From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755210Ab2AaUed (ORCPT ); Tue, 31 Jan 2012 15:34:33 -0500 Received: from casper.infradead.org ([85.118.1.10]:57572 "EHLO casper.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753569Ab2AaUec convert rfc822-to-8bit (ORCPT ); Tue, 31 Jan 2012 15:34:32 -0500 Message-ID: <1328042063.2446.250.camel@twins> Subject: Re: [PATCH] backing-dev: use synchronize_rcu_expedited instead of synchronize_rcu From: Peter Zijlstra To: Mikulas Patocka Cc: axboe@kernel.dk, "Alasdair G. Kergon" , dm-devel@redhat.com, linux-kernel@vger.kernel.org, "Paul E. McKenney" Date: Tue, 31 Jan 2012 21:34:23 +0100 In-Reply-To: References: Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7BIT X-Mailer: Evolution 3.2.1- Mime-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 2011-07-20 at 20:29 -0400, Mikulas Patocka wrote: > Hi Jens > > Please would you consider taking this into the block tree? It seems to > speed up device deletion enormously. > > Mikulas > > --- > > backing-dev: use synchronize_rcu_expedited instead of synchronize_rcu > > synchronize_rcu sleeps several timer ticks. synchronize_rcu_expedited is > much faster. > > With 100Hz timer frequency, when we remove 10000 block devices with > "dmsetup remove_all" command, it takes 27 minutes. With this patch, > removing 10000 block devices takes only 15 seconds. > > Signed-off-by: Mikulas Patocka > > --- > mm/backing-dev.c | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) > > Index: linux-3.0-rc7-fast/mm/backing-dev.c > =================================================================== > --- linux-3.0-rc7-fast.orig/mm/backing-dev.c 2011-07-19 18:01:00.000000000 +0200 > +++ linux-3.0-rc7-fast/mm/backing-dev.c 2011-07-19 18:01:07.000000000 +0200 > @@ -505,7 +505,7 @@ static void bdi_remove_from_list(struct > list_del_rcu(&bdi->bdi_list); > spin_unlock_bh(&bdi_lock); > > - synchronize_rcu(); > + synchronize_rcu_expedited(); > } > Urgh, I just noticed this crap in my tree.. You realize that what you're effectively hammering a global sync primitive this way? Depending on what RCU flavour you have any SMP variant will at least do a machine wide IPI broadcast for every sync_rcu_exp(), some do significantly more. The much better solution would've been to batch your block-dev removals and use a single sync_rcu as barrier. This is not cool.