From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755709Ab3I3OY2 (ORCPT ); Mon, 30 Sep 2013 10:24:28 -0400 Received: from merlin.infradead.org ([205.233.59.134]:42455 "EHLO merlin.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755233Ab3I3OYZ (ORCPT ); Mon, 30 Sep 2013 10:24:25 -0400 Date: Mon, 30 Sep 2013 16:24:00 +0200 From: Peter Zijlstra To: Oleg Nesterov Cc: Mel Gorman , Rik van Riel , Srikar Dronamraju , Ingo Molnar , Andrea Arcangeli , Johannes Weiner , Linux-MM , LKML , Paul McKenney , Thomas Gleixner , Steven Rostedt , Linus Torvalds Subject: Re: [RFC] introduce synchronize_sched_{enter,exit}() Message-ID: <20130930142400.GK26785@twins.programming.kicks-ass.net> References: <1378805550-29949-1-git-send-email-mgorman@suse.de> <1378805550-29949-38-git-send-email-mgorman@suse.de> <20130917143003.GA29354@twins.programming.kicks-ass.net> <20130929183634.GA15563@redhat.com> <20130930125942.GB12926@twins.programming.kicks-ass.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20130930125942.GB12926@twins.programming.kicks-ass.net> User-Agent: Mutt/1.5.21 (2012-12-30) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Sep 30, 2013 at 02:59:42PM +0200, Peter Zijlstra wrote: > > > > static void cb_rcu_func(struct rcu_head *rcu) > > { > > struct xxx_struct *xxx = container_of(rcu, struct xxx_struct, cb_head); > > long flags; > > > > BUG_ON(xxx->gp_state != GP_PASSED); > > BUG_ON(xxx->cb_state == CB_IDLE); > > > > spin_lock_irqsave(&xxx->xxx_lock, flags); > > if (xxx->gp_count) { > > xxx->cb_state = CB_IDLE; > > This seems to be when a new xxx_begin() has happened after our last > xxx_end() and the sync_sched() from xxx_begin() merges with the > xxx_end() one and we're done. > > > } else if (xxx->cb_state == CB_REPLAY) { > > xxx->cb_state = CB_PENDING; > > call_rcu_sched(&xxx->cb_head, cb_rcu_func); > > A later xxx_exit() has happened, and we need to requeue to catch a later > GP. > > > } else { > > xxx->cb_state = CB_IDLE; > > xxx->gp_state = GP_IDLE; > > Nothing fancy happened and we're done. > > > } > > spin_unlock_irqrestore(&xxx->xxx_lock, flags); > > } > > > > void xxx_exit(struct xxx_struct *xxx) > > { > > spin_lock_irq(&xxx->xxx_lock); > > if (!--xxx->gp_count) { > > if (xxx->cb_state == CB_IDLE) { > > xxx->cb_state = CB_PENDING; > > call_rcu_sched(&xxx->cb_head, cb_rcu_func); > > } else if (xxx->cb_state == CB_PENDING) { > > xxx->cb_state = CB_REPLAY; > > } > > } > > spin_unlock_irq(&xxx->xxx_lock); > > } > > So I don't immediately see the point of the concurrent write side; > percpu_rwsem wouldn't allow this and afaict neither would > freeze_super(). > > Other than that; yes this makes sense if you care about write side > performance and I think its solid. Hmm, wait. I don't see how this is equivalent to: xxx_end() { synchronize_sched(); atomic_dec(&xxx->counter); } For that we'd have to decrement xxx->gp_count from cb_rcu_func(), wouldn't we? Without that there's no guarantee the fast path readers will have a MB to observe the write critical section, unless I'm completely missing something obviuos here.