From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id DF698C433EF for ; Wed, 9 Feb 2022 22:06:08 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234984AbiBIWGE (ORCPT ); Wed, 9 Feb 2022 17:06:04 -0500 Received: from gmail-smtp-in.l.google.com ([23.128.96.19]:53658 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234925AbiBIWGC (ORCPT ); Wed, 9 Feb 2022 17:06:02 -0500 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 08E21E015649; Wed, 9 Feb 2022 14:06:03 -0800 (PST) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id A3B9261BF2; Wed, 9 Feb 2022 22:06:02 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 13E94C340EB; Wed, 9 Feb 2022 22:06:02 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1644444362; bh=9eTZzuMwvMRMOvxFPlctPZEOfJOPYNJAQtcV0k5GrM8=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=lsrPkyTqlfqVoGi5RhV4UmkRNs5ctkf/Kp7eRbp9J11M7mwxdlVxRH6zzEpRbaJ3Z cRd99TWZM2EL23O6g/LmNzXweVV6zRZiOEvNjiN32q3qSRZqjBQLMu9iTcvQ15DHMd 96hlHYDh9AYhKyAm2ADp87eJ9sbCc6PwksqSjCyYb4BfOIPiAs5lYtj2MmAtuzk67D Rl2ct4teSgr3/n4sMR9HfsG72nOyYAZA3u9Az9Opdq5YZfmMkNJJjytTQW8r9rnbco dnHetFQYYPQdmGU11US78pxIfRFyHCk7P679KLwH92EtGZtMs3Za9GXz14OescIuI2 40zwIvE6WyMcA== Received: by paulmck-ThinkPad-P17-Gen-1.home (Postfix, from userid 1000) id B30E85C03C6; Wed, 9 Feb 2022 14:06:01 -0800 (PST) Date: Wed, 9 Feb 2022 14:06:01 -0800 From: "Paul E. McKenney" To: Mukesh Ojha Cc: rcu@vger.kernel.org, linux-kernel@vger.kernel.org, kernel-team@fb.com, rostedt@goodmis.org, Tejun Heo Subject: Re: [PATCH rcu 3/3] rcu: Allow expedited RCU grace periods on incoming CPUs Message-ID: <20220209220601.GB4285@paulmck-ThinkPad-P17-Gen-1> Reply-To: paulmck@kernel.org References: <20220204225409.GA4193020@paulmck-ThinkPad-P17-Gen-1> <20220204225507.4193113-3-paulmck@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Feb 09, 2022 at 11:53:33PM +0530, Mukesh Ojha wrote: > > On 2/5/2022 4:25 AM, Paul E. McKenney wrote: > > Although it is usually safe to invoke synchronize_rcu_expedited() from a > > preemption-enabled CPU-hotplug notifier, if it is invoked from a notifier > > between CPUHP_AP_RCUTREE_ONLINE and CPUHP_AP_ACTIVE, its attempts to > > invoke a workqueue handler will hang due to RCU waiting on a CPU that > > the scheduler is not paying attention to. This commit therefore expands > > use of the existing workqueue-independent synchronize_rcu_expedited() > > from early boot to also include CPUs that are being hotplugged. > > > > Link: https://lore.kernel.org/lkml/7359f994-8aaf-3cea-f5cf-c0d3929689d6@quicinc.com/ > > Reported-by: Mukesh Ojha > > Cc: Tejun Heo > > Signed-off-by: Paul E. McKenney > > --- > > kernel/rcu/tree_exp.h | 14 ++++++++++---- > > 1 file changed, 10 insertions(+), 4 deletions(-) > > > > diff --git a/kernel/rcu/tree_exp.h b/kernel/rcu/tree_exp.h > > index 60197ea24ceb9..1a45667402260 100644 > > --- a/kernel/rcu/tree_exp.h > > +++ b/kernel/rcu/tree_exp.h > > @@ -816,7 +816,7 @@ static int rcu_print_task_exp_stall(struct rcu_node *rnp) > > */ > > void synchronize_rcu_expedited(void) > > { > > - bool boottime = (rcu_scheduler_active == RCU_SCHEDULER_INIT); > > + bool no_wq; > > struct rcu_exp_work rew; > > struct rcu_node *rnp; > > unsigned long s; > > @@ -841,9 +841,15 @@ void synchronize_rcu_expedited(void) > > if (exp_funnel_lock(s)) > > return; /* Someone else did our work for us. */ > > + /* Don't use workqueue during boot or from an incoming CPU. */ > > + preempt_disable(); > > + no_wq = rcu_scheduler_active == RCU_SCHEDULER_INIT || > > + !cpumask_test_cpu(smp_processor_id(), cpu_active_mask); > > + preempt_enable(); > > + > > /* Ensure that load happens before action based on it. */ > > - if (unlikely(boottime)) { > > - /* Direct call during scheduler init and early_initcalls(). */ > > + if (unlikely(no_wq)) { > > + /* Direct call for scheduler init, early_initcall()s, and incoming CPUs. */ > > rcu_exp_sel_wait_wake(s); > > } else { > > /* Marshall arguments & schedule the expedited grace period. */ > > @@ -861,7 +867,7 @@ void synchronize_rcu_expedited(void) > > /* Let the next expedited grace period start. */ > > mutex_unlock(&rcu_state.exp_mutex); > > - if (likely(!boottime)) > > + if (likely(!no_wq)) > > destroy_work_on_stack(&rew.rew_work); > > } > > EXPORT_SYMBOL_GPL(synchronize_rcu_expedited); > > Can we reach a condition after this change where no_wq = true and during > rcu_stall report where exp_task = 0 list and exp_mask contain only this cpu > ? Hello, Mukesh, and thank you for looking this over! At first glance, I do not believe that this can happen because the expedited grace-period machinery avoids waiting on the current CPU. (See sync_rcu_exp_select_node_cpus(), both the raw_smp_processor_id() early in the function and the get_cpu() later in the function.) But please let me know if I am missing something here. But suppose that we could in fact reach this condition. What bad thing would happen? Other than a resched_cpu() having been invoked several times on a not-yet-online CPU, of course. ;-) Thanx, Paul