From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756860Ab3GQSs6 (ORCPT ); Wed, 17 Jul 2013 14:48:58 -0400 Received: from g5t0009.atlanta.hp.com ([15.192.0.46]:37235 "EHLO g5t0009.atlanta.hp.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1756838Ab3GQSs5 (ORCPT ); Wed, 17 Jul 2013 14:48:57 -0400 Message-ID: <1374086931.1816.2.camel@j-VirtualBox> Subject: Re: [RFC] sched: Limit idle_balance() when it is being used too frequently From: Jason Low To: Peter Zijlstra Cc: Rik van Riel , Ingo Molnar , LKML , Mike Galbraith , Thomas Gleixner , Paul Turner , Alex Shi , Preeti U Murthy , Vincent Guittot , Morten Rasmussen , Namhyung Kim , Andrew Morton , Kees Cook , Mel Gorman , aswin@hp.com, scott.norton@hp.com, chegu_vinod@hp.com Date: Wed, 17 Jul 2013 11:48:51 -0700 In-Reply-To: <20130717180156.GS23818@dyad.programming.kicks-ass.net> References: <1374002463.3944.11.camel@j-VirtualBox> <20130716202015.GX17211@twins.programming.kicks-ass.net> <1374014881.2332.21.camel@j-VirtualBox> <20130717072504.GY17211@twins.programming.kicks-ass.net> <1374048701.6000.21.camel@j-VirtualBox> <20130717093913.GP23818@dyad.programming.kicks-ass.net> <1374076741.7412.35.camel@j-VirtualBox> <20130717161815.GR23818@dyad.programming.kicks-ass.net> <51E6D9B7.1030705@redhat.com> <20130717180156.GS23818@dyad.programming.kicks-ass.net> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.2.3-0ubuntu6 Content-Transfer-Encoding: 7bit Mime-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 2013-07-17 at 20:01 +0200, Peter Zijlstra wrote: > On Wed, Jul 17, 2013 at 01:51:51PM -0400, Rik van Riel wrote: > > On 07/17/2013 12:18 PM, Peter Zijlstra wrote: > > > >So the way I see things is that the only way newidle balance can slow down > > >things is if it runs when we could have ran something useful. > > > > Due to contention on the runqueue locks of other CPUs, > > newidle also has the potential to keep _others_ from > > running something useful. > > Right, although that should only happen when we do have an imbalance and want > to go move something. Which in Jason's case is 'rare'. But yes, I suppose > there's other scenarios where this is far more likely. > > > Could we prevent that downside by measuring both the > > time spent idle, and the time spent in idle balancing, > > and making sure the idle balancing time never exceeds > > more than N% of the idle time? > > Sure: > > idle_balance(u64 idle_duration) > { > u64 cost = 0; > > for_each_domain(sd) { > if (cost + sd->cost > idle_duration/N) > break; > > ... > > sd->cost = (sd->cost + this_cost) / 2; > cost += this_cost; > } > } > > I would've initially suggested using something like N=2 since we're dealing > with averages and half should ensure we don't run over except for the worst > peaks. But we could easily use a bigger N. Okay, I'll try this out. Thank you for your suggestions. Jason.