From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <linux-kernel-owner@vger.kernel.org>
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
        id S1754321AbcHVL3T (ORCPT <rfc822;w@1wt.eu>);
        Mon, 22 Aug 2016 07:29:19 -0400
Received: from foss.arm.com ([217.140.101.70]:51070 "EHLO foss.arm.com"
        rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP
        id S1752150AbcHVL3R (ORCPT <rfc822;linux-kernel@vger.kernel.org>);
        Mon, 22 Aug 2016 07:29:17 -0400
Date: Mon, 22 Aug 2016 12:29:25 +0100
From: Morten Rasmussen <morten.rasmussen@arm.com>
To: Wanpeng Li <kernellwp@gmail.com>
Cc: Peter Zijlstra <peterz@infradead.org>, Ingo Molnar <mingo@redhat.com>,
        Dietmar Eggemann <dietmar.eggemann@arm.com>,
        Yuyang Du <yuyang.du@intel.com>,
        Vincent Guittot <vincent.guittot@linaro.org>,
        Mike Galbraith <mgalbraith@suse.de>, sgurrappadi@nvidia.com,
        Koan-Sin Tan <freedom.tan@mediatek.com>,
        =?utf-8?B?5bCP5p6X5pWs5aSq?= <keita.kobayashi.ym@renesas.com>,
        "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>
Subject: Re: [PATCH v3 10/13] sched/fair: Compute task/cpu utilization at
 wake-up more correctly
Message-ID: <20160822112925.GC25262@e105550-lin.cambridge.arm.com>
References: <1469453670-2660-11-git-send-email-morten.rasmussen@arm.com>
 <20160815142342.GV6879@twins.programming.kicks-ass.net>
 <20160815154237.GE3391@e105550-lin.cambridge.arm.com>
 <20160818084053.GG3391@e105550-lin.cambridge.arm.com>
 <20160818102438.GA27873@e105550-lin.cambridge.arm.com>
 <CANRm+Czjkn+gnzEhAtf0muRhBPs0FoXYFgnEYdzvXcjqBgXUyw@mail.gmail.com>
 <20160818134517.GC27873@e105550-lin.cambridge.arm.com>
 <CANRm+Cx7YScEtxhSag_uqzdOei+kEjSYsTMHeoMdYq-ijLGGAQ@mail.gmail.com>
 <20160819140307.GA25262@e105550-lin.cambridge.arm.com>
 <CANRm+CxLiSy2JEohZ2iGNbg4kjJpR-5xS9dvzmxvoBHRHBJL_Q@mail.gmail.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <CANRm+CxLiSy2JEohZ2iGNbg4kjJpR-5xS9dvzmxvoBHRHBJL_Q@mail.gmail.com>
User-Agent: Mutt/1.5.21 (2010-09-15)
Sender: linux-kernel-owner@vger.kernel.org
List-ID: <linux-kernel.vger.kernel.org>
X-Mailing-List: linux-kernel@vger.kernel.org

On Mon, Aug 22, 2016 at 09:48:19AM +0800, Wanpeng Li wrote:
> 2016-08-19 22:03 GMT+08:00 Morten Rasmussen <morten.rasmussen@arm.com>:
> > On Fri, Aug 19, 2016 at 09:43:00AM +0800, Wanpeng Li wrote:
> >> 2016-08-18 21:45 GMT+08:00 Morten Rasmussen <morten.rasmussen@arm.com>:
> >> > I assume you are referring to using task_util_peak() instead of
> >> > task_util() in wake_cap()?
> >>
> >> Yes.
> >>
> >> >
> >> > The peak value should never exceed the util_avg accumulated by the task
> >> > last time it ran. So any spike has to be caused by the task accumulating
> >> > more utilization last time it ran. We don't know if it a spike or a more
> >>
> >> I see.
> >>
> >> > permanent change in behaviour, so we have to guess. So a spike on an
> >> > asymmetric system could cause us to disable wake affine in some
> >> > circumstances (either prev_cpu or waker cpu has to be low compute
> >> > capacity) for the following wake-up.
> >> >
> >> > SMP should be unaffected as we should bail out on the previous
> >> > condition.
> >>
> >> Why capacity_orig instead of capacity since it is checked each time
> >> wakeup and maybe rt class/interrupt have already occupied many cpu
> >> utilization.
> >
> > We could switch to capacity for this condition if we also change the
> > spare capacity evaluation in find_idlest_group() to do the same. It
> > would open up for SMP systems to take find_idlest_group() route if the
> > SD_BALANCE_WAKE flag is set.
> >
> > The reason why I have avoided capacity and used capacity_orig instead
> > is that in previous discussions about scheduling behaviour under
> > rt/dl/irq pressure it has been clear to me whether we want to move tasks
> > away from cpus with capacity < capacity_orig or not. The choice depends
> > on the use-case.
> >
> > In some cases taking rt/dl/irq pressure into account is more complicated
> > as we don't know the capacities available in a sched_group without
> > iterating over all the cpus. However, I don't think it would complicate
> > these patches. It is more a question whether everyone are happy with
> > additional conditions in their wake-up path. I guess we could make it a
> > sched_feature if people are interested?
> >
> > In short, I used capacity_orig to play it safe ;-)
> 
> Actually you mixed capacity_orig and capacity when evaluating max spare cap.

Right, that is a mistake. Thanks for pointing that out :-)

Morten