[RFC] sched/core: Fix up load metric exposed to cpuidle

* [RFC] sched/core: Fix up load metric exposed to cpuidle
@ 2016-09-23 21:49 Sai Gurrappadi
  2016-09-23 22:06 ` Peter Zijlstra
  0 siblings, 1 reply; 5+ messages in thread
From: Sai Gurrappadi @ 2016-09-23 21:49 UTC (permalink / raw)
  To: rafael.j.wysocki
  Cc: Peter Boonstoppel, Peter Zijlstra, Colin Cross, Arjan van de Ven,
	linux-pm

When triaging a performance degradation of ~5% on some use cases between
our k3.18 and k4.4 trees, we found that the menu cpuidle governor on k4.4
was way more aggressive when requesting for deeper idle states. It would
often get it wrong though resulting in perf loss.

The menu governor tries to bias picking shallower idle states based on the
historical load on the CPU. The busier the CPU, the shallower the idle
state.

However, after commit "3289bdb sched: Move the loadavg code to a more
obvious location", the load metric it looks at is rq->load.weight which is
the instantaneous se->load.weight sum for top level entities on the rq
which on idle entry is always 0 (for the common case at least) because
there is nothing on the cfs rq.

The previous metric the menu governor used was rq->cpu_load[0] which is a
snap shot of the weighted_cpuload at the previous load update point so it
isn't always 0 on idle entry.

Unfortunately, it isn't straightforward to switch the metric being used to
rq->cfs.load_avg or rq->cfs.util_avg because they overestimate the load a
lot more than rq->cpu_load[0] (include blocked task contrib.). That would
potentially require redoing the magic constants in the menu governor's
performance_multiplier...so for now, use rq->cpu_load[0] instead to
preserve old behaviour.

Reported-by: Juha Lainema <jlainema@nvidia.com>
Signed-off-by: Sai Gurrappadi <sgurrappadi@nvidia.com>
---

* I realize this might not be the best thing to do hence the RFC tag.
Thoughts?

 kernel/sched/core.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/kernel/sched/core.c b/kernel/sched/core.c
index 44817c6..d1aea12 100644
--- a/kernel/sched/core.c
+++ b/kernel/sched/core.c
@@ -2955,7 +2955,7 @@ void get_iowait_load(unsigned long *nr_waiters, unsigned long *load)
 {
 	struct rq *rq = this_rq();
 	*nr_waiters = atomic_read(&rq->nr_iowait);
-	*load = rq->load.weight;
+	*load = rq->cpu_load[0];
 }
 
 #ifdef CONFIG_SMP
-- 
2.1.4

^ permalink raw reply related	[flat|nested] 5+ messages in thread