From: Suren Baghdasaryan <surenb@google.com> To: Patrick Bellasi <patrick.bellasi@arm.com> Cc: LKML <linux-kernel@vger.kernel.org>, linux-pm@vger.kernel.org, linux-api@vger.kernel.org, Ingo Molnar <mingo@redhat.com>, Peter Zijlstra <peterz@infradead.org>, Tejun Heo <tj@kernel.org>, "Rafael J . Wysocki" <rafael.j.wysocki@intel.com>, Vincent Guittot <vincent.guittot@linaro.org>, Viresh Kumar <viresh.kumar@linaro.org>, Paul Turner <pjt@google.com>, Quentin Perret <quentin.perret@arm.com>, Dietmar Eggemann <dietmar.eggemann@arm.com>, Morten Rasmussen <morten.rasmussen@arm.com>, Juri Lelli <juri.lelli@redhat.com>, Todd Kjos <tkjos@google.com>, Joel Fernandes <joelaf@google.com>, Steve Muckle <smuckle@google.com> Subject: Re: [PATCH v7 01/15] sched/core: uclamp: Add CPU's clamp buckets refcounting Date: Thu, 14 Mar 2019 08:29:48 -0700 [thread overview] Message-ID: <CAJuCfpFDVi7O+yqQ+F0DT3pRZN=BtRD8NyUH5Tys0VVkd9JAGg@mail.gmail.com> (raw) In-Reply-To: <20190314144600.2ulpeipad7jbxyiy@e110439-lin> On Thu, Mar 14, 2019 at 7:46 AM Patrick Bellasi <patrick.bellasi@arm.com> wrote: > > On 13-Mar 14:32, Suren Baghdasaryan wrote: > > On Fri, Feb 8, 2019 at 2:06 AM Patrick Bellasi <patrick.bellasi@arm.com> wrote: > > > > > > Utilization clamping allows to clamp the CPU's utilization within a > > > [util_min, util_max] range, depending on the set of RUNNABLE tasks on > > > that CPU. Each task references two "clamp buckets" defining its minimum > > > and maximum (util_{min,max}) utilization "clamp values". A CPU's clamp > > > bucket is active if there is at least one RUNNABLE tasks enqueued on > > > that CPU and refcounting that bucket. > > > > > > When a task is {en,de}queued {on,from} a rq, the set of active clamp > > > buckets on that CPU can change. Since each clamp bucket enforces a > > > different utilization clamp value, when the set of active clamp buckets > > > changes, a new "aggregated" clamp value is computed for that CPU. > > > > > > Clamp values are always MAX aggregated for both util_min and util_max. > > > This ensures that no tasks can affect the performance of other > > > co-scheduled tasks which are more boosted (i.e. with higher util_min > > > clamp) or less capped (i.e. with higher util_max clamp). > > > > > > Each task has a: > > > task_struct::uclamp[clamp_id]::bucket_id > > > to track the "bucket index" of the CPU's clamp bucket it refcounts while > > > enqueued, for each clamp index (clamp_id). > > > > > > Each CPU's rq has a: > > > rq::uclamp[clamp_id]::bucket[bucket_id].tasks > > > to track how many tasks, currently RUNNABLE on that CPU, refcount each > > > clamp bucket (bucket_id) of a clamp index (clamp_id). > > > > > > Each CPU's rq has also a: > > > rq::uclamp[clamp_id]::bucket[bucket_id].value > > > to track the clamp value of each clamp bucket (bucket_id) of a clamp > > > index (clamp_id). > > > > > > The rq::uclamp::bucket[clamp_id][] array is scanned every time we need > > > to find a new MAX aggregated clamp value for a clamp_id. This operation > > > is required only when we dequeue the last task of a clamp bucket > > > tracking the current MAX aggregated clamp value. In these cases, the CPU > > > is either entering IDLE or going to schedule a less boosted or more > > > clamped task. > > > The expected number of different clamp values, configured at build time, > > > is small enough to fit the full unordered array into a single cache > > > line. > > > > I assume you are talking about "struct uclamp_rq uclamp[UCLAMP_CNT]" > > here. > > No, I'm talking about the rq::uclamp::bucket[clamp_id][], which is an > array of: > > struct uclamp_bucket { > unsigned long value : bits_per(SCHED_CAPACITY_SCALE); > unsigned long tasks : BITS_PER_LONG - bits_per(SCHED_CAPACITY_SCALE); > }; > > defined as part of: > > struct uclamp_rq { > unsigned int value; > struct uclamp_bucket bucket[UCLAMP_BUCKETS]; > }; > > > So, it's an array of UCLAMP_BUCKETS (value, tasks) pairs. > > > uclamp_rq size depends on UCLAMP_BUCKETS configurable to be up > > to 20. sizeof(long)*20 is already more than 64 bytes. What am I > > missing? > > Right, the comment above refers to the default configuration, which is > 5 buckets. With that configuration we have: > > > $> pahole kernel/sched/core.o > > ---8<--- > struct uclamp_bucket { > long unsigned int value:11; /* 0:53 8 */ > long unsigned int tasks:53; /* 0: 0 8 */ > > /* size: 8, cachelines: 1, members: 2 */ > /* last cacheline: 8 bytes */ > }; > > struct uclamp_rq { > unsigned int value; /* 0 4 */ > > /* XXX 4 bytes hole, try to pack */ > > struct uclamp_bucket bucket[5]; /* 8 40 */ > > /* size: 48, cachelines: 1, members: 2 */ > /* sum members: 44, holes: 1, sum holes: 4 */ > /* last cacheline: 48 bytes */ > }; > > struct rq { > // ... > /* --- cacheline 2 boundary (128 bytes) --- */ > struct uclamp_rq uclamp[2]; /* 128 96 */ > /* --- cacheline 3 boundary (192 bytes) was 32 bytes ago --- */ > // ... > }; > ---8<--- > > Where you see the array fits into a single cache line. > > Actually I notice now that, since when we removed the bucket dedicated > to the default values, we now have some spare space and we can > probably increase the default (and minimum) value of UCLAMP_BUCKETS to > be 7. > > This will uses two full cache lines in struct rq, one for each clamp > index... Although 7 it's a bit of a odd number and gives by default > buckets of ~14% size instead of the ~20%. > > Thoughts ? Got it. From reading the documentation at the beginning my impression was that whatever value I choose within allowed 5-20 range it would still fit in a cache line. To disambiguate it might be worse mentioning that this is true for the default value or for values up to 7. Thanks! > [...] > > -- > #include <best/regards.h> > > Patrick Bellasi
next prev parent reply other threads:[~2019-03-14 15:30 UTC|newest] Thread overview: 84+ messages / expand[flat|nested] mbox.gz Atom feed top 2019-02-08 10:05 [PATCH v7 00/15] Add utilization clamping support Patrick Bellasi 2019-02-08 10:05 ` [PATCH v7 01/15] sched/core: uclamp: Add CPU's clamp buckets refcounting Patrick Bellasi 2019-03-12 12:52 ` Dietmar Eggemann 2019-03-13 15:15 ` Patrick Bellasi 2019-03-13 21:01 ` Suren Baghdasaryan 2019-03-14 14:54 ` Patrick Bellasi 2019-03-14 15:00 ` Patrick Bellasi 2019-03-12 15:20 ` Peter Zijlstra 2019-03-12 15:50 ` Patrick Bellasi 2019-03-13 8:19 ` Peter Zijlstra 2019-03-13 11:37 ` Patrick Bellasi 2019-03-13 13:40 ` Peter Zijlstra 2019-03-13 16:12 ` Patrick Bellasi 2019-03-13 17:22 ` Peter Zijlstra 2019-03-13 18:22 ` Patrick Bellasi 2019-03-13 19:48 ` Peter Zijlstra 2019-03-14 12:13 ` Patrick Bellasi 2019-03-14 13:32 ` Peter Zijlstra 2019-03-14 15:07 ` Patrick Bellasi 2019-03-14 19:18 ` Peter Zijlstra 2019-03-13 13:52 ` Peter Zijlstra 2019-03-13 15:59 ` Patrick Bellasi 2019-03-13 19:30 ` Peter Zijlstra 2019-03-14 11:03 ` Patrick Bellasi 2019-03-14 13:27 ` Peter Zijlstra 2019-03-13 19:39 ` Peter Zijlstra 2019-03-14 11:18 ` Patrick Bellasi 2019-03-13 21:23 ` Suren Baghdasaryan 2019-03-14 12:43 ` Patrick Bellasi 2019-03-13 14:06 ` Peter Zijlstra 2019-03-13 15:28 ` Patrick Bellasi 2019-03-13 14:09 ` Peter Zijlstra 2019-03-13 15:23 ` Patrick Bellasi 2019-03-13 19:46 ` Peter Zijlstra 2019-03-13 21:08 ` Suren Baghdasaryan 2019-03-14 12:22 ` Patrick Bellasi 2019-03-14 11:45 ` Patrick Bellasi 2019-03-13 21:32 ` Suren Baghdasaryan 2019-03-14 14:46 ` Patrick Bellasi 2019-03-14 15:29 ` Suren Baghdasaryan [this message] 2019-03-14 15:40 ` Patrick Bellasi 2019-03-14 16:39 ` Suren Baghdasaryan 2019-02-08 10:05 ` [PATCH v7 02/15] sched/core: uclamp: Enforce last task UCLAMP_MAX Patrick Bellasi 2019-03-13 14:10 ` Peter Zijlstra 2019-03-13 16:20 ` Patrick Bellasi 2019-03-13 17:29 ` Peter Zijlstra 2019-03-13 18:29 ` Patrick Bellasi 2019-03-13 14:12 ` Peter Zijlstra 2019-03-13 16:16 ` Patrick Bellasi 2019-03-14 0:29 ` Suren Baghdasaryan 2019-03-14 17:06 ` Patrick Bellasi 2019-02-08 10:05 ` [PATCH v7 03/15] sched/core: uclamp: Add system default clamps Patrick Bellasi 2019-03-13 14:32 ` Peter Zijlstra 2019-03-13 17:09 ` Patrick Bellasi 2019-03-13 19:58 ` Peter Zijlstra 2019-03-13 20:10 ` Peter Zijlstra 2019-03-15 13:41 ` Patrick Bellasi 2019-03-13 20:13 ` Peter Zijlstra 2019-03-13 20:18 ` Peter Zijlstra 2019-03-18 12:18 ` Patrick Bellasi 2019-03-18 13:10 ` Peter Zijlstra 2019-03-18 14:21 ` Patrick Bellasi 2019-03-18 14:29 ` Peter Zijlstra 2019-02-08 10:05 ` [PATCH v7 04/15] sched/core: Allow sched_setattr() to use the current policy Patrick Bellasi 2019-02-08 10:05 ` [PATCH v7 05/15] sched/core: uclamp: Extend sched_setattr() to support utilization clamping Patrick Bellasi 2019-02-08 10:05 ` [PATCH v7 06/15] sched/core: uclamp: Reset uclamp values on RESET_ON_FORK Patrick Bellasi 2019-03-13 20:52 ` Peter Zijlstra 2019-03-18 12:58 ` Patrick Bellasi 2019-02-08 10:05 ` [PATCH v7 07/15] sched/core: uclamp: Set default clamps for RT tasks Patrick Bellasi 2019-02-08 10:05 ` [PATCH v7 08/15] sched/cpufreq: uclamp: Add clamps for FAIR and " Patrick Bellasi 2019-02-08 10:05 ` [PATCH v7 09/15] sched/core: uclamp: Add uclamp_util_with() Patrick Bellasi 2019-02-08 10:05 ` [PATCH v7 10/15] sched/fair: uclamp: Add uclamp support to energy_compute() Patrick Bellasi 2019-03-06 17:21 ` Quentin Perret 2019-03-18 15:19 ` Patrick Bellasi 2019-02-08 10:05 ` [PATCH v7 11/15] sched/core: uclamp: Extend CPU's cgroup controller Patrick Bellasi 2019-02-14 15:48 ` Tejun Heo 2019-03-19 10:00 ` Patrick Bellasi 2019-02-08 10:05 ` [PATCH v7 12/15] sched/core: uclamp: Propagate parent clamps Patrick Bellasi 2019-03-14 16:17 ` Suren Baghdasaryan 2019-03-18 16:54 ` Patrick Bellasi 2019-03-18 16:58 ` Suren Baghdasaryan 2019-02-08 10:05 ` [PATCH v7 13/15] sched/core: uclamp: Propagate system defaults to root group Patrick Bellasi 2019-02-08 10:05 ` [PATCH v7 14/15] sched/core: uclamp: Use TG's clamps to restrict TASK's clamps Patrick Bellasi 2019-02-08 10:05 ` [PATCH v7 15/15] sched/core: uclamp: Update CPU's refcount on TG's clamp changes Patrick Bellasi
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to='CAJuCfpFDVi7O+yqQ+F0DT3pRZN=BtRD8NyUH5Tys0VVkd9JAGg@mail.gmail.com' \ --to=surenb@google.com \ --cc=dietmar.eggemann@arm.com \ --cc=joelaf@google.com \ --cc=juri.lelli@redhat.com \ --cc=linux-api@vger.kernel.org \ --cc=linux-kernel@vger.kernel.org \ --cc=linux-pm@vger.kernel.org \ --cc=mingo@redhat.com \ --cc=morten.rasmussen@arm.com \ --cc=patrick.bellasi@arm.com \ --cc=peterz@infradead.org \ --cc=pjt@google.com \ --cc=quentin.perret@arm.com \ --cc=rafael.j.wysocki@intel.com \ --cc=smuckle@google.com \ --cc=tj@kernel.org \ --cc=tkjos@google.com \ --cc=vincent.guittot@linaro.org \ --cc=viresh.kumar@linaro.org \ --subject='Re: [PATCH v7 01/15] sched/core: uclamp: Add CPU'\''s clamp buckets refcounting' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).