From: Jakub Kicinski <kuba@kernel.org>
To: Johannes Weiner <hannes@cmpxchg.org>
Cc: akpm@linux-foundation.org, linux-mm@kvack.org,
kernel-team@fb.com, tj@kernel.org, chris@chrisdown.name,
cgroups@vger.kernel.org, shakeelb@google.com, mhocko@kernel.org
Subject: Re: [PATCH mm v5 RESEND 4/4] mm: automatically penalize tasks with high swap use
Date: Tue, 26 May 2020 13:11:57 -0700 [thread overview]
Message-ID: <20200526131157.79c17940@kicinski-fedora-PC1C0HJN.hsd1.ca.comcast.net> (raw)
In-Reply-To: <20200526153309.GD848026@cmpxchg.org>
On Tue, 26 May 2020 11:33:09 -0400 Johannes Weiner wrote:
> On Wed, May 20, 2020 at 05:24:11PM -0700, Jakub Kicinski wrote:
> > Add a memory.swap.high knob, which can be used to protect the system
> > from SWAP exhaustion. The mechanism used for penalizing is similar
> > to memory.high penalty (sleep on return to user space), but with
> > a less steep slope.
>
> The last part is no longer true after incorporating Michal's feedback.
>
> > + /*
> > + * Make the swap curve more gradual, swap can be considered "cheaper",
> > + * and is allocated in larger chunks. We want the delays to be gradual.
> > + */
>
> This comment is also out-of-date, as the same curve is being applied.
Indeed :S
> > + penalty_jiffies += calculate_high_delay(memcg, nr_pages,
> > + swap_find_max_overage(memcg));
> > +
> > /*
> > * Clamp the max delay per usermode return so as to still keep the
> > * application moving forwards and also permit diagnostics, albeit
> > @@ -2585,12 +2608,25 @@ static int try_charge(struct mem_cgroup *memcg, gfp_t gfp_mask,
> > * reclaim, the cost of mismatch is negligible.
> > */
> > do {
> > - if (page_counter_is_above_high(&memcg->memory)) {
> > - /* Don't bother a random interrupted task */
> > - if (in_interrupt()) {
> > + bool mem_high, swap_high;
> > +
> > + mem_high = page_counter_is_above_high(&memcg->memory);
> > + swap_high = page_counter_is_above_high(&memcg->swap);
>
> Please open-code these checks instead - we don't really do getters and
> predicates for these, and only have the setters because they are more
> complicated operations.
I added this helper because the calculation doesn't fit into 80 chars.
In particular reclaim_high will need a temporary variable or IMHO
questionable line split.
static void reclaim_high(struct mem_cgroup *memcg,
unsigned int nr_pages,
gfp_t gfp_mask)
{
do {
if (!page_counter_is_above_high(&memcg->memory))
continue;
memcg_memory_event(memcg, MEMCG_HIGH);
try_to_free_mem_cgroup_pages(memcg, nr_pages, gfp_mask, true);
} while ((memcg = parent_mem_cgroup(memcg)) &&
!mem_cgroup_is_root(memcg));
}
What's your preference? Mine is a helper, but I'm probably not
sensitive enough to the ontology here :)
> > + if (mem_high || swap_high) {
> > + /* Use one counter for number of pages allocated
> > + * under pressure to save struct task space and
> > + * avoid two separate hierarchy walks.
> > + /*
> > current->memcg_nr_pages_over_high += batch;
>
> That comment style is leaking out of the networking code ;-) Please
> use the customary style in this code base, /*\n *...
>
> As for one counter instead of two: I'm not sure that question arises
> in the reader. There have also been some questions recently what the
> counter actually means. How about the following:
>
> /*
> * The allocating tasks in this cgroup will need to do
> * reclaim or be throttled to prevent further growth
> * of the memory or swap footprints.
> *
> * Target some best-effort fairness between the tasks,
> * and distribute reclaim work and delay penalties
> * based on how much each task is actually allocating.
> */
sounds good!
> Otherwise, the patch looks good to me.
Thanks!
next prev parent reply other threads:[~2020-05-26 20:12 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-05-21 0:24 [PATCH mm v5 RESEND 0/4] memcg: Slow down swap allocation as the available space gets depleted Jakub Kicinski
2020-05-21 0:24 ` [PATCH mm v5 RESEND 1/4] mm: prepare for swap over-high accounting and penalty calculation Jakub Kicinski
2020-05-26 14:35 ` Johannes Weiner
2020-05-21 0:24 ` [PATCH mm v5 RESEND 2/4] mm: move penalty delay clamping out of calculate_high_delay() Jakub Kicinski
2020-05-26 14:36 ` Johannes Weiner
2020-05-21 0:24 ` [PATCH mm v5 RESEND 3/4] mm: move cgroup high memory limit setting into struct page_counter Jakub Kicinski
2020-05-26 14:42 ` Johannes Weiner
2020-05-21 0:24 ` [PATCH mm v5 RESEND 4/4] mm: automatically penalize tasks with high swap use Jakub Kicinski
2020-05-26 15:33 ` Johannes Weiner
2020-05-26 20:11 ` Jakub Kicinski [this message]
2020-05-27 15:51 ` Johannes Weiner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20200526131157.79c17940@kicinski-fedora-PC1C0HJN.hsd1.ca.comcast.net \
--to=kuba@kernel.org \
--cc=akpm@linux-foundation.org \
--cc=cgroups@vger.kernel.org \
--cc=chris@chrisdown.name \
--cc=hannes@cmpxchg.org \
--cc=kernel-team@fb.com \
--cc=linux-mm@kvack.org \
--cc=mhocko@kernel.org \
--cc=shakeelb@google.com \
--cc=tj@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).