From: "Huang, Ying" <ying.huang@intel.com>
To: Shakeel Butt <shakeelb@google.com>
Cc: Tejun Heo <tj@kernel.org>,
Andrew Morton <akpm@linux-foundation.org>,
Linux MM <linux-mm@kvack.org>,
LKML <linux-kernel@vger.kernel.org>,
Mel Gorman <mgorman@suse.de>,
Johannes Weiner <hannes@cmpxchg.org>,
Vladimir Davydov <vdavydov.dev@gmail.com>,
Michal Hocko <mhocko@suse.cz>,
Joonsoo Kim <iamjoonsoo.kim@lge.com>, Tejun Heo <tj@kernel.org>
Subject: Re: [PATCH] vmscan: retry without cache trim mode if nothing scanned
Date: Thu, 11 Mar 2021 16:52:47 +0800 [thread overview]
Message-ID: <87v99yvzq8.fsf@yhuang-dev.intel.com> (raw)
In-Reply-To: <CALvZod7QNEXdKCJ3H3eoZKsRj5jtOESkmHm1dTC-ZjSBAcW7ng@mail.gmail.com> (Shakeel Butt's message of "Wed, 10 Mar 2021 16:57:49 -0800")
Hi, Butt,
Shakeel Butt <shakeelb@google.com> writes:
> On Wed, Mar 10, 2021 at 4:47 PM Huang, Ying <ying.huang@intel.com> wrote:
>>
>> From: Huang Ying <ying.huang@intel.com>
>>
>> In shrink_node(), to determine whether to enable cache trim mode, the
>> LRU size is gotten via lruvec_page_state(). That gets the value from
>> a per-CPU counter (mem_cgroup_per_node->lruvec_stat[]). The error of
>> the per-CPU counter from CPU local counting and the descendant memory
>> cgroups may cause some issues. We run into this in 0-Day performance
>> test.
>>
>> 0-Day uses the RAM file system as root file system, so the number of
>> the reclaimable file pages is very small. In the swap testing, the
>> inactive file LRU list will become almost empty soon. But the size of
>> the inactive file LRU list gotten from the per-CPU counter may keep a
>> much larger value (say, 33, 50, etc.). This will enable cache trim
>> mode, but nothing can be scanned in fact. The following pattern
>> repeats for long time in the test,
>>
>> priority inactive_file_size cache_trim_mode
>> 12 33 0
>> 11 33 0
>> ...
>> 6 33 0
>> 5 33 1
>> ...
>> 1 33 1
>>
>> That is, the cache_trim_mode will be enabled wrongly when the scan
>> priority decreases to 5. And the problem will not be recovered for
>> long time.
>>
>> It's hard to get the more accurate size of the inactive file list
>> without much more overhead. And it's hard to estimate the error of
>> the per-CPU counter too, because there may be many descendant memory
>> cgroups. But after the actual scanning, if nothing can be scanned
>> with the cache trim mode, it should be wrong to enable the cache trim
>> mode. So we can retry with the cache trim mode disabled. This patch
>> implement this policy.
>
> Instead of playing with the already complicated heuristics, we should
> improve the accuracy of the lruvec stats. Johannes already fixed the
> memcg stats using rstat infrastructure and Tejun has suggestions on
> how to use rstat infrastructure efficiently for lruvec stats at
> https://lore.kernel.org/linux-mm/YCFgr300eRiEZwpL@slm.duckdns.org/.
Thanks for your information! It should be better if we can improve the
accuracy of lruvec stats without much overhead. But that may be not a
easy task.
If my understanding were correct, what Tejun suggested is to add a fast
read interface to rstat to be used in hot path. And its accuracy is
similar as that of traditional per-CPU counter. But if we can regularly
update the lruvec rstat with something like vmstat_update(), that should
be OK for the issue described in this patch.
Best Regards,
Huang, Ying
next prev parent reply other threads:[~2021-03-11 8:52 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-03-11 0:44 [PATCH] vmscan: retry without cache trim mode if nothing scanned Huang, Ying
2021-03-11 0:57 ` Shakeel Butt
2021-03-11 8:52 ` Huang, Ying [this message]
2021-03-14 20:58 ` Shakeel Butt
2021-03-14 22:51 ` Tejun Heo
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87v99yvzq8.fsf@yhuang-dev.intel.com \
--to=ying.huang@intel.com \
--cc=akpm@linux-foundation.org \
--cc=hannes@cmpxchg.org \
--cc=iamjoonsoo.kim@lge.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mgorman@suse.de \
--cc=mhocko@suse.cz \
--cc=shakeelb@google.com \
--cc=tj@kernel.org \
--cc=vdavydov.dev@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).