From: "Jürgen Groß" <jgross@suse.com>
To: "Roger Pau Monné" <roger.pau@citrix.com>,
"SeongJae Park" <sj38.park@gmail.com>
Cc: axboe@kernel.dk, sjpark@amazon.com, konrad.wilk@oracle.com,
pdurrant@amazon.com, SeongJae Park <sjpark@amazon.de>,
linux-kernel@vger.kernel.org, linux-block@vger.kernel.org,
xen-devel@lists.xenproject.org
Subject: Re: [Xen-devel] [PATCH v7 2/3] xen/blkback: Squeeze page pools if a memory pressure is detected
Date: Fri, 13 Dec 2019 10:33:45 +0100 [thread overview]
Message-ID: <8425d77b-37cf-d959-9466-7bc1d4d99642@suse.com> (raw)
In-Reply-To: <20191213092742.GG11756@Air-de-Roger>
On 13.12.19 10:27, Roger Pau Monné wrote:
> On Thu, Dec 12, 2019 at 05:06:58PM +0100, SeongJae Park wrote:
>> On Thu, 12 Dec 2019 16:27:57 +0100 "Roger Pau Monné" <roger.pau@citrix.com> wrote:
>>
>>>> diff --git a/drivers/block/xen-blkback/blkback.c b/drivers/block/xen-blkback/blkback.c
>>>> index fd1e19f1a49f..98823d150905 100644
>>>> --- a/drivers/block/xen-blkback/blkback.c
>>>> +++ b/drivers/block/xen-blkback/blkback.c
>>>> @@ -142,6 +142,21 @@ static inline bool persistent_gnt_timeout(struct persistent_gnt *persistent_gnt)
>>>> HZ * xen_blkif_pgrant_timeout);
>>>> }
>>>>
>>>> +/* Once a memory pressure is detected, squeeze free page pools for a while. */
>>>> +static unsigned int buffer_squeeze_duration_ms = 10;
>>>> +module_param_named(buffer_squeeze_duration_ms,
>>>> + buffer_squeeze_duration_ms, int, 0644);
>>>> +MODULE_PARM_DESC(buffer_squeeze_duration_ms,
>>>> +"Duration in ms to squeeze pages buffer when a memory pressure is detected");
>>>> +
>>>> +static unsigned long buffer_squeeze_end;
>>>> +
>>>> +void xen_blkbk_reclaim_memory(struct xenbus_device *dev)
>>>> +{
>>>> + buffer_squeeze_end = jiffies +
>>>> + msecs_to_jiffies(buffer_squeeze_duration_ms);
>>>
>>> I'm not sure this is fully correct. This function will be called for
>>> each blkback instance, but the timeout is stored in a global variable
>>> that's shared between all blkback instances. Shouldn't this timeout be
>>> stored in xen_blkif so each instance has it's own local variable?
>>>
>>> Or else in the case you have 1k blkback instances the timeout is
>>> certainly going to be longer than expected, because each call to
>>> xen_blkbk_reclaim_memory will move it forward.
>>
>> Agreed that. I think the extended timeout would not make a visible
>> performance, though, because the time that 1k-loop take would be short enough
>> to be ignored compared to the millisecond-scope duration.
>>
>> I took this way because I wanted to minimize such structural changes as far as
>> I can, as this is just a point-fix rather than ultimate solution. That said,
>> it is not fully correct and very confusing. My another colleague also pointed
>> out it in internal review. Correct solution would be to adding a variable in
>> the struct as you suggested or avoiding duplicated update of the variable by
>> initializing the variable once the squeezing duration passes. I would prefer
>> the later way, as it is more straightforward and still not introducing
>> structural change. For example, it might be like below:
>>
>> diff --git a/drivers/block/xen-blkback/blkback.c b/drivers/block/xen-blkback/blkback.c
>> index f41c698dd854..6856c8ef88de 100644
>> --- a/drivers/block/xen-blkback/blkback.c
>> +++ b/drivers/block/xen-blkback/blkback.c
>> @@ -152,8 +152,9 @@ static unsigned long buffer_squeeze_end;
>>
>> void xen_blkbk_reclaim_memory(struct xenbus_device *dev)
>> {
>> - buffer_squeeze_end = jiffies +
>> - msecs_to_jiffies(buffer_squeeze_duration_ms);
>> + if (!buffer_squeeze_end)
>> + buffer_squeeze_end = jiffies +
>> + msecs_to_jiffies(buffer_squeeze_duration_ms);
>> }
>>
>> static inline int get_free_page(struct xen_blkif_ring *ring, struct page **page)
>> @@ -669,10 +670,13 @@ int xen_blkif_schedule(void *arg)
>> }
>>
>> /* Shrink the free pages pool if it is too large. */
>> - if (time_before(jiffies, buffer_squeeze_end))
>> + if (time_before(jiffies, buffer_squeeze_end)) {
>> shrink_free_pagepool(ring, 0);
>> - else
>> + } else {
>> + if (unlikely(buffer_squeeze_end))
>> + buffer_squeeze_end = 0;
>> shrink_free_pagepool(ring, max_buffer_pages);
>> + }
>>
>> if (log_stats && time_after(jiffies, ring->st_print))
>> print_stats(ring);
>>
>> May I ask you what way would you prefer?
>
> I'm not particularly found of this approach, as I think it's racy. Ie:
> you would have to add some kind of lock to make sure the contents of
> buffer_squeeze_end stay unmodified during the read and set cycle, or
> else xen_blkif_schedule will race with xen_blkbk_reclaim_memory.
>
> This is likely not a big deal ATM since the code will work as
> expected in most cases AFAICT, but I would still prefer to have a
> per-instance buffer_squeeze_end added to xen_blkif, given that the
> callback is per-instance. I wouldn't call it a structural change, it's
> just adding a variable to a struct instead of having a shared one, but
> the code is almost the same as the current version.
FWIW, I agree.
Juergen
next prev parent reply other threads:[~2019-12-13 9:33 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-12-11 18:10 [PATCH v7 0/2] xenbus/backend: Add a memory pressure handler callback SeongJae Park
2019-12-11 18:10 ` [PATCH v7 1/3] xenbus/backend: Add " SeongJae Park
2019-12-12 9:46 ` [Xen-devel] " Roger Pau Monné
2019-12-11 18:10 ` [PATCH v7 2/3] xen/blkback: Squeeze page pools if a memory pressure is detected SeongJae Park
2019-12-12 11:42 ` [Xen-devel] " Roger Pau Monné
2019-12-12 13:39 ` SeongJae Park
2019-12-12 15:23 ` Roger Pau Monné
2019-12-12 16:15 ` SeongJae Park
2019-12-12 15:27 ` Roger Pau Monné
2019-12-12 16:06 ` SeongJae Park
2019-12-13 9:27 ` Roger Pau Monné
2019-12-13 9:33 ` Jürgen Groß [this message]
2019-12-13 11:47 ` SeongJae Park
2019-12-11 18:10 ` [PATCH v7 3/3] xen/blkback: Remove unnecessary static variable name prefixes SeongJae Park
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=8425d77b-37cf-d959-9466-7bc1d4d99642@suse.com \
--to=jgross@suse.com \
--cc=axboe@kernel.dk \
--cc=konrad.wilk@oracle.com \
--cc=linux-block@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=pdurrant@amazon.com \
--cc=roger.pau@citrix.com \
--cc=sj38.park@gmail.com \
--cc=sjpark@amazon.com \
--cc=sjpark@amazon.de \
--cc=xen-devel@lists.xenproject.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).