All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Michael R. Hines" <mrhines@digitalocean.com>
To: Peter Lieven <pl@kamp.de>, qemu-devel@nongnu.org
Cc: kwolf@redhat.com, peter.maydell@linaro.org, mst@redhat.com,
	dgilbert@redhat.com, mreitz@redhat.com, kraxel@redhat.com,
	pbonzini@redhat.com, blemasurier@digitalocean.com,
	patrick@digitalocean.com
Subject: Re: [Qemu-devel] [PATCH 00/15] optimize Qemu RSS usage
Date: Mon, 31 Oct 2016 17:00:21 -0500	[thread overview]
Message-ID: <f43fdbd2-7f8c-a241-4d24-5722d326d0e5@digitalocean.com> (raw)
In-Reply-To: <73ad59dc-cc88-96ee-2867-9fb628823782@kamp.de>

On 10/18/2016 05:47 AM, Peter Lieven wrote:
> Am 12.10.2016 um 23:18 schrieb Michael R. Hines:
>> Peter,
>>
>> Greetings from DigitalOcean. We're experiencing the same symptoms 
>> without this patch.
>> We have, collectively, many gigabytes of un-planned-for RSS being 
>> used per-hypervisor
>> that we would like to get rid of =).
>>
>> Without explicitly trying this patch (will do that ASAP), we 
>> immediately noticed that the
>> 192MB mentioned immediately melts away (Yay) when we disabled the 
>> coroutine thread pool explicitly,
>> with another ~100MB in additional stack usage that would likely also 
>> go away if we
>> applied the entirety of your patch.
>>
>> Is there any chance you have revisited this or have a timeline for it?
>
> Hi Michael,
>
> the current master already includes some of the patches of this 
> original series. There are still some changes left, but
> what works for me is the current master +
>
> diff --git a/util/qemu-coroutine.c b/util/qemu-coroutine.c
> index 5816702..3eaef68 100644
> --- a/util/qemu-coroutine.c
> +++ b/util/qemu-coroutine.c
> @@ -25,8 +25,6 @@ enum {
>  };
>
>  /** Free list to speed up creation */
> -static QSLIST_HEAD(, Coroutine) release_pool = 
> QSLIST_HEAD_INITIALIZER(pool);
> -static unsigned int release_pool_size;
>  static __thread QSLIST_HEAD(, Coroutine) alloc_pool = 
> QSLIST_HEAD_INITIALIZER(pool);
>  static __thread unsigned int alloc_pool_size;
>  static __thread Notifier coroutine_pool_cleanup_notifier;
> @@ -49,20 +47,10 @@ Coroutine *qemu_coroutine_create(CoroutineEntry 
> *entry)
>      if (CONFIG_COROUTINE_POOL) {
>          co = QSLIST_FIRST(&alloc_pool);
>          if (!co) {
> -            if (release_pool_size > POOL_BATCH_SIZE) {
> -                /* Slow path; a good place to register the 
> destructor, too.  */
> -                if (!coroutine_pool_cleanup_notifier.notify) {
> -                    coroutine_pool_cleanup_notifier.notify = 
> coroutine_pool_cleanup;
> - qemu_thread_atexit_add(&coroutine_pool_cleanup_notifier);
> -                }
> -
> -                /* This is not exact; there could be a little skew 
> between
> -                 * release_pool_size and the actual size of 
> release_pool.  But
> -                 * it is just a heuristic, it does not need to be 
> perfect.
> -                 */
> -                alloc_pool_size = atomic_xchg(&release_pool_size, 0);
> -                QSLIST_MOVE_ATOMIC(&alloc_pool, &release_pool);
> -                co = QSLIST_FIRST(&alloc_pool);
> +            /* Slow path; a good place to register the destructor, 
> too.  */
> +            if (!coroutine_pool_cleanup_notifier.notify) {
> +                coroutine_pool_cleanup_notifier.notify = 
> coroutine_pool_cleanup;
> + qemu_thread_atexit_add(&coroutine_pool_cleanup_notifier);
>              }
>          }
>          if (co) {
> @@ -85,11 +73,6 @@ static void coroutine_delete(Coroutine *co)
>      co->caller = NULL;
>
>      if (CONFIG_COROUTINE_POOL) {
> -        if (release_pool_size < POOL_BATCH_SIZE * 2) {
> -            QSLIST_INSERT_HEAD_ATOMIC(&release_pool, co, pool_next);
> -            atomic_inc(&release_pool_size);
> -            return;
> -        }
>          if (alloc_pool_size < POOL_BATCH_SIZE) {
>              QSLIST_INSERT_HEAD(&alloc_pool, co, pool_next);
>              alloc_pool_size++;
>
> + invoking qemu with the following environemnet variable set:
>
> MALLOC_MMAP_THRESHOLD_=32768 qemu-system-x86_64 ....
>
> The last one makes glibc automatically using mmap when the malloced 
> memory exceeds 32kByte.
>

Peter,

I tested the above patch (and the environment variable --- it doesn't 
quite come close to as lean of
an RSS tally as the original patchset -------- there's still about 70-80 
MB of remaining RSS.

Any chance you could trim the remaining fat before merging this? =)


/*
  * Michael R. Hines
  * Senior Engineer, DigitalOcean.
  */

  parent reply	other threads:[~2016-10-31 22:00 UTC|newest]

Thread overview: 78+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-06-28  9:01 [Qemu-devel] [PATCH 00/15] optimize Qemu RSS usage Peter Lieven
2016-06-28  9:01 ` [Qemu-devel] [PATCH 01/15] coroutine-ucontext: mmap stack memory Peter Lieven
2016-06-28 10:02   ` Peter Maydell
2016-06-28 10:21     ` Peter Lieven
2016-06-28 11:04   ` Paolo Bonzini
2016-06-28  9:01 ` [Qemu-devel] [PATCH 02/15] coroutine-ucontext: add a switch to monitor maximum stack size Peter Lieven
2016-06-28  9:01 ` [Qemu-devel] [PATCH 03/15] coroutine-ucontext: reduce stack size to 64kB Peter Lieven
2016-06-28 10:54   ` Paolo Bonzini
2016-06-28 10:57     ` Dr. David Alan Gilbert
2016-06-28 11:17       ` Peter Lieven
2016-06-28 11:35         ` Dr. David Alan Gilbert
2016-06-28 12:09           ` Peter Lieven
2016-06-28 14:20             ` Dr. David Alan Gilbert
2016-06-30  6:34               ` Peter Lieven
2016-06-28 11:13     ` Peter Lieven
2016-06-28 11:26       ` Paolo Bonzini
2016-06-28  9:01 ` [Qemu-devel] [PATCH 04/15] coroutine: add a knob to disable the shared release pool Peter Lieven
2016-06-28 10:41   ` Paolo Bonzini
2016-06-28 10:47     ` Peter Lieven
2016-06-28  9:01 ` [Qemu-devel] [PATCH 05/15] util: add a helper to mmap private anonymous memory Peter Lieven
2016-10-16  2:10   ` Michael S. Tsirkin
2016-10-18 13:50     ` Alex Bennée
2016-06-28  9:01 ` [Qemu-devel] [PATCH 06/15] exec: use mmap for subpages Peter Lieven
2016-06-28 10:48   ` Paolo Bonzini
2016-06-28  9:01 ` [Qemu-devel] [PATCH 07/15] qapi: use mmap for QmpInputVisitor Peter Lieven
2016-06-28  9:29   ` Dr. David Alan Gilbert
2016-06-28  9:39     ` Peter Lieven
2016-06-28 10:10       ` Daniel P. Berrange
2016-06-28 10:17         ` Dr. David Alan Gilbert
2016-06-28 10:21           ` Daniel P. Berrange
2016-06-28 14:10           ` Eric Blake
2016-06-28 11:36   ` Paolo Bonzini
2016-06-28 14:14     ` Eric Blake
2016-06-30 14:12   ` Markus Armbruster
2016-07-04  9:02     ` Paolo Bonzini
2016-07-04 11:18       ` Markus Armbruster
2016-07-04 11:36         ` Peter Lieven
2016-07-04 11:42         ` Paolo Bonzini
2016-06-28  9:01 ` [Qemu-devel] [PATCH 08/15] virtio: use mmap for VirtQueue Peter Lieven
2016-06-28  9:01 ` [Qemu-devel] [PATCH 09/15] loader: use mmap for ROMs Peter Lieven
2016-06-28 10:41   ` Paolo Bonzini
2016-06-28 11:26     ` Peter Lieven
2016-07-04  7:30     ` Peter Lieven
2016-06-28  9:01 ` [Qemu-devel] [PATCH 10/15] vmware_svga: use mmap for scratch pad Peter Lieven
2016-06-28  9:01 ` [Qemu-devel] [PATCH 11/15] qom: use mmap for bigger Objects Peter Lieven
2016-06-28 10:08   ` Daniel P. Berrange
2016-06-28 10:10   ` Peter Maydell
2016-06-28 10:19     ` Peter Lieven
2016-06-28 10:42   ` Paolo Bonzini
2016-06-28 10:49     ` Peter Lieven
2016-06-30 14:15       ` Markus Armbruster
2016-06-28  9:01 ` [Qemu-devel] [PATCH 12/15] util: add a function to realloc mmapped memory Peter Lieven
2016-06-28  9:01 ` [Qemu-devel] [PATCH 13/15] exec: use mmap for PhysPageMap->nodes Peter Lieven
2016-06-28 10:43   ` Paolo Bonzini
2016-06-28 10:48     ` Peter Lieven
2016-07-11  9:31     ` Peter Lieven
2016-07-11  9:44       ` Peter Lieven
2016-07-11 10:37       ` Paolo Bonzini
2016-07-12 14:34         ` Peter Lieven
2016-07-13 10:27           ` Paolo Bonzini
2016-07-14 14:47             ` Peter Lieven
2016-06-28  9:01 ` [Qemu-devel] [PATCH 14/15] vnc-tight: make the encoding palette static Peter Lieven
2016-06-28 11:12   ` Paolo Bonzini
2016-06-28 11:18     ` Peter Lieven
2016-06-28  9:01 ` [Qemu-devel] [PATCH 15/15] vnc: use mmap for VncState Peter Lieven
2016-06-28 11:37 ` [Qemu-devel] [PATCH 00/15] optimize Qemu RSS usage Paolo Bonzini
2016-06-28 12:14   ` Peter Lieven
2016-06-28 12:29     ` Paolo Bonzini
2016-06-28 12:33       ` Peter Lieven
2016-06-28 12:56         ` Paolo Bonzini
2016-06-28 12:56         ` Dr. David Alan Gilbert
2016-06-28 14:43           ` Peter Lieven
2016-06-28 14:52             ` Peter Lieven
2016-10-12 21:18 ` Michael R. Hines
2016-10-18 10:47   ` Peter Lieven
2016-10-19 17:40     ` Michael R. Hines
2016-10-31 22:00     ` Michael R. Hines [this message]
2016-11-01 22:02       ` Michael R. Hines

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=f43fdbd2-7f8c-a241-4d24-5722d326d0e5@digitalocean.com \
    --to=mrhines@digitalocean.com \
    --cc=blemasurier@digitalocean.com \
    --cc=dgilbert@redhat.com \
    --cc=kraxel@redhat.com \
    --cc=kwolf@redhat.com \
    --cc=mreitz@redhat.com \
    --cc=mst@redhat.com \
    --cc=patrick@digitalocean.com \
    --cc=pbonzini@redhat.com \
    --cc=peter.maydell@linaro.org \
    --cc=pl@kamp.de \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.