All of lore.kernel.org
 help / color / mirror / Atom feed
From: Rafael Aquini <aquini@redhat.com>
To: "Michael S. Tsirkin" <mst@redhat.com>
Cc: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>,
	Peter Zijlstra <peterz@infradead.org>,
	linux-mm@kvack.org, linux-kernel@vger.kernel.org,
	virtualization@lists.linux-foundation.org,
	Rusty Russell <rusty@rustcorp.com.au>,
	Rik van Riel <riel@redhat.com>, Mel Gorman <mel@csn.ul.ie>,
	Andi Kleen <andi@firstfloor.org>,
	Andrew Morton <akpm@linux-foundation.org>,
	Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>,
	Minchan Kim <minchan@kernel.org>
Subject: Re: [PATCH v8 1/5] mm: introduce a common interface for balloon pages mobility
Date: Thu, 23 Aug 2012 21:33:53 -0300	[thread overview]
Message-ID: <20120824003353.GG10777@t510.redhat.com> (raw)
In-Reply-To: <20120823233616.GB2775@redhat.com>

On Fri, Aug 24, 2012 at 02:36:16AM +0300, Michael S. Tsirkin wrote:
> I would wake it each time after adding a page, then it
> can stop waiting when it leaks enough.
> But again, it's cleaner to just keep tracking all
> pages, let mm hang on to them by keeping a reference.
> 
Here is a rough idea on how it's getting:

Basically, I'm have introducing an atomic counter to track isolated pages, I
also have changed vb->num_pages into an atomic conter. All inc/dec operations
take place under pages_lock spinlock, and we only perform work under page lock.

It's still missing the wait-part (I'll write it during the weekend) and your
concerns (and mine) will be addressed, IMHO.

---8<---
+/*
+ *
+ */
+static inline void __wait_on_isolated_pages(struct virtio_balloon *vb,
+                                           size_t num)
+{
+       /* There are no isolated pages for this balloon device */
+       if (!atomic_read(&vb->num_isolated_pages))
+               return;
+
+       /* the leak target is smaller than # of pages on vb->pages list */
+       if (num < (atomic_read(&vb->num_pages) -
+           atomic_read(&vb->num_isolated_pages)))
+               return;
+       else {
+               spin_unlock(&vb->pages_lock);
+               /* wait stuff goes here */
+               spin_lock(&vb->pages_lock);
+       }
+}
+
 static void leak_balloon(struct virtio_balloon *vb, size_t num)
 {
-       struct page *page;
+       /* The array of pfns we tell the Host about. */
+       unsigned int num_pfns;
+       u32 pfns[VIRTIO_BALLOON_ARRAY_PFNS_MAX];

        /* We can only do one array worth at a time. */
-       num = min(num, ARRAY_SIZE(vb->pfns));
+       num = min(num, ARRAY_SIZE(pfns));

-       for (vb->num_pfns = 0; vb->num_pfns < num;
-            vb->num_pfns += VIRTIO_BALLOON_PAGES_PER_PAGE) {
-               page = list_first_entry(&vb->pages, struct page, lru);
-               list_del(&page->lru);
-               set_page_pfns(vb->pfns + vb->num_pfns, page);
-               vb->num_pages -= VIRTIO_BALLOON_PAGES_PER_PAGE;
+       for (num_pfns = 0; num_pfns < num;
+            num_pfns += VIRTIO_BALLOON_PAGES_PER_PAGE) {
+               struct page *page = NULL;
+               spin_lock(&vb->pages_lock);
+               __wait_on_isolated_pages(vb, num);
+
+               if (!list_empty(&vb->pages))
+                       page = list_first_entry(&vb->pages, struct page, lru);
+               /*
+                * Grab the page lock to avoid racing against threads isolating
+                * pages from, or migrating pages back to vb->pages list.
+                * (both tasks are done under page lock protection)
+                *
+                * Failing to grab the page lock here means this page is being
+                * isolated already, or its migration has not finished yet.
+                */
+               if (page && trylock_page(page)) {
+                       clear_balloon_mapping(page);
+                       list_del(&page->lru);
+                       set_page_pfns(pfns + num_pfns, page);
+                       atomic_sub(VIRTIO_BALLOON_PAGES_PER_PAGE,
+                                  &vb->num_pages);
+                       unlock_page(page);
+               }
+               spin_unlock(&vb->pages_lock);
        }

        /*
@@ -182,8 +251,10 @@ static void leak_balloon(struct virtio_balloon *vb, size_t
num)
         * virtio_has_feature(vdev, VIRTIO_BALLOON_F_MUST_TELL_HOST);
         * is true, we *have* to do it in this order
         */
+       mutex_lock(&vb->balloon_lock);
        tell_host(vb, vb->deflate_vq);
-       release_pages_by_pfn(vb->pfns, vb->num_pfns);
+       mutex_unlock(&vb->balloon_lock);
+       release_pages_by_pfn(pfns, num_pfns);
 }
---8<---

WARNING: multiple messages have this Message-ID (diff)
From: Rafael Aquini <aquini@redhat.com>
To: "Michael S. Tsirkin" <mst@redhat.com>
Cc: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>,
	Peter Zijlstra <peterz@infradead.org>,
	linux-mm@kvack.org, linux-kernel@vger.kernel.org,
	virtualization@lists.linux-foundation.org,
	Rusty Russell <rusty@rustcorp.com.au>,
	Rik van Riel <riel@redhat.com>, Mel Gorman <mel@csn.ul.ie>,
	Andi Kleen <andi@firstfloor.org>,
	Andrew Morton <akpm@linux-foundation.org>,
	Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>,
	Minchan Kim <minchan@kernel.org>
Subject: Re: [PATCH v8 1/5] mm: introduce a common interface for balloon pages mobility
Date: Thu, 23 Aug 2012 21:33:53 -0300	[thread overview]
Message-ID: <20120824003353.GG10777@t510.redhat.com> (raw)
In-Reply-To: <20120823233616.GB2775@redhat.com>

On Fri, Aug 24, 2012 at 02:36:16AM +0300, Michael S. Tsirkin wrote:
> I would wake it each time after adding a page, then it
> can stop waiting when it leaks enough.
> But again, it's cleaner to just keep tracking all
> pages, let mm hang on to them by keeping a reference.
> 
Here is a rough idea on how it's getting:

Basically, I'm have introducing an atomic counter to track isolated pages, I
also have changed vb->num_pages into an atomic conter. All inc/dec operations
take place under pages_lock spinlock, and we only perform work under page lock.

It's still missing the wait-part (I'll write it during the weekend) and your
concerns (and mine) will be addressed, IMHO.

---8<---
+/*
+ *
+ */
+static inline void __wait_on_isolated_pages(struct virtio_balloon *vb,
+                                           size_t num)
+{
+       /* There are no isolated pages for this balloon device */
+       if (!atomic_read(&vb->num_isolated_pages))
+               return;
+
+       /* the leak target is smaller than # of pages on vb->pages list */
+       if (num < (atomic_read(&vb->num_pages) -
+           atomic_read(&vb->num_isolated_pages)))
+               return;
+       else {
+               spin_unlock(&vb->pages_lock);
+               /* wait stuff goes here */
+               spin_lock(&vb->pages_lock);
+       }
+}
+
 static void leak_balloon(struct virtio_balloon *vb, size_t num)
 {
-       struct page *page;
+       /* The array of pfns we tell the Host about. */
+       unsigned int num_pfns;
+       u32 pfns[VIRTIO_BALLOON_ARRAY_PFNS_MAX];

        /* We can only do one array worth at a time. */
-       num = min(num, ARRAY_SIZE(vb->pfns));
+       num = min(num, ARRAY_SIZE(pfns));

-       for (vb->num_pfns = 0; vb->num_pfns < num;
-            vb->num_pfns += VIRTIO_BALLOON_PAGES_PER_PAGE) {
-               page = list_first_entry(&vb->pages, struct page, lru);
-               list_del(&page->lru);
-               set_page_pfns(vb->pfns + vb->num_pfns, page);
-               vb->num_pages -= VIRTIO_BALLOON_PAGES_PER_PAGE;
+       for (num_pfns = 0; num_pfns < num;
+            num_pfns += VIRTIO_BALLOON_PAGES_PER_PAGE) {
+               struct page *page = NULL;
+               spin_lock(&vb->pages_lock);
+               __wait_on_isolated_pages(vb, num);
+
+               if (!list_empty(&vb->pages))
+                       page = list_first_entry(&vb->pages, struct page, lru);
+               /*
+                * Grab the page lock to avoid racing against threads isolating
+                * pages from, or migrating pages back to vb->pages list.
+                * (both tasks are done under page lock protection)
+                *
+                * Failing to grab the page lock here means this page is being
+                * isolated already, or its migration has not finished yet.
+                */
+               if (page && trylock_page(page)) {
+                       clear_balloon_mapping(page);
+                       list_del(&page->lru);
+                       set_page_pfns(pfns + num_pfns, page);
+                       atomic_sub(VIRTIO_BALLOON_PAGES_PER_PAGE,
+                                  &vb->num_pages);
+                       unlock_page(page);
+               }
+               spin_unlock(&vb->pages_lock);
        }

        /*
@@ -182,8 +251,10 @@ static void leak_balloon(struct virtio_balloon *vb, size_t
num)
         * virtio_has_feature(vdev, VIRTIO_BALLOON_F_MUST_TELL_HOST);
         * is true, we *have* to do it in this order
         */
+       mutex_lock(&vb->balloon_lock);
        tell_host(vb, vb->deflate_vq);
-       release_pages_by_pfn(vb->pfns, vb->num_pfns);
+       mutex_unlock(&vb->balloon_lock);
+       release_pages_by_pfn(pfns, num_pfns);
 }
---8<---

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  parent reply	other threads:[~2012-08-24  0:34 UTC|newest]

Thread overview: 155+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-08-21 12:47 [PATCH v8 0/5] make balloon pages movable by compaction Rafael Aquini
2012-08-21 12:47 ` Rafael Aquini
2012-08-21 12:47 ` [PATCH v8 1/5] mm: introduce a common interface for balloon pages mobility Rafael Aquini
2012-08-21 12:47   ` Rafael Aquini
2012-08-21 12:47   ` Rafael Aquini
2012-08-21 13:52   ` Michael S. Tsirkin
2012-08-21 13:52   ` Michael S. Tsirkin
2012-08-21 13:52     ` Michael S. Tsirkin
2012-08-21 14:25     ` Michael S. Tsirkin
2012-08-21 14:25       ` Michael S. Tsirkin
2012-08-21 14:25     ` Michael S. Tsirkin
2012-08-21 15:16     ` Peter Zijlstra
2012-08-21 15:16       ` Peter Zijlstra
2012-08-21 15:41       ` Michael S. Tsirkin
2012-08-21 15:41       ` Michael S. Tsirkin
2012-08-21 15:41         ` Michael S. Tsirkin
2012-08-21 17:42         ` Rafael Aquini
2012-08-21 17:42         ` Rafael Aquini
2012-08-21 17:42           ` Rafael Aquini
2012-08-21 19:28           ` Michael S. Tsirkin
2012-08-21 19:28             ` Michael S. Tsirkin
2012-08-21 19:28             ` Michael S. Tsirkin
2012-08-21 15:16     ` Peter Zijlstra
2012-08-21 17:55     ` Rafael Aquini
2012-08-21 17:55     ` Rafael Aquini
2012-08-21 17:55       ` Rafael Aquini
2012-08-21 19:16       ` Michael S. Tsirkin
2012-08-21 19:16       ` Michael S. Tsirkin
2012-08-21 19:16         ` Michael S. Tsirkin
2012-08-21 19:34         ` Rafael Aquini
2012-08-21 19:34           ` Rafael Aquini
2012-08-21 19:34           ` Rafael Aquini
2012-08-22  0:06           ` Michael S. Tsirkin
2012-08-22  0:06             ` Michael S. Tsirkin
2012-08-22  0:06           ` Michael S. Tsirkin
2012-08-21 15:20   ` Peter Zijlstra
2012-08-21 15:20     ` Peter Zijlstra
2012-08-21 16:24     ` Paul E. McKenney
2012-08-21 16:24       ` Paul E. McKenney
2012-08-21 16:24       ` Paul E. McKenney
2012-08-21 17:28       ` Rafael Aquini
2012-08-21 17:28       ` Rafael Aquini
2012-08-21 17:28         ` Rafael Aquini
2012-08-21 19:13         ` Michael S. Tsirkin
2012-08-21 19:13           ` Michael S. Tsirkin
2012-08-21 19:13           ` Michael S. Tsirkin
2012-08-21 19:23           ` Rafael Aquini
2012-08-21 19:23           ` Rafael Aquini
2012-08-21 19:23             ` Rafael Aquini
2012-08-21 19:30             ` Michael S. Tsirkin
2012-08-21 19:30               ` Michael S. Tsirkin
2012-08-21 19:30               ` Michael S. Tsirkin
2012-08-21 20:45               ` Rafael Aquini
2012-08-21 20:45                 ` Rafael Aquini
2012-08-22  0:07                 ` Michael S. Tsirkin
2012-08-22  0:07                 ` Michael S. Tsirkin
2012-08-22  0:07                   ` Michael S. Tsirkin
2012-08-22  1:19                   ` Rafael Aquini
2012-08-22  1:19                   ` Rafael Aquini
2012-08-22  1:19                     ` Rafael Aquini
2012-08-22  9:33                     ` Michael S. Tsirkin
2012-08-22  9:33                     ` Michael S. Tsirkin
2012-08-22  9:33                       ` Michael S. Tsirkin
2012-08-23  2:19                       ` Rafael Aquini
2012-08-23  2:19                       ` Rafael Aquini
2012-08-23  2:19                         ` Rafael Aquini
2012-08-23 10:01                         ` Michael S. Tsirkin
2012-08-23 10:01                         ` Michael S. Tsirkin
2012-08-23 10:01                           ` Michael S. Tsirkin
2012-08-23 12:13                           ` Rafael Aquini
2012-08-23 12:13                             ` Rafael Aquini
2012-08-23 12:34                             ` Michael S. Tsirkin
2012-08-23 12:34                               ` Michael S. Tsirkin
2012-08-23 13:06                               ` Rafael Aquini
2012-08-23 13:06                               ` Rafael Aquini
2012-08-23 13:06                                 ` Rafael Aquini
2012-08-23 13:53                                 ` Michael S. Tsirkin
2012-08-23 13:53                                   ` Michael S. Tsirkin
2012-08-23 15:21                                   ` Rafael Aquini
2012-08-23 15:21                                     ` Rafael Aquini
2012-08-23 15:54                                     ` Michael S. Tsirkin
2012-08-23 15:54                                       ` Michael S. Tsirkin
2012-08-23 16:03                                       ` Rik van Riel
2012-08-23 16:03                                       ` Rik van Riel
2012-08-23 16:03                                         ` Rik van Riel
2012-08-23 16:06                                         ` Rafael Aquini
2012-08-23 16:06                                         ` Rafael Aquini
2012-08-23 16:06                                           ` Rafael Aquini
2012-08-23 16:10                                           ` Michael S. Tsirkin
2012-08-23 16:10                                           ` Michael S. Tsirkin
2012-08-23 16:10                                             ` Michael S. Tsirkin
2012-08-23 15:54                                     ` Michael S. Tsirkin
2012-08-23 15:21                                   ` Rafael Aquini
2012-08-23 16:25                                   ` Michael S. Tsirkin
2012-08-23 16:25                                   ` Michael S. Tsirkin
2012-08-23 16:25                                     ` Michael S. Tsirkin
2012-08-23 17:28                                     ` Rafael Aquini
2012-08-23 17:28                                     ` Rafael Aquini
2012-08-23 17:28                                       ` Rafael Aquini
2012-08-23 17:59                                       ` Rik van Riel
2012-08-23 17:59                                       ` Rik van Riel
2012-08-23 17:59                                         ` Rik van Riel
2012-08-23 23:36                                       ` Michael S. Tsirkin
2012-08-23 23:36                                         ` Michael S. Tsirkin
2012-08-24  0:26                                         ` Rafael Aquini
2012-08-24  0:26                                         ` Rafael Aquini
2012-08-24  0:26                                           ` Rafael Aquini
2012-08-24  0:33                                         ` Rafael Aquini
2012-08-24  0:33                                         ` Rafael Aquini [this message]
2012-08-24  0:33                                           ` Rafael Aquini
2012-08-24  0:38                                           ` Rafael Aquini
2012-08-24  0:38                                           ` Rafael Aquini
2012-08-24  0:38                                             ` Rafael Aquini
2012-08-24  0:49                                             ` Rafael Aquini
2012-08-24  0:49                                               ` Rafael Aquini
2012-08-24  0:49                                             ` Rafael Aquini
2012-08-24  3:12                                         ` Rik van Riel
2012-08-24  3:12                                           ` Rik van Riel
2012-08-24  8:03                                           ` Michael S. Tsirkin
2012-08-24  8:03                                           ` Michael S. Tsirkin
2012-08-24  8:03                                             ` Michael S. Tsirkin
2012-08-24  3:12                                         ` Rik van Riel
2012-08-23 23:36                                       ` Michael S. Tsirkin
2012-08-23 13:53                                 ` Michael S. Tsirkin
2012-08-23 12:34                             ` Michael S. Tsirkin
2012-08-23 12:13                           ` Rafael Aquini
2012-08-21 20:45               ` Rafael Aquini
2012-08-21 15:20   ` Peter Zijlstra
2012-08-21 12:47 ` [PATCH v8 2/5] mm: introduce compaction and migration for ballooned pages Rafael Aquini
2012-08-21 12:47 ` Rafael Aquini
2012-08-21 12:47   ` Rafael Aquini
2012-08-21 12:47 ` [PATCH v8 3/5] virtio_balloon: introduce migration primitives to balloon pages Rafael Aquini
2012-08-21 12:47 ` Rafael Aquini
2012-08-21 12:47   ` Rafael Aquini
2012-08-21 14:40   ` Michael S. Tsirkin
2012-08-21 14:40     ` Michael S. Tsirkin
2012-08-21 15:34     ` Peter Zijlstra
2012-08-21 15:34       ` Peter Zijlstra
2012-08-21 15:34     ` Peter Zijlstra
2012-08-21 15:37     ` Peter Zijlstra
2012-08-21 15:37     ` Peter Zijlstra
2012-08-21 15:37       ` Peter Zijlstra
2012-08-21 14:40   ` Michael S. Tsirkin
2012-08-21 14:57   ` Michael S. Tsirkin
2012-08-21 14:57     ` Michael S. Tsirkin
2012-08-21 14:57   ` Michael S. Tsirkin
2012-08-21 12:47 ` [PATCH v8 4/5] mm: introduce putback_movable_pages() Rafael Aquini
2012-08-21 12:47   ` Rafael Aquini
2012-08-21 14:42   ` Michael S. Tsirkin
2012-08-21 14:42   ` Michael S. Tsirkin
2012-08-21 14:42     ` Michael S. Tsirkin
2012-08-21 12:47 ` Rafael Aquini
2012-08-21 12:47 ` [PATCH v8 5/5] mm: add vm event counters for balloon pages compaction Rafael Aquini
2012-08-21 12:47 ` Rafael Aquini
2012-08-21 12:47   ` Rafael Aquini

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20120824003353.GG10777@t510.redhat.com \
    --to=aquini@redhat.com \
    --cc=akpm@linux-foundation.org \
    --cc=andi@firstfloor.org \
    --cc=konrad.wilk@oracle.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=mel@csn.ul.ie \
    --cc=minchan@kernel.org \
    --cc=mst@redhat.com \
    --cc=paulmck@linux.vnet.ibm.com \
    --cc=peterz@infradead.org \
    --cc=riel@redhat.com \
    --cc=rusty@rustcorp.com.au \
    --cc=virtualization@lists.linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.