All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Michael S. Tsirkin" <mst@redhat.com>
To: Matthew Wilcox <willy@infradead.org>
Cc: Andrea Arcangeli <aarcange@redhat.com>,
	virtio-dev@lists.oasis-open.org, Amit Shah <amit.shah@redhat.com>,
	kvm@vger.kernel.org, Dave Hansen <dave.hansen@intel.com>,
	linux-kernel@vger.kernel.org, Liang Li <liang.z.li@intel.com>,
	qemu-devel@nongnu.org, virtualization@lists.linux-foundation.org,
	Liang Li <liliang324@gmail.com>,
	linux-mm@kvack.org, Cornelia Huck <cornelia.huck@de.ibm.com>,
	Paolo Bonzini <pbonzini@redhat.com>
Subject: Re: [PATCH v7 kernel 3/5] virtio-balloon: implementation of VIRTIO_BALLOON_F_CHUNK_TRANSFER
Date: Fri, 10 Mar 2017 21:10:53 +0200	[thread overview]
Message-ID: <20170310211037-mutt-send-email-mst__35548.4680542306$1489173065$gmane$org@kernel.org> (raw)
In-Reply-To: <20170310171143.GA16328@bombadil.infradead.org>

On Fri, Mar 10, 2017 at 09:11:44AM -0800, Matthew Wilcox wrote:
> On Fri, Mar 10, 2017 at 05:58:28PM +0200, Michael S. Tsirkin wrote:
> > One of the issues of current balloon is the 4k page size
> > assumption. For example if you free a huge page you
> > have to split it up and pass 4k chunks to host.
> > Quite often host can't free these 4k chunks at all (e.g.
> > when it's using huge tlb fs).
> > It's even sillier for architectures with base page size >4k.
> 
> I completely agree with you that we should be able to pass a hugepage
> as a single chunk.  Also we shouldn't assume that host and guest have
> the same page size.  I think we can come up with a scheme that actually
> lets us encode that into a 64-bit word, something like this:
> 
> bit 0 clear => bits 1-11 encode a page count, bits 12-63 encode a PFN, page size 4k.
> bit 0 set, bit 1 clear => bits 2-12 encode a page count, bits 13-63 encode a PFN, page size 8k
> bits 0+1 set, bit 2 clear => bits 3-13 for page count, bits 14-63 for PFN, page size 16k.
> bits 0-2 set, bit 3 clear => bits 4-14 for page count, bits 15-63 for PFN, page size 32k
> bits 0-3 set, bit 4 clear => bits 5-15 for page count, bits 16-63 for PFN, page size 64k

huge page sizes go up to gigabytes.

> That means we can always pass 2048 pages (of whatever page size) in a single chunk.  And
> we support arbitrary power of two page sizes.  I suggest something like this:
> 
> u64 page_to_chunk(struct page *page)
> {
> 	u64 chunk = page_to_pfn(page) << PAGE_SHIFT;
> 	chunk |= (1UL << compound_order(page)) - 1;
> }
> 
> (note this is a single page of order N, so we leave the page count bits
> set to 0, meaning one page).
> 
> > Two things to consider:
> > - host should pass its base page size to guest
> >   this can be a separate patch and for now we can fall back on 12 bit if not there
> 
> With this encoding scheme, I don't think we need to do this?  As long as
> it's *at least* 12 bit, then we're fine.
> 
> > - guest should pass full huge pages to host
> >   this should be done correctly to avoid breaking up huge pages
> >   I would say yes let's use a single format but drop the "normal chunk"
> >   and always use the extended one.
> >   Also, size is in units of 4k, right? Please document that low 12 bit
> >   are reserved, they will be handy as e.g. flags.
> 
> What per-chunk flags are you thinking would be useful?

  reply	other threads:[~2017-03-10 19:10 UTC|newest]

Thread overview: 123+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-03-03  5:40 [PATCH v7 kernel 0/5] Extend virtio-balloon for fast (de)inflating & fast live migration Wei Wang
2017-03-03  5:40 ` [Qemu-devel] " Wei Wang
2017-03-03  5:40 ` Wei Wang
2017-03-03  5:40 ` [PATCH v7 kernel 1/5] virtio-balloon: rework deflate to add page to a list Wei Wang
2017-03-03  5:40 ` Wei Wang
2017-03-03  5:40   ` [Qemu-devel] " Wei Wang
2017-03-03  5:40   ` Wei Wang
2017-03-03  5:40 ` [PATCH v7 kernel 2/5] virtio-balloon: VIRTIO_BALLOON_F_CHUNK_TRANSFER Wei Wang
2017-03-03  5:40   ` [Qemu-devel] " Wei Wang
2017-03-03  5:40   ` Wei Wang
2017-03-08  4:01   ` Michael S. Tsirkin
2017-03-08  4:01     ` [Qemu-devel] " Michael S. Tsirkin
2017-03-08  4:01     ` Michael S. Tsirkin
2017-03-09  7:12     ` Wei Wang
2017-03-09  7:12       ` [Qemu-devel] " Wei Wang
2017-03-09  7:12       ` Wei Wang
2017-03-09  7:12       ` Wei Wang
2017-03-09  7:12     ` Wei Wang
2017-03-08  4:01   ` Michael S. Tsirkin
2017-03-03  5:40 ` Wei Wang
2017-03-03  5:40 ` [PATCH v7 kernel 3/5] virtio-balloon: implementation of VIRTIO_BALLOON_F_CHUNK_TRANSFER Wei Wang
2017-03-03  5:40   ` [Qemu-devel] " Wei Wang
2017-03-03  5:40   ` Wei Wang
2017-03-08  4:01   ` Michael S. Tsirkin
2017-03-08  4:01   ` Michael S. Tsirkin
2017-03-08  4:01     ` [Qemu-devel] " Michael S. Tsirkin
2017-03-08  4:01     ` Michael S. Tsirkin
2017-03-10 10:02     ` [virtio-dev] " Wei Wang
2017-03-10 10:02       ` [Qemu-devel] " Wei Wang
2017-03-10 10:02       ` Wei Wang
2017-03-10 10:02       ` Wei Wang
2017-03-10 13:26       ` [virtio-dev] " David Hildenbrand
2017-03-10 13:26         ` [Qemu-devel] " David Hildenbrand
2017-03-10 13:26         ` David Hildenbrand
2017-03-10 13:26         ` David Hildenbrand
2017-03-10 15:37       ` Michael S. Tsirkin
2017-03-10 15:37         ` [Qemu-devel] " Michael S. Tsirkin
2017-03-10 15:37         ` Michael S. Tsirkin
2017-03-10 15:37       ` Michael S. Tsirkin
2017-03-10 10:02     ` Wei Wang
2017-03-09 14:14   ` Matthew Wilcox
2017-03-09 14:14     ` [Qemu-devel] " Matthew Wilcox
2017-03-09 14:14     ` Matthew Wilcox
2017-03-10 11:37     ` Wei Wang
2017-03-10 11:37     ` Wei Wang
2017-03-10 11:37       ` [Qemu-devel] " Wei Wang
2017-03-10 11:37       ` Wei Wang
2017-03-10 15:58       ` Michael S. Tsirkin
2017-03-10 15:58         ` [Qemu-devel] " Michael S. Tsirkin
2017-03-10 15:58         ` Michael S. Tsirkin
2017-03-10 17:11         ` Matthew Wilcox
2017-03-10 17:11           ` [Qemu-devel] " Matthew Wilcox
2017-03-10 17:11           ` Matthew Wilcox
2017-03-10 19:10           ` Michael S. Tsirkin [this message]
2017-03-10 19:10           ` Michael S. Tsirkin
2017-03-10 19:10             ` [Qemu-devel] " Michael S. Tsirkin
2017-03-10 19:10             ` Michael S. Tsirkin
2017-03-10 21:18             ` Matthew Wilcox
2017-03-10 21:18               ` [Qemu-devel] " Matthew Wilcox
2017-03-10 21:18               ` Matthew Wilcox
2017-03-10 21:18             ` Matthew Wilcox
2017-03-10 19:35           ` Michael S. Tsirkin
2017-03-10 19:35           ` Michael S. Tsirkin
2017-03-10 19:35             ` [Qemu-devel] " Michael S. Tsirkin
2017-03-10 19:35             ` Michael S. Tsirkin
2017-03-10 21:25             ` Matthew Wilcox
2017-03-10 21:25             ` Matthew Wilcox
2017-03-10 21:25               ` [Qemu-devel] " Matthew Wilcox
2017-03-10 21:25               ` Matthew Wilcox
2017-03-12  0:05               ` Michael S. Tsirkin
2017-03-12  0:05                 ` [Qemu-devel] " Michael S. Tsirkin
2017-03-12  0:05                 ` Michael S. Tsirkin
2017-03-12  0:05               ` Michael S. Tsirkin
2017-03-11 11:59           ` Wei Wang
2017-03-11 11:59             ` [Qemu-devel] " Wei Wang
2017-03-11 11:59             ` Wei Wang
2017-03-11 14:09             ` Matthew Wilcox
2017-03-11 14:09             ` Matthew Wilcox
2017-03-11 14:09               ` [Qemu-devel] " Matthew Wilcox
2017-03-11 14:09               ` Matthew Wilcox
2017-03-12  1:59               ` Wang, Wei W
2017-03-12  1:59               ` Wang, Wei W
2017-03-12  1:59                 ` [Qemu-devel] " Wang, Wei W
2017-03-12  1:59                 ` Wang, Wei W
2017-03-12  1:59                 ` Wang, Wei W
2017-03-12  4:04                 ` Michael S. Tsirkin
2017-03-12  4:04                 ` Michael S. Tsirkin
2017-03-12  4:04                   ` [Qemu-devel] " Michael S. Tsirkin
2017-03-12  4:04                   ` Michael S. Tsirkin
2017-03-12  4:04                   ` Michael S. Tsirkin
2017-03-13 12:41                   ` Wang, Wei W
2017-03-13 12:41                     ` [Qemu-devel] " Wang, Wei W
2017-03-13 12:41                     ` Wang, Wei W
2017-03-13 12:41                     ` Wang, Wei W
2017-03-12  0:07             ` Michael S. Tsirkin
2017-03-12  0:07             ` Michael S. Tsirkin
2017-03-12  0:07               ` [Qemu-devel] " Michael S. Tsirkin
2017-03-12  0:07               ` Michael S. Tsirkin
2017-03-11 11:59           ` Wei Wang
2017-03-10 17:11         ` Matthew Wilcox
2017-03-10 15:58       ` Michael S. Tsirkin
2017-03-09 14:14   ` Matthew Wilcox
2017-03-03  5:40 ` Wei Wang
2017-03-03  5:40 ` [PATCH v7 kernel 4/5] virtio-balloon: define flags and head for host request vq Wei Wang
2017-03-03  5:40 ` Wei Wang
2017-03-03  5:40   ` [Qemu-devel] " Wei Wang
2017-03-03  5:40   ` Wei Wang
2017-03-08  4:02   ` Michael S. Tsirkin
2017-03-08  4:02     ` [Qemu-devel] " Michael S. Tsirkin
2017-03-08  4:02     ` Michael S. Tsirkin
2017-03-08  4:02   ` Michael S. Tsirkin
2017-03-03  5:40 ` [PATCH v7 kernel 5/5] This patch contains two parts: Wei Wang
2017-03-03  5:40   ` [Qemu-devel] " Wei Wang
2017-03-03  5:40   ` Wei Wang
2017-03-06 13:23   ` David Hildenbrand
2017-03-06 13:23     ` [Qemu-devel] " David Hildenbrand
2017-03-06 13:23     ` David Hildenbrand
2017-03-09  7:04     ` Wei Wang
2017-03-09  7:04     ` Wei Wang
2017-03-09  7:04       ` [Qemu-devel] " Wei Wang
2017-03-09  7:04       ` Wei Wang
2017-03-06 13:23   ` David Hildenbrand
2017-03-03  5:40 ` Wei Wang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='20170310211037-mutt-send-email-mst__35548.4680542306$1489173065$gmane$org@kernel.org' \
    --to=mst@redhat.com \
    --cc=aarcange@redhat.com \
    --cc=amit.shah@redhat.com \
    --cc=cornelia.huck@de.ibm.com \
    --cc=dave.hansen@intel.com \
    --cc=kvm@vger.kernel.org \
    --cc=liang.z.li@intel.com \
    --cc=liliang324@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=pbonzini@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=virtio-dev@lists.oasis-open.org \
    --cc=virtualization@lists.linux-foundation.org \
    --cc=willy@infradead.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.