All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Michael S. Tsirkin" <mst@redhat.com>
To: qemu-devel@nongnu.org
Cc: Peter Maydell <peter.maydell@linaro.org>,
	Richard Henderson <rth@twiddle.net>,
	"Dr . David Alan Gilbert" <dgilbert@redhat.com>,
	Paolo Bonzini <pbonzini@redhat.com>,
	David Hildenbrand <david@redhat.com>
Subject: [PULL v2 05/41] exec: Introduce ram_block_discard_(disable|require)()
Date: Sat, 4 Jul 2020 14:29:35 -0400	[thread overview]
Message-ID: <20200704182750.1088103-6-mst@redhat.com> (raw)
In-Reply-To: <20200704182750.1088103-1-mst@redhat.com>

From: David Hildenbrand <david@redhat.com>

We want to replace qemu_balloon_inhibit() by something more generic.
Especially, we want to make sure that technologies that really rely on
RAM block discards to work reliably to run mutual exclusive with
technologies that effectively break it.

E.g., vfio will usually pin all guest memory, turning the virtio-balloon
basically useless and make the VM consume more memory than reported via
the balloon. While the balloon is special already (=> no guarantees, same
behavior possible afer reboots and with huge pages), this will be
different, especially, with virtio-mem.

Let's implement a way such that we can make both types of technology run
mutually exclusive. We'll convert existing balloon inhibitors in successive
patches and add some new ones. Add the check to
qemu_balloon_is_inhibited() for now. We might want to make
virtio-balloon an acutal inhibitor in the future - however, that
requires more thought to not break existing setups.

Reviewed-by: Dr. David Alan Gilbert <dgilbert@redhat.com>
Cc: "Michael S. Tsirkin" <mst@redhat.com>
Cc: Richard Henderson <rth@twiddle.net>
Cc: Paolo Bonzini <pbonzini@redhat.com>
Signed-off-by: David Hildenbrand <david@redhat.com>
Message-Id: <20200626072248.78761-3-david@redhat.com>
Reviewed-by: Michael S. Tsirkin <mst@redhat.com>
Signed-off-by: Michael S. Tsirkin <mst@redhat.com>
---
 include/exec/memory.h | 41 ++++++++++++++++++++++++++++++++++
 balloon.c             |  3 ++-
 exec.c                | 52 +++++++++++++++++++++++++++++++++++++++++++
 3 files changed, 95 insertions(+), 1 deletion(-)

diff --git a/include/exec/memory.h b/include/exec/memory.h
index 7207025bd4..38ec38b9a8 100644
--- a/include/exec/memory.h
+++ b/include/exec/memory.h
@@ -2472,6 +2472,47 @@ static inline MemOp devend_memop(enum device_endian end)
 }
 #endif
 
+/*
+ * Inhibit technologies that require discarding of pages in RAM blocks, e.g.,
+ * to manage the actual amount of memory consumed by the VM (then, the memory
+ * provided by RAM blocks might be bigger than the desired memory consumption).
+ * This *must* be set if:
+ * - Discarding parts of a RAM blocks does not result in the change being
+ *   reflected in the VM and the pages getting freed.
+ * - All memory in RAM blocks is pinned or duplicated, invaldiating any previous
+ *   discards blindly.
+ * - Discarding parts of a RAM blocks will result in integrity issues (e.g.,
+ *   encrypted VMs).
+ * Technologies that only temporarily pin the current working set of a
+ * driver are fine, because we don't expect such pages to be discarded
+ * (esp. based on guest action like balloon inflation).
+ *
+ * This is *not* to be used to protect from concurrent discards (esp.,
+ * postcopy).
+ *
+ * Returns 0 if successful. Returns -EBUSY if a technology that relies on
+ * discards to work reliably is active.
+ */
+int ram_block_discard_disable(bool state);
+
+/*
+ * Inhibit technologies that disable discarding of pages in RAM blocks.
+ *
+ * Returns 0 if successful. Returns -EBUSY if discards are already set to
+ * broken.
+ */
+int ram_block_discard_require(bool state);
+
+/*
+ * Test if discarding of memory in ram blocks is disabled.
+ */
+bool ram_block_discard_is_disabled(void);
+
+/*
+ * Test if discarding of memory in ram blocks is required to work reliably.
+ */
+bool ram_block_discard_is_required(void);
+
 #endif
 
 #endif
diff --git a/balloon.c b/balloon.c
index f104b42961..5fff79523a 100644
--- a/balloon.c
+++ b/balloon.c
@@ -40,7 +40,8 @@ static int balloon_inhibit_count;
 
 bool qemu_balloon_is_inhibited(void)
 {
-    return atomic_read(&balloon_inhibit_count) > 0;
+    return atomic_read(&balloon_inhibit_count) > 0 ||
+           ram_block_discard_is_disabled();
 }
 
 void qemu_balloon_inhibit(bool state)
diff --git a/exec.c b/exec.c
index 21926dc9c7..893636176e 100644
--- a/exec.c
+++ b/exec.c
@@ -4115,4 +4115,56 @@ void mtree_print_dispatch(AddressSpaceDispatch *d, MemoryRegion *root)
     }
 }
 
+/*
+ * If positive, discarding RAM is disabled. If negative, discarding RAM is
+ * required to work and cannot be disabled.
+ */
+static int ram_block_discard_disabled;
+
+int ram_block_discard_disable(bool state)
+{
+    int old;
+
+    if (!state) {
+        atomic_dec(&ram_block_discard_disabled);
+        return 0;
+    }
+
+    do {
+        old = atomic_read(&ram_block_discard_disabled);
+        if (old < 0) {
+            return -EBUSY;
+        }
+    } while (atomic_cmpxchg(&ram_block_discard_disabled, old, old + 1) != old);
+    return 0;
+}
+
+int ram_block_discard_require(bool state)
+{
+    int old;
+
+    if (!state) {
+        atomic_inc(&ram_block_discard_disabled);
+        return 0;
+    }
+
+    do {
+        old = atomic_read(&ram_block_discard_disabled);
+        if (old > 0) {
+            return -EBUSY;
+        }
+    } while (atomic_cmpxchg(&ram_block_discard_disabled, old, old - 1) != old);
+    return 0;
+}
+
+bool ram_block_discard_is_disabled(void)
+{
+    return atomic_read(&ram_block_discard_disabled) > 0;
+}
+
+bool ram_block_discard_is_required(void)
+{
+    return atomic_read(&ram_block_discard_disabled) < 0;
+}
+
 #endif
-- 
MST



  parent reply	other threads:[~2020-07-04 18:34 UTC|newest]

Thread overview: 55+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-07-04 18:29 [PULL v2 00/41] virtio,acpi: features, fixes, cleanups Michael S. Tsirkin
2020-07-04 18:29 ` [PULL v2 01/41] tests: disassemble-aml.sh: generate AML in readable format Michael S. Tsirkin
2020-07-04 18:29 ` [PULL v2 02/41] Revert "tests/migration: Reduce autoconverge initial bandwidth" Michael S. Tsirkin
2020-07-04 18:29 ` [PULL v2 03/41] virtio-balloon: always indicate S_DONE when migration fails Michael S. Tsirkin
2020-07-04 18:29 ` [PULL v2 04/41] pc: Support coldplugging of virtio-pmem-pci devices on all buses Michael S. Tsirkin
2020-07-04 18:29 ` Michael S. Tsirkin [this message]
2020-07-04 18:29 ` [PULL v2 06/41] vfio: Convert to ram_block_discard_disable() Michael S. Tsirkin
2020-07-04 18:29 ` [PULL v2 07/41] accel/kvm: " Michael S. Tsirkin
2020-07-04 18:29   ` Michael S. Tsirkin
2020-07-04 18:29 ` [PULL v2 08/41] s390x/pv: " Michael S. Tsirkin
2020-07-04 18:29 ` [PULL v2 09/41] virtio-balloon: Rip out qemu_balloon_inhibit() Michael S. Tsirkin
2020-07-04 18:29 ` [PULL v2 10/41] target/i386: sev: Use ram_block_discard_disable() Michael S. Tsirkin
2020-07-04 18:29 ` [PULL v2 11/41] migration/rdma: " Michael S. Tsirkin
2020-07-04 18:29 ` [PULL v2 12/41] migration/colo: " Michael S. Tsirkin
2020-07-04 18:29 ` [PULL v2 13/41] virtio-mem: Paravirtualized memory hot(un)plug Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 14/41] virtio-pci: Proxy for virtio-mem Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 15/41] MAINTAINERS: Add myself as virtio-mem maintainer Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 16/41] hmp: Handle virtio-mem when printing memory device info Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 17/41] numa: Handle virtio-mem in NUMA stats Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 18/41] pc: Support for virtio-mem-pci Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 19/41] virtio-mem: Allow notifiers for size changes Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 20/41] virtio-pci: Send qapi events when the virtio-mem " Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 21/41] virtio-mem: Migration sanity checks Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 22/41] virtio-mem: Add trace events Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 23/41] virtio-mem: Exclude unplugged memory during migration Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 24/41] numa: Auto-enable NUMA when any memory devices are possible Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 25/41] tests/acpi: remove stale allowed tables Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 26/41] docs: vhost-user: add Virtio status protocol feature Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 27/41] MAINTAINERS: add VT-d entry Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 28/41] net: introduce qemu_get_peer Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 29/41] vhost_net: use the function qemu_get_peer Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 30/41] virtio-bus: introduce queue_enabled method Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 31/41] virtio-pci: implement " Michael S. Tsirkin
2020-07-27 13:51   ` Laurent Vivier
2020-07-27 14:00     ` Michael S. Tsirkin
2020-07-27 14:06       ` Laurent Vivier
2020-07-04 18:30 ` [PULL v2 32/41] vhost: check the existence of vhost_set_iotlb_callback Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 33/41] vhost: introduce new VhostOps vhost_dev_start Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 34/41] vhost: implement vhost_dev_start method Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 35/41] vhost: introduce new VhostOps vhost_vq_get_addr Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 36/41] vhost: implement vhost_vq_get_addr method Michael S. Tsirkin
2020-07-04 18:30 ` [PULL v2 37/41] vhost: introduce new VhostOps vhost_force_iommu Michael S. Tsirkin
2020-07-04 18:31 ` [PULL v2 38/41] vhost: implement vhost_force_iommu method Michael S. Tsirkin
2020-07-04 18:31 ` [PULL v2 39/41] vhost_net: introduce set_config & get_config Michael S. Tsirkin
2020-07-04 18:31 ` [PULL v2 40/41] vhost-vdpa: introduce vhost-vdpa backend Michael S. Tsirkin
2020-07-09 15:14   ` Peter Maydell
2020-07-10  2:07     ` Cindy Lu
2020-07-04 18:31 ` [PULL v2 41/41] vhost-vdpa: introduce vhost-vdpa net client Michael S. Tsirkin
2020-07-07 11:40 ` [PULL v2 00/41] virtio,acpi: features, fixes, cleanups Peter Maydell
2020-07-07 12:03   ` Michael S. Tsirkin
2020-07-07 17:50     ` Peter Maydell
2020-07-08  6:45       ` Michael S. Tsirkin
2020-07-08 10:19         ` Peter Maydell
2020-07-09 10:32       ` Peter Maydell
2020-07-09 17:46         ` Michael S. Tsirkin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20200704182750.1088103-6-mst@redhat.com \
    --to=mst@redhat.com \
    --cc=david@redhat.com \
    --cc=dgilbert@redhat.com \
    --cc=pbonzini@redhat.com \
    --cc=peter.maydell@linaro.org \
    --cc=qemu-devel@nongnu.org \
    --cc=rth@twiddle.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.