All of lore.kernel.org
 help / color / mirror / Atom feed
From: Juan Quintela <quintela@redhat.com>
To: "Dr. David Alan Gilbert" <dgilbert@redhat.com>
Cc: Laurent Vivier <lvivier@redhat.com>,
	"Michael S. Tsirkin" <mst@redhat.com>,
	Jason Wang <jasowang@redhat.com>,
	qemu-devel@nongnu.org, Markus Armbruster <armbru@redhat.com>,
	Eric Blake <eblake@redhat.com>
Subject: Re: [PATCH] failover: allow to pause the VM during the migration
Date: Fri, 29 Oct 2021 15:49:38 +0200	[thread overview]
Message-ID: <87mtmsrle5.fsf@secure.mitica> (raw)
In-Reply-To: <YWguh5+m84rldUzg@work-vm> (David Alan Gilbert's message of "Thu,  14 Oct 2021 14:20:07 +0100")

"Dr. David Alan Gilbert" <dgilbert@redhat.com> wrote:
> * Laurent Vivier (lvivier@redhat.com) wrote:
>> If we want to save a snapshot of a VM to a file, we used to follow the
>> following steps:
>> 
>> 1- stop the VM:
>>    (qemu) stop
>> 
>> 2- migrate the VM to a file:
>>    (qemu) migrate "exec:cat > snapshot"
>> 
>> 3- resume the VM:
>>    (qemu) cont
>> 
>> After that we can restore the snapshot with:
>>   qemu-system-x86_64 ... -incoming "exec:cat snapshot"
>>   (qemu) cont
>> 
>> But when failover is configured, it doesn't work anymore.
>> 
>> As the failover needs to ask the guest OS to unplug the card
>> the machine cannot be paused.
>> 
>> This patch introduces a new migration parameter, "pause-vm", that
>> asks the migration to pause the VM during the migration startup
>> phase after the the card is unplugged.
>> 
>> Once the migration is done, we only need to resume the VM with
>> "cont" and the card is plugged back:
>> 
>> 1- set the parameter:
>>    (qemu) migrate_set_parameter pause-vm on
>> 
>> 2- migrate the VM to a file:
>>    (qemu) migrate "exec:cat > snapshot"
>> 
>>    The primary failover card (VFIO) is unplugged and the VM is paused.
>> 
>> 3- resume the VM:
>>    (qemu) cont
>> 
>>    The VM restarts and the primary failover card is plugged back
>> 
>> The VM state sent in the migration stream is "paused", it means
>> when the snapshot is loaded or if the stream is sent to a destination
>> QEMU, the VM needs to be resumed manually.
>> 
>> Signed-off-by: Laurent Vivier <lvivier@redhat.com>
>
> A mix of comments:
>   a) As a boolean, this should be a MigrationCapability rather than a
> parameter
>   b) We already have a pause-before-switchover capability for a pause
> that happens later in the flow; so this would be something like
> pause-after-unplug
>   c) Is this really the right answer?  Could this be done a different
> way by doing the unplugs using (a possibly new) qmp command - so
> that you can explicitly trigger the unplug prior to the migration?

Not if you want the wait to be minimal.
What managedsave wants to do is doing the migration with the guest
stopped.  And wait for it until the last moment.

Doing this is qemu is "relatively" simple.  Doing that on libvirt is
extremely complex, because you basically have to :
- unplug the device
- wait for unplug to finish
- stop the guest
- migrate paused
- (restart the guest)

If you do it in libvirt, you are increasing the time betwee wait for
unplug to finish and stop the guest.  But the biggest problem is what
happens if the migration (or anything else fails).
qemu failover code already knows how to handle the stop/continuation of
the vfio device.  It is what happens on a normal run.  If you do this on
libvirt, it needs to be able to recover for all scenarios, what is much
more complex in my hunble opinion.

Later, Juan.



  reply	other threads:[~2021-10-29 13:51 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-09-30 17:09 [PATCH] failover: allow to pause the VM during the migration Laurent Vivier
2021-09-30 20:17 ` Laine Stump
2021-10-01  6:48   ` Laurent Vivier
2021-10-01  7:37   ` Peter Krempa
2021-10-01  9:01   ` Daniel P. Berrangé
2021-10-14 13:20 ` Dr. David Alan Gilbert
2021-10-29 13:49   ` Juan Quintela [this message]
2021-10-29 13:56 ` Juan Quintela

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87mtmsrle5.fsf@secure.mitica \
    --to=quintela@redhat.com \
    --cc=armbru@redhat.com \
    --cc=dgilbert@redhat.com \
    --cc=eblake@redhat.com \
    --cc=jasowang@redhat.com \
    --cc=lvivier@redhat.com \
    --cc=mst@redhat.com \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.