All of lore.kernel.org
 help / color / mirror / Atom feed
From: Chris Friesen <chris.friesen@windriver.com>
To: Ladi Prosek <lprosek@redhat.com>, Stefan Hajnoczi <stefanha@gmail.com>
Cc: qemu-devel <qemu-devel@nongnu.org>,
	"Dr. David Alan Gilbert" <dgilbert@redhat.com>
Subject: Re: [Qemu-devel] hitting intermittent issue with live migration from qemu-kvm-ev 2.3.0 to qemu-kvm-ev 2.6.0
Date: Tue, 4 Apr 2017 08:28:51 -0600	[thread overview]
Message-ID: <58E3ADA3.2040305@windriver.com> (raw)
In-Reply-To: <CABdb737NXTBJGvk+nuyFTccK_-g=N=PHO+ZRd1qSZcTgai=feQ@mail.gmail.com>

On 04/04/2017 07:56 AM, Ladi Prosek wrote:
> On Mon, Apr 3, 2017 at 9:11 PM, Stefan Hajnoczi <stefanha@gmail.com> wrote:
>> On Fri, Mar 31, 2017 at 02:12:36PM -0600, Chris Friesen wrote:

>>> Initially we have a bunch of guests running on compute-2 (which is running
>>> qemu-kvm-ev 2.3.0).  We then started live-migrating them one at a time to
>>> compute-0 (which is running qemu-kvm-ev 2.6.0).  Three of them migrated
>>> successfully.  The fourth (which was essentially identical in configuration
>>> to the first three) failed, as per the following logs in
>>> /var/log/libvirt/qemu/instance-0000000e.log:
>>>
>>>
>>> 2017-03-29T06:38:37.886940Z qemu-kvm: VQ 2 size 0x80 < last_avail_idx 0x47b
>>> - used_idx 0x47c
>>> 2017-03-29T06:38:37.886974Z qemu-kvm: error while loading state for instance
>>> 0x0 of device '0000:00:07.0/virtio-balloon'
>>> 2017-03-29T06:38:37.888684Z qemu-kvm: load of migration failed: Operation
>>> not permitted
>>> 2017-03-29 06:38:37.896+0000: shutting down
>>>
>>>
>>> Does anyone know of an existing bug report covering this issue?  (I took a
>>> look and didn't see anything obviously related.)
>>
>> This is the virtio-balloon device.  If you remove the device the live
>> migration should work reliably.
>>
>> Alternatively, you can temporarily rmmod virtio_balloon inside the guest
>> for live migration.  After migration you can modprobe virtio_balloon
>> again.
>>
>> last_avail_idx 0x47b with used_idx 0x47c is an invalid device state.
>> I've diffed qemu-kvm-ev 2.6.0-27.1 hw/virtio/virtio-balloon.c against
>> qemu.git/master and do not see an obvious bug.  I also compared
>> qemu-kvm-ev 2.3.0-31 with qemu-kvm-ev 2.6.0-27.1.
>
> The device likely got into the invalid state as part of a previous
> migration to an unfixed QEMU. I second Stefan's suggestion to
> temporarily remove the device or unload the driver.

I'll give that a try (been busy with a separate issue).

If I have a guest already running, can I unilaterally hot-remove the device from 
the host side or does the guest need to be involved as well?  (I'm just trying 
to figure out how to deal with existing guests.)

Thanks,
Chris

  reply	other threads:[~2017-04-04 14:29 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-03-31 20:12 [Qemu-devel] hitting intermittent issue with live migration from qemu-kvm-ev 2.3.0 to qemu-kvm-ev 2.6.0 Chris Friesen
2017-04-03  9:30 ` Dr. David Alan Gilbert
2017-04-03 19:11 ` Stefan Hajnoczi
2017-04-04 13:56   ` Ladi Prosek
2017-04-04 14:28     ` Chris Friesen [this message]
2017-04-04 15:07       ` Ladi Prosek

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=58E3ADA3.2040305@windriver.com \
    --to=chris.friesen@windriver.com \
    --cc=dgilbert@redhat.com \
    --cc=lprosek@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=stefanha@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.