All of lore.kernel.org
 help / color / mirror / Atom feed
From: Chris J Arges <1297218@bugs.launchpad.net>
To: qemu-devel@nongnu.org
Subject: [Qemu-devel] [Bug 1297218] Re: guest hangs after live migration due to tsc jump
Date: Wed, 06 Jul 2016 13:44:24 -0000	[thread overview]
Message-ID: <20160706134424.18528.94062.malone@wampee.canonical.com> (raw)
In-Reply-To: 20140325110518.5612.54837.malonedeb@chaenomeles.canonical.com

Hello Paul, or anyone else affected,

Accepted qemu into trusty-proposed. The package will build now and be
available at https://launchpad.net/ubuntu/+source/qemu/2.0.0+dfsg-
2ubuntu1.25 in a few hours, and then in the -proposed repository.

Please help us by testing this new package.  See
https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to
enable and use -proposed.  Your feedback will aid us getting this update
out to other Ubuntu users.

If this package fixes the bug for you, please add a comment to this bug,
mentioning the version of the package you tested, and change the tag
from verification-needed to verification-done. If it does not fix the
bug for you, please add a comment stating that, and change the tag to
verification-failed.  In either case, details of your testing will help
us make a better decision.

Further information regarding the verification process can be found at
https://wiki.ubuntu.com/QATeam/PerformingSRUVerification .  Thank you in
advance!

** Changed in: qemu (Ubuntu Trusty)
       Status: Confirmed => Fix Committed

** Tags added: verification-needed

-- 
You received this bug notification because you are a member of qemu-
devel-ml, which is subscribed to QEMU.
https://bugs.launchpad.net/bugs/1297218

Title:
  guest hangs after live migration due to tsc jump

Status in QEMU:
  New
Status in glusterfs package in Ubuntu:
  Invalid
Status in qemu package in Ubuntu:
  Fix Released
Status in glusterfs source package in Trusty:
  Confirmed
Status in qemu source package in Trusty:
  Fix Committed

Bug description:
  =====================================
  SRU Justification:
  1. Impact: guests hang after live migration with 100% cpu
  2. Upstream fix: a set of four patches fix this upstream
  3. Stable fix: we have a backport of the four patches into a single patch.
  4. Test case: try a set of migrations of different VMS (it is unfortunately not 100% reproducible)
  5. Regression potential: the patch is not trivial, however the lp:qa-regression-tests testsuite passed 100% with this package.
  =====================================

  We have two identical Ubuntu servers running libvirt/kvm/qemu, sharing
  a Gluster filesystem. Guests can be live migrated between them.
  However, live migration often leads to the guest being stuck at 100%
  for a while. In that case, the dmesg output for such a guest will show
  (once it recovers): Clocksource tsc unstable (delta = 662463064082
  ns). In this particular example, a guest was migrated and only after
  11 minutes (662 seconds) did it become responsive again.

  It seems that newly booted guests doe not suffer from this problem,
  these can be migrated back and forth at will. After a day or so, the
  problem becomes apparent. It also seems that migrating from server A
  to server B causes much more problems than going from B back to A. If
  necessary, I can do more measurements to qualify these observations.

  The VM servers run Ubuntu 13.04 with these packages:
  Kernel: 3.8.0-35-generic x86_64
  Libvirt: 1.0.2
  Qemu: 1.4.0
  Gluster-fs: 3.4.2 (libvirt access the images via the filesystem, not using libgfapi yet as the Ubuntu libvirt is not linked against libgfapi).
  The interconnect between both machines (both for migration and gluster) is 10GbE.
  Both servers are synced to NTP and well within 1ms form one another.

  Guests are either Ubuntu 13.04 or 13.10.

  On the guests, the current_clocksource is kvm-clock.
  The XML definition of the guests only contains:  <clock offset='utc'/>

  Now as far as I've read in the documentation of kvm-clock, it specifically supports live migrations, so I'm a bit surprised at these problems. There isn't all that much information to find on these issue, although I have found postings by others that seem to have run into the same issues, but without a solution.
  ---
  ApportVersion: 2.14.1-0ubuntu3
  Architecture: amd64
  DistroRelease: Ubuntu 14.04
  Package: libvirt (not installed)
  ProcCmdline: BOOT_IMAGE=/boot/vmlinuz-3.13.0-24-generic root=UUID=1b0c3c6d-a9b8-4e84-b076-117ae267d178 ro console=ttyS1,115200n8 BOOTIF=01-00-25-90-75-b5-c8
  ProcVersionSignature: Ubuntu 3.13.0-24.47-generic 3.13.9
  Tags:  trusty apparmor apparmor apparmor apparmor apparmor
  Uname: Linux 3.13.0-24-generic x86_64
  UpgradeStatus: No upgrade log present (probably fresh install)
  UserGroups:

  _MarkForUpload: True
  modified.conffile..etc.default.libvirt.bin: [modified]
  modified.conffile..etc.libvirt.libvirtd.conf: [modified]
  modified.conffile..etc.libvirt.qemu.conf: [modified]
  modified.conffile..etc.libvirt.qemu.networks.default.xml: [deleted]
  mtime.conffile..etc.default.libvirt.bin: 2014-05-12T19:07:40.020662
  mtime.conffile..etc.libvirt.libvirtd.conf: 2014-05-13T14:40:25.894837
  mtime.conffile..etc.libvirt.qemu.conf: 2014-05-12T18:58:27.885506

To manage notifications about this bug go to:
https://bugs.launchpad.net/qemu/+bug/1297218/+subscriptions

  parent reply	other threads:[~2016-07-06 13:51 UTC|newest]

Thread overview: 38+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20140325110518.5612.54837.malonedeb@chaenomeles.canonical.com>
2014-07-29 16:45 ` [Qemu-devel] [Bug 1297218] Re: guest hangs after live migration due to tsc jump Paul Boven
2014-07-30  8:51   ` Alex Bligh
2014-07-30  9:43 ` Andrey Korolyov
2014-08-01 18:20 ` Paul Boven
2014-08-11 12:17 ` Paul Boven
2014-08-11 18:41 ` Serge Hallyn
2014-08-11 18:43 ` Serge Hallyn
2014-08-12 14:16 ` Paul Boven
2015-01-15 21:22 ` Serge Hallyn
2015-01-15 21:25 ` Serge Hallyn
2015-01-19 13:18 ` Mohammed Gamal
2015-01-19 16:25 ` Ubuntu Foundations Team Bug Bot
2015-01-21 18:34 ` Serge Hallyn
2015-01-28 16:26 ` Mohammed Gamal
2015-03-11 17:20 ` Mohammed Gamal
2015-03-18 12:11 ` Launchpad Bug Tracker
2015-04-03 21:08 ` Serge Hallyn
2015-05-12 16:19 ` Serge Hallyn
2015-08-12 16:57 ` Serge Hallyn
2015-09-03 18:03 ` Joerg Schumacher
2015-11-19 17:09 ` Serge Hallyn
2016-03-29 22:06 ` Serge Hallyn
2016-06-23 19:43 ` Steve Kerrison
2016-06-23 19:54 ` Serge Hallyn
2016-06-24 12:13 ` Kai Storbeck
2016-06-24 14:55 ` Serge Hallyn
2016-06-27 14:28 ` Kai Storbeck
2016-07-01  8:26 ` Kai Storbeck
2016-07-01 19:28 ` Serge Hallyn
2016-07-01 19:41 ` Serge Hallyn
2016-07-04  9:25 ` Kai Storbeck
2016-07-04 17:35 ` Serge Hallyn
2016-07-06 13:44 ` Chris J Arges [this message]
2016-07-08 13:27 ` Kai Storbeck
2016-07-08 13:39 ` Serge Hallyn
2016-07-13 15:04 ` [Qemu-devel] [Bug 1297218] Update Released Chris J Arges
2016-07-13 15:14 ` [Qemu-devel] [Bug 1297218] Re: guest hangs after live migration due to tsc jump Launchpad Bug Tracker
2016-11-30 16:32 ` Thomas Huth

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20160706134424.18528.94062.malone@wampee.canonical.com \
    --to=1297218@bugs.launchpad.net \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.