From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-0.8 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 797D4C35280 for ; Wed, 2 Oct 2019 09:04:54 +0000 (UTC) Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id 49A6C21783 for ; Wed, 2 Oct 2019 09:04:54 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 49A6C21783 Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=bugs.launchpad.net Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Received: from localhost ([::1]:52682 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1iFaZ3-00048v-7N for qemu-devel@archiver.kernel.org; Wed, 02 Oct 2019 05:04:53 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:42269) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1iFaVD-0002fw-Vt for qemu-devel@nongnu.org; Wed, 02 Oct 2019 05:00:57 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1iFaVA-0000xs-GP for qemu-devel@nongnu.org; Wed, 02 Oct 2019 05:00:54 -0400 Received: from indium.canonical.com ([91.189.90.7]:34316) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1iFaVA-0000wu-Ac for qemu-devel@nongnu.org; Wed, 02 Oct 2019 05:00:52 -0400 Received: from loganberry.canonical.com ([91.189.90.37]) by indium.canonical.com with esmtp (Exim 4.86_2 #2 (Debian)) id 1iFaV6-0001iF-RV for ; Wed, 02 Oct 2019 09:00:48 +0000 Received: from loganberry.canonical.com (localhost [127.0.0.1]) by loganberry.canonical.com (Postfix) with ESMTP id BCF322E80C3 for ; Wed, 2 Oct 2019 09:00:48 +0000 (UTC) MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable Date: Wed, 02 Oct 2019 08:40:54 -0000 From: ustcweizhou <1618431@bugs.launchpad.net> To: qemu-devel@nongnu.org X-Launchpad-Notification-Type: bug X-Launchpad-Bug: product=qemu; status=Fix Released; importance=Undecided; assignee=None; X-Launchpad-Bug-Information-Type: Public Security X-Launchpad-Bug-Private: no X-Launchpad-Bug-Security-Vulnerability: yes X-Launchpad-Bug-Commenters: dgilbert-h sp2-blub th-huth w-link X-Launchpad-Bug-Reporter: Wolfgang Bumiller (sp2-blub) X-Launchpad-Bug-Modifier: ustcweizhou (ustcweizhou) References: <20160830115830.15931.71622.malonedeb@soybean.canonical.com> Message-Id: <157000565717.20799.13930525229317474578.launchpad@chaenomeles.canonical.com> Subject: [Bug 1618431] Re: windows hangs after live migration with virtio X-Launchpad-Message-Rationale: Subscriber (QEMU) @qemu-devel-ml X-Launchpad-Message-For: qemu-devel-ml Precedence: bulk X-Generated-By: Launchpad (canonical.com); Revision="19066"; Instance="production-secrets-lazr.conf" X-Launchpad-Hash: 3dedf4432ece5a1e3c0fcc889f45005dfdce0689 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] [fuzzy] X-Received-From: 91.189.90.7 X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.23 List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: Bug 1618431 <1618431@bugs.launchpad.net> Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Sender: "Qemu-devel" ** Information type changed from Public to Public Security -- = You received this bug notification because you are a member of qemu- devel-ml, which is subscribed to QEMU. https://bugs.launchpad.net/bugs/1618431 Title: windows hangs after live migration with virtio Status in QEMU: Fix Released Bug description: Several of our users reported problems with windows machines hanging after live migrations. The common denominator _seems_ to be virtio devices. I've managed to reproduce this reliably on a windows 10 (+ virtio-win-0.1.118) guest, always within 1 to 5 migrations, with a virtio-scsi hard drive and a virtio-net network device. (When I replace the virtio-net device with an e1000 it takes 10 or more migrations, and without virtio devices I have not (yet) been able to reproduce this problem. I also could not reproduce this with a linux guest. Also spice seems to improve the situation, but doesn't solve it completely). I've tested quite a few tags from qemu-git (v2.2.0 through v2.6.1, and 2.6.1 with the patches mentioned on qemu-stable by Peter Lieven) and the behavior is the same everywhere. The reproducibility seems to be somewhat dependent on the host hardware, which makes investigating this issue that much harder. Symptoms: After the migration the windows graphics stack just hangs. Background processes are still running (eg. after installing an ssh server I could still login and get a command prompt after the hang was triggered... not that I'd know what to do with that on a windows machine...) - commands which need no GUI access work, the rest just hangs there on the command line, too. It's also capable of responding to an NMI sent via the qemu monitor: it then seems to "recover" and manages to show the blue sad-face screen that something happened, reboots successfully and is usable again without restarting the qemu process in between. From there whole the process can be repeated. Here's what our command line usually looks like: /usr/bin/qemu -daemonize \ -enable-kvm \ -chardev socket,id=3Dqmp,path=3D/var/run/qemu-server/101.qmp,server,nowa= it -mon chardev=3Dqmp,mode=3Dcontrol \ -pidfile /var/run/qemu-server/101.pid \ -smbios type=3D1,uuid=3D07fc916e-24c2-4eef-9827-4ab4026501d4 \ -name win10 \ -smp 6,sockets=3D1,cores=3D6,maxcpus=3D6 \ -nodefaults \ -boot menu=3Don,strict=3Don,reboot-timeout=3D1000 \ -vga std \ -vnc unix:/var/run/qemu-server/101.vnc \ -no-hpet \ -cpu kvm64,hv_spinlocks=3D0x1fff,hv_vapic,hv_time,hv_reset,hv_vpindex,hv= _runtime,hv_relaxed,+lahf_lm,+sep,+kvm_pv_unhalt,+kvm_pv_eoi,enforce \ -m 2048 \ -device pci-bridge,id=3Dpci.2,chassis_nr=3D2,bus=3Dpci.0,addr=3D0x1f \ -device pci-bridge,id=3Dpci.1,chassis_nr=3D1,bus=3Dpci.0,addr=3D0x1e \ -device piix3-usb-uhci,id=3Duhci,bus=3Dpci.0,addr=3D0x1.0x2 \ -device usb-tablet,id=3Dtablet,bus=3Duhci.0,port=3D1 \ -device virtio-balloon-pci,id=3Dballoon0,bus=3Dpci.0,addr=3D0x3 \ -iscsi initiator-name=3Diqn.1993-08.org.debian:01:1ba48d46fb8 \ -drive if=3Dnone,id=3Ddrive-ide0,media=3Dcdrom,aio=3Dthreads \ -device ide-cd,bus=3Dide.0,unit=3D0,drive=3Ddrive-ide0,id=3Dide0,bootind= ex=3D200 \ -device virtio-scsi-pci,id=3Dscsihw0,bus=3Dpci.0,addr=3D0x5 \ -drive file=3D/mnt/pve/data1/images/101/vm-101-disk-1.qcow2,if=3Dnone,id= =3Ddrive-scsi0,cache=3Dwriteback,discard=3Don,format=3Dqcow2,aio=3Dthreads,= detect-zeroes=3Dunmap \ -device scsi-hd,bus=3Dscsihw0.0,channel=3D0,scsi-id=3D0,lun=3D0,drive=3D= drive-scsi0,id=3Dscsi0,bootindex=3D100 \ -netdev type=3Dtap,id=3Dnet0,ifname=3Dtap101i0,script=3D/var/lib/qemu-se= rver/pve-bridge,downscript=3D/var/lib/qemu-server/pve-bridgedown,vhost=3Don= \ -device virtio-net-pci,mac=3DF2:2B:20:37:E6:D7,netdev=3Dnet0,bus=3Dpci.0= ,addr=3D0x12,id=3Dnet0,bootindex=3D300 \ -rtc driftfix=3Dslew,base=3Dlocaltime \ -global kvm-pit.lost_tick_policy=3Ddiscard To manage notifications about this bug go to: https://bugs.launchpad.net/qemu/+bug/1618431/+subscriptions