From mboxrd@z Thu Jan 1 00:00:00 1970 From: Nate Custer Subject: kvm deadlock Date: Mon, 5 Dec 2011 16:48:16 -0600 Message-ID: <54FC5923-2123-4BDD-A506-EA57DCE0C1F6@cpanel.net> Mime-Version: 1.0 (Apple Message framework v1084) Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 8BIT To: kvm@vger.kernel.org Return-path: Received: from mx1.cpanel.net ([208.74.121.68]:41720 "EHLO mx1.cpanel.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932702Ab1LEWsS convert rfc822-to-8bit (ORCPT ); Mon, 5 Dec 2011 17:48:18 -0500 Received: from ng1.cptxoffice.net ([208.74.121.102]:9957 helo=[10.4.130.1]) by mx1.cpanel.net with esmtpsa (TLSv1:AES128-SHA:128) (Exim 4.77) (envelope-from ) id 1RXhKa-0007sA-Kl for kvm@vger.kernel.org; Mon, 05 Dec 2011 16:48:16 -0600 Sender: kvm-owner@vger.kernel.org List-ID: Hello, I am struggling with repeatable full hardware locks when running 8-12 KVM vms. At some point before the hard lock I get a inconsistent lock state warning. An example of this can be found here: http://pastebin.com/8wKhgE2C After that the server continues to run for a while and then starts its death spiral. When it reaches that point it fails to log anything further to the disk, but by attaching a console I have been able to get a stack trace documenting the final implosion: http://pastebin.com/PbcN76bd All of the cores end up hung and the server stops responding to all input, including SysRq commands. I have seen this behavior on two machines (dual E5606 running Fedora 16) both passed cpuburnin testing and memtest86 scans without error. I have reproduced the crash and stack traces from a Fedora debugging kernel - 3.1.2-1 and with a vanilla 3.1.4 kernel. Nate Custer QA Analyst cPanel Inc