All of lore.kernel.org
 help / color / mirror / Atom feed
* KVM Guest Lock up (100%) again!
       [not found] <2122448695.1031741.1365086071426.JavaMail.root@innovot.com>
@ 2013-04-04 14:36 ` Phil Daws
  2013-04-08 20:02   ` Phil Daws
  0 siblings, 1 reply; 6+ messages in thread
From: Phil Daws @ 2013-04-04 14:36 UTC (permalink / raw)
  To: kvm

One of my KVM guests locked up again at 100% CPU!  Any thoughts on how I can diagnose it ? We would love to put into production but am very concerned about the current stability. I have tried to re-direct the console, through screen, to see whether there is a spin lock that is causing the problem; though all I got in the log file was a login prompt. What is the correct way of redirecting the console in KVM on a CentOS 6.4 system please ? Thanks.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: KVM Guest Lock up (100%) again!
  2013-04-04 14:36 ` KVM Guest Lock up (100%) again! Phil Daws
@ 2013-04-08 20:02   ` Phil Daws
  2013-04-10 14:10     ` Gleb Natapov
  0 siblings, 1 reply; 6+ messages in thread
From: Phil Daws @ 2013-04-08 20:02 UTC (permalink / raw)
  To: kvm

Hello all,

Another lock up again this evening :( am wondering whether should consider upgrading the kernel to 3.7.10 and the latest version of KVM. Thoughts ?

Thanks.

----- Original Message -----
To: kvm@vger.kernel.org
Sent: Thursday, 4 April, 2013 3:36:11 PM
Subject: KVM Guest Lock up (100%) again!

One of my KVM guests locked up again at 100% CPU!  Any thoughts on how I can diagnose it ? We would love to put into production but am very concerned about the current stability. I have tried to re-direct the console, through screen, to see whether there is a spin lock that is causing the problem; though all I got in the log file was a login prompt. What is the correct way of redirecting the console in KVM on a CentOS 6.4 system please ? Thanks.
--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: KVM Guest Lock up (100%) again!
  2013-04-08 20:02   ` Phil Daws
@ 2013-04-10 14:10     ` Gleb Natapov
  2013-04-12 14:10       ` Phil Daws
  0 siblings, 1 reply; 6+ messages in thread
From: Gleb Natapov @ 2013-04-10 14:10 UTC (permalink / raw)
  To: Phil Daws; +Cc: kvm

On Mon, Apr 08, 2013 at 09:02:04PM +0100, Phil Daws wrote:
> Hello all,
> 
> Another lock up again this evening :( am wondering whether should consider upgrading the kernel to 3.7.10 and the latest version of KVM. Thoughts ?
> 
Please try with latest kvm/qemu. This is not the list to report vendor specific problems.

> Thanks.
> 
> ----- Original Message -----
> To: kvm@vger.kernel.org
> Sent: Thursday, 4 April, 2013 3:36:11 PM
> Subject: KVM Guest Lock up (100%) again!
> 
> One of my KVM guests locked up again at 100% CPU!  Any thoughts on how I can diagnose it ? We would love to put into production but am very concerned about the current stability. I have tried to re-direct the console, through screen, to see whether there is a spin lock that is causing the problem; though all I got in the log file was a login prompt. What is the correct way of redirecting the console in KVM on a CentOS 6.4 system please ? Thanks.
> --
> To unsubscribe from this list: send the line "unsubscribe kvm" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> --
> To unsubscribe from this list: send the line "unsubscribe kvm" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
			Gleb.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: KVM Guest Lock up (100%) again!
  2013-04-10 14:10     ` Gleb Natapov
@ 2013-04-12 14:10       ` Phil Daws
  2013-04-12 15:13         ` Gleb Natapov
  0 siblings, 1 reply; 6+ messages in thread
From: Phil Daws @ 2013-04-12 14:10 UTC (permalink / raw)
  To: Gleb Natapov; +Cc: kvm

Well this is still happening ... I have tried to isolate what could be causing but not much luck yet.  Thought the VMs may have been IO bound but that not the case and even tried upping the vCPU allocation from one to two as plenty of head room.  When it locks up I see this on a strace:

[pid  1343] read(14, 0x7fff82aeb360, 4096) = -1 EAGAIN (Resource temporarily unavailable)
[pid  1343] read(7, "\0", 512)          = 1
[pid  1343] read(7, 0x7fff82aec160, 512) = -1 EAGAIN (Resource temporarily unavailable)
[pid  1343] select(26, [7 10 13 14 16 17 22 25], [], [], {1, 0}) = 1 (in [16], left {0, 999981})
[pid  1343] read(16, "\16\0\0\0\0\0\0\0\376\377\377\377\0\0\0\0\0\0\0\0\0\0\0\0\1\0\0\0\0\0\0\0"..., 128) = 128
[pid  1343] rt_sigaction(SIGALRM, NULL, {0x7f210b2c0510, ~[KILL STOP RTMIN RT_1], SA_RESTORER, 0x7f210ac22500}, 8) = 0
[pid  1343] write(8, "\0", 1)           = 1
[pid  1343] write(15, "\1\0\0\0\0\0\0\0", 8) = 8
[pid  1343] read(16, 0x7fff82aec2d0, 128) = -1 EAGAIN (Resource temporarily unavailable)
[pid  1343] timer_gettime(0x1, {it_interval={0, 0}, it_value={0, 0}}) = 0
[pid  1343] timer_settime(0x1, 0, {it_interval={0, 0}, it_value={0, 656000000}}, NULL) = 0
[pid  1343] select(26, [7 10 13 14 16 17 22 25], [], [], {1, 0}) = 2 (in [7 14], left {0, 999998})
[pid  1343] read(14, "\1\0\0\0\0\0\0\0", 4096) = 8
[pid  1343] read(14, 0x7fff82aeb360, 4096) = -1 EAGAIN (Resource temporarily unavailable)
[pid  1343] read(7, "\0", 512)          = 1
[pid  1343] read(7, 0x7fff82aec160, 512) = -1 EAGAIN (Resource temporarily unavailable)

Does that shed any light ? Trying to find a how to for upgrading to the latest KVM/QEMU.

Thanks.

----- Original Message -----
From: "Gleb Natapov" <gleb@redhat.com>
To: "Phil Daws" <uxbod@splatnix.net>
Cc: kvm@vger.kernel.org
Sent: Wednesday, 10 April, 2013 3:10:27 PM
Subject: Re: KVM Guest Lock up (100%) again!

On Mon, Apr 08, 2013 at 09:02:04PM +0100, Phil Daws wrote:
> Hello all,
> 
> Another lock up again this evening :( am wondering whether should consider upgrading the kernel to 3.7.10 and the latest version of KVM. Thoughts ?
> 
Please try with latest kvm/qemu. This is not the list to report vendor specific problems.

> Thanks.
> 
> ----- Original Message -----
> To: kvm@vger.kernel.org
> Sent: Thursday, 4 April, 2013 3:36:11 PM
> Subject: KVM Guest Lock up (100%) again!
> 
> One of my KVM guests locked up again at 100% CPU!  Any thoughts on how I can diagnose it ? We would love to put into production but am very concerned about the current stability. I have tried to re-direct the console, through screen, to see whether there is a spin lock that is causing the problem; though all I got in the log file was a login prompt. What is the correct way of redirecting the console in KVM on a CentOS 6.4 system please ? Thanks.
> --
> To unsubscribe from this list: send the line "unsubscribe kvm" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> --
> To unsubscribe from this list: send the line "unsubscribe kvm" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
			Gleb.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: KVM Guest Lock up (100%) again!
  2013-04-12 14:10       ` Phil Daws
@ 2013-04-12 15:13         ` Gleb Natapov
  2013-04-12 16:31           ` Phil Daws
  0 siblings, 1 reply; 6+ messages in thread
From: Gleb Natapov @ 2013-04-12 15:13 UTC (permalink / raw)
  To: Phil Daws; +Cc: kvm

On Fri, Apr 12, 2013 at 03:10:43PM +0100, Phil Daws wrote:
> Well this is still happening ... I have tried to isolate what could be causing but not much luck yet.  Thought the VMs may have been IO bound but that not the case and even tried upping the vCPU allocation from one to two as plenty of head room.  When it locks up I see this on a strace:
> 
> [pid  1343] read(14, 0x7fff82aeb360, 4096) = -1 EAGAIN (Resource temporarily unavailable)
> [pid  1343] read(7, "\0", 512)          = 1
> [pid  1343] read(7, 0x7fff82aec160, 512) = -1 EAGAIN (Resource temporarily unavailable)
> [pid  1343] select(26, [7 10 13 14 16 17 22 25], [], [], {1, 0}) = 1 (in [16], left {0, 999981})
> [pid  1343] read(16, "\16\0\0\0\0\0\0\0\376\377\377\377\0\0\0\0\0\0\0\0\0\0\0\0\1\0\0\0\0\0\0\0"..., 128) = 128
> [pid  1343] rt_sigaction(SIGALRM, NULL, {0x7f210b2c0510, ~[KILL STOP RTMIN RT_1], SA_RESTORER, 0x7f210ac22500}, 8) = 0
> [pid  1343] write(8, "\0", 1)           = 1
> [pid  1343] write(15, "\1\0\0\0\0\0\0\0", 8) = 8
> [pid  1343] read(16, 0x7fff82aec2d0, 128) = -1 EAGAIN (Resource temporarily unavailable)
> [pid  1343] timer_gettime(0x1, {it_interval={0, 0}, it_value={0, 0}}) = 0
> [pid  1343] timer_settime(0x1, 0, {it_interval={0, 0}, it_value={0, 656000000}}, NULL) = 0
> [pid  1343] select(26, [7 10 13 14 16 17 22 25], [], [], {1, 0}) = 2 (in [7 14], left {0, 999998})
> [pid  1343] read(14, "\1\0\0\0\0\0\0\0", 4096) = 8
> [pid  1343] read(14, 0x7fff82aeb360, 4096) = -1 EAGAIN (Resource temporarily unavailable)
> [pid  1343] read(7, "\0", 512)          = 1
> [pid  1343] read(7, 0x7fff82aec160, 512) = -1 EAGAIN (Resource temporarily unavailable)
> 
> Does that shed any light ? Trying to find a how to for upgrading to the latest KVM/QEMU.
> 
Is the lockup with upstream now? strace is not very helpful to
diagnose kvm problems. Try to run ftrace: http://www.linux-kvm.org/page/Tracing

--
			Gleb.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: KVM Guest Lock up (100%) again!
  2013-04-12 15:13         ` Gleb Natapov
@ 2013-04-12 16:31           ` Phil Daws
  0 siblings, 0 replies; 6+ messages in thread
From: Phil Daws @ 2013-04-12 16:31 UTC (permalink / raw)
  To: Gleb Natapov; +Cc: kvm

Was running two guests on k3.7.10 but have now switched one to stock 2.6.32; and neither have crashed yet.  Will leave running as and if it stays stable will switch out the other kernel to stock as-well.  Am wondering if have hit a kernel buglet. Thank you for the ftrace info. Have a great weekend.

----- Original Message -----
From: "Gleb Natapov" <gleb@redhat.com>
To: "Phil Daws" <uxbod@splatnix.net>
Cc: kvm@vger.kernel.org
Sent: Friday, 12 April, 2013 4:13:16 PM
Subject: Re: KVM Guest Lock up (100%) again!

On Fri, Apr 12, 2013 at 03:10:43PM +0100, Phil Daws wrote:
> Well this is still happening ... I have tried to isolate what could be causing but not much luck yet.  Thought the VMs may have been IO bound but that not the case and even tried upping the vCPU allocation from one to two as plenty of head room.  When it locks up I see this on a strace:
> 
> [pid  1343] read(14, 0x7fff82aeb360, 4096) = -1 EAGAIN (Resource temporarily unavailable)
> [pid  1343] read(7, "\0", 512)          = 1
> [pid  1343] read(7, 0x7fff82aec160, 512) = -1 EAGAIN (Resource temporarily unavailable)
> [pid  1343] select(26, [7 10 13 14 16 17 22 25], [], [], {1, 0}) = 1 (in [16], left {0, 999981})
> [pid  1343] read(16, "\16\0\0\0\0\0\0\0\376\377\377\377\0\0\0\0\0\0\0\0\0\0\0\0\1\0\0\0\0\0\0\0"..., 128) = 128
> [pid  1343] rt_sigaction(SIGALRM, NULL, {0x7f210b2c0510, ~[KILL STOP RTMIN RT_1], SA_RESTORER, 0x7f210ac22500}, 8) = 0
> [pid  1343] write(8, "\0", 1)           = 1
> [pid  1343] write(15, "\1\0\0\0\0\0\0\0", 8) = 8
> [pid  1343] read(16, 0x7fff82aec2d0, 128) = -1 EAGAIN (Resource temporarily unavailable)
> [pid  1343] timer_gettime(0x1, {it_interval={0, 0}, it_value={0, 0}}) = 0
> [pid  1343] timer_settime(0x1, 0, {it_interval={0, 0}, it_value={0, 656000000}}, NULL) = 0
> [pid  1343] select(26, [7 10 13 14 16 17 22 25], [], [], {1, 0}) = 2 (in [7 14], left {0, 999998})
> [pid  1343] read(14, "\1\0\0\0\0\0\0\0", 4096) = 8
> [pid  1343] read(14, 0x7fff82aeb360, 4096) = -1 EAGAIN (Resource temporarily unavailable)
> [pid  1343] read(7, "\0", 512)          = 1
> [pid  1343] read(7, 0x7fff82aec160, 512) = -1 EAGAIN (Resource temporarily unavailable)
> 
> Does that shed any light ? Trying to find a how to for upgrading to the latest KVM/QEMU.
> 
Is the lockup with upstream now? strace is not very helpful to
diagnose kvm problems. Try to run ftrace: http://www.linux-kvm.org/page/Tracing

--
			Gleb.

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2013-04-12 16:33 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <2122448695.1031741.1365086071426.JavaMail.root@innovot.com>
2013-04-04 14:36 ` KVM Guest Lock up (100%) again! Phil Daws
2013-04-08 20:02   ` Phil Daws
2013-04-10 14:10     ` Gleb Natapov
2013-04-12 14:10       ` Phil Daws
2013-04-12 15:13         ` Gleb Natapov
2013-04-12 16:31           ` Phil Daws

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.