Hypervisor crash(!) on xl cpupool-numa-split

* Hypervisor crash(!) on xl cpupool-numa-split
@ 2011-01-27 23:18 Andre Przywara
  2011-01-28  6:47 ` Juergen Gross
  0 siblings, 1 reply; 53+ messages in thread
From: Andre Przywara @ 2011-01-27 23:18 UTC (permalink / raw)
  To: Keir Fraser, Ian Jackson, Juergen Gross; +Cc: xen-devel

Hi,

when I boot my machine without restricting Dom0 (dom0_mem= 
dom0_max_vcpus=) I get an _hypervisor_ crash when I run
# xl cpupool-numa-split
If Dom0's resources are limited on the Xen cmdline, everything works fine.
The crashdump points to a scheduling problem with weights, so I assume 
the NUMA distribution algorithm some fools the hypervisor completely.

I will investigate this further tomorrow, but maybe someone has some 
good idea.

Regards,
Andre.

root@dosorca:/data/images# xl cpupool-numa-split
(XEN) Xen BUG at sched_credit.c:990
(XEN) ----[ Xen-4.1.0-rc2-pre  x86_64  debug=y  Not tainted ]----
(XEN) CPU:    0
(XEN) RIP:    e008:[<ffff82c4801180f8>] csched_acct+0x11f/0x419
(XEN) RFLAGS: 0000000000010006   CONTEXT: hypervisor
(XEN) rax: 0000000000000010   rbx: 0000000000000f00   rcx: 0000000000000100
(XEN) rdx: 0000000000001000   rsi: ffff830437ffa600   rdi: 0000000000000010
(XEN) rbp: ffff82c480297e10   rsp: ffff82c480297d80   r8:  0000000000000100
(XEN) r9:  0000000000000006   r10: ffff82c4802d4100   r11: 000000afc7df0edf
(XEN) r12: ffff830437ffa5e0   r13: ffff82c480117fd9   r14: ffff830437f9f2e8
(XEN) r15: ffff830434321ec0   cr0: 000000008005003b   cr4: 00000000000006f0
(XEN) cr3: 000000080df4e000   cr2: ffff88179af79618
(XEN) ds: 002b   es: 002b   fs: 0000   gs: 0000   ss: e010   cs: e008
(XEN) Xen stack trace from rsp=ffff82c480297d80:
(XEN)    0000000000000282 fffffed4802d3f80 0000000000000eff ffff830437ffa5e0
(XEN)    ffff830437ffa5e8 ffff830437ffa870 ffff830437ffa5e0 0000000000000282
(XEN)    ffff830437ffa5e8 00002a3037ffa870 00000f0000000f00 0000000000000000
(XEN)    ffff82c400000000 ffff82c4802d3f80 ffff830437ffa5e0 ffff82c480117fd9
(XEN)    ffff830437f9f2e8 ffff830437f9f2e0 ffff82c480297e40 ffff82c480125f34
(XEN)    0000000000000002 ffff830437ffa600 ffff82c4802d3f80 000000afb6f8667f
(XEN)    ffff82c480297e90 ffff82c480126259 ffff82c48024ae20 ffff82c4802d3f80
(XEN)    ffff830437f9f2e0 0000000000000000 0000000000000000 ffff82c4802b0880
(XEN)    ffff82c480297f18 ffffffffffffffff ffff82c480297ed0 ffff82c480123327
(XEN)    ffff82c4802d4a00 ffff82c480297f18 ffff82c48024ae20 ffff82c480297f18
(XEN)    000000afb6abd652 ffff82c4802d3ec0 ffff82c480297ee0 ffff82c4801233a2
(XEN)    ffff82c480297f10 ffff82c4801563f5 0000000000000000 ffff8300c7cd6000
(XEN)    0000000000000000 ffff8300c7ad4000 ffff82c480297d48 0000000000000000
(XEN)    0000000000000000 0000000000000000 ffffffff81a69060 ffff8817a8503f10
(XEN)    ffff8817a8503fd8 0000000000000246 ffff8817a8503e80 ffff880000000001
(XEN)    0000000000000000 0000000000000000 ffffffff810093aa 000000aafab2f86e
(XEN)    00000000deadbeef 00000000deadbeef 0000010000000000 ffffffff810093aa
(XEN)    000000000000e033 0000000000000246 ffff8817a8503ef8 000000000000e02b
(XEN)    0000000000000000 0000000000000000 0000000000000000 0000000000000000
(XEN)    0000000000000000 ffff8300c7cd6000 0000000000000000 0000000000000000
(XEN) Xen call trace:
(XEN)    [<ffff82c4801180f8>] csched_acct+0x11f/0x419
(XEN)    [<ffff82c480125f34>] execute_timer+0x4e/0x6c
(XEN)    [<ffff82c480126259>] timer_softirq_action+0xf2/0x245
(XEN)    [<ffff82c480123327>] __do_softirq+0x88/0x99
(XEN)    [<ffff82c4801233a2>] do_softirq+0x6a/0x7a
(XEN)    [<ffff82c4801563f5>] idle_loop+0x6a/0x6f
(XEN)
(XEN)
(XEN) ****************************************
(XEN) Panic on CPU 0:
(XEN) Xen BUG at sched_credit.c:990
(XEN) ****************************************
(XEN)
(XEN) Reboot in five seconds...

-- 
Andre Przywara
AMD-OSRC (Dresden)
Tel: x29712

^ permalink raw reply	[flat|nested] 53+ messages in thread