linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [migration] kernel BUG at kernel/irq_work.c:175!
@ 2014-06-25  9:31 Fengguang Wu
  2014-06-25  9:45 ` Srivatsa S. Bhat
  0 siblings, 1 reply; 2+ messages in thread
From: Fengguang Wu @ 2014-06-25  9:31 UTC (permalink / raw)
  To: Linux Memory Management List
  Cc: LKML, Andrew Morton, Stephen Rothwell, Jet Chen, Yuanhan Liu,
	LKP, Su, Tao

Greetings,

0day kernel testing robot got the below dmesg and the first bad commit is

git://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git master
commit 68c90b2c635f18ad51ae7440162f6c082ea1288d
Merge: f08af6f ec11f8c
Author:     Stephen Rothwell <sfr@canb.auug.org.au>
AuthorDate: Mon Jun 23 14:12:48 2014 +1000

    Merge branch 'akpm-current/current'

+---------------------------------+------------+------------+------------+---------------+
|                                 | f08af6fa87 | ec11f8c81f | 68c90b2c63 | next-20140623 |
+---------------------------------+------------+------------+------------+---------------+
| boot_successes                  | 60         | 60         | 0          | 0             |
| boot_failures                   | 0          | 0          | 20         | 13            |
| kernel_BUG_at_kernel/irq_work.c | 0          | 0          | 20         | 13            |
| invalid_opcode                  | 0          | 0          | 20         | 13            |
| RIP:irq_work_run                | 0          | 0          | 20         | 13            |
| backtrace:smpboot_thread_fn     | 0          | 0          | 20         | 13            |
+---------------------------------+------------+------------+------------+---------------+

[    2.194744] EDD information not available.
[    2.195290] Unregister pv shared memory for cpu 0
[    2.206025] ------------[ cut here ]------------
[    2.206025] kernel BUG at kernel/irq_work.c:175!
[    2.206025] invalid opcode: 0000 [#1] SMP DEBUG_PAGEALLOC
[    2.206025] CPU: 0 PID: 9 Comm: migration/0 Not tainted 3.16.0-rc2-02039-g68c90b2 #1
[    2.206025] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011
[    2.206025] task: ffff88001219a7e0 ti: ffff8800121a4000 task.ti: ffff8800121a4000
[    2.206025] RIP: 0010:[<ffffffff810f9318>]  [<ffffffff810f9318>] irq_work_run+0xf/0x1c
[    2.206025] RSP: 0000:ffff8800121a7c48  EFLAGS: 00010046
[    2.206025] RAX: 0000000080000001 RBX: 0000000000000000 RCX: 0000000000000005
[    2.206025] RDX: 0000000000000000 RSI: 0000000000000008 RDI: 0000000000000000
[    2.206025] RBP: ffff8800121a7c68 R08: 0000000000000002 R09: 0000000000000001
[    2.206025] R10: ffffffff810e2a10 R11: ffffffff810b9de3 R12: ffff880012412340
[    2.206025] R13: 0000000000000000 R14: 0000000000000000 R15: ffffffff81c83e50
[    2.206025] FS:  0000000000000000(0000) GS:ffff880012400000(0000) knlGS:0000000000000000
[    2.206025] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[    2.206025] CR2: 0000000000000000 CR3: 0000000001c0c000 CR4: 00000000000006b0
[    2.206025] Stack:
[    2.206025]  ffffffff810e87e0 ffff880012412380 00000000fffffff0 ffffffff81c81ba0
[    2.206025]  ffff8800121a7c88 ffffffff810e88f0 0000000000000001 00000000fffffff0
[    2.206025]  ffff8800121a7cd0 ffffffff810b6e23 0000000000000000 0000000000000008
[    2.206025] Call Trace:
[    2.206025]  [<ffffffff810e87e0>] ? flush_smp_call_function_queue+0xa4/0x107
[    2.206025]  [<ffffffff810e88f0>] hotplug_cfd+0xad/0xbb
[    2.206025]  [<ffffffff810b6e23>] notifier_call_chain+0x68/0x8e
[    2.206025]  [<ffffffff810b70c0>] __raw_notifier_call_chain+0x9/0xb
[    2.206025]  [<ffffffff8109b39e>] __cpu_notify+0x1b/0x32
[    2.206025]  [<ffffffff8109b3c3>] cpu_notify+0xe/0x10
[    2.206025]  [<ffffffff817e2817>] take_cpu_down+0x22/0x35
[    2.206025]  [<ffffffff810f4153>] multi_cpu_stop+0x8c/0xe2
[    2.206025]  [<ffffffff810f40c7>] ? cpu_stopper_thread+0x126/0x126
[    2.206025]  [<ffffffff810f402e>] cpu_stopper_thread+0x8d/0x126
[    2.206025]  [<ffffffff810cdab4>] ? lock_acquire+0x94/0x9d
[    2.206025]  [<ffffffff817f25af>] ? _raw_spin_unlock_irqrestore+0x40/0x55
[    2.206025]  [<ffffffff810cbdcd>] ? trace_hardirqs_on_caller+0x171/0x18d
[    2.206025]  [<ffffffff817f25b7>] ? _raw_spin_unlock_irqrestore+0x48/0x55
[    2.206025]  [<ffffffff810b8e39>] smpboot_thread_fn+0x182/0x1a0
[    2.206025]  [<ffffffff810b8cb7>] ? in_egroup_p+0x2e/0x2e
[    2.206025]  [<ffffffff810b372c>] kthread+0xcd/0xd5
[    2.206025]  [<ffffffff810b365f>] ? __kthread_parkme+0x5c/0x5c
[    2.206025]  [<ffffffff817f2f3c>] ret_from_fork+0x7c/0xb0
[    2.206025]  [<ffffffff810b365f>] ? __kthread_parkme+0x5c/0x5c
[    2.206025] Code: 48 c7 c7 65 cd b0 81 e8 43 20 fa ff c6 05 50 e1 c9 00 01 eb 02 31 db 88 d8 5b 5d c3 65 8b 04 25 10 b8 00 00 a9 00 00 0f 00 75 02 <0f> 0b 55 48 89 e5 e8 b5 fd ff ff 5d c3 55 48 89 e5 53 48 89 fb 
[    2.206025] RIP  [<ffffffff810f9318>] irq_work_run+0xf/0x1c
[    2.206025]  RSP <ffff8800121a7c48>
[    2.206025] ---[ end trace f7f1564c3a1f35d0 ]---
[    2.206025] note: migration/0[9] exited with preempt_count 1

git bisect start 58ae500a03a6bf68eee323c342431bfdd3f460b6 f08af6fa87ea33262fe2fe5167119fb55ad9dd2c --
git bisect  bad 68c90b2c635f18ad51ae7440162f6c082ea1288d  # 14:19      0-     20  Merge branch 'akpm-current/current'
git bisect good 6b11d02e25c79a8961983a966b7fafcdc36c7a91  # 14:23     20+      0  slab: do not keep free objects/slabs on dead memcg caches
git bisect good 11709212b3a5479fcc63dda3160f4f4b0251f914  # 14:27     20+      0  mm/util.c: add kstrimdup()
git bisect good 6af20930dcfcd13270de4f29f3830312f3c36a17  # 14:33     20+      0  fork: reset mm->pinned_vm
git bisect good 8e7c32fb574ec1b49fd0e451cb25febf51430dd9  # 14:38     20+      0  fs/qnx6: use pr_fmt and __func__ in logging
git bisect good 6873969c750b85734bc7d06be3c51ad381b3c85a  # 14:41     20+      0  shm: remove unneeded extern for function
git bisect good 2b9ed79abc340e15bc9652048d2e8d8a283bd8a1  # 14:48     20+      0  um: use asm-generic/scatterlist.h
git bisect good ec11f8c81fbc76534c1374e29bdf36f085ed859a  # 15:12     20+      0  lib/scatterlist: clean up useless architecture versions of scatterlist.h
# first bad commit: [68c90b2c635f18ad51ae7440162f6c082ea1288d] Merge branch 'akpm-current/current'
git bisect good f08af6fa87ea33262fe2fe5167119fb55ad9dd2c  # 15:14     60+      0  Merge branch 'rd-docs/master'
git bisect good ec11f8c81fbc76534c1374e29bdf36f085ed859a  # 15:19     60+      0  lib/scatterlist: clean up useless architecture versions of scatterlist.h
git bisect  bad 58ae500a03a6bf68eee323c342431bfdd3f460b6  # 15:19      0-     13  Add linux-next specific files for 20140623
git bisect good a497c3ba1d97fc69c1e78e7b96435ba8c2cb42ee  # 15:25     60+      0  Linux 3.16-rc2
git bisect  bad 58ae500a03a6bf68eee323c342431bfdd3f460b6  # 15:25      0-     13  Add linux-next specific files for 20140623


This script may reproduce the error.

-----------------------------------------------------------------------------
#!/bin/bash

kernel=$1
initrd=quantal-core-x86_64.cgz

wget --no-clobber https://github.com/fengguang/reproduce-kernel-bug/blob/master/initrd/$initrd

kvm=(
	qemu-system-x86_64 -cpu kvm64 -enable-kvm 
	-kernel $kernel
	-initrd $initrd
	-smp 2
	-m 256M
	-net nic,vlan=0,macaddr=00:00:00:00:00:00,model=virtio
	-net user,vlan=0
	-net nic,vlan=1,model=e1000
	-net user,vlan=1
	-boot order=nc
	-no-reboot
	-watchdog i6300esb
	-serial stdio
	-display none
	-monitor null
)

append=(
	debug
	sched_debug
	apic=debug
	ignore_loglevel
	sysrq_always_enabled
	panic=10
	prompt_ramdisk=0
	earlyprintk=ttyS0,115200
	console=ttyS0,115200
	console=tty0
	vga=normal
	root=/dev/ram0
	rw
)

"${kvm[@]}" --append "${append[*]}"
-----------------------------------------------------------------------------

Thanks,
Fengguang

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: [migration] kernel BUG at kernel/irq_work.c:175!
  2014-06-25  9:31 [migration] kernel BUG at kernel/irq_work.c:175! Fengguang Wu
@ 2014-06-25  9:45 ` Srivatsa S. Bhat
  0 siblings, 0 replies; 2+ messages in thread
From: Srivatsa S. Bhat @ 2014-06-25  9:45 UTC (permalink / raw)
  To: Fengguang Wu, Linux Memory Management List
  Cc: LKML, Andrew Morton, Stephen Rothwell, Jet Chen, Yuanhan Liu,
	LKP, Su, Tao, Viresh Kumar, Peter Zijlstra, Stephen Warren

On 06/25/2014 03:01 PM, Fengguang Wu wrote:
> Greetings,
> 
> 0day kernel testing robot got the below dmesg and the first bad commit is
>

I think this is the same issue as the one reported by Stephen Warren
here:

https://lkml.org/lkml/2014/6/24/765

Peter Zijlstra is working on a fix for that.

Regards,
Srivatsa S. Bhat

 
> git://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git master
> commit 68c90b2c635f18ad51ae7440162f6c082ea1288d
> Merge: f08af6f ec11f8c
> Author:     Stephen Rothwell <sfr@canb.auug.org.au>
> AuthorDate: Mon Jun 23 14:12:48 2014 +1000
> 
>     Merge branch 'akpm-current/current'
> 
> +---------------------------------+------------+------------+------------+---------------+
> |                                 | f08af6fa87 | ec11f8c81f | 68c90b2c63 | next-20140623 |
> +---------------------------------+------------+------------+------------+---------------+
> | boot_successes                  | 60         | 60         | 0          | 0             |
> | boot_failures                   | 0          | 0          | 20         | 13            |
> | kernel_BUG_at_kernel/irq_work.c | 0          | 0          | 20         | 13            |
> | invalid_opcode                  | 0          | 0          | 20         | 13            |
> | RIP:irq_work_run                | 0          | 0          | 20         | 13            |
> | backtrace:smpboot_thread_fn     | 0          | 0          | 20         | 13            |
> +---------------------------------+------------+------------+------------+---------------+
> 
> [    2.194744] EDD information not available.
> [    2.195290] Unregister pv shared memory for cpu 0
> [    2.206025] ------------[ cut here ]------------
> [    2.206025] kernel BUG at kernel/irq_work.c:175!
> [    2.206025] invalid opcode: 0000 [#1] SMP DEBUG_PAGEALLOC
> [    2.206025] CPU: 0 PID: 9 Comm: migration/0 Not tainted 3.16.0-rc2-02039-g68c90b2 #1
> [    2.206025] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011
> [    2.206025] task: ffff88001219a7e0 ti: ffff8800121a4000 task.ti: ffff8800121a4000
> [    2.206025] RIP: 0010:[<ffffffff810f9318>]  [<ffffffff810f9318>] irq_work_run+0xf/0x1c
> [    2.206025] RSP: 0000:ffff8800121a7c48  EFLAGS: 00010046
> [    2.206025] RAX: 0000000080000001 RBX: 0000000000000000 RCX: 0000000000000005
> [    2.206025] RDX: 0000000000000000 RSI: 0000000000000008 RDI: 0000000000000000
> [    2.206025] RBP: ffff8800121a7c68 R08: 0000000000000002 R09: 0000000000000001
> [    2.206025] R10: ffffffff810e2a10 R11: ffffffff810b9de3 R12: ffff880012412340
> [    2.206025] R13: 0000000000000000 R14: 0000000000000000 R15: ffffffff81c83e50
> [    2.206025] FS:  0000000000000000(0000) GS:ffff880012400000(0000) knlGS:0000000000000000
> [    2.206025] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> [    2.206025] CR2: 0000000000000000 CR3: 0000000001c0c000 CR4: 00000000000006b0
> [    2.206025] Stack:
> [    2.206025]  ffffffff810e87e0 ffff880012412380 00000000fffffff0 ffffffff81c81ba0
> [    2.206025]  ffff8800121a7c88 ffffffff810e88f0 0000000000000001 00000000fffffff0
> [    2.206025]  ffff8800121a7cd0 ffffffff810b6e23 0000000000000000 0000000000000008
> [    2.206025] Call Trace:
> [    2.206025]  [<ffffffff810e87e0>] ? flush_smp_call_function_queue+0xa4/0x107
> [    2.206025]  [<ffffffff810e88f0>] hotplug_cfd+0xad/0xbb
> [    2.206025]  [<ffffffff810b6e23>] notifier_call_chain+0x68/0x8e
> [    2.206025]  [<ffffffff810b70c0>] __raw_notifier_call_chain+0x9/0xb
> [    2.206025]  [<ffffffff8109b39e>] __cpu_notify+0x1b/0x32
> [    2.206025]  [<ffffffff8109b3c3>] cpu_notify+0xe/0x10
> [    2.206025]  [<ffffffff817e2817>] take_cpu_down+0x22/0x35
> [    2.206025]  [<ffffffff810f4153>] multi_cpu_stop+0x8c/0xe2
> [    2.206025]  [<ffffffff810f40c7>] ? cpu_stopper_thread+0x126/0x126
> [    2.206025]  [<ffffffff810f402e>] cpu_stopper_thread+0x8d/0x126
> [    2.206025]  [<ffffffff810cdab4>] ? lock_acquire+0x94/0x9d
> [    2.206025]  [<ffffffff817f25af>] ? _raw_spin_unlock_irqrestore+0x40/0x55
> [    2.206025]  [<ffffffff810cbdcd>] ? trace_hardirqs_on_caller+0x171/0x18d
> [    2.206025]  [<ffffffff817f25b7>] ? _raw_spin_unlock_irqrestore+0x48/0x55
> [    2.206025]  [<ffffffff810b8e39>] smpboot_thread_fn+0x182/0x1a0
> [    2.206025]  [<ffffffff810b8cb7>] ? in_egroup_p+0x2e/0x2e
> [    2.206025]  [<ffffffff810b372c>] kthread+0xcd/0xd5
> [    2.206025]  [<ffffffff810b365f>] ? __kthread_parkme+0x5c/0x5c
> [    2.206025]  [<ffffffff817f2f3c>] ret_from_fork+0x7c/0xb0
> [    2.206025]  [<ffffffff810b365f>] ? __kthread_parkme+0x5c/0x5c
> [    2.206025] Code: 48 c7 c7 65 cd b0 81 e8 43 20 fa ff c6 05 50 e1 c9 00 01 eb 02 31 db 88 d8 5b 5d c3 65 8b 04 25 10 b8 00 00 a9 00 00 0f 00 75 02 <0f> 0b 55 48 89 e5 e8 b5 fd ff ff 5d c3 55 48 89 e5 53 48 89 fb 
> [    2.206025] RIP  [<ffffffff810f9318>] irq_work_run+0xf/0x1c
> [    2.206025]  RSP <ffff8800121a7c48>
> [    2.206025] ---[ end trace f7f1564c3a1f35d0 ]---
> [    2.206025] note: migration/0[9] exited with preempt_count 1
> 
> git bisect start 58ae500a03a6bf68eee323c342431bfdd3f460b6 f08af6fa87ea33262fe2fe5167119fb55ad9dd2c --
> git bisect  bad 68c90b2c635f18ad51ae7440162f6c082ea1288d  # 14:19      0-     20  Merge branch 'akpm-current/current'
> git bisect good 6b11d02e25c79a8961983a966b7fafcdc36c7a91  # 14:23     20+      0  slab: do not keep free objects/slabs on dead memcg caches
> git bisect good 11709212b3a5479fcc63dda3160f4f4b0251f914  # 14:27     20+      0  mm/util.c: add kstrimdup()
> git bisect good 6af20930dcfcd13270de4f29f3830312f3c36a17  # 14:33     20+      0  fork: reset mm->pinned_vm
> git bisect good 8e7c32fb574ec1b49fd0e451cb25febf51430dd9  # 14:38     20+      0  fs/qnx6: use pr_fmt and __func__ in logging
> git bisect good 6873969c750b85734bc7d06be3c51ad381b3c85a  # 14:41     20+      0  shm: remove unneeded extern for function
> git bisect good 2b9ed79abc340e15bc9652048d2e8d8a283bd8a1  # 14:48     20+      0  um: use asm-generic/scatterlist.h
> git bisect good ec11f8c81fbc76534c1374e29bdf36f085ed859a  # 15:12     20+      0  lib/scatterlist: clean up useless architecture versions of scatterlist.h
> # first bad commit: [68c90b2c635f18ad51ae7440162f6c082ea1288d] Merge branch 'akpm-current/current'
> git bisect good f08af6fa87ea33262fe2fe5167119fb55ad9dd2c  # 15:14     60+      0  Merge branch 'rd-docs/master'
> git bisect good ec11f8c81fbc76534c1374e29bdf36f085ed859a  # 15:19     60+      0  lib/scatterlist: clean up useless architecture versions of scatterlist.h
> git bisect  bad 58ae500a03a6bf68eee323c342431bfdd3f460b6  # 15:19      0-     13  Add linux-next specific files for 20140623
> git bisect good a497c3ba1d97fc69c1e78e7b96435ba8c2cb42ee  # 15:25     60+      0  Linux 3.16-rc2
> git bisect  bad 58ae500a03a6bf68eee323c342431bfdd3f460b6  # 15:25      0-     13  Add linux-next specific files for 20140623
> 
> 
> This script may reproduce the error.
> 
> -----------------------------------------------------------------------------
> #!/bin/bash
> 
> kernel=$1
> initrd=quantal-core-x86_64.cgz
> 
> wget --no-clobber https://github.com/fengguang/reproduce-kernel-bug/blob/master/initrd/$initrd
> 
> kvm=(
> 	qemu-system-x86_64 -cpu kvm64 -enable-kvm 
> 	-kernel $kernel
> 	-initrd $initrd
> 	-smp 2
> 	-m 256M
> 	-net nic,vlan=0,macaddr=00:00:00:00:00:00,model=virtio
> 	-net user,vlan=0
> 	-net nic,vlan=1,model=e1000
> 	-net user,vlan=1
> 	-boot order=nc
> 	-no-reboot
> 	-watchdog i6300esb
> 	-serial stdio
> 	-display none
> 	-monitor null
> )
> 
> append=(
> 	debug
> 	sched_debug
> 	apic=debug
> 	ignore_loglevel
> 	sysrq_always_enabled
> 	panic=10
> 	prompt_ramdisk=0
> 	earlyprintk=ttyS0,115200
> 	console=ttyS0,115200
> 	console=tty0
> 	vga=normal
> 	root=/dev/ram0
> 	rw
> )
> 
> "${kvm[@]}" --append "${append[*]}"
> -----------------------------------------------------------------------------
> 
> Thanks,
> Fengguang
> 


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2014-06-25  9:47 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2014-06-25  9:31 [migration] kernel BUG at kernel/irq_work.c:175! Fengguang Wu
2014-06-25  9:45 ` Srivatsa S. Bhat

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).