All of lore.kernel.org
 help / color / mirror / Atom feed
* 3.9.2: xfstests triggered panic
       [not found] <40971621.4497871.1369211701112.JavaMail.root@redhat.com>
@ 2013-05-22  8:39   ` CAI Qian
  0 siblings, 0 replies; 16+ messages in thread
From: CAI Qian @ 2013-05-22  8:39 UTC (permalink / raw)
  To: LKML; +Cc: xfs, stable

Reproduced on almost all s390x guests by running xfstests.

14634.396658¨ XFS (dm-1): Mounting Filesystem 
14634.525522¨ XFS (dm-1): Ending clean mount 
14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340 
14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0 
14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0 
14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298 
14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec 
14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc 
14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc 
14640.428311¨ Last Breaking-Event-Address: 
14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4 
14640.428319¨  list_add corruption. next->prev should be prev (0000000000000918 
), but was           (null). (next=          (null)). 
14640.508240¨ ------------�  cut here ¨------------ 
14640.508256¨ WARNING: at lib/list_debug.c:29 
14640.508259¨ Modules linked in: binfmt_misc(F) tun(F) ipt_ULOG(F) af_802154(F) 
 scsi_transport_iscsi(F) af_iucv(F) af_key(F) nfc(F) rds(F) atm(F) btrfs(F) zlib 
_deflate(F) raid6_pq(F) xor(F) vfat(F) fat(F) nfsv3(F) nfs_acl(F) nfs(F) lockd(F 
) sunrpc(F) fscache(F) nfnetlink_log(F) nfnetlink(F) arc4(F) md4(F) nls_utf8(F) 
cifs(F) dns_resolver(F) nf_tproxy_core(F) nls_koi8_u(F) nls_cp932(F) ts_kmp(F) s 
ctp(F) fuse(F) nf_conntrack_netbios_ns(F) nf_conntrack_broadcast(F) ipt_MASQUERA 
DE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6table_mangle(F) ip6t_REJECT(F) nf_conntr 
ack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(F) nf_nat_ipv4(F) nf_nat(F) iptable_ma 
ngle(F) ipt_REJECT(F) nf_conntrack_ipv4(F) nf_defrag_ipv4(F) xt_conntrack(F) nf_ 
conntrack(F) ebtable_filter(F) ebtables(F) ip6table_filter(F) ip6_tables(F) ipta 
ble_filter(F) ip_tables(F) sg(F) qeth_l2(F) vmur(F) xfs(F) libcrc32c(F) dasd_fba 
_mod(F) dasd_eckd_mod(F) dasd_mod(F) qeth(F) qdio(F) lcs(F) ctcm(F) ccwgroup(F) 
fsm(F) dm_mirror(F) dm_region_hash(F) dm_log(F) dm_mod(F) � last unloaded: brd¨ 
14640.508358¨ CPU: 0 Tainted: GF     D      3.9.2 #1 
14640.508361¨ Process rcu_sched (pid: 10, task: 000000003ffc60c0, ksp: 00000000 
3ffe3ba0) 
14640.508363¨ Krnl PSW : 0404c00180000000 0000000000426544 (__list_add+0xd0/0xd 
4) 
14640.508378¨            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 EA: 
3 
Krnl GPRS: 0000000000000002 0400000000a0a500 0000000000000075 0000000000000000 
14640.508383¨            0000000000426540 00000000026af440 000000003da430a8 000 
0000000915c90 
14640.508385¨            00000000009cd800 00000000026b2800 0000000000915c90 000 
0000000000000 
14640.508388¨            0000000000000918 000000003da430d0 000000000 
14640.516392¨  � <0000000000634de6>¨ kernel_thread_starter+0x6/0xc 
14640.516395¨  � <0000000000634de0>¨ kernel_thread_starter+0x0/0xc 
14640.516398¨ Last Breaking-Event-Address: 
14640.516400¨  � <0000000000426544>¨ __list_add+0xd0/0xd4 
14640.516402¨ 
14640.516544¨ Kernel panic - not syncing: Fatal exception in interrupt 
00: HCPGIR450W CP entered; disabled wait PSW 00020001 80000000 00000000 0010F2AC

Sometimes, this pops up,
[16907.275002] WARNING: at kernel/rcutree.c:1960 

or this,
15316.154171¨ XFS (dm-1): Mounting Filesystem 
15316.255796¨ XFS (dm-1): Ending clean mount 
15320.364246¨            00000000006367a2: e310b0080004        lg      %r1,8(%r 
11) 
15320.364249¨            00000000006367a8: 41101010            la      %r1,16(% 
r1) 
15320.364251¨            00000000006367ac: e33010000004        lg      %r3,0(%r 
1) 
15320.364252¨ Call Trace: 
15320.364252¨ Last Breaking-Event-Address: 
15320.364253¨  � <0000000000000000>¨ Kernel stack overflow. 
15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1 
15320.364309¨ Process rhts-test-runne (pid: 625, task: 000000003dccc890, ksp: 0 
000000037433c78) 
15320.364310¨ Krnl PSW : 0404c00180000000 0000000000636796 (do_dat_exception+0x 
26/0x36c) 
15320.364315¨            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 EA: 
3 
15320.364316¨ Krnl GPRS: e500000000000000 0000000000014138 ffffffffffffffbf 000 
0000000901110 
15320.364317¨            00000000001a9fe6 0000000000124438 0000000000020001 000 
0000000000001 
15320.364318¨            00000000000141d8 00000000001a9fe6 0000000000000044 000 
00000000141d8 
0000000014028 
15320.366500¨ Krnl Code: 000000000063678a: b9040082            lgr     %r8,%r2 
15320.366502¨            000000000063678e: a729ffbf            lghi    %r2,-65 
15320.366504¨           #0000000000636792: a7840001            brc     8,636794 
 
15320.366507¨           >0000000000636796: e310f0980024        stg     %r1,152( 
%r15) 
15320.366509¨            000000000063679c: e3b003100004        lg      %r11,784 
 
15320.366512¨            00000000006367a2: e310b0080004        lg      %r1,8(%r 
11) 
15320.366515¨            00000000006367a8: 41101010            la      %r1,16(% 
r1) 
15320.366518¨            00000000006367ac: e33010000004        lg      %r3,0(%r 
1) 
15320.366518¨ Call Trace: 
15320.366518¨ Last Breaking-Event-Address: 
15320.366519¨  � <0000000000000000>¨ Kernel stack overflow. 
15320.366541¨ CPU: 0 Tainted: GF       W    3.9.2 #1 
15320.366542¨ Process rhts-test-runne (pid: 625, task: 000000003dccc890, ksp: 0 
000000037433c78) 
15320.366543¨ Krnl PSW : 0404c00180000000 0000000000636796 (do_dat_exception+0x 
26/0x36c) 
15320.366546¨            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 EA: 
3 
15320.366547¨ Krnl GPRS: e500000000000000 0000000000014138 ffffffffffffffbf 000 
0000000901110 
15320.366548¨            00000000001a9fe6 0000000000124438 0000000000020001 000 
0000000000001 
15320.366549¨            00000000000141d8 00000000001a9fe6 0000000000000044 000 
00000000141d8 
15320.366550¨            0000000037430000 000000000063df78 0000000000634f24 000 
0000000014028 
15320.366557¨ Krnl Code: 000000000063678a: b9040082            lgr     %r8,%r2 
15320.366560¨            000000000063678e: a729ffbf 

CAI Qian

^ permalink raw reply	[flat|nested] 16+ messages in thread

* 3.9.2: xfstests triggered panic
@ 2013-05-22  8:39   ` CAI Qian
  0 siblings, 0 replies; 16+ messages in thread
From: CAI Qian @ 2013-05-22  8:39 UTC (permalink / raw)
  To: LKML; +Cc: stable, xfs

Reproduced on almost all s390x guests by running xfstests.

14634.396658¨ XFS (dm-1): Mounting Filesystem 
14634.525522¨ XFS (dm-1): Ending clean mount 
14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340 
14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0 
14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0 
14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298 
14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec 
14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc 
14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc 
14640.428311¨ Last Breaking-Event-Address: 
14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4 
14640.428319¨  list_add corruption. next->prev should be prev (0000000000000918 
), but was           (null). (next=          (null)). 
14640.508240¨ ------------�  cut here ¨------------ 
14640.508256¨ WARNING: at lib/list_debug.c:29 
14640.508259¨ Modules linked in: binfmt_misc(F) tun(F) ipt_ULOG(F) af_802154(F) 
 scsi_transport_iscsi(F) af_iucv(F) af_key(F) nfc(F) rds(F) atm(F) btrfs(F) zlib 
_deflate(F) raid6_pq(F) xor(F) vfat(F) fat(F) nfsv3(F) nfs_acl(F) nfs(F) lockd(F 
) sunrpc(F) fscache(F) nfnetlink_log(F) nfnetlink(F) arc4(F) md4(F) nls_utf8(F) 
cifs(F) dns_resolver(F) nf_tproxy_core(F) nls_koi8_u(F) nls_cp932(F) ts_kmp(F) s 
ctp(F) fuse(F) nf_conntrack_netbios_ns(F) nf_conntrack_broadcast(F) ipt_MASQUERA 
DE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6table_mangle(F) ip6t_REJECT(F) nf_conntr 
ack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(F) nf_nat_ipv4(F) nf_nat(F) iptable_ma 
ngle(F) ipt_REJECT(F) nf_conntrack_ipv4(F) nf_defrag_ipv4(F) xt_conntrack(F) nf_ 
conntrack(F) ebtable_filter(F) ebtables(F) ip6table_filter(F) ip6_tables(F) ipta 
ble_filter(F) ip_tables(F) sg(F) qeth_l2(F) vmur(F) xfs(F) libcrc32c(F) dasd_fba 
_mod(F) dasd_eckd_mod(F) dasd_mod(F) qeth(F) qdio(F) lcs(F) ctcm(F) ccwgroup(F) 
fsm(F) dm_mirror(F) dm_region_hash(F) dm_log(F) dm_mod(F) � last unloaded: brd¨ 
14640.508358¨ CPU: 0 Tainted: GF     D      3.9.2 #1 
14640.508361¨ Process rcu_sched (pid: 10, task: 000000003ffc60c0, ksp: 00000000 
3ffe3ba0) 
14640.508363¨ Krnl PSW : 0404c00180000000 0000000000426544 (__list_add+0xd0/0xd 
4) 
14640.508378¨            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 EA: 
3 
Krnl GPRS: 0000000000000002 0400000000a0a500 0000000000000075 0000000000000000 
14640.508383¨            0000000000426540 00000000026af440 000000003da430a8 000 
0000000915c90 
14640.508385¨            00000000009cd800 00000000026b2800 0000000000915c90 000 
0000000000000 
14640.508388¨            0000000000000918 000000003da430d0 000000000 
14640.516392¨  � <0000000000634de6>¨ kernel_thread_starter+0x6/0xc 
14640.516395¨  � <0000000000634de0>¨ kernel_thread_starter+0x0/0xc 
14640.516398¨ Last Breaking-Event-Address: 
14640.516400¨  � <0000000000426544>¨ __list_add+0xd0/0xd4 
14640.516402¨ 
14640.516544¨ Kernel panic - not syncing: Fatal exception in interrupt 
00: HCPGIR450W CP entered; disabled wait PSW 00020001 80000000 00000000 0010F2AC

Sometimes, this pops up,
[16907.275002] WARNING: at kernel/rcutree.c:1960 

or this,
15316.154171¨ XFS (dm-1): Mounting Filesystem 
15316.255796¨ XFS (dm-1): Ending clean mount 
15320.364246¨            00000000006367a2: e310b0080004        lg      %r1,8(%r 
11) 
15320.364249¨            00000000006367a8: 41101010            la      %r1,16(% 
r1) 
15320.364251¨            00000000006367ac: e33010000004        lg      %r3,0(%r 
1) 
15320.364252¨ Call Trace: 
15320.364252¨ Last Breaking-Event-Address: 
15320.364253¨  � <0000000000000000>¨ Kernel stack overflow. 
15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1 
15320.364309¨ Process rhts-test-runne (pid: 625, task: 000000003dccc890, ksp: 0 
000000037433c78) 
15320.364310¨ Krnl PSW : 0404c00180000000 0000000000636796 (do_dat_exception+0x 
26/0x36c) 
15320.364315¨            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 EA: 
3 
15320.364316¨ Krnl GPRS: e500000000000000 0000000000014138 ffffffffffffffbf 000 
0000000901110 
15320.364317¨            00000000001a9fe6 0000000000124438 0000000000020001 000 
0000000000001 
15320.364318¨            00000000000141d8 00000000001a9fe6 0000000000000044 000 
00000000141d8 
0000000014028 
15320.366500¨ Krnl Code: 000000000063678a: b9040082            lgr     %r8,%r2 
15320.366502¨            000000000063678e: a729ffbf            lghi    %r2,-65 
15320.366504¨           #0000000000636792: a7840001            brc     8,636794 
 
15320.366507¨           >0000000000636796: e310f0980024        stg     %r1,152( 
%r15) 
15320.366509¨            000000000063679c: e3b003100004        lg      %r11,784 
 
15320.366512¨            00000000006367a2: e310b0080004        lg      %r1,8(%r 
11) 
15320.366515¨            00000000006367a8: 41101010            la      %r1,16(% 
r1) 
15320.366518¨            00000000006367ac: e33010000004        lg      %r3,0(%r 
1) 
15320.366518¨ Call Trace: 
15320.366518¨ Last Breaking-Event-Address: 
15320.366519¨  � <0000000000000000>¨ Kernel stack overflow. 
15320.366541¨ CPU: 0 Tainted: GF       W    3.9.2 #1 
15320.366542¨ Process rhts-test-runne (pid: 625, task: 000000003dccc890, ksp: 0 
000000037433c78) 
15320.366543¨ Krnl PSW : 0404c00180000000 0000000000636796 (do_dat_exception+0x 
26/0x36c) 
15320.366546¨            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 EA: 
3 
15320.366547¨ Krnl GPRS: e500000000000000 0000000000014138 ffffffffffffffbf 000 
0000000901110 
15320.366548¨            00000000001a9fe6 0000000000124438 0000000000020001 000 
0000000000001 
15320.366549¨            00000000000141d8 00000000001a9fe6 0000000000000044 000 
00000000141d8 
15320.366550¨            0000000037430000 000000000063df78 0000000000634f24 000 
0000000014028 
15320.366557¨ Krnl Code: 000000000063678a: b9040082            lgr     %r8,%r2 
15320.366560¨            000000000063678e: a729ffbf 

CAI Qian

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: 3.9.2: xfstests triggered panic
  2013-05-22  8:39   ` CAI Qian
@ 2013-05-22  9:53     ` Dave Chinner
  -1 siblings, 0 replies; 16+ messages in thread
From: Dave Chinner @ 2013-05-22  9:53 UTC (permalink / raw)
  To: CAI Qian; +Cc: LKML, stable, xfs

On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> Reproduced on almost all s390x guests by running xfstests.
> 
> 14634.396658¨ XFS (dm-1): Mounting Filesystem 
> 14634.525522¨ XFS (dm-1): Ending clean mount 
> 14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340 
> 14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0 
> 14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0 
> 14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298 
> 14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec 
> 14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc 
> 14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc 
> 14640.428311¨ Last Breaking-Event-Address: 
> 14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4 
> 14640.428319¨  list_add corruption. next->prev should be prev (0000000000000918 
> ), but was           (null). (next=          (null)). 

Where's XFS in this? walk_tg_tree_from() is part of the scheduler
code. This kind of implies a stack corruption....

> Sometimes, this pops up,
> [16907.275002] WARNING: at kernel/rcutree.c:1960 
> 
> or this,
> 15316.154171¨ XFS (dm-1): Mounting Filesystem 
> 15316.255796¨ XFS (dm-1): Ending clean mount 
> 15320.364246¨            00000000006367a2: e310b0080004        lg      %r1,8(%r 
> 11) 
> 15320.364249¨            00000000006367a8: 41101010            la      %r1,16(% 
> r1) 
> 15320.364251¨            00000000006367ac: e33010000004        lg      %r3,0(%r 
> 1) 
> 15320.364252¨ Call Trace: 
> 15320.364252¨ Last Breaking-Event-Address: 
> 15320.364253¨  � <0000000000000000>¨ Kernel stack overflow. 
> 15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1 
> 15320.364309¨ Process rhts-test-runne (pid: 625, task: 000000003dccc890, ksp: 0 

.... and there you go - a stack overflow. Your kernel stack size is
too small.

I'd suggest that you need 16k stacks on s390 - IIRC every function
call has 128 byte stack frame, and there are call chains 70-80
functions deep in the storage stack...

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: 3.9.2: xfstests triggered panic
@ 2013-05-22  9:53     ` Dave Chinner
  0 siblings, 0 replies; 16+ messages in thread
From: Dave Chinner @ 2013-05-22  9:53 UTC (permalink / raw)
  To: CAI Qian; +Cc: LKML, stable, xfs

On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> Reproduced on almost all s390x guests by running xfstests.
> 
> 14634.396658¨ XFS (dm-1): Mounting Filesystem 
> 14634.525522¨ XFS (dm-1): Ending clean mount 
> 14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340 
> 14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0 
> 14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0 
> 14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298 
> 14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec 
> 14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc 
> 14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc 
> 14640.428311¨ Last Breaking-Event-Address: 
> 14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4 
> 14640.428319¨  list_add corruption. next->prev should be prev (0000000000000918 
> ), but was           (null). (next=          (null)). 

Where's XFS in this? walk_tg_tree_from() is part of the scheduler
code. This kind of implies a stack corruption....

> Sometimes, this pops up,
> [16907.275002] WARNING: at kernel/rcutree.c:1960 
> 
> or this,
> 15316.154171¨ XFS (dm-1): Mounting Filesystem 
> 15316.255796¨ XFS (dm-1): Ending clean mount 
> 15320.364246¨            00000000006367a2: e310b0080004        lg      %r1,8(%r 
> 11) 
> 15320.364249¨            00000000006367a8: 41101010            la      %r1,16(% 
> r1) 
> 15320.364251¨            00000000006367ac: e33010000004        lg      %r3,0(%r 
> 1) 
> 15320.364252¨ Call Trace: 
> 15320.364252¨ Last Breaking-Event-Address: 
> 15320.364253¨  � <0000000000000000>¨ Kernel stack overflow. 
> 15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1 
> 15320.364309¨ Process rhts-test-runne (pid: 625, task: 000000003dccc890, ksp: 0 

.... and there you go - a stack overflow. Your kernel stack size is
too small.

I'd suggest that you need 16k stacks on s390 - IIRC every function
call has 128 byte stack frame, and there are call chains 70-80
functions deep in the storage stack...

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: 3.9.2: xfstests triggered panic
  2013-05-22  9:53     ` Dave Chinner
@ 2013-05-23  3:16       ` CAI Qian
  -1 siblings, 0 replies; 16+ messages in thread
From: CAI Qian @ 2013-05-23  3:16 UTC (permalink / raw)
  To: Dave Chinner; +Cc: LKML, stable, xfs



----- Original Message -----
> From: "Dave Chinner" <david@fromorbit.com>
> To: "CAI Qian" <caiqian@redhat.com>
> Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org, xfs@oss.sgi.com
> Sent: Wednesday, May 22, 2013 5:53:00 PM
> Subject: Re: 3.9.2: xfstests triggered panic
> 
> On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> > Reproduced on almost all s390x guests by running xfstests.
> > 
> > 14634.396658¨ XFS (dm-1): Mounting Filesystem
> > 14634.525522¨ XFS (dm-1): Ending clean mount
> > 14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340
> > 14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0
> > 14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0
> > 14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298
> > 14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec
> > 14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc
> > 14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc
> > 14640.428311¨ Last Breaking-Event-Address:
> > 14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4
> > 14640.428319¨  list_add corruption. next->prev should be prev
> > (0000000000000918
> > ), but was           (null). (next=          (null)).
> 
> Where's XFS in this? walk_tg_tree_from() is part of the scheduler
> code. This kind of implies a stack corruption....
> 
> > Sometimes, this pops up,
> > [16907.275002] WARNING: at kernel/rcutree.c:1960
> > 
> > or this,
> > 15316.154171¨ XFS (dm-1): Mounting Filesystem
> > 15316.255796¨ XFS (dm-1): Ending clean mount
> > 15320.364246¨            00000000006367a2: e310b0080004        lg
> > %r1,8(%r
> > 11)
> > 15320.364249¨            00000000006367a8: 41101010            la
> > %r1,16(%
> > r1)
> > 15320.364251¨            00000000006367ac: e33010000004        lg
> > %r3,0(%r
> > 1)
> > 15320.364252¨ Call Trace:
> > 15320.364252¨ Last Breaking-Event-Address:
> > 15320.364253¨  � <0000000000000000>¨ Kernel stack overflow.
> > 15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1
> > 15320.364309¨ Process rhts-test-runne (pid: 625, task: 000000003dccc890,
> > ksp: 0
> 
> .... and there you go - a stack overflow. Your kernel stack size is
> too small.
> 
> I'd suggest that you need 16k stacks on s390 - IIRC every function
> call has 128 byte stack frame, and there are call chains 70-80
> functions deep in the storage stack...
Hmm, I am unsure how to set to 16k stack there, and power 7 has looks
like has the same problem.

[14927.117017] XFS (dm-0): Mounting Filesystem 
[14927.299854] XFS (dm-0): Ending clean mount 
[14927.668909] Unable to handle kernel paging request for data at address 0x00000040 
[14927.668913] Unable to handle kernel paging request for data at address 0x000000f8 
[14927.668914] Unable to handle kernel paging request for data at address 0x000000bb 
[14927.668915] Faulting instruction address: 0xc0000000000d1bd8 
[14927.668916] Faulting instruction address: 0xc0000000000d1bd8 
[14927.668919] Unable to handle kernel paging request for data at address 0x00000018 
[14927.668920] Faulting instruction address: 0xc0000000003d34b8 
[14927.668922] Oops: Kernel access of bad area, sig: 11 [#1] 
[14927.668924] SMP NR_CPUS=1024 NUMA pSeries 
[14927.668927] Modules linked in: binfmt_misc(F) tun(F) ipt_ULOG(F) rds(F) scsi_transport_iscsi(F) atm(F) nfc(F) pppoe(F) pppox(F) ppp_generic(F) slhc(F) af_802154(F) af_key(F) sctp(F) btrfs(F) raid6_pq(F) xor(F) vfat(F) fat(F) nfsv3(F) nfs_acl(F) nfs(F) lockd(F) sunrpc(F) fscache(F) nfnetlink_log(F) nfnetlink(F) bluetooth(F) rfkill(F) arc4(F) md4(F) nls_utf8(F) cifs(F) dns_resolver(F) nf_tproxy_core(F) nls_koi8_u(F) nls_cp932(F) ts_kmp(F)[14927.668955] Faulting instruction address: 0xc0000000000d1bd8 
 fuse(F) nf_conntrack_netbios_ns(F) nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6table_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F) nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F) ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) ehea(F) xfs(F) libcrc32c(F) sd_mod(F) crc_t10dif(F) ibmvscsi(F) scsi_transport_srp(F) scsi_tgt(F) dm_mirror(F) dm_region_hash(F) dm_log(F) dm_mod(F) [last unloaded: brd] 
[14927.669041] NIP: c0000000000d1bd8 LR: c0000000000d1b94 CTR: c0000000000d7e30 
[14927.669048] REGS: c0000001fbfb3120 TRAP: 0300   Tainted: GF             (3.9.3) 
[14927.669053] MSR: 8000000000009032 <SF,EE,ME,IR,DR,RI>  CR: 28000028  XER: 00000000 
[14927.669069] SOFTE: 0 
[14927.669072] CFAR: c00000000000908c 
[14927.669076] DAR: 00000000000000f8, DSISR: 40000000 
[14927.669080] TASK = c0000001fbf14880[0] 'swapper/2' THREAD: c0000001fbfb0000 CPU: 2 
GPR00: c0000000000d1b94 c0000001fbfb33a0 c0000000010f3038 00000d939e66add6  
GPR04: 0000000000000000 00000001001651f2 0000000000000099 c000000000af3038  
GPR08: c000000001163038 0000000000000002 00000000000000b8 000c3420953d115d  
GPR12: 0000000048000022 c00000000ed90800 c0000001fbfb3f90 000000000eee7bc0  
GPR16: 0000000010200040 00000001001651f2 c000000001152100 0000000000000000  
GPR20: c000000000af3f80 c000000001152180 0000000000000000 0000000000000000  
GPR24: c0000000007801e8 0000000000000001 0000000000200200 c0000000015550d0  
GPR28: c000000001554880 0000000000000000 c0000001f5564200 0000000000000000  
[14927.669159] NIP [c0000000000d1bd8] .update_blocked_averages+0xc8/0x5c0 
[14927.669165] LR [c0000000000d1b94] .update_blocked_averages+0x84/0x5c0 
[14927.669170] Call Trace: 
[14927.669174] [c0000001fbfb33a0] [c0000000000d1b94] .update_blocked_averages+0x84/0x5c0 (unreliable) 
[14927.669183] [c0000001fbfb3490] [c0000000000d7c54] .rebalance_domains+0x84/0x260 
[14927.669190] [c0000001fbfb3570] [c0000000000d7eb4] .run_rebalance_domains+0x84/0x230 
[14927.669198] [c0000001fbfb3650] [c000000000091228] .__do_softirq+0x148/0x310 
[14927.669205] [c0000001fbfb3740] [c000000000091608] .irq_exit+0xc8/0xe0 
[14927.669212] [c0000001fbfb37c0] [c00000000001d214] .timer_interrupt+0x154/0x2e0 
[14927.669220] [c0000001fbfb3870] [c0000000000024d4] decrementer_common+0x154/0x180 
[14927.669230] --- Exception: 901 at .plpar_hcall_norets+0x84/0xd4 
[14927.669230]     LR = .check_and_cede_processor+0x24/0x40 
[14927.669240] [c0000001fbfb3b60] [0000000000000001] 0x1 (unreliable) 
[14927.669247] [c0000001fbfb3bd0] [c00000000006d070] .shared_cede_loop+0x50/0xe0 
[14927.669256] [c0000001fbfb3c90] [c0000000005b818c] .cpuidle_enter+0x2c/0x40 
[14927.669263] [c0000001fbfb3d00] [c0000000005b8ad0] .cpuidle_idle_call+0xf0/0x300 
[14927.669270] [c0000001fbfb3db0] [c00000000005dab0] .pSeries_idle+0x10/0x40 
[14927.669278] [c0000001fbfb3e20] [c0000000000171b8] .cpu_idle+0x158/0x2a0 
[14927.669285] [c0000001fbfb3ed0] [c00000000074c030] .start_secondary+0x3a4/0x3ac 
[14927.669293] [c0000001fbfb3f90] [c00000000000976c] .start_secondary_prolog+0x10/0x14 
[14927.669299] Instruction dump: 
[14927.669303] 7fbbf040 3bdeff50 419e01f0 3f400020 3f02ff69 3ae00000 3ac00000 3b18d1b0  
[14927.669314] 635a0200 60000000 e93c0912 e95e00c0 <e90a0040> e94a0048 79291f24 7fe8482a  
[14927.669334] ---[ end trace ac4936baffc8b47b ]--- 
[14927.671261]  
[14927.671266] Oops: Kernel access of bad area, sig: 11 [#2] 
[14927.671272] SMP NR_CPUS=1024 NUMA pSeries 

CAI Qian
> 
> Cheers,
> 
> Dave.
> --
> Dave Chinner
> david@fromorbit.com
> --
> To unsubscribe from this list: send the line "unsubscribe stable" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: 3.9.2: xfstests triggered panic
@ 2013-05-23  3:16       ` CAI Qian
  0 siblings, 0 replies; 16+ messages in thread
From: CAI Qian @ 2013-05-23  3:16 UTC (permalink / raw)
  To: Dave Chinner; +Cc: LKML, stable, xfs



----- Original Message -----
> From: "Dave Chinner" <david@fromorbit.com>
> To: "CAI Qian" <caiqian@redhat.com>
> Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org, xfs@oss.sgi.com
> Sent: Wednesday, May 22, 2013 5:53:00 PM
> Subject: Re: 3.9.2: xfstests triggered panic
> 
> On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> > Reproduced on almost all s390x guests by running xfstests.
> > 
> > 14634.396658¨ XFS (dm-1): Mounting Filesystem
> > 14634.525522¨ XFS (dm-1): Ending clean mount
> > 14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340
> > 14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0
> > 14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0
> > 14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298
> > 14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec
> > 14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc
> > 14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc
> > 14640.428311¨ Last Breaking-Event-Address:
> > 14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4
> > 14640.428319¨  list_add corruption. next->prev should be prev
> > (0000000000000918
> > ), but was           (null). (next=          (null)).
> 
> Where's XFS in this? walk_tg_tree_from() is part of the scheduler
> code. This kind of implies a stack corruption....
> 
> > Sometimes, this pops up,
> > [16907.275002] WARNING: at kernel/rcutree.c:1960
> > 
> > or this,
> > 15316.154171¨ XFS (dm-1): Mounting Filesystem
> > 15316.255796¨ XFS (dm-1): Ending clean mount
> > 15320.364246¨            00000000006367a2: e310b0080004        lg
> > %r1,8(%r
> > 11)
> > 15320.364249¨            00000000006367a8: 41101010            la
> > %r1,16(%
> > r1)
> > 15320.364251¨            00000000006367ac: e33010000004        lg
> > %r3,0(%r
> > 1)
> > 15320.364252¨ Call Trace:
> > 15320.364252¨ Last Breaking-Event-Address:
> > 15320.364253¨  � <0000000000000000>¨ Kernel stack overflow.
> > 15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1
> > 15320.364309¨ Process rhts-test-runne (pid: 625, task: 000000003dccc890,
> > ksp: 0
> 
> .... and there you go - a stack overflow. Your kernel stack size is
> too small.
> 
> I'd suggest that you need 16k stacks on s390 - IIRC every function
> call has 128 byte stack frame, and there are call chains 70-80
> functions deep in the storage stack...
Hmm, I am unsure how to set to 16k stack there, and power 7 has looks
like has the same problem.

[14927.117017] XFS (dm-0): Mounting Filesystem 
[14927.299854] XFS (dm-0): Ending clean mount 
[14927.668909] Unable to handle kernel paging request for data at address 0x00000040 
[14927.668913] Unable to handle kernel paging request for data at address 0x000000f8 
[14927.668914] Unable to handle kernel paging request for data at address 0x000000bb 
[14927.668915] Faulting instruction address: 0xc0000000000d1bd8 
[14927.668916] Faulting instruction address: 0xc0000000000d1bd8 
[14927.668919] Unable to handle kernel paging request for data at address 0x00000018 
[14927.668920] Faulting instruction address: 0xc0000000003d34b8 
[14927.668922] Oops: Kernel access of bad area, sig: 11 [#1] 
[14927.668924] SMP NR_CPUS=1024 NUMA pSeries 
[14927.668927] Modules linked in: binfmt_misc(F) tun(F) ipt_ULOG(F) rds(F) scsi_transport_iscsi(F) atm(F) nfc(F) pppoe(F) pppox(F) ppp_generic(F) slhc(F) af_802154(F) af_key(F) sctp(F) btrfs(F) raid6_pq(F) xor(F) vfat(F) fat(F) nfsv3(F) nfs_acl(F) nfs(F) lockd(F) sunrpc(F) fscache(F) nfnetlink_log(F) nfnetlink(F) bluetooth(F) rfkill(F) arc4(F) md4(F) nls_utf8(F) cifs(F) dns_resolver(F) nf_tproxy_core(F) nls_koi8_u(F) nls_cp932(F) ts_kmp(F)[14927.668955] Faulting instruction address: 0xc0000000000d1bd8 
 fuse(F) nf_conntrack_netbios_ns(F) nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6table_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F) nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F) ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) ehea(F) xfs(F) libcrc32c(F) sd_mod(F) crc_t10dif(F) ibmvscsi(F) scsi_transport_srp(F) scsi_tgt(F) dm_mirror(F) dm_region_hash(F) dm_log(F) dm_mod(F) [last unloaded: brd] 
[14927.669041] NIP: c0000000000d1bd8 LR: c0000000000d1b94 CTR: c0000000000d7e30 
[14927.669048] REGS: c0000001fbfb3120 TRAP: 0300   Tainted: GF             (3.9.3) 
[14927.669053] MSR: 8000000000009032 <SF,EE,ME,IR,DR,RI>  CR: 28000028  XER: 00000000 
[14927.669069] SOFTE: 0 
[14927.669072] CFAR: c00000000000908c 
[14927.669076] DAR: 00000000000000f8, DSISR: 40000000 
[14927.669080] TASK = c0000001fbf14880[0] 'swapper/2' THREAD: c0000001fbfb0000 CPU: 2 
GPR00: c0000000000d1b94 c0000001fbfb33a0 c0000000010f3038 00000d939e66add6  
GPR04: 0000000000000000 00000001001651f2 0000000000000099 c000000000af3038  
GPR08: c000000001163038 0000000000000002 00000000000000b8 000c3420953d115d  
GPR12: 0000000048000022 c00000000ed90800 c0000001fbfb3f90 000000000eee7bc0  
GPR16: 0000000010200040 00000001001651f2 c000000001152100 0000000000000000  
GPR20: c000000000af3f80 c000000001152180 0000000000000000 0000000000000000  
GPR24: c0000000007801e8 0000000000000001 0000000000200200 c0000000015550d0  
GPR28: c000000001554880 0000000000000000 c0000001f5564200 0000000000000000  
[14927.669159] NIP [c0000000000d1bd8] .update_blocked_averages+0xc8/0x5c0 
[14927.669165] LR [c0000000000d1b94] .update_blocked_averages+0x84/0x5c0 
[14927.669170] Call Trace: 
[14927.669174] [c0000001fbfb33a0] [c0000000000d1b94] .update_blocked_averages+0x84/0x5c0 (unreliable) 
[14927.669183] [c0000001fbfb3490] [c0000000000d7c54] .rebalance_domains+0x84/0x260 
[14927.669190] [c0000001fbfb3570] [c0000000000d7eb4] .run_rebalance_domains+0x84/0x230 
[14927.669198] [c0000001fbfb3650] [c000000000091228] .__do_softirq+0x148/0x310 
[14927.669205] [c0000001fbfb3740] [c000000000091608] .irq_exit+0xc8/0xe0 
[14927.669212] [c0000001fbfb37c0] [c00000000001d214] .timer_interrupt+0x154/0x2e0 
[14927.669220] [c0000001fbfb3870] [c0000000000024d4] decrementer_common+0x154/0x180 
[14927.669230] --- Exception: 901 at .plpar_hcall_norets+0x84/0xd4 
[14927.669230]     LR = .check_and_cede_processor+0x24/0x40 
[14927.669240] [c0000001fbfb3b60] [0000000000000001] 0x1 (unreliable) 
[14927.669247] [c0000001fbfb3bd0] [c00000000006d070] .shared_cede_loop+0x50/0xe0 
[14927.669256] [c0000001fbfb3c90] [c0000000005b818c] .cpuidle_enter+0x2c/0x40 
[14927.669263] [c0000001fbfb3d00] [c0000000005b8ad0] .cpuidle_idle_call+0xf0/0x300 
[14927.669270] [c0000001fbfb3db0] [c00000000005dab0] .pSeries_idle+0x10/0x40 
[14927.669278] [c0000001fbfb3e20] [c0000000000171b8] .cpu_idle+0x158/0x2a0 
[14927.669285] [c0000001fbfb3ed0] [c00000000074c030] .start_secondary+0x3a4/0x3ac 
[14927.669293] [c0000001fbfb3f90] [c00000000000976c] .start_secondary_prolog+0x10/0x14 
[14927.669299] Instruction dump: 
[14927.669303] 7fbbf040 3bdeff50 419e01f0 3f400020 3f02ff69 3ae00000 3ac00000 3b18d1b0  
[14927.669314] 635a0200 60000000 e93c0912 e95e00c0 <e90a0040> e94a0048 79291f24 7fe8482a  
[14927.669334] ---[ end trace ac4936baffc8b47b ]--- 
[14927.671261]  
[14927.671266] Oops: Kernel access of bad area, sig: 11 [#2] 
[14927.671272] SMP NR_CPUS=1024 NUMA pSeries 

CAI Qian
> 
> Cheers,
> 
> Dave.
> --
> Dave Chinner
> david@fromorbit.com
> --
> To unsubscribe from this list: send the line "unsubscribe stable" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: 3.9.2: xfstests triggered panic
  2013-05-23  3:16       ` CAI Qian
@ 2013-05-23  3:46         ` Dave Chinner
  -1 siblings, 0 replies; 16+ messages in thread
From: Dave Chinner @ 2013-05-23  3:46 UTC (permalink / raw)
  To: CAI Qian; +Cc: LKML, stable, xfs

On Wed, May 22, 2013 at 11:16:56PM -0400, CAI Qian wrote:
> ----- Original Message -----
> > From: "Dave Chinner" <david@fromorbit.com>
> > To: "CAI Qian" <caiqian@redhat.com>
> > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org, xfs@oss.sgi.com
> > Sent: Wednesday, May 22, 2013 5:53:00 PM
> > Subject: Re: 3.9.2: xfstests triggered panic
> > 
> > On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> > > Reproduced on almost all s390x guests by running xfstests.
> > > 
> > > 14634.396658¨ XFS (dm-1): Mounting Filesystem
> > > 14634.525522¨ XFS (dm-1): Ending clean mount
> > > 14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340
> > > 14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0
> > > 14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0
> > > 14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298
> > > 14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec
> > > 14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc
> > > 14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc
> > > 14640.428311¨ Last Breaking-Event-Address:
> > > 14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4
> > > 14640.428319¨  list_add corruption. next->prev should be prev
> > > (0000000000000918
> > > ), but was           (null). (next=          (null)).
> > 
> > Where's XFS in this? walk_tg_tree_from() is part of the scheduler
> > code. This kind of implies a stack corruption....
> > 
> > > Sometimes, this pops up,
> > > [16907.275002] WARNING: at kernel/rcutree.c:1960
> > > 
> > > or this,
> > > 15316.154171¨ XFS (dm-1): Mounting Filesystem
> > > 15316.255796¨ XFS (dm-1): Ending clean mount
> > > 15320.364246¨            00000000006367a2: e310b0080004        lg
> > > %r1,8(%r
> > > 11)
> > > 15320.364249¨            00000000006367a8: 41101010            la
> > > %r1,16(%
> > > r1)
> > > 15320.364251¨            00000000006367ac: e33010000004        lg
> > > %r3,0(%r
> > > 1)
> > > 15320.364252¨ Call Trace:
> > > 15320.364252¨ Last Breaking-Event-Address:
> > > 15320.364253¨  � <0000000000000000>¨ Kernel stack overflow.
> > > 15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1
> > > 15320.364309¨ Process rhts-test-runne (pid: 625, task: 000000003dccc890,
> > > ksp: 0
> > 
> > .... and there you go - a stack overflow. Your kernel stack size is
> > too small.
> > 
> > I'd suggest that you need 16k stacks on s390 - IIRC every function
> > call has 128 byte stack frame, and there are call chains 70-80
> > functions deep in the storage stack...
> Hmm, I am unsure how to set to 16k stack there

Are you build a 64 bit s390 kernel or a 32 bit kernel? 32 bit
kernels only have an 8k stack size, 64 bit kernels are 16k (see
arch/s390/Makefile).

$ git grep STACK_SIZE arch/s390 |head -2
arch/s390/Makefile:STACK_SIZE   := 8192
arch/s390/Makefile:STACK_SIZE   := 16384

As it is, the stack frame usage is worse than I thought:

$ git grep STACK_FRAME_OVERHEAD arch/s390 |head -2
arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 96      /* size of minimum stack frame */
arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 160      /* size of minimum stack frame */

Overhead is 96 bytes for 32 bit and 160 bytes for 64 bit. So 16k
stack size is going to have big troubles with a 70-80 function deep
call chain.

As for powerpc:

arch/powerpc/include/asm/ppc_asm.h:#define STACKFRAMESIZE 256

Yeah, same issue.

But, seriously, these stack traces are meaningless to anyone not
familiar with s390 or power7 - they indicate a problem detected
in the idle loop, not where ever the stack overran.

Can you please work with the s390/power7 people to obtain whatever
stack it was that overflowed, and we can go from there.

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: 3.9.2: xfstests triggered panic
@ 2013-05-23  3:46         ` Dave Chinner
  0 siblings, 0 replies; 16+ messages in thread
From: Dave Chinner @ 2013-05-23  3:46 UTC (permalink / raw)
  To: CAI Qian; +Cc: LKML, stable, xfs

On Wed, May 22, 2013 at 11:16:56PM -0400, CAI Qian wrote:
> ----- Original Message -----
> > From: "Dave Chinner" <david@fromorbit.com>
> > To: "CAI Qian" <caiqian@redhat.com>
> > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org, xfs@oss.sgi.com
> > Sent: Wednesday, May 22, 2013 5:53:00 PM
> > Subject: Re: 3.9.2: xfstests triggered panic
> > 
> > On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> > > Reproduced on almost all s390x guests by running xfstests.
> > > 
> > > 14634.396658¨ XFS (dm-1): Mounting Filesystem
> > > 14634.525522¨ XFS (dm-1): Ending clean mount
> > > 14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340
> > > 14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0
> > > 14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0
> > > 14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298
> > > 14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec
> > > 14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc
> > > 14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc
> > > 14640.428311¨ Last Breaking-Event-Address:
> > > 14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4
> > > 14640.428319¨  list_add corruption. next->prev should be prev
> > > (0000000000000918
> > > ), but was           (null). (next=          (null)).
> > 
> > Where's XFS in this? walk_tg_tree_from() is part of the scheduler
> > code. This kind of implies a stack corruption....
> > 
> > > Sometimes, this pops up,
> > > [16907.275002] WARNING: at kernel/rcutree.c:1960
> > > 
> > > or this,
> > > 15316.154171¨ XFS (dm-1): Mounting Filesystem
> > > 15316.255796¨ XFS (dm-1): Ending clean mount
> > > 15320.364246¨            00000000006367a2: e310b0080004        lg
> > > %r1,8(%r
> > > 11)
> > > 15320.364249¨            00000000006367a8: 41101010            la
> > > %r1,16(%
> > > r1)
> > > 15320.364251¨            00000000006367ac: e33010000004        lg
> > > %r3,0(%r
> > > 1)
> > > 15320.364252¨ Call Trace:
> > > 15320.364252¨ Last Breaking-Event-Address:
> > > 15320.364253¨  � <0000000000000000>¨ Kernel stack overflow.
> > > 15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1
> > > 15320.364309¨ Process rhts-test-runne (pid: 625, task: 000000003dccc890,
> > > ksp: 0
> > 
> > .... and there you go - a stack overflow. Your kernel stack size is
> > too small.
> > 
> > I'd suggest that you need 16k stacks on s390 - IIRC every function
> > call has 128 byte stack frame, and there are call chains 70-80
> > functions deep in the storage stack...
> Hmm, I am unsure how to set to 16k stack there

Are you build a 64 bit s390 kernel or a 32 bit kernel? 32 bit
kernels only have an 8k stack size, 64 bit kernels are 16k (see
arch/s390/Makefile).

$ git grep STACK_SIZE arch/s390 |head -2
arch/s390/Makefile:STACK_SIZE   := 8192
arch/s390/Makefile:STACK_SIZE   := 16384

As it is, the stack frame usage is worse than I thought:

$ git grep STACK_FRAME_OVERHEAD arch/s390 |head -2
arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 96      /* size of minimum stack frame */
arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 160      /* size of minimum stack frame */

Overhead is 96 bytes for 32 bit and 160 bytes for 64 bit. So 16k
stack size is going to have big troubles with a 70-80 function deep
call chain.

As for powerpc:

arch/powerpc/include/asm/ppc_asm.h:#define STACKFRAMESIZE 256

Yeah, same issue.

But, seriously, these stack traces are meaningless to anyone not
familiar with s390 or power7 - they indicate a problem detected
in the idle loop, not where ever the stack overran.

Can you please work with the s390/power7 people to obtain whatever
stack it was that overflowed, and we can go from there.

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: 3.9.2: xfstests triggered panic
  2013-05-23  3:46         ` Dave Chinner
@ 2013-05-23  4:11           ` CAI Qian
  -1 siblings, 0 replies; 16+ messages in thread
From: CAI Qian @ 2013-05-23  4:11 UTC (permalink / raw)
  To: Dave Chinner; +Cc: LKML, stable, xfs



----- Original Message -----
> From: "Dave Chinner" <david@fromorbit.com>
> To: "CAI Qian" <caiqian@redhat.com>
> Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org, xfs@oss.sgi.com
> Sent: Thursday, May 23, 2013 11:46:11 AM
> Subject: Re: 3.9.2: xfstests triggered panic
> 
> On Wed, May 22, 2013 at 11:16:56PM -0400, CAI Qian wrote:
> > ----- Original Message -----
> > > From: "Dave Chinner" <david@fromorbit.com>
> > > To: "CAI Qian" <caiqian@redhat.com>
> > > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org,
> > > xfs@oss.sgi.com
> > > Sent: Wednesday, May 22, 2013 5:53:00 PM
> > > Subject: Re: 3.9.2: xfstests triggered panic
> > > 
> > > On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> > > > Reproduced on almost all s390x guests by running xfstests.
> > > > 
> > > > 14634.396658¨ XFS (dm-1): Mounting Filesystem
> > > > 14634.525522¨ XFS (dm-1): Ending clean mount
> > > > 14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340
> > > > 14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0
> > > > 14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0
> > > > 14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298
> > > > 14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec
> > > > 14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc
> > > > 14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc
> > > > 14640.428311¨ Last Breaking-Event-Address:
> > > > 14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4
> > > > 14640.428319¨  list_add corruption. next->prev should be prev
> > > > (0000000000000918
> > > > ), but was           (null). (next=          (null)).
> > > 
> > > Where's XFS in this? walk_tg_tree_from() is part of the scheduler
> > > code. This kind of implies a stack corruption....
> > > 
> > > > Sometimes, this pops up,
> > > > [16907.275002] WARNING: at kernel/rcutree.c:1960
> > > > 
> > > > or this,
> > > > 15316.154171¨ XFS (dm-1): Mounting Filesystem
> > > > 15316.255796¨ XFS (dm-1): Ending clean mount
> > > > 15320.364246¨            00000000006367a2: e310b0080004        lg
> > > > %r1,8(%r
> > > > 11)
> > > > 15320.364249¨            00000000006367a8: 41101010            la
> > > > %r1,16(%
> > > > r1)
> > > > 15320.364251¨            00000000006367ac: e33010000004        lg
> > > > %r3,0(%r
> > > > 1)
> > > > 15320.364252¨ Call Trace:
> > > > 15320.364252¨ Last Breaking-Event-Address:
> > > > 15320.364253¨  � <0000000000000000>¨ Kernel stack overflow.
> > > > 15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1
> > > > 15320.364309¨ Process rhts-test-runne (pid: 625, task:
> > > > 000000003dccc890,
> > > > ksp: 0
> > > 
> > > .... and there you go - a stack overflow. Your kernel stack size is
> > > too small.
> > > 
> > > I'd suggest that you need 16k stacks on s390 - IIRC every function
> > > call has 128 byte stack frame, and there are call chains 70-80
> > > functions deep in the storage stack...
> > Hmm, I am unsure how to set to 16k stack there
> 
> Are you build a 64 bit s390 kernel or a 32 bit kernel? 32 bit
> kernels only have an 8k stack size, 64 bit kernels are 16k (see
> arch/s390/Makefile).
It is 64-bit.
> 
> $ git grep STACK_SIZE arch/s390 |head -2
> arch/s390/Makefile:STACK_SIZE   := 8192
> arch/s390/Makefile:STACK_SIZE   := 16384
> 
> As it is, the stack frame usage is worse than I thought:
> 
> $ git grep STACK_FRAME_OVERHEAD arch/s390 |head -2
> arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 96      /*
> size of minimum stack frame */
> arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 160      /*
> size of minimum stack frame */
> 
> Overhead is 96 bytes for 32 bit and 160 bytes for 64 bit. So 16k
> stack size is going to have big troubles with a 70-80 function deep
> call chain.
> 
> As for powerpc:
> 
> arch/powerpc/include/asm/ppc_asm.h:#define STACKFRAMESIZE 256
> 
> Yeah, same issue.
> 
> But, seriously, these stack traces are meaningless to anyone not
> familiar with s390 or power7 - they indicate a problem detected
> in the idle loop, not where ever the stack overran.
> 
> Can you please work with the s390/power7 people to obtain whatever
> stack it was that overflowed, and we can go from there.
OK, I'll do.
CAI Qian
> 
> Cheers,
> 
> Dave.
> --
> Dave Chinner
> david@fromorbit.com
> 

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: 3.9.2: xfstests triggered panic
@ 2013-05-23  4:11           ` CAI Qian
  0 siblings, 0 replies; 16+ messages in thread
From: CAI Qian @ 2013-05-23  4:11 UTC (permalink / raw)
  To: Dave Chinner; +Cc: LKML, stable, xfs



----- Original Message -----
> From: "Dave Chinner" <david@fromorbit.com>
> To: "CAI Qian" <caiqian@redhat.com>
> Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org, xfs@oss.sgi.com
> Sent: Thursday, May 23, 2013 11:46:11 AM
> Subject: Re: 3.9.2: xfstests triggered panic
> 
> On Wed, May 22, 2013 at 11:16:56PM -0400, CAI Qian wrote:
> > ----- Original Message -----
> > > From: "Dave Chinner" <david@fromorbit.com>
> > > To: "CAI Qian" <caiqian@redhat.com>
> > > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org,
> > > xfs@oss.sgi.com
> > > Sent: Wednesday, May 22, 2013 5:53:00 PM
> > > Subject: Re: 3.9.2: xfstests triggered panic
> > > 
> > > On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> > > > Reproduced on almost all s390x guests by running xfstests.
> > > > 
> > > > 14634.396658¨ XFS (dm-1): Mounting Filesystem
> > > > 14634.525522¨ XFS (dm-1): Ending clean mount
> > > > 14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340
> > > > 14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0
> > > > 14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0
> > > > 14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298
> > > > 14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec
> > > > 14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc
> > > > 14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc
> > > > 14640.428311¨ Last Breaking-Event-Address:
> > > > 14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4
> > > > 14640.428319¨  list_add corruption. next->prev should be prev
> > > > (0000000000000918
> > > > ), but was           (null). (next=          (null)).
> > > 
> > > Where's XFS in this? walk_tg_tree_from() is part of the scheduler
> > > code. This kind of implies a stack corruption....
> > > 
> > > > Sometimes, this pops up,
> > > > [16907.275002] WARNING: at kernel/rcutree.c:1960
> > > > 
> > > > or this,
> > > > 15316.154171¨ XFS (dm-1): Mounting Filesystem
> > > > 15316.255796¨ XFS (dm-1): Ending clean mount
> > > > 15320.364246¨            00000000006367a2: e310b0080004        lg
> > > > %r1,8(%r
> > > > 11)
> > > > 15320.364249¨            00000000006367a8: 41101010            la
> > > > %r1,16(%
> > > > r1)
> > > > 15320.364251¨            00000000006367ac: e33010000004        lg
> > > > %r3,0(%r
> > > > 1)
> > > > 15320.364252¨ Call Trace:
> > > > 15320.364252¨ Last Breaking-Event-Address:
> > > > 15320.364253¨  � <0000000000000000>¨ Kernel stack overflow.
> > > > 15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1
> > > > 15320.364309¨ Process rhts-test-runne (pid: 625, task:
> > > > 000000003dccc890,
> > > > ksp: 0
> > > 
> > > .... and there you go - a stack overflow. Your kernel stack size is
> > > too small.
> > > 
> > > I'd suggest that you need 16k stacks on s390 - IIRC every function
> > > call has 128 byte stack frame, and there are call chains 70-80
> > > functions deep in the storage stack...
> > Hmm, I am unsure how to set to 16k stack there
> 
> Are you build a 64 bit s390 kernel or a 32 bit kernel? 32 bit
> kernels only have an 8k stack size, 64 bit kernels are 16k (see
> arch/s390/Makefile).
It is 64-bit.
> 
> $ git grep STACK_SIZE arch/s390 |head -2
> arch/s390/Makefile:STACK_SIZE   := 8192
> arch/s390/Makefile:STACK_SIZE   := 16384
> 
> As it is, the stack frame usage is worse than I thought:
> 
> $ git grep STACK_FRAME_OVERHEAD arch/s390 |head -2
> arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 96      /*
> size of minimum stack frame */
> arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 160      /*
> size of minimum stack frame */
> 
> Overhead is 96 bytes for 32 bit and 160 bytes for 64 bit. So 16k
> stack size is going to have big troubles with a 70-80 function deep
> call chain.
> 
> As for powerpc:
> 
> arch/powerpc/include/asm/ppc_asm.h:#define STACKFRAMESIZE 256
> 
> Yeah, same issue.
> 
> But, seriously, these stack traces are meaningless to anyone not
> familiar with s390 or power7 - they indicate a problem detected
> in the idle loop, not where ever the stack overran.
> 
> Can you please work with the s390/power7 people to obtain whatever
> stack it was that overflowed, and we can go from there.
OK, I'll do.
CAI Qian
> 
> Cheers,
> 
> Dave.
> --
> Dave Chinner
> david@fromorbit.com
> 

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 16+ messages in thread

* 3.9.2/3.9.3: stack overrun on s390x and ppc64 (WAS Re: 3.9.2: xfstests triggered panic)
  2013-05-23  3:46         ` Dave Chinner
  (?)
@ 2013-05-23  4:57           ` CAI Qian
  -1 siblings, 0 replies; 16+ messages in thread
From: CAI Qian @ 2013-05-23  4:57 UTC (permalink / raw)
  To: linux-s390, linuxppc-dev
  Cc: LKML, stable, xfs, Steve Best, Hendrik Brueckner, Dave Chinner

Original report:
http://oss.sgi.com/archives/xfs/2013-05/msg00683.html

Also seen on Power7:
http://marc.info/?l=linux-kernel&m=136927904900692&w=2

CAI Qian

----- Original Message -----
> From: "Dave Chinner" <david@fromorbit.com>
> To: "CAI Qian" <caiqian@redhat.com>
> Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org, xfs@oss.sgi.com
> Sent: Thursday, May 23, 2013 11:46:11 AM
> Subject: Re: 3.9.2: xfstests triggered panic
> 
> On Wed, May 22, 2013 at 11:16:56PM -0400, CAI Qian wrote:
> > ----- Original Message -----
> > > From: "Dave Chinner" <david@fromorbit.com>
> > > To: "CAI Qian" <caiqian@redhat.com>
> > > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org,
> > > xfs@oss.sgi.com
> > > Sent: Wednesday, May 22, 2013 5:53:00 PM
> > > Subject: Re: 3.9.2: xfstests triggered panic
> > > 
> > > On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> > > > Reproduced on almost all s390x guests by running xfstests.
> > > > 
> > > > 14634.396658¨ XFS (dm-1): Mounting Filesystem
> > > > 14634.525522¨ XFS (dm-1): Ending clean mount
> > > > 14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340
> > > > 14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0
> > > > 14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0
> > > > 14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298
> > > > 14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec
> > > > 14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc
> > > > 14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc
> > > > 14640.428311¨ Last Breaking-Event-Address:
> > > > 14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4
> > > > 14640.428319¨  list_add corruption. next->prev should be prev
> > > > (0000000000000918
> > > > ), but was           (null). (next=          (null)).
> > > 
> > > Where's XFS in this? walk_tg_tree_from() is part of the scheduler
> > > code. This kind of implies a stack corruption....
> > > 
> > > > Sometimes, this pops up,
> > > > [16907.275002] WARNING: at kernel/rcutree.c:1960
> > > > 
> > > > or this,
> > > > 15316.154171¨ XFS (dm-1): Mounting Filesystem
> > > > 15316.255796¨ XFS (dm-1): Ending clean mount
> > > > 15320.364246¨            00000000006367a2: e310b0080004        lg
> > > > %r1,8(%r
> > > > 11)
> > > > 15320.364249¨            00000000006367a8: 41101010            la
> > > > %r1,16(%
> > > > r1)
> > > > 15320.364251¨            00000000006367ac: e33010000004        lg
> > > > %r3,0(%r
> > > > 1)
> > > > 15320.364252¨ Call Trace:
> > > > 15320.364252¨ Last Breaking-Event-Address:
> > > > 15320.364253¨  � <0000000000000000>¨ Kernel stack overflow.
> > > > 15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1
> > > > 15320.364309¨ Process rhts-test-runne (pid: 625, task:
> > > > 000000003dccc890,
> > > > ksp: 0
> > > 
> > > .... and there you go - a stack overflow. Your kernel stack size is
> > > too small.
> > > 
> > > I'd suggest that you need 16k stacks on s390 - IIRC every function
> > > call has 128 byte stack frame, and there are call chains 70-80
> > > functions deep in the storage stack...
> > Hmm, I am unsure how to set to 16k stack there
> 
> Are you build a 64 bit s390 kernel or a 32 bit kernel? 32 bit
> kernels only have an 8k stack size, 64 bit kernels are 16k (see
> arch/s390/Makefile).
> 
> $ git grep STACK_SIZE arch/s390 |head -2
> arch/s390/Makefile:STACK_SIZE   := 8192
> arch/s390/Makefile:STACK_SIZE   := 16384
> 
> As it is, the stack frame usage is worse than I thought:
> 
> $ git grep STACK_FRAME_OVERHEAD arch/s390 |head -2
> arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 96      /*
> size of minimum stack frame */
> arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 160      /*
> size of minimum stack frame */
> 
> Overhead is 96 bytes for 32 bit and 160 bytes for 64 bit. So 16k
> stack size is going to have big troubles with a 70-80 function deep
> call chain.
> 
> As for powerpc:
> 
> arch/powerpc/include/asm/ppc_asm.h:#define STACKFRAMESIZE 256
> 
> Yeah, same issue.
> 
> But, seriously, these stack traces are meaningless to anyone not
> familiar with s390 or power7 - they indicate a problem detected
> in the idle loop, not where ever the stack overran.
> 
> Can you please work with the s390/power7 people to obtain whatever
> stack it was that overflowed, and we can go from there.
> 
> Cheers,
> 
> Dave.
> --
> Dave Chinner
> david@fromorbit.com
> 

^ permalink raw reply	[flat|nested] 16+ messages in thread

* 3.9.2/3.9.3: stack overrun on s390x and ppc64 (WAS Re: 3.9.2: xfstests triggered panic)
@ 2013-05-23  4:57           ` CAI Qian
  0 siblings, 0 replies; 16+ messages in thread
From: CAI Qian @ 2013-05-23  4:57 UTC (permalink / raw)
  To: linux-s390, linuxppc-dev; +Cc: LKML, Steve Best, xfs, stable, Hendrik Brueckner

Original report:
http://oss.sgi.com/archives/xfs/2013-05/msg00683.html

Also seen on Power7:
http://marc.info/?l=linux-kernel&m=136927904900692&w=2

CAI Qian

----- Original Message -----
> From: "Dave Chinner" <david@fromorbit.com>
> To: "CAI Qian" <caiqian@redhat.com>
> Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org, xfs@oss.sgi.com
> Sent: Thursday, May 23, 2013 11:46:11 AM
> Subject: Re: 3.9.2: xfstests triggered panic
> 
> On Wed, May 22, 2013 at 11:16:56PM -0400, CAI Qian wrote:
> > ----- Original Message -----
> > > From: "Dave Chinner" <david@fromorbit.com>
> > > To: "CAI Qian" <caiqian@redhat.com>
> > > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org,
> > > xfs@oss.sgi.com
> > > Sent: Wednesday, May 22, 2013 5:53:00 PM
> > > Subject: Re: 3.9.2: xfstests triggered panic
> > > 
> > > On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> > > > Reproduced on almost all s390x guests by running xfstests.
> > > > 
> > > > 14634.396658¨ XFS (dm-1): Mounting Filesystem
> > > > 14634.525522¨ XFS (dm-1): Ending clean mount
> > > > 14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340
> > > > 14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0
> > > > 14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0
> > > > 14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298
> > > > 14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec
> > > > 14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc
> > > > 14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc
> > > > 14640.428311¨ Last Breaking-Event-Address:
> > > > 14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4
> > > > 14640.428319¨  list_add corruption. next->prev should be prev
> > > > (0000000000000918
> > > > ), but was           (null). (next=          (null)).
> > > 
> > > Where's XFS in this? walk_tg_tree_from() is part of the scheduler
> > > code. This kind of implies a stack corruption....
> > > 
> > > > Sometimes, this pops up,
> > > > [16907.275002] WARNING: at kernel/rcutree.c:1960
> > > > 
> > > > or this,
> > > > 15316.154171¨ XFS (dm-1): Mounting Filesystem
> > > > 15316.255796¨ XFS (dm-1): Ending clean mount
> > > > 15320.364246¨            00000000006367a2: e310b0080004        lg
> > > > %r1,8(%r
> > > > 11)
> > > > 15320.364249¨            00000000006367a8: 41101010            la
> > > > %r1,16(%
> > > > r1)
> > > > 15320.364251¨            00000000006367ac: e33010000004        lg
> > > > %r3,0(%r
> > > > 1)
> > > > 15320.364252¨ Call Trace:
> > > > 15320.364252¨ Last Breaking-Event-Address:
> > > > 15320.364253¨  � <0000000000000000>¨ Kernel stack overflow.
> > > > 15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1
> > > > 15320.364309¨ Process rhts-test-runne (pid: 625, task:
> > > > 000000003dccc890,
> > > > ksp: 0
> > > 
> > > .... and there you go - a stack overflow. Your kernel stack size is
> > > too small.
> > > 
> > > I'd suggest that you need 16k stacks on s390 - IIRC every function
> > > call has 128 byte stack frame, and there are call chains 70-80
> > > functions deep in the storage stack...
> > Hmm, I am unsure how to set to 16k stack there
> 
> Are you build a 64 bit s390 kernel or a 32 bit kernel? 32 bit
> kernels only have an 8k stack size, 64 bit kernels are 16k (see
> arch/s390/Makefile).
> 
> $ git grep STACK_SIZE arch/s390 |head -2
> arch/s390/Makefile:STACK_SIZE   := 8192
> arch/s390/Makefile:STACK_SIZE   := 16384
> 
> As it is, the stack frame usage is worse than I thought:
> 
> $ git grep STACK_FRAME_OVERHEAD arch/s390 |head -2
> arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 96      /*
> size of minimum stack frame */
> arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 160      /*
> size of minimum stack frame */
> 
> Overhead is 96 bytes for 32 bit and 160 bytes for 64 bit. So 16k
> stack size is going to have big troubles with a 70-80 function deep
> call chain.
> 
> As for powerpc:
> 
> arch/powerpc/include/asm/ppc_asm.h:#define STACKFRAMESIZE 256
> 
> Yeah, same issue.
> 
> But, seriously, these stack traces are meaningless to anyone not
> familiar with s390 or power7 - they indicate a problem detected
> in the idle loop, not where ever the stack overran.
> 
> Can you please work with the s390/power7 people to obtain whatever
> stack it was that overflowed, and we can go from there.
> 
> Cheers,
> 
> Dave.
> --
> Dave Chinner
> david@fromorbit.com
> 

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 16+ messages in thread

* 3.9.2/3.9.3: stack overrun on s390x and ppc64 (WAS Re: 3.9.2: xfstests triggered panic)
@ 2013-05-23  4:57           ` CAI Qian
  0 siblings, 0 replies; 16+ messages in thread
From: CAI Qian @ 2013-05-23  4:57 UTC (permalink / raw)
  To: linux-s390, linuxppc-dev
  Cc: Dave Chinner, LKML, Steve Best, xfs, stable, Hendrik Brueckner

Original report:
http://oss.sgi.com/archives/xfs/2013-05/msg00683.html

Also seen on Power7:
http://marc.info/?l=3Dlinux-kernel&m=3D136927904900692&w=3D2

CAI Qian

----- Original Message -----
> From: "Dave Chinner" <david@fromorbit.com>
> To: "CAI Qian" <caiqian@redhat.com>
> Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org, xfs@os=
s.sgi.com
> Sent: Thursday, May 23, 2013 11:46:11 AM
> Subject: Re: 3.9.2: xfstests triggered panic
>=20
> On Wed, May 22, 2013 at 11:16:56PM -0400, CAI Qian wrote:
> > ----- Original Message -----
> > > From: "Dave Chinner" <david@fromorbit.com>
> > > To: "CAI Qian" <caiqian@redhat.com>
> > > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org,
> > > xfs@oss.sgi.com
> > > Sent: Wednesday, May 22, 2013 5:53:00 PM
> > > Subject: Re: 3.9.2: xfstests triggered panic
> > >=20
> > > On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> > > > Reproduced on almost all s390x guests by running xfstests.
> > > >=20
> > > > 14634.396658=C2=A8 XFS (dm-1): Mounting Filesystem
> > > > 14634.525522=C2=A8 XFS (dm-1): Ending clean mount
> > > > 14640.413007=C2=A8  <000000000017c6d4>=C2=A8 idle_balance+0x1a0/0x3=
40
> > > > 14640.413010=C2=A8  <000000000063303e>=C2=A8 __schedule+0xa22/0xaf0
> > > > 14640.428279=C2=A8  <0000000000630da6>=C2=A8 schedule_timeout+0x186=
/0x2c0
> > > > 14640.428289=C2=A8  <00000000001cf864>=C2=A8 rcu_gp_kthread+0x1bc/0=
x298
> > > > 14640.428300=C2=A8  <0000000000158c5a>=C2=A8 kthread+0xe6/0xec
> > > > 14640.428304=C2=A8  <0000000000634de6>=C2=A8 kernel_thread_starter+=
0x6/0xc
> > > > 14640.428308=C2=A8  <0000000000634de0>=C2=A8 kernel_thread_starter+=
0x0/0xc
> > > > 14640.428311=C2=A8 Last Breaking-Event-Address:
> > > > 14640.428314=C2=A8  <000000000016bd76>=C2=A8 walk_tg_tree_from+0x3a=
/0xf4
> > > > 14640.428319=C2=A8  list_add corruption. next->prev should be prev
> > > > (0000000000000918
> > > > ), but was           (null). (next=3D          (null)).
> > >=20
> > > Where's XFS in this? walk_tg_tree_from() is part of the scheduler
> > > code. This kind of implies a stack corruption....
> > >=20
> > > > Sometimes, this pops up,
> > > > [16907.275002] WARNING: at kernel/rcutree.c:1960
> > > >=20
> > > > or this,
> > > > 15316.154171=C2=A8 XFS (dm-1): Mounting Filesystem
> > > > 15316.255796=C2=A8 XFS (dm-1): Ending clean mount
> > > > 15320.364246=C2=A8            00000000006367a2: e310b0080004       =
 lg
> > > > %r1,8(%r
> > > > 11)
> > > > 15320.364249=C2=A8            00000000006367a8: 41101010           =
 la
> > > > %r1,16(%
> > > > r1)
> > > > 15320.364251=C2=A8            00000000006367ac: e33010000004       =
 lg
> > > > %r3,0(%r
> > > > 1)
> > > > 15320.364252=C2=A8 Call Trace:
> > > > 15320.364252=C2=A8 Last Breaking-Event-Address:
> > > > 15320.364253=C2=A8  =EF=BF=BD <0000000000000000>=C2=A8 Kernel stack=
 overflow.
> > > > 15320.364308=C2=A8 CPU: 0 Tainted: GF       W    3.9.2 #1
> > > > 15320.364309=C2=A8 Process rhts-test-runne (pid: 625, task:
> > > > 000000003dccc890,
> > > > ksp: 0
> > >=20
> > > .... and there you go - a stack overflow. Your kernel stack size is
> > > too small.
> > >=20
> > > I'd suggest that you need 16k stacks on s390 - IIRC every function
> > > call has 128 byte stack frame, and there are call chains 70-80
> > > functions deep in the storage stack...
> > Hmm, I am unsure how to set to 16k stack there
>=20
> Are you build a 64 bit s390 kernel or a 32 bit kernel? 32 bit
> kernels only have an 8k stack size, 64 bit kernels are 16k (see
> arch/s390/Makefile).
>=20
> $ git grep STACK_SIZE arch/s390 |head -2
> arch/s390/Makefile:STACK_SIZE   :=3D 8192
> arch/s390/Makefile:STACK_SIZE   :=3D 16384
>=20
> As it is, the stack frame usage is worse than I thought:
>=20
> $ git grep STACK_FRAME_OVERHEAD arch/s390 |head -2
> arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 96      =
/*
> size of minimum stack frame */
> arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 160     =
 /*
> size of minimum stack frame */
>=20
> Overhead is 96 bytes for 32 bit and 160 bytes for 64 bit. So 16k
> stack size is going to have big troubles with a 70-80 function deep
> call chain.
>=20
> As for powerpc:
>=20
> arch/powerpc/include/asm/ppc_asm.h:#define STACKFRAMESIZE 256
>=20
> Yeah, same issue.
>=20
> But, seriously, these stack traces are meaningless to anyone not
> familiar with s390 or power7 - they indicate a problem detected
> in the idle loop, not where ever the stack overran.
>=20
> Can you please work with the s390/power7 people to obtain whatever
> stack it was that overflowed, and we can go from there.
>=20
> Cheers,
>=20
> Dave.
> --
> Dave Chinner
> david@fromorbit.com
>=20

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: 3.9.2/3.9.3: stack overrun on s390x and ppc64 (WAS Re: 3.9.2: xfstests triggered panic)
  2013-05-23  4:57           ` CAI Qian
  (?)
@ 2013-05-24  3:33             ` CAI Qian
  -1 siblings, 0 replies; 16+ messages in thread
From: CAI Qian @ 2013-05-24  3:33 UTC (permalink / raw)
  To: linux-s390, linuxppc-dev
  Cc: LKML, stable, xfs, Steve Best, Hendrik Brueckner, Dave Chinner

OK, here is clearer stack output from the run.
CAI Qian

+ ./check
FSTYP         -- xfs (non-debug)
PLATFORM      -- Linux/s390x ibm-z10-23 3.9.3

001	 29s
002	 3s
003	 2s
004	 [not run] this test requires a valid $SCRATCH_DEV
005	 2s
006	 9s
007	 10s
008	 7s
009	 [not run] this test requires a valid $SCRATCH_DEV
010	 [not run] dbtest was not built for this platform
011	 9s
012	 10s
013	 35s
014	 5s
015	 [not run] this test requires a valid $SCRATCH_DEV
016	 [not run] this test requires a valid $SCRATCH_DEV
017	 [not run] this test requires a valid $SCRATCH_DEV
018	 [not run] this test requires a valid $SCRATCH_DEV
019	 [not run] this test requires a valid $SCRATCH_DEV
020	


[ 1316.571227] XFS (dm-0): Mounting Filesystem
[ 1316.697803] XFS (dm-0): Ending clean mount
[ 1318.080615] XFS (dm-0): Ending clean mount
[ 1348.791125] XFS (dm-0): Mounting Filesystem
[ 1348.989166] XFS (dm-0): Ending clean mount
[ 1353.335478] XFS (dm-0): Mounting Filesystem
[ 1353.496364] XFS (dm-0): Ending clean mount
[ 1357.495427] XFS (dm-0): Mounting Filesystem
[ 1357.676971] XFS (dm-0): Ending clean mount
[ 1361.646399] XFS (dm-0): Mounting Filesystem
[ 1361.890426] XFS (dm-0): Ending clean mount
[ 1371.798944] XFS (dm-0): Mounting Filesystem
[ 1371.976922] XFS (dm-0): Ending clean mount
[ 1384.559103] XFS (dm-0): Mounting Filesystem
[ 1384.725657] XFS (dm-0): Ending clean mount
[ 1393.131347] XFS (dm-0): Mounting Filesystem
[ 1393.357927] XFS (dm-0): Ending clean mount
[ 1407.282708] XFS (dm-0): Mounting Filesystem
[ 1407.745176] XFS (dm-0): Ending clean mount
[ 1422.927074] XFS (dm-0): Mounting Filesystem
[ 1423.136266] XFS (dm-0): Ending clean mount
[ 1425.500910] XFS (dm-0): Mounting Filesystem
[ 1425.608851] XFS (dm-0): Ending clean mount
[ 1450.978110] XFS (dm-0): Mounting Filesystem
[ 1451.255368] XFS (dm-0): Ending clean mount
[ 1453.603742] XFS (dm-0): Mounting Filesystem
[ 1453.680657] XFS (dm-0): Ending clean mount
[ 1456.262266] XFS (dm-0): Mounting Filesystem
[ 1456.330515] XFS (dm-0): Ending clean mount
[ 1457.053767] XFS (dm-0): Mounting Filesystem
[ 1457.107258] XFS (dm-0): Ending clean mount
[ 1462.049374] XFS (dm-0): Mounting Filesystem
[ 1462.111389] XFS (dm-0): Ending clean mount
[ 1471.109589] ODEBUG: deactivate not available (active state 0) object type: ti
mer_list hint: process_timeout+0x0/0x8
[ 1471.109683] ------------[ cut here ]------------
[ 1471.109688] WARNING: at lib/debugobjects.c:260
[ 1471.109692] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.109848] CPU: 0 Tainted: GF            3.9.3 #2
[ 1471.109858] Process swapper/0 (pid: 0, task: 0000000000a2b4d0, ksp: 000000000
0a17d28)
[ 1471.109868] Krnl PSW : 0404c00180000000 000000000046c84a (debug_print_object+
0xca/0xd8)
[ 1471.114762]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 EA:
3
Krnl GPRS: 0000000000000000 0000000000a2b4d0 0000000000000067 000000000101f708
[ 1471.114769]            000000000046c846 0000000084a4d448 000000000086936a 000
0000001040700
[ 1471.114773]            0000000001a0f290 0400000000000001 0000000000874cf8 000
0000000a395d8
[ 1471.114777]            000000000195f820 000000000001bd20 000000000046c846 000
000000001bc20
[ 1471.114792] Krnl Code: 000000000046c83a: e34410000004        lg      %r4,0(%r
4,%r1)
           000000000046c840: c0e500139f88       brasl   %r14,6e0750
          #000000000046c846: a7f40001           brc     15,46c848
          >000000000046c84a: a7f4ffc2           brc     15,46c7ce
           000000000046c84e: a7290000           lghi    %r2,0
           000000000046c852: a7f4ffd7           brc     15,46c800
           000000000046c856: 0707               bcr     0,%r7
           000000000046c858: ebaff0680024       stmg    %r10,%r15,104(%r15)
[ 1471.114825] Call Trace:
[ 1471.114828] ([<000000000046c846>] debug_print_object+0xc6/0xd8)
[ 1471.114833]  [<000000000046d35c>] debug_object_deactivate+0x15c/0x160
[ 1471.114838]  [<0000000000148244>] run_timer_softirq+0x180/0x464
[ 1471.114843]  [<000000000013d8d6>] __do_softirq+0x112/0x42c
[ 1471.114847]  [<000000000013ddf8>] irq_exit+0xc8/0xe8
[ 1471.114851]  [<000000000010d55e>] do_extint+0x25e/0x318
[ 1471.114859]  [<00000000006f0d90>] ext_skip+0x40/0x44
[ 1471.114866]  [<00000000006f05d6>] vtime_stop_cpu+0x52/0xbc
[ 1471.114870] ([<00000000006f05b4>] vtime_stop_cpu+0x30/0xbc)
[ 1471.114874]  [<000000000010476e>] cpu_idle+0x112/0x1b8
[ 1471.114879]  [<0000000000aaf99a>] start_kernel+0x42e/0x43c
[ 1471.114885]  [<0000000000100020>] _stext+0x20/0x80
[ 1471.114891] 2 locks held by swapper/0/0:
[ 1471.114894]  #0:  (&(&base->lock)->rlock){-.-.-.}, at: [<0000000000148128>] r
un_timer_softirq+0x64/0x464
[ 1471.114908]  #1:  (&obj_hash[i].lock){-.-.-.}, at: [<000000000046d2ba>] debug
_object_deactivate+0xba/0x160
[ 1471.114918] Last Breaking-Event-Address:
[ 1471.114921]  [<000000000046c846>] debug_print_object+0xc6/0xd8
[ 1471.114927] ---[ end trace dd87895f75677361 ]---
[ 1471.117683] ODEBUG: object is on stack, but not annotated
[ 1471.117723] ------------[ cut here ]------------
[ 1471.117726] WARNING: at lib/debugobjects.c:300
[ 1471.117729] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.117791] CPU: 0 Tainted: GF       W    3.9.3 #2
[ 1471.117794] Process rcu_sched (pid: 10, task: 00000000fffe8000, ksp: 00000000
fffe7ae0)
[ 1471.117797] Krnl PSW : 0404c00180000000 000000000046ccce (__debug_object_init
+0x35a/0x480)
[ 1471.117804]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 EA:
3
Krnl GPRS: 0000000000000000 00000000fffe8000 000000000000002d 000000000101f708
[ 1471.117811]            000000000046ccca 0000000000000001 0000000000a395d8 070
0000000000000
[ 1471.117814]            0000000000a78750 0000000001a0f298 00000000fffe7cf0 000
00000f8daa938
[ 1471.117818]            0000000001a0f290 00000000007434c0 000000000046ccca 00
00000fffe7a40
[ 1471.117852] Krnl Code: 000000000046ccbe: c02000217617        larl    %r2,89b8
ec
           000000000046ccc4: c0e500139d46       brasl   %r14,6e0750
          #000000000046ccca: a7f40001           brc     15,46cccc
          >000000000046ccce: a7f4ff03           brc     15,46cad4
           000000000046ccd2: a7380000           lhi     %r3,0
           000000000046ccd6: a7f4ffe9           brc     15,46cca8
           000000000046ccda: c02000217622       larl    %r2,89b91e
           000000000046cce0: c0e500139d38       brasl   %r14,6e0750
[ 1471.117884] Call Trace:
[ 1471.117887] ([<000000000046ccca>] __debug_object_init+0x356/0x480)
[ 1471.117891]  [<0000000000146f42>] init_timer_key+0x3a/0x164
[ 1471.117897]  [<0000000000147eac>] timer_fixup_assert_init+0x50/0xa8
[ 1471.117901]  [<000000000046d7a4>] debug_object_assert_init+0x124/0x154
[ 1471.117906]  [<0000000000148676>] try_to_del_timer_sync+0x2e/0x94
[ 1471.117910]  [<000000000014878e>] del_timer_sync+0xb2/0x104
[ 1471.117915]  [<00000000006ea950>] schedule_timeout+0x190/0x37c
[ 1471.117922]  [<00000000001f1118>] rcu_gp_kthread+0x320/0x59c
[ 1471.117927]  [<0000000000164176>] kthread+0xe6/0xec
[ 1471.117934]  [<00000000006f0906>] kernel_thread_starter+0x6/0xc
[ 1471.117938]  [<00000000006f0900>] kernel_thread_starter+0x0/0xc
[ 1471.117943] 1 lock held by rcu_sched/10:
[ 1471.117945]  #0:  (&obj_hash[i].lock){-.-.-.}, at: [<000000000046caba>] __deb
ug_object_init+0x146/0x480
[ 1471.117955] Last Breaking-Event-Address:
[ 1471.117958]  [<000000000046ccca>] __debug_object_init+0x356/0x480
[ 1471.117962] ---[ end trace dd87895f75677362 ]---
[ 1471.117967] ODEBUG: assert_init not available (active state 0) object type: t
imer_list hint: stub_timer+0x0/0x8
[ 1471.117984] ------------[ cut here ]------------
[ 1471.117987] WARNING: at lib/debugobjects.c:260
[ 1471.117990] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.118053] CPU: 0 Tainted: GF       W    3.9.3 #2
[ 1471.118075] Process rcu_sched (pid: 10, task: 00000000fffe8000, ksp: 00000000
fffe7ae0)
[ 1471.118079] Krnl PSW : 0704e00180000000 000000000046c84a (debug_print_object+
0xca/0xd8)
[ 1471.118085]            R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:0 EA:
3
Krnl GPRS: 0000000000000000 00000000fffe8000 0000000000000063 0000000000000001
[ 1471.118092]            000000000046c846 0000000000000001 000000000086936a 000
0000000a1d300
[ 1471.118095]            0000000001a0f290 07000000006ed3fc 000000000089ba14 000
0000000a395d8
[ 1471.118099]            000000000195f820 00000000fffe7bc8 000000000046c846 000
00000fffe7ac8
[ 1471.118109] Krnl Code: 000000000046c83a: e34410000004        lg      %r4,0(%r
4,%r1)
           000000000046c840: c0e500139f88       brasl   %r14,6e0750
          #000000000046c846: a7f40001           brc     15,46c848
          >000000000046c84a: a7f4ffc2           brc     15,46c7ce
           000000000046c84e: a7290000           lghi    %r2,0
           000000000046c852: a7f4ffd7           brc     15,46c800
           000000000046c856: 0707               bcr     0,%r7
           000000000046c858: ebaff0680024       stmg    %r10,%r15,104(%r15)
[ 1471.118141] Call Trace:
[ 1471.118144] ([<000000000046c846>] debug_print_object+0xc6/0xd8)
[ 1471.118149]  [<000000000046d7ce>] debug_object_assert_init+0x14e/0x154
[ 1471.118153]  [<0000000000148676>] try_to_del_timer_sync+0x2e/0x94
[ 1471.118158]  [<000000000014878e>] del_timer_sync+0xb2/0x104
[ 1471.118163]  [<00000000006ea950>] schedule_timeout+0x190/0x37c
[ 1471.118167]  [<00000000001f1118>] rcu_gp_kthread+0x320/0x59c
[ 1471.118172]  [<0000000000164176>] kthread+0xe6/0xec
[ 1471.118176]  [<00000000006f0906>] kernel_thread_starter+0x6/0xc
[ 1471.118180]  [<00000000006f0900>] kernel_thread_starter+0x0/0xc
[ 1471.118185] no locks held by rcu_sched/10.
[ 1471.118188] Last Breaking-Event-Address:
[ 1471.118190]  [<000000000046c846>] debug_print_object+0xc6/0xd8
[ 1471.118195] ---[ end trace dd87895f75677363 ]---
[ 1471.121980] Unable to handle kernel pointer dereference at virtual kernel add
ress 000003ff8072c000
[ 1471.123714] Oops: 0011 [#1] SMP
[ 1471.123720] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.123777] CPU: 0 Tainted: GF       W    3.9.3 #2
[ 1471.123781] Process systemd-journal (pid: 516, task: 00000000f5ad8000, ksp: 0
0000000f544fbd0)
[ 1471.123784] Krnl PSW : 0704e00180000000 000003ff80749cc2 (xfs_vm_page_mkwrite
+0x12/0x1c [xfs])
[ 1471.154773]            R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:0 EA:
3
Krnl GPRS: 00000000000015e8 000003ff8072cc30 0000000000000001 0000000000000000
[ 1471.154781]            00000000006ef176 00000000007036b8 00000000f97b7d98 000
003fffb3e7000
[ 1471.154784]            000003fffb3e7000 00000000f8f65a00 00000000f8472738 000
00000f8ffc2c0
[ 1471.154788]            00000000f8472738 000000000070d5e0 000003ff80749cb6 000
00000f544fc28
[ 1471.154802] Krnl Code: 000003ff80749cb6: 1222                ltr     %r2,%r2
           000003ff80749cb8: a784ffd8           brc     8,3ff80749c68
          #000003ff80749cbc: c010ffff17ba       larl    %r1,3ff8072cc30
          >000003ff80749cc2: e31010000004       lg      %r1,0(%r1)
           000003ff80749cc8: e32010000012       lt      %r2,0(%r1)
           000003ff80749cce: a7740032           brc     7,3ff80749d32
           000003ff80749cd2: e31003180004       lg      %r1,792
           000003ff80749cd8: e330101c0012       lt      %r3,28(%r1)
[ 1471.154838] Call Trace:
[ 1471.154841] ([<0000000000261870>] do_wp_page+0x6f8/0xa4c)
[ 1471.154851]  [<0000000000262f2c>] handle_pte_fault+0x65c/0xa10
[ 1471.154856]  [<0000000000264a22>] handle_mm_fault+0x182/0x260
[ 1471.154860]  [<00000000006f21d0>] do_protection_exception+0x1b8/0x3cc
[ 1471.154867]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.154871]  [<000000008001d60a>] 0x8001d60a
[ 1471.154894] INFO: lockdep is turned off.
[ 1471.154897] Last Breaking-Event-Address:
[ 1471.154899]  [<00000000001ed7ca>] rcu_lockdep_current_cpu_online+0x6e/0x84
[ 1471.154909]
[ 1471.154913] ---[ end trace dd87895f75677364 ]---
[ 1471.157438] ------------[ cut here ]------------
[ 1471.157446] Kernel BUG at 00000000006eb186 [verbose debug info unavailable]
[ 1471.157508] addressing exception: 0005 [#2] SMP
[ 1471.157514] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.157574] CPU: 0 Tainted: GF     D W    3.9.3 #2
[ 1471.157577] Process systemd-journal (pid: 516, task: 00000000f5ad8000, ksp: 0
0000000f544f578)
[ 1471.157580] Krnl PSW : 0404d00180000000 00000000006eb186 (mutex_lock_nested+0
xde/0x36c)
[ 1471.157591]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:1 PM:0 EA:
3
Krnl GPRS: 000000000000166e 0000000000000000 00000000ffffffff 0000000000a78630
[ 1471.157597]            0000000000000000 0000000000000000 0000000000000002 000
00000100073f8
[ 1471.157601]            00000000f5ad8000 000000000101f708 0000000000000000 070
0000000000000
[ 1471.157604]            0000000010007388 0000000010007380 00000000006eb170 000
00000f544f658
[ 1471.157616] Krnl Code: 00000000006eb178: a7180000            lhi     %r1,0
           00000000006eb17c: c027ffffffff       xilf    %r2,4294967295
          #00000000006eb182: ba12d008           cs      %r1,%r2,8(%r13)
          >00000000006eb186: 1211               ltr     %r1,%r1
           00000000006eb188: a774011b           brc     7,6eb3be
           00000000006eb18c: e31090000012       lt      %r1,0(%r9)
           00000000006eb192: a7740007           brc     7,6eb1a0
           00000000006eb196: e3d0d0700020       cg      %r13,112(%r13)
[ 1471.157650] Call Trace:
[ 1471.157652] ([<00000000006eb170>] mutex_lock_nested+0xc8/0x36c)
[ 1471.157657]  [<0000000000260e3e>] unmap_single_vma+0x786/0x7d0
[ 1471.157663]  [<000000000026219c>] unmap_vmas+0x50/0x74
[ 1471.157666]  [<000000000026b7aa>] exit_mmap+0x186/0x224
[ 1471.157671]  [<000000000012ef88>] mmput+0x84/0x110
[ 1471.157677]  [<0000000000139fd2>] do_exit+0x2c6/0xce8
[ 1471.157682]  [<0000000000100ef2>] die+0x13e/0x158
[ 1471.157687]  [<000000000011dd6e>] do_no_context+0xba/0xf0
[ 1471.157692]  [<00000000006f2714>] do_dat_exception+0x330/0x390
[ 1471.157697]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.157701]  [<000003ff80749cc2>] xfs_vm_page_mkwrite+0x12/0x1c [xfs]
[ 1471.157800] ([<0000000000261870>] do_wp_page+0x6f8/0xa4c)
[ 1471.157805]  [<0000000000262f2c>] handle_pte_fault+0x65c/0xa10
[ 1471.157809]  [<0000000000264a22>] handle_mm_fault+0x182/0x260
[ 1471.157813]  [<00000000006f21d0>] do_protection_exception+0x1b8/0x3cc
[ 1471.157817]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.157821]  [<000000008001d60a>] 0x8001d60a
[ 1471.157827] INFO: lockdep is turned off.
[ 1471.157829] Last Breaking-Event-Address:
[ 1471.157832]  [<00000000001af490>] trace_hardirqs_off_caller+0x7c/0xb0
[ 1471.157840]
[ 1471.157857] ---[ end trace dd87895f75677365 ]---
[ 1471.157863] Fixing recursive fault but reboot is needed!
[ 1471.157867] BUG: scheduling while atomic: systemd-journal/516/0x00000002
[ 1471.157870] INFO: lockdep is turned off.
[ 1471.157872] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.157928] CPU: 0 Tainted: GF     D W    3.9.3 #2
[ 1471.157931] Process systemd-journal (pid: 516, task: 00000000f5ad8000, ksp: 0
0000000f544f578)
[ 1471.157934]        00000000f544f168 00000000f544f178 0000000000000002 0000000
000000000
       00000000f544f208 00000000f544f180 00000000f544f180 0000000000100a1e
       0000000000000000 0000000000ffb000 000000000000000a 000000000000000a
       00000000f544f1c8 00000000f544f168 0000000000000000 0000000000000000
       00000000f5ad8000 0000000000100a1e 00000000f544f168 00000000f544f1b8
[ 1471.157966] Call Trace:
[ 1471.157968] ([<0000000000100920>] show_trace+0xf0/0x148)
[ 1471.157973]  [<00000000006e0e88>] __schedule_bug+0x80/0x98
[ 1471.157977]  [<00000000006ed9ea>] __schedule+0xc6a/0xcc8
[ 1471.157981]  [<000000000013a99e>] do_exit+0xc92/0xce8
[ 1471.157985]  [<0000000000100ef2>] die+0x13e/0x158
[ 1471.157990]  [<00000000006f02ac>] do_per_trap+0x0/0xb4
[ 1471.157993]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.157997]  [<00000000006eb186>] mutex_lock_nested+0xde/0x36c
[ 1471.158002] ([<00000000006eb170>] mutex_lock_nested+0xc8/0x36c)
[ 1471.158006]  [<0000000000260e3e>] unmap_single_vma+0x786/0x7d0
[ 1471.158010]  [<000000000026219c>] unmap_vmas+0x50/0x74
[ 1471.158014]  [<000000000026b7aa>] exit_mmap+0x186/0x224
[ 1471.158018]  [<000000000012ef88>] mmput+0x84/0x110
[ 1471.158021]  [<0000000000139fd2>] do_exit+0x2c6/0xce8
[ 1471.158025]  [<0000000000100ef2>] die+0x13e/0x158
[ 1471.158029]  [<000000000011dd6e>] do_no_context+0xba/0xf0
[ 1471.158033]  [<00000000006f2714>] do_dat_exception+0x330/0x390
[ 1471.158037]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.158041]  [<000003ff80749cc2>] xfs_vm_page_mkwrite+0x12/0x1c [xfs]
[ 1471.158070] ([<0000000000261870>] do_wp_page+0x6f8/0xa4c)
[ 1471.158074]  [<0000000000262f2c>] handle_pte_fault+0x65c/0xa10
[ 1471.158078]  [<0000000000264a22>] handle_mm_fault+0x182/0x260
[ 1471.158082]  [<00000000006f21d0>] do_protection_exception+0x1b8/0x3cc
[ 1471.158086]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.158090]  [<000000008001d60a>] 0x8001d60a
[ 1471.158094] INFO: lockdep is turned off.
[ 1471.158221] Unable to handle kernel paging request at virtual user address
        (null)
[ 1471.158247] Oops: 0004 [#3] SMP
[ 1471.158251] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.158306] CPU: 0 Tainted: GF     D W    3.9.3 #2
[ 1471.158309] Process in:imklog (pid: 765, task: 00000000fc7f2430, ksp: 0000000
0fc36fd50)
[ 1471.158312] Krnl PSW : 0404e00180000000 0000000000183520 (tg_load_down+0x60/0
xb0)
[ 1471.158320]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:0 EA:
3
Krnl GPRS: 0000000000000040 0000000000000000 0000000000000000 0000000000000008
[ 1471.158327]            0000000000000000 0000000000000000 00000000f9253400 000
0000000aa0854
[ 1471.158330]            0000000000aa0852 00000000001834c0 0000000000170edc 000
0000000000000
[ 1471.158333]            00000000f9187e00 00000000fcd5c800 000000000017a330 000
00000fc36fb48
[ 1471.158343] Krnl Code: 0000000000183512: a7290000            lghi    %r2,0
           0000000000183516: e31310000004       lg      %r1,0(%r3,%r1)
          #000000000018351c: b9040045           lgr     %r4,%r5
          >0000000000183520: e34010980024       stg     %r4,152(%r1)
           0000000000183526: ebbcf0700004       lmg     %r11,%r12,112(%r15)
           000000000018352c: 07fe               bcr     15,%r14
           000000000018352e: eb430003000d       sllg    %r4,%r3,3
           0000000000183534: c05000451512       larl    %r5,a25f58
[ 1471.158377] Call Trace:
[ 1471.158380] ([<000000000018bd96>] load_balance+0x236/0xc7c)
[ 1471.158385]  [<000000000018be16>] load_balance+0x2b6/0xc7c
[ 1471.158389]  [<000000000018d0a2>] idle_balance+0x212/0x494
[ 1471.158393]  [<00000000006ed9ae>] __schedule+0xc2e/0xcc8
[ 1471.158397]  [<00000000006f0c30>] io_reschedule+0xa/0x1a
[ 1471.158402]  [<000003fffd3bea22>] 0x3fffd3bea22
[ 1471.158406] INFO: lockdep is turned off.
[ 1471.158408] Last Breaking-Event-Address:
[ 1471.158411]  [<000000000017a32e>] walk_tg_tree_from+0x46/0x190
[ 1471.158416]
[ 1471.158418] ---[ end trace dd87895f75677366 ]---
[ 1471.158423] BUG: sleeping function called from invalid context at kernel/rwse
m.c:20
[ 1471.158426] in_atomic(): 1, irqs_disabled(): 0, pid: 765, name: in:imklog
[ 1471.158429] INFO: lockdep is turned off.
[ 1471.158432] CPU: 0 Tainted: GF     D W    3.9.3 #2
[ 1471.158435] Process in:imklog (pid: 765, task: 00000000fc7f2430, ksp: 0000000
0fc36fd50)
[ 1471.158438]        00000000fc36f690 00000000fc36f6a0 0000000000000002 0000000
000000000
       00000000fc36f730 00000000fc36f6a8 00000000fc36f6a8 0000000000100a1e
       0000000000000000 00000000fc7f2870 000000000000000a 0404e0010000000a
       00000000fc36f6f0 00000000fc36f690 0000000000000000 0000000000000000
       00000000006fdbc8 0000000000100a1e 00000000fc36f690 00000000fc36f6e0
[ 1471.158470] Call Trace:
[ 1471.158473] ([<0000000000100920>] show_trace+0xf0/0x148)
[ 1471.158477]  [<0000000000175c0a>] __might_sleep+0x17a/0x238
[ 1471.158482]  [<00000000006ec4de>] down_read+0x42/0xe0
[ 1471.158487]  [<000000000014fea0>] exit_signals+0x38/0x144
[ 1471.158492]  [<0000000000139dd6>] do_exit+0xca/0xce8
[ 1471.158496]  [<0000000000100ef2>] die+0x13e/0x158
[ 1471.158500]  [<000000000011dd6e>] do_no_context+0xba/0xf0
[ 1471.158504]  [<00000000006f23de>] do_protection_exception+0x3c6/0x3cc
[ 1471.158508]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.158513]  [<0000000000183520>] tg_load_down+0x60/0xb0
[ 1471.158517] ([<000000000018bd96>] load_balance+0x236/0xc7c)
[ 1471.158521]  [<000000000018be16>] load_balance+0x2b6/0xc7c
[ 1471.158525]  [<000000000018d0a2>] idle_balance+0x212/0x494
[ 1471.158529]  [<00000000006ed9ae>] __schedule+0xc2e/0xcc8
[ 1471.158533]  [<00000000006f0c30>] io_reschedule+0xa/0x1a
[ 1471.158537]  [<000003fffd3bea22>] 0x3fffd3bea22
[ 1471.158541] INFO: lockdep is turned off.
[ 1471.158545] note: in:imklog[765] exited with preempt_count 3
[ 1573.704911] Unable to handle kernel pointer dereference at virtual kernel add
ress 00c040077ff00000
[ 1573.705014] Oops: 0038 [#4] SMP
[ 1573.705029] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F)00: HCPGSP2629I The virtual machine is placed i
n CP mode due to a SIGP stop from
 CPU 01.
 iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F) vmur(F) xfs(F) libcrc32c(F) das
d_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F) ctcm(F) qeth(F) qdio(F) ccwgrou
p(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_log(F) dm_mod(F)
[ 1573.705173] CPU: 1 Tainted: GF     D W    3.9.3 #2
[ 1573.705177] Process attr (pid: 22262, task: 00000000b1190000, ksp: 00000000b0
f43eb0)
[ 1573.705180] Krnl PSW : 0404e00180000000 000000000046c738 (lookup_object+0x20/
0x68)
[ 1573.705193]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:0 EA:
3
Krnl GPRS: 7440000000000000 00c040077ff000e8 00000000fadd1eb8 0000000000000003
[ 1573.705199]            000000000046d2ba 0000000000000000 0000000084c1a828 000
0000084c1a868
[ 1573.705203]            0000000001964250 04000000001b5d64 0000000000a3bdf8 000
0000001964258
[ 1573.705206]            00000000fadd1eb8 00000000007434f0 000000000046d2cc 000
00000feb0b488
[ 1573.705221] Krnl Code: 000000000046c72c: a7380001            lhi     %r3,1
           000000000046c730: a7f40009           brc     15,46c742
          #000000000046c734: a73a0001           ahi     %r3,1
          >000000000046c738: e32010180020       cg      %r2,24(%r1)
           000000000046c73e: a7840016           brc     8,46c76a
           000000000046c742: e31010000002       ltg     %r1,0(%r1)
           000000000046c748: a774fff6           brc     7,46c734
           000000000046c74c: c010002dcfc0       larl    %r1,a266cc
[ 1573.705254] Call Trace:
[ 1573.705257] ([<000000000046d2ba>] debug_object_deactivate+0xba/0x160)
[ 1573.705263]  [<0000000000168890>] __run_hrtimer+0x58/0x474
[ 1573.705269]  [<0000000000169af2>] hrtimer_interrupt+0x116/0x2b0
[ 1573.705273]  [<0000000000104036>] clock_comparator_work+0x4a/0x54
[ 1573.705279]  [<000000000010d5b4>] do_extint+0x2b4/0x318
[ 1573.705285]  [<00000000006f0d90>] ext_skip+0x40/0x44
[ 1573.705292]  [<000003ff80108ba4>] ipt_do_table+0x298/0x8d0 [ip_tables]
[ 1573.705300] ([<000003ff801089d2>] ipt_do_table+0xc6/0x8d0 [ip_tables])
[ 1573.705305]  [<000003ff80165318>] nf_nat_ipv4_fn+0x1b4/0x354 [iptable_nat]
[ 1573.705310]  [<000003ff80165598>] nf_nat_ipv4_in+0x38/0x98 [iptable_nat]
[ 1573.705314]  [<00000000005d047c>] nf_iterate+0xd4/0x1b0
[ 1573.705903]  [<00000000005d0680>] nf_hook_slow+0x128/0x2cc
[ 1573.705909]  [<00000000005de956>] ip_rcv+0x272/0x360
[ 1573.705914]  [<0000000000592af6>] __netif_receive_skb_core+0xac2/0xe70
[ 1573.705919]  [<00000000005957e4>] netif_receive_skb+0x48/0x260
[ 1573.705924]  [<000003ff80095d16>] qeth_l2_poll+0x30a/0x540 [qeth_l2]
[ 1573.705938]  [<0000000000595f58>] net_rx_action+0x158/0x340
[ 1573.705942]  [<000000000013d8d6>] __do_softirq+0x112/0x42c
[ 1573.705947]  [<000000000013ddf8>] irq_exit+0xc8/0xe8
[ 1573.705951]  [<000000000010d55e>] do_extint+0x25e/0x318
[ 1573.705956]  [<00000000006f0d90>] ext_skip+0x40/0x44
[ 1573.705961]  [<000000000045a634>] memmove+0x14/0x5c
[ 1573.705968] ([<000003ff807d7a10>] __func__.41994+0x10/0xffffffffffffb600 [xfs
])
[ 1573.706104]  [<000003ff8076e6d2>] xfs_attr_leaf_compact+0xee/0x24c [xfs]
[ 1573.706224]  [<000003ff80772036>] xfs_attr_leaf_add+0x10a/0x268 [xfs]
[ 1573.706272]  [<000003ff8076a814>] xfs_attr_leaf_addname+0x11c/0x5a4 [xfs]
[ 1573.706303]  [<000003ff8076b028>] xfs_attr_set_int+0x38c/0x4b8 [xfs]
[ 1573.706333]  [<000003ff8076c9aa>] xfs_attr_set+0xb2/0xb8 [xfs]
[ 1573.706363]  [<000003ff8075d17e>] xfs_xattr_set+0x66/0xa0 [xfs]
[ 1573.706393]  [<00000000002d0a36>] generic_setxattr+0x7e/0x98
[ 1573.706400]  [<00000000002d1430>] __vfs_setxattr_noperm+0x78/0x1dc
[ 1573.706404]  [<00000000002d1652>] vfs_setxattr+0xbe/0xc4
[ 1573.706408]  [<00000000002d17b8>] setxattr+0x160/0x1c4
[ 1573.706412]  [<00000000002d1ba0>] SyS_lsetxattr+0xac/0xe8
[ 1573.706416]  [<00000000006f08b4>] sysc_tracego+0x14/0x1a
[ 1573.706420]  [<000003fffd2c2d1a>] 0x3fffd2c2d1a
[ 1573.706427] INFO: lockdep is turned off.
[ 1573.706430] Last Breaking-Event-Address:
[ 1573.706433]  [<000000000046c748>] lookup_object+0x30/0x68
[ 1573.706439]
[ 1573.706443] Kernel panic - not syncing: Fatal exception in interrupt
01: HCPGIR450W CP entered; disabled wait PSW 00020001 80000000 00000000 0010F384

----- Original Message -----
> From: "CAI Qian" <caiqian@redhat.com>
> To: "linux-s390" <linux-s390@vger.kernel.org>, linuxppc-dev@lists.ozlabs.org
> Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org, xfs@oss.sgi.com, "Steve Best" <sbest@redhat.com>,
> "Hendrik Brueckner" <bhendrik@redhat.com>, "Dave Chinner" <david@fromorbit.com>
> Sent: Thursday, May 23, 2013 12:57:20 PM
> Subject: 3.9.2/3.9.3: stack overrun on s390x and ppc64 (WAS Re: 3.9.2: xfstests triggered panic)
> 
> Original report:
> http://oss.sgi.com/archives/xfs/2013-05/msg00683.html
> 
> Also seen on Power7:
> http://marc.info/?l=linux-kernel&m=136927904900692&w=2
> 
> CAI Qian
> 
> ----- Original Message -----
> > From: "Dave Chinner" <david@fromorbit.com>
> > To: "CAI Qian" <caiqian@redhat.com>
> > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org,
> > xfs@oss.sgi.com
> > Sent: Thursday, May 23, 2013 11:46:11 AM
> > Subject: Re: 3.9.2: xfstests triggered panic
> > 
> > On Wed, May 22, 2013 at 11:16:56PM -0400, CAI Qian wrote:
> > > ----- Original Message -----
> > > > From: "Dave Chinner" <david@fromorbit.com>
> > > > To: "CAI Qian" <caiqian@redhat.com>
> > > > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org,
> > > > xfs@oss.sgi.com
> > > > Sent: Wednesday, May 22, 2013 5:53:00 PM
> > > > Subject: Re: 3.9.2: xfstests triggered panic
> > > > 
> > > > On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> > > > > Reproduced on almost all s390x guests by running xfstests.
> > > > > 
> > > > > 14634.396658¨ XFS (dm-1): Mounting Filesystem
> > > > > 14634.525522¨ XFS (dm-1): Ending clean mount
> > > > > 14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340
> > > > > 14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0
> > > > > 14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0
> > > > > 14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298
> > > > > 14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec
> > > > > 14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc
> > > > > 14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc
> > > > > 14640.428311¨ Last Breaking-Event-Address:
> > > > > 14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4
> > > > > 14640.428319¨  list_add corruption. next->prev should be prev
> > > > > (0000000000000918
> > > > > ), but was           (null). (next=          (null)).
> > > > 
> > > > Where's XFS in this? walk_tg_tree_from() is part of the scheduler
> > > > code. This kind of implies a stack corruption....
> > > > 
> > > > > Sometimes, this pops up,
> > > > > [16907.275002] WARNING: at kernel/rcutree.c:1960
> > > > > 
> > > > > or this,
> > > > > 15316.154171¨ XFS (dm-1): Mounting Filesystem
> > > > > 15316.255796¨ XFS (dm-1): Ending clean mount
> > > > > 15320.364246¨            00000000006367a2: e310b0080004        lg
> > > > > %r1,8(%r
> > > > > 11)
> > > > > 15320.364249¨            00000000006367a8: 41101010            la
> > > > > %r1,16(%
> > > > > r1)
> > > > > 15320.364251¨            00000000006367ac: e33010000004        lg
> > > > > %r3,0(%r
> > > > > 1)
> > > > > 15320.364252¨ Call Trace:
> > > > > 15320.364252¨ Last Breaking-Event-Address:
> > > > > 15320.364253¨  � <0000000000000000>¨ Kernel stack overflow.
> > > > > 15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1
> > > > > 15320.364309¨ Process rhts-test-runne (pid: 625, task:
> > > > > 000000003dccc890,
> > > > > ksp: 0
> > > > 
> > > > .... and there you go - a stack overflow. Your kernel stack size is
> > > > too small.
> > > > 
> > > > I'd suggest that you need 16k stacks on s390 - IIRC every function
> > > > call has 128 byte stack frame, and there are call chains 70-80
> > > > functions deep in the storage stack...
> > > Hmm, I am unsure how to set to 16k stack there
> > 
> > Are you build a 64 bit s390 kernel or a 32 bit kernel? 32 bit
> > kernels only have an 8k stack size, 64 bit kernels are 16k (see
> > arch/s390/Makefile).
> > 
> > $ git grep STACK_SIZE arch/s390 |head -2
> > arch/s390/Makefile:STACK_SIZE   := 8192
> > arch/s390/Makefile:STACK_SIZE   := 16384
> > 
> > As it is, the stack frame usage is worse than I thought:
> > 
> > $ git grep STACK_FRAME_OVERHEAD arch/s390 |head -2
> > arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 96      /*
> > size of minimum stack frame */
> > arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 160
> > /*
> > size of minimum stack frame */
> > 
> > Overhead is 96 bytes for 32 bit and 160 bytes for 64 bit. So 16k
> > stack size is going to have big troubles with a 70-80 function deep
> > call chain.
> > 
> > As for powerpc:
> > 
> > arch/powerpc/include/asm/ppc_asm.h:#define STACKFRAMESIZE 256
> > 
> > Yeah, same issue.
> > 
> > But, seriously, these stack traces are meaningless to anyone not
> > familiar with s390 or power7 - they indicate a problem detected
> > in the idle loop, not where ever the stack overran.
> > 
> > Can you please work with the s390/power7 people to obtain whatever
> > stack it was that overflowed, and we can go from there.
> > 
> > Cheers,
> > 
> > Dave.
> > --
> > Dave Chinner
> > david@fromorbit.com
> > 
> 

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: 3.9.2/3.9.3: stack overrun on s390x and ppc64 (WAS Re: 3.9.2: xfstests triggered panic)
@ 2013-05-24  3:33             ` CAI Qian
  0 siblings, 0 replies; 16+ messages in thread
From: CAI Qian @ 2013-05-24  3:33 UTC (permalink / raw)
  To: linux-s390, linuxppc-dev; +Cc: LKML, Steve Best, xfs, stable, Hendrik Brueckner

OK, here is clearer stack output from the run.
CAI Qian

+ ./check
FSTYP         -- xfs (non-debug)
PLATFORM      -- Linux/s390x ibm-z10-23 3.9.3

001	 29s
002	 3s
003	 2s
004	 [not run] this test requires a valid $SCRATCH_DEV
005	 2s
006	 9s
007	 10s
008	 7s
009	 [not run] this test requires a valid $SCRATCH_DEV
010	 [not run] dbtest was not built for this platform
011	 9s
012	 10s
013	 35s
014	 5s
015	 [not run] this test requires a valid $SCRATCH_DEV
016	 [not run] this test requires a valid $SCRATCH_DEV
017	 [not run] this test requires a valid $SCRATCH_DEV
018	 [not run] this test requires a valid $SCRATCH_DEV
019	 [not run] this test requires a valid $SCRATCH_DEV
020	


[ 1316.571227] XFS (dm-0): Mounting Filesystem
[ 1316.697803] XFS (dm-0): Ending clean mount
[ 1318.080615] XFS (dm-0): Ending clean mount
[ 1348.791125] XFS (dm-0): Mounting Filesystem
[ 1348.989166] XFS (dm-0): Ending clean mount
[ 1353.335478] XFS (dm-0): Mounting Filesystem
[ 1353.496364] XFS (dm-0): Ending clean mount
[ 1357.495427] XFS (dm-0): Mounting Filesystem
[ 1357.676971] XFS (dm-0): Ending clean mount
[ 1361.646399] XFS (dm-0): Mounting Filesystem
[ 1361.890426] XFS (dm-0): Ending clean mount
[ 1371.798944] XFS (dm-0): Mounting Filesystem
[ 1371.976922] XFS (dm-0): Ending clean mount
[ 1384.559103] XFS (dm-0): Mounting Filesystem
[ 1384.725657] XFS (dm-0): Ending clean mount
[ 1393.131347] XFS (dm-0): Mounting Filesystem
[ 1393.357927] XFS (dm-0): Ending clean mount
[ 1407.282708] XFS (dm-0): Mounting Filesystem
[ 1407.745176] XFS (dm-0): Ending clean mount
[ 1422.927074] XFS (dm-0): Mounting Filesystem
[ 1423.136266] XFS (dm-0): Ending clean mount
[ 1425.500910] XFS (dm-0): Mounting Filesystem
[ 1425.608851] XFS (dm-0): Ending clean mount
[ 1450.978110] XFS (dm-0): Mounting Filesystem
[ 1451.255368] XFS (dm-0): Ending clean mount
[ 1453.603742] XFS (dm-0): Mounting Filesystem
[ 1453.680657] XFS (dm-0): Ending clean mount
[ 1456.262266] XFS (dm-0): Mounting Filesystem
[ 1456.330515] XFS (dm-0): Ending clean mount
[ 1457.053767] XFS (dm-0): Mounting Filesystem
[ 1457.107258] XFS (dm-0): Ending clean mount
[ 1462.049374] XFS (dm-0): Mounting Filesystem
[ 1462.111389] XFS (dm-0): Ending clean mount
[ 1471.109589] ODEBUG: deactivate not available (active state 0) object type: ti
mer_list hint: process_timeout+0x0/0x8
[ 1471.109683] ------------[ cut here ]------------
[ 1471.109688] WARNING: at lib/debugobjects.c:260
[ 1471.109692] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.109848] CPU: 0 Tainted: GF            3.9.3 #2
[ 1471.109858] Process swapper/0 (pid: 0, task: 0000000000a2b4d0, ksp: 000000000
0a17d28)
[ 1471.109868] Krnl PSW : 0404c00180000000 000000000046c84a (debug_print_object+
0xca/0xd8)
[ 1471.114762]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 EA:
3
Krnl GPRS: 0000000000000000 0000000000a2b4d0 0000000000000067 000000000101f708
[ 1471.114769]            000000000046c846 0000000084a4d448 000000000086936a 000
0000001040700
[ 1471.114773]            0000000001a0f290 0400000000000001 0000000000874cf8 000
0000000a395d8
[ 1471.114777]            000000000195f820 000000000001bd20 000000000046c846 000
000000001bc20
[ 1471.114792] Krnl Code: 000000000046c83a: e34410000004        lg      %r4,0(%r
4,%r1)
           000000000046c840: c0e500139f88       brasl   %r14,6e0750
          #000000000046c846: a7f40001           brc     15,46c848
          >000000000046c84a: a7f4ffc2           brc     15,46c7ce
           000000000046c84e: a7290000           lghi    %r2,0
           000000000046c852: a7f4ffd7           brc     15,46c800
           000000000046c856: 0707               bcr     0,%r7
           000000000046c858: ebaff0680024       stmg    %r10,%r15,104(%r15)
[ 1471.114825] Call Trace:
[ 1471.114828] ([<000000000046c846>] debug_print_object+0xc6/0xd8)
[ 1471.114833]  [<000000000046d35c>] debug_object_deactivate+0x15c/0x160
[ 1471.114838]  [<0000000000148244>] run_timer_softirq+0x180/0x464
[ 1471.114843]  [<000000000013d8d6>] __do_softirq+0x112/0x42c
[ 1471.114847]  [<000000000013ddf8>] irq_exit+0xc8/0xe8
[ 1471.114851]  [<000000000010d55e>] do_extint+0x25e/0x318
[ 1471.114859]  [<00000000006f0d90>] ext_skip+0x40/0x44
[ 1471.114866]  [<00000000006f05d6>] vtime_stop_cpu+0x52/0xbc
[ 1471.114870] ([<00000000006f05b4>] vtime_stop_cpu+0x30/0xbc)
[ 1471.114874]  [<000000000010476e>] cpu_idle+0x112/0x1b8
[ 1471.114879]  [<0000000000aaf99a>] start_kernel+0x42e/0x43c
[ 1471.114885]  [<0000000000100020>] _stext+0x20/0x80
[ 1471.114891] 2 locks held by swapper/0/0:
[ 1471.114894]  #0:  (&(&base->lock)->rlock){-.-.-.}, at: [<0000000000148128>] r
un_timer_softirq+0x64/0x464
[ 1471.114908]  #1:  (&obj_hash[i].lock){-.-.-.}, at: [<000000000046d2ba>] debug
_object_deactivate+0xba/0x160
[ 1471.114918] Last Breaking-Event-Address:
[ 1471.114921]  [<000000000046c846>] debug_print_object+0xc6/0xd8
[ 1471.114927] ---[ end trace dd87895f75677361 ]---
[ 1471.117683] ODEBUG: object is on stack, but not annotated
[ 1471.117723] ------------[ cut here ]------------
[ 1471.117726] WARNING: at lib/debugobjects.c:300
[ 1471.117729] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.117791] CPU: 0 Tainted: GF       W    3.9.3 #2
[ 1471.117794] Process rcu_sched (pid: 10, task: 00000000fffe8000, ksp: 00000000
fffe7ae0)
[ 1471.117797] Krnl PSW : 0404c00180000000 000000000046ccce (__debug_object_init
+0x35a/0x480)
[ 1471.117804]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 EA:
3
Krnl GPRS: 0000000000000000 00000000fffe8000 000000000000002d 000000000101f708
[ 1471.117811]            000000000046ccca 0000000000000001 0000000000a395d8 070
0000000000000
[ 1471.117814]            0000000000a78750 0000000001a0f298 00000000fffe7cf0 000
00000f8daa938
[ 1471.117818]            0000000001a0f290 00000000007434c0 000000000046ccca 00
00000fffe7a40
[ 1471.117852] Krnl Code: 000000000046ccbe: c02000217617        larl    %r2,89b8
ec
           000000000046ccc4: c0e500139d46       brasl   %r14,6e0750
          #000000000046ccca: a7f40001           brc     15,46cccc
          >000000000046ccce: a7f4ff03           brc     15,46cad4
           000000000046ccd2: a7380000           lhi     %r3,0
           000000000046ccd6: a7f4ffe9           brc     15,46cca8
           000000000046ccda: c02000217622       larl    %r2,89b91e
           000000000046cce0: c0e500139d38       brasl   %r14,6e0750
[ 1471.117884] Call Trace:
[ 1471.117887] ([<000000000046ccca>] __debug_object_init+0x356/0x480)
[ 1471.117891]  [<0000000000146f42>] init_timer_key+0x3a/0x164
[ 1471.117897]  [<0000000000147eac>] timer_fixup_assert_init+0x50/0xa8
[ 1471.117901]  [<000000000046d7a4>] debug_object_assert_init+0x124/0x154
[ 1471.117906]  [<0000000000148676>] try_to_del_timer_sync+0x2e/0x94
[ 1471.117910]  [<000000000014878e>] del_timer_sync+0xb2/0x104
[ 1471.117915]  [<00000000006ea950>] schedule_timeout+0x190/0x37c
[ 1471.117922]  [<00000000001f1118>] rcu_gp_kthread+0x320/0x59c
[ 1471.117927]  [<0000000000164176>] kthread+0xe6/0xec
[ 1471.117934]  [<00000000006f0906>] kernel_thread_starter+0x6/0xc
[ 1471.117938]  [<00000000006f0900>] kernel_thread_starter+0x0/0xc
[ 1471.117943] 1 lock held by rcu_sched/10:
[ 1471.117945]  #0:  (&obj_hash[i].lock){-.-.-.}, at: [<000000000046caba>] __deb
ug_object_init+0x146/0x480
[ 1471.117955] Last Breaking-Event-Address:
[ 1471.117958]  [<000000000046ccca>] __debug_object_init+0x356/0x480
[ 1471.117962] ---[ end trace dd87895f75677362 ]---
[ 1471.117967] ODEBUG: assert_init not available (active state 0) object type: t
imer_list hint: stub_timer+0x0/0x8
[ 1471.117984] ------------[ cut here ]------------
[ 1471.117987] WARNING: at lib/debugobjects.c:260
[ 1471.117990] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.118053] CPU: 0 Tainted: GF       W    3.9.3 #2
[ 1471.118075] Process rcu_sched (pid: 10, task: 00000000fffe8000, ksp: 00000000
fffe7ae0)
[ 1471.118079] Krnl PSW : 0704e00180000000 000000000046c84a (debug_print_object+
0xca/0xd8)
[ 1471.118085]            R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:0 EA:
3
Krnl GPRS: 0000000000000000 00000000fffe8000 0000000000000063 0000000000000001
[ 1471.118092]            000000000046c846 0000000000000001 000000000086936a 000
0000000a1d300
[ 1471.118095]            0000000001a0f290 07000000006ed3fc 000000000089ba14 000
0000000a395d8
[ 1471.118099]            000000000195f820 00000000fffe7bc8 000000000046c846 000
00000fffe7ac8
[ 1471.118109] Krnl Code: 000000000046c83a: e34410000004        lg      %r4,0(%r
4,%r1)
           000000000046c840: c0e500139f88       brasl   %r14,6e0750
          #000000000046c846: a7f40001           brc     15,46c848
          >000000000046c84a: a7f4ffc2           brc     15,46c7ce
           000000000046c84e: a7290000           lghi    %r2,0
           000000000046c852: a7f4ffd7           brc     15,46c800
           000000000046c856: 0707               bcr     0,%r7
           000000000046c858: ebaff0680024       stmg    %r10,%r15,104(%r15)
[ 1471.118141] Call Trace:
[ 1471.118144] ([<000000000046c846>] debug_print_object+0xc6/0xd8)
[ 1471.118149]  [<000000000046d7ce>] debug_object_assert_init+0x14e/0x154
[ 1471.118153]  [<0000000000148676>] try_to_del_timer_sync+0x2e/0x94
[ 1471.118158]  [<000000000014878e>] del_timer_sync+0xb2/0x104
[ 1471.118163]  [<00000000006ea950>] schedule_timeout+0x190/0x37c
[ 1471.118167]  [<00000000001f1118>] rcu_gp_kthread+0x320/0x59c
[ 1471.118172]  [<0000000000164176>] kthread+0xe6/0xec
[ 1471.118176]  [<00000000006f0906>] kernel_thread_starter+0x6/0xc
[ 1471.118180]  [<00000000006f0900>] kernel_thread_starter+0x0/0xc
[ 1471.118185] no locks held by rcu_sched/10.
[ 1471.118188] Last Breaking-Event-Address:
[ 1471.118190]  [<000000000046c846>] debug_print_object+0xc6/0xd8
[ 1471.118195] ---[ end trace dd87895f75677363 ]---
[ 1471.121980] Unable to handle kernel pointer dereference at virtual kernel add
ress 000003ff8072c000
[ 1471.123714] Oops: 0011 [#1] SMP
[ 1471.123720] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.123777] CPU: 0 Tainted: GF       W    3.9.3 #2
[ 1471.123781] Process systemd-journal (pid: 516, task: 00000000f5ad8000, ksp: 0
0000000f544fbd0)
[ 1471.123784] Krnl PSW : 0704e00180000000 000003ff80749cc2 (xfs_vm_page_mkwrite
+0x12/0x1c [xfs])
[ 1471.154773]            R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:0 EA:
3
Krnl GPRS: 00000000000015e8 000003ff8072cc30 0000000000000001 0000000000000000
[ 1471.154781]            00000000006ef176 00000000007036b8 00000000f97b7d98 000
003fffb3e7000
[ 1471.154784]            000003fffb3e7000 00000000f8f65a00 00000000f8472738 000
00000f8ffc2c0
[ 1471.154788]            00000000f8472738 000000000070d5e0 000003ff80749cb6 000
00000f544fc28
[ 1471.154802] Krnl Code: 000003ff80749cb6: 1222                ltr     %r2,%r2
           000003ff80749cb8: a784ffd8           brc     8,3ff80749c68
          #000003ff80749cbc: c010ffff17ba       larl    %r1,3ff8072cc30
          >000003ff80749cc2: e31010000004       lg      %r1,0(%r1)
           000003ff80749cc8: e32010000012       lt      %r2,0(%r1)
           000003ff80749cce: a7740032           brc     7,3ff80749d32
           000003ff80749cd2: e31003180004       lg      %r1,792
           000003ff80749cd8: e330101c0012       lt      %r3,28(%r1)
[ 1471.154838] Call Trace:
[ 1471.154841] ([<0000000000261870>] do_wp_page+0x6f8/0xa4c)
[ 1471.154851]  [<0000000000262f2c>] handle_pte_fault+0x65c/0xa10
[ 1471.154856]  [<0000000000264a22>] handle_mm_fault+0x182/0x260
[ 1471.154860]  [<00000000006f21d0>] do_protection_exception+0x1b8/0x3cc
[ 1471.154867]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.154871]  [<000000008001d60a>] 0x8001d60a
[ 1471.154894] INFO: lockdep is turned off.
[ 1471.154897] Last Breaking-Event-Address:
[ 1471.154899]  [<00000000001ed7ca>] rcu_lockdep_current_cpu_online+0x6e/0x84
[ 1471.154909]
[ 1471.154913] ---[ end trace dd87895f75677364 ]---
[ 1471.157438] ------------[ cut here ]------------
[ 1471.157446] Kernel BUG at 00000000006eb186 [verbose debug info unavailable]
[ 1471.157508] addressing exception: 0005 [#2] SMP
[ 1471.157514] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.157574] CPU: 0 Tainted: GF     D W    3.9.3 #2
[ 1471.157577] Process systemd-journal (pid: 516, task: 00000000f5ad8000, ksp: 0
0000000f544f578)
[ 1471.157580] Krnl PSW : 0404d00180000000 00000000006eb186 (mutex_lock_nested+0
xde/0x36c)
[ 1471.157591]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:1 PM:0 EA:
3
Krnl GPRS: 000000000000166e 0000000000000000 00000000ffffffff 0000000000a78630
[ 1471.157597]            0000000000000000 0000000000000000 0000000000000002 000
00000100073f8
[ 1471.157601]            00000000f5ad8000 000000000101f708 0000000000000000 070
0000000000000
[ 1471.157604]            0000000010007388 0000000010007380 00000000006eb170 000
00000f544f658
[ 1471.157616] Krnl Code: 00000000006eb178: a7180000            lhi     %r1,0
           00000000006eb17c: c027ffffffff       xilf    %r2,4294967295
          #00000000006eb182: ba12d008           cs      %r1,%r2,8(%r13)
          >00000000006eb186: 1211               ltr     %r1,%r1
           00000000006eb188: a774011b           brc     7,6eb3be
           00000000006eb18c: e31090000012       lt      %r1,0(%r9)
           00000000006eb192: a7740007           brc     7,6eb1a0
           00000000006eb196: e3d0d0700020       cg      %r13,112(%r13)
[ 1471.157650] Call Trace:
[ 1471.157652] ([<00000000006eb170>] mutex_lock_nested+0xc8/0x36c)
[ 1471.157657]  [<0000000000260e3e>] unmap_single_vma+0x786/0x7d0
[ 1471.157663]  [<000000000026219c>] unmap_vmas+0x50/0x74
[ 1471.157666]  [<000000000026b7aa>] exit_mmap+0x186/0x224
[ 1471.157671]  [<000000000012ef88>] mmput+0x84/0x110
[ 1471.157677]  [<0000000000139fd2>] do_exit+0x2c6/0xce8
[ 1471.157682]  [<0000000000100ef2>] die+0x13e/0x158
[ 1471.157687]  [<000000000011dd6e>] do_no_context+0xba/0xf0
[ 1471.157692]  [<00000000006f2714>] do_dat_exception+0x330/0x390
[ 1471.157697]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.157701]  [<000003ff80749cc2>] xfs_vm_page_mkwrite+0x12/0x1c [xfs]
[ 1471.157800] ([<0000000000261870>] do_wp_page+0x6f8/0xa4c)
[ 1471.157805]  [<0000000000262f2c>] handle_pte_fault+0x65c/0xa10
[ 1471.157809]  [<0000000000264a22>] handle_mm_fault+0x182/0x260
[ 1471.157813]  [<00000000006f21d0>] do_protection_exception+0x1b8/0x3cc
[ 1471.157817]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.157821]  [<000000008001d60a>] 0x8001d60a
[ 1471.157827] INFO: lockdep is turned off.
[ 1471.157829] Last Breaking-Event-Address:
[ 1471.157832]  [<00000000001af490>] trace_hardirqs_off_caller+0x7c/0xb0
[ 1471.157840]
[ 1471.157857] ---[ end trace dd87895f75677365 ]---
[ 1471.157863] Fixing recursive fault but reboot is needed!
[ 1471.157867] BUG: scheduling while atomic: systemd-journal/516/0x00000002
[ 1471.157870] INFO: lockdep is turned off.
[ 1471.157872] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.157928] CPU: 0 Tainted: GF     D W    3.9.3 #2
[ 1471.157931] Process systemd-journal (pid: 516, task: 00000000f5ad8000, ksp: 0
0000000f544f578)
[ 1471.157934]        00000000f544f168 00000000f544f178 0000000000000002 0000000
000000000
       00000000f544f208 00000000f544f180 00000000f544f180 0000000000100a1e
       0000000000000000 0000000000ffb000 000000000000000a 000000000000000a
       00000000f544f1c8 00000000f544f168 0000000000000000 0000000000000000
       00000000f5ad8000 0000000000100a1e 00000000f544f168 00000000f544f1b8
[ 1471.157966] Call Trace:
[ 1471.157968] ([<0000000000100920>] show_trace+0xf0/0x148)
[ 1471.157973]  [<00000000006e0e88>] __schedule_bug+0x80/0x98
[ 1471.157977]  [<00000000006ed9ea>] __schedule+0xc6a/0xcc8
[ 1471.157981]  [<000000000013a99e>] do_exit+0xc92/0xce8
[ 1471.157985]  [<0000000000100ef2>] die+0x13e/0x158
[ 1471.157990]  [<00000000006f02ac>] do_per_trap+0x0/0xb4
[ 1471.157993]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.157997]  [<00000000006eb186>] mutex_lock_nested+0xde/0x36c
[ 1471.158002] ([<00000000006eb170>] mutex_lock_nested+0xc8/0x36c)
[ 1471.158006]  [<0000000000260e3e>] unmap_single_vma+0x786/0x7d0
[ 1471.158010]  [<000000000026219c>] unmap_vmas+0x50/0x74
[ 1471.158014]  [<000000000026b7aa>] exit_mmap+0x186/0x224
[ 1471.158018]  [<000000000012ef88>] mmput+0x84/0x110
[ 1471.158021]  [<0000000000139fd2>] do_exit+0x2c6/0xce8
[ 1471.158025]  [<0000000000100ef2>] die+0x13e/0x158
[ 1471.158029]  [<000000000011dd6e>] do_no_context+0xba/0xf0
[ 1471.158033]  [<00000000006f2714>] do_dat_exception+0x330/0x390
[ 1471.158037]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.158041]  [<000003ff80749cc2>] xfs_vm_page_mkwrite+0x12/0x1c [xfs]
[ 1471.158070] ([<0000000000261870>] do_wp_page+0x6f8/0xa4c)
[ 1471.158074]  [<0000000000262f2c>] handle_pte_fault+0x65c/0xa10
[ 1471.158078]  [<0000000000264a22>] handle_mm_fault+0x182/0x260
[ 1471.158082]  [<00000000006f21d0>] do_protection_exception+0x1b8/0x3cc
[ 1471.158086]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.158090]  [<000000008001d60a>] 0x8001d60a
[ 1471.158094] INFO: lockdep is turned off.
[ 1471.158221] Unable to handle kernel paging request at virtual user address
        (null)
[ 1471.158247] Oops: 0004 [#3] SMP
[ 1471.158251] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_l
og(F) dm_mod(F)
[ 1471.158306] CPU: 0 Tainted: GF     D W    3.9.3 #2
[ 1471.158309] Process in:imklog (pid: 765, task: 00000000fc7f2430, ksp: 0000000
0fc36fd50)
[ 1471.158312] Krnl PSW : 0404e00180000000 0000000000183520 (tg_load_down+0x60/0
xb0)
[ 1471.158320]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:0 EA:
3
Krnl GPRS: 0000000000000040 0000000000000000 0000000000000000 0000000000000008
[ 1471.158327]            0000000000000000 0000000000000000 00000000f9253400 000
0000000aa0854
[ 1471.158330]            0000000000aa0852 00000000001834c0 0000000000170edc 000
0000000000000
[ 1471.158333]            00000000f9187e00 00000000fcd5c800 000000000017a330 000
00000fc36fb48
[ 1471.158343] Krnl Code: 0000000000183512: a7290000            lghi    %r2,0
           0000000000183516: e31310000004       lg      %r1,0(%r3,%r1)
          #000000000018351c: b9040045           lgr     %r4,%r5
          >0000000000183520: e34010980024       stg     %r4,152(%r1)
           0000000000183526: ebbcf0700004       lmg     %r11,%r12,112(%r15)
           000000000018352c: 07fe               bcr     15,%r14
           000000000018352e: eb430003000d       sllg    %r4,%r3,3
           0000000000183534: c05000451512       larl    %r5,a25f58
[ 1471.158377] Call Trace:
[ 1471.158380] ([<000000000018bd96>] load_balance+0x236/0xc7c)
[ 1471.158385]  [<000000000018be16>] load_balance+0x2b6/0xc7c
[ 1471.158389]  [<000000000018d0a2>] idle_balance+0x212/0x494
[ 1471.158393]  [<00000000006ed9ae>] __schedule+0xc2e/0xcc8
[ 1471.158397]  [<00000000006f0c30>] io_reschedule+0xa/0x1a
[ 1471.158402]  [<000003fffd3bea22>] 0x3fffd3bea22
[ 1471.158406] INFO: lockdep is turned off.
[ 1471.158408] Last Breaking-Event-Address:
[ 1471.158411]  [<000000000017a32e>] walk_tg_tree_from+0x46/0x190
[ 1471.158416]
[ 1471.158418] ---[ end trace dd87895f75677366 ]---
[ 1471.158423] BUG: sleeping function called from invalid context at kernel/rwse
m.c:20
[ 1471.158426] in_atomic(): 1, irqs_disabled(): 0, pid: 765, name: in:imklog
[ 1471.158429] INFO: lockdep is turned off.
[ 1471.158432] CPU: 0 Tainted: GF     D W    3.9.3 #2
[ 1471.158435] Process in:imklog (pid: 765, task: 00000000fc7f2430, ksp: 0000000
0fc36fd50)
[ 1471.158438]        00000000fc36f690 00000000fc36f6a0 0000000000000002 0000000
000000000
       00000000fc36f730 00000000fc36f6a8 00000000fc36f6a8 0000000000100a1e
       0000000000000000 00000000fc7f2870 000000000000000a 0404e0010000000a
       00000000fc36f6f0 00000000fc36f690 0000000000000000 0000000000000000
       00000000006fdbc8 0000000000100a1e 00000000fc36f690 00000000fc36f6e0
[ 1471.158470] Call Trace:
[ 1471.158473] ([<0000000000100920>] show_trace+0xf0/0x148)
[ 1471.158477]  [<0000000000175c0a>] __might_sleep+0x17a/0x238
[ 1471.158482]  [<00000000006ec4de>] down_read+0x42/0xe0
[ 1471.158487]  [<000000000014fea0>] exit_signals+0x38/0x144
[ 1471.158492]  [<0000000000139dd6>] do_exit+0xca/0xce8
[ 1471.158496]  [<0000000000100ef2>] die+0x13e/0x158
[ 1471.158500]  [<000000000011dd6e>] do_no_context+0xba/0xf0
[ 1471.158504]  [<00000000006f23de>] do_protection_exception+0x3c6/0x3cc
[ 1471.158508]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.158513]  [<0000000000183520>] tg_load_down+0x60/0xb0
[ 1471.158517] ([<000000000018bd96>] load_balance+0x236/0xc7c)
[ 1471.158521]  [<000000000018be16>] load_balance+0x2b6/0xc7c
[ 1471.158525]  [<000000000018d0a2>] idle_balance+0x212/0x494
[ 1471.158529]  [<00000000006ed9ae>] __schedule+0xc2e/0xcc8
[ 1471.158533]  [<00000000006f0c30>] io_reschedule+0xa/0x1a
[ 1471.158537]  [<000003fffd3bea22>] 0x3fffd3bea22
[ 1471.158541] INFO: lockdep is turned off.
[ 1471.158545] note: in:imklog[765] exited with preempt_count 3
[ 1573.704911] Unable to handle kernel pointer dereference at virtual kernel add
ress 00c040077ff00000
[ 1573.705014] Oops: 0038 [#4] SMP
[ 1573.705029] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_ns(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ipv4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtables(F)
 ip6table_filter(F) ip6_tables(F)00: HCPGSP2629I The virtual machine is placed i
n CP mode due to a SIGP stop from
 CPU 01.
 iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F) vmur(F) xfs(F) libcrc32c(F) das
d_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F) ctcm(F) qeth(F) qdio(F) ccwgrou
p(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_log(F) dm_mod(F)
[ 1573.705173] CPU: 1 Tainted: GF     D W    3.9.3 #2
[ 1573.705177] Process attr (pid: 22262, task: 00000000b1190000, ksp: 00000000b0
f43eb0)
[ 1573.705180] Krnl PSW : 0404e00180000000 000000000046c738 (lookup_object+0x20/
0x68)
[ 1573.705193]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:0 EA:
3
Krnl GPRS: 7440000000000000 00c040077ff000e8 00000000fadd1eb8 0000000000000003
[ 1573.705199]            000000000046d2ba 0000000000000000 0000000084c1a828 000
0000084c1a868
[ 1573.705203]            0000000001964250 04000000001b5d64 0000000000a3bdf8 000
0000001964258
[ 1573.705206]            00000000fadd1eb8 00000000007434f0 000000000046d2cc 000
00000feb0b488
[ 1573.705221] Krnl Code: 000000000046c72c: a7380001            lhi     %r3,1
           000000000046c730: a7f40009           brc     15,46c742
          #000000000046c734: a73a0001           ahi     %r3,1
          >000000000046c738: e32010180020       cg      %r2,24(%r1)
           000000000046c73e: a7840016           brc     8,46c76a
           000000000046c742: e31010000002       ltg     %r1,0(%r1)
           000000000046c748: a774fff6           brc     7,46c734
           000000000046c74c: c010002dcfc0       larl    %r1,a266cc
[ 1573.705254] Call Trace:
[ 1573.705257] ([<000000000046d2ba>] debug_object_deactivate+0xba/0x160)
[ 1573.705263]  [<0000000000168890>] __run_hrtimer+0x58/0x474
[ 1573.705269]  [<0000000000169af2>] hrtimer_interrupt+0x116/0x2b0
[ 1573.705273]  [<0000000000104036>] clock_comparator_work+0x4a/0x54
[ 1573.705279]  [<000000000010d5b4>] do_extint+0x2b4/0x318
[ 1573.705285]  [<00000000006f0d90>] ext_skip+0x40/0x44
[ 1573.705292]  [<000003ff80108ba4>] ipt_do_table+0x298/0x8d0 [ip_tables]
[ 1573.705300] ([<000003ff801089d2>] ipt_do_table+0xc6/0x8d0 [ip_tables])
[ 1573.705305]  [<000003ff80165318>] nf_nat_ipv4_fn+0x1b4/0x354 [iptable_nat]
[ 1573.705310]  [<000003ff80165598>] nf_nat_ipv4_in+0x38/0x98 [iptable_nat]
[ 1573.705314]  [<00000000005d047c>] nf_iterate+0xd4/0x1b0
[ 1573.705903]  [<00000000005d0680>] nf_hook_slow+0x128/0x2cc
[ 1573.705909]  [<00000000005de956>] ip_rcv+0x272/0x360
[ 1573.705914]  [<0000000000592af6>] __netif_receive_skb_core+0xac2/0xe70
[ 1573.705919]  [<00000000005957e4>] netif_receive_skb+0x48/0x260
[ 1573.705924]  [<000003ff80095d16>] qeth_l2_poll+0x30a/0x540 [qeth_l2]
[ 1573.705938]  [<0000000000595f58>] net_rx_action+0x158/0x340
[ 1573.705942]  [<000000000013d8d6>] __do_softirq+0x112/0x42c
[ 1573.705947]  [<000000000013ddf8>] irq_exit+0xc8/0xe8
[ 1573.705951]  [<000000000010d55e>] do_extint+0x25e/0x318
[ 1573.705956]  [<00000000006f0d90>] ext_skip+0x40/0x44
[ 1573.705961]  [<000000000045a634>] memmove+0x14/0x5c
[ 1573.705968] ([<000003ff807d7a10>] __func__.41994+0x10/0xffffffffffffb600 [xfs
])
[ 1573.706104]  [<000003ff8076e6d2>] xfs_attr_leaf_compact+0xee/0x24c [xfs]
[ 1573.706224]  [<000003ff80772036>] xfs_attr_leaf_add+0x10a/0x268 [xfs]
[ 1573.706272]  [<000003ff8076a814>] xfs_attr_leaf_addname+0x11c/0x5a4 [xfs]
[ 1573.706303]  [<000003ff8076b028>] xfs_attr_set_int+0x38c/0x4b8 [xfs]
[ 1573.706333]  [<000003ff8076c9aa>] xfs_attr_set+0xb2/0xb8 [xfs]
[ 1573.706363]  [<000003ff8075d17e>] xfs_xattr_set+0x66/0xa0 [xfs]
[ 1573.706393]  [<00000000002d0a36>] generic_setxattr+0x7e/0x98
[ 1573.706400]  [<00000000002d1430>] __vfs_setxattr_noperm+0x78/0x1dc
[ 1573.706404]  [<00000000002d1652>] vfs_setxattr+0xbe/0xc4
[ 1573.706408]  [<00000000002d17b8>] setxattr+0x160/0x1c4
[ 1573.706412]  [<00000000002d1ba0>] SyS_lsetxattr+0xac/0xe8
[ 1573.706416]  [<00000000006f08b4>] sysc_tracego+0x14/0x1a
[ 1573.706420]  [<000003fffd2c2d1a>] 0x3fffd2c2d1a
[ 1573.706427] INFO: lockdep is turned off.
[ 1573.706430] Last Breaking-Event-Address:
[ 1573.706433]  [<000000000046c748>] lookup_object+0x30/0x68
[ 1573.706439]
[ 1573.706443] Kernel panic - not syncing: Fatal exception in interrupt
01: HCPGIR450W CP entered; disabled wait PSW 00020001 80000000 00000000 0010F384

----- Original Message -----
> From: "CAI Qian" <caiqian@redhat.com>
> To: "linux-s390" <linux-s390@vger.kernel.org>, linuxppc-dev@lists.ozlabs.org
> Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org, xfs@oss.sgi.com, "Steve Best" <sbest@redhat.com>,
> "Hendrik Brueckner" <bhendrik@redhat.com>, "Dave Chinner" <david@fromorbit.com>
> Sent: Thursday, May 23, 2013 12:57:20 PM
> Subject: 3.9.2/3.9.3: stack overrun on s390x and ppc64 (WAS Re: 3.9.2: xfstests triggered panic)
> 
> Original report:
> http://oss.sgi.com/archives/xfs/2013-05/msg00683.html
> 
> Also seen on Power7:
> http://marc.info/?l=linux-kernel&m=136927904900692&w=2
> 
> CAI Qian
> 
> ----- Original Message -----
> > From: "Dave Chinner" <david@fromorbit.com>
> > To: "CAI Qian" <caiqian@redhat.com>
> > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org,
> > xfs@oss.sgi.com
> > Sent: Thursday, May 23, 2013 11:46:11 AM
> > Subject: Re: 3.9.2: xfstests triggered panic
> > 
> > On Wed, May 22, 2013 at 11:16:56PM -0400, CAI Qian wrote:
> > > ----- Original Message -----
> > > > From: "Dave Chinner" <david@fromorbit.com>
> > > > To: "CAI Qian" <caiqian@redhat.com>
> > > > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org,
> > > > xfs@oss.sgi.com
> > > > Sent: Wednesday, May 22, 2013 5:53:00 PM
> > > > Subject: Re: 3.9.2: xfstests triggered panic
> > > > 
> > > > On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> > > > > Reproduced on almost all s390x guests by running xfstests.
> > > > > 
> > > > > 14634.396658¨ XFS (dm-1): Mounting Filesystem
> > > > > 14634.525522¨ XFS (dm-1): Ending clean mount
> > > > > 14640.413007¨  <000000000017c6d4>¨ idle_balance+0x1a0/0x340
> > > > > 14640.413010¨  <000000000063303e>¨ __schedule+0xa22/0xaf0
> > > > > 14640.428279¨  <0000000000630da6>¨ schedule_timeout+0x186/0x2c0
> > > > > 14640.428289¨  <00000000001cf864>¨ rcu_gp_kthread+0x1bc/0x298
> > > > > 14640.428300¨  <0000000000158c5a>¨ kthread+0xe6/0xec
> > > > > 14640.428304¨  <0000000000634de6>¨ kernel_thread_starter+0x6/0xc
> > > > > 14640.428308¨  <0000000000634de0>¨ kernel_thread_starter+0x0/0xc
> > > > > 14640.428311¨ Last Breaking-Event-Address:
> > > > > 14640.428314¨  <000000000016bd76>¨ walk_tg_tree_from+0x3a/0xf4
> > > > > 14640.428319¨  list_add corruption. next->prev should be prev
> > > > > (0000000000000918
> > > > > ), but was           (null). (next=          (null)).
> > > > 
> > > > Where's XFS in this? walk_tg_tree_from() is part of the scheduler
> > > > code. This kind of implies a stack corruption....
> > > > 
> > > > > Sometimes, this pops up,
> > > > > [16907.275002] WARNING: at kernel/rcutree.c:1960
> > > > > 
> > > > > or this,
> > > > > 15316.154171¨ XFS (dm-1): Mounting Filesystem
> > > > > 15316.255796¨ XFS (dm-1): Ending clean mount
> > > > > 15320.364246¨            00000000006367a2: e310b0080004        lg
> > > > > %r1,8(%r
> > > > > 11)
> > > > > 15320.364249¨            00000000006367a8: 41101010            la
> > > > > %r1,16(%
> > > > > r1)
> > > > > 15320.364251¨            00000000006367ac: e33010000004        lg
> > > > > %r3,0(%r
> > > > > 1)
> > > > > 15320.364252¨ Call Trace:
> > > > > 15320.364252¨ Last Breaking-Event-Address:
> > > > > 15320.364253¨  � <0000000000000000>¨ Kernel stack overflow.
> > > > > 15320.364308¨ CPU: 0 Tainted: GF       W    3.9.2 #1
> > > > > 15320.364309¨ Process rhts-test-runne (pid: 625, task:
> > > > > 000000003dccc890,
> > > > > ksp: 0
> > > > 
> > > > .... and there you go - a stack overflow. Your kernel stack size is
> > > > too small.
> > > > 
> > > > I'd suggest that you need 16k stacks on s390 - IIRC every function
> > > > call has 128 byte stack frame, and there are call chains 70-80
> > > > functions deep in the storage stack...
> > > Hmm, I am unsure how to set to 16k stack there
> > 
> > Are you build a 64 bit s390 kernel or a 32 bit kernel? 32 bit
> > kernels only have an 8k stack size, 64 bit kernels are 16k (see
> > arch/s390/Makefile).
> > 
> > $ git grep STACK_SIZE arch/s390 |head -2
> > arch/s390/Makefile:STACK_SIZE   := 8192
> > arch/s390/Makefile:STACK_SIZE   := 16384
> > 
> > As it is, the stack frame usage is worse than I thought:
> > 
> > $ git grep STACK_FRAME_OVERHEAD arch/s390 |head -2
> > arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 96      /*
> > size of minimum stack frame */
> > arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 160
> > /*
> > size of minimum stack frame */
> > 
> > Overhead is 96 bytes for 32 bit and 160 bytes for 64 bit. So 16k
> > stack size is going to have big troubles with a 70-80 function deep
> > call chain.
> > 
> > As for powerpc:
> > 
> > arch/powerpc/include/asm/ppc_asm.h:#define STACKFRAMESIZE 256
> > 
> > Yeah, same issue.
> > 
> > But, seriously, these stack traces are meaningless to anyone not
> > familiar with s390 or power7 - they indicate a problem detected
> > in the idle loop, not where ever the stack overran.
> > 
> > Can you please work with the s390/power7 people to obtain whatever
> > stack it was that overflowed, and we can go from there.
> > 
> > Cheers,
> > 
> > Dave.
> > --
> > Dave Chinner
> > david@fromorbit.com
> > 
> 

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: 3.9.2/3.9.3: stack overrun on s390x and ppc64 (WAS Re: 3.9.2: xfstests triggered panic)
@ 2013-05-24  3:33             ` CAI Qian
  0 siblings, 0 replies; 16+ messages in thread
From: CAI Qian @ 2013-05-24  3:33 UTC (permalink / raw)
  To: linux-s390, linuxppc-dev
  Cc: Dave Chinner, LKML, Steve Best, xfs, stable, Hendrik Brueckner

OK, here is clearer stack output from the run.
CAI Qian

+ ./check
FSTYP         -- xfs (non-debug)
PLATFORM      -- Linux/s390x ibm-z10-23 3.9.3

001=09 29s
002=09 3s
003=09 2s
004=09 [not run] this test requires a valid $SCRATCH_DEV
005=09 2s
006=09 9s
007=09 10s
008=09 7s
009=09 [not run] this test requires a valid $SCRATCH_DEV
010=09 [not run] dbtest was not built for this platform
011=09 9s
012=09 10s
013=09 35s
014=09 5s
015=09 [not run] this test requires a valid $SCRATCH_DEV
016=09 [not run] this test requires a valid $SCRATCH_DEV
017=09 [not run] this test requires a valid $SCRATCH_DEV
018=09 [not run] this test requires a valid $SCRATCH_DEV
019=09 [not run] this test requires a valid $SCRATCH_DEV
020=09


[ 1316.571227] XFS (dm-0): Mounting Filesystem
[ 1316.697803] XFS (dm-0): Ending clean mount
[ 1318.080615] XFS (dm-0): Ending clean mount
[ 1348.791125] XFS (dm-0): Mounting Filesystem
[ 1348.989166] XFS (dm-0): Ending clean mount
[ 1353.335478] XFS (dm-0): Mounting Filesystem
[ 1353.496364] XFS (dm-0): Ending clean mount
[ 1357.495427] XFS (dm-0): Mounting Filesystem
[ 1357.676971] XFS (dm-0): Ending clean mount
[ 1361.646399] XFS (dm-0): Mounting Filesystem
[ 1361.890426] XFS (dm-0): Ending clean mount
[ 1371.798944] XFS (dm-0): Mounting Filesystem
[ 1371.976922] XFS (dm-0): Ending clean mount
[ 1384.559103] XFS (dm-0): Mounting Filesystem
[ 1384.725657] XFS (dm-0): Ending clean mount
[ 1393.131347] XFS (dm-0): Mounting Filesystem
[ 1393.357927] XFS (dm-0): Ending clean mount
[ 1407.282708] XFS (dm-0): Mounting Filesystem
[ 1407.745176] XFS (dm-0): Ending clean mount
[ 1422.927074] XFS (dm-0): Mounting Filesystem
[ 1423.136266] XFS (dm-0): Ending clean mount
[ 1425.500910] XFS (dm-0): Mounting Filesystem
[ 1425.608851] XFS (dm-0): Ending clean mount
[ 1450.978110] XFS (dm-0): Mounting Filesystem
[ 1451.255368] XFS (dm-0): Ending clean mount
[ 1453.603742] XFS (dm-0): Mounting Filesystem
[ 1453.680657] XFS (dm-0): Ending clean mount
[ 1456.262266] XFS (dm-0): Mounting Filesystem
[ 1456.330515] XFS (dm-0): Ending clean mount
[ 1457.053767] XFS (dm-0): Mounting Filesystem
[ 1457.107258] XFS (dm-0): Ending clean mount
[ 1462.049374] XFS (dm-0): Mounting Filesystem
[ 1462.111389] XFS (dm-0): Ending clean mount
[ 1471.109589] ODEBUG: deactivate not available (active state 0) object typ=
e: ti
mer_list hint: process_timeout+0x0/0x8
[ 1471.109683] ------------[ cut here ]------------
[ 1471.109688] WARNING: at lib/debugobjects.c:260
[ 1471.109692] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_n=
s(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) =
ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable=
_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ip=
v4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtabl=
es(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth=
_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_=
mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F)=
 dm_l
og(F) dm_mod(F)
[ 1471.109848] CPU: 0 Tainted: GF            3.9.3 #2
[ 1471.109858] Process swapper/0 (pid: 0, task: 0000000000a2b4d0, ksp: 0000=
00000
0a17d28)
[ 1471.109868] Krnl PSW : 0404c00180000000 000000000046c84a (debug_print_ob=
ject+
0xca/0xd8)
[ 1471.114762]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:=
0 EA:
3
Krnl GPRS: 0000000000000000 0000000000a2b4d0 0000000000000067 000000000101f=
708
[ 1471.114769]            000000000046c846 0000000084a4d448 000000000086936=
a 000
0000001040700
[ 1471.114773]            0000000001a0f290 0400000000000001 0000000000874cf=
8 000
0000000a395d8
[ 1471.114777]            000000000195f820 000000000001bd20 000000000046c84=
6 000
000000001bc20
[ 1471.114792] Krnl Code: 000000000046c83a: e34410000004        lg      %r4=
,0(%r
4,%r1)
           000000000046c840: c0e500139f88       brasl   %r14,6e0750
          #000000000046c846: a7f40001           brc     15,46c848
          >000000000046c84a: a7f4ffc2           brc     15,46c7ce
           000000000046c84e: a7290000           lghi    %r2,0
           000000000046c852: a7f4ffd7           brc     15,46c800
           000000000046c856: 0707               bcr     0,%r7
           000000000046c858: ebaff0680024       stmg    %r10,%r15,104(%r15)
[ 1471.114825] Call Trace:
[ 1471.114828] ([<000000000046c846>] debug_print_object+0xc6/0xd8)
[ 1471.114833]  [<000000000046d35c>] debug_object_deactivate+0x15c/0x160
[ 1471.114838]  [<0000000000148244>] run_timer_softirq+0x180/0x464
[ 1471.114843]  [<000000000013d8d6>] __do_softirq+0x112/0x42c
[ 1471.114847]  [<000000000013ddf8>] irq_exit+0xc8/0xe8
[ 1471.114851]  [<000000000010d55e>] do_extint+0x25e/0x318
[ 1471.114859]  [<00000000006f0d90>] ext_skip+0x40/0x44
[ 1471.114866]  [<00000000006f05d6>] vtime_stop_cpu+0x52/0xbc
[ 1471.114870] ([<00000000006f05b4>] vtime_stop_cpu+0x30/0xbc)
[ 1471.114874]  [<000000000010476e>] cpu_idle+0x112/0x1b8
[ 1471.114879]  [<0000000000aaf99a>] start_kernel+0x42e/0x43c
[ 1471.114885]  [<0000000000100020>] _stext+0x20/0x80
[ 1471.114891] 2 locks held by swapper/0/0:
[ 1471.114894]  #0:  (&(&base->lock)->rlock){-.-.-.}, at: [<000000000014812=
8>] r
un_timer_softirq+0x64/0x464
[ 1471.114908]  #1:  (&obj_hash[i].lock){-.-.-.}, at: [<000000000046d2ba>] =
debug
_object_deactivate+0xba/0x160
[ 1471.114918] Last Breaking-Event-Address:
[ 1471.114921]  [<000000000046c846>] debug_print_object+0xc6/0xd8
[ 1471.114927] ---[ end trace dd87895f75677361 ]---
[ 1471.117683] ODEBUG: object is on stack, but not annotated
[ 1471.117723] ------------[ cut here ]------------
[ 1471.117726] WARNING: at lib/debugobjects.c:300
[ 1471.117729] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_n=
s(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) =
ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable=
_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ip=
v4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtabl=
es(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth=
_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_=
mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F)=
 dm_l
og(F) dm_mod(F)
[ 1471.117791] CPU: 0 Tainted: GF       W    3.9.3 #2
[ 1471.117794] Process rcu_sched (pid: 10, task: 00000000fffe8000, ksp: 000=
00000
fffe7ae0)
[ 1471.117797] Krnl PSW : 0404c00180000000 000000000046ccce (__debug_object=
_init
+0x35a/0x480)
[ 1471.117804]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:=
0 EA:
3
Krnl GPRS: 0000000000000000 00000000fffe8000 000000000000002d 000000000101f=
708
[ 1471.117811]            000000000046ccca 0000000000000001 0000000000a395d=
8 070
0000000000000
[ 1471.117814]            0000000000a78750 0000000001a0f298 00000000fffe7cf=
0 000
00000f8daa938
[ 1471.117818]            0000000001a0f290 00000000007434c0 000000000046ccc=
a 00
00000fffe7a40
[ 1471.117852] Krnl Code: 000000000046ccbe: c02000217617        larl    %r2=
,89b8
ec
           000000000046ccc4: c0e500139d46       brasl   %r14,6e0750
          #000000000046ccca: a7f40001           brc     15,46cccc
          >000000000046ccce: a7f4ff03           brc     15,46cad4
           000000000046ccd2: a7380000           lhi     %r3,0
           000000000046ccd6: a7f4ffe9           brc     15,46cca8
           000000000046ccda: c02000217622       larl    %r2,89b91e
           000000000046cce0: c0e500139d38       brasl   %r14,6e0750
[ 1471.117884] Call Trace:
[ 1471.117887] ([<000000000046ccca>] __debug_object_init+0x356/0x480)
[ 1471.117891]  [<0000000000146f42>] init_timer_key+0x3a/0x164
[ 1471.117897]  [<0000000000147eac>] timer_fixup_assert_init+0x50/0xa8
[ 1471.117901]  [<000000000046d7a4>] debug_object_assert_init+0x124/0x154
[ 1471.117906]  [<0000000000148676>] try_to_del_timer_sync+0x2e/0x94
[ 1471.117910]  [<000000000014878e>] del_timer_sync+0xb2/0x104
[ 1471.117915]  [<00000000006ea950>] schedule_timeout+0x190/0x37c
[ 1471.117922]  [<00000000001f1118>] rcu_gp_kthread+0x320/0x59c
[ 1471.117927]  [<0000000000164176>] kthread+0xe6/0xec
[ 1471.117934]  [<00000000006f0906>] kernel_thread_starter+0x6/0xc
[ 1471.117938]  [<00000000006f0900>] kernel_thread_starter+0x0/0xc
[ 1471.117943] 1 lock held by rcu_sched/10:
[ 1471.117945]  #0:  (&obj_hash[i].lock){-.-.-.}, at: [<000000000046caba>] =
__deb
ug_object_init+0x146/0x480
[ 1471.117955] Last Breaking-Event-Address:
[ 1471.117958]  [<000000000046ccca>] __debug_object_init+0x356/0x480
[ 1471.117962] ---[ end trace dd87895f75677362 ]---
[ 1471.117967] ODEBUG: assert_init not available (active state 0) object ty=
pe: t
imer_list hint: stub_timer+0x0/0x8
[ 1471.117984] ------------[ cut here ]------------
[ 1471.117987] WARNING: at lib/debugobjects.c:260
[ 1471.117990] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_n=
s(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) =
ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable=
_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ip=
v4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtabl=
es(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth=
_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_=
mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F)=
 dm_l
og(F) dm_mod(F)
[ 1471.118053] CPU: 0 Tainted: GF       W    3.9.3 #2
[ 1471.118075] Process rcu_sched (pid: 10, task: 00000000fffe8000, ksp: 000=
00000
fffe7ae0)
[ 1471.118079] Krnl PSW : 0704e00180000000 000000000046c84a (debug_print_ob=
ject+
0xca/0xd8)
[ 1471.118085]            R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:=
0 EA:
3
Krnl GPRS: 0000000000000000 00000000fffe8000 0000000000000063 0000000000000=
001
[ 1471.118092]            000000000046c846 0000000000000001 000000000086936=
a 000
0000000a1d300
[ 1471.118095]            0000000001a0f290 07000000006ed3fc 000000000089ba1=
4 000
0000000a395d8
[ 1471.118099]            000000000195f820 00000000fffe7bc8 000000000046c84=
6 000
00000fffe7ac8
[ 1471.118109] Krnl Code: 000000000046c83a: e34410000004        lg      %r4=
,0(%r
4,%r1)
           000000000046c840: c0e500139f88       brasl   %r14,6e0750
          #000000000046c846: a7f40001           brc     15,46c848
          >000000000046c84a: a7f4ffc2           brc     15,46c7ce
           000000000046c84e: a7290000           lghi    %r2,0
           000000000046c852: a7f4ffd7           brc     15,46c800
           000000000046c856: 0707               bcr     0,%r7
           000000000046c858: ebaff0680024       stmg    %r10,%r15,104(%r15)
[ 1471.118141] Call Trace:
[ 1471.118144] ([<000000000046c846>] debug_print_object+0xc6/0xd8)
[ 1471.118149]  [<000000000046d7ce>] debug_object_assert_init+0x14e/0x154
[ 1471.118153]  [<0000000000148676>] try_to_del_timer_sync+0x2e/0x94
[ 1471.118158]  [<000000000014878e>] del_timer_sync+0xb2/0x104
[ 1471.118163]  [<00000000006ea950>] schedule_timeout+0x190/0x37c
[ 1471.118167]  [<00000000001f1118>] rcu_gp_kthread+0x320/0x59c
[ 1471.118172]  [<0000000000164176>] kthread+0xe6/0xec
[ 1471.118176]  [<00000000006f0906>] kernel_thread_starter+0x6/0xc
[ 1471.118180]  [<00000000006f0900>] kernel_thread_starter+0x0/0xc
[ 1471.118185] no locks held by rcu_sched/10.
[ 1471.118188] Last Breaking-Event-Address:
[ 1471.118190]  [<000000000046c846>] debug_print_object+0xc6/0xd8
[ 1471.118195] ---[ end trace dd87895f75677363 ]---
[ 1471.121980] Unable to handle kernel pointer dereference at virtual kerne=
l add
ress 000003ff8072c000
[ 1471.123714] Oops: 0011 [#1] SMP
[ 1471.123720] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_n=
s(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) =
ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable=
_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ip=
v4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtabl=
es(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth=
_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_=
mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F)=
 dm_l
og(F) dm_mod(F)
[ 1471.123777] CPU: 0 Tainted: GF       W    3.9.3 #2
[ 1471.123781] Process systemd-journal (pid: 516, task: 00000000f5ad8000, k=
sp: 0
0000000f544fbd0)
[ 1471.123784] Krnl PSW : 0704e00180000000 000003ff80749cc2 (xfs_vm_page_mk=
write
+0x12/0x1c [xfs])
[ 1471.154773]            R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:=
0 EA:
3
Krnl GPRS: 00000000000015e8 000003ff8072cc30 0000000000000001 0000000000000=
000
[ 1471.154781]            00000000006ef176 00000000007036b8 00000000f97b7d9=
8 000
003fffb3e7000
[ 1471.154784]            000003fffb3e7000 00000000f8f65a00 00000000f847273=
8 000
00000f8ffc2c0
[ 1471.154788]            00000000f8472738 000000000070d5e0 000003ff80749cb=
6 000
00000f544fc28
[ 1471.154802] Krnl Code: 000003ff80749cb6: 1222                ltr     %r2=
,%r2
           000003ff80749cb8: a784ffd8           brc     8,3ff80749c68
          #000003ff80749cbc: c010ffff17ba       larl    %r1,3ff8072cc30
          >000003ff80749cc2: e31010000004       lg      %r1,0(%r1)
           000003ff80749cc8: e32010000012       lt      %r2,0(%r1)
           000003ff80749cce: a7740032           brc     7,3ff80749d32
           000003ff80749cd2: e31003180004       lg      %r1,792
           000003ff80749cd8: e330101c0012       lt      %r3,28(%r1)
[ 1471.154838] Call Trace:
[ 1471.154841] ([<0000000000261870>] do_wp_page+0x6f8/0xa4c)
[ 1471.154851]  [<0000000000262f2c>] handle_pte_fault+0x65c/0xa10
[ 1471.154856]  [<0000000000264a22>] handle_mm_fault+0x182/0x260
[ 1471.154860]  [<00000000006f21d0>] do_protection_exception+0x1b8/0x3cc
[ 1471.154867]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.154871]  [<000000008001d60a>] 0x8001d60a
[ 1471.154894] INFO: lockdep is turned off.
[ 1471.154897] Last Breaking-Event-Address:
[ 1471.154899]  [<00000000001ed7ca>] rcu_lockdep_current_cpu_online+0x6e/0x=
84
[ 1471.154909]
[ 1471.154913] ---[ end trace dd87895f75677364 ]---
[ 1471.157438] ------------[ cut here ]------------
[ 1471.157446] Kernel BUG at 00000000006eb186 [verbose debug info unavailab=
le]
[ 1471.157508] addressing exception: 0005 [#2] SMP
[ 1471.157514] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_n=
s(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) =
ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable=
_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ip=
v4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtabl=
es(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth=
_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_=
mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F)=
 dm_l
og(F) dm_mod(F)
[ 1471.157574] CPU: 0 Tainted: GF     D W    3.9.3 #2
[ 1471.157577] Process systemd-journal (pid: 516, task: 00000000f5ad8000, k=
sp: 0
0000000f544f578)
[ 1471.157580] Krnl PSW : 0404d00180000000 00000000006eb186 (mutex_lock_nes=
ted+0
xde/0x36c)
[ 1471.157591]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:1 PM:=
0 EA:
3
Krnl GPRS: 000000000000166e 0000000000000000 00000000ffffffff 0000000000a78=
630
[ 1471.157597]            0000000000000000 0000000000000000 000000000000000=
2 000
00000100073f8
[ 1471.157601]            00000000f5ad8000 000000000101f708 000000000000000=
0 070
0000000000000
[ 1471.157604]            0000000010007388 0000000010007380 00000000006eb17=
0 000
00000f544f658
[ 1471.157616] Krnl Code: 00000000006eb178: a7180000            lhi     %r1=
,0
           00000000006eb17c: c027ffffffff       xilf    %r2,4294967295
          #00000000006eb182: ba12d008           cs      %r1,%r2,8(%r13)
          >00000000006eb186: 1211               ltr     %r1,%r1
           00000000006eb188: a774011b           brc     7,6eb3be
           00000000006eb18c: e31090000012       lt      %r1,0(%r9)
           00000000006eb192: a7740007           brc     7,6eb1a0
           00000000006eb196: e3d0d0700020       cg      %r13,112(%r13)
[ 1471.157650] Call Trace:
[ 1471.157652] ([<00000000006eb170>] mutex_lock_nested+0xc8/0x36c)
[ 1471.157657]  [<0000000000260e3e>] unmap_single_vma+0x786/0x7d0
[ 1471.157663]  [<000000000026219c>] unmap_vmas+0x50/0x74
[ 1471.157666]  [<000000000026b7aa>] exit_mmap+0x186/0x224
[ 1471.157671]  [<000000000012ef88>] mmput+0x84/0x110
[ 1471.157677]  [<0000000000139fd2>] do_exit+0x2c6/0xce8
[ 1471.157682]  [<0000000000100ef2>] die+0x13e/0x158
[ 1471.157687]  [<000000000011dd6e>] do_no_context+0xba/0xf0
[ 1471.157692]  [<00000000006f2714>] do_dat_exception+0x330/0x390
[ 1471.157697]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.157701]  [<000003ff80749cc2>] xfs_vm_page_mkwrite+0x12/0x1c [xfs]
[ 1471.157800] ([<0000000000261870>] do_wp_page+0x6f8/0xa4c)
[ 1471.157805]  [<0000000000262f2c>] handle_pte_fault+0x65c/0xa10
[ 1471.157809]  [<0000000000264a22>] handle_mm_fault+0x182/0x260
[ 1471.157813]  [<00000000006f21d0>] do_protection_exception+0x1b8/0x3cc
[ 1471.157817]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.157821]  [<000000008001d60a>] 0x8001d60a
[ 1471.157827] INFO: lockdep is turned off.
[ 1471.157829] Last Breaking-Event-Address:
[ 1471.157832]  [<00000000001af490>] trace_hardirqs_off_caller+0x7c/0xb0
[ 1471.157840]
[ 1471.157857] ---[ end trace dd87895f75677365 ]---
[ 1471.157863] Fixing recursive fault but reboot is needed!
[ 1471.157867] BUG: scheduling while atomic: systemd-journal/516/0x00000002
[ 1471.157870] INFO: lockdep is turned off.
[ 1471.157872] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_n=
s(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) =
ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable=
_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ip=
v4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtabl=
es(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth=
_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_=
mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F)=
 dm_l
og(F) dm_mod(F)
[ 1471.157928] CPU: 0 Tainted: GF     D W    3.9.3 #2
[ 1471.157931] Process systemd-journal (pid: 516, task: 00000000f5ad8000, k=
sp: 0
0000000f544f578)
[ 1471.157934]        00000000f544f168 00000000f544f178 0000000000000002 00=
00000
000000000
       00000000f544f208 00000000f544f180 00000000f544f180 0000000000100a1e
       0000000000000000 0000000000ffb000 000000000000000a 000000000000000a
       00000000f544f1c8 00000000f544f168 0000000000000000 0000000000000000
       00000000f5ad8000 0000000000100a1e 00000000f544f168 00000000f544f1b8
[ 1471.157966] Call Trace:
[ 1471.157968] ([<0000000000100920>] show_trace+0xf0/0x148)
[ 1471.157973]  [<00000000006e0e88>] __schedule_bug+0x80/0x98
[ 1471.157977]  [<00000000006ed9ea>] __schedule+0xc6a/0xcc8
[ 1471.157981]  [<000000000013a99e>] do_exit+0xc92/0xce8
[ 1471.157985]  [<0000000000100ef2>] die+0x13e/0x158
[ 1471.157990]  [<00000000006f02ac>] do_per_trap+0x0/0xb4
[ 1471.157993]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.157997]  [<00000000006eb186>] mutex_lock_nested+0xde/0x36c
[ 1471.158002] ([<00000000006eb170>] mutex_lock_nested+0xc8/0x36c)
[ 1471.158006]  [<0000000000260e3e>] unmap_single_vma+0x786/0x7d0
[ 1471.158010]  [<000000000026219c>] unmap_vmas+0x50/0x74
[ 1471.158014]  [<000000000026b7aa>] exit_mmap+0x186/0x224
[ 1471.158018]  [<000000000012ef88>] mmput+0x84/0x110
[ 1471.158021]  [<0000000000139fd2>] do_exit+0x2c6/0xce8
[ 1471.158025]  [<0000000000100ef2>] die+0x13e/0x158
[ 1471.158029]  [<000000000011dd6e>] do_no_context+0xba/0xf0
[ 1471.158033]  [<00000000006f2714>] do_dat_exception+0x330/0x390
[ 1471.158037]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.158041]  [<000003ff80749cc2>] xfs_vm_page_mkwrite+0x12/0x1c [xfs]
[ 1471.158070] ([<0000000000261870>] do_wp_page+0x6f8/0xa4c)
[ 1471.158074]  [<0000000000262f2c>] handle_pte_fault+0x65c/0xa10
[ 1471.158078]  [<0000000000264a22>] handle_mm_fault+0x182/0x260
[ 1471.158082]  [<00000000006f21d0>] do_protection_exception+0x1b8/0x3cc
[ 1471.158086]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.158090]  [<000000008001d60a>] 0x8001d60a
[ 1471.158094] INFO: lockdep is turned off.
[ 1471.158221] Unable to handle kernel paging request at virtual user addre=
ss
        (null)
[ 1471.158247] Oops: 0004 [#3] SMP
[ 1471.158251] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_n=
s(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) =
ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable=
_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ip=
v4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtabl=
es(F)
 ip6table_filter(F) ip6_tables(F) iptable_filter(F) ip_tables(F) sg(F) qeth=
_l2(F
) vmur(F) xfs(F) libcrc32c(F) dasd_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_=
mod(F
) ctcm(F) qeth(F) qdio(F) ccwgroup(F) fsm(F) dm_mirror(F) dm_region_hash(F)=
 dm_l
og(F) dm_mod(F)
[ 1471.158306] CPU: 0 Tainted: GF     D W    3.9.3 #2
[ 1471.158309] Process in:imklog (pid: 765, task: 00000000fc7f2430, ksp: 00=
00000
0fc36fd50)
[ 1471.158312] Krnl PSW : 0404e00180000000 0000000000183520 (tg_load_down+0=
x60/0
xb0)
[ 1471.158320]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:=
0 EA:
3
Krnl GPRS: 0000000000000040 0000000000000000 0000000000000000 0000000000000=
008
[ 1471.158327]            0000000000000000 0000000000000000 00000000f925340=
0 000
0000000aa0854
[ 1471.158330]            0000000000aa0852 00000000001834c0 0000000000170ed=
c 000
0000000000000
[ 1471.158333]            00000000f9187e00 00000000fcd5c800 000000000017a33=
0 000
00000fc36fb48
[ 1471.158343] Krnl Code: 0000000000183512: a7290000            lghi    %r2=
,0
           0000000000183516: e31310000004       lg      %r1,0(%r3,%r1)
          #000000000018351c: b9040045           lgr     %r4,%r5
          >0000000000183520: e34010980024       stg     %r4,152(%r1)
           0000000000183526: ebbcf0700004       lmg     %r11,%r12,112(%r15)
           000000000018352c: 07fe               bcr     15,%r14
           000000000018352e: eb430003000d       sllg    %r4,%r3,3
           0000000000183534: c05000451512       larl    %r5,a25f58
[ 1471.158377] Call Trace:
[ 1471.158380] ([<000000000018bd96>] load_balance+0x236/0xc7c)
[ 1471.158385]  [<000000000018be16>] load_balance+0x2b6/0xc7c
[ 1471.158389]  [<000000000018d0a2>] idle_balance+0x212/0x494
[ 1471.158393]  [<00000000006ed9ae>] __schedule+0xc2e/0xcc8
[ 1471.158397]  [<00000000006f0c30>] io_reschedule+0xa/0x1a
[ 1471.158402]  [<000003fffd3bea22>] 0x3fffd3bea22
[ 1471.158406] INFO: lockdep is turned off.
[ 1471.158408] Last Breaking-Event-Address:
[ 1471.158411]  [<000000000017a32e>] walk_tg_tree_from+0x46/0x190
[ 1471.158416]
[ 1471.158418] ---[ end trace dd87895f75677366 ]---
[ 1471.158423] BUG: sleeping function called from invalid context at kernel=
/rwse
m.c:20
[ 1471.158426] in_atomic(): 1, irqs_disabled(): 0, pid: 765, name: in:imklo=
g
[ 1471.158429] INFO: lockdep is turned off.
[ 1471.158432] CPU: 0 Tainted: GF     D W    3.9.3 #2
[ 1471.158435] Process in:imklog (pid: 765, task: 00000000fc7f2430, ksp: 00=
00000
0fc36fd50)
[ 1471.158438]        00000000fc36f690 00000000fc36f6a0 0000000000000002 00=
00000
000000000
       00000000fc36f730 00000000fc36f6a8 00000000fc36f6a8 0000000000100a1e
       0000000000000000 00000000fc7f2870 000000000000000a 0404e0010000000a
       00000000fc36f6f0 00000000fc36f690 0000000000000000 0000000000000000
       00000000006fdbc8 0000000000100a1e 00000000fc36f690 00000000fc36f6e0
[ 1471.158470] Call Trace:
[ 1471.158473] ([<0000000000100920>] show_trace+0xf0/0x148)
[ 1471.158477]  [<0000000000175c0a>] __might_sleep+0x17a/0x238
[ 1471.158482]  [<00000000006ec4de>] down_read+0x42/0xe0
[ 1471.158487]  [<000000000014fea0>] exit_signals+0x38/0x144
[ 1471.158492]  [<0000000000139dd6>] do_exit+0xca/0xce8
[ 1471.158496]  [<0000000000100ef2>] die+0x13e/0x158
[ 1471.158500]  [<000000000011dd6e>] do_no_context+0xba/0xf0
[ 1471.158504]  [<00000000006f23de>] do_protection_exception+0x3c6/0x3cc
[ 1471.158508]  [<00000000006f0a44>] pgm_check_handler+0x138/0x13c
[ 1471.158513]  [<0000000000183520>] tg_load_down+0x60/0xb0
[ 1471.158517] ([<000000000018bd96>] load_balance+0x236/0xc7c)
[ 1471.158521]  [<000000000018be16>] load_balance+0x2b6/0xc7c
[ 1471.158525]  [<000000000018d0a2>] idle_balance+0x212/0x494
[ 1471.158529]  [<00000000006ed9ae>] __schedule+0xc2e/0xcc8
[ 1471.158533]  [<00000000006f0c30>] io_reschedule+0xa/0x1a
[ 1471.158537]  [<000003fffd3bea22>] 0x3fffd3bea22
[ 1471.158541] INFO: lockdep is turned off.
[ 1471.158545] note: in:imklog[765] exited with preempt_count 3
[ 1573.704911] Unable to handle kernel pointer dereference at virtual kerne=
l add
ress 00c040077ff00000
[ 1573.705014] Oops: 0038 [#4] SMP
[ 1573.705029] Modules linked in: lockd(F) sunrpc(F) nf_conntrack_netbios_n=
s(F)
nf_conntrack_broadcast(F) ipt_MASQUERADE(F) ip6table_nat(F) nf_nat_ipv6(F) =
ip6ta
ble_mangle(F) ip6t_REJECT(F) nf_conntrack_ipv6(F) nf_defrag_ipv6(F) iptable=
_nat(
F) nf_nat_ipv4(F) nf_nat(F) iptable_mangle(F) ipt_REJECT(F) nf_conntrack_ip=
v4(F)
 nf_defrag_ipv4(F) xt_conntrack(F) nf_conntrack(F) ebtable_filter(F) ebtabl=
es(F)
 ip6table_filter(F) ip6_tables(F)00: HCPGSP2629I The virtual machine is pla=
ced i
n CP mode due to a SIGP stop from
 CPU 01.
 iptable_filter(F) ip_tables(F) sg(F) qeth_l2(F) vmur(F) xfs(F) libcrc32c(F=
) das
d_fba_mod(F) dasd_eckd_mod(F) lcs(F) dasd_mod(F) ctcm(F) qeth(F) qdio(F) cc=
wgrou
p(F) fsm(F) dm_mirror(F) dm_region_hash(F) dm_log(F) dm_mod(F)
[ 1573.705173] CPU: 1 Tainted: GF     D W    3.9.3 #2
[ 1573.705177] Process attr (pid: 22262, task: 00000000b1190000, ksp: 00000=
000b0
f43eb0)
[ 1573.705180] Krnl PSW : 0404e00180000000 000000000046c738 (lookup_object+=
0x20/
0x68)
[ 1573.705193]            R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:=
0 EA:
3
Krnl GPRS: 7440000000000000 00c040077ff000e8 00000000fadd1eb8 0000000000000=
003
[ 1573.705199]            000000000046d2ba 0000000000000000 0000000084c1a82=
8 000
0000084c1a868
[ 1573.705203]            0000000001964250 04000000001b5d64 0000000000a3bdf=
8 000
0000001964258
[ 1573.705206]            00000000fadd1eb8 00000000007434f0 000000000046d2c=
c 000
00000feb0b488
[ 1573.705221] Krnl Code: 000000000046c72c: a7380001            lhi     %r3=
,1
           000000000046c730: a7f40009           brc     15,46c742
          #000000000046c734: a73a0001           ahi     %r3,1
          >000000000046c738: e32010180020       cg      %r2,24(%r1)
           000000000046c73e: a7840016           brc     8,46c76a
           000000000046c742: e31010000002       ltg     %r1,0(%r1)
           000000000046c748: a774fff6           brc     7,46c734
           000000000046c74c: c010002dcfc0       larl    %r1,a266cc
[ 1573.705254] Call Trace:
[ 1573.705257] ([<000000000046d2ba>] debug_object_deactivate+0xba/0x160)
[ 1573.705263]  [<0000000000168890>] __run_hrtimer+0x58/0x474
[ 1573.705269]  [<0000000000169af2>] hrtimer_interrupt+0x116/0x2b0
[ 1573.705273]  [<0000000000104036>] clock_comparator_work+0x4a/0x54
[ 1573.705279]  [<000000000010d5b4>] do_extint+0x2b4/0x318
[ 1573.705285]  [<00000000006f0d90>] ext_skip+0x40/0x44
[ 1573.705292]  [<000003ff80108ba4>] ipt_do_table+0x298/0x8d0 [ip_tables]
[ 1573.705300] ([<000003ff801089d2>] ipt_do_table+0xc6/0x8d0 [ip_tables])
[ 1573.705305]  [<000003ff80165318>] nf_nat_ipv4_fn+0x1b4/0x354 [iptable_na=
t]
[ 1573.705310]  [<000003ff80165598>] nf_nat_ipv4_in+0x38/0x98 [iptable_nat]
[ 1573.705314]  [<00000000005d047c>] nf_iterate+0xd4/0x1b0
[ 1573.705903]  [<00000000005d0680>] nf_hook_slow+0x128/0x2cc
[ 1573.705909]  [<00000000005de956>] ip_rcv+0x272/0x360
[ 1573.705914]  [<0000000000592af6>] __netif_receive_skb_core+0xac2/0xe70
[ 1573.705919]  [<00000000005957e4>] netif_receive_skb+0x48/0x260
[ 1573.705924]  [<000003ff80095d16>] qeth_l2_poll+0x30a/0x540 [qeth_l2]
[ 1573.705938]  [<0000000000595f58>] net_rx_action+0x158/0x340
[ 1573.705942]  [<000000000013d8d6>] __do_softirq+0x112/0x42c
[ 1573.705947]  [<000000000013ddf8>] irq_exit+0xc8/0xe8
[ 1573.705951]  [<000000000010d55e>] do_extint+0x25e/0x318
[ 1573.705956]  [<00000000006f0d90>] ext_skip+0x40/0x44
[ 1573.705961]  [<000000000045a634>] memmove+0x14/0x5c
[ 1573.705968] ([<000003ff807d7a10>] __func__.41994+0x10/0xffffffffffffb600=
 [xfs
])
[ 1573.706104]  [<000003ff8076e6d2>] xfs_attr_leaf_compact+0xee/0x24c [xfs]
[ 1573.706224]  [<000003ff80772036>] xfs_attr_leaf_add+0x10a/0x268 [xfs]
[ 1573.706272]  [<000003ff8076a814>] xfs_attr_leaf_addname+0x11c/0x5a4 [xfs=
]
[ 1573.706303]  [<000003ff8076b028>] xfs_attr_set_int+0x38c/0x4b8 [xfs]
[ 1573.706333]  [<000003ff8076c9aa>] xfs_attr_set+0xb2/0xb8 [xfs]
[ 1573.706363]  [<000003ff8075d17e>] xfs_xattr_set+0x66/0xa0 [xfs]
[ 1573.706393]  [<00000000002d0a36>] generic_setxattr+0x7e/0x98
[ 1573.706400]  [<00000000002d1430>] __vfs_setxattr_noperm+0x78/0x1dc
[ 1573.706404]  [<00000000002d1652>] vfs_setxattr+0xbe/0xc4
[ 1573.706408]  [<00000000002d17b8>] setxattr+0x160/0x1c4
[ 1573.706412]  [<00000000002d1ba0>] SyS_lsetxattr+0xac/0xe8
[ 1573.706416]  [<00000000006f08b4>] sysc_tracego+0x14/0x1a
[ 1573.706420]  [<000003fffd2c2d1a>] 0x3fffd2c2d1a
[ 1573.706427] INFO: lockdep is turned off.
[ 1573.706430] Last Breaking-Event-Address:
[ 1573.706433]  [<000000000046c748>] lookup_object+0x30/0x68
[ 1573.706439]
[ 1573.706443] Kernel panic - not syncing: Fatal exception in interrupt
01: HCPGIR450W CP entered; disabled wait PSW 00020001 80000000 00000000 001=
0F384

----- Original Message -----
> From: "CAI Qian" <caiqian@redhat.com>
> To: "linux-s390" <linux-s390@vger.kernel.org>, linuxppc-dev@lists.ozlabs.=
org
> Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org, xfs@os=
s.sgi.com, "Steve Best" <sbest@redhat.com>,
> "Hendrik Brueckner" <bhendrik@redhat.com>, "Dave Chinner" <david@fromorbi=
t.com>
> Sent: Thursday, May 23, 2013 12:57:20 PM
> Subject: 3.9.2/3.9.3: stack overrun on s390x and ppc64 (WAS Re: 3.9.2: xf=
stests triggered panic)
>=20
> Original report:
> http://oss.sgi.com/archives/xfs/2013-05/msg00683.html
>=20
> Also seen on Power7:
> http://marc.info/?l=3Dlinux-kernel&m=3D136927904900692&w=3D2
>=20
> CAI Qian
>=20
> ----- Original Message -----
> > From: "Dave Chinner" <david@fromorbit.com>
> > To: "CAI Qian" <caiqian@redhat.com>
> > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org,
> > xfs@oss.sgi.com
> > Sent: Thursday, May 23, 2013 11:46:11 AM
> > Subject: Re: 3.9.2: xfstests triggered panic
> >=20
> > On Wed, May 22, 2013 at 11:16:56PM -0400, CAI Qian wrote:
> > > ----- Original Message -----
> > > > From: "Dave Chinner" <david@fromorbit.com>
> > > > To: "CAI Qian" <caiqian@redhat.com>
> > > > Cc: "LKML" <linux-kernel@vger.kernel.org>, stable@vger.kernel.org,
> > > > xfs@oss.sgi.com
> > > > Sent: Wednesday, May 22, 2013 5:53:00 PM
> > > > Subject: Re: 3.9.2: xfstests triggered panic
> > > >=20
> > > > On Wed, May 22, 2013 at 04:39:58AM -0400, CAI Qian wrote:
> > > > > Reproduced on almost all s390x guests by running xfstests.
> > > > >=20
> > > > > 14634.396658=C2=A8 XFS (dm-1): Mounting Filesystem
> > > > > 14634.525522=C2=A8 XFS (dm-1): Ending clean mount
> > > > > 14640.413007=C2=A8  <000000000017c6d4>=C2=A8 idle_balance+0x1a0/0=
x340
> > > > > 14640.413010=C2=A8  <000000000063303e>=C2=A8 __schedule+0xa22/0xa=
f0
> > > > > 14640.428279=C2=A8  <0000000000630da6>=C2=A8 schedule_timeout+0x1=
86/0x2c0
> > > > > 14640.428289=C2=A8  <00000000001cf864>=C2=A8 rcu_gp_kthread+0x1bc=
/0x298
> > > > > 14640.428300=C2=A8  <0000000000158c5a>=C2=A8 kthread+0xe6/0xec
> > > > > 14640.428304=C2=A8  <0000000000634de6>=C2=A8 kernel_thread_starte=
r+0x6/0xc
> > > > > 14640.428308=C2=A8  <0000000000634de0>=C2=A8 kernel_thread_starte=
r+0x0/0xc
> > > > > 14640.428311=C2=A8 Last Breaking-Event-Address:
> > > > > 14640.428314=C2=A8  <000000000016bd76>=C2=A8 walk_tg_tree_from+0x=
3a/0xf4
> > > > > 14640.428319=C2=A8  list_add corruption. next->prev should be pre=
v
> > > > > (0000000000000918
> > > > > ), but was           (null). (next=3D          (null)).
> > > >=20
> > > > Where's XFS in this? walk_tg_tree_from() is part of the scheduler
> > > > code. This kind of implies a stack corruption....
> > > >=20
> > > > > Sometimes, this pops up,
> > > > > [16907.275002] WARNING: at kernel/rcutree.c:1960
> > > > >=20
> > > > > or this,
> > > > > 15316.154171=C2=A8 XFS (dm-1): Mounting Filesystem
> > > > > 15316.255796=C2=A8 XFS (dm-1): Ending clean mount
> > > > > 15320.364246=C2=A8            00000000006367a2: e310b0080004     =
   lg
> > > > > %r1,8(%r
> > > > > 11)
> > > > > 15320.364249=C2=A8            00000000006367a8: 41101010         =
   la
> > > > > %r1,16(%
> > > > > r1)
> > > > > 15320.364251=C2=A8            00000000006367ac: e33010000004     =
   lg
> > > > > %r3,0(%r
> > > > > 1)
> > > > > 15320.364252=C2=A8 Call Trace:
> > > > > 15320.364252=C2=A8 Last Breaking-Event-Address:
> > > > > 15320.364253=C2=A8  =EF=BF=BD <0000000000000000>=C2=A8 Kernel sta=
ck overflow.
> > > > > 15320.364308=C2=A8 CPU: 0 Tainted: GF       W    3.9.2 #1
> > > > > 15320.364309=C2=A8 Process rhts-test-runne (pid: 625, task:
> > > > > 000000003dccc890,
> > > > > ksp: 0
> > > >=20
> > > > .... and there you go - a stack overflow. Your kernel stack size is
> > > > too small.
> > > >=20
> > > > I'd suggest that you need 16k stacks on s390 - IIRC every function
> > > > call has 128 byte stack frame, and there are call chains 70-80
> > > > functions deep in the storage stack...
> > > Hmm, I am unsure how to set to 16k stack there
> >=20
> > Are you build a 64 bit s390 kernel or a 32 bit kernel? 32 bit
> > kernels only have an 8k stack size, 64 bit kernels are 16k (see
> > arch/s390/Makefile).
> >=20
> > $ git grep STACK_SIZE arch/s390 |head -2
> > arch/s390/Makefile:STACK_SIZE   :=3D 8192
> > arch/s390/Makefile:STACK_SIZE   :=3D 16384
> >=20
> > As it is, the stack frame usage is worse than I thought:
> >=20
> > $ git grep STACK_FRAME_OVERHEAD arch/s390 |head -2
> > arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 96    =
  /*
> > size of minimum stack frame */
> > arch/s390/include/uapi/asm/ptrace.h:#define STACK_FRAME_OVERHEAD 160
> > /*
> > size of minimum stack frame */
> >=20
> > Overhead is 96 bytes for 32 bit and 160 bytes for 64 bit. So 16k
> > stack size is going to have big troubles with a 70-80 function deep
> > call chain.
> >=20
> > As for powerpc:
> >=20
> > arch/powerpc/include/asm/ppc_asm.h:#define STACKFRAMESIZE 256
> >=20
> > Yeah, same issue.
> >=20
> > But, seriously, these stack traces are meaningless to anyone not
> > familiar with s390 or power7 - they indicate a problem detected
> > in the idle loop, not where ever the stack overran.
> >=20
> > Can you please work with the s390/power7 people to obtain whatever
> > stack it was that overflowed, and we can go from there.
> >=20
> > Cheers,
> >=20
> > Dave.
> > --
> > Dave Chinner
> > david@fromorbit.com
> >=20
>=20

^ permalink raw reply	[flat|nested] 16+ messages in thread

end of thread, other threads:[~2013-05-24  3:33 UTC | newest]

Thread overview: 16+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <40971621.4497871.1369211701112.JavaMail.root@redhat.com>
2013-05-22  8:39 ` 3.9.2: xfstests triggered panic CAI Qian
2013-05-22  8:39   ` CAI Qian
2013-05-22  9:53   ` Dave Chinner
2013-05-22  9:53     ` Dave Chinner
2013-05-23  3:16     ` CAI Qian
2013-05-23  3:16       ` CAI Qian
2013-05-23  3:46       ` Dave Chinner
2013-05-23  3:46         ` Dave Chinner
2013-05-23  4:11         ` CAI Qian
2013-05-23  4:11           ` CAI Qian
2013-05-23  4:57         ` 3.9.2/3.9.3: stack overrun on s390x and ppc64 (WAS Re: 3.9.2: xfstests triggered panic) CAI Qian
2013-05-23  4:57           ` CAI Qian
2013-05-23  4:57           ` CAI Qian
2013-05-24  3:33           ` CAI Qian
2013-05-24  3:33             ` CAI Qian
2013-05-24  3:33             ` CAI Qian

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.