All of lore.kernel.org
 help / color / mirror / Atom feed
* ehea crash on boot
@ 2016-09-23 12:50 Denis Kirjanov
  2016-09-26  7:39 ` Mathieu Malaterre
  2016-10-10 11:51 ` Jan Stancek
  0 siblings, 2 replies; 6+ messages in thread
From: Denis Kirjanov @ 2016-09-23 12:50 UTC (permalink / raw)
  To: linuxppc-dev

Heh, another thing to debug :)

mm: Hashing failure ! EA=0xd000080080124040 access=0x800000000000000e
current=NetworkManager
trap=0x300 vsid=0x13d349c ssize=1 base psize=2 psize 2 pte=0xc0003bc0300301ae
mm: Hashing failure ! EA=0xd000080080124040 access=0x800000000000000e
current=NetworkManager
trap=0x300 vsid=0x13d349c ssize=1 base psize=2 psize 2 pte=0xc0003bc0300301ae
Unable to handle kernel paging request for data at address 0xd000080080124040
Faulting instruction address: 0xc0000000006f21a0
cpu 0x8: Vector: 300 (Data Access) at [c0000005a8b92b50]
pc: c0000000006f21a0: .ehea_create_cq+0x160/0x230
lr: c0000000006f2164: .ehea_create_cq+0x124/0x230
sp: c0000005a8b92dd0
msr: 8000000000009032
dar: d000080080124040
dsisr: 42000000
current = 0xc0000005a8b68200
paca = 0xc00000000ea94000 softe: 0 irq_happened: 0x01
pid = 6787, comm = NetworkManager
Linux version 4.8.0-rc6-00214-g4cea877 (kda@ps700) (gcc version 4.8.5
20150623 (Red Hat 4.8.5-4) (GCC) ) #1 SMP Fri Sep 23 15:01:08 MSK 2016
enter ? for help
[c0000005a8b92dd0] c0000000006f2140 .ehea_create_cq+0x100/0x230 (unreliable)
[c0000005a8b92e70] c0000000006ed448 .ehea_up+0x288/0xed0
[c0000005a8b92fe0] c0000000006ee314 .ehea_open+0x44/0x130
[c0000005a8b93070] c000000000812324 .__dev_open+0x154/0x220
[c0000005a8b93110] c000000000812734 .__dev_change_flags+0xd4/0x1e0
[c0000005a8b931b0] c00000000081286c .dev_change_flags+0x2c/0x80
[c0000005a8b93240] c000000000829f0c .do_setlink+0x37c/0xe50
[c0000005a8b933c0] c00000000082c884 .rtnl_newlink+0x5e4/0x9b0
[c0000005a8b936d0] c00000000082cd08 .rtnetlink_rcv_msg+0xb8/0x2f0
[c0000005a8b937a0] c00000000084e25c .netlink_rcv_skb+0x12c/0x150
[c0000005a8b93830] c000000000829458 .rtnetlink_rcv+0x38/0x60
[c0000005a8b938b0] c00000000084d814 .netlink_unicast+0x1e4/0x350
[c0000005a8b93960] c00000000084def8 .netlink_sendmsg+0x418/0x480
[c0000005a8b93a40] c0000000007defac .sock_sendmsg+0x2c/0x60
[c0000005a8b93ab0] c0000000007e0cbc .___sys_sendmsg+0x30c/0x320
[c0000005a8b93c90] c0000000007e21bc .__sys_sendmsg+0x4c/0xb0
[c0000005a8b93d80] c0000000007e2dec .SyS_socketcall+0x34c/0x3d0
[c0000005a8b93e30] c00000000000946c system_call+0x38/0x108

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: ehea crash on boot
  2016-09-23 12:50 ehea crash on boot Denis Kirjanov
@ 2016-09-26  7:39 ` Mathieu Malaterre
  2016-09-26 10:59   ` Denis Kirjanov
  2016-10-10 11:51 ` Jan Stancek
  1 sibling, 1 reply; 6+ messages in thread
From: Mathieu Malaterre @ 2016-09-26  7:39 UTC (permalink / raw)
  To: Denis Kirjanov; +Cc: linuxppc-dev

On Fri, Sep 23, 2016 at 2:50 PM, Denis Kirjanov <kda@linux-powerpc.org> wrote:
> Heh, another thing to debug :)
>
> mm: Hashing failure ! EA=0xd000080080124040 access=0x800000000000000e
> current=NetworkManager
> trap=0x300 vsid=0x13d349c ssize=1 base psize=2 psize 2 pte=0xc0003bc0300301ae
> mm: Hashing failure ! EA=0xd000080080124040 access=0x800000000000000e
> current=NetworkManager
> trap=0x300 vsid=0x13d349c ssize=1 base psize=2 psize 2 pte=0xc0003bc0300301ae
> Unable to handle kernel paging request for data at address 0xd000080080124040
> Faulting instruction address: 0xc0000000006f21a0
> cpu 0x8: Vector: 300 (Data Access) at [c0000005a8b92b50]
> pc: c0000000006f21a0: .ehea_create_cq+0x160/0x230
> lr: c0000000006f2164: .ehea_create_cq+0x124/0x230
> sp: c0000005a8b92dd0
> msr: 8000000000009032
> dar: d000080080124040
> dsisr: 42000000
> current = 0xc0000005a8b68200
> paca = 0xc00000000ea94000 softe: 0 irq_happened: 0x01
> pid = 6787, comm = NetworkManager
> Linux version 4.8.0-rc6-00214-g4cea877 (kda@ps700) (gcc version 4.8.5
> 20150623 (Red Hat 4.8.5-4) (GCC) ) #1 SMP Fri Sep 23 15:01:08 MSK 2016
> enter ? for help
> [c0000005a8b92dd0] c0000000006f2140 .ehea_create_cq+0x100/0x230 (unreliable)
> [c0000005a8b92e70] c0000000006ed448 .ehea_up+0x288/0xed0
> [c0000005a8b92fe0] c0000000006ee314 .ehea_open+0x44/0x130
> [c0000005a8b93070] c000000000812324 .__dev_open+0x154/0x220
> [c0000005a8b93110] c000000000812734 .__dev_change_flags+0xd4/0x1e0
> [c0000005a8b931b0] c00000000081286c .dev_change_flags+0x2c/0x80
> [c0000005a8b93240] c000000000829f0c .do_setlink+0x37c/0xe50
> [c0000005a8b933c0] c00000000082c884 .rtnl_newlink+0x5e4/0x9b0
> [c0000005a8b936d0] c00000000082cd08 .rtnetlink_rcv_msg+0xb8/0x2f0
> [c0000005a8b937a0] c00000000084e25c .netlink_rcv_skb+0x12c/0x150
> [c0000005a8b93830] c000000000829458 .rtnetlink_rcv+0x38/0x60
> [c0000005a8b938b0] c00000000084d814 .netlink_unicast+0x1e4/0x350
> [c0000005a8b93960] c00000000084def8 .netlink_sendmsg+0x418/0x480
> [c0000005a8b93a40] c0000000007defac .sock_sendmsg+0x2c/0x60
> [c0000005a8b93ab0] c0000000007e0cbc .___sys_sendmsg+0x30c/0x320
> [c0000005a8b93c90] c0000000007e21bc .__sys_sendmsg+0x4c/0xb0
> [c0000005a8b93d80] c0000000007e2dec .SyS_socketcall+0x34c/0x3d0
> [c0000005a8b93e30] c00000000000946c system_call+0x38/0x108

Can you turn UBSAN on for this ?

-- 
Mathieu

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: ehea crash on boot
  2016-09-26  7:39 ` Mathieu Malaterre
@ 2016-09-26 10:59   ` Denis Kirjanov
  0 siblings, 0 replies; 6+ messages in thread
From: Denis Kirjanov @ 2016-09-26 10:59 UTC (permalink / raw)
  To: Mathieu Malaterre; +Cc: linuxppc-dev

[-- Attachment #1: Type: text/plain, Size: 2633 bytes --]

On Monday, September 26, 2016, Mathieu Malaterre <
mathieu.malaterre@gmail.com> wrote:

> On Fri, Sep 23, 2016 at 2:50 PM, Denis Kirjanov <kda@linux-powerpc.org
> <javascript:;>> wrote:
> > Heh, another thing to debug :)
> >
> > mm: Hashing failure ! EA=0xd000080080124040 access=0x800000000000000e
> > current=NetworkManager
> > trap=0x300 vsid=0x13d349c ssize=1 base psize=2 psize 2
> pte=0xc0003bc0300301ae
> > mm: Hashing failure ! EA=0xd000080080124040 access=0x800000000000000e
> > current=NetworkManager
> > trap=0x300 vsid=0x13d349c ssize=1 base psize=2 psize 2
> pte=0xc0003bc0300301ae
> > Unable to handle kernel paging request for data at address
> 0xd000080080124040
> > Faulting instruction address: 0xc0000000006f21a0
> > cpu 0x8: Vector: 300 (Data Access) at [c0000005a8b92b50]
> > pc: c0000000006f21a0: .ehea_create_cq+0x160/0x230
> > lr: c0000000006f2164: .ehea_create_cq+0x124/0x230
> > sp: c0000005a8b92dd0
> > msr: 8000000000009032
> > dar: d000080080124040
> > dsisr: 42000000
> > current = 0xc0000005a8b68200
> > paca = 0xc00000000ea94000 softe: 0 irq_happened: 0x01
> > pid = 6787, comm = NetworkManager
> > Linux version 4.8.0-rc6-00214-g4cea877 (kda@ps700) (gcc version 4.8.5
> > 20150623 (Red Hat 4.8.5-4) (GCC) ) #1 SMP Fri Sep 23 15:01:08 MSK 2016
> > enter ? for help
> > [c0000005a8b92dd0] c0000000006f2140 .ehea_create_cq+0x100/0x230
> (unreliable)
> > [c0000005a8b92e70] c0000000006ed448 .ehea_up+0x288/0xed0
> > [c0000005a8b92fe0] c0000000006ee314 .ehea_open+0x44/0x130
> > [c0000005a8b93070] c000000000812324 .__dev_open+0x154/0x220
> > [c0000005a8b93110] c000000000812734 .__dev_change_flags+0xd4/0x1e0
> > [c0000005a8b931b0] c00000000081286c .dev_change_flags+0x2c/0x80
> > [c0000005a8b93240] c000000000829f0c .do_setlink+0x37c/0xe50
> > [c0000005a8b933c0] c00000000082c884 .rtnl_newlink+0x5e4/0x9b0
> > [c0000005a8b936d0] c00000000082cd08 .rtnetlink_rcv_msg+0xb8/0x2f0
> > [c0000005a8b937a0] c00000000084e25c .netlink_rcv_skb+0x12c/0x150
> > [c0000005a8b93830] c000000000829458 .rtnetlink_rcv+0x38/0x60
> > [c0000005a8b938b0] c00000000084d814 .netlink_unicast+0x1e4/0x350
> > [c0000005a8b93960] c00000000084def8 .netlink_sendmsg+0x418/0x480
> > [c0000005a8b93a40] c0000000007defac .sock_sendmsg+0x2c/0x60
> > [c0000005a8b93ab0] c0000000007e0cbc .___sys_sendmsg+0x30c/0x320
> > [c0000005a8b93c90] c0000000007e21bc .__sys_sendmsg+0x4c/0xb0
> > [c0000005a8b93d80] c0000000007e2dec .SyS_socketcall+0x34c/0x3d0
> > [c0000005a8b93e30] c00000000000946c system_call+0x38/0x108
>
> Can you turn UBSAN on for this ?


I'll get back to the problem and send a fix when I'll finish my trip


> --
> Mathieu
>

[-- Attachment #2: Type: text/html, Size: 3316 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: ehea crash on boot
  2016-09-23 12:50 ehea crash on boot Denis Kirjanov
  2016-09-26  7:39 ` Mathieu Malaterre
@ 2016-10-10 11:51 ` Jan Stancek
  2016-10-11  5:46   ` Michael Ellerman
  1 sibling, 1 reply; 6+ messages in thread
From: Jan Stancek @ 2016-10-10 11:51 UTC (permalink / raw)
  To: Denis Kirjanov; +Cc: linuxppc-dev

Hi Denis / all,

Do you know if there is a patch or lead for this problem? I seem
to be hitting same Oops with P730 lpar when running 4.8 (see below),
but 4.7.7 looks OK.

Regards,
Jan

[    8.698424] IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready 
[    8.713373] IPv6: ADDRCONF(NETDEV_UP): eth1: link is not ready 
[    8.713940] mm: Hashing failure ! EA=0xd000080080004040 access=0x800000000000000e current=NetworkManager 
[    8.713949]     trap=0x300 vsid=0x13d349c ssize=1 base psize=2 psize 2 pte=0xc0003cc033e701ae 
[    8.713958] mm: Hashing failure ! EA=0xd000080080004040 access=0x800000000000000e current=NetworkManager 
[    8.713966]     trap=0x300 vsid=0x13d349c ssize=1 base psize=2 psize 2 pte=0xc0003cc033e701ae 
[    8.713979] Unable to handle kernel paging request for data at address 0xd000080080004040 
[    8.713985] Faulting instruction address: 0xd0000000011cc250 
[    8.713992] Oops: Kernel access of bad area, sig: 7 [#1] 
[    8.713996] SMP NR_CPUS=2048 NUMA pSeries 
[    8.714008] Modules linked in: sg uio_pdrv_genirq uio nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables xfs libcrc32c sd_mod ibmvscsi scsi_transport_srp ibmveth ehea dm_mirror dm_region_hash dm_log dm_mod 
[    8.714063] CPU: 2 PID: 1148 Comm: NetworkManager Not tainted 4.8.0-1.el7.test.ppc64.debug #1 
[    8.714072] task: c0000000065e2080 task.stack: c000000006668000 
[    8.714078] NIP: d0000000011cc250 LR: d0000000011cc118 CTR: 000000000042c120 
[    8.714086] REGS: c00000000666ab00 TRAP: 0300   Not tainted  (4.8.0-1.el7.test.ppc64.debug) 
[    8.714092] MSR: 8000000000009032 <SF,EE,ME,IR,DR,RI>  CR: 24288442  XER: 00000020 
[    8.714120] CFAR: c0000000000087d0 DAR: d000080080004040 DSISR: 42000000 SOFTE: 1  
GPR00: d0000000011cc118 c00000000666ad80 d0000000011dbdd8 c000000006327f80  
GPR04: 0000000000000000 c0000000b0800000 0000000000029000 0000000000028000  
GPR08: c0000000b0800000 0000000000000000 d000080080004000 00000000ffffd953  
GPR12: 0000000080000001 c00000000ea61200 0000000000000000 0000000000000000  
GPR16: 00000000000007fe 0000000000000000 0000000000000001 0000000000000000  
GPR20: c0000000b53ecbd0 c0000000b53ecb00 c0000000b53ec1e8 c0000000b53ec1d0  
GPR24: c0000000b53ec1b8 c0000000b53ec200 0000000000000000 0000000000000015  
GPR28: 00000000000009fd c0000000bbb59418 0000000000000028 c000000006327f80  
[    8.714254] NIP [d0000000011cc250] .ehea_create_cq+0x280/0x340 [ehea] 
[    8.714263] LR [d0000000011cc118] .ehea_create_cq+0x148/0x340 [ehea] 
[    8.714270] Call Trace: 
[    8.714278] [c00000000666ad80] [d0000000011cc118] .ehea_create_cq+0x148/0x340 [ehea] (unreliable) 
[    8.714292] [c00000000666ae30] [d0000000011c5e28] .ehea_up+0x258/0x1200 [ehea] 
[    8.714304] [c00000000666afa0] [d0000000011c6e14] .ehea_open+0x44/0x1a0 [ehea] 
[    8.714316] [c00000000666b030] [c0000000009bc4c4] .__dev_open+0x164/0x310 
[    8.714328] [c00000000666b0d0] [c0000000009c6998] .__dev_change_flags+0x158/0x4f0 
[    8.714339] [c00000000666b180] [c0000000009c6d5c] .dev_change_flags+0x2c/0x220 
[    8.714349] [c00000000666b220] [c0000000009e2d3c] .do_setlink+0x38c/0xef0 
[    8.714359] [c00000000666b3a0] [c0000000009e65cc] .rtnl_newlink+0x97c/0xb10 
[    8.714369] [c00000000666b6b0] [c0000000009e4ae4] .rtnetlink_rcv_msg+0xc4/0x380 
[    8.714379] [c00000000666b7a0] [c000000000a1c05c] .netlink_rcv_skb+0x12c/0x150 
[    8.714388] [c00000000666b830] [c0000000009e1b68] .rtnetlink_rcv+0x38/0x60 
[    8.714396] [c00000000666b8b0] [c000000000a1bb74] .netlink_unicast+0x554/0x6b0 
[    8.714405] [c00000000666b990] [c000000000a1cbcc] .netlink_sendmsg+0x41c/0x490 
[    8.714415] [c00000000666ba70] [c000000000986e18] .___sys_sendmsg+0x278/0x370 
[    8.714425] [c00000000666bc50] [c0000000009892d4] .SyS_sendmsg+0xc4/0x130 
[    8.714436] [c00000000666bd50] [c00000000098a180] .SyS_socketcall+0x3d0/0x4e0 
[    8.714448] [c00000000666be30] [c000000000009590] system_call+0x38/0xec 
[    8.714455] Instruction dump: 
[    8.714462] 38a00001 4bffe7fd 60000000 7fe3fb78 48003081 e8410028 38600000 48000030  
[    8.714484] e95f0038 39200000 7fe3fb78 f93f0010 <f92a0040> 3920ffff 79290004 e95f0038  
[    8.714506] ---[ end trace fe4fbc224578dd0c ]--- 

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: ehea crash on boot
  2016-10-10 11:51 ` Jan Stancek
@ 2016-10-11  5:46   ` Michael Ellerman
  2016-10-11  7:45     ` Jan Stancek
  0 siblings, 1 reply; 6+ messages in thread
From: Michael Ellerman @ 2016-10-11  5:46 UTC (permalink / raw)
  To: Jan Stancek, Denis Kirjanov; +Cc: linuxppc-dev

Jan Stancek <jstancek@redhat.com> writes:

> Hi Denis / all,
>
> Do you know if there is a patch or lead for this problem? I seem
> to be hitting same Oops with P730 lpar when running 4.8 (see below),
> but 4.7.7 looks OK.

Does this fix it?

cheers


diff --git a/arch/powerpc/mm/hash_utils_64.c b/arch/powerpc/mm/hash_utils_64.c
index 4cebc31e53de..4e83d872872d 100644
--- a/arch/powerpc/mm/hash_utils_64.c
+++ b/arch/powerpc/mm/hash_utils_64.c
@@ -526,7 +526,7 @@ static bool might_have_hea(void)
 	 */
 #ifdef CONFIG_IBMEBUS
 	return !cpu_has_feature(CPU_FTR_ARCH_207S) &&
-		!firmware_has_feature(FW_FEATURE_SPLPAR);
+		firmware_has_feature(FW_FEATURE_SPLPAR);
 #else
 	return false;
 #endif

^ permalink raw reply related	[flat|nested] 6+ messages in thread

* Re: ehea crash on boot
  2016-10-11  5:46   ` Michael Ellerman
@ 2016-10-11  7:45     ` Jan Stancek
  0 siblings, 0 replies; 6+ messages in thread
From: Jan Stancek @ 2016-10-11  7:45 UTC (permalink / raw)
  To: Michael Ellerman; +Cc: Denis Kirjanov, linuxppc-dev





----- Original Message -----
> From: "Michael Ellerman" <mpe@ellerman.id.au>
> To: "Jan Stancek" <jstancek@redhat.com>, "Denis Kirjanov" <kda@linux-powerpc.org>
> Cc: linuxppc-dev@lists.ozlabs.org
> Sent: Tuesday, 11 October, 2016 7:46:31 AM
> Subject: Re: ehea crash on boot
> 
> Jan Stancek <jstancek@redhat.com> writes:
> 
> > Hi Denis / all,
> >
> > Do you know if there is a patch or lead for this problem? I seem
> > to be hitting same Oops with P730 lpar when running 4.8 (see below),
> > but 4.7.7 looks OK.
> 
> Does this fix it?

Yes, it does. dmesg looks clean and network is up.

Regards,
Jan

> 
> cheers
> 
> 
> diff --git a/arch/powerpc/mm/hash_utils_64.c
> b/arch/powerpc/mm/hash_utils_64.c
> index 4cebc31e53de..4e83d872872d 100644
> --- a/arch/powerpc/mm/hash_utils_64.c
> +++ b/arch/powerpc/mm/hash_utils_64.c
> @@ -526,7 +526,7 @@ static bool might_have_hea(void)
>  	 */
>  #ifdef CONFIG_IBMEBUS
>  	return !cpu_has_feature(CPU_FTR_ARCH_207S) &&
> -		!firmware_has_feature(FW_FEATURE_SPLPAR);
> +		firmware_has_feature(FW_FEATURE_SPLPAR);
>  #else
>  	return false;
>  #endif
> 

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2016-10-11  7:46 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-09-23 12:50 ehea crash on boot Denis Kirjanov
2016-09-26  7:39 ` Mathieu Malaterre
2016-09-26 10:59   ` Denis Kirjanov
2016-10-10 11:51 ` Jan Stancek
2016-10-11  5:46   ` Michael Ellerman
2016-10-11  7:45     ` Jan Stancek

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.