All of lore.kernel.org
 help / color / mirror / Atom feed
* Issues with 4.4-rt2 with arm64
@ 2016-01-20 14:05 Jaggi, Manish
  2016-01-20 15:00 ` Thomas Gleixner
  0 siblings, 1 reply; 2+ messages in thread
From: Jaggi, Manish @ 2016-01-20 14:05 UTC (permalink / raw)
  To: linux-rt-users

Hi All,
I have found some issues running with the 4.4 kernel and 4.4rt2 patchset on Cavium ThunderX with 48 cores.

a) With 4 cores there is no issue, 

b) With 48 cores boot hangs, sata doesnt mount and network VFs have issues. Probably the threadded irq are not scheduled
[   37.354634] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 7 from VF3
[   40.558635] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 5 from VF3
[   43.762640] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 14 from VF3
[   46.966631] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 7 from VF3
[   50.170671] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 5 from VF3
[   53.374621] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 14 from VF3
[   56.578615] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 7 from VF3
[   59.782632] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 5 from VF3
[   62.986630] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 14 from VF3
[   66.190629] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 7 from VF3
[   69.394626] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 5 from VF3
[   72.598633] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 14 from VF3
[   75.802680] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 7 from VF3

c) By  disabling NO_HZ_ IDLE and enabling PERIODIC no issues are observed in kernel boot. There are random issues like mount of a partition throws errors (run fsck)

d) Enabling sleep while atomic debug config, got the call traces
[   19.506039] BUG: sleeping function called from invalid context at kernel/locking/rtmutex.c:930
[   19.506043] in_atomic(): 1, irqs_disabled(): 0, pid: 733, name: dhclient
[   19.506045] 2 locks held by dhclient/733:
[   19.506065]  #0:  (sk_lock-AF_INET){+.+...}, at: [<ffff800000882c14>] sock_setsockopt+0xbc/0x998
[   19.506079]  #1:  (jump_label_mutex){+.+...}, at: [<ffff8000001f53a0>] static_key_slow_inc+0xb0/0x110
[   19.506089] Preemption disabled at:[<ffff80000009aa04>] aarch64_insn_patch_text_nosync+0x1c/0x48
[   19.506090] 
[   19.506096] CPU: 0 PID: 733 Comm: dhclient Not tainted 4.4.0-rt2-00116-g02e26fb-dirty #20
[   19.506098] Hardware name: www.cavium.com ThunderX CRB1S/ThunderX CRB1S, BIOS 0.3 Dec  3 2015

[   19.506100] Call trace:
[   19.506104] [<ffff800000097b30>] dump_backtrace+0x0/0x160
[   19.506107] [<ffff800000097cb4>] show_stack+0x24/0x30
[   19.506112] [<ffff80000059c538>] dump_stack+0x88/0xa8
[   19.506120] [<ffff800000103088>] ___might_sleep+0x240/0x258
[   19.506130] [<ffff8000009c72a8>] rt_spin_lock+0x30/0x68
[   19.506133] [<ffff80000009a7e4>] __aarch64_insn_write+0x34/0x108
[   19.506137] [<ffff80000009aa04>] aarch64_insn_patch_text_nosync+0x1c/0x48
[   19.506140] [<ffff80000009ac2c>] aarch64_insn_patch_text+0xc4/0xd8
[   19.506146] [<ffff8000000a3400>] arch_jump_label_transform+0x70/0x80
[   19.506151] [<ffff8000001f4d8c>] __jump_label_update+0x84/0xa0
[   19.506154] [<ffff8000001f4e50>] jump_label_update+0xa8/0xe0
[   19.506157] [<ffff8000001f53dc>] static_key_slow_inc+0xec/0x110
[   19.506162] [<ffff800000896260>] net_enable_timestamp+0x80/0x90
[   19.506166] [<ffff800000882b4c>] sock_enable_timestamp+0x9c/0xa8
[   19.506169] [<ffff800000882ec8>] sock_setsockopt+0x370/0x998
[   19.506176] [<ffff80000087cae8>] SyS_setsockopt+0xd8/0xe8
[   19.506180] [<ffff800000093a30>] el0_svc_naked+0x24/0x28
[   20.634687] 
[   20.940909] ===============================

PS: earlier the 4.1 rt  patch worked without these issue on same hardware.

 Regards, 
Manish Jaggi    

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: Issues with 4.4-rt2 with arm64
  2016-01-20 14:05 Issues with 4.4-rt2 with arm64 Jaggi, Manish
@ 2016-01-20 15:00 ` Thomas Gleixner
  0 siblings, 0 replies; 2+ messages in thread
From: Thomas Gleixner @ 2016-01-20 15:00 UTC (permalink / raw)
  To: Jaggi, Manish; +Cc: linux-rt-users

On Wed, 20 Jan 2016, Jaggi, Manish wrote:

> b) With 48 cores boot hangs, sata doesnt mount and network VFs have
>    issues. Probably the threadded irq are not scheduled
> [   37.354634] thunder-nicvf 0002:01:00.4 eth3: PF didn't ack to mbox msg 7 from VF3
... 
> c) By disabling NO_HZ_ IDLE and enabling PERIODIC no issues are observed in
> kernel boot. There are random issues like mount of a partition throws errors
> (run fsck)

So is it a NOHZ idle issue or a HIGHRES/PERIODIC issue?

> d) Enabling sleep while atomic debug config, got the call traces

> [   19.506039] BUG: sleeping function called from invalid context at kernel/locking/rtmutex.c:930

> [   19.506130] [<ffff8000009c72a8>] rt_spin_lock+0x30/0x68
> [   19.506133] [<ffff80000009a7e4>] __aarch64_insn_write+0x34/0x108

I have no idea how you manage this:

__aarch64_insn_write()
{
	raw_spin_lock_irqsave(&patch_lock, flags);

That is a raw lock. So I have no idea how that might end up in the sleeping
spinlocks.

Thanks,

	tglx

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2016-01-20 15:01 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-01-20 14:05 Issues with 4.4-rt2 with arm64 Jaggi, Manish
2016-01-20 15:00 ` Thomas Gleixner

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.