All of lore.kernel.org
 help / color / mirror / Atom feed
* Issue booting Niagara T1 after commit 87fa05aeb3
@ 2013-11-19 12:21 Fabio M. Di Nitto
  2013-12-12 12:33 ` Raúl Porcel
                   ` (3 more replies)
  0 siblings, 4 replies; 5+ messages in thread
From: Fabio M. Di Nitto @ 2013-11-19 12:21 UTC (permalink / raw)
  To: sparclinux

Hi all,

yes I know.. vintage hardware :) It´s running Debian unstable updated to
approx a week ago (yay for tons of rebuilds to bisect ;)).

fabbione@sunfire:~/linux-2.6$ gcc --version
gcc (Debian 4.6.4-5) 4.6.4

kernel config is from debian, applied to the various bisect done on
Linus tree.

sunfire:~# cat /proc/cpuinfo
cpu             : UltraSparc T1 (Niagara)
fpu             : UltraSparc T1 integrated FPU
pmu             : niagara
prom            : OBP 4.28.9 2008/06/30 21:26
type            : sun4v
ncpus probed    : 24
..

commit 87fa05aeb3a5e8e21b1a5510eef6983650eff092
Author: Sam Ravnborg <sam@ravnborg.org>
Date:   Thu Apr 11 21:38:50 2013 +0200

    sparc: Use generic idle loop

    Add generic cpu_idle support

appears to be culprit and I can´t boot any kernel after that. Reverting
the commit makes the machine boot again (tested up to commit 4fbf888acc
from Linus tree).

Sorry I really don´t know that part of the code well enough to propose a
fix, but I can either provide access to the machine or test debugging
patches.

Thanks
Fabio

Few extra info:

with this commit applied (head to the commit):

[SNIP]
[   68.089495] scsi0 : Emulex LPe1150 PCIe Fibre Channel Adapter  on PCI
bus 08 device 00 irq 23
[   68.089495] scsi0 : Emulex LPe1150 PCIe Fibre Channel Adapter  on PCI
bus 08 device 00 irq 23
[   89.429041] INFO: rcu_sched detected stalls on CPUs/tasks:[
89.429041] INFO: rcu_sched detected stalls on CPUs/tasks: { { 3 3 10 10
15 15} } (detected by 0, tS42 jiffies, g\x18446744073709551328,
c\x18446744073709551327, q!64)
(detected by 0, tS42 jiffies, g\x18446744073709551328,
c\x18446744073709551327, q!64)
[   89.431437] * CPU[  0]: TSTATE[0000000080001603]
TPC[000000000042b9d4] TNPC[000000000042b9d8] TASK[swapper/0:0]
[   89.431437] * CPU[  0]: TSTATE[0000000080001603]
TPC[000000000042b9d4] TNPC[000000000042b9d8] TASK[swapper/0:0]
[   89.431462] BUG: soft lockup - CPU#3 stuck for 23s! [swapper/3:0]
[   89.431462] BUG: soft lockup - CPU#3 stuck for 23s! [swapper/3:0]
[   89.431470] BUG: soft lockup - CPU#1 stuck for 23s! [swapper/1:0]
[   89.431470] BUG: soft lockup - CPU#1 stuck for 23s! [swapper/1:0]
[   89.431479] BUG: soft lockup - CPU#2 stuck for 23s! [swapper/2:0]
[   89.431479] BUG: soft lockup - CPU#2 stuck for 23s! [swapper/2:0]
[   89.431484] Modules linked in:
[SNIP]

Most kernels in between that commit to 3.12:

[SNIP]
[   68.392210] UDP hash table entries: 4096 (order: 4, 131072 bytes)
[   68.393319] UDP-Lite hash table entries: 4096 (order: 4, 131072 bytes)
[   68.395425] NET: Registered protocol family 1
[   68.396008] pci 0001:05:02.0: Activating ISA DMA hang workarounds
-- no more output --

With Linus tree up to commit 4fbf888accb39 (didn´t test yet newer trees):

[   60.770765] scsi2 : Emulex LPe1150 PCIe Fibre Channel Adapter  on PCI
bus 08 device 00 irq 23
 [   81.923914] INFO: rcu_sched detected stalls on CPUs/tasks: {}
(detected by 4, tR92 jiffies, g\x18446744073709551336,
c\x18446744073709551335,5)
[   81.924215] INFO: Stall ended before state dump start
[  145.419184] INFO: rcu_sched detected stalls on CPUs/tasks: {}
(detected by 0, t!166 jiffies, g\x18446744073709551336,
c\x18446744073709551335,7)
[  145.419482] INFO: Stall ended before state dump start



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Issue booting Niagara T1 after commit 87fa05aeb3
  2013-11-19 12:21 Issue booting Niagara T1 after commit 87fa05aeb3 Fabio M. Di Nitto
@ 2013-12-12 12:33 ` Raúl Porcel
  2013-12-13 11:46 ` Raúl Porcel
                   ` (2 subsequent siblings)
  3 siblings, 0 replies; 5+ messages in thread
From: Raúl Porcel @ 2013-12-12 12:33 UTC (permalink / raw)
  To: sparclinux

Same here, I tried with the default config.

On 11/19/13 13:21, Fabio M. Di Nitto wrote:
> Hi all,
> 
> yes I know.. vintage hardware :) It´s running Debian unstable updated to
> approx a week ago (yay for tons of rebuilds to bisect ;)).
> 
> fabbione@sunfire:~/linux-2.6$ gcc --version
> gcc (Debian 4.6.4-5) 4.6.4
> 
> kernel config is from debian, applied to the various bisect done on
> Linus tree.
> 
> sunfire:~# cat /proc/cpuinfo
> cpu             : UltraSparc T1 (Niagara)
> fpu             : UltraSparc T1 integrated FPU
> pmu             : niagara
> prom            : OBP 4.28.9 2008/06/30 21:26
> type            : sun4v
> ncpus probed    : 24
> ..
> 
> commit 87fa05aeb3a5e8e21b1a5510eef6983650eff092
> Author: Sam Ravnborg <sam@ravnborg.org>
> Date:   Thu Apr 11 21:38:50 2013 +0200
> 
>     sparc: Use generic idle loop
> 
>     Add generic cpu_idle support
> 
> appears to be culprit and I can´t boot any kernel after that. Reverting
> the commit makes the machine boot again (tested up to commit 4fbf888acc
> from Linus tree).
> 
> Sorry I really don´t know that part of the code well enough to propose a
> fix, but I can either provide access to the machine or test debugging
> patches.
> 
> Thanks
> Fabio
> 
> Few extra info:
> 
> with this commit applied (head to the commit):
> 
> [SNIP]
> [   68.089495] scsi0 : Emulex LPe1150 PCIe Fibre Channel Adapter  on PCI
> bus 08 device 00 irq 23
> [   68.089495] scsi0 : Emulex LPe1150 PCIe Fibre Channel Adapter  on PCI
> bus 08 device 00 irq 23
> [   89.429041] INFO: rcu_sched detected stalls on CPUs/tasks:[
> 89.429041] INFO: rcu_sched detected stalls on CPUs/tasks: { { 3 3 10 10
> 15 15} } (detected by 0, tS42 jiffies, g\x18446744073709551328,
> c\x18446744073709551327, q!64)
> (detected by 0, tS42 jiffies, g\x18446744073709551328,
> c\x18446744073709551327, q!64)
> [   89.431437] * CPU[  0]: TSTATE[0000000080001603]
> TPC[000000000042b9d4] TNPC[000000000042b9d8] TASK[swapper/0:0]
> [   89.431437] * CPU[  0]: TSTATE[0000000080001603]
> TPC[000000000042b9d4] TNPC[000000000042b9d8] TASK[swapper/0:0]
> [   89.431462] BUG: soft lockup - CPU#3 stuck for 23s! [swapper/3:0]
> [   89.431462] BUG: soft lockup - CPU#3 stuck for 23s! [swapper/3:0]
> [   89.431470] BUG: soft lockup - CPU#1 stuck for 23s! [swapper/1:0]
> [   89.431470] BUG: soft lockup - CPU#1 stuck for 23s! [swapper/1:0]
> [   89.431479] BUG: soft lockup - CPU#2 stuck for 23s! [swapper/2:0]
> [   89.431479] BUG: soft lockup - CPU#2 stuck for 23s! [swapper/2:0]
> [   89.431484] Modules linked in:
> [SNIP]
> 
> Most kernels in between that commit to 3.12:
> 
> [SNIP]
> [   68.392210] UDP hash table entries: 4096 (order: 4, 131072 bytes)
> [   68.393319] UDP-Lite hash table entries: 4096 (order: 4, 131072 bytes)
> [   68.395425] NET: Registered protocol family 1
> [   68.396008] pci 0001:05:02.0: Activating ISA DMA hang workarounds
> -- no more output --
> 
> With Linus tree up to commit 4fbf888accb39 (didn´t test yet newer trees):
> 
> [   60.770765] scsi2 : Emulex LPe1150 PCIe Fibre Channel Adapter  on PCI
> bus 08 device 00 irq 23
>  [   81.923914] INFO: rcu_sched detected stalls on CPUs/tasks: {}
> (detected by 4, tR92 jiffies, g\x18446744073709551336,
> c\x18446744073709551335,5)
> [   81.924215] INFO: Stall ended before state dump start
> [  145.419184] INFO: rcu_sched detected stalls on CPUs/tasks: {}
> (detected by 0, t!166 jiffies, g\x18446744073709551336,
> c\x18446744073709551335,7)
> [  145.419482] INFO: Stall ended before state dump start
> 
> 
> --
> To unsubscribe from this list: send the line "unsubscribe sparclinux" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Issue booting Niagara T1 after commit 87fa05aeb3
  2013-11-19 12:21 Issue booting Niagara T1 after commit 87fa05aeb3 Fabio M. Di Nitto
  2013-12-12 12:33 ` Raúl Porcel
@ 2013-12-13 11:46 ` Raúl Porcel
  2014-02-10 21:23 ` [resend] " Fabio M. Di Nitto
  2014-04-04 11:50 ` Raúl Porcel
  3 siblings, 0 replies; 5+ messages in thread
From: Raúl Porcel @ 2013-12-13 11:46 UTC (permalink / raw)
  To: sparclinux

Hello,

On 12/12/13 14:09, Kirill Tkhai wrote:
> Hi,
> 
> 12.12.2013, 16:33, "Raúl Porcel" <armin76@gentoo.org>:
>> Same here, I tried with the default config.
> 
> just hypothesis. Does "nohlt" command line option help anyway?
> 
> Does anything change if you are swapping local_irq_enable() and
> preempt_disable() in smp_callin() (file arch/sparc/kernel/smp_64.c):

Tried both of the suggestions, none worked.

Thanks


^ permalink raw reply	[flat|nested] 5+ messages in thread

* [resend] Re: Issue booting Niagara T1 after commit 87fa05aeb3
  2013-11-19 12:21 Issue booting Niagara T1 after commit 87fa05aeb3 Fabio M. Di Nitto
  2013-12-12 12:33 ` Raúl Porcel
  2013-12-13 11:46 ` Raúl Porcel
@ 2014-02-10 21:23 ` Fabio M. Di Nitto
  2014-04-04 11:50 ` Raúl Porcel
  3 siblings, 0 replies; 5+ messages in thread
From: Fabio M. Di Nitto @ 2014-02-10 21:23 UTC (permalink / raw)
  To: sparclinux

HI all,

another user has reported a similar issue. Anything that I can help with
to have a fix?

Thanks
Fabio

On 11/19/2013 01:21 PM, Fabio M. Di Nitto wrote:
> Hi all,
> 
> yes I know.. vintage hardware :) It´s running Debian unstable updated to
> approx a week ago (yay for tons of rebuilds to bisect ;)).
> 
> fabbione@sunfire:~/linux-2.6$ gcc --version
> gcc (Debian 4.6.4-5) 4.6.4
> 
> kernel config is from debian, applied to the various bisect done on
> Linus tree.
> 
> sunfire:~# cat /proc/cpuinfo
> cpu             : UltraSparc T1 (Niagara)
> fpu             : UltraSparc T1 integrated FPU
> pmu             : niagara
> prom            : OBP 4.28.9 2008/06/30 21:26
> type            : sun4v
> ncpus probed    : 24
> ..
> 
> commit 87fa05aeb3a5e8e21b1a5510eef6983650eff092
> Author: Sam Ravnborg <sam@ravnborg.org>
> Date:   Thu Apr 11 21:38:50 2013 +0200
> 
>     sparc: Use generic idle loop
> 
>     Add generic cpu_idle support
> 
> appears to be culprit and I can´t boot any kernel after that. Reverting
> the commit makes the machine boot again (tested up to commit 4fbf888acc
> from Linus tree).
> 
> Sorry I really don´t know that part of the code well enough to propose a
> fix, but I can either provide access to the machine or test debugging
> patches.
> 
> Thanks
> Fabio
> 
> Few extra info:
> 
> with this commit applied (head to the commit):
> 
> [SNIP]
> [   68.089495] scsi0 : Emulex LPe1150 PCIe Fibre Channel Adapter  on PCI
> bus 08 device 00 irq 23
> [   68.089495] scsi0 : Emulex LPe1150 PCIe Fibre Channel Adapter  on PCI
> bus 08 device 00 irq 23
> [   89.429041] INFO: rcu_sched detected stalls on CPUs/tasks:[
> 89.429041] INFO: rcu_sched detected stalls on CPUs/tasks: { { 3 3 10 10
> 15 15} } (detected by 0, tS42 jiffies, g\x18446744073709551328,
> c\x18446744073709551327, q!64)
> (detected by 0, tS42 jiffies, g\x18446744073709551328,
> c\x18446744073709551327, q!64)
> [   89.431437] * CPU[  0]: TSTATE[0000000080001603]
> TPC[000000000042b9d4] TNPC[000000000042b9d8] TASK[swapper/0:0]
> [   89.431437] * CPU[  0]: TSTATE[0000000080001603]
> TPC[000000000042b9d4] TNPC[000000000042b9d8] TASK[swapper/0:0]
> [   89.431462] BUG: soft lockup - CPU#3 stuck for 23s! [swapper/3:0]
> [   89.431462] BUG: soft lockup - CPU#3 stuck for 23s! [swapper/3:0]
> [   89.431470] BUG: soft lockup - CPU#1 stuck for 23s! [swapper/1:0]
> [   89.431470] BUG: soft lockup - CPU#1 stuck for 23s! [swapper/1:0]
> [   89.431479] BUG: soft lockup - CPU#2 stuck for 23s! [swapper/2:0]
> [   89.431479] BUG: soft lockup - CPU#2 stuck for 23s! [swapper/2:0]
> [   89.431484] Modules linked in:
> [SNIP]
> 
> Most kernels in between that commit to 3.12:
> 
> [SNIP]
> [   68.392210] UDP hash table entries: 4096 (order: 4, 131072 bytes)
> [   68.393319] UDP-Lite hash table entries: 4096 (order: 4, 131072 bytes)
> [   68.395425] NET: Registered protocol family 1
> [   68.396008] pci 0001:05:02.0: Activating ISA DMA hang workarounds
> -- no more output --
> 
> With Linus tree up to commit 4fbf888accb39 (didn´t test yet newer trees):
> 
> [   60.770765] scsi2 : Emulex LPe1150 PCIe Fibre Channel Adapter  on PCI
> bus 08 device 00 irq 23
>  [   81.923914] INFO: rcu_sched detected stalls on CPUs/tasks: {}
> (detected by 4, tR92 jiffies, g\x18446744073709551336,
> c\x18446744073709551335,5)
> [   81.924215] INFO: Stall ended before state dump start
> [  145.419184] INFO: rcu_sched detected stalls on CPUs/tasks: {}
> (detected by 0, t!166 jiffies, g\x18446744073709551336,
> c\x18446744073709551335,7)
> [  145.419482] INFO: Stall ended before state dump start
> 
> 


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [resend] Re: Issue booting Niagara T1 after commit 87fa05aeb3
  2013-11-19 12:21 Issue booting Niagara T1 after commit 87fa05aeb3 Fabio M. Di Nitto
                   ` (2 preceding siblings ...)
  2014-02-10 21:23 ` [resend] " Fabio M. Di Nitto
@ 2014-04-04 11:50 ` Raúl Porcel
  3 siblings, 0 replies; 5+ messages in thread
From: Raúl Porcel @ 2014-04-04 11:50 UTC (permalink / raw)
  To: sparclinux

Hi,

Just for the record, commit b3042d609e30e6144024801c89be3925106752b
fixes it.

kernel 3.14 boots fine

On 02/10/14 22:23, Fabio M. Di Nitto wrote:
> HI all,
> 
> another user has reported a similar issue. Anything that I can help with
> to have a fix?
> 
> Thanks
> Fabio
> 
> On 11/19/2013 01:21 PM, Fabio M. Di Nitto wrote:
>> Hi all,
>>
>> yes I know.. vintage hardware :) It´s running Debian unstable updated to
>> approx a week ago (yay for tons of rebuilds to bisect ;)).
>>
>> fabbione@sunfire:~/linux-2.6$ gcc --version
>> gcc (Debian 4.6.4-5) 4.6.4
>>
>> kernel config is from debian, applied to the various bisect done on
>> Linus tree.
>>
>> sunfire:~# cat /proc/cpuinfo
>> cpu             : UltraSparc T1 (Niagara)
>> fpu             : UltraSparc T1 integrated FPU
>> pmu             : niagara
>> prom            : OBP 4.28.9 2008/06/30 21:26
>> type            : sun4v
>> ncpus probed    : 24
>> ..
>>
>> commit 87fa05aeb3a5e8e21b1a5510eef6983650eff092
>> Author: Sam Ravnborg <sam@ravnborg.org>
>> Date:   Thu Apr 11 21:38:50 2013 +0200
>>
>>     sparc: Use generic idle loop
>>
>>     Add generic cpu_idle support
>>
>> appears to be culprit and I can´t boot any kernel after that. Reverting
>> the commit makes the machine boot again (tested up to commit 4fbf888acc
>> from Linus tree).
>>
>> Sorry I really don´t know that part of the code well enough to propose a
>> fix, but I can either provide access to the machine or test debugging
>> patches.
>>
>> Thanks
>> Fabio
>>
>> Few extra info:
>>
>> with this commit applied (head to the commit):
>>
>> [SNIP]
>> [   68.089495] scsi0 : Emulex LPe1150 PCIe Fibre Channel Adapter  on PCI
>> bus 08 device 00 irq 23
>> [   68.089495] scsi0 : Emulex LPe1150 PCIe Fibre Channel Adapter  on PCI
>> bus 08 device 00 irq 23
>> [   89.429041] INFO: rcu_sched detected stalls on CPUs/tasks:[
>> 89.429041] INFO: rcu_sched detected stalls on CPUs/tasks: { { 3 3 10 10
>> 15 15} } (detected by 0, tS42 jiffies, g\x18446744073709551328,
>> c\x18446744073709551327, q!64)
>> (detected by 0, tS42 jiffies, g\x18446744073709551328,
>> c\x18446744073709551327, q!64)
>> [   89.431437] * CPU[  0]: TSTATE[0000000080001603]
>> TPC[000000000042b9d4] TNPC[000000000042b9d8] TASK[swapper/0:0]
>> [   89.431437] * CPU[  0]: TSTATE[0000000080001603]
>> TPC[000000000042b9d4] TNPC[000000000042b9d8] TASK[swapper/0:0]
>> [   89.431462] BUG: soft lockup - CPU#3 stuck for 23s! [swapper/3:0]
>> [   89.431462] BUG: soft lockup - CPU#3 stuck for 23s! [swapper/3:0]
>> [   89.431470] BUG: soft lockup - CPU#1 stuck for 23s! [swapper/1:0]
>> [   89.431470] BUG: soft lockup - CPU#1 stuck for 23s! [swapper/1:0]
>> [   89.431479] BUG: soft lockup - CPU#2 stuck for 23s! [swapper/2:0]
>> [   89.431479] BUG: soft lockup - CPU#2 stuck for 23s! [swapper/2:0]
>> [   89.431484] Modules linked in:
>> [SNIP]
>>
>> Most kernels in between that commit to 3.12:
>>
>> [SNIP]
>> [   68.392210] UDP hash table entries: 4096 (order: 4, 131072 bytes)
>> [   68.393319] UDP-Lite hash table entries: 4096 (order: 4, 131072 bytes)
>> [   68.395425] NET: Registered protocol family 1
>> [   68.396008] pci 0001:05:02.0: Activating ISA DMA hang workarounds
>> -- no more output --
>>
>> With Linus tree up to commit 4fbf888accb39 (didn´t test yet newer trees):
>>
>> [   60.770765] scsi2 : Emulex LPe1150 PCIe Fibre Channel Adapter  on PCI
>> bus 08 device 00 irq 23
>>  [   81.923914] INFO: rcu_sched detected stalls on CPUs/tasks: {}
>> (detected by 4, tR92 jiffies, g\x18446744073709551336,
>> c\x18446744073709551335,5)
>> [   81.924215] INFO: Stall ended before state dump start
>> [  145.419184] INFO: rcu_sched detected stalls on CPUs/tasks: {}
>> (detected by 0, t!166 jiffies, g\x18446744073709551336,
>> c\x18446744073709551335,7)
>> [  145.419482] INFO: Stall ended before state dump start
>>
>>
> 
> --
> To unsubscribe from this list: send the line "unsubscribe sparclinux" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 


^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2014-04-04 11:50 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-11-19 12:21 Issue booting Niagara T1 after commit 87fa05aeb3 Fabio M. Di Nitto
2013-12-12 12:33 ` Raúl Porcel
2013-12-13 11:46 ` Raúl Porcel
2014-02-10 21:23 ` [resend] " Fabio M. Di Nitto
2014-04-04 11:50 ` Raúl Porcel

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.