All of lore.kernel.org
 help / color / mirror / Atom feed
* Re: funny kernel death with ksoftirqd_CPUX taking up almost 100% of cpu?
  2002-07-10 16:36 funny kernel death with ksoftirqd_CPUX taking up almost 100% of cpu? Kevin B. Hendricks
@ 2002-07-10 16:11 ` Benjamin Herrenschmidt
  2002-07-10 16:24 ` Benjamin Herrenschmidt
  2002-07-10 17:20 ` Kevin B. Hendricks
  2 siblings, 0 replies; 5+ messages in thread
From: Benjamin Herrenschmidt @ 2002-07-10 16:11 UTC (permalink / raw)
  To: Kevin B. Hendricks, yellowdog-devel, linuxppc-dev; +Cc: paulus


>I just experienced an alarming form of kernel death running a self compiled
>SMP kernel with HIGHMEM enabled on my dual G4 -1gig machine.
>
>The kernel tree used is Ben's 2.4.19-pre10 one rebuilt for SMP support, aec
>IDE driver and otherwise basically stock.
>
>I was debugging in gdb a large program and noticed typing got slower and
>slower.  I quick check of top showed that ksoftirqd_CPU was taking up
>almost 100% of the cpu.  I exited out of gdb and killed every process I
>could think of but the usage of that kernel demaon stayed at near 100%.
>
>It became so bad I could barely perform a straight shutdown (I had to hit
>return numerous times to allow the other cpu to get some time to handle
>the shutdown.
>
>There were lots of messages like the following as I tried to shutdown:
>
> ../..
>
>Jul 10 11:57:36 localhost kernel: wait_on_irq, CPU 0:
>Jul 10 11:57:36 localhost kernel: irq:  -1 [0 0]
>Jul 10 11:57:36 localhost kernel: bh:   0 [0 0]
>Jul 10 11:57:37 localhost kernel:

Hrm... looks bad. global_irq_count got negative !

So either somebody is doing a mismatched hardirq_enter/leave
pair, which I seriously doubt, or our atomics are broken on
those machines (ugh !!!)

Paul, any good idea at hand ?

Ben.


** Sent via the linuxppc-dev mail list. See http://lists.linuxppc.org/

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: funny kernel death with ksoftirqd_CPUX taking up almost 100% of cpu?
  2002-07-10 16:36 funny kernel death with ksoftirqd_CPUX taking up almost 100% of cpu? Kevin B. Hendricks
  2002-07-10 16:11 ` Benjamin Herrenschmidt
@ 2002-07-10 16:24 ` Benjamin Herrenschmidt
  2002-07-10 17:18   ` Kevin B. Hendricks
  2002-07-10 17:20 ` Kevin B. Hendricks
  2 siblings, 1 reply; 5+ messages in thread
From: Benjamin Herrenschmidt @ 2002-07-10 16:24 UTC (permalink / raw)
  To: Kevin B. Hendricks, yellowdog-devel, linuxppc-dev


>
>I just experienced an alarming form of kernel death running a self compiled
>SMP kernel with HIGHMEM enabled on my dual G4 -1gig machine.
>
>The kernel tree used is Ben's 2.4.19-pre10 one rebuilt for SMP support, aec
>IDE driver and otherwise basically stock.
>
>I was debugging in gdb a large program and noticed typing got slower and
>slower.  I quick check of top showed that ksoftirqd_CPU was taking up
>almost 100% of the cpu.  I exited out of gdb and killed every process I
>could think of but the usage of that kernel demaon stayed at near 100%.

BTW. It would be interesting to know if you can ever reproduce it...

Ben.


** Sent via the linuxppc-dev mail list. See http://lists.linuxppc.org/

^ permalink raw reply	[flat|nested] 5+ messages in thread

* funny kernel death with ksoftirqd_CPUX taking up almost 100% of cpu?
@ 2002-07-10 16:36 Kevin B. Hendricks
  2002-07-10 16:11 ` Benjamin Herrenschmidt
                   ` (2 more replies)
  0 siblings, 3 replies; 5+ messages in thread
From: Kevin B. Hendricks @ 2002-07-10 16:36 UTC (permalink / raw)
  To: yellowdog-devel, linuxppc-dev; +Cc: Benjamin Herrenschmidt


Hi,

I just experienced an alarming form of kernel death running a self compiled
SMP kernel with HIGHMEM enabled on my dual G4 -1gig machine.

The kernel tree used is Ben's 2.4.19-pre10 one rebuilt for SMP support, aec
IDE driver and otherwise basically stock.

I was debugging in gdb a large program and noticed typing got slower and
slower.  I quick check of top showed that ksoftirqd_CPU was taking up
almost 100% of the cpu.  I exited out of gdb and killed every process I
could think of but the usage of that kernel demaon stayed at near 100%.

It became so bad I could barely perform a straight shutdown (I had to hit
return numerous times to allow the other cpu to get some time to handle
the shutdown.

There were lots of messages like the following as I tried to shutdown:

Jul 10 11:56:24 localhost kernel: Serial port locked ON by debugger !
Jul 10 11:56:24 localhost kernel:
Jul 10 11:56:24 localhost kernel: wait_on_irq, CPU 0:
Jul 10 11:56:24 localhost kernel: irq:  -1 [0 0]
Jul 10 11:56:24 localhost kernel: bh:   0 [0 0]
Jul 10 11:57:13 localhost kernel:
Jul 10 11:57:13 localhost kernel: wait_on_irq, CPU 0:
Jul 10 11:57:13 localhost kernel: irq:  -1 [0 0]
Jul 10 11:57:13 localhost kernel: bh:   0 [0 0]
Jul 10 11:57:31 localhost kernel:
Jul 10 11:57:31 localhost kernel: wait_on_irq, CPU 0:
Jul 10 11:57:31 localhost kernel: irq:  -1 [0 0]
Jul 10 11:57:31 localhost kernel: bh:   0 [0 0]
Jul 10 11:57:36 localhost kernel:
Jul 10 11:57:36 localhost kernel: wait_on_irq, CPU 0:
Jul 10 11:57:36 localhost kernel: irq:  -1 [0 0]
Jul 10 11:57:36 localhost kernel: bh:   0 [0 0]
Jul 10 11:57:37 localhost kernel:
...

But I was able to finally get it to shutdown properly and it seems to have
rebooted fine.

Anyone with any ideas of what just hit me?  Anything I should do next time
it happens (the were no relevant message in the /var/log/messages/ that
might indicate a problem excpet for the lines I quoted above.

Any help would be greatly appreciated.

Thanks,

Kevin


** Sent via the linuxppc-dev mail list. See http://lists.linuxppc.org/

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: funny kernel death with ksoftirqd_CPUX taking up almost 100% of cpu?
  2002-07-10 16:24 ` Benjamin Herrenschmidt
@ 2002-07-10 17:18   ` Kevin B. Hendricks
  0 siblings, 0 replies; 5+ messages in thread
From: Kevin B. Hendricks @ 2002-07-10 17:18 UTC (permalink / raw)
  To: Benjamin Herrenschmidt, yellowdog-devel, linuxppc-dev


Well,

I still have to debug that damn OOo 1.0.1 bridge problem so I have to go
back into OOo and gdb (which uses alot of memory) and try again.

If I run into it again, I will let you know.

If so, is there anything debug wise I can do that would help pinpoint the
problem.

Thanks,

Kevin

On July 10, 2002 12:24, Benjamin Herrenschmidt wrote:
> >I just experienced an alarming form of kernel death running a self
> > compiled SMP kernel with HIGHMEM enabled on my dual G4 -1gig machine.
> >
> >The kernel tree used is Ben's 2.4.19-pre10 one rebuilt for SMP support,
> > aec IDE driver and otherwise basically stock.
> >
> >I was debugging in gdb a large program and noticed typing got slower
> > and slower.  I quick check of top showed that ksoftirqd_CPU was taking
> > up almost 100% of the cpu.  I exited out of gdb and killed every
> > process I could think of but the usage of that kernel demaon stayed at
> > near 100%.
>
> BTW. It would be interesting to know if you can ever reproduce it...
>
> Ben.


** Sent via the linuxppc-dev mail list. See http://lists.linuxppc.org/

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: funny kernel death with ksoftirqd_CPUX taking up almost 100% of cpu?
  2002-07-10 16:36 funny kernel death with ksoftirqd_CPUX taking up almost 100% of cpu? Kevin B. Hendricks
  2002-07-10 16:11 ` Benjamin Herrenschmidt
  2002-07-10 16:24 ` Benjamin Herrenschmidt
@ 2002-07-10 17:20 ` Kevin B. Hendricks
  2 siblings, 0 replies; 5+ messages in thread
From: Kevin B. Hendricks @ 2002-07-10 17:20 UTC (permalink / raw)
  To: yellowdog-devel, linuxppc-dev; +Cc: Benjamin Herrenschmidt


Hi,

If it matters at all the kernel was compiled with Franz's gcc 3.1-1a rpm
from his site on PenguinPPC.org

I have been tracking a problem with how structures that are inheritied
seemed to be aligned differntly then under gcc 2.95.4.

Kevin

On July 10, 2002 12:36, Kevin B. Hendricks wrote:
> Hi,
>
> I just experienced an alarming form of kernel death running a self
> compiled SMP kernel with HIGHMEM enabled on my dual G4 -1gig machine.
>
> The kernel tree used is Ben's 2.4.19-pre10 one rebuilt for SMP support,
> aec IDE driver and otherwise basically stock.
>
> I was debugging in gdb a large program and noticed typing got slower and
> slower.  I quick check of top showed that ksoftirqd_CPU was taking up
> almost 100% of the cpu.  I exited out of gdb and killed every process I
> could think of but the usage of that kernel demaon stayed at near 100%.
>
> It became so bad I could barely perform a straight shutdown (I had to
> hit return numerous times to allow the other cpu to get some time to
> handle the shutdown.
>
> There were lots of messages like the following as I tried to shutdown:
>
> Jul 10 11:56:24 localhost kernel: Serial port locked ON by debugger !
> Jul 10 11:56:24 localhost kernel:
> Jul 10 11:56:24 localhost kernel: wait_on_irq, CPU 0:
> Jul 10 11:56:24 localhost kernel: irq:  -1 [0 0]
> Jul 10 11:56:24 localhost kernel: bh:   0 [0 0]
> Jul 10 11:57:13 localhost kernel:
> Jul 10 11:57:13 localhost kernel: wait_on_irq, CPU 0:
> Jul 10 11:57:13 localhost kernel: irq:  -1 [0 0]
> Jul 10 11:57:13 localhost kernel: bh:   0 [0 0]
> Jul 10 11:57:31 localhost kernel:
> Jul 10 11:57:31 localhost kernel: wait_on_irq, CPU 0:
> Jul 10 11:57:31 localhost kernel: irq:  -1 [0 0]
> Jul 10 11:57:31 localhost kernel: bh:   0 [0 0]
> Jul 10 11:57:36 localhost kernel:
> Jul 10 11:57:36 localhost kernel: wait_on_irq, CPU 0:
> Jul 10 11:57:36 localhost kernel: irq:  -1 [0 0]
> Jul 10 11:57:36 localhost kernel: bh:   0 [0 0]
> Jul 10 11:57:37 localhost kernel:
> ...
>
> But I was able to finally get it to shutdown properly and it seems to
> have rebooted fine.
>
> Anyone with any ideas of what just hit me?  Anything I should do next
> time it happens (the were no relevant message in the /var/log/messages/
> that might indicate a problem excpet for the lines I quoted above.
>
> Any help would be greatly appreciated.
>
> Thanks,
>
> Kevin
>
>
> _______________________________________________
> yellowdog-devel mailing list
> yellowdog-devel@lists.terrasoftsolutions.com
> http://lists.terrasoftsolutions.com/mailman/listinfo/yellowdog-devel


** Sent via the linuxppc-dev mail list. See http://lists.linuxppc.org/

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2002-07-10 17:20 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2002-07-10 16:36 funny kernel death with ksoftirqd_CPUX taking up almost 100% of cpu? Kevin B. Hendricks
2002-07-10 16:11 ` Benjamin Herrenschmidt
2002-07-10 16:24 ` Benjamin Herrenschmidt
2002-07-10 17:18   ` Kevin B. Hendricks
2002-07-10 17:20 ` Kevin B. Hendricks

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.