On Mon, 21 Jul 2003 16:12:26 +0200 Udo A. Steinberg (UAS) wrote: UAS> We have a Dual-Xeon machine with Hyperthreading which keeps locking up hard, UAS> so that not even Sysrq works anymore. I have captured such a lockup using the UAS> NMI oopser. Below you'll find the lockup fed through ksymoops. Note that UAS> after CPU3 locked up, CPU2 did too. But that lockup couldn't be captured UAS> anymore. Kernel is a monolithic 2.4.22-pre6. Problem also happened on UAS> plain 2.4.21. I can provide more information wrt. hardware, config etc. UAS> on request. Sorry, I used the wrong System.map. Below is the fixed decode. Looks like the lockup is caused by the 3rd party Compushack FDDI driver. Regards, -Udo. ksymoops 2.4.9 on i686 2.4.22-pre6. Options used -V (default) -K (specified) -l /proc/modules (default) -o /lib/modules/2.4.22-pre6/ (default) -m /boot/System.map-2.4.22 (specified) No modules in ksyms, skipping objects No ksyms, skipping lsmod NMI Watchdog detected LOCKUP on CPU3, eip c01f8364, registers: CPU: 3 EIP: 0010:[] Not tainted Using defaults from ksymoops -t elf32-i386 -a i386 EFLAGS: 00000082 eax: 00000006 ebx: 00000202 ecx: f7ee3400 edx: c01f5d20 esi: f7ee3400 edi: 00000180 ebp: 00000003 esp: f7efff64 ds: 0018 es: 0018 ss: 0018 Process ksoftirqd_CPU3 (pid: 6, stackpage=f7eff000) Stack: f7ee3428 c02d03e6 c0105cfa f6dae000 f7e4ac80 f7efff88 f7efff88 c011f8da f7ee3400 f7ee34ac f7ee34ac c0393434 00000000 c0122cfd c02ebe1c c011f7f5 c011f6a3 00000009 00000001 c0367a00 fffffffe c011f456 c0367a00 00000246 Call Trace: [] [] [] [] [] [] [] [] [] [] Code: 7e f5 e9 e8 d9 ff ff 80 3d 40 be 2e c0 00 f3 90 7e f5 e9 db >>EIP; c01f8364 <.text.lock.csfddi+39/55> <===== >>edx; c01f5d20 Trace; c0105cfa <__switch_to+ca/d0> Trace; c011f8da <__run_task_queue+6a/80> Trace; c0122cfd Trace; c011f7f5 Trace; c011f6a3 Trace; c011f456 Trace; c011f9b5 Trace; c0105000 <_stext+0/0> Trace; c01058ee Trace; c011f8f0 Code; c01f8364 <.text.lock.csfddi+39/55> 00000000 <_EIP>: Code; c01f8364 <.text.lock.csfddi+39/55> <===== 0: 7e f5 jle fffffff7 <_EIP+0xfffffff7> <===== Code; c01f8366 <.text.lock.csfddi+3b/55> 2: e9 e8 d9 ff ff jmp ffffd9ef <_EIP+0xffffd9ef> Code; c01f836b <.text.lock.csfddi+40/55> 7: 80 3d 40 be 2e c0 00 cmpb $0x0,0xc02ebe40 Code; c01f8372 <.text.lock.csfddi+47/55> e: f3 90 repz nop Code; c01f8374 <.text.lock.csfddi+49/55> 10: 7e f5 jle 7 <_EIP+0x7> Code; c01f8376 <.text.lock.csfddi+4b/55> 12: e9 db 00 00 00 jmp f2 <_EIP+0xf2> NMI Watchdog detected LOCKUP on CPU2, eip c01062cd, registers: