linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Tulip (DFE-570tx) & keyboard lockup in 2.6.9 and other 2.6 kernels.
@ 2005-02-23 21:08 Ben Greear
  2005-02-25  0:16 ` Ben Greear
  0 siblings, 1 reply; 2+ messages in thread
From: Ben Greear @ 2005-02-23 21:08 UTC (permalink / raw)
  To: netdev; +Cc: linux-kernel

I finally had some time to debug this one a little more
thoroughly.  On two different machines (Shuttle SB61G1) I
get the same results, so I do not believe it is bad hardware...

The bug is as follows:

I have 1 4-port tulip NIC in the machine.  If I generate traffic
between two interfaces, it runs fine.  But, if I start running traffic
on all 4 interfaces, the keyboard quits taking input, and ethernet
traffic stops on at least a few of the interfaces.  I can still ssh
into the machine (via the rtl8139 interface), so at least one of the
processors (I'm using SMP on an P4 HT processor) is working.  I also
enabled NMI and that does not trigger.

I ran the tulip-diag tool, and the third interface seems to be in a bad way:

[root@lf61g2-blk root]# tulip-diag
tulip-diag.c:v2.07 3/31/2001 Donald Becker (becker@scyld.com)
  http://www.scyld.com/diag/index.html
Index #1: Found a Digital DS21143 Tulip adapter at 0xc000.
  Port selection is MII, full-duplex.
  Transmit started, Receive started, full-duplex.
   The Rx process state is 'Waiting for packets'.
   The Tx process state is 'Idle'.
   The transmit threshold is 256.
   The NWay status register is 000000c6.
   Internal autonegotiation state is 'Autonegotiation disabled'.
Index #2: Found a Digital DS21143 Tulip adapter at 0xc100.
  Port selection is MII, full-duplex.
  Transmit started, Receive started, full-duplex.
   The Rx process state is 'Waiting for packets'.
   The Tx process state is 'Idle'.
   The transmit threshold is 256.
   The NWay status register is 000000c6.
   Internal autonegotiation state is 'Autonegotiation disabled'.
Index #3: Found a Digital DS21143 Tulip adapter at 0xc200.
  Port selection is MII, full-duplex.
  Transmit started, Receive started, full-duplex.
   The Rx process state is 'Suspended -- no Rx buffers'.
   The Tx process state is 'Idle'.
   The transmit threshold is 128.
   The NWay status register is 000000c6.
   Internal autonegotiation state is 'Autonegotiation disabled'.
Index #4: Found a Digital DS21143 Tulip adapter at 0xc300.
  Port selection is MII, full-duplex.
  Transmit started, Receive started, full-duplex.
   The Rx process state is 'Waiting for packets'.
   The Tx process state is 'Idle'.
   The transmit threshold is 128.
   The NWay status register is 000000c6.
   Internal autonegotiation state is 'Autonegotiation disabled'.


The interrupt count for eth3 is not increasing.  I tried to bring
down the interface with 'ifconfig eth3 down' and that command hung
as well.

I tried getting sysrq to print out the stack for ifconfig, but
I'm not sure it worked:

ifconfig      R running  6128  3692   3631                     (NOTLB)
sshd          S D9717E6C  6488  3693   2193  3695          3629 (NOTLB)
d9717eac 00000082 d9721954 d9717e6c 00000000 c0356580 c0335d00 c0335c80
        00000286 b04a5e80 000f44f5 00000019 de056890 00000008 00000000 c1402980
        c1402020 00000001 00000000 b04a5e80 000f44f5 de056df0 de056f58 00000001
Call Trace:
  [<c02e328b>] schedule_timeout+0xc3/0xc5
  [<c01ffd19>] normal_poll+0x0/0x134
  [<c01f9a61>] tty_ldisc_deref+0x63/0x7d
  [<c01fbe86>] tty_poll+0x90/0xb2
  [<c016ba1e>] do_select+0x193/0x2b9
  [<c016b6f1>] __pollwait+0x0/0xc1
  [<c016bdf4>] sys_select+0x29b/0x50c
  [<c0105f91>] sysenter_past_esp+0x52/0x71
bash          R running  6620  3695   3693                     (NOTLB)


I tried this same hardware running RH9 with the 2.4.27 kernel and
it works like a charm.

This is easily repeatable, so please let me know what information I
can offer to help debug this.

Thanks,
Ben


-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com


^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: Tulip (DFE-570tx) & keyboard lockup in 2.6.9 and other 2.6 kernels.
  2005-02-23 21:08 Tulip (DFE-570tx) & keyboard lockup in 2.6.9 and other 2.6 kernels Ben Greear
@ 2005-02-25  0:16 ` Ben Greear
  0 siblings, 0 replies; 2+ messages in thread
From: Ben Greear @ 2005-02-25  0:16 UTC (permalink / raw)
  To: netdev; +Cc: linux-kernel

Ben Greear wrote:
> I finally had some time to debug this one a little more
> thoroughly.  On two different machines (Shuttle SB61G1) I
> get the same results, so I do not believe it is bad hardware...
> 
> The bug is as follows:
> 
> I have 1 4-port tulip NIC in the machine.  If I generate traffic
> between two interfaces, it runs fine.  But, if I start running traffic
> on all 4 interfaces, the keyboard quits taking input, and ethernet
> traffic stops on at least a few of the interfaces.  I can still ssh
> into the machine (via the rtl8139 interface), so at least one of the
> processors (I'm using SMP on an P4 HT processor) is working.  I also
> enabled NMI and that does not trigger.

This was my bug.  I was holding a lock that was required for receiving a packet
while calling the hard_start_xmit method.  When an IRQ happened while I was in
the hard_start_xmit method, the IRQ could not grab the lock, and just sat there
spinning...

Thanks,
Ben

-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2005-02-25  0:22 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2005-02-23 21:08 Tulip (DFE-570tx) & keyboard lockup in 2.6.9 and other 2.6 kernels Ben Greear
2005-02-25  0:16 ` Ben Greear

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).