linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* tg3 stops working when NFS is involved
@ 2009-06-08  4:41 Christian Kujau
  2009-06-08  5:18 ` Christian Kujau
  0 siblings, 1 reply; 4+ messages in thread
From: Christian Kujau @ 2009-06-08  4:41 UTC (permalink / raw)
  To: LKML; +Cc: netdev, mcarlson

Hi there,

first off: I know there are quite a few reports on the net on a similar 
topic, but they just don't match this particular scenario, so here it 
goes:

I have this Lenovo Ideapad S10 "netbook" with a buitin BCM5906M 10/100Mbps 
NIC. On this box nfs-kernel-server is exporting (ro) a directory to 
another linux client (really, only one). After a while (measured in bytes: after a 
few megabytes..up to a few gigabytes of traffic) the server goes away: not 
just the NFS server but the network card just stops working: I cannot ping 
the server any more and I have to go over to the netbook and reload the 
tg3 module. Before doing this I can verify that the netbook is unable to ping 
anything else - so, I figure it's not a client problem. And I'm not sure 
if it's NFS problem either, because restarting the NFS server doesn't do 
anything, I really have to rmmod/modprobe the tg3 module. OTOH, I cannot 
reproduce this without NFS: running e.g. iperf (TCP, UDP) did not trigger 
it.

As to the network load involved: the client is able to receive a bit over 
2MB/s at best (client---wlan---wrt54---netbook), so the server is not busy 
at all.

I've noticed this behaviour with 2.6.27-14-generic (ubuntu/9.04 kernel) 
but running the latest -git (vanilla from kernel.org) does not change 
anything.

I've started to modprobe the tg3 module with tg3_debug=0x7fffffff so that 
it might print out more debug messages but there are no messages printed 
when tg3 stops working. A few maybe interesting boot messages, below, 
please find more details at: http://nerdbynature.de/bits/2.6.30-rc8/

[    0.000000] ACPI: BIOS bug: multiple APIC/MADT found, using 0
[    0.184315] ACPI: EC: non-query interrupt received, switching to interrupt mode
[    1.756338] tg3.c:v3.98 (February 25, 2009)
[    1.756507] tg3 0000:02:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
[    1.756657] tg3 0000:02:00.0: setting latency timer to 64
[    1.790008] tg3 0000:02:00.0: PME# disabled
[    8.839227] tg3 0000:02:00.0: PME# disabled
[    8.839664] tg3 0000:02:00.0: irq 24 for MSI/MSI-X
[   10.572500] tg3: eth0: Link is up at 100 Mbps, full duplex.
[   10.572589] tg3: eth0: Flow control is on for TX and on for RX.
[   51.892143] CE: hpet increasing min_delta_ns to 15000 nsec
[  738.011394] ACPI: EC: GPE storm detected, transactions will use polling mode
[  768.656172] ACPI: EC: missing confirmations, switch off interrupt mode.
[ 4757.984170] tg3 0000:02:00.0: PCI INT A disabled


If anyone has an idea how to debug this, I'm all ears.

Thank you,
Christian.
-- 
BOFH excuse #384:

it's an ID-10-T error

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: tg3 stops working when NFS is involved
  2009-06-08  4:41 tg3 stops working when NFS is involved Christian Kujau
@ 2009-06-08  5:18 ` Christian Kujau
  2009-06-08 16:40   ` Matt Carlson
  0 siblings, 1 reply; 4+ messages in thread
From: Christian Kujau @ 2009-06-08  5:18 UTC (permalink / raw)
  To: LKML; +Cc: netdev, mcarlson

On Sun, 7 Jun 2009, Christian Kujau wrote:
> anything else - so, I figure it's not a client problem. And I'm not sure 
> if it's NFS problem either, because restarting the NFS server doesn't do 
> anything, I really have to rmmod/modprobe the tg3 module. OTOH, I cannot 
> reproduce this without NFS: running e.g. iperf (TCP, UDP) did not trigger 
> it.

Just tested with Samba - after a few megabytes of transmitted data, tg3 stops 
working, with no messages printed in the kernel log :-\

I don't know why iperf does not trigger it though....

Christian.
-- 
BOFH excuse #271:

The kernel license has expired

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: tg3 stops working when NFS is involved
  2009-06-08  5:18 ` Christian Kujau
@ 2009-06-08 16:40   ` Matt Carlson
  2009-06-08 17:21     ` Christian Kujau
  0 siblings, 1 reply; 4+ messages in thread
From: Matt Carlson @ 2009-06-08 16:40 UTC (permalink / raw)
  To: Christian Kujau; +Cc: LKML, netdev, Matthew Carlson

On Sun, Jun 07, 2009 at 10:18:52PM -0700, Christian Kujau wrote:
> On Sun, 7 Jun 2009, Christian Kujau wrote:
> > anything else - so, I figure it's not a client problem. And I'm not sure 
> > if it's NFS problem either, because restarting the NFS server doesn't do 
> > anything, I really have to rmmod/modprobe the tg3 module. OTOH, I cannot 
> > reproduce this without NFS: running e.g. iperf (TCP, UDP) did not trigger 
> > it.
> 
> Just tested with Samba - after a few megabytes of transmitted data, tg3 stops 
> working, with no messages printed in the kernel log :-\
> 
> I don't know why iperf does not trigger it though....

Hi Christian.  This seems to be a common problem with the 5906 on some
notebooks.  The word on the street is that turning sg off through
ethtool seems to avoid the problem.  While I dig deeper into this
problem, you could try that and see if it helps you.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: tg3 stops working when NFS is involved
  2009-06-08 16:40   ` Matt Carlson
@ 2009-06-08 17:21     ` Christian Kujau
  0 siblings, 0 replies; 4+ messages in thread
From: Christian Kujau @ 2009-06-08 17:21 UTC (permalink / raw)
  To: Matt Carlson; +Cc: LKML, netdev

On Mon, 8 Jun 2009, Matt Carlson wrote:
> This seems to be a common problem with the 5906 on some
> notebooks.  The word on the street is that turning sg off through
> ethtool seems to avoid the problem.

Well, I must've been on the wrong streets then, this was news to me...

> While I dig deeper into this
> problem, you could try that and see if it helps you.

Yes, "ethtool -K eth0 sg off" does indeed help here too: I've already 
transmitted a few gigabytes of traffic (the problem occured after a few 
megabytes lastly) and all is still well.

Thanks for your quick response!
Christian.
-- 
BOFH excuse #273:

The cord jumped over and hit the power switch.

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2009-06-08 17:23 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2009-06-08  4:41 tg3 stops working when NFS is involved Christian Kujau
2009-06-08  5:18 ` Christian Kujau
2009-06-08 16:40   ` Matt Carlson
2009-06-08 17:21     ` Christian Kujau

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).