* tg3 adapter losing link - PM related?
@ 2013-05-03 9:28 Nikola Ciprich
2013-05-03 15:13 ` Nithin Nayak Sujir
0 siblings, 1 reply; 6+ messages in thread
From: Nikola Ciprich @ 2013-05-03 9:28 UTC (permalink / raw)
To: netdev
[-- Attachment #1: Type: text/plain, Size: 2910 bytes --]
Hello,
I'd like to ask about trouble I've got with new HP server: tg3 adapter keeps losing
link every few minutes:
Aug 3 03:58:02 atlovav1a kernel: [616741.147598] tg3 0000:03:00.0: eth0: Link is down
Aug 3 03:58:04 atlovav1a kernel: [616743.943456] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
Aug 3 03:58:04 atlovav1a kernel: [616743.943598] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
Aug 3 03:58:04 atlovav1a kernel: [616743.943736] tg3 0000:03:00.0: eth0: EEE is enabled
Aug 3 04:14:29 atlovav1a kernel: [617727.980487] tg3 0000:03:00.0: eth0: Link is down
Aug 3 04:14:32 atlovav1a kernel: [617730.847245] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
Aug 3 04:14:32 atlovav1a kernel: [617730.847387] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
Aug 3 04:14:32 atlovav1a kernel: [617730.847525] tg3 0000:03:00.0: eth0: EEE is enabled
Aug 3 06:47:13 atlovav1a kernel: [626885.452974] tg3 0000:03:00.0: eth0: Link is down
Aug 3 06:47:15 atlovav1a kernel: [626888.218702] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
Aug 3 06:47:15 atlovav1a kernel: [626888.218844] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
Aug 3 06:47:15 atlovav1a kernel: [626888.218982] tg3 0000:03:00.0: eth0: EEE is enabled
Aug 3 06:51:44 atlovav1a kernel: [627156.293386] tg3 0000:03:00.0: eth0: Link is down
Aug 3 06:51:46 atlovav1a kernel: [627159.123347] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
Aug 3 06:51:46 atlovav1a kernel: [627159.123491] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
Aug 3 06:51:46 atlovav1a kernel: [627159.123629] tg3 0000:03:00.0: eth0: EEE is enabled
Aug 3 07:13:10 atlovav1a kernel: [628441.722197] tg3 0000:03:00.0: eth0: Link is down
Aug 3 07:13:13 atlovav1a kernel: [628444.615548] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
Aug 3 07:13:13 atlovav1a kernel: [628444.615690] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
Aug 3 07:13:13 atlovav1a kernel: [628444.615828] tg3 0000:03:00.0: eth0: EEE is enabled
I can't exclude the possibility it's switch problem, but I don't have access to box now and the
switch is not managed, so I'd like to try other ways first..
Could this somehow be power management related? I don't see what else PM-related could I disable
in the bios to disable this EEE stuff.. Didn't anyone met similar problem?
the system is running Centos6 with x86_64 3.0.76 kernel
thanks in advance for any reply
BR
nik
--
-------------------------------------
Ing. Nikola CIPRICH
LinuxBox.cz, s.r.o.
28.rijna 168, 709 00 Ostrava
tel.: +420 591 166 214
fax: +420 596 621 273
mobil: +420 777 093 799
www.linuxbox.cz
mobil servis: +420 737 238 656
email servis: servis@linuxbox.cz
-------------------------------------
[-- Attachment #2: Type: application/pgp-signature, Size: 198 bytes --]
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: tg3 adapter losing link - PM related?
2013-05-03 9:28 tg3 adapter losing link - PM related? Nikola Ciprich
@ 2013-05-03 15:13 ` Nithin Nayak Sujir
2013-05-03 18:39 ` Ben Hutchings
` (2 more replies)
0 siblings, 3 replies; 6+ messages in thread
From: Nithin Nayak Sujir @ 2013-05-03 15:13 UTC (permalink / raw)
To: Nikola Ciprich; +Cc: netdev
Hi Nikola,
1. What device is present on this server? Can you give the tg3 messages
in /var/log/messages? Can you also give the output of "ethtool -i <iface>"?
2. Is it possible for you to try the latest 3.9 upstream kernel?
3. Any reason why the link is at 100Mb? The switch does not support gig?
What switch is it?
4. I don't think you can turn off EEE in the bios, but you can try
turning autoneg off. Try "ethtool -s <iface> speed 100 duplex full
autoneg off". It's not exactly the same thing since the device still has
EEE enabled but not negotiated.
Nithin.
On 5/3/2013 2:28 AM, Nikola Ciprich wrote:
> Hello,
>
> I'd like to ask about trouble I've got with new HP server: tg3 adapter keeps losing
> link every few minutes:
>
> Aug 3 03:58:02 atlovav1a kernel: [616741.147598] tg3 0000:03:00.0: eth0: Link is down
> Aug 3 03:58:04 atlovav1a kernel: [616743.943456] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
> Aug 3 03:58:04 atlovav1a kernel: [616743.943598] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
> Aug 3 03:58:04 atlovav1a kernel: [616743.943736] tg3 0000:03:00.0: eth0: EEE is enabled
> Aug 3 04:14:29 atlovav1a kernel: [617727.980487] tg3 0000:03:00.0: eth0: Link is down
> Aug 3 04:14:32 atlovav1a kernel: [617730.847245] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
> Aug 3 04:14:32 atlovav1a kernel: [617730.847387] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
> Aug 3 04:14:32 atlovav1a kernel: [617730.847525] tg3 0000:03:00.0: eth0: EEE is enabled
> Aug 3 06:47:13 atlovav1a kernel: [626885.452974] tg3 0000:03:00.0: eth0: Link is down
> Aug 3 06:47:15 atlovav1a kernel: [626888.218702] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
> Aug 3 06:47:15 atlovav1a kernel: [626888.218844] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
> Aug 3 06:47:15 atlovav1a kernel: [626888.218982] tg3 0000:03:00.0: eth0: EEE is enabled
> Aug 3 06:51:44 atlovav1a kernel: [627156.293386] tg3 0000:03:00.0: eth0: Link is down
> Aug 3 06:51:46 atlovav1a kernel: [627159.123347] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
> Aug 3 06:51:46 atlovav1a kernel: [627159.123491] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
> Aug 3 06:51:46 atlovav1a kernel: [627159.123629] tg3 0000:03:00.0: eth0: EEE is enabled
> Aug 3 07:13:10 atlovav1a kernel: [628441.722197] tg3 0000:03:00.0: eth0: Link is down
> Aug 3 07:13:13 atlovav1a kernel: [628444.615548] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
> Aug 3 07:13:13 atlovav1a kernel: [628444.615690] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
> Aug 3 07:13:13 atlovav1a kernel: [628444.615828] tg3 0000:03:00.0: eth0: EEE is enabled
>
> I can't exclude the possibility it's switch problem, but I don't have access to box now and the
> switch is not managed, so I'd like to try other ways first..
>
> Could this somehow be power management related? I don't see what else PM-related could I disable
> in the bios to disable this EEE stuff.. Didn't anyone met similar problem?
>
> the system is running Centos6 with x86_64 3.0.76 kernel
>
> thanks in advance for any reply
>
> BR
>
> nik
>
>
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: tg3 adapter losing link - PM related?
2013-05-03 15:13 ` Nithin Nayak Sujir
@ 2013-05-03 18:39 ` Ben Hutchings
2013-05-03 18:48 ` Nithin Nayak Sujir
2013-05-03 19:28 ` Nikola Ciprich
2013-05-04 6:45 ` Nikola Ciprich
2 siblings, 1 reply; 6+ messages in thread
From: Ben Hutchings @ 2013-05-03 18:39 UTC (permalink / raw)
To: Nithin Nayak Sujir; +Cc: Nikola Ciprich, netdev
On Fri, 2013-05-03 at 08:13 -0700, Nithin Nayak Sujir wrote:
> Hi Nikola,
> 1. What device is present on this server? Can you give the tg3 messages
> in /var/log/messages? Can you also give the output of "ethtool -i <iface>"?
>
> 2. Is it possible for you to try the latest 3.9 upstream kernel?
>
> 3. Any reason why the link is at 100Mb? The switch does not support gig?
> What switch is it?
>
> 4. I don't think you can turn off EEE in the bios, but you can try
> turning autoneg off. Try "ethtool -s <iface> speed 100 duplex full
> autoneg off". It's not exactly the same thing since the device still has
> EEE enabled but not negotiated.
[...]
It should be possible to disable EEE with ethtool. You really should
implement the EEE configuration operations in tg3 if you're going to
enable it at all.
Ben.
--
Ben Hutchings, Staff Engineer, Solarflare
Not speaking for my employer; that's the marketing department's job.
They asked us to note that Solarflare product names are trademarked.
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: tg3 adapter losing link - PM related?
2013-05-03 18:39 ` Ben Hutchings
@ 2013-05-03 18:48 ` Nithin Nayak Sujir
0 siblings, 0 replies; 6+ messages in thread
From: Nithin Nayak Sujir @ 2013-05-03 18:48 UTC (permalink / raw)
To: Ben Hutchings; +Cc: Nikola Ciprich, netdev
On 05/03/2013 11:39 AM, Ben Hutchings wrote:
> On Fri, 2013-05-03 at 08:13 -0700, Nithin Nayak Sujir wrote:
>> Hi Nikola,
>> 1. What device is present on this server? Can you give the tg3 messages
>> in /var/log/messages? Can you also give the output of "ethtool -i <iface>"?
>>
>> 2. Is it possible for you to try the latest 3.9 upstream kernel?
>>
>> 3. Any reason why the link is at 100Mb? The switch does not support gig?
>> What switch is it?
>>
>> 4. I don't think you can turn off EEE in the bios, but you can try
>> turning autoneg off. Try "ethtool -s <iface> speed 100 duplex full
>> autoneg off". It's not exactly the same thing since the device still has
>> EEE enabled but not negotiated.
> [...]
>
> It should be possible to disable EEE with ethtool. You really should
> implement the EEE configuration operations in tg3 if you're going to
> enable it at all.
>
I agree. I plan to have a patch when netdev opens.
> Ben.
>
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: tg3 adapter losing link - PM related?
2013-05-03 15:13 ` Nithin Nayak Sujir
2013-05-03 18:39 ` Ben Hutchings
@ 2013-05-03 19:28 ` Nikola Ciprich
2013-05-04 6:45 ` Nikola Ciprich
2 siblings, 0 replies; 6+ messages in thread
From: Nikola Ciprich @ 2013-05-03 19:28 UTC (permalink / raw)
To: Nithin Nayak Sujir; +Cc: netdev
[-- Attachment #1: Type: text/plain, Size: 5109 bytes --]
Hello Nithin,
On Fri, May 03, 2013 at 08:13:15AM -0700, Nithin Nayak Sujir wrote:
> Hi Nikola,
> 1. What device is present on this server? Can you give the tg3 messages in
> /var/log/messages? Can you also give the output of "ethtool -i <iface>"?
sure, here it goes:
[ +0.273255] tg3.c:v3.119 (May 18, 2011)
[ +0.000184] tg3 0000:03:00.0: PCI INT A -> GSI 32 (level, low) -> IRQ 32
[ +0.000175] tg3 0000:03:00.0: setting latency timer to 64
[ +0.034616] tg3 0000:03:00.0: eth0: Tigon3 [partno(none) rev 5719001] (PCI Express) MAC address 2c:76:8a:52:a5:1c
[ +0.000255] tg3 0000:03:00.0: eth0: attached PHY is 5719C (10/100/1000Base-T Ethernet) (WireSpeed[1], EEE[1])
[ +0.000251] tg3 0000:03:00.0: eth0: RXcsums[1] LinkChgREG[0] MIirq[0] ASF[1] TSOcap[0]
[ +0.000251] tg3 0000:03:00.0: eth0: dma_rwctrl[00000001] dma_mask[64-bit]
[root@atlovav1a ~]# ethtool -i eth0
driver: tg3
version: 3.119
firmware-version: 5719-v1.29 NCSI v1.0.88.0
bus-info: 0000:03:00.0
supports-statistics: yes
supports-test: yes
supports-eeprom-access: yes
supports-register-dump: yes
supports-priv-flags: no
the box is HP ProLiant DL360p, device identifies itself in lspci as Broadcom Corporation NetXtreme BCM5719
>
> 2. Is it possible for you to try the latest 3.9 upstream kernel?
of course, It's compiling now. I'll report how it behaves.
>
> 3. Any reason why the link is at 100Mb? The switch does not support gig?
> What switch is it?
it's some lowcost cisco, not sure whether it's gigabit capable.. I'll find out on monday
and report if it's important.
>
> 4. I don't think you can turn off EEE in the bios, but you can try turning
> autoneg off. Try "ethtool -s <iface> speed 100 duplex full autoneg off".
> It's not exactly the same thing since the device still has EEE enabled but
> not negotiated.
forcing speed helped, I no longer see link going down!
of course I'll report about 3.9 kernel too..
thanks for Your reply!
nik
>
> Nithin.
>
>
> On 5/3/2013 2:28 AM, Nikola Ciprich wrote:
> >Hello,
> >
> >I'd like to ask about trouble I've got with new HP server: tg3 adapter keeps losing
> >link every few minutes:
> >
> >Aug 3 03:58:02 atlovav1a kernel: [616741.147598] tg3 0000:03:00.0: eth0: Link is down
> >Aug 3 03:58:04 atlovav1a kernel: [616743.943456] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
> >Aug 3 03:58:04 atlovav1a kernel: [616743.943598] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
> >Aug 3 03:58:04 atlovav1a kernel: [616743.943736] tg3 0000:03:00.0: eth0: EEE is enabled
> >Aug 3 04:14:29 atlovav1a kernel: [617727.980487] tg3 0000:03:00.0: eth0: Link is down
> >Aug 3 04:14:32 atlovav1a kernel: [617730.847245] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
> >Aug 3 04:14:32 atlovav1a kernel: [617730.847387] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
> >Aug 3 04:14:32 atlovav1a kernel: [617730.847525] tg3 0000:03:00.0: eth0: EEE is enabled
> >Aug 3 06:47:13 atlovav1a kernel: [626885.452974] tg3 0000:03:00.0: eth0: Link is down
> >Aug 3 06:47:15 atlovav1a kernel: [626888.218702] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
> >Aug 3 06:47:15 atlovav1a kernel: [626888.218844] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
> >Aug 3 06:47:15 atlovav1a kernel: [626888.218982] tg3 0000:03:00.0: eth0: EEE is enabled
> >Aug 3 06:51:44 atlovav1a kernel: [627156.293386] tg3 0000:03:00.0: eth0: Link is down
> >Aug 3 06:51:46 atlovav1a kernel: [627159.123347] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
> >Aug 3 06:51:46 atlovav1a kernel: [627159.123491] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
> >Aug 3 06:51:46 atlovav1a kernel: [627159.123629] tg3 0000:03:00.0: eth0: EEE is enabled
> >Aug 3 07:13:10 atlovav1a kernel: [628441.722197] tg3 0000:03:00.0: eth0: Link is down
> >Aug 3 07:13:13 atlovav1a kernel: [628444.615548] tg3 0000:03:00.0: eth0: Link is up at 100 Mbps, full duplex
> >Aug 3 07:13:13 atlovav1a kernel: [628444.615690] tg3 0000:03:00.0: eth0: Flow control is on for TX and on for RX
> >Aug 3 07:13:13 atlovav1a kernel: [628444.615828] tg3 0000:03:00.0: eth0: EEE is enabled
> >
> >I can't exclude the possibility it's switch problem, but I don't have access to box now and the
> >switch is not managed, so I'd like to try other ways first..
> >
> >Could this somehow be power management related? I don't see what else PM-related could I disable
> >in the bios to disable this EEE stuff.. Didn't anyone met similar problem?
> >
> >the system is running Centos6 with x86_64 3.0.76 kernel
> >
> >thanks in advance for any reply
> >
> >BR
> >
> >nik
> >
> >
>
--
-------------------------------------
Ing. Nikola CIPRICH
LinuxBox.cz, s.r.o.
28. rijna 168, 709 00 Ostrava
tel.: +420 591 166 214
fax: +420 596 621 273
mobil: +420 777 093 799
www.linuxbox.cz
mobil servis: +420 737 238 656
email servis: servis@linuxbox.cz
-------------------------------------
[-- Attachment #2: Type: application/pgp-signature, Size: 198 bytes --]
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: tg3 adapter losing link - PM related?
2013-05-03 15:13 ` Nithin Nayak Sujir
2013-05-03 18:39 ` Ben Hutchings
2013-05-03 19:28 ` Nikola Ciprich
@ 2013-05-04 6:45 ` Nikola Ciprich
2 siblings, 0 replies; 6+ messages in thread
From: Nikola Ciprich @ 2013-05-04 6:45 UTC (permalink / raw)
To: Nithin Nayak Sujir; +Cc: netdev
[-- Attachment #1: Type: text/plain, Size: 690 bytes --]
Hello again,
> 2. Is it possible for you to try the latest 3.9 upstream kernel?
so in 3.9 it's the same..
the box is not used for production yet, so I have few days for tests and
experiments if it helps. But I shouldn't cut myself off the network :)
(although there's IPMI module present, so in worst case, I should be able
to reboot the box..)
nik
--
-------------------------------------
Ing. Nikola CIPRICH
LinuxBox.cz, s.r.o.
28. rijna 168, 709 00 Ostrava
tel.: +420 591 166 214
fax: +420 596 621 273
mobil: +420 777 093 799
www.linuxbox.cz
mobil servis: +420 737 238 656
email servis: servis@linuxbox.cz
-------------------------------------
[-- Attachment #2: Type: application/pgp-signature, Size: 198 bytes --]
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2013-05-04 6:45 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-05-03 9:28 tg3 adapter losing link - PM related? Nikola Ciprich
2013-05-03 15:13 ` Nithin Nayak Sujir
2013-05-03 18:39 ` Ben Hutchings
2013-05-03 18:48 ` Nithin Nayak Sujir
2013-05-03 19:28 ` Nikola Ciprich
2013-05-04 6:45 ` Nikola Ciprich
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).