All of lore.kernel.org
 help / color / mirror / Atom feed
* [BUG] igb: reconnecting of cable not always detected
@ 2018-04-24 15:14 ` Holger Schurig
  0 siblings, 0 replies; 23+ messages in thread
From: Holger Schurig @ 2018-04-24 15:14 UTC (permalink / raw)
  To: jeffrey.t.kirsher, intel-wired-lan, linux-kernel

Hi all,

I'm on kernel 4.16.4 and have an issue with eth0, driver is igb. When I
remove the ethernet cable, this is always detected:

[    2.772360] igb: Intel(R) Gigabit Ethernet Network Driver - version 5.4.0-k
[    2.772363] igb: Copyright (c) 2007-2014 Intel Corporation.
[    3.023707] igb 0000:02:00.0: added PHC on eth0
[    3.023710] igb 0000:02:00.0: Intel(R) Gigabit Ethernet Network Connection
[    3.023713] igb 0000:02:00.0: eth0: (PCIe:2.5Gb/s:Width x1) 00:13:95:1a:54:33
[    3.023758] igb 0000:02:00.0: eth0: PBA No: 000300-000
[    3.023762] igb 0000:02:00.0: Using MSI-X interrupts. 4 rx queue(s), 4 tx queue(s)
[    7.984921] igb 0000:02:00.0 eth0: igb: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX
[   11.184593] igb 0000:02:00.0 eth0: igb: eth0 NIC Link is Down

Sometimes, plugging the cable back in is detected ...

[   43.736922] igb 0000:02:00.0 eth0: igb: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX

... but sometimes this is *NOT* detected. I can put the cable in and
even after two minutes nothing has been detected.

But when I run "rmmod igb" followed by "modpobe igb", the link is
detected again:

[  100.528609] igb 0000:02:00.0 eth0: igb: eth0 NIC Link is Down
[ 2336.583244] igb 0000:02:00.0: removed PHC on eth0
[ 2339.693521] igb: Intel(R) Gigabit Ethernet Network Driver - version 5.4.0-k
[ 2339.693524] igb: Copyright (c) 2007-2014 Intel Corporation.
[ 2339.990553] pps pps0: new PPS source ptp0
[ 2339.990561] igb 0000:02:00.0: added PHC on eth0
[ 2339.990565] igb 0000:02:00.0: Intel(R) Gigabit Ethernet Network Connection
[ 2339.990569] igb 0000:02:00.0: eth0: (PCIe:2.5Gb/s:Width x1) 00:13:95:1a:54:33
[ 2339.990611] igb 0000:02:00.0: eth0: PBA No: 000300-000
[ 2339.990615] igb 0000:02:00.0: Using MSI-X interrupts. 4 rx queue(s), 4 tx queue(s)
[ 2343.001114] igb 0000:02:00.0 eth0: igb: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX

(In above dmesg snippet the ethernet cable was the whole time inserted).


Any tips on how I can debug this further?

PS: I already tried a different switch and also a direct connection from
device-to-device, without a switch.

^ permalink raw reply	[flat|nested] 23+ messages in thread
* [BUG] igb: reconnecting of cable not always detected
@ 2018-06-09 17:15 Thomas Netousek
  0 siblings, 0 replies; 23+ messages in thread
From: Thomas Netousek @ 2018-06-09 17:15 UTC (permalink / raw)
  To: linux-kernel

I have a similar problem.
If I disconnect and reconnect the ethernet cable on a Intel Ethernet
card then the device does not come up again.

For me this problem happens on the first pull of the LAN cable all the time.

It is reproducible on Supermicro X8, X9 and X10 dual CPU mainboards with
onboard networking providing two PHY interfaces using Intel 82576 and
I350 chips.
It is not reproducible on a Supermicro X10SLL single mainboard with
onboard I210 chip providing one PHY for eth0 (tested) and one
I217-LM powered by the e1000e driver (not connected, not tested).

It is reproducible using kernel 4.9.107 and 4.17.0.
It is not reproducible using  kernels 4.1.48, 4.4.136.
So it might be related to the changes in the igb versions from 5.3.0-k
(good) to 5.4.0-k (bad).

After pulling and re-plugging the cable, with the bad driver I get:

# ip -d link show eth0
2: eth0: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc mq state
DOWN mode DEFAULT group default qlen 1000
    link/ether 0c:c4:7a:69:9d:3e brd ff:ff:ff:ff:ff:ff promiscuity 0
numtxqueues 8 numrxqueues 8 gso_max_size 65536 gso_max_segs 65535

# ethtool -i eth0    
Cannot get driver information: No such device

The last lines in the dmesg output are:

[   13.127730] igb 0000:01:00.0 eth0: igb: eth0 NIC Link is Up 1000 Mbps
Full Duplex, Flow Control: RX/TX
[   13.747735] igb 0000:01:00.1 eth1: igb: eth1 NIC Link is Up 1000 Mbps
Full Duplex, Flow Control: RX/TX
[  147.760943] igb 0000:01:00.0 eth0: igb: eth0 NIC Link is Down
[  608.211864] igb 0000:01:00.0 eth0: PCIe link lost, device now detached

Please note that the "PCIe link lost" message arrives 8 minutes after
re-plugging the LAN cable.

I hope that information helps pinning down this bug and fixing it.

Kind regards
Thomas

^ permalink raw reply	[flat|nested] 23+ messages in thread

end of thread, other threads:[~2019-01-17 21:55 UTC | newest]

Thread overview: 23+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-04-24 15:14 [BUG] igb: reconnecting of cable not always detected Holger Schurig
2018-04-24 15:14 ` [Intel-wired-lan] " Holger Schurig
2018-04-24 18:09 ` Alexander Duyck
2018-04-24 18:09   ` [Intel-wired-lan] " Alexander Duyck
2018-04-25  3:30   ` Richard Cochran
2018-04-25  3:30     ` [Intel-wired-lan] " Richard Cochran
2018-04-25  9:47   ` Holger Schurig
2018-04-25  9:47     ` [Intel-wired-lan] " Holger Schurig
2018-04-25 16:01     ` Alexander Duyck
2018-04-25 16:01       ` [Intel-wired-lan] " Alexander Duyck
2018-04-26  7:54       ` Holger Schurig
2018-04-26  7:54         ` [Intel-wired-lan] " Holger Schurig
2018-04-26  9:08       ` Holger Schurig
2018-04-26  9:08         ` [Intel-wired-lan] " Holger Schurig
2018-04-26 16:02         ` Alexander Duyck
2018-04-26 16:02           ` [Intel-wired-lan] " Alexander Duyck
2018-04-27 10:39           ` Holger Schurig
2018-04-27 10:39             ` [Intel-wired-lan] " Holger Schurig
2018-05-18  7:35           ` Holger Schurig
2018-05-18  7:35             ` [Intel-wired-lan] " Holger Schurig
2019-01-17 21:55             ` Jeff Kirsher
2019-01-17 21:55               ` Jeff Kirsher
2018-06-09 17:15 Thomas Netousek

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.