All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Dave, Tushar N" <tushar.n.dave@intel.com>
To: Li Yu <raise.sail@gmail.com>
Cc: Joe Jin <joe.jin@oracle.com>,
	"e1000-devel@lists.sf.net" <e1000-devel@lists.sf.net>,
	"netdev@vger.kernel.org" <netdev@vger.kernel.org>,
	"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
	Mary Mcgrath <mary.mcgrath@oracle.com>
Subject: RE: 82571EB: Detected Hardware Unit Hang
Date: Wed, 14 Nov 2012 03:43:33 +0000	[thread overview]
Message-ID: <061C8A8601E8EE4CA8D8FD6990CEA8913348B0E7@ORSMSX102.amr.corp.intel.com> (raw)
In-Reply-To: <50A311E9.9030702@gmail.com>

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain; charset="utf-8", Size: 3285 bytes --]

>-----Original Message-----
>From: Li Yu [mailto:raise.sail@gmail.com]
>Sent: Tuesday, November 13, 2012 7:37 PM
>To: Dave, Tushar N
>Cc: Joe Jin; e1000-devel@lists.sf.net; netdev@vger.kernel.org; linux-
>kernel@vger.kernel.org; Mary Mcgrath
>Subject: Re: 82571EB: Detected Hardware Unit Hang
>
>于 2012年11月09日 04:35, Dave, Tushar N 写道:
>>> -----Original Message-----
>>> From: netdev-owner@vger.kernel.org
>>> [mailto:netdev-owner@vger.kernel.org]
>>> On Behalf Of Joe Jin
>>> Sent: Wednesday, November 07, 2012 10:25 PM
>>> To: e1000-devel@lists.sf.net
>>> Cc: netdev@vger.kernel.org; linux-kernel@vger.kernel.org; Mary
>>> Mcgrath
>>> Subject: 82571EB: Detected Hardware Unit Hang
>>>
>>> Hi list,
>>>
>>> IHAC reported "82571EB Detected Hardware Unit Hang" on HP ProLiant
>>> DL360 G6, and have to reboot the server to recover:
>>>
>>> e1000e 0000:06:00.1: eth3: Detected Hardware Unit Hang:
>>>   TDH                  <1a>
>>>   TDT                  <1a>
>>>   next_to_use          <1a>
>>>   next_to_clean        <18>
>>> buffer_info[next_to_clean]:
>>>   time_stamp           <10047a74e>
>>>   next_to_watch        <18>
>>>   jiffies              <10047a88c>
>>>   next_to_watch.status <1>
>>> MAC Status             <80383>
>>> PHY Status             <792d>
>>> PHY 1000BASE-T Status  <3800>
>>> PHY Extended Status    <3000>
>>> PCI Status             <10>
>>>
>>> With newer kernel 2.0.0.1 the issue still reproducible.
>>>
>>> Device info:
>>> 06:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit
>>> Ethernet Controller (Copper) (rev 06)
>>> 06:00.1 0200: 8086:10bc (rev 06)
>>>
>>> I compared lspci output before and after the issue, different as below:
>>> 06:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit
>>> Ethernet Controller (Copper) (rev 06)
>>> 	Subsystem: Hewlett-Packard Company NC364T PCI Express Quad Port
>>> Gigabit Server Adapter
>>> 	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+
>>> Stepping- SERR- FastB2B- DisINTx-
>>> -	Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
>>> <TAbort- <MAbort- >SERR- <PERR- INTx-
>>> +	Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
>>> +<TAbort- <MAbort- >SERR- <PERR- INTx+
>>
>> Are you sure this is not similar issue as before that you reported.
>> i.e.
>> On Mon, 2012-07-09 at 16:51 +0800, Joe Jin wrote:
>>> I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when
>>> doing scp test. this issue is easy do reproduced on SUN FIRE X2270
>>> M2, just copy a big file (>500M) from another server will hit it at
>once.
>>
>> All devices in path from root complex to 82571, should have *same* max
>payload size otherwise it can cause hang.
>> Can you double check this?
>>
>
>We also found such hang problem on 82599EB (ixgbe driver) in RHEL6.3
>kernel, we ever tried to upgrade to latest version (3.8.21 or 3.10.17),
>but it still happens.
>
>Is it probably also due to wrong "max payload size" set in BIOS?
>
It could be or could not be. I would suggest please create another thread with that issue as these two devices are significantly different.

-Tushar
ÿôèº{.nÇ+‰·Ÿ®‰­†+%ŠËÿ±éݶ\x17¥Šwÿº{.nÇ+‰·¥Š{±þG«éÿŠ{ayº\x1dʇڙë,j\a­¢f£¢·hšïêÿ‘êçz_è®\x03(­éšŽŠÝ¢j"ú\x1a¶^[m§ÿÿ¾\a«þG«éÿ¢¸?™¨è­Ú&£ø§~á¶iO•æ¬z·švØ^\x14\x04\x1a¶^[m§ÿÿÃ\fÿ¶ìÿ¢¸?–I¥

WARNING: multiple messages have this Message-ID (diff)
From: "Dave, Tushar N" <tushar.n.dave@intel.com>
To: Li Yu <raise.sail@gmail.com>
Cc: Joe Jin <joe.jin@oracle.com>,
	"e1000-devel@lists.sf.net" <e1000-devel@lists.sf.net>,
	"netdev@vger.kernel.org" <netdev@vger.kernel.org>,
	"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
	Mary Mcgrath <mary.mcgrath@oracle.com>
Subject: RE: 82571EB: Detected Hardware Unit Hang
Date: Wed, 14 Nov 2012 03:43:33 +0000	[thread overview]
Message-ID: <061C8A8601E8EE4CA8D8FD6990CEA8913348B0E7@ORSMSX102.amr.corp.intel.com> (raw)
In-Reply-To: <50A311E9.9030702@gmail.com>

>-----Original Message-----
>From: Li Yu [mailto:raise.sail@gmail.com]
>Sent: Tuesday, November 13, 2012 7:37 PM
>To: Dave, Tushar N
>Cc: Joe Jin; e1000-devel@lists.sf.net; netdev@vger.kernel.org; linux-
>kernel@vger.kernel.org; Mary Mcgrath
>Subject: Re: 82571EB: Detected Hardware Unit Hang
>
>于 2012年11月09日 04:35, Dave, Tushar N 写道:
>>> -----Original Message-----
>>> From: netdev-owner@vger.kernel.org
>>> [mailto:netdev-owner@vger.kernel.org]
>>> On Behalf Of Joe Jin
>>> Sent: Wednesday, November 07, 2012 10:25 PM
>>> To: e1000-devel@lists.sf.net
>>> Cc: netdev@vger.kernel.org; linux-kernel@vger.kernel.org; Mary
>>> Mcgrath
>>> Subject: 82571EB: Detected Hardware Unit Hang
>>>
>>> Hi list,
>>>
>>> IHAC reported "82571EB Detected Hardware Unit Hang" on HP ProLiant
>>> DL360 G6, and have to reboot the server to recover:
>>>
>>> e1000e 0000:06:00.1: eth3: Detected Hardware Unit Hang:
>>>   TDH                  <1a>
>>>   TDT                  <1a>
>>>   next_to_use          <1a>
>>>   next_to_clean        <18>
>>> buffer_info[next_to_clean]:
>>>   time_stamp           <10047a74e>
>>>   next_to_watch        <18>
>>>   jiffies              <10047a88c>
>>>   next_to_watch.status <1>
>>> MAC Status             <80383>
>>> PHY Status             <792d>
>>> PHY 1000BASE-T Status  <3800>
>>> PHY Extended Status    <3000>
>>> PCI Status             <10>
>>>
>>> With newer kernel 2.0.0.1 the issue still reproducible.
>>>
>>> Device info:
>>> 06:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit
>>> Ethernet Controller (Copper) (rev 06)
>>> 06:00.1 0200: 8086:10bc (rev 06)
>>>
>>> I compared lspci output before and after the issue, different as below:
>>> 06:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit
>>> Ethernet Controller (Copper) (rev 06)
>>> 	Subsystem: Hewlett-Packard Company NC364T PCI Express Quad Port
>>> Gigabit Server Adapter
>>> 	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+
>>> Stepping- SERR- FastB2B- DisINTx-
>>> -	Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
>>> <TAbort- <MAbort- >SERR- <PERR- INTx-
>>> +	Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
>>> +<TAbort- <MAbort- >SERR- <PERR- INTx+
>>
>> Are you sure this is not similar issue as before that you reported.
>> i.e.
>> On Mon, 2012-07-09 at 16:51 +0800, Joe Jin wrote:
>>> I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when
>>> doing scp test. this issue is easy do reproduced on SUN FIRE X2270
>>> M2, just copy a big file (>500M) from another server will hit it at
>once.
>>
>> All devices in path from root complex to 82571, should have *same* max
>payload size otherwise it can cause hang.
>> Can you double check this?
>>
>
>We also found such hang problem on 82599EB (ixgbe driver) in RHEL6.3
>kernel, we ever tried to upgrade to latest version (3.8.21 or 3.10.17),
>but it still happens.
>
>Is it probably also due to wrong "max payload size" set in BIOS?
>
It could be or could not be. I would suggest please create another thread with that issue as these two devices are significantly different.

-Tushar

  reply	other threads:[~2012-11-14  3:43 UTC|newest]

Thread overview: 100+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-11-08  6:24 82571EB: Detected Hardware Unit Hang Joe Jin
2012-11-08 20:35 ` Dave, Tushar N
2012-11-09  1:22   ` Joe Jin
2012-11-09  1:22     ` Joe Jin
2012-11-14  2:47   ` Joe Jin
2012-11-14  3:45     ` Dave, Tushar N
2012-11-15  0:32       ` Joe Jin
2012-11-15  0:32         ` Joe Jin
2012-11-15 20:26         ` Dave, Tushar N
2012-11-19  5:38           ` Joe Jin
2012-11-20  8:59             ` Dave, Tushar N
2012-11-20 13:24               ` Joe Jin
2012-11-26 16:23                 ` [E1000-devel] " Fujinaka, Todd
2012-11-27  0:59                   ` Joe Jin
2012-11-27  2:06                     ` Mary Mcgrath
2012-11-27  2:06                       ` Mary Mcgrath
2012-11-27 17:32                       ` [E1000-devel] " Fujinaka, Todd
2012-11-27 18:10                         ` Ben Hutchings
2012-11-27 18:24                           ` Fujinaka, Todd
2012-11-27 18:24                             ` Fujinaka, Todd
2012-11-27 18:24                             ` Fujinaka, Todd
2012-11-28  8:31                           ` Joe Jin
2012-11-28 15:53                             ` Fujinaka, Todd
2012-11-28 15:53                               ` Fujinaka, Todd
2012-11-28 15:53                               ` Fujinaka, Todd
2012-11-29  3:10                               ` Ethan Zhao
2012-11-29 15:52                                 ` Fujinaka, Todd
2012-12-19  3:04                                   ` Joe Jin
2012-12-19  3:04                                     ` Joe Jin
2012-12-19  5:52                                     ` Yijing Wang
2012-12-19  5:52                                       ` Yijing Wang
2012-12-19  6:13                                       ` Joe Jin
2012-12-19  6:13                                         ` Joe Jin
2012-11-20 13:24               ` Joe Jin
2012-11-14  3:37   ` Li Yu
2012-11-14  3:43     ` Dave, Tushar N [this message]
2012-11-14  3:43       ` Dave, Tushar N
  -- strict thread matches above, loose matches on Subject: below --
2012-07-09  8:51 Joe Jin
2012-07-09  9:21 ` Eric Dumazet
2012-07-09 12:19   ` Joe Jin
2012-07-09 12:19     ` Joe Jin
2012-07-10  7:40 ` Joe Jin
2012-07-10 18:14   ` Wyborny, Carolyn
2012-07-10 19:02   ` Dave, Tushar N
2012-07-10 19:02     ` Dave, Tushar N
2012-07-10 19:17     ` Dave, Tushar N
2012-07-10 19:17       ` Dave, Tushar N
2012-07-11  0:34     ` Joe Jin
2012-07-11  0:34       ` Joe Jin
2012-07-11  1:18       ` Dave, Tushar N
2012-07-11  1:44         ` Joe Jin
2012-07-11  1:44           ` Joe Jin
2012-07-11  3:22           ` Dave, Tushar N
2012-07-11  3:29             ` Joe Jin
2012-07-11  3:29               ` Joe Jin
2012-07-11  4:05               ` Dave, Tushar N
2012-07-11  4:05                 ` Dave, Tushar N
2012-07-11  5:03                 ` Joe Jin
2012-07-11  7:11                   ` Dave, Tushar N
2012-07-11  7:17                     ` Joe Jin
2012-07-11  7:17                       ` Joe Jin
2012-07-11  7:37                       ` Dave, Tushar N
2012-07-11  7:37                         ` Dave, Tushar N
2012-07-11  7:38                         ` Joe Jin
2012-07-11  7:38                           ` Joe Jin
2012-07-11  7:50                           ` Dave, Tushar N
2012-07-11  7:53                             ` Joe Jin
2012-07-11  7:53                               ` Joe Jin
2012-07-11 18:51                   ` Dave, Tushar N
2012-07-12  2:23                     ` Joe Jin
2012-07-12  2:52                       ` Dave, Tushar N
2012-07-12  2:52                         ` Dave, Tushar N
2012-07-12  2:57                         ` Joe Jin
2012-07-12  2:57                           ` Joe Jin
2012-07-12  3:07                           ` Dave, Tushar N
2012-07-12  3:12                             ` Joe Jin
2012-07-12  3:12                               ` Joe Jin
2012-07-12  5:57                               ` Dave, Tushar N
2012-07-12  6:16                                 ` Joe Jin
2012-07-12  6:16                                   ` Joe Jin
2012-07-12  6:41                                   ` Dave, Tushar N
2012-07-12  6:41                                     ` Dave, Tushar N
2012-07-12  7:10                                     ` Joe Jin
2012-07-12  7:10                                       ` Joe Jin
2012-07-12 18:19                                       ` Dave, Tushar N
2012-07-12 23:46                                         ` Joe Jin
2012-07-12 23:46                                           ` Joe Jin
2012-07-13  4:10                                           ` Dave, Tushar N
2012-07-13  4:10                                             ` Dave, Tushar N
2012-07-13  4:33                                             ` Joe Jin
2012-07-13  4:33                                               ` Joe Jin
2012-07-15  3:42                                               ` Dave, Tushar N
2012-07-15  3:52                                                 ` Joe Jin
2012-07-15  3:52                                                   ` Joe Jin
2012-07-15 13:35                                                 ` Henrique de Moraes Holschuh
2012-07-16 15:47                                                   ` Ben Hutchings
2012-07-16 16:08                                                     ` Henrique de Moraes Holschuh
2012-07-16 16:08                                                       ` Henrique de Moraes Holschuh
2012-07-17  4:48                                                       ` Jon Mason
2012-07-17  4:45                                                     ` Jon Mason

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=061C8A8601E8EE4CA8D8FD6990CEA8913348B0E7@ORSMSX102.amr.corp.intel.com \
    --to=tushar.n.dave@intel.com \
    --cc=e1000-devel@lists.sf.net \
    --cc=joe.jin@oracle.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mary.mcgrath@oracle.com \
    --cc=netdev@vger.kernel.org \
    --cc=raise.sail@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.