All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Fujinaka, Todd" <todd.fujinaka@intel.com>
To: Pavlos Parissis <pavlos.parissis@gmail.com>,
	"netdev@vger.kernel.org" <netdev@vger.kernel.org>,
	"intel-wired-lan@lists.osuosl.org"
	<intel-wired-lan@lists.osuosl.org>
Subject: RE: [Intel-wired-lan] Instability of i40e driver on 4.9 kernel
Date: Sat, 21 Oct 2017 00:07:23 +0000	[thread overview]
Message-ID: <9B4A1B1917080E46B64F07F2989DADD697A3FFD1@ORSMSX114.amr.corp.intel.com> (raw)
In-Reply-To: <92118fc4-8a20-f129-193b-9c8fdf81aa24@gmail.com>

You picked a bunch of places to post this, and you really should've used a different place: e1000-devel@lists.sourceforge.net

Also, since you flagged the "communities" post as "answered", you're not likely to get any follow-up. The Intel communities are also not monitored as much by the wired networking people at Intel.

Please let us know if you have any specific issues, and please provide exact reproduction steps so we can investigate your issues, and please use e1000-devel.

Todd Fujinaka
Software Application Engineer
Datacenter Engineering Group
Intel Corporation
todd.fujinaka@intel.com


-----Original Message-----
From: Intel-wired-lan [mailto:intel-wired-lan-bounces@osuosl.org] On Behalf Of Pavlos Parissis
Sent: Thursday, October 19, 2017 4:03 PM
To: netdev@vger.kernel.org; intel-wired-lan@lists.osuosl.org
Subject: [Intel-wired-lan] Instability of i40e driver on 4.9 kernel

Hi all,

We have been running 4.9 kernels for several months on CentOS 7.3 and for few weeks on CentOS 7.4, and, after we replaced 10GbE cobber cards(X540-AT2 with ixgbe driver) with X710 10GbE SFP cards using i40e driver, we noticed sever instabilities on our servers.

On several servers the links were marked down and up again, without any obvious reasons expect a lot of errors on kernel.log. We run Bird Internet daemon on our servers in order to establish BGP peerings with routers and we have observed flapping on BGP peerings. At the same time we had BGP peering stabilities issues we had kernel errors. We decided to go back to 3.10 kernel from CentOS, but that process wasn't smooth as latest firmware gave us problems with speed detection. We rolled back to two version old and speed detection issue was resolved. We have been running 3.10 several weeks without any problems. Even we want certain functionality from kernel 4.9, we decided to switch back to 3.10 as stability of our systems has higher priority.

I need to mention that in all occurrences of the issue we didn't see any anomalies, such DDOS attacks and etc.

I have opened https://communities.intel.com/message/501682#501682 and there you can find all the error messages and other information.

Since we noticed the issues, I have been following netdev ML and I know that there are a lot of improvements/patched queued up for 4.14 and I am hoping those patches fix our issue and most importantly are sent to linux-stable for inclusion in 4.9 kernel.

Cheers,
Pavlos



WARNING: multiple messages have this Message-ID (diff)
From: Fujinaka, Todd <todd.fujinaka@intel.com>
To: intel-wired-lan@osuosl.org
Subject: [Intel-wired-lan] Instability of i40e driver on 4.9 kernel
Date: Sat, 21 Oct 2017 00:07:23 +0000	[thread overview]
Message-ID: <9B4A1B1917080E46B64F07F2989DADD697A3FFD1@ORSMSX114.amr.corp.intel.com> (raw)
In-Reply-To: <92118fc4-8a20-f129-193b-9c8fdf81aa24@gmail.com>

You picked a bunch of places to post this, and you really should've used a different place: e1000-devel at lists.sourceforge.net

Also, since you flagged the "communities" post as "answered", you're not likely to get any follow-up. The Intel communities are also not monitored as much by the wired networking people at Intel.

Please let us know if you have any specific issues, and please provide exact reproduction steps so we can investigate your issues, and please use e1000-devel.

Todd Fujinaka
Software Application Engineer
Datacenter Engineering Group
Intel Corporation
todd.fujinaka at intel.com


-----Original Message-----
From: Intel-wired-lan [mailto:intel-wired-lan-bounces at osuosl.org] On Behalf Of Pavlos Parissis
Sent: Thursday, October 19, 2017 4:03 PM
To: netdev@vger.kernel.org; intel-wired-lan at lists.osuosl.org
Subject: [Intel-wired-lan] Instability of i40e driver on 4.9 kernel

Hi all,

We have been running 4.9 kernels for several months on CentOS 7.3 and for few weeks on CentOS 7.4, and, after we replaced 10GbE cobber cards(X540-AT2 with ixgbe driver) with X710 10GbE SFP cards using i40e driver, we noticed sever instabilities on our servers.

On several servers the links were marked down and up again, without any obvious reasons expect a lot of errors on kernel.log. We run Bird Internet daemon on our servers in order to establish BGP peerings with routers and we have observed flapping on BGP peerings. At the same time we had BGP peering stabilities issues we had kernel errors. We decided to go back to 3.10 kernel from CentOS, but that process wasn't smooth as latest firmware gave us problems with speed detection. We rolled back to two version old and speed detection issue was resolved. We have been running 3.10 several weeks without any problems. Even we want certain functionality from kernel 4.9, we decided to switch back to 3.10 as stability of our systems has higher priority.

I need to mention that in all occurrences of the issue we didn't see any anomalies, such DDOS attacks and etc.

I have opened https://communities.intel.com/message/501682#501682 and there you can find all the error messages and other information.

Since we noticed the issues, I have been following netdev ML and I know that there are a lot of improvements/patched queued up for 4.14 and I am hoping those patches fix our issue and most importantly are sent to linux-stable for inclusion in 4.9 kernel.

Cheers,
Pavlos



  reply	other threads:[~2017-10-21  0:07 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-10-19 23:02 Instability of i40e driver on 4.9 kernel Pavlos Parissis
2017-10-19 23:02 ` [Intel-wired-lan] " Pavlos Parissis
2017-10-21  0:07 ` Fujinaka, Todd [this message]
2017-10-21  0:07   ` Fujinaka, Todd
2017-10-25 21:49   ` Pavlos Parissis
2017-10-25 21:49     ` Pavlos Parissis
2017-10-27 23:16     ` Paweł Staszewski
2017-10-27 23:16       ` =?unknown-8bit?q?Pawe=C5=82?= Staszewski

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=9B4A1B1917080E46B64F07F2989DADD697A3FFD1@ORSMSX114.amr.corp.intel.com \
    --to=todd.fujinaka@intel.com \
    --cc=intel-wired-lan@lists.osuosl.org \
    --cc=netdev@vger.kernel.org \
    --cc=pavlos.parissis@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.