From mboxrd@z Thu Jan 1 00:00:00 1970 From: Vitezslav Samel Subject: Re: Linux 4.12+ memory leak on router with i40e NICs Date: Tue, 17 Oct 2017 07:51:55 +0200 Message-ID: <20171017055155.GA19944@pc11.op.pod.cz> References: <1507121766.30720.4.camel@cohaesio.com> <1507180753.20182.8.camel@cohaesio.com> <227d17ae-b040-07d0-3c57-e9acd1a3b5b4@itcare.pl> <3d783736-a474-d9e3-2de2-e35c765f8249@itcare.pl> <39696136-2a4a-9c6c-3a63-4485ed2a1bf3@itcare.pl> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Cc: Pavlos Parissis , Alexander Duyck , "Anders K. Pedersen | Cohaesio" , "netdev@vger.kernel.org" , "intel-wired-lan@lists.osuosl.org" , "alexander.h.duyck@intel.com" To: =?iso-8859-2?Q?Pawe=B3?= Staszewski Return-path: Received: from mail.pod.cz ([213.155.227.146]:55802 "EHLO mail.pod.cz" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751485AbdJQF6Q (ORCPT ); Tue, 17 Oct 2017 01:58:16 -0400 Content-Disposition: inline In-Reply-To: <39696136-2a4a-9c6c-3a63-4485ed2a1bf3@itcare.pl> Sender: netdev-owner@vger.kernel.org List-ID: On Tue, Oct 17, 2017 at 01:34:29AM +0200, Paweł Staszewski wrote: > W dniu 2017-10-16 o 18:26, Paweł Staszewski pisze: > > W dniu 2017-10-16 o 13:20, Pavlos Parissis pisze: > > > On 15/10/2017 02:58 πμ, Alexander Duyck wrote: > > > > Hi Pawel, > > > > > > > > To clarify is that Dave Miller's tree or Linus's that you are talking > > > > about? If it is Dave's tree how long ago was it you pulled it since I > > > > think the fix was just pushed by Jeff Kirsher a few days ago. > > > > > > > > The issue should be fixed in the following commit: > > > > https://git.kernel.org/pub/scm/linux/kernel/git/davem/net.git/commit/drivers/net/ethernet/intel/i40e/i40e_txrx.c?id=2b9478ffc550f17c6cd8c69057234e91150f5972 > > > > > > Do you know when it is going to be available on net-next and > > > linux-stable repos? > > > > > > Cheers, > > > Pavlos > > > > > > > > I will make some tests today night with "net" git tree where this patch > > is included. > > Starting from 0:00 CET > > :) > > > > > Upgraded and looks like problem is not solved with that patch > Currently running system with > https://git.kernel.org/pub/scm/linux/kernel/git/davem/net.git/ > kernel > > Still about 0.5GB of memory is leaking somewhere > > Also can confirm that the latest kernel where memory is not leaking (with > use i40e driver intel 710 cards) is 4.11.12 > With kernel 4.11.12 - after hour no change in memory usage. > > also checked that with ixgbe instead of i40e with same  net.git kernel there > is no memleak - after hour same memory usage - so for 100% this is i40e > driver problem. I have (probably) the same problem here but with X520 cards: booting 4.12.x gives me oops after circa 20 minutes of our workload. Booting 4.9.y is OK. This machine is in production so any testing is very limited. Machine was stable for >2 months (on the desk before got to production) with 4.12.8 but with no traffic on X520 cards. Cheers, Vita From mboxrd@z Thu Jan 1 00:00:00 1970 From: Vitezslav Samel Date: Tue, 17 Oct 2017 07:51:55 +0200 Subject: [Intel-wired-lan] Linux 4.12+ memory leak on router with i40e NICs In-Reply-To: <39696136-2a4a-9c6c-3a63-4485ed2a1bf3@itcare.pl> References: <1507121766.30720.4.camel@cohaesio.com> <1507180753.20182.8.camel@cohaesio.com> <227d17ae-b040-07d0-3c57-e9acd1a3b5b4@itcare.pl> <3d783736-a474-d9e3-2de2-e35c765f8249@itcare.pl> <39696136-2a4a-9c6c-3a63-4485ed2a1bf3@itcare.pl> Message-ID: <20171017055155.GA19944@pc11.op.pod.cz> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit To: intel-wired-lan@osuosl.org List-ID: On Tue, Oct 17, 2017 at 01:34:29AM +0200, Pawe? Staszewski wrote: > W dniu 2017-10-16 o?18:26, Pawe? Staszewski pisze: > > W dniu 2017-10-16 o?13:20, Pavlos Parissis pisze: > > > On 15/10/2017 02:58 ??, Alexander Duyck wrote: > > > > Hi Pawel, > > > > > > > > To clarify is that Dave Miller's tree or Linus's that you are talking > > > > about? If it is Dave's tree how long ago was it you pulled it since I > > > > think the fix was just pushed by Jeff Kirsher a few days ago. > > > > > > > > The issue should be fixed in the following commit: > > > > https://git.kernel.org/pub/scm/linux/kernel/git/davem/net.git/commit/drivers/net/ethernet/intel/i40e/i40e_txrx.c?id=2b9478ffc550f17c6cd8c69057234e91150f5972 > > > > > > Do you know when it is going to be available on net-next and > > > linux-stable repos? > > > > > > Cheers, > > > Pavlos > > > > > > > > I will make some tests today night with "net" git tree where this patch > > is included. > > Starting from 0:00 CET > > :) > > > > > Upgraded and looks like problem is not solved with that patch > Currently running system with > https://git.kernel.org/pub/scm/linux/kernel/git/davem/net.git/ > kernel > > Still about 0.5GB of memory is leaking somewhere > > Also can confirm that the latest kernel where memory is not leaking (with > use i40e driver intel 710 cards) is 4.11.12 > With kernel 4.11.12 - after hour no change in memory usage. > > also checked that with ixgbe instead of i40e with same? net.git kernel there > is no memleak - after hour same memory usage - so for 100% this is i40e > driver problem. I have (probably) the same problem here but with X520 cards: booting 4.12.x gives me oops after circa 20 minutes of our workload. Booting 4.9.y is OK. This machine is in production so any testing is very limited. Machine was stable for >2 months (on the desk before got to production) with 4.12.8 but with no traffic on X520 cards. Cheers, Vita