All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Paweł Staszewski" <pstaszewski@itcare.pl>
To: Alexander Duyck <alexander.duyck@gmail.com>,
	Pavlos Parissis <pavlos.parissis@gmail.com>,
	"Anders K. Pedersen | Cohaesio" <akp@cohaesio.com>,
	"netdev@vger.kernel.org" <netdev@vger.kernel.org>,
	"intel-wired-lan@lists.osuosl.org"
	<intel-wired-lan@lists.osuosl.org>,
	"alexander.h.duyck@intel.com" <alexander.h.duyck@intel.com>
Subject: Re: Linux 4.12+ memory leak on router with i40e NICs
Date: Thu, 19 Oct 2017 01:40:58 +0200	[thread overview]
Message-ID: <57579746-77e1-4603-12ed-7d999fdfeabf@itcare.pl> (raw)
In-Reply-To: <CAKgT0Udb5DVu6CB5U7rGNERZwMCKZUTii=qxn6hFzc=5zEZc1w@mail.gmail.com>



W dniu 2017-10-19 o 01:29, Alexander Duyck pisze:
> On Mon, Oct 16, 2017 at 10:51 PM, Vitezslav Samel <vitezslav@samel.cz> wrote:
>> On Tue, Oct 17, 2017 at 01:34:29AM +0200, Paweł Staszewski wrote:
>>> W dniu 2017-10-16 o 18:26, Paweł Staszewski pisze:
>>>> W dniu 2017-10-16 o 13:20, Pavlos Parissis pisze:
>>>>> On 15/10/2017 02:58 πμ, Alexander Duyck wrote:
>>>>>> Hi Pawel,
>>>>>>
>>>>>> To clarify is that Dave Miller's tree or Linus's that you are talking
>>>>>> about? If it is Dave's tree how long ago was it you pulled it since I
>>>>>> think the fix was just pushed by Jeff Kirsher a few days ago.
>>>>>>
>>>>>> The issue should be fixed in the following commit:
>>>>>> https://git.kernel.org/pub/scm/linux/kernel/git/davem/net.git/commit/drivers/net/ethernet/intel/i40e/i40e_txrx.c?id=2b9478ffc550f17c6cd8c69057234e91150f5972
>>>>> Do you know when it is going to be available on net-next and
>>>>> linux-stable repos?
>>>>>
>>>>> Cheers,
>>>>> Pavlos
>>>>>
>>>>>
>>>> I will make some tests today night with "net" git tree where this patch
>>>> is included.
>>>> Starting from 0:00 CET
>>>> :)
>>>>
>>>>
>>> Upgraded and looks like problem is not solved with that patch
>>> Currently running system with
>>> https://git.kernel.org/pub/scm/linux/kernel/git/davem/net.git/
>>> kernel
>>>
>>> Still about 0.5GB of memory is leaking somewhere
>>>
>>> Also can confirm that the latest kernel where memory is not leaking (with
>>> use i40e driver intel 710 cards) is 4.11.12
>>> With kernel 4.11.12 - after hour no change in memory usage.
>>>
>>> also checked that with ixgbe instead of i40e with same  net.git kernel there
>>> is no memleak - after hour same memory usage - so for 100% this is i40e
>>> driver problem.
>>    I have (probably) the same problem here but with X520 cards: booting
>> 4.12.x gives me oops after circa 20 minutes of our workload. Booting
>> 4.9.y is OK. This machine is in production so any testing is very
>> limited.
>>
>>    Machine was stable for >2 months (on the desk before got to
>> production) with 4.12.8 but with no traffic on X520 cards.
>>
>>          Cheers,
>>
>>                  Vita
> Sorry but it can't be the same issue since we are discussing a
> different driver (i40e) running different hardware (X710 or XL170).
> You might want to start a new thread for your issue, and/or if
> possible file a bug on e1000.sf.net.
>
> Thanks.
>
> - Alex
>
sorry but bugs reported on e1000.sf.net are delayed - some after about 6 
or more months - when i reported first bug there iv got reply after a 
year about no activity :):) haha - and reported there bug is still 
actrive :)
better for me is now to change nics (for sure cheaper from  the 
perspective of clients :) ) to mellanox or just to replace and use ixgbe 
- that have no this bug (mellanox and ixgbe have no such bug - have many 
servers with them with same conf - and only one with i40e where is same 
conf and memleak)

If nobody from Intel wants to reproduce this - qool - this is not my 
problem but intels :) - there is now many good nics to use - like 
mellanox or just stick with many 10G based on ixgbe that is really good 
driver - but really ? intel guys have no XL710 cards ? i dont want to 
buy another buggy cards to do only kernel bisects .... sorry ....
To do good bisects with this bug You need to spend maybee 200/300 
bisects - and to confirm each - You need maybee 30minutes so count how 
much time You need - more that 100 cards in price from mellanox maybee :)

so imagine what i will do :)


Thanks
Paweł

WARNING: multiple messages have this Message-ID (diff)
From: =?unknown-8bit?q?Pawe=C5=82?= Staszewski <pstaszewski@itcare.pl>
To: intel-wired-lan@osuosl.org
Subject: [Intel-wired-lan] Linux 4.12+ memory leak on router with i40e NICs
Date: Thu, 19 Oct 2017 01:40:58 +0200	[thread overview]
Message-ID: <57579746-77e1-4603-12ed-7d999fdfeabf@itcare.pl> (raw)
In-Reply-To: <CAKgT0Udb5DVu6CB5U7rGNERZwMCKZUTii=qxn6hFzc=5zEZc1w@mail.gmail.com>



W dniu 2017-10-19 o?01:29, Alexander Duyck pisze:
> On Mon, Oct 16, 2017 at 10:51 PM, Vitezslav Samel <vitezslav@samel.cz> wrote:
>> On Tue, Oct 17, 2017 at 01:34:29AM +0200, Pawe? Staszewski wrote:
>>> W dniu 2017-10-16 o 18:26, Pawe? Staszewski pisze:
>>>> W dniu 2017-10-16 o 13:20, Pavlos Parissis pisze:
>>>>> On 15/10/2017 02:58 ??, Alexander Duyck wrote:
>>>>>> Hi Pawel,
>>>>>>
>>>>>> To clarify is that Dave Miller's tree or Linus's that you are talking
>>>>>> about? If it is Dave's tree how long ago was it you pulled it since I
>>>>>> think the fix was just pushed by Jeff Kirsher a few days ago.
>>>>>>
>>>>>> The issue should be fixed in the following commit:
>>>>>> https://git.kernel.org/pub/scm/linux/kernel/git/davem/net.git/commit/drivers/net/ethernet/intel/i40e/i40e_txrx.c?id=2b9478ffc550f17c6cd8c69057234e91150f5972
>>>>> Do you know when it is going to be available on net-next and
>>>>> linux-stable repos?
>>>>>
>>>>> Cheers,
>>>>> Pavlos
>>>>>
>>>>>
>>>> I will make some tests today night with "net" git tree where this patch
>>>> is included.
>>>> Starting from 0:00 CET
>>>> :)
>>>>
>>>>
>>> Upgraded and looks like problem is not solved with that patch
>>> Currently running system with
>>> https://git.kernel.org/pub/scm/linux/kernel/git/davem/net.git/
>>> kernel
>>>
>>> Still about 0.5GB of memory is leaking somewhere
>>>
>>> Also can confirm that the latest kernel where memory is not leaking (with
>>> use i40e driver intel 710 cards) is 4.11.12
>>> With kernel 4.11.12 - after hour no change in memory usage.
>>>
>>> also checked that with ixgbe instead of i40e with same  net.git kernel there
>>> is no memleak - after hour same memory usage - so for 100% this is i40e
>>> driver problem.
>>    I have (probably) the same problem here but with X520 cards: booting
>> 4.12.x gives me oops after circa 20 minutes of our workload. Booting
>> 4.9.y is OK. This machine is in production so any testing is very
>> limited.
>>
>>    Machine was stable for >2 months (on the desk before got to
>> production) with 4.12.8 but with no traffic on X520 cards.
>>
>>          Cheers,
>>
>>                  Vita
> Sorry but it can't be the same issue since we are discussing a
> different driver (i40e) running different hardware (X710 or XL170).
> You might want to start a new thread for your issue, and/or if
> possible file a bug on e1000.sf.net.
>
> Thanks.
>
> - Alex
>
sorry but bugs reported on e1000.sf.net are delayed - some after about 6 
or more months - when i reported first bug there iv got reply after a 
year about no activity :):) haha - and reported there bug is still 
actrive :)
better for me is now to change nics (for sure cheaper from? the 
perspective of clients :) ) to mellanox or just to replace and use ixgbe 
- that have no this bug (mellanox and ixgbe have no such bug - have many 
servers with them with same conf - and only one with i40e where is same 
conf and memleak)

If nobody from Intel wants to reproduce this - qool - this is not my 
problem but intels :) - there is now many good nics to use - like 
mellanox or just stick with many 10G based on ixgbe that is really good 
driver - but really ? intel guys have no XL710 cards ? i dont want to 
buy another buggy cards to do only kernel bisects .... sorry ....
To do good bisects with this bug You need to spend maybee 200/300 
bisects - and to confirm each - You need maybee 30minutes so count how 
much time You need - more that 100 cards in price from mellanox maybee :)

so imagine what i will do :)


Thanks
Pawe?







  reply	other threads:[~2017-10-18 23:40 UTC|newest]

Thread overview: 74+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-10-04 12:56 Linux 4.12+ memory leak on router with i40e NICs Anders K. Pedersen | Cohaesio
2017-10-04 12:56 ` [Intel-wired-lan] " Anders K. Pedersen | Cohaesio
2017-10-04 15:32 ` Alexander Duyck
2017-10-04 15:32   ` [Intel-wired-lan] " Alexander Duyck
2017-10-05  5:19   ` Anders K. Pedersen | Cohaesio
2017-10-05  5:19     ` [Intel-wired-lan] " Anders K. Pedersen | Cohaesio
2017-10-14 22:00     ` =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-14 22:03       ` =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-15  0:58         ` Alexander Duyck
2017-10-15  0:58           ` [Intel-wired-lan] " Alexander Duyck
2017-10-15 15:03           ` Paweł Staszewski
2017-10-15 15:03             ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-16 11:20           ` Pavlos Parissis
2017-10-16 11:20             ` [Intel-wired-lan] " Pavlos Parissis
2017-10-16 14:11             ` Alexander Duyck
2017-10-16 14:11               ` [Intel-wired-lan] " Alexander Duyck
2017-10-16 16:26             ` Paweł Staszewski
2017-10-16 16:26               ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-16 23:34               ` Paweł Staszewski
2017-10-16 23:34                 ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-16 23:56                 ` Alexander Duyck
2017-10-16 23:56                   ` [Intel-wired-lan] " Alexander Duyck
2017-10-17  0:44                   ` Paweł Staszewski
2017-10-17  0:44                     ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-17  9:48                     ` Paweł Staszewski
2017-10-17  9:48                       ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-17 10:20                       ` Paweł Staszewski
2017-10-17 10:20                         ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-17 10:51                         ` Paweł Staszewski
2017-10-17 10:51                           ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-17 10:59                           ` Paweł Staszewski
2017-10-17 10:59                             ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-17 11:05                             ` Paweł Staszewski
2017-10-17 11:05                               ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-17 11:52                               ` Paweł Staszewski
2017-10-17 11:52                                 ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-17 14:08                                 ` Paweł Staszewski
2017-10-17 14:08                                   ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-18 15:44                                   ` Paweł Staszewski
2017-10-18 15:44                                     ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-18 22:20                                     ` Paweł Staszewski
2017-10-18 22:20                                       ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-18 22:50                                       ` Paweł Staszewski
2017-10-18 22:50                                         ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-18 22:58                                         ` Paweł Staszewski
2017-10-18 22:58                                           ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-18 23:22                                           ` Paweł Staszewski
2017-10-18 23:22                                             ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-18 23:37                                             ` Alexander Duyck
2017-10-18 23:37                                               ` [Intel-wired-lan] " Alexander Duyck
2017-10-18 23:51                                               ` Paweł Staszewski
2017-10-18 23:51                                                 ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-18 23:56                                                 ` Paweł Staszewski
2017-10-18 23:56                                                   ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-18 23:59                                                   ` Paweł Staszewski
2017-10-18 23:59                                                     ` [Intel-wired-lan] " =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-19 17:10                                                 ` Alexander Duyck
2017-10-19 17:10                                                   ` [Intel-wired-lan] " Alexander Duyck
2017-10-19 12:19                                               ` Anders K. Pedersen | Cohaesio
2017-10-19 12:19                                                 ` [Intel-wired-lan] " Anders K. Pedersen | Cohaesio
2017-10-19 15:40                                                 ` Alexander Duyck
2017-10-19 15:40                                                   ` [Intel-wired-lan] " Alexander Duyck
2017-10-22 13:56                                                   ` Anders K. Pedersen | Cohaesio
2017-10-22 13:56                                                     ` [Intel-wired-lan] " Anders K. Pedersen | Cohaesio
2017-10-17  5:51                 ` Vitezslav Samel
2017-10-17  5:51                   ` [Intel-wired-lan] " Vitezslav Samel
2017-10-18 23:29                   ` Alexander Duyck
2017-10-18 23:29                     ` [Intel-wired-lan] " Alexander Duyck
2017-10-18 23:40                     ` Paweł Staszewski [this message]
2017-10-18 23:40                       ` =?unknown-8bit?q?Pawe=C5=82?= Staszewski
2017-10-19 11:41                       ` Pavlos Parissis
2017-10-19 11:41                         ` [Intel-wired-lan] " Pavlos Parissis
2017-10-19 15:53                         ` Alexander Duyck
2017-10-19 15:53                           ` [Intel-wired-lan] " Alexander Duyck

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=57579746-77e1-4603-12ed-7d999fdfeabf@itcare.pl \
    --to=pstaszewski@itcare.pl \
    --cc=akp@cohaesio.com \
    --cc=alexander.duyck@gmail.com \
    --cc=alexander.h.duyck@intel.com \
    --cc=intel-wired-lan@lists.osuosl.org \
    --cc=netdev@vger.kernel.org \
    --cc=pavlos.parissis@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.