From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-0.8 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_PASS autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 0E182C04ABA for ; Tue, 18 Sep 2018 10:24:04 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 9E82B2146D for ; Tue, 18 Sep 2018 10:24:03 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 9E82B2146D Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=prnet.org Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729493AbeIRPz6 (ORCPT ); Tue, 18 Sep 2018 11:55:58 -0400 Received: from extserverfr1.prnet.org ([188.165.208.21]:33614 "EHLO extserverfr1.prnet.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727667AbeIRPz5 (ORCPT ); Tue, 18 Sep 2018 11:55:57 -0400 Received: from extserver.intern.prnet.org (extserver.prnet.org [192.168.11.1]) by extserverfr1.prnet.org (8.14.9/8.14.9) with ESMTP id w8IANo1E015758; Tue, 18 Sep 2018 12:23:50 +0200 Received: from extserver (localhost [IPv6:::1]) (authenticated bits=0) by extserver.intern.prnet.org (8.14.9/8.14.9) with ESMTP id w8IANnIw031354 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES128-GCM-SHA256 bits=128 verify=NO); Tue, 18 Sep 2018 12:23:49 +0200 From: David Arendt To: "Maciej S. Szmigiero" Cc: linux-kernel@vger.kernel.org, nic_swsd@realtek.com, netdev@vger.kernel.org Subject: Re: kernel 4.18.5 Realtek 8111G network adapter stops responding under high system load In-Reply-To: <6c14f6d0-ea61-b8e6-57a2-940d32330ed2@maciej.szmigiero.name> Message-ID: <20180918102349.Horde.yHctatUod-_SC696oYKmySa@cloud.luxmemories.lu> User-Agent: Horde Application Framework 5 Date: Tue, 18 Sep 2018 10:23:49 +0000 Content-Type: text/plain; charset=utf-8 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, Today I had the network adapter problems again. So the patch doesn't seem to change anything regarding this problem. This week my time is unfortunately very limited, but I will try to find some time next weekend to look a bit more into the issue. Thanks in advance, David Arendt Maciej S. Szmigiero – Sun, 16. September 2018 2:12 > [ I've added Realtek Linux NIC and netdev mailing lists to CC ] > > Hi David, > > On 15.09.2018 23:23, David Arendt wrote: > > Hi, > > > > just a follow up: > > > > In kernel 4.18.8 the behaviour is different. > > > > The network is not reachable a number of times, but restarting to be > > reachable by itself before it finally is no longer reachable at all. > > > > Here the logging output: > > > > Sep 15 17:44:43 server kernel: NETDEV WATCHDOG: enp3s0 (r8169): transmit > > queue 0 timed out > > Sep 15 17:44:43 server kernel: r8169 0000:03:00.0 enp3s0: link up > > Sep 15 18:10:26 server kernel: r8169 0000:03:00.0 enp3s0: link up > > Sep 15 18:12:24 server kernel: r8169 0000:03:00.0 enp3s0: link up > > Sep 15 18:13:19 server kernel: r8169 0000:03:00.0 enp3s0: link up > > Sep 15 18:14:48 server kernel: r8169 0000:03:00.0 enp3s0: link up > > Sep 15 18:20:24 server kernel: r8169 0000:03:00.0 enp3s0: link up > > Sep 15 18:34:19 server kernel: r8169 0000:03:00.0 enp3s0: link up > > Sep 15 18:43:43 server kernel: r8169 0000:03:00.0 enp3s0: link up > > Sep 15 18:46:26 server kernel: r8169 0000:03:00.0 enp3s0: link up > > Sep 15 19:00:24 server kernel: r8169 0000:03:00.0 enp3s0: link up > > > > From 17:44 ro 18:46 the network is recovering automatically. After the > > up from 19:00, the network is no longer reachable without any additional > > message. > > > > If looking at ifconfig, the counter for TX packets is incrementing, the > > counter for RX packets not. > > > > Here again the driver from 4.17.14 is working flawlessly. > > Could you please try this patch on top of 4.18.8: > git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/ > > In my case the problem fixed by the above commit was only limited to > bad TX performance but my r8169 NIC models were different from what > you have. > > If this does not help then try bisecting the issue > (maybe limited to drivers/net/ethernet/realtek/r8169.c to save time). > If the NIC dies after a heavy load it might be possible to generate > such load quickly by in-kernel pktgen. > > If that's not possible then at please least compare NIC register > values displayed by "ethtool -d enp3s0" between working and > non-working kernels. > > > Thanks in advance, > > David Arendt > > Maciej > > > > > > > On 9/4/18 8:19 AM, David Arendt wrote: > >> Hi, > >> > >> When using kernel 4.18.5 the Realtek 8111G network adapter stops > >> responding under high system load. > >> > >> Dmesg is showing no errors. > >> > >> Sometimes an ifconfig enp3s0 down followed by an ifconfig enp3s0 up is > >> enough for the network adapter to restart responding. Sometimes a reboot > >> is necessary. > >> > >> When copying r8169.c from 4.17.14 to the 4.18.5 kernel, networking works > >> perfectly stable on 4.18.5 so the problem seems r8169.c related. > >> > >> Here the output from lshw: > >> > >> *-pci:2 > >> description: PCI bridge > >> product: 8 Series/C220 Series Chipset Family PCI Express > >> Root Port #3 > >> vendor: Intel Corporation > >> physical id: 1c.2 > >> bus info: pci@0000:00:1c.2 > >> version: d5 > >> width: 32 bits > >> clock: 33MHz > >> capabilities: pci pciexpress msi pm normal_decode > >> bus_master cap_list > >> configuration: driver=pcieport > >> resources: irq:18 ioport:d000(size=4096) > >> memory:f7300000-f73fffff ioport:f2100000(size=1048576) > >> *-network > >> description: Ethernet interface > >> product: RTL8111/8168/8411 PCI Express Gigabit Ethernet > >> Controller > >> vendor: Realtek Semiconductor Co., Ltd. > >> physical id: 0 > >> bus info: pci@0000:03:00.0 > >> logical name: enp3s0 > >> version: 0c > >> serial: > >> size: 1Gbit/s > >> capacity: 1Gbit/s > >> width: 64 bits > >> clock: 33MHz > >> capabilities: pm msi pciexpress msix vpd bus_master > >> cap_list ethernet physical tp mii 10bt 10bt-fd 100bt 100bt-fd 1000bt > >> 1000bt-fd autonegotiation > >> configuration: autonegotiation=on broadcast=yes > >> driver=r8169 driverversion=2.3LK-NAPI duplex=full > >> firmware=rtl8168g-2_0.0.1 02/06/13 latency=0 link=yes multicast=yes > >> port=MII speed=1Gbit/s > >> resources: irq:18 ioport:d000(size=256) > >> memory:f7300000-f7300fff memory:f2100000-f2103fff > >> > >> Thanks in advance for looking into this, > >> > >> David Arendt > >> > >> > >