From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.8 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id B6C23C48BCD for ; Tue, 8 Jun 2021 17:57:00 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 9DB9161183 for ; Tue, 8 Jun 2021 17:57:00 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233933AbhFHR6w (ORCPT ); Tue, 8 Jun 2021 13:58:52 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:49192 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231481AbhFHR6v (ORCPT ); Tue, 8 Jun 2021 13:58:51 -0400 Received: from mail-pj1-x102f.google.com (mail-pj1-x102f.google.com [IPv6:2607:f8b0:4864:20::102f]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 1FA54C061574 for ; Tue, 8 Jun 2021 10:56:45 -0700 (PDT) Received: by mail-pj1-x102f.google.com with SMTP id h16so12388603pjv.2 for ; Tue, 08 Jun 2021 10:56:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=aternos.org; s=google; h=mime-version:from:date:message-id:subject:to; bh=NtFUkXFeSM5JJGxJce9fbpNpmFBiYRG+AnaGSrBu2Qo=; b=iFL6slB8LabhOSOgOCwIxpLNxap2wgwCR8d3ezoyIVOOT+V3umE3PyPDgLGs20WOud dj5EcFpaFAX25INZtWOIk+krpnXJ0rtJ/MUlO8w3KxbsvoVSD8tb0ADZlNv+4Xc0nZA1 Wk5rj1a24jy+D9jQ5Chi4BMRgRKdmF12SNYaH3F2FLUb0ksG2lEHXCtf3EFHb+u/wqBu lx520AqFBdm0i1m8c+q2ZdV6NISFSJt+k8cNowPKYiHbXUiy/W5dytXbXfBh8VonWKfx D4NInh0Drq8iQsfnom2rlePjqxVoM/0E+aht0bnafu2an0sE3AWmJsvUOcGBa5dwOxtf N6Yw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=NtFUkXFeSM5JJGxJce9fbpNpmFBiYRG+AnaGSrBu2Qo=; b=O3ZCTRFb4VHK5XF67Nl0QWwNHlGksfQzVDquhXEA0/tqxmZVMRmp6Jeyf4qf708mqc kuTKGQoZCv/0H7dEDYQXlEIul262h39mQX9o0UX+grC+HXtkeM0jo5sPi1JOwrLgZ4/V ezhvXtd+HuBe1SIgeHPRFZ0PBEB9Ah5AhTmVzqu1dXU9ylRz1BRBaOi1T2Q4wgKwVrpG zhsVZg1eo3Psy8F/7hYjVya45apPO3+GVytaCtKfL9lnfpI83LJXh9XwI2Mpg6/oQlEc zqpS4aLTcF9gtzGfv6CPKNubDQgeN51/qjB9vvwIIWMI2cBt51iTzYTyottmI0Wz7Dp2 W7ow== X-Gm-Message-State: AOAM530/qw9G0snkKj3ucBK7WhZnfRIEVGy8y9e1M41bOf/X+0n+aL/k cnEGtuLo67VsMPkXW+CE/uSEQFeM5o8DGXoljVGfvExYBnhtGA== X-Google-Smtp-Source: ABdhPJxKMNqTNwCJ7Zr4MnngsZ/J65SNxzJWpGbRCgH/7LHU+VAB/AnyqhWbXv/pX98vnMB+t4vGnqH0Z/YBY54rNjM= X-Received: by 2002:a17:90b:108f:: with SMTP id gj15mr28504137pjb.124.1623175004485; Tue, 08 Jun 2021 10:56:44 -0700 (PDT) MIME-Version: 1.0 From: Roman Steinhart Date: Tue, 8 Jun 2021 19:56:33 +0200 Message-ID: Subject: bnxt_en NIC driver crashes IO_PAGE_FAULT To: netdev@vger.kernel.org, linux-kernel@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi all, You receive this mail because I raised a bug report against the bnxt_en driver in the Linux kernel on launchpad.net: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1931106 I was advised there to get in touch with you here. We received a bunch of new servers with a Supermicro H12SSL-NT mainboard that has an embedded Broadcom BCM57416 NIC. On all those servers we observe crashes of the NIC driver (bnxt_en) from time to time. We're not able to manually reproduce this issue, it just occurs at some point. Also our monitoring does not show any irregularities(high traffic flow or sth. like this). All servers are running with up-to-date packages: $ lsb_release -rd Description: Ubuntu 20.04.2 LTS Release: 20.04 We tested the kernel versions 5.4.0-73 back to -66, the current HWE kernel 5.8.0-55 as well as the latest mainline kernel 5.13.0-051300rc5. On those 20 servers the crash occurs like ~1-2 times a week. Just with the 5.13.0 kernel the driver crashed on all 5 servers running that version within 1-2 hours after installing that kernel version. Syslog 5.4.0-73 kernel: https://pastebin.com/yDAyjHvF Syslog 5.13-rc5 kernel: https://pastebin.com/GWqtVaA3 Apport file: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1931106/+attachment/5502930/+files/apport.linux-image-5.8.0-55-generic.cime34c6.apport related Launchpad.net Bug report: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1931106 Thanks in advance. ~ Roman