From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-4.0 required=3.0 tests=BAYES_00,MAILING_LIST_MULTI, SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 40E65C433DF for ; Thu, 6 Aug 2020 17:58:50 +0000 (UTC) Received: from lists.ozlabs.org (lists.ozlabs.org [203.11.71.2]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id 26BF1206B2 for ; Thu, 6 Aug 2020 17:58:50 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 26BF1206B2 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=bugzilla.kernel.org Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=linuxppc-dev-bounces+linuxppc-dev=archiver.kernel.org@lists.ozlabs.org Received: from bilbo.ozlabs.org (lists.ozlabs.org [IPv6:2401:3900:2:1::3]) by lists.ozlabs.org (Postfix) with ESMTP id 4BMx7T3cdzzDqv5 for ; Fri, 7 Aug 2020 03:58:45 +1000 (AEST) Authentication-Results: lists.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=kernel.org (client-ip=198.145.29.99; helo=mail.kernel.org; envelope-from=srs0=ac95=bq=bugzilla.kernel.org=bugzilla-daemon@kernel.org; receiver=) Authentication-Results: lists.ozlabs.org; dmarc=pass (p=none dis=none) header.from=bugzilla.kernel.org Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by lists.ozlabs.org (Postfix) with ESMTPS id 4BMx4q25JlzDqRJ for ; Fri, 7 Aug 2020 03:56:27 +1000 (AEST) From: bugzilla-daemon@bugzilla.kernel.org To: linuxppc-dev@lists.ozlabs.org Subject: [Bug 207359] MegaRAID SAS 9361 controller hang/reset Date: Thu, 06 Aug 2020 17:56:24 +0000 X-Bugzilla-Reason: None X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: AssignedTo platform_ppc-64@kernel-bugs.osdl.org X-Bugzilla-Product: Platform Specific/Hardware X-Bugzilla-Component: PPC-64 X-Bugzilla-Version: 2.5 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: cam@neo-zeon.de X-Bugzilla-Status: NEW X-Bugzilla-Resolution: X-Bugzilla-Priority: P1 X-Bugzilla-Assigned-To: platform_ppc-64@kernel-bugs.osdl.org X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: https://bugzilla.kernel.org/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: linuxppc-dev@lists.ozlabs.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Linux on PowerPC Developers Mail List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: linuxppc-dev-bounces+linuxppc-dev=archiver.kernel.org@lists.ozlabs.org Sender: "Linuxppc-dev" https://bugzilla.kernel.org/show_bug.cgi?id=3D207359 --- Comment #4 from Cameron (cam@neo-zeon.de) --- I converted the box's filesystems from BTRFS to XFS, and switched the page = size from 4k to 64k. The problem appears to be entirely gone now. I am able to conclusively run 5.7.13 without issue, which I verified as having the megaraid_sas controller hang problem while still running my previous BTRFS+= 4k page configuration. Unfortunately, it took a great deal of time to perform this conversion, and= I wasn't able to keep the box down even longer to test if converting to XFS a= nd 64k pages individually resolved the issue. All I can say for certain is that either switching to XFS, to a 64k page size, or both has fixed the problem = for me. The backup volume is a single SATA disk that is still using BTRFS (for snapshotting), and is not giving me any trouble. But if this has any relati= on to https://bugzilla.kernel.org/show_bug.cgi?id=3D206123, then this may not = be conclusive due to being that SATA disks potentially may not trigger the iss= ue. The single disk also can't push as much IO as the RAID10 volume so that may= be another reason. My quasi educated non-kernel-dev guess is that this is probably a bug relat= ing to the 4k page size. Whether or not the regular behavior of BTRFS exacerbat= es this (making it easier to reproduce), is possible, but unknown. Hopefully someone else encountering this issue will find this helpful. --=20 You are receiving this mail because: You are watching the assignee of the bug.=