From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.3 required=3.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS, USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id E1826C4338F for ; Fri, 20 Aug 2021 09:16:32 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id C6121610E6 for ; Fri, 20 Aug 2021 09:16:32 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236228AbhHTJRJ (ORCPT ); Fri, 20 Aug 2021 05:17:09 -0400 Received: from verein.lst.de ([213.95.11.211]:40267 "EHLO verein.lst.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233437AbhHTJRH (ORCPT ); Fri, 20 Aug 2021 05:17:07 -0400 Received: by verein.lst.de (Postfix, from userid 2407) id DA66A67373; Fri, 20 Aug 2021 11:16:27 +0200 (CEST) Date: Fri, 20 Aug 2021 11:16:27 +0200 From: Christoph Hellwig To: Oliver Sang Cc: Christoph Hellwig , "Martin K. Petersen" , LKML , Linux Memory Management List , lkp@lists.01.org, lkp@intel.com Subject: Re: [scsi] 61b3baad24: last_state.load_disk_fail Message-ID: <20210820091627.GA6035@lst.de> References: <20210729082528.GA26618@xsang-OptiPlex-9020> <20210820074013.GC29369@xsang-OptiPlex-9020> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20210820074013.GC29369@xsang-OptiPlex-9020> User-Agent: Mutt/1.5.17 (2007-11-01) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Aug 20, 2021 at 03:40:13PM +0800, Oliver Sang wrote: > Hi, Christoph Hellwig, > > recently we checked this commit again, and find it has a new commit id > as well as the parent: > f2542a3be3277 scsi: scsi_ioctl: Move the "block layer" SCSI ioctl handling to drivers/scsi > 7353dc06c9a8e scsi: scsi_ioctl: Simplify SCSI passthrough permission checking > > so we tested it again, and found the issue is still reproduced in > our environment persistently. > > we also tried another platform, and could reproduce, too. > Intel(R) Xeon(R) CPU E7-8890 v3 @ 2.50GHz > > in parent dmesg (attached as dmesg-7353dc06c9.xz), > it's clear that the disk mounted without problem: > [ 31.549031][ T1791] XFS (sda1): Mounting V5 Filesystem > [ 31.591680][ T1791] XFS (sda1): Starting recovery (logdev: internal) > [ 31.608990][ T1791] XFS (sda1): Ending recovery (logdev: internal) > [ 31.625155][ T1791] xfs filesystem being mounted at /opt/rootfs supports timestamps until 2038 (0x7fffffff) > > but in the dmesg for commit f2542a3be3277 (attached as dmesg-f2542a3be3.xz), > which is from the identical test environment except kernel, > just failed like below: > [ 62.411699][ T1661] can't load the disk /dev/disk/by-id/ata-INTEL_SSDSC2BA400G4_BTHV634503K3400NGN-part1, skip testing... Really strange. This message is printed when wait_load_disk fails. The kernel has probed all disks before, then apparently something is installed using dpkg and then it waits for this rootfs (which obviously isn't the root at that point). Also at least on my debian testing and oldstable systems a plain blkid call never even calls SG_IO or related ioctls (which makes sense given that it looks at the file system labels). Does tis issue just show up on one particular system or on multiple different ones?