From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 42C79C4321E for ; Wed, 6 Apr 2022 04:46:10 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1441948AbiDFEqs (ORCPT ); Wed, 6 Apr 2022 00:46:48 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:49408 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2361381AbiDFE3z (ORCPT ); Wed, 6 Apr 2022 00:29:55 -0400 Received: from mail1.merlins.org (magic.merlins.org [209.81.13.136]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 54BEB109A65 for ; Tue, 5 Apr 2022 17:35:22 -0700 (PDT) Received: from merlin by mail1.merlins.org with local (Exim 4.94.2 #2) id 1nbtdp-0006ki-Tv by authid ; Tue, 05 Apr 2022 17:35:21 -0700 Date: Tue, 5 Apr 2022 17:35:21 -0700 From: Marc MERLIN To: Josef Bacik Cc: "linux-btrfs@vger.kernel.org" Subject: Re: Rebuilding 24TB Raid5 array (was btrfs corruption: parent transid verify failed + open_ctree failed) Message-ID: <20220406003521.GM28707@merlins.org> References: <20220405203737.GE28707@merlins.org> <20220405211412.GF28707@merlins.org> <20220405212655.GH28707@merlins.org> <20220405214309.GI28707@merlins.org> <20220405225808.GJ28707@merlins.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Sysadmin: BOFH X-URL: http://marc.merlins.org/ User-Agent: Mutt/1.10.1 (2018-07-13) X-SA-Exim-Connect-IP: X-SA-Exim-Mail-From: marc@merlins.org Precedence: bulk List-ID: X-Mailing-List: linux-btrfs@vger.kernel.org On Tue, Apr 05, 2022 at 08:23:42PM -0400, Josef Bacik wrote: > Alright it's time for the crazy. Go ahead and pull and start running. > This is going to take a while to run, we're basically going to walk > and check all the node pointers in the tree root, if it doesn't look > right we're going to search the metadata for the best copy to use, and > then update the block to point at the new block. It has to do the > full search every time, because we don't have time for me to properly > implement a cache, so don't be worried if it takes a while. > > It may print out stuff, if it looks like it's looping stop it and let > me know, but I don't think I fucked it up. You're going to see a lot > "fixed root ", "fixed slot ", if you see it repeating > the same slot or root then we know we have a problem. I assume this is not good? ERROR: failed to write bytenr 13577814573056 length 16384 devid 1 dev_bytenr 13439981355008: Operation not permitted I am running as root of course. Marc -- "A mouse is a device used to point at the xterm you want to type in" - A.S.R. Home page: http://marc.merlins.org/