From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 6EC17C433F5 for ; Sun, 8 May 2022 21:20:56 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232699AbiEHVYo (ORCPT ); Sun, 8 May 2022 17:24:44 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:45466 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229491AbiEHVYn (ORCPT ); Sun, 8 May 2022 17:24:43 -0400 Received: from mail1.merlins.org (magic.merlins.org [209.81.13.136]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3D089BC3C for ; Sun, 8 May 2022 14:20:51 -0700 (PDT) Received: from c-24-5-124-255.hsd1.ca.comcast.net ([24.5.124.255]:58464 helo=sauron.svh.merlins.org) by mail1.merlins.org with esmtpsa (Cipher TLS1.3:ECDHE_SECP256R1__RSA_PSS_RSAE_SHA256__AES_256_GCM:256) (Exim 4.94.2 #2) id 1nnoKg-0002vA-QA by authid with srv_auth_plain; Sun, 08 May 2022 14:20:50 -0700 Received: from merlin by sauron.svh.merlins.org with local (Exim 4.92) (envelope-from ) id 1nnoKg-00EYwk-Jd; Sun, 08 May 2022 14:20:50 -0700 Date: Sun, 8 May 2022 14:20:50 -0700 From: Marc MERLIN To: Josef Bacik Cc: linux-btrfs Subject: Re: Rebuilding 24TB Raid5 array (was btrfs corruption: parent transid verify failed + open_ctree failed) Message-ID: <20220508212050.GR12542@merlins.org> References: <20220506031910.GH12542@merlins.org> <20220507153921.GG1020265@merlins.org> <20220507193628.GO12542@merlins.org> <20220508194557.GP12542@merlins.org> <20220508205224.GQ12542@merlins.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20220508205224.GQ12542@merlins.org> X-Sysadmin: BOFH X-URL: http://marc.merlins.org/ X-SA-Exim-Connect-IP: 24.5.124.255 X-SA-Exim-Mail-From: marc@merlins.org Precedence: bulk List-ID: X-Mailing-List: linux-btrfs@vger.kernel.org On Sun, May 08, 2022 at 01:52:24PM -0700, Marc MERLIN wrote: > On Sun, May 08, 2022 at 03:55:01PM -0400, Josef Bacik wrote: > > > searching 165298 for bad extents > > > processed 108756992 of 108756992 possible bytes, 100% > > > searching 165299 for bad extents > > > processed 75792384 of 75792384 possible bytes, 100% > > > searching 18446744073709551607 for bad extents > > > processed 16384 of 16384 possible bytes, 100% > > > Recording extents for root 3 > > > Floating point exception > > > > Oops that's probably a divide by 0 for my fancy pct thing, I fixed it > > up and pushed. Thanks, > > > Mmmh, got worse? Failed on the first file to delete > processed 491520 of 108756992 possible bytes, 0%WTF???? we think we already inserted this bytenr?? [76300, 108, 0] dumping paths 10467695652864 8675328 > inode ref info failed??? > misc/add0/new/file > Failed to find [10467695652864, 168, 8675328] > > Program received signal SIGSEGV, Segmentation fault. > rb_search (root=root@entry=0x100000060, key=key@entry=0x7fffffffd540, comp=comp@entry=0x55555559afa1 , > next_ret=next_ret@entry=0x7fffffffd558) at common/rbtree-utils.c:48 > 48 struct rb_node *n = root->rb_node; > (gdb) bt > #0 rb_search (root=root@entry=0x100000060, key=key@entry=0x7fffffffd540, comp=comp@entry=0x55555559afa1 , > next_ret=next_ret@entry=0x7fffffffd558) at common/rbtree-utils.c:48 > #1 0x000055555559b1a5 in search_cache_extent (tree=tree@entry=0x100000060, start=start@entry=15645169139712) > at common/extent-cache.c:179 > #2 0x0000555555584d33 in set_extent_bits (tree=0x100000060, start=15645169139712, end=15645169139711, bits=bits@entry=1) > at kernel-shared/extent_io.c:380 I tried the old code, and here is the output processed 108756992 of 108756992 possible bytes, 100% searching 165299 for bad extents processed 75792384 of 75792384 possible bytes, 100% searching 18446744073709551607 for bad extents processed 16384 of 16384 possible bytes, 100% Recording extents for root 3 Program received signal SIGFPE, Arithmetic exception. 0x00005555555e08ad in process_eb (trans=trans@entry=0x55555930f1a0, root=root@entry=0x55555564e030, eb=eb@entry=0x555555654c10, current=current@entry=0x7fffffffdae8) at /usr/include/x86_64-linux-gnu/bits/stdio2.h:112 112 return __printf_chk (__USE_FORTIFY_LEVEL - 1, __fmt, __va_arg_pack ()); (gdb) bt #0 0x00005555555e08ad in process_eb (trans=trans@entry=0x55555930f1a0, root=root@entry=0x55555564e030, eb=eb@entry=0x555555654c10, current=current@entry=0x7fffffffdae8) at /usr/include/x86_64-linux-gnu/bits/stdio2.h:112 #1 0x00005555555e0745 in process_eb (trans=trans@entry=0x55555930f1a0, root=root@entry=0x55555564e030, eb=0x555555650b90, current=current@entry=0x7fffffffdae8) at cmds/rescue-init-extent-tree.c:941 #2 0x00005555555e09ea in record_root (root=0x55555564e030) at cmds/rescue-init-extent-tree.c:1014 #3 0x00005555555e0d11 in btrfs_init_extent_tree (path=path@entry=0x7fffffffe1ce "/dev/mapper/dshelf1") at cmds/rescue-init-extent-tree.c:1086 #4 0x00005555555d7b65 in cmd_rescue_init_extent_tree (cmd=, argc=, argv=) at cmds/rescue.c:65 #5 0x000055555556c17b in cmd_execute (argv=0x7fffffffdeb8, argc=2, cmd=0x555555643d40 ) at cmds/commands.h:125 #6 handle_command_group (cmd=, argc=2, argv=0x7fffffffdeb8) at btrfs.c:152 #7 0x000055555556c275 in cmd_execute (argv=0x7fffffffdeb0, argc=3, cmd=0x555555644cc0 ) at cmds/commands.h:125 #8 main (argc=3, argv=0x7fffffffdeb0) at btrfs.c:405 -- "A mouse is a device used to point at the xterm you want to type in" - A.S.R. Home page: http://marc.merlins.org/ | PGP 7F55D5F27AAF9D08