From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 12295C43334 for ; Thu, 2 Jun 2022 00:06:45 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232589AbiFBAGn (ORCPT ); Wed, 1 Jun 2022 20:06:43 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:47100 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229996AbiFBAGk (ORCPT ); Wed, 1 Jun 2022 20:06:40 -0400 Received: from mail1.merlins.org (magic.merlins.org [209.81.13.136]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 67821DF31 for ; Wed, 1 Jun 2022 17:06:39 -0700 (PDT) Received: from merlin by mail1.merlins.org with local (Exim 4.94.2 #2) id 1nwYMH-0004RW-Mz by authid ; Wed, 01 Jun 2022 17:06:37 -0700 Date: Wed, 1 Jun 2022 17:06:37 -0700 From: Marc MERLIN To: Josef Bacik Cc: linux-btrfs Subject: Re: Rebuilding 24TB Raid5 array (was btrfs corruption: parent transid verify failed + open_ctree failed) Message-ID: <20220602000637.GL22722@merlins.org> References: <20220601214054.GH22722@merlins.org> <20220601223639.GI22722@merlins.org> <20220601225643.GJ22722@merlins.org> <20220601231008.GK22722@merlins.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Sysadmin: BOFH X-URL: http://marc.merlins.org/ User-Agent: Mutt/1.10.1 (2018-07-13) X-SA-Exim-Connect-IP: X-SA-Exim-Mail-From: marc@merlins.org Precedence: bulk List-ID: X-Mailing-List: linux-btrfs@vger.kernel.org On Wed, Jun 01, 2022 at 08:04:03PM -0400, Josef Bacik wrote: > > Found missing chunk 15772793438208-15773867180032 type 0 > > Found missing chunk 15773867180032-15774940921856 type 0 > > Found missing chunk 15774940921856-15776014663680 type 0 > > Found missing chunk 15776014663680-15777088405504 type 0 > > Found missing chunk 15777088405504-15778162147328 type 0 > > This segfault makes no sense, we check to make sure any of this stuff > is NULL. I've added some debugging, hopefully that'll shed some > light. Thanks, Found missing chunk 15486104371200-15487178113024 type 0 Found missing chunk 15487178113024-15488251854848 type 0 Found missing chunk 15488251854848-15489325596672 type 0 Found missing chunk 15671861706752-15672935448576 type 0 Found missing chunk 15672935448576-15674009190400 type 0 Found missing chunk 15772793438208-15773867180032 type 0 Found missing chunk 15773867180032-15774940921856 type 0 Found missing chunk 15774940921856-15776014663680 type 0 Found missing chunk 15776014663680-15777088405504 type 0 Found missing chunk 15777088405504-15778162147328 type 0 adding bg for 20971520 8388608 Program received signal SIGSEGV, Segmentation fault. 0x00005555555826d5 in cache_block_group (block_group=0x555555662000, root=0x0) at ./kerncompat.h:133 133 abort(); (gdb) bt #0 0x00005555555826d5 in cache_block_group (block_group=0x555555662000, root=0x0) at ./kerncompat.h:133 #1 find_search_start (profile=2, num=16384, start_ret=, cache_ret=, root=0x555555650030) at kernel-shared/extent-tree.c:280 #2 find_free_extent (profile=2, exclude_nr=0, exclude_start=0, ins=0x7fffffffd760, hint_byte=, search_end=18446744073709551615, search_start=0, empty_size=0, num_bytes=16384, root=0x555555650030, trans=0x5555556620e0) at kernel-shared/extent-tree.c:2254 #3 btrfs_reserve_extent (trans=trans@entry=0x5555556620e0, root=root@entry=0x555555650030, num_bytes=num_bytes@entry=16384, empty_size=empty_size@entry=0, hint_byte=hint_byte@entry=0, search_end=search_end@entry=18446744073709551615, ins=0x7fffffffd760, is_data=false) at kernel-shared/extent-tree.c:2385 #4 0x0000555555582f12 in alloc_tree_block (ins=0x7fffffffd760, search_end=18446744073709551615, hint_byte=0, empty_size=0, level=0, key=0x7fffffffd880, flags=0, generation=, root_objectid=3, num_bytes=16384, root=0x555555650030, trans=0x5555556620e0) at kernel-shared/extent-tree.c:2520 #5 btrfs_alloc_free_block (trans=trans@entry=0x5555556620e0, root=root@entry=0x555555650030, blocksize=16384, root_objectid=3, key=key@entry=0x7fffffffd880, level=level@entry=0, hint=0, empty_size=0) at kernel-shared/extent-tree.c:2575 #6 0x000055555557243f in __btrfs_cow_block (trans=trans@entry=0x5555556620e0, root=root@entry=0x555555650030, buf=buf@entry=0x555555652b90, parent=parent@entry=0x0, parent_slot=parent_slot@entry=0, cow_ret=cow_ret@entry=0x7fffffffdab8, search_start=0, empty_size=0) at kernel-shared/ctree.c:451 #7 0x0000555555572d2e in btrfs_cow_block (trans=trans@entry=0x5555556620e0, root=root@entry=0x555555650030, buf=0x555555652b90, parent=0x0, parent_slot=0, cow_ret=cow_ret@entry=0x7fffffffdab8) at kernel-shared/ctree.c:544 #8 0x0000555555575730 in btrfs_search_slot (trans=0x5555556620e0, root=root@entry=0x555555650030, key=key@entry=0x55555564fb70, p=p@entry=0x555555662170, ins_len=ins_len@entry=105, cow=cow@entry=1) at kernel-shared/ctree.c:1377 #9 0x0000555555576e85 in btrfs_insert_empty_items (trans=trans@entry=0x5555556620e0, root=root@entry=0x555555650030, path=path@entry=0x555555662170, cpu_key=cpu_key@entry=0x55555564fb70, data_size=data_size@entry=0x7fffffffdb9c, nr=nr@entry=1) at kernel-shared/ctree.c:2824 #10 0x0000555555577295 in btrfs_insert_empty_item (data_size=, key=0x55555564fb70, path=0x555555662170, root=0x555555650030, trans=0x5555556620e0) at ./kernel-shared/ctree.h:2780 #11 btrfs_insert_item (trans=trans@entry=0x5555556620e0, root=0x555555650030, cpu_key=cpu_key@entry=0x55555564fb70, data=0x55555565de80, data_size=80) at kernel-shared/ctree.c:2923 #12 0x00005555555e30dc in restore_missing_chunks (fs_info=0x55555564fbc0) at ./kernel-shared/ctree.h:322 #13 btrfs_find_recover_chunks (path=path@entry=0x7fffffffe1ce "/dev/mapper/dshelf1") at cmds/rescue-recover-chunks.c:458 #14 0x00005555555d7c87 in cmd_rescue_recover_chunks (cmd=, argc=, argv=) at cmds/rescue.c:65 #15 0x000055555556c17b in cmd_execute (argv=0x7fffffffdeb8, argc=2, cmd=0x555555645d40 ) at cmds/commands.h:125 #16 handle_command_group (cmd=, argc=2, argv=0x7fffffffdeb8) at btrfs.c:152 #17 0x000055555556c275 in cmd_execute (argv=0x7fffffffdeb0, argc=3, cmd=0x555555646cc0 ) at cmds/commands.h:125 #18 main (argc=3, argv=0x7fffffffdeb0) at btrfs.c:405 -- "A mouse is a device used to point at the xterm you want to type in" - A.S.R. Home page: http://marc.merlins.org/