From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-15.8 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER, INCLUDES_PATCH,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 3EE9EC4338F for ; Tue, 3 Aug 2021 07:34:28 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 1813D60EB9 for ; Tue, 3 Aug 2021 07:34:28 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234108AbhHCHeh (ORCPT ); Tue, 3 Aug 2021 03:34:37 -0400 Received: from mail.synology.com ([211.23.38.101]:39160 "EHLO synology.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S234065AbhHCHee (ORCPT ); Tue, 3 Aug 2021 03:34:34 -0400 Subject: Re: [PATCH] Btrfs: fix root drop key mismatch when drop snapshot fails DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=synology.com; s=123; t=1627976063; bh=bBHI5hxjuEP1QRyZL7ShDjgNdVTpFiefjLsZpmDItkc=; h=Subject:To:Cc:References:From:Date:In-Reply-To; b=t1rUqY52VTKqpWS/iAVp+DVENkj6dsPOmJ/6HBW1JCd3dtUb+kbmGsYlH4UktltYX kdfUjMWJoApSK2+si6Aqq/jyYkL7mxrfnrQCzWv6+sn8flQP0DSC6hF0eWu9/sjvl5 9SspWoFRiO5RXqayP1cqFh2JpQFWW6FzGpIRnArI= To: fdmanana@gmail.com Cc: linux-btrfs References: <20210802104004.733-1-robbieko@synology.com> From: robbieko Message-ID: <2e781b79-2b7c-f88f-17d4-9d237b65d67e@synology.com> Date: Tue, 3 Aug 2021 15:36:50 +0800 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US X-Antivirus: Avast (VPS 210803-0, 2021/8/3), Outbound message X-Antivirus-Status: Clean X-Synology-MCP-Status: no X-Synology-Spam-Flag: no X-Synology-Spam-Status: score=0, required 6, WHITELIST_FROM_ADDRESS 0 X-Synology-Virus-Status: no Precedence: bulk List-ID: X-Mailing-List: linux-btrfs@vger.kernel.org Filipe Manana 於 2021/8/2 下午 07:28 寫道: > On Mon, Aug 2, 2021 at 11:41 AM robbieko wrote: >> From: Robbie Ko >> >> When walk down/up tree fails, we did not abort the transaction, >> nor did modify the root drop key, but the refs of some tree blocks >> may have been removed in the transaction. >> >> Therefore, when we retry to delete subvol in the future, and >> missing reference occurs when lookup extent info. > This sentence is confusing, it took me some time to understand it. > > Something like: > > Therefore when we retry to delete the subvolume in the future, we will > fail due to > the fact that some references were deleted in the previous attempt: > >> ------------[ cut here ]------------ >> WARNING: at fs/btrfs/extent-tree.c:898 btrfs_lookup_extent_info+0x40a/0x4c0 [btrfs]() >> CPU: 2 PID: 11618 Comm: btrfs-cleaner Tainted: P >> Hardware name: Synology Inc. RS3617xs Series/Type2 - Board Product Name1, BIOS M.017 2019/10/16 >> ffffffff814c2246 ffffffff81036536 ffff88024a911d08 ffff880262de45b0 >> ffff8802448b5f20 ffff88024a9c1ad8 0000000000000000 ffffffffa08eb05a >> 000008f84e72c000 0000000000000000 0000000000000001 0000000100000000 >> Call Trace: >> [] ? dump_stack+0xc/0x15 >> [] ? warn_slowpath_common+0x56/0x70 >> [] ? btrfs_lookup_extent_info+0x40a/0x4c0 [btrfs] >> [] ? do_walk_down+0x128/0x750 [btrfs] >> [] ? walk_down_proc+0x314/0x330 [btrfs] >> [] ? walk_down_tree+0xc2/0xf0 [btrfs] >> [] ? btrfs_drop_snapshot+0x40e/0x9a0 [btrfs] >> [] ? btrfs_clean_one_deleted_snapshot+0xab/0xe0 [btrfs] >> [] ? cleaner_kthread+0x280/0x320 [btrfs] >> [] ? kthread+0xaf/0xc0 >> [] ? kthread_create_on_node+0x110/0x110 >> [] ? ret_from_fork+0x5d/0xb0 >> [] ? kthread_create_on_node+0x110/0x110 >> ------------[ end trace ]------------ >> BTRFS error (device dm-1): Missing references. >> BTRFS: error (device dm-1) in btrfs_drop_snapshot:9557: errno=-5 IO failure >> >> We fix this problem by abort trnasaction when walk down/up tree fails. > Typo in "trnasaction". Also "by aborting the transaction". > > Finally you should be more explicit about the problem, something like: > > By not aborting the transaction, every future attempt to delete the > subvolume fails and we > end up never freeing all the extents used by the subvolume/snapshot. > By aborting the transaction we have a least the possibility to > succeeded after unmounting > and mounting again the filesystem. > > Also use "btrfs: " instead of "Btrfs: " in the subject. Aborting the transaction is a safer practice. ---------------------- If we want to ensure drop progress, we need to check error handling in different situations, which is a more complicated part. For example, we first modified wc->drop_progress and wc->drop_level in do_walk_down, and then went to the free extent. If the free extent fails, the drop_progress and drop_level are incorrect and cannot be updated to root_item. ---------------------- In addition, I found a potential risk. We will unconditionally update wc->drop_progress and wc->drop_level back to the root item, but the above two values ​​are 0 at the time of initialization, and not initialized to root_item->drop_progress, resulting in Clear root_item->drop_porgress to 0 during resume subvol delete. Cause the drop key to be inconsistent. > Now my question is, why can't the problem be solved by ensuring we > persist a correct drop progress key? > > That is, if walk up or walk down fails, still try to update the drop > progress and the root item with the new drop progress - aborting the > transaction only if we get an error updating the root item. > > Is there a reason why that can't be done? If that does not work, it > should be mentioned in the change log. > > Thanks. > > >> Signed-off-by: Robbie Ko >> --- >> fs/btrfs/extent-tree.c | 4 +++- >> 1 file changed, 3 insertions(+), 1 deletion(-) >> >> diff --git a/fs/btrfs/extent-tree.c b/fs/btrfs/extent-tree.c >> index 268ce58d4569..49cdb7eeccb3 100644 >> --- a/fs/btrfs/extent-tree.c >> +++ b/fs/btrfs/extent-tree.c >> @@ -5659,8 +5659,10 @@ int btrfs_drop_snapshot(struct btrfs_root *root, int update_ref, int for_reloc) >> } >> } >> btrfs_release_path(path); >> - if (err) >> + if (err) { >> + btrfs_abort_transaction(trans, err); >> goto out_end_trans; >> + } >> >> ret = btrfs_del_root(trans, &root->root_key); >> if (ret) { >> -- >> 2.17.1 >> > > -- > Filipe David Manana, > > “Whether you think you can, or you think you can't — you're right.” -- Avast 防毒軟體已檢查此封電子郵件的病毒。 https://www.avast.com/antivirus