From: Anand Jain <anand.jain@oracle.com> To: dsterba@suse.cz, dsterba@suse.com Cc: Desmond Cheong Zhi Xi <desmondcheongzx@gmail.com>, clm@fb.com, josef@toxicpanda.com, linux-btrfs@vger.kernel.org, linux-kernel@vger.kernel.org, skhan@linuxfoundation.org, gregkh@linuxfoundation.org, linux-kernel-mentees@lists.linuxfoundation.org, syzbot+a70e2ad0879f160b9217@syzkaller.appspotmail.com Subject: Re: [PATCH] btrfs: fix rw device counting in __btrfs_free_extra_devids Date: Sun, 25 Jul 2021 21:49:17 +0800 [thread overview] Message-ID: <24c7304c-9a38-278b-0ae5-78edb225cb4c@oracle.com> (raw) In-Reply-To: <20210721175938.GP19710@twin.jikos.cz> On 22/07/2021 01:59, David Sterba wrote: > On Thu, Jul 15, 2021 at 06:34:03PM +0800, Desmond Cheong Zhi Xi wrote: >> Syzbot reports a warning in close_fs_devices that happens because >> fs_devices->rw_devices is not 0 after calling btrfs_close_one_device >> on each device. >> >> This happens when a writeable device is removed in >> __btrfs_free_extra_devids, but the rw device count is not decremented >> accordingly. So when close_fs_devices is called, the removed device is >> still counted and we get an off by 1 error. >> >> Here is one call trace that was observed: >> btrfs_mount_root(): >> btrfs_scan_one_device(): >> device_list_add(); <---------------- device added >> btrfs_open_devices(): >> open_fs_devices(): >> btrfs_open_one_device(); <-------- rw device count ++ >> btrfs_fill_super(): >> open_ctree(): >> btrfs_free_extra_devids(): >> __btrfs_free_extra_devids(); <--- device removed >> fail_tree_roots: >> btrfs_close_devices(): >> close_fs_devices(); <------- rw device count off by 1 >> >> Fixes: cf89af146b7e ("btrfs: dev-replace: fail mount if we don't have replace item with target device") > > What this patch did in the last hunk was the rw_devices decrement, but > conditional: > > @@ -1080,9 +1071,6 @@ static void __btrfs_free_extra_devids(struct btrfs_fs_devices *fs_devices, > if (test_bit(BTRFS_DEV_STATE_WRITEABLE, &device->dev_state)) { > list_del_init(&device->dev_alloc_list); > clear_bit(BTRFS_DEV_STATE_WRITEABLE, &device->dev_state); > - if (!test_bit(BTRFS_DEV_STATE_REPLACE_TGT, > - &device->dev_state)) This condition was wrong. The 1st roll of this patch which is here [1], has the details of why. As shown below - [1] https://patchwork.kernel.org/project/linux-btrfs/patch/b3a0a629df98bd044a1fd5c4964f381ff6e7aa05.1600777827.git.anand.jain@oracle.com/#23640775 ---- rw_devices is incremented in btrfs_open_one_device() for all write-able devices except for devid == BTRFS_DEV_REPLACE_DEVID. But while we clean up the extra devices in __btrfs_free_extra_devids() we used the BTRFS_DEV_STATE_REPLACE_TGT flag isn't set because there isn't the replace-item. So rw_devices went below zero. ---- > - fs_devices->rw_devices--; > } > list_del_init(&device->dev_list); > fs_devices->num_devices--; > --- > > >> @@ -1078,6 +1078,7 @@ static void __btrfs_free_extra_devids(struct btrfs_fs_devices *fs_devices, >> if (test_bit(BTRFS_DEV_STATE_WRITEABLE, &device->dev_state)) { >> list_del_init(&device->dev_alloc_list); >> clear_bit(BTRFS_DEV_STATE_WRITEABLE, &device->dev_state); >> + fs_devices->rw_devices--; >> } >> list_del_init(&device->dev_list); >> fs_devices->num_devices--; > > So should it be reinstated in the original form? No. The reason is the same as above. Only the rw_devices decrement line has to be restored. > The rest of > cf89af146b7e handles unexpected device replace item during mount. > Adding the decrement is correct, but right now I'm not sure about the > corner case when teh devcie has the BTRFS_DEV_STATE_REPLACE_TGT bit set. BTRFS_DEV_STATE_REPLACE_TGT is set (on BTRFS_DEV_REPLACE_DEVID) for two reasons when we call replace through ioctl or during mount upon finding a replace-device item. > The state machine of the device bits and counters is not trivial so > fixing it one way or the other could lead to further syzbot reports if > we don't understand the issue. I agree. Also, a good idea to convert this sysbot test into an xfstests case. Thanks, Anand
prev parent reply other threads:[~2021-07-25 13:49 UTC|newest] Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top 2021-07-15 10:34 Desmond Cheong Zhi Xi 2021-07-15 11:23 ` Anand Jain 2021-07-15 11:55 ` Nikolay Borisov 2021-07-15 13:11 ` Desmond Cheong Zhi Xi 2021-07-21 13:34 ` David Sterba 2021-07-21 17:59 ` David Sterba 2021-07-25 6:19 ` Desmond Cheong Zhi Xi 2021-07-26 17:52 ` David Sterba 2021-07-26 23:07 ` Desmond Cheong Zhi Xi 2021-07-25 13:49 ` Anand Jain [this message]
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=24c7304c-9a38-278b-0ae5-78edb225cb4c@oracle.com \ --to=anand.jain@oracle.com \ --cc=clm@fb.com \ --cc=desmondcheongzx@gmail.com \ --cc=dsterba@suse.com \ --cc=dsterba@suse.cz \ --cc=gregkh@linuxfoundation.org \ --cc=josef@toxicpanda.com \ --cc=linux-btrfs@vger.kernel.org \ --cc=linux-kernel-mentees@lists.linuxfoundation.org \ --cc=linux-kernel@vger.kernel.org \ --cc=skhan@linuxfoundation.org \ --cc=syzbot+a70e2ad0879f160b9217@syzkaller.appspotmail.com \ --subject='Re: [PATCH] btrfs: fix rw device counting in __btrfs_free_extra_devids' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).