All of lore.kernel.org
 help / color / mirror / Atom feed
From: Qu Wenruo <quwenruo.btrfs@gmx.com>
To: Ritesh Harjani <riteshh@linux.ibm.com>, Qu Wenruo <wqu@suse.com>
Cc: linux-btrfs@vger.kernel.org
Subject: Re: [Patch v2 41/42] btrfs: fix the use-after-free bug in writeback subpage helper
Date: Mon, 10 May 2021 16:38:08 +0800	[thread overview]
Message-ID: <e7e6ebdd-a220-e4ec-64e4-d031d7a9b181@gmx.com> (raw)
In-Reply-To: <5d406b40-23d9-8542-792a-2cd6a7d95afe@gmx.com>

Hi Ritesh,

I guess no error report so far is a good thing?

Just to report what my result is, I ran my latest github branch for the
full weekend, over 50 hours, and around 20 runs of full generic/auto
without defrag groups.

And I see no crash at all.

But there is a special note, there is a new patch, introduced just
before the weekend (Fri May 7 09:31:43 2021 +0800), titled "btrfs: fix a
possible use-after-free race in metadata read path", is a new fix for a
bug I reproduced once locally.

The bug should only happen when read is slow and only happens for
metadata read path.

The details can be found in the commit message, although it's rare to
hit, I have hit such problem around 3 times in total.

Hopes you didn't hit any crash during your test.

Thanks,
Qu


On 2021/5/7 下午1:14, Qu Wenruo wrote:
>
>
> On 2021/5/7 下午12:57, Ritesh Harjani wrote:
>> On 21/05/07 07:46AM, Qu Wenruo wrote:
>>>
>>>
>>> On 2021/4/28 上午7:03, Qu Wenruo wrote:
>>>> [BUG]
>>>> There is a possible use-after-free bug when running generic/095.
>>>>
>>>>    BUG: Unable to handle kernel data access on write at
>>>> 0x6b6b6b6b6b6b725b
>>>>    Faulting instruction address: 0xc000000000283654
>>>>    c000000000283078 do_raw_spin_unlock+0x88/0x230
>>>>    c0000000012b1e14 _raw_spin_unlock_irqrestore+0x44/0x90
>>>>    c000000000a918dc btrfs_subpage_clear_writeback+0xac/0xe0
>>>>    c0000000009e0458 end_bio_extent_writepage+0x158/0x270
>>>>    c000000000b6fd14 bio_endio+0x254/0x270
>>>>    c0000000009fc0f0 btrfs_end_bio+0x1a0/0x200
>>>>    c000000000b6fd14 bio_endio+0x254/0x270
>>>>    c000000000b781fc blk_update_request+0x46c/0x670
>>>>    c000000000b8b394 blk_mq_end_request+0x34/0x1d0
>>>>    c000000000d82d1c lo_complete_rq+0x11c/0x140
>>>>    c000000000b880a4 blk_complete_reqs+0x84/0xb0
>>>>    c0000000012b2ca4 __do_softirq+0x334/0x680
>>>>    c0000000001dd878 irq_exit+0x148/0x1d0
>>>>    c000000000016f4c do_IRQ+0x20c/0x240
>>>>    c000000000009240 hardware_interrupt_common_virt+0x1b0/0x1c0
>>>>
>>>> [CAUSE]
>>>> There is very small race window like the following in generic/095.
>>>>
>>>>     Thread 1        |        Thread 2
>>>> --------------------------------+------------------------------------
>>>>     end_bio_extent_writepage()    | btrfs_releasepage()
>>>>     |- spin_lock_irqsave()    | |
>>>>     |- end_page_writeback()    | |
>>>>     |                | |- if (PageWriteback() ||...)
>>>>     |                | |- clear_page_extent_mapped()
>>>>     |                |    |- kfree(subpage);
>>>>     |- spin_unlock_irqrestore().
>>>>
>>>> The race can also happen between writeback and btrfs_invalidatepage(),
>>>> although that would be much harder as btrfs_invalidatepage() has much
>>>> more work to do before the clear_page_extent_mapped() call.
>>>>
>>>> [FIX]
>>>> For btrfs_subpage_clear_writeback(), we don't really need to put
>>>> end_page_writepage() call into the spinlock critical section.
>>>>
>>>> By just checking the bitmap in the critical section and call
>>>> end_page_writeback() outside of the critical section, we can avoid such
>>>> use-after-free bug.
>>>>
>>>> Reported-by: Ritesh Harjani <riteshh@linux.ibm.com>
>>>> Tested-by: Ritesh Harjani <riteshh@linux.ibm.com>
>>>> Signed-off-by: Qu Wenruo <wqu@suse.com>
>>>> ---
>>>>    fs/btrfs/subpage.c | 5 ++++-
>>>>    1 file changed, 4 insertions(+), 1 deletion(-)
>>>>
>>>> diff --git a/fs/btrfs/subpage.c b/fs/btrfs/subpage.c
>>>> index 696485ab68a2..c5abf9745c10 100644
>>>> --- a/fs/btrfs/subpage.c
>>>> +++ b/fs/btrfs/subpage.c
>>>
>>> Hi Ritesh,
>>>
>>> Unfortunately I have to bother you again for testing the latest subpage
>>> branch.
>>
>> Yes, this was anyway on my mind to test the latest subpage branch.
>> Sure, I will do the testing.
>>
>>>
>>> This particular fix seems to be incomplete, as I have hit several
>>> BUG_ON()s
>>> related to end_page_writeback() called on page without writeback flag.
>>
>> ok.
>>
>>>
>>>> @@ -420,13 +420,16 @@ void btrfs_subpage_clear_writeback(const
>>>> struct btrfs_fs_info *fs_info,
>>>>    {
>>>>        struct btrfs_subpage *subpage = (struct btrfs_subpage
>>>> *)page->private;
>>>>        u16 tmp = btrfs_subpage_calc_bitmap(fs_info, page, start, len);
>>>> +    bool finished = false;
>>>>        unsigned long flags;
>>>>        spin_lock_irqsave(&subpage->lock, flags);
>>>>        subpage->writeback_bitmap &= ~tmp;
>>>>        if (subpage->writeback_bitmap == 0)
>>>> -        end_page_writeback(page);
>>>> +        finished = true;
>>>>        spin_unlock_irqrestore(&subpage->lock, flags);
>>>> +    if (finished)
>>>> +        end_page_writeback(page);
>>>
>>> The race can happen like this:
>>>
>>>                T1                  |              T2
>>> ----------------------------------+----------------------------------
>>> __extent_writepage()              |
>>> |<< The 1st sector of the page >> |
>>> |- writepage_delalloc()           |
>>> |  Now the subpage range has      |
>>> |  Writeback flag                 |
>>> |- __extent_writepage_io()        |
>>> |  |- submit_extent_page()        | << endio of the 1st sector >>
>>> |                                 | end_bio_extent_writepage()
>>> |<< The 2nd sector of the page >> | |- spin_lock_irqsave()
>>> |- writepage_delalloc()           | |- finished = true
>>> |  |- spin_lock()                 | |- spin_unlock_irqstore()
>>> |  |- set_page_writeback();       | |
>>> |  |- spin_unlock()               | |- end_page_writeback()
>>> |                                 | << Now page has no writeback >>
>>> |- __extent_writepagE_io()        |
>>>     |- submit_extent_page()        | << endio of the 2nd sector >>
>>>                                    | end_bio_extent_writepage()
>>>                                    | |- finished = true;
>>>                                    | |- end_page_writeback()
>>>                                     !!! BUG_ON() triggered !!!
>>>
>>> The reproducibility is pretty low, so far I have only hit 3 times such
>>> BUG_ON().
>>> No special test case number for it, all 3 BUG_ON() happens for different
>>> test cases.
>>>
>>> Thus newer fix will still keep the end_page_writeback() inside the
>>> spinlock,
>>> but btrfs_releasepage() and btrfs_invalidatepage() will "wait" for the
>>> spinlock to be released before detaching the subpage structure.
>>>
>>> Currently the fix runs fine, but extra test will always help.
>>
>> Sorry, just to be clear, do you mean the latest subpage branch still
>> has some issues where we can hit the BUG_ON() or have you identifed
>> and added
>> some patches to fix it?
>
> Above race is how the old fix (with end_page_writeback() called outside
> of the spinlock) could lead to a BUG_ON().
>
> I believe the new fix, with the same title, can fix the problem.
>
>>
>> Let me clone below branch and re-test xfstests on Power.
>> https://github.com/adam900710/linux/commits/subpage
>>
>> Also if you would like me to test any extra mount option or mkfs
>> option testing
>> too, then pls do let me know. For now I will be testing with default
>> options.
>
> Thanks, let's just focus on the default mount option first.
>
> Thanks for your great help!
> Qu
>
>>
>> -ritesh
>>

  reply	other threads:[~2021-05-10  8:38 UTC|newest]

Thread overview: 117+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-04-27 23:03 [Patch v2 00/42] btrfs: add data write support for subpage Qu Wenruo
2021-04-27 23:03 ` [Patch v2 01/42] btrfs: scrub: fix subpage scrub repair error caused by hardcoded PAGE_SIZE Qu Wenruo
2021-05-13 22:57   ` David Sterba
2021-05-13 23:32     ` Qu Wenruo
2021-04-27 23:03 ` [Patch v2 02/42] btrfs: make free space cache size consistent across different PAGE_SIZE Qu Wenruo
2021-04-27 23:03 ` [Patch v2 03/42] btrfs: remove the unused parameter @len for btrfs_bio_fits_in_stripe() Qu Wenruo
2021-05-13 22:58   ` David Sterba
2021-05-13 23:07   ` David Sterba
2021-04-27 23:03 ` [Patch v2 04/42] btrfs: allow btrfs_bio_fits_in_stripe() to accept bio without any page Qu Wenruo
2021-04-27 23:03 ` [Patch v2 05/42] btrfs: refactor submit_extent_page() to make bio and its flag tracing easier Qu Wenruo
2021-05-13 23:03   ` David Sterba
2021-05-21 11:06   ` Johannes Thumshirn
2021-05-21 11:26     ` Qu Wenruo
2021-05-21 13:30       ` David Sterba
2021-04-27 23:03 ` [Patch v2 06/42] btrfs: make subpage metadata write path to call its own endio functions Qu Wenruo
2021-04-27 23:03 ` [Patch v2 07/42] btrfs: pass btrfs_inode into btrfs_writepage_endio_finish_ordered() Qu Wenruo
2021-05-13 23:06   ` David Sterba
2021-05-13 23:35     ` Qu Wenruo
2021-05-21 14:27   ` Josef Bacik
2021-05-21 20:22     ` David Sterba
2021-05-22  0:24     ` Qu Wenruo
2021-05-23  7:40       ` Qu Wenruo
2021-05-23 13:43         ` Josef Bacik
2021-05-23 13:50           ` Qu Wenruo
2021-05-23 14:08             ` Josef Bacik
2021-04-27 23:03 ` [Patch v2 08/42] btrfs: make Private2 lifespan more consistent Qu Wenruo
2021-04-27 23:03 ` [Patch v2 09/42] btrfs: refactor how we finish ordered extent io for endio functions Qu Wenruo
2021-05-13 23:11   ` David Sterba
2021-04-27 23:03 ` [Patch v2 10/42] btrfs: update the comments in btrfs_invalidatepage() Qu Wenruo
2021-04-27 23:03 ` [Patch v2 11/42] btrfs: introduce btrfs_lookup_first_ordered_range() Qu Wenruo
2021-05-13 23:13   ` David Sterba
2021-04-27 23:03 ` [Patch v2 12/42] btrfs: refactor btrfs_invalidatepage() Qu Wenruo
2021-04-27 23:03 ` [Patch v2 13/42] btrfs: rename PagePrivate2 to PageOrdered inside btrfs Qu Wenruo
2021-04-27 23:03 ` [Patch v2 14/42] btrfs: pass bytenr directly to __process_pages_contig() Qu Wenruo
2021-04-27 23:03 ` [Patch v2 15/42] btrfs: refactor the page status update into process_one_page() Qu Wenruo
2021-04-27 23:03 ` [Patch v2 16/42] btrfs: provide btrfs_page_clamp_*() helpers Qu Wenruo
2021-04-27 23:03 ` [Patch v2 17/42] btrfs: only require sector size alignment for end_bio_extent_writepage() Qu Wenruo
2021-04-27 23:03 ` [Patch v2 18/42] btrfs: make btrfs_dirty_pages() to be subpage compatible Qu Wenruo
2021-04-27 23:03 ` [Patch v2 19/42] btrfs: make __process_pages_contig() to handle subpage dirty/error/writeback status Qu Wenruo
2021-04-27 23:03 ` [Patch v2 20/42] btrfs: make end_bio_extent_writepage() to be subpage compatible Qu Wenruo
2021-04-27 23:03 ` [Patch v2 21/42] btrfs: make process_one_page() to handle subpage locking Qu Wenruo
2021-04-27 23:03 ` [Patch v2 22/42] btrfs: introduce helpers for subpage ordered status Qu Wenruo
2021-04-27 23:03 ` [Patch v2 23/42] btrfs: make page Ordered bit to be subpage compatible Qu Wenruo
2021-04-27 23:03 ` [Patch v2 24/42] btrfs: update locked page dirty/writeback/error bits in __process_pages_contig Qu Wenruo
2021-04-27 23:03 ` [Patch v2 25/42] btrfs: prevent extent_clear_unlock_delalloc() to unlock page not locked by __process_pages_contig() Qu Wenruo
2021-04-27 23:03 ` [Patch v2 26/42] btrfs: make btrfs_set_range_writeback() subpage compatible Qu Wenruo
2021-04-27 23:03 ` [Patch v2 27/42] btrfs: make __extent_writepage_io() only submit dirty range for subpage Qu Wenruo
2021-04-27 23:03 ` [Patch v2 28/42] btrfs: make btrfs_truncate_block() to be subpage compatible Qu Wenruo
2021-04-27 23:03 ` [Patch v2 29/42] btrfs: make btrfs_page_mkwrite() " Qu Wenruo
2021-04-27 23:03 ` [Patch v2 30/42] btrfs: reflink: make copy_inline_to_page() " Qu Wenruo
2021-04-27 23:03 ` [Patch v2 31/42] btrfs: fix the filemap_range_has_page() call in btrfs_punch_hole_lock_range() Qu Wenruo
2021-04-27 23:03 ` [Patch v2 32/42] btrfs: don't clear page extent mapped if we're not invalidating the full page Qu Wenruo
2021-04-27 23:03 ` [Patch v2 33/42] btrfs: extract relocation page read and dirty part into its own function Qu Wenruo
2021-04-27 23:03 ` [Patch v2 34/42] btrfs: make relocate_one_page() to handle subpage case Qu Wenruo
2021-04-27 23:03 ` [Patch v2 35/42] btrfs: fix wild subpage writeback which does not have ordered extent Qu Wenruo
2021-04-27 23:03 ` [Patch v2 36/42] btrfs: disable inline extent creation for subpage Qu Wenruo
2021-05-04  4:28   ` Qu Wenruo
2021-04-27 23:03 ` [Patch v2 37/42] btrfs: skip validation for subpage read repair Qu Wenruo
2021-04-27 23:03 ` [Patch v2 38/42] btrfs: allow submit_extent_page() to do bio split for subpage Qu Wenruo
2021-04-27 23:03 ` [Patch v2 39/42] btrfs: reject raid5/6 fs " Qu Wenruo
2021-04-28 14:22   ` Neal Gompa
2021-04-28 23:11     ` Qu Wenruo
2021-05-12 22:04       ` David Sterba
2021-04-27 23:03 ` [Patch v2 40/42] btrfs: fix a crash caused by race between prepare_pages() and btrfs_releasepage() Qu Wenruo
2021-04-28 10:56   ` Filipe Manana
2021-04-27 23:03 ` [Patch v2 41/42] btrfs: fix the use-after-free bug in writeback subpage helper Qu Wenruo
2021-05-06 23:46   ` Qu Wenruo
2021-05-07  4:57     ` Ritesh Harjani
2021-05-07  5:14       ` Qu Wenruo
2021-05-10  8:38         ` Qu Wenruo [this message]
2021-05-10 12:29           ` Ritesh Harjani
2021-05-10 13:10             ` Qu Wenruo
2021-05-11 10:48               ` Ritesh Harjani
2021-05-11 11:15                 ` Qu Wenruo
2021-05-12  1:49                   ` Qu Wenruo
2021-05-12  7:09                     ` Ritesh Harjani
2021-05-13 16:33                       ` Ritesh Harjani
2021-05-13 21:36                         ` Ritesh Harjani
2021-05-13 23:41                           ` Qu Wenruo
2021-05-14 15:08                             ` Ritesh Harjani
2021-05-14 17:53                               ` Ritesh Harjani
2021-05-14 22:22                                 ` Qu Wenruo
2021-05-15  9:59                                   ` Ritesh Harjani
2021-05-15 10:15                                     ` Qu Wenruo
2021-05-25  4:43                                       ` Ritesh Harjani
2021-05-25  5:52                                         ` Qu Wenruo
2021-05-25  6:14                                           ` Qu Wenruo
2021-05-25  9:23                                             ` Ritesh Harjani
2021-05-25  9:45                                               ` Qu Wenruo
2021-05-25  9:49                                                 ` Qu Wenruo
2021-05-25 10:20                                                   ` Ritesh Harjani
2021-05-25 11:41                                                     ` Qu Wenruo
2021-05-25 13:02                                                       ` Ritesh Harjani
2021-05-26  5:29                                                         ` Ritesh Harjani
2021-05-26  5:58                                                           ` Qu Wenruo
2021-05-26 13:45                                                             ` Ritesh Harjani
2021-05-28  8:26                                                               ` Qu Wenruo
2021-05-28  8:59                                                                 ` Ritesh Harjani
2021-05-28 10:25                                                                   ` Qu Wenruo
2021-05-30  1:50                                                                     ` Qu Wenruo
2021-04-27 23:03 ` [Patch v2 42/42] btrfs: allow read-write for 4K sectorsize on 64K page size systems Qu Wenruo
2021-05-12 22:18 ` [Patch v2 00/42] btrfs: add data write support for subpage David Sterba
2021-05-12 23:48   ` Qu Wenruo
2021-05-13  2:21     ` Qu Wenruo
2021-05-13 22:54       ` David Sterba
2021-05-14  1:41         ` Qu Wenruo
2021-05-14  2:26           ` riteshh
2021-05-14 10:28             ` riteshh
2021-05-14 11:28               ` David Sterba
2021-05-14 14:38                 ` riteshh
2021-05-14 11:30       ` David Sterba
2021-05-14 22:25         ` David Sterba
2021-05-14 22:45         ` Qu Wenruo
2021-05-14 23:05           ` David Sterba
2021-05-14 23:17             ` Qu Wenruo
2021-05-17 13:22               ` David Sterba
2021-05-17 23:20                 ` Qu Wenruo

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=e7e6ebdd-a220-e4ec-64e4-d031d7a9b181@gmx.com \
    --to=quwenruo.btrfs@gmx.com \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=riteshh@linux.ibm.com \
    --cc=wqu@suse.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.