All of lore.kernel.org
 help / color / mirror / Atom feed
From: Qu Wenruo <quwenruo.btrfs@gmx.com>
To: Josef Bacik <josef@toxicpanda.com>, Qu Wenruo <wqu@suse.com>,
	linux-btrfs@vger.kernel.org
Subject: Re: [PATCH 07/42] btrfs: use u32 for length related members of btrfs_ordered_extent
Date: Sat, 17 Apr 2021 07:59:17 +0800	[thread overview]
Message-ID: <f7770543-8b6d-afd6-e6d5-6290a3f37b6e@gmx.com> (raw)
In-Reply-To: <5160a3e9-1612-45d9-a1ba-71fdcf9b8b24@toxicpanda.com>



On 2021/4/16 下午9:54, Josef Bacik wrote:
> On 4/15/21 1:04 AM, Qu Wenruo wrote:
>> Unlike btrfs_file_extent_item, btrfs_ordered_extent has its length
>> limit (BTRFS_MAX_EXTENT_SIZE), which is far smaller than U32_MAX.
>>
>> Using u64 for those length related members are just a waste of memory.
>>
>> This patch will make the following members u32:
>> - num_bytes
>> - disk_num_bytes
>> - bytes_left
>> - truncated_len
>>
>> This will save 16 bytes for btrfs_ordered_extent structure.
>>
>> For btrfs_add_ordered_extent*() call sites, they are mostly deeply
>> inside other functions passing u64.
>> Thus this patch will keep those u64, but do internal ASSERT() to ensure
>> the correct length values are passed in.
>>
>> For btrfs_dec_test_.*_ordered_extent() call sites, length related
>> parameters are converted to u32, with extra ASSERT() added to ensure we
>> get correct values passed in.
>>
>> There is special convert needed in btrfs_remove_ordered_extent(), which
>> needs s64, using "-entry->num_bytes" from u32 directly will cause
>> underflow.
>>
>> Signed-off-by: Qu Wenruo <wqu@suse.com>
>> ---
>>   fs/btrfs/inode.c        | 11 ++++++++---
>>   fs/btrfs/ordered-data.c | 21 ++++++++++++++-------
>>   fs/btrfs/ordered-data.h | 25 ++++++++++++++-----------
>>   3 files changed, 36 insertions(+), 21 deletions(-)
>>
>> diff --git a/fs/btrfs/inode.c b/fs/btrfs/inode.c
>> index 74ee34fc820d..554effbf307e 100644
>> --- a/fs/btrfs/inode.c
>> +++ b/fs/btrfs/inode.c
>> @@ -3066,6 +3066,7 @@ void btrfs_writepage_endio_finish_ordered(struct 
>> page *page, u64 start,
>>       struct btrfs_ordered_extent *ordered_extent = NULL;
>>       struct btrfs_workqueue *wq;
>> +    ASSERT(end + 1 - start < U32_MAX);
>>       trace_btrfs_writepage_end_io_hook(page, start, end, uptodate);
>>       ClearPagePrivate2(page);
>> @@ -7969,6 +7970,7 @@ static void __endio_write_update_ordered(struct 
>> btrfs_inode *inode,
>>       else
>>           wq = fs_info->endio_write_workers;
>> +    ASSERT(bytes < U32_MAX);
>>       while (ordered_offset < offset + bytes) {
>>           last_offset = ordered_offset;
>>           if (btrfs_dec_test_first_ordered_pending(inode, &ordered,
>> @@ -8415,10 +8417,13 @@ static void btrfs_invalidatepage(struct page 
>> *page, unsigned int offset,
>>           if (TestClearPagePrivate2(page)) {
>>               spin_lock_irq(&inode->ordered_tree.lock);
>>               set_bit(BTRFS_ORDERED_TRUNCATED, &ordered->flags);
>> -            ordered->truncated_len = min(ordered->truncated_len,
>> -                             start - ordered->file_offset);
>> +            ASSERT(start - ordered->file_offset < U32_MAX);
>> +            ordered->truncated_len = min_t(u32,
>> +                        ordered->truncated_len,
>> +                        start - ordered->file_offset);
>>               spin_unlock_irq(&inode->ordered_tree.lock);
>> +            ASSERT(end - start + 1 < U32_MAX);
>>               if (btrfs_dec_test_ordered_pending(inode, &ordered,
>>                                  start,
>>                                  end - start + 1, 1)) {
>> @@ -8937,7 +8942,7 @@ void btrfs_destroy_inode(struct inode *vfs_inode)
>>               break;
>>           else {
>>               btrfs_err(root->fs_info,
>> -                  "found ordered extent %llu %llu on inode cleanup",
>> +                  "found ordered extent %llu %u on inode cleanup",
>>                     ordered->file_offset, ordered->num_bytes);
>>               btrfs_remove_ordered_extent(inode, ordered);
>>               btrfs_put_ordered_extent(ordered);
>> diff --git a/fs/btrfs/ordered-data.c b/fs/btrfs/ordered-data.c
>> index 07b0b4218791..8e6d9d906bdd 100644
>> --- a/fs/btrfs/ordered-data.c
>> +++ b/fs/btrfs/ordered-data.c
>> @@ -160,6 +160,12 @@ static int __btrfs_add_ordered_extent(struct 
>> btrfs_inode *inode, u64 file_offset
>>       struct btrfs_ordered_extent *entry;
>>       int ret;
>> +    /*
>> +     * Basic size check, all length related members should be smaller
>> +     * than U32_MAX.
>> +     */
>> +    ASSERT(num_bytes < U32_MAX && disk_num_bytes < U32_MAX);
>> +
>>       if (type == BTRFS_ORDERED_NOCOW || type == 
>> BTRFS_ORDERED_PREALLOC) {
>>           /* For nocow write, we can release the qgroup rsv right now */
>>           ret = btrfs_qgroup_free_data(inode, NULL, file_offset, 
>> num_bytes);
>> @@ -186,7 +192,7 @@ static int __btrfs_add_ordered_extent(struct 
>> btrfs_inode *inode, u64 file_offset
>>       entry->bytes_left = num_bytes;
>>       entry->inode = igrab(&inode->vfs_inode);
>>       entry->compress_type = compress_type;
>> -    entry->truncated_len = (u64)-1;
>> +    entry->truncated_len = (u32)-1;
>>       entry->qgroup_rsv = ret;
>>       entry->physical = (u64)-1;
>>       entry->disk = NULL;
>> @@ -320,7 +326,7 @@ void btrfs_add_ordered_sum(struct 
>> btrfs_ordered_extent *entry,
>>    */
>>   bool btrfs_dec_test_first_ordered_pending(struct btrfs_inode *inode,
>>                      struct btrfs_ordered_extent **finished_ret,
>> -                   u64 *file_offset, u64 io_size, int uptodate)
>> +                   u64 *file_offset, u32 io_size, int uptodate)
>>   {
>>       struct btrfs_fs_info *fs_info = inode->root->fs_info;
>>       struct btrfs_ordered_inode_tree *tree = &inode->ordered_tree;
>> @@ -330,7 +336,7 @@ bool btrfs_dec_test_first_ordered_pending(struct 
>> btrfs_inode *inode,
>>       unsigned long flags;
>>       u64 dec_end;
>>       u64 dec_start;
>> -    u64 to_dec;
>> +    u32 to_dec;
>>       spin_lock_irqsave(&tree->lock, flags);
>>       node = tree_search(tree, *file_offset);
>> @@ -352,7 +358,7 @@ bool btrfs_dec_test_first_ordered_pending(struct 
>> btrfs_inode *inode,
>>       to_dec = dec_end - dec_start;
>>       if (to_dec > entry->bytes_left) {
>>           btrfs_crit(fs_info,
>> -               "bad ordered accounting left %llu size %llu",
>> +               "bad ordered accounting left %u size %u",
>>                  entry->bytes_left, to_dec);
>>       }
>>       entry->bytes_left -= to_dec;
>> @@ -397,7 +403,7 @@ bool btrfs_dec_test_first_ordered_pending(struct 
>> btrfs_inode *inode,
>>    */
>>   bool btrfs_dec_test_ordered_pending(struct btrfs_inode *inode,
>>                       struct btrfs_ordered_extent **cached,
>> -                    u64 file_offset, u64 io_size, int uptodate)
>> +                    u64 file_offset, u32 io_size, int uptodate)
>>   {
>>       struct btrfs_ordered_inode_tree *tree = &inode->ordered_tree;
>>       struct rb_node *node;
>> @@ -422,7 +428,7 @@ bool btrfs_dec_test_ordered_pending(struct 
>> btrfs_inode *inode,
>>       if (io_size > entry->bytes_left)
>>           btrfs_crit(inode->root->fs_info,
>> -               "bad ordered accounting left %llu size %llu",
>> +               "bad ordered accounting left %u size %u",
>>                  entry->bytes_left, io_size);
>>       entry->bytes_left -= io_size;
>> @@ -495,7 +501,8 @@ void btrfs_remove_ordered_extent(struct 
>> btrfs_inode *btrfs_inode,
>>           btrfs_delalloc_release_metadata(btrfs_inode, entry->num_bytes,
>>                           false);
>> -    percpu_counter_add_batch(&fs_info->ordered_bytes, -entry->num_bytes,
>> +    percpu_counter_add_batch(&fs_info->ordered_bytes,
>> +                 -(s64)entry->num_bytes,
>>                    fs_info->delalloc_batch);
>>       tree = &btrfs_inode->ordered_tree;
>> diff --git a/fs/btrfs/ordered-data.h b/fs/btrfs/ordered-data.h
>> index e60c07f36427..6906df0c946c 100644
>> --- a/fs/btrfs/ordered-data.h
>> +++ b/fs/btrfs/ordered-data.h
>> @@ -83,13 +83,22 @@ struct btrfs_ordered_extent {
>>       /*
>>        * These fields directly correspond to the same fields in
>>        * btrfs_file_extent_item.
>> +     *
>> +     * But since ordered extents can't be larger than 
>> BTRFS_MAX_EXTENT_SIZE,
>> +     * for length related members, they can use u32.
>>        */
>>       u64 disk_bytenr;
>> -    u64 num_bytes;
>> -    u64 disk_num_bytes;
>> +    u32 num_bytes;
>> +    u32 disk_num_bytes;
>>       /* number of bytes that still need writing */
>> -    u64 bytes_left;
>> +    u32 bytes_left;
>> +
>> +    /*
>> +     * If we get truncated we need to adjust the file extent we enter 
>> for
>> +     * this ordered extent so that we do not expose stale data.
>> +     */
>> +    u32 truncated_len;
> 
> This is the actual logical length of the file, which could be well above 
> u32, so at the very least this needs to stay.

Truncated_len is <= num_bytes, and num_bytes is already logical length, 
so no problem here.

> 
> And I hate this patch in general.  Ok generally we are limited to 
> 128mib, but we use u64 literally everywhere else for sizes, so using u64 
> here makes us consistent with the rest of how we address space and 
> lengths, which is more valuable to me than saving 16bytes.  Thanks,

That's also one of the concern, that's why I kept the parameter to be 
u64 while only did the u32 in the internal structure.

I totally get you point, and I'm also OK to drop this patch if there are 
more objections.

Thanks,
Qu

> 
> Josef

  reply	other threads:[~2021-04-16 23:59 UTC|newest]

Thread overview: 76+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-04-15  5:04 [PATCH 00/42] btrfs: add full read-write support for subpage Qu Wenruo
2021-04-15  5:04 ` [PATCH 01/42] btrfs: introduce end_bio_subpage_eb_writepage() function Qu Wenruo
2021-04-15 18:50   ` Josef Bacik
2021-04-15 23:21     ` Qu Wenruo
2021-04-15  5:04 ` [PATCH 02/42] btrfs: introduce write_one_subpage_eb() function Qu Wenruo
2021-04-15 19:03   ` Josef Bacik
2021-04-15 23:25     ` Qu Wenruo
2021-04-16 13:26       ` Josef Bacik
2021-04-18 19:45       ` Thiago Jung Bauermann
2021-04-15  5:04 ` [PATCH 03/42] btrfs: make lock_extent_buffer_for_io() to be subpage compatible Qu Wenruo
2021-04-15 19:04   ` Josef Bacik
2021-04-15  5:04 ` [PATCH 04/42] btrfs: introduce submit_eb_subpage() to submit a subpage metadata page Qu Wenruo
2021-04-15 19:27   ` Josef Bacik
2021-04-15 23:28     ` Qu Wenruo
2021-04-16 13:25       ` Josef Bacik
2021-04-15  5:04 ` [PATCH 05/42] btrfs: remove the unused parameter @len for btrfs_bio_fits_in_stripe() Qu Wenruo
2021-04-16 13:46   ` Josef Bacik
2021-04-15  5:04 ` [PATCH 06/42] btrfs: allow btrfs_bio_fits_in_stripe() to accept bio without any page Qu Wenruo
2021-04-16 13:50   ` Josef Bacik
2021-04-15  5:04 ` [PATCH 07/42] btrfs: use u32 for length related members of btrfs_ordered_extent Qu Wenruo
2021-04-16 13:54   ` Josef Bacik
2021-04-16 23:59     ` Qu Wenruo [this message]
2021-04-15  5:04 ` [PATCH 08/42] btrfs: pass btrfs_inode into btrfs_writepage_endio_finish_ordered() Qu Wenruo
2021-04-16 13:58   ` Josef Bacik
2021-04-17  0:02     ` Qu Wenruo
2021-04-15  5:04 ` [PATCH 09/42] btrfs: refactor how we finish ordered extent io for endio functions Qu Wenruo
2021-04-16 14:09   ` Josef Bacik
2021-04-17  0:06     ` Qu Wenruo
2021-04-15  5:04 ` [PATCH 10/42] btrfs: update the comments in btrfs_invalidatepage() Qu Wenruo
2021-04-16 14:32   ` Josef Bacik
2021-04-15  5:04 ` [PATCH 11/42] btrfs: refactor btrfs_invalidatepage() Qu Wenruo
2021-04-16 14:42   ` Josef Bacik
2021-04-17  0:13     ` Qu Wenruo
2021-04-15  5:04 ` [PATCH 12/42] btrfs: make Private2 lifespan more consistent Qu Wenruo
2021-04-16 14:43   ` Josef Bacik
2021-04-15  5:04 ` [PATCH 13/42] btrfs: rename PagePrivate2 to PageOrdered inside btrfs Qu Wenruo
2021-04-16 14:49   ` Josef Bacik
2021-04-15  5:04 ` [PATCH 14/42] btrfs: pass bytenr directly to __process_pages_contig() Qu Wenruo
2021-04-16 14:58   ` Josef Bacik
2021-04-17  0:15     ` Qu Wenruo
2021-04-15  5:04 ` [PATCH 15/42] btrfs: refactor the page status update into process_one_page() Qu Wenruo
2021-04-16 15:06   ` Josef Bacik
2021-04-15  5:04 ` [PATCH 16/42] btrfs: provide btrfs_page_clamp_*() helpers Qu Wenruo
2021-04-16 15:09   ` Josef Bacik
2021-04-15  5:04 ` [PATCH 17/42] btrfs: only require sector size alignment for end_bio_extent_writepage() Qu Wenruo
2021-04-16 15:13   ` Josef Bacik
2021-04-17  0:16     ` Qu Wenruo
2021-04-15  5:04 ` [PATCH 18/42] btrfs: make btrfs_dirty_pages() to be subpage compatible Qu Wenruo
2021-04-16 15:14   ` Josef Bacik
2021-04-15  5:04 ` [PATCH 19/42] btrfs: make __process_pages_contig() to handle subpage dirty/error/writeback status Qu Wenruo
2021-04-16 15:20   ` Josef Bacik
2021-04-15  5:04 ` [PATCH 20/42] btrfs: make end_bio_extent_writepage() to be subpage compatible Qu Wenruo
2021-04-16 15:21   ` Josef Bacik
2021-04-15  5:04 ` [PATCH 21/42] btrfs: make process_one_page() to handle subpage locking Qu Wenruo
2021-04-16 15:36   ` Josef Bacik
2021-04-15  5:04 ` [PATCH 22/42] btrfs: introduce helpers for subpage ordered status Qu Wenruo
2021-04-15  5:04 ` [PATCH 23/42] btrfs: make page Ordered bit to be subpage compatible Qu Wenruo
2021-04-15  5:04 ` [PATCH 24/42] btrfs: update locked page dirty/writeback/error bits in __process_pages_contig Qu Wenruo
2021-04-15  5:04 ` [PATCH 25/42] btrfs: prevent extent_clear_unlock_delalloc() to unlock page not locked by __process_pages_contig() Qu Wenruo
2021-04-15  5:04 ` [PATCH 26/42] btrfs: make btrfs_set_range_writeback() subpage compatible Qu Wenruo
2021-04-15  5:04 ` [PATCH 27/42] btrfs: make __extent_writepage_io() only submit dirty range for subpage Qu Wenruo
2021-04-15  5:04 ` [PATCH 28/42] btrfs: add extra assert for submit_extent_page() Qu Wenruo
2021-04-15  5:04 ` [PATCH 29/42] btrfs: make btrfs_truncate_block() to be subpage compatible Qu Wenruo
2021-04-15  5:04 ` [PATCH 30/42] btrfs: make btrfs_page_mkwrite() " Qu Wenruo
2021-04-15  5:04 ` [PATCH 31/42] btrfs: reflink: make copy_inline_to_page() " Qu Wenruo
2021-04-15  5:04 ` [PATCH 32/42] btrfs: fix the filemap_range_has_page() call in btrfs_punch_hole_lock_range() Qu Wenruo
2021-04-15  5:04 ` [PATCH 33/42] btrfs: don't clear page extent mapped if we're not invalidating the full page Qu Wenruo
2021-04-15  5:04 ` [PATCH 34/42] btrfs: extract relocation page read and dirty part into its own function Qu Wenruo
2021-04-15  5:04 ` [PATCH 35/42] btrfs: make relocate_one_page() to handle subpage case Qu Wenruo
2021-04-15  5:04 ` [PATCH 36/42] btrfs: fix wild subpage writeback which does not have ordered extent Qu Wenruo
2021-04-15  5:04 ` [PATCH 37/42] btrfs: disable inline extent creation for subpage Qu Wenruo
2021-04-15  5:04 ` [PATCH 38/42] btrfs: skip validation for subpage read repair Qu Wenruo
2021-04-15  5:04 ` [PATCH 39/42] btrfs: make free space cache size consistent across different PAGE_SIZE Qu Wenruo
2021-04-15  5:04 ` [PATCH 40/42] btrfs: refactor submit_extent_page() to make bio and its flag tracing easier Qu Wenruo
2021-04-15  5:04 ` [PATCH 41/42] btrfs: allow submit_extent_page() to do bio split for subpage Qu Wenruo
2021-04-15  5:04 ` [PATCH 42/42] btrfs: allow read-write for 4K sectorsize on 64K page size systems Qu Wenruo

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=f7770543-8b6d-afd6-e6d5-6290a3f37b6e@gmx.com \
    --to=quwenruo.btrfs@gmx.com \
    --cc=josef@toxicpanda.com \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=wqu@suse.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.