From: Nikolay Borisov <nborisov@suse.com>
To: Josef Bacik <josef@toxicpanda.com>,
linux-btrfs@vger.kernel.org, kernel-team@fb.com
Subject: Re: [PATCH 4/8] btrfs: add ALLOC_CHUNK_FORCE to the flushing code
Date: Mon, 26 Nov 2018 13:28:32 +0200 [thread overview]
Message-ID: <30bf1159-9bbc-96fb-89a2-41cf6b22df7b@suse.com> (raw)
In-Reply-To: <20181121190313.24575-5-josef@toxicpanda.com>
On 21.11.18 г. 21:03 ч., Josef Bacik wrote:
> With my change to no longer take into account the global reserve for
> metadata allocation chunks we have this side-effect for mixed block
> group fs'es where we are no longer allocating enough chunks for the
> data/metadata requirements. To deal with this add a ALLOC_CHUNK_FORCE
> step to the flushing state machine. This will only get used if we've
> already made a full loop through the flushing machinery and tried
> committing the transaction. If we have then we can try and force a
> chunk allocation since we likely need it to make progress. This
> resolves the issues I was seeing with the mixed bg tests in xfstests
> with my previous patch.
>
> Signed-off-by: Josef Bacik <josef@toxicpanda.com>
Reviewed-by: Nikolay Borisov <nborisov@suse.com>
Still, my observation is that the metadata reclaim code is increasing in
complexity for rather niche use cases or the details become way too subtle.
> ---
> fs/btrfs/ctree.h | 3 ++-
> fs/btrfs/extent-tree.c | 18 +++++++++++++++++-
> include/trace/events/btrfs.h | 1 +
> 3 files changed, 20 insertions(+), 2 deletions(-)
>
> diff --git a/fs/btrfs/ctree.h b/fs/btrfs/ctree.h
> index 0c6d589c8ce4..8ccc5019172b 100644
> --- a/fs/btrfs/ctree.h
> +++ b/fs/btrfs/ctree.h
> @@ -2750,7 +2750,8 @@ enum btrfs_flush_state {
> FLUSH_DELALLOC = 5,
> FLUSH_DELALLOC_WAIT = 6,
> ALLOC_CHUNK = 7,
> - COMMIT_TRANS = 8,
> + ALLOC_CHUNK_FORCE = 8,
> + COMMIT_TRANS = 9,
> };
>
> int btrfs_alloc_data_chunk_ondemand(struct btrfs_inode *inode, u64 bytes);
> diff --git a/fs/btrfs/extent-tree.c b/fs/btrfs/extent-tree.c
> index a91b3183dcae..e6bb6ce23c84 100644
> --- a/fs/btrfs/extent-tree.c
> +++ b/fs/btrfs/extent-tree.c
> @@ -4927,6 +4927,7 @@ static void flush_space(struct btrfs_fs_info *fs_info,
> btrfs_end_transaction(trans);
> break;
> case ALLOC_CHUNK:
> + case ALLOC_CHUNK_FORCE:
> trans = btrfs_join_transaction(root);
> if (IS_ERR(trans)) {
> ret = PTR_ERR(trans);
> @@ -4934,7 +4935,9 @@ static void flush_space(struct btrfs_fs_info *fs_info,
> }
> ret = do_chunk_alloc(trans,
> btrfs_metadata_alloc_profile(fs_info),
> - CHUNK_ALLOC_NO_FORCE);
> + (state == ALLOC_CHUNK) ?
> + CHUNK_ALLOC_NO_FORCE :
> + CHUNK_ALLOC_FORCE);
> btrfs_end_transaction(trans);
> if (ret > 0 || ret == -ENOSPC)
> ret = 0;
> @@ -5070,6 +5073,19 @@ static void btrfs_async_reclaim_metadata_space(struct work_struct *work)
> commit_cycles--;
> }
>
> + /*
> + * We don't want to force a chunk allocation until we've tried
> + * pretty hard to reclaim space. Think of the case where we
> + * free'd up a bunch of space and so have a lot of pinned space
> + * to reclaim. We would rather use that than possibly create a
> + * underutilized metadata chunk. So if this is our first run
> + * through the flushing state machine skip ALLOC_CHUNK_FORCE and
> + * commit the transaction. If nothing has changed the next go
> + * around then we can force a chunk allocation.
> + */
> + if (flush_state == ALLOC_CHUNK_FORCE && !commit_cycles)
> + flush_state++;
> +
> if (flush_state > COMMIT_TRANS) {
> commit_cycles++;
> if (commit_cycles > 2) {
> diff --git a/include/trace/events/btrfs.h b/include/trace/events/btrfs.h
> index 63d1f9d8b8c7..dd0e6f8d6b6e 100644
> --- a/include/trace/events/btrfs.h
> +++ b/include/trace/events/btrfs.h
> @@ -1051,6 +1051,7 @@ TRACE_EVENT(btrfs_trigger_flush,
> { FLUSH_DELAYED_REFS_NR, "FLUSH_DELAYED_REFS_NR"}, \
> { FLUSH_DELAYED_REFS, "FLUSH_ELAYED_REFS"}, \
> { ALLOC_CHUNK, "ALLOC_CHUNK"}, \
> + { ALLOC_CHUNK_FORCE, "ALLOC_CHUNK_FORCE"}, \
> { COMMIT_TRANS, "COMMIT_TRANS"})
>
> TRACE_EVENT(btrfs_flush_space,
>
next prev parent reply other threads:[~2018-11-26 11:28 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-11-21 19:03 [PATCH 0/8] Enospc cleanups and fixes Josef Bacik
2018-11-21 19:03 ` [PATCH 1/8] btrfs: check if free bgs for commit Josef Bacik
2018-11-26 10:45 ` Nikolay Borisov
2018-11-21 19:03 ` [PATCH 2/8] btrfs: dump block_rsv whe dumping space info Josef Bacik
2018-11-21 19:03 ` [PATCH 3/8] btrfs: don't use global rsv for chunk allocation Josef Bacik
2018-11-26 11:25 ` Nikolay Borisov
2018-11-21 19:03 ` [PATCH 4/8] btrfs: add ALLOC_CHUNK_FORCE to the flushing code Josef Bacik
2018-11-26 11:28 ` Nikolay Borisov [this message]
2018-11-21 19:03 ` [PATCH 5/8] btrfs: don't enospc all tickets on flush failure Josef Bacik
2018-11-26 12:25 ` Nikolay Borisov
2018-11-27 19:46 ` Josef Bacik
2018-11-28 8:11 ` Nikolay Borisov
2018-11-21 19:03 ` [PATCH 6/8] btrfs: loop in inode_rsv_refill Josef Bacik
2018-11-21 19:03 ` [PATCH 7/8] btrfs: be more explicit about allowed flush states Josef Bacik
2018-11-26 12:41 ` Nikolay Borisov
2018-11-26 12:45 ` Nikolay Borisov
2018-11-21 19:03 ` [PATCH 8/8] btrfs: reserve extra space during evict() Josef Bacik
2018-12-03 15:24 [PATCH 0/8][V2] Enospc cleanups and fixeS Josef Bacik
2018-12-03 15:24 ` [PATCH 4/8] btrfs: add ALLOC_CHUNK_FORCE to the flushing code Josef Bacik
2018-12-11 10:08 ` Nikolay Borisov
2018-12-11 16:47 ` David Sterba
2018-12-11 16:51 ` Nikolay Borisov
2018-12-11 19:04 ` David Sterba
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=30bf1159-9bbc-96fb-89a2-41cf6b22df7b@suse.com \
--to=nborisov@suse.com \
--cc=josef@toxicpanda.com \
--cc=kernel-team@fb.com \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).