linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: David Sterba <dsterba@suse.cz>
To: Nikolay Borisov <nborisov@suse.com>
Cc: Josef Bacik <josef@toxicpanda.com>,
	linux-btrfs@vger.kernel.org, kernel-team@fb.com
Subject: Re: [PATCH 0/5] Deal with a few ENOSPC corner cases
Date: Wed, 11 Mar 2020 02:45:34 +0100	[thread overview]
Message-ID: <20200311014534.GG12659@twin.jikos.cz> (raw)
In-Reply-To: <0c2c685d-ce31-99a3-9d12-78725da73d63@suse.com>

On Tue, Mar 10, 2020 at 07:28:03PM +0200, Nikolay Borisov wrote:
> On 9.03.20 г. 22:23 ч., Josef Bacik wrote:
> > Nikolay has been digging into a failure of generic/320 on ppc64.  This has
> > shaken out a variety of issues, and he's done a good job at running all of the
> > weird corners down and then testing my ideas to get them all fixed.  This is the
> > series that has survived the longest, so we're declaring victory.
> > 
> > First there is the global reserve stealing logic.  The way unlink works is it
> > attempts to start a transaction with a normal reservation amount, and if this
> > fails with ENOSPC we fall back to stealing from the global reserve.  This is
> > problematic because of all the same reasons we had with previous iterations of
> > the ENOSPC handling, thundering herd.  We get a bunch of failures all at once,
> > everybody tries to allocate from the global reserve, some win and some lose, we
> > get an ENSOPC.
> > 
> > To fix this we need to integrate this logic into the normal ENOSPC
> > infrastructure.  The idea is simple, we add a new flushing state that indicates
> > we are allowed to steal from the global reserve.  We still go through all of the
> > normal flushing work, and at the moment we begin to fail all the tickets we try
> > to satisfy any tickets that are allowed to steal by stealing from the global
> > reserve.  If this works we start the flushing system over again just like we
> > would with a normal ticket satisfaction.  This serializes our global reserve
> > stealing, so we don't have the thundering herd problem
> > 
> > This isn't the only problem however.  Nikolay also noticed that we would
> > sometimes have huge amounts of space in the trans block rsv and we would ENOSPC
> > out.  This is because the may_commit_transaction() logic didn't take into
> > account the space that would be reclaimed by all of the outstanding trans
> > handles being required to stop in order to commit the transaction.
> > 
> > Another corner here was that priority tickets could race in and make
> > may_commit_transaction() think that it had no work left to do, and thus not
> > commit the transaction.
> > 
> > Those fixes all address the failures that Nikolay was seeing.  The last two
> > patches are just cleanups around how we handle priority tickets.  We shouldn't
> > even be serializing priority tickets behind normal tickets, only behind other
> > priority tickets.  And finally there would be a small window where priority
> > tickets would fail out if there were multiple priority tickets and one of them
> > failed.  This is addressed by the previous patch.
> > 
> > Nikolay has put these through many iterations of generic/320, and so far it
> > hasn't failed.  Thanks,
> > 
> > Josef
> > 
> 
> This patchset causes regressions on following tests:
> 
> btrfs/132 btrfs/170 btrfs/177 generic/102 generic/103 generic/170
> generic/172 generic/275 generic/299 generic/464 generic/551
> 
> Please don't merge for now.

Thanks for letting me know, space handling fixes could always use longer
period of testing. At this point we're getting close to pre merge window
freeze so I'd be more nervous merging it now.

  reply	other threads:[~2020-03-11  1:46 UTC|newest]

Thread overview: 18+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-03-09 20:23 [PATCH 0/5] Deal with a few ENOSPC corner cases Josef Bacik
2020-03-09 20:23 ` [PATCH 1/5] btrfs: Improve global reserve stealing logic Josef Bacik
2020-03-09 20:48   ` Nikolay Borisov
2020-03-10 14:27   ` Nikolay Borisov
2020-03-09 20:23 ` [PATCH 2/5] btrfs: Account for trans_block_rsv in may_commit_transaction Josef Bacik
2020-03-09 20:44   ` Nikolay Borisov
2020-03-09 20:23 ` [PATCH 3/5] btrfs: only take normal tickets into account " Josef Bacik
2020-03-09 20:51   ` Nikolay Borisov
2020-03-09 23:13   ` Nikolay Borisov
2020-03-10 10:27   ` Nikolay Borisov
2020-03-09 20:23 ` [PATCH 4/5] btrfs: only check priority tickets for priority flushing Josef Bacik
2020-03-10 10:30   ` Nikolay Borisov
2020-03-09 20:23 ` [PATCH 5/5] btrfs: run btrfs_try_granting_tickets if a priority ticket fails Josef Bacik
2020-03-10 10:32   ` Nikolay Borisov
2020-03-13 19:54     ` Josef Bacik
2020-03-10 17:28 ` [PATCH 0/5] Deal with a few ENOSPC corner cases Nikolay Borisov
2020-03-11  1:45   ` David Sterba [this message]
2020-03-13 12:37     ` Nikolay Borisov

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20200311014534.GG12659@twin.jikos.cz \
    --to=dsterba@suse.cz \
    --cc=josef@toxicpanda.com \
    --cc=kernel-team@fb.com \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=nborisov@suse.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).