linux-fsdevel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Jeff Layton <jlayton@kernel.org>
To: Benjamin Coddington <bcodding@redhat.com>,
	Trond Myklebust <trondmy@hammerspace.com>
Cc: Thorsten Leemhuis <regressions@leemhuis.info>,
	Hugh Dickins <hughd@google.com>,
	Charles Edward Lever <chuck.lever@oracle.com>,
	Linux NFS Mailing List <linux-nfs@vger.kernel.org>,
	Anna Schumaker <anna@kernel.org>,
	Alexander Viro <viro@zeniv.linux.org.uk>,
	Amir Goldstein <amir73il@gmail.com>,
	linux-fsdevel@vger.kernel.org, linux-mm@kvack.org,
	Linux kernel regressions list <regressions@lists.linux.dev>
Subject: Re: git regression failures with v6.2-rc NFS client
Date: Sun, 05 Feb 2023 06:24:25 -0500	[thread overview]
Message-ID: <8f122cb0304632b391759788fe1f72ea1bab1ba0.camel@kernel.org> (raw)
In-Reply-To: <05BEEF62-46DF-4FAC-99D4-4589C294F93A@redhat.com>

On Sat, 2023-02-04 at 15:44 -0500, Benjamin Coddington wrote:
> On 4 Feb 2023, at 11:52, Trond Myklebust wrote:
> > On Feb 4, 2023, at 08:15, Benjamin Coddington <bcodding@redhat.com> wrote:
> > > Ah, thanks for explaining that.
> > > 
> > > I'd like to summarize and quantify this problem one last time for folks that
> > > don't want to read everything.  If an application wants to remove all files
> > > and the parent directory, and uses this pattern to do it:
> > > 
> > > opendir
> > > while (getdents)
> > >    unlink dents
> > > closedir
> > > rmdir
> > > 
> > > Before this commit, that would work with up to 126 dentries on NFS from
> > > tmpfs export.  If the directory had 127 or more, the rmdir would fail with
> > > ENOTEMPTY.
> > 
> > For all sizes of filenames, or just the particular set that was chosen
> > here? What about the choice of rsize? Both these values affect how many
> > entries glibc can cache before it has to issue another getdents() call
> > into the kernel. For the record, this is what glibc does in the opendir()
> > code in order to choose a buffer size for the getdents syscalls:
> > 
> >   /* The st_blksize value of the directory is used as a hint for the
> >      size of the buffer which receives struct dirent values from the
> >      kernel.  st_blksize is limited to max_buffer_size, in case the
> >      file system provides a bogus value.  */
> >   enum { max_buffer_size = 1048576 };
> > 
> >   enum { allocation_size = 32768 };
> >   _Static_assert (allocation_size >= sizeof (struct dirent64),
> >                   "allocation_size < sizeof (struct dirent64)");
> > 
> >   /* Increase allocation if requested, but not if the value appears to
> >      be bogus.  It will be between 32Kb and 1Mb.  */
> >   size_t allocation = MIN (MAX ((size_t) statp->st_blksize, (size_t)
> >                                 allocation_size), (size_t) max_buffer_size);
> > 
> >   DIR *dirp = (DIR *) malloc (sizeof (DIR) + allocation);
> 
> The behavioral complexity is even higher with glibc in the mix, but both the
> test that Chuck's using and the reproducer I've been making claims about
> use SYS_getdents directly.  I'm using a static 4k buffer size which is big
> enough to fit enough entries to prime the heuristic for a single call to
> getdents() whether or not we return early at 17 or 126.
> 
> > > After this commit, it only works with up to 17 dentries.
> > > 
> > > The argument that this is making things worse takes the position that there
> > > are more directories in the universe with >17 dentries that want to be
> > > cleaned up by this "saw off the branch you're sitting on" pattern than
> > > directories with >127.  And I guess that's true if Chuck runs that testing
> > > setup enough.  :)
> > > 
> > > We can change the optimization in the commit from
> > > NFS_READDIR_CACHE_MISS_THRESHOLD + 1
> > > to
> > > nfs_readdir_array_maxentries + 1
> > > 
> > > This would make the regression disappear, and would also keep most of the
> > > optimization.
> > > 
> > > Ben
> > > 
> > 
> > So in other words the suggestion is to optimise the number of readdir
> > records that we return from NFS to whatever value that papers over the
> > known telldir()/seekdir() tmpfs bug that is re-revealed by this particular
> > test when run under these particular conditions?
> 
> Yes.  It's a terrible suggestion.  Its only merit may be that it meets the
> letter of the no regressions law.  I hate it, and I after I started popping
> out patches that do it I've found they've all made the behavior far more
> complex due to the way we dynamically optimize dtsize.
> 
> > Anyone who tries to use tmpfs with a different number of files, different
> > file name lengths, or different mount options is still SOL because that’s
> > not a “regression"?
> 
> Right. :P
> 
> Ben
> 

I may be missing something, but would it be possible to move to a more
stable scheme for readdir cookies for tmpfs?

It is tmpfs, so we don't need to worry about persisting these values
across reboots. Could we (e.g.) hash dentry pointers to generate
cookies?
-- 
Jeff Layton <jlayton@kernel.org>

  reply	other threads:[~2023-02-05 11:24 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <9A4A5673-691D-47EC-BC44-C43BE7E50A48@oracle.com>
     [not found] ` <D0404F55-2692-4DB6-8DD6-CAC004331AC1@redhat.com>
     [not found]   ` <5FF4061F-108C-4555-A32D-DDBFA80EE4E7@redhat.com>
     [not found]     ` <F1833EA0-263F-46DF-8001-747A871E5757@redhat.com>
     [not found]       ` <B90C62F2-1D3A-40E0-8E33-8C349C6FFD3D@oracle.com>
     [not found]         ` <44CB1E86-60E0-4CF0-9FD4-BB7E446542B7@redhat.com>
     [not found]           ` <1AAC6854-2591-4B21-952A-BC58180B4091@oracle.com>
     [not found]             ` <41813D21-95C8-44E3-BB97-1E9C03CE7FE5@redhat.com>
     [not found]               ` <79261B77-35D0-4E36-AA29-C7BF9FB734CC@oracle.com>
     [not found]                 ` <104B6879-5223-485F-B099-767F741EB15B@redhat.com>
     [not found]                   ` <966AEC32-A7C9-4B97-A4F7-098AF6EF0067@oracle.com>
     [not found]                     ` <545B5AB7-93A6-496E-924E-AE882BF57B72@hammerspace.com>
     [not found]                       ` <FA8392E6-DAFC-4462-BDAE-893955F9E1A4@oracle.com>
2023-02-03 23:53                         ` git regression failures with v6.2-rc NFS client Hugh Dickins
2023-02-04  0:07                           ` Trond Myklebust
2023-02-04  0:15                             ` Hugh Dickins
2023-02-04  0:59                               ` Trond Myklebust
2023-02-04 11:07                                 ` Thorsten Leemhuis
2023-02-04 13:15                                   ` Benjamin Coddington
2023-02-04 16:52                                     ` Trond Myklebust
2023-02-04 20:44                                       ` Benjamin Coddington
2023-02-05 11:24                                         ` Jeff Layton [this message]
2023-02-05 16:11                                           ` Benjamin Coddington

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=8f122cb0304632b391759788fe1f72ea1bab1ba0.camel@kernel.org \
    --to=jlayton@kernel.org \
    --cc=amir73il@gmail.com \
    --cc=anna@kernel.org \
    --cc=bcodding@redhat.com \
    --cc=chuck.lever@oracle.com \
    --cc=hughd@google.com \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=linux-nfs@vger.kernel.org \
    --cc=regressions@leemhuis.info \
    --cc=regressions@lists.linux.dev \
    --cc=trondmy@hammerspace.com \
    --cc=viro@zeniv.linux.org.uk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).