All of lore.kernel.org
 help / color / mirror / Atom feed
From: Daire Byrne <daire@dneg.com>
To: NeilBrown <neilb@suse.de>
Cc: "J. Bruce Fields" <bfields@fieldses.org>,
	Patrick Goetz <pgoetz@math.utexas.edu>,
	linux-nfs <linux-nfs@vger.kernel.org>
Subject: Re: parallel file create rates (+high latency)
Date: Tue, 26 Apr 2022 13:29:20 +0100	[thread overview]
Message-ID: <CAPt2mGPVWuut=ESWicSw0Ser2PGTeuyb+ACL41N6p_FAAuOUwg@mail.gmail.com> (raw)
In-Reply-To: <165093700757.1648.16863178337904278508@noble.neil.brown.name>

On Tue, 26 Apr 2022 at 02:36, NeilBrown <neilb@suse.de> wrote:
>
> On Tue, 26 Apr 2022, Daire Byrne wrote:
> >
> > I'll stare at fs/nfsd/vfs.c for a bit but I probably lack the
> > expertise to make it work.
>
> Staring at code is good for the soul ....  but I'll try to schedule time
> to work on this patch again - make it work from nfsd and also make it
> handle rename.

Yea, I stared at it for quite a while this morning and no amount of
coffee was going to help me figure out how best to proceed.

If you are able to update this for nfsd then I'll be eternally
grateful and do my best to test it under load in an effort to get it
all merged.

The community here has been so good to us over the last couple of
years and it is very much appreciated. It has helped us deliver (oscar
winning) movies!

To give some brief context as to why this is useful to us (for anyone
interested), we utilise NFS re-export extensively to run our batch
jobs (movie frame renders) in various remote DCs. In combination with
fscache and long term attribute caching, this works great for exposing
our (read often) onprem storage to the remote DCs (with varying
latencies).

But batch jobs have a tendency to start related tasks on many clients
at the same time with their results or logs being written to big
common directories. And by writing through the re-export server, we
often hit this limitation with parallel file creates which slows
everything down. We have tried to avoid large directories where
possible, but it's hard to catch and fix all the cases.

Using an NFS re-export server works 95% of the time for our workloads
(after much help from this community), so we are just trying to pick
away at the last 5% of edge cases. One of the disadvantages of the
re-export server in the middle, is that we lose some of the natural
parallelism that directly connected clients would otherwise have. And
this becomes very noticeable once the latency goes above 20ms.

Cheers,

Daire


> > It's also not entirely clear that this parallel creates RFC patch will
> > ever make it into mainline?
>
> I hope it will eventually, but I have no idea when that might be.
>
> Thanks for your continued interest,
> NeilBrown

  reply	other threads:[~2022-04-26 12:30 UTC|newest]

Thread overview: 36+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-01-23 23:53 parallel file create rates (+high latency) Daire Byrne
2022-01-24 13:52 ` Daire Byrne
2022-01-24 19:37 ` J. Bruce Fields
2022-01-24 20:10   ` Daire Byrne
2022-01-24 20:50     ` J. Bruce Fields
2022-01-25 12:52       ` Daire Byrne
2022-01-25 13:59         ` J. Bruce Fields
2022-01-25 15:24           ` Daire Byrne
2022-01-25 15:30           ` Chuck Lever III
2022-01-25 21:50             ` Patrick Goetz
2022-01-25 21:58               ` Chuck Lever III
2022-01-25 21:59               ` Bruce Fields
2022-01-25 22:11                 ` Patrick Goetz
2022-01-25 22:41                   ` Daire Byrne
2022-01-25 23:01                     ` Patrick Goetz
2022-01-25 23:25                       ` Daire Byrne
2022-01-25 21:15   ` Patrick Goetz
2022-01-25 21:20     ` J. Bruce Fields
2022-01-26  0:02       ` NeilBrown
2022-01-26  0:28         ` Daire Byrne
2022-01-26  2:57         ` J. Bruce Fields
2022-02-08 18:48           ` Daire Byrne
2022-02-10 18:19             ` Daire Byrne
2022-02-11 15:59               ` J. Bruce Fields
2022-02-17 19:50                 ` Daire Byrne
2022-02-18  7:46                   ` NeilBrown
2022-02-21 13:59                     ` Daire Byrne
2022-04-25 13:00                       ` Daire Byrne
2022-04-25 13:22                         ` J. Bruce Fields
2022-04-25 15:24                           ` Daire Byrne
2022-04-25 16:02                             ` J. Bruce Fields
2022-04-25 16:47                               ` Daire Byrne
2022-04-26  1:36                                 ` NeilBrown
2022-04-26 12:29                                   ` Daire Byrne [this message]
2022-04-28  5:46                                     ` NeilBrown
2022-04-29  7:55                                       ` Daire Byrne

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAPt2mGPVWuut=ESWicSw0Ser2PGTeuyb+ACL41N6p_FAAuOUwg@mail.gmail.com' \
    --to=daire@dneg.com \
    --cc=bfields@fieldses.org \
    --cc=linux-nfs@vger.kernel.org \
    --cc=neilb@suse.de \
    --cc=pgoetz@math.utexas.edu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.