All of lore.kernel.org
 help / color / mirror / Atom feed
From: Jan Kara <jack@suse.cz>
To: Matthew Bobrowski <repnop@google.com>
Cc: Jan Kara <jack@suse.cz>,
	amir73il@gmail.com, christian.brauner@ubuntu.com,
	linux-fsdevel@vger.kernel.org, linux-api@vger.kernel.org
Subject: Re: [PATCH 0/5] Add pidfd support to the fanotify API
Date: Fri, 21 May 2021 12:40:56 +0200	[thread overview]
Message-ID: <20210521104056.GG18952@quack2.suse.cz> (raw)
In-Reply-To: <YKeIR+LiSXqUHL8Q@google.com>

On Fri 21-05-21 20:15:35, Matthew Bobrowski wrote:
> Hey Jan!
> 
> On Thu, May 20, 2021 at 03:55:27PM +0200, Jan Kara wrote:
> > On Thu 20-05-21 12:09:45, Matthew Bobrowski wrote:
> > > Hey Jan/Amir/Christian,
> > > 
> > > This is the updated patch series for adding pidfd support to the
> > > fanotify API. It incorporates all the suggestions that had come out of
> > > the initial RFC patch series [0].
> > > 
> > > The main difference with this patch series is that FAN_REPORT_PIDFD
> > > results in an additional info record object supplied alongside the
> > > generic event metadata object instead of overloading metadata->pid. If
> > > any of the fid flavoured init flags are specified, then the pidfd info
> > > record object will follow any fid info record objects.
> > > 
> > > [0] https://www.spinics.net/lists/linux-fsdevel/msg193296.html
> > 
> > Overall the series looks fine to me - modulo the problems Christian & Amir
> > found. Do you have any tests for this? Preferably for LTP so that we can
> > extend the coverage there?
> 
> Cool and thanks for glancing over this series.
> 
> I've written some simple programs to verify this functionality works in
> FID and non-FID modes. I definitely plan on writing LTP tests,
> although it's something I'll do once we've agreed on the approach and
> I've received an ACK from yourself, Amir and Christian. This series
> passes all current LTP regressions. Also, I guess I'll need to write
> some patches for man-pages given this is an ABI change.

Yes, manpage update will be necessary as well.

> There's one thing that I'd like to mention, and it's something in
> regards to the overall approach we've taken that I'm not particularly
> happy about and I'd like to hear all your thoughts. Basically, with
> this approach the pidfd creation is done only once an event has been
> queued and the notification worker wakes up and picks up the event
> from the queue processes it. There's a subtle latency introduced when
> taking such an approach which at times leads to pidfd creation
> failures. As in, by the time pidfd_create() is called the struct pid
> has already been reaped, which then results in FAN_NOPIDFD being
> returned in the pidfd info record.
> 
> Having said that, I'm wondering what the thoughts are on doing pidfd
> creation earlier on i.e. in the event allocation stages? This way, the
> struct pid is pinned earlier on and rather than FAN_NOPIDFD being
> returned in the pidfd info record because the struct pid has been
> already reaped, userspace application will atleast receive a valid
> pidfd which can be used to check whether the process still exists or
> not. I think it'll just set the expectation better from an API
> perspective.

Yes, there's this race. OTOH if FAN_NOPIDFD is returned, the listener can
be sure the original process doesn't exist anymore. So is it useful to
still receive pidfd of the dead process? Also opening pidfd in the context
of event generation is problematic for two reasons:

1) Technically, the context under which events are generated can be rather
constrained (various locks held etc.). Adding relatively complex operations
such as pidfd creation is going to introduce strange lock dependencies,
possibly deadlocks.

2) Doing pidfd generation in the context of the process generating event is
problematic - you don't know in which fd_table the fd will live. Also that
process is unfairly penalized (performance wise) because someone else is
listening. We try to keep overhead of event generation as low as possible
for this reason.

								Honza

> 
> /M
-- 
Jan Kara <jack@suse.com>
SUSE Labs, CR

  reply	other threads:[~2021-05-21 10:41 UTC|newest]

Thread overview: 38+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-05-20  2:09 [PATCH 0/5] Add pidfd support to the fanotify API Matthew Bobrowski
2021-05-20  2:10 ` [PATCH 1/5] kernel/pid.c: remove static qualifier from pidfd_create() Matthew Bobrowski
2021-05-20  2:10 ` [PATCH 2/5] kernel/pid.c: implement checks on parameters passed to pidfd_create() Matthew Bobrowski
2021-05-20  2:11 ` [PATCH 3/5] fanotify_user.c: minor cosmetic adjustments to fid labels Matthew Bobrowski
2021-05-20  2:11 ` [PATCH 4/5] fanotify/fanotify_user.c: introduce a generic info record copying function Matthew Bobrowski
2021-05-20 13:59   ` Amir Goldstein
2021-05-21  9:26     ` Matthew Bobrowski
2021-05-20  2:11 ` [PATCH 5/5] fanotify: Add pidfd info record support to the fanotify API Matthew Bobrowski
2021-05-20  8:17   ` Christian Brauner
2021-05-20 13:43     ` Amir Goldstein
2021-05-21  9:21       ` Matthew Bobrowski
2021-05-21  9:41         ` Amir Goldstein
2021-05-21 10:24           ` Jan Kara
2021-05-21 11:10             ` Amir Goldstein
2021-05-21 13:19               ` Jan Kara
2021-05-21 13:52                 ` Amir Goldstein
2021-05-21 15:14                   ` Jan Kara
2021-05-22  0:41                     ` Matthew Bobrowski
2021-05-22  9:01                       ` Amir Goldstein
2021-05-20 13:55 ` [PATCH 0/5] Add pidfd " Jan Kara
2021-05-21 10:15   ` Matthew Bobrowski
2021-05-21 10:40     ` Jan Kara [this message]
2021-05-21 23:32       ` Matthew Bobrowski
2021-05-24  8:47         ` Jan Kara
2021-05-25 10:31           ` Christian Brauner
2021-05-25 23:20             ` Matthew Bobrowski
2021-05-26 18:05               ` Christian Brauner
2021-05-26 22:54                 ` Matthew Bobrowski
2021-06-01 11:03                 ` Matthew Bobrowski
2021-06-01 11:46                   ` Christian Brauner
2021-06-02  6:30                     ` Matthew Bobrowski
2021-06-02  7:18                       ` Amir Goldstein
2021-06-02  8:48                         ` Christian Brauner
2021-06-02 10:56                           ` Matthew Bobrowski
2021-06-02 12:46                             ` Christian Brauner
2021-06-02 10:43                         ` Matthew Bobrowski
2021-06-02 12:18                           ` Amir Goldstein
2021-06-03  1:24                             ` Matthew Bobrowski

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20210521104056.GG18952@quack2.suse.cz \
    --to=jack@suse.cz \
    --cc=amir73il@gmail.com \
    --cc=christian.brauner@ubuntu.com \
    --cc=linux-api@vger.kernel.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=repnop@google.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.