From: Jan Kara <jack@suse.cz>
To: Amir Goldstein <amir73il@gmail.com>
Cc: Jan Kara <jack@suse.cz>,
linux-fsdevel <linux-fsdevel@vger.kernel.org>,
mhocko@suse.cz, Al Viro <viro@zeniv.linux.org.uk>,
Marko Rauhamaa <marko.rauhamaa@f-secure.com>
Subject: Re: fanotify permission events on virtual filesystem
Date: Mon, 1 Apr 2019 19:26:22 +0200 [thread overview]
Message-ID: <20190401172622.GB19542@quack2.suse.cz> (raw)
In-Reply-To: <CAOQ4uxjDq2ZZFwdirjrU9JG32MzQukeFr9-Ht_oQvYFo7SKQ-A@mail.gmail.com>
On Wed 20-03-19 17:02:54, Amir Goldstein wrote:
> On Wed, Mar 20, 2019 at 4:30 PM Jan Kara <jack@suse.cz> wrote:
> >
> > On Wed 20-03-19 15:46:20, Amir Goldstein wrote:
> > > On Wed, Mar 20, 2019 at 3:16 PM Jan Kara <jack@suse.cz> wrote:
> > > > recently, one of our customers has reported a deadlock with fanotify. The
> > > > analysis has shown that a process has put (likely by mistake) FAN_OPEN_PERM
> > > > mark on /proc and / filesystem. That resulted in a deadlock like follows:
> > > >
> > > > process 1: process 2: process 3:
> > > > open("/proc/process 2/maps")
> > > > - blocks waiting for response to
> > > > FAN_OPEN_PERM event
> > > >
> > > > exec(2)
> > > > __do_execve_file()
> > > > - grabs current->signal->cred_guard_mutex
> > > > do_open_execat()
> > > > - blocks waiting for response to
> > > > FAN_OPEN_PERM event
> > > >
> > > > reads fanotify event
> > > > generated by process 1
> > > > create_fd()
> > > > dentry_open()
> > > > proc_maps_open()
> > > > blocks on
> > > > cred_guard_mutex of process 2.
> > > >
> > > > Now this is not the only case where you can setup fanotify permissions
> > > > events so that your listener deadlocks but I'd argue that this case is
> > > > especially nasty and it is unrealistic to expect from userspace that it
> > > > would be able to implement fanotify listener in such a way that is
> > > > deadlock-free for proc filesystem since the lock dependencies there are
> > > > very different. So how about we just forbid placing marks with fanotify
> > > > permission events on proc? Any other virtual filesystem we should exclude?
> > > >
> > >
> > > I bet if we forbid placing marks on /proc, some apps would break.
> >
> > Well, I didn't mean all marks, just the permission ones. I'm not sure there
> > are apps that place permission events on /proc...
> >
>
> Maybe not intentionally.
> I once tested a few fanotify based AntiVirus solutions.
> In some of them, setting an "Exclude path" on some mount point
> would cause mark to not be set on that path, but for one in particular,
> the mark was still being set on the mount so path pattern filtering was
> done after receiving the events.
> I did not check whether /proc was blacklisted out of the box or if it
> could be marked/excluded from scan.
> IMO, assuming that all AntiVirus vendors blacklist all virtual filesystems
> is an assumption that we need to validate.
> [CC Marko from F-Secure for commenting on the above.]
>
> > > I always though that allowing O_PATH in event_f_flags can make
> > > sense for some apps.
> > >
> > > What if instead of forbiding marks of /proc, we would force those
> > > marks to use O_PATH for fd creation. Some of the functionality
> > > will remain. Apps are less likely to break. Deadlocks will be less
> > > likely, although maybe still possible.
> >
> > Yes, that's another option. But if this is automatic, it is going to be
> > confusing to potential users - report O_PATH fd if getting normal fd is
> > dangerous isn't great. And also the deadlocks are there only for permission
> > events so there's no strong reason to restrict normal ones.
> >
>
> Of course, we can do this hack only for reading permission events
> for virtual filesystems. The question is what would be more surprising
> to existing apps:
> - marks for permission events no longer working on /proc
> - fd from permission events no longer readable from /proc
> - something completely different
I think that option 1) is at least very explicit failure and given what
Marko wrote I'm willing to give that a try. If people start complaining,
we'll probably have to revert and try either reporting O_PATH fd or even
FAN_NOFD. It's a bit of a gamble I agree but current status quo isn't good
either.
Honza
--
Jan Kara <jack@suse.com>
SUSE Labs, CR
prev parent reply other threads:[~2019-04-01 17:52 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-03-20 13:16 fanotify permission events on virtual filesystem Jan Kara
2019-03-20 13:46 ` Amir Goldstein
2019-03-20 14:30 ` Jan Kara
2019-03-20 15:02 ` Amir Goldstein
2019-03-21 8:36 ` Marko Rauhamaa
2019-04-01 17:26 ` Jan Kara [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20190401172622.GB19542@quack2.suse.cz \
--to=jack@suse.cz \
--cc=amir73il@gmail.com \
--cc=linux-fsdevel@vger.kernel.org \
--cc=marko.rauhamaa@f-secure.com \
--cc=mhocko@suse.cz \
--cc=viro@zeniv.linux.org.uk \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).