From: Jan Kara <jack@suse.cz>
To: Joseph Qi <joseph.qi@linux.alibaba.com>
Cc: Jan Kara <jack@suse.cz>, ocfs2-devel@oss.oracle.com
Subject: Re: [Ocfs2-devel] Possible fs corruption when hole punch races with other ops
Date: Thu, 22 Apr 2021 12:44:25 +0200 [thread overview]
Message-ID: <20210422104425.GC26221@quack2.suse.cz> (raw)
In-Reply-To: <c4d119be-e263-366c-c2c2-6c58ee852977@linux.alibaba.com>
Hello!
On Thu 22-04-21 11:22:07, Joseph Qi wrote:
> Checked the code flow, it seems the race you worried truly exists.
> We have to take ip_alloc_sem before calling into ocfs2_get_block().
OK, that's what I assumed but this won't be easy. Because ocfs2_writepage()
is called with a page locked but ocfs2_remove_inode_range() gets called
under ip_alloc_sem and locks pages in ocfs2_truncate_cluster_pages() ->
truncate_inode_pages_range(). Which creates ABBA deadlock. So you'll
probably need to come up with a similar dance like in ocfs2_readpage().
But after fixing this, ip_alloc_sem should provide you with enough
protection so that ocfs2 isn't prone to races between hole punch and
readahead / fault my work will be mostly irrelevant for OCFS2.
Thanks for confirmation!
Honza
> On 4/22/21 12:29 AM, Jan Kara wrote:
> > Hello,
> >
> > I'm unifying protection various filesystems use to protect hole punch
> > operations from racing with other operations (like readahead, page fault,
> > writepage etc.). I was looking into OCFS2 and I think it is prone to a
> > following race which can possibly lead to filesystem corruption. But maybe
> > I miss something so that's why I'm writing here. The scenario I'm concerned
> > about is:
> >
> > CPU1 CPU2
> > ocfs2_remove_inode_range() ocfs2_writepage()
> > ... block_write_full_page()
> > ocfs2_remove_btree_range() ocfs2_extent_map_get_blocks()
> >
> > Now ocfs2_extent_map_get_blocks() runs without protection of ip_alloc_sem
> > AFAICT and so both these operations can be modifying extent map at the same
> > time? What am I missing?
> >
> > Honza
> >
--
Jan Kara <jack@suse.com>
SUSE Labs, CR
_______________________________________________
Ocfs2-devel mailing list
Ocfs2-devel@oss.oracle.com
https://oss.oracle.com/mailman/listinfo/ocfs2-devel
next prev parent reply other threads:[~2021-04-22 10:45 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-04-21 16:29 [Ocfs2-devel] Possible fs corruption when hole punch races with other ops Jan Kara
2021-04-22 3:22 ` Joseph Qi
2021-04-22 10:44 ` Jan Kara [this message]
2021-04-22 15:56 ` Wengang Wang
2021-04-22 16:06 ` Darrick J. Wong
2021-04-22 16:36 ` Wengang Wang
2021-04-22 16:46 ` Darrick J. Wong
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20210422104425.GC26221@quack2.suse.cz \
--to=jack@suse.cz \
--cc=joseph.qi@linux.alibaba.com \
--cc=ocfs2-devel@oss.oracle.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).