* Re: kernel BUG in ext4_writepages [not found] <000000000000c3a53d05de992007@google.com> @ 2022-11-07 12:37 ` syzbot 2022-11-07 14:08 ` Jan Kara 0 siblings, 1 reply; 8+ messages in thread From: syzbot @ 2022-11-07 12:37 UTC (permalink / raw) To: gregkh, jack, lczerner, linux-ext4, linux-fsdevel, linux-kernel, stable-commits, stable, stable, syzkaller-android-bugs, tadeusz.struk, tytso This bug is marked as fixed by commit: ext4: Avoid crash when inline data creation follows DIO write But I can't find it in any tested tree for more than 90 days. Is it a correct commit? Please update it by replying: #syz fix: exact-commit-title Until then the bug is still considered open and new crashes with the same signature are ignored. ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: kernel BUG in ext4_writepages 2022-11-07 12:37 ` kernel BUG in ext4_writepages syzbot @ 2022-11-07 14:08 ` Jan Kara 0 siblings, 0 replies; 8+ messages in thread From: Jan Kara @ 2022-11-07 14:08 UTC (permalink / raw) To: syzbot Cc: gregkh, jack, lczerner, linux-ext4, linux-fsdevel, linux-kernel, stable-commits, stable, stable, syzkaller-android-bugs, tadeusz.struk, tytso On Mon 07-11-22 04:37:28, syzbot wrote: > This bug is marked as fixed by commit: > ext4: Avoid crash when inline data creation follows DIO write > But I can't find it in any tested tree for more than 90 days. > Is it a correct commit? Please update it by replying: > #syz fix: exact-commit-title > Until then the bug is still considered open and > new crashes with the same signature are ignored. #syz fix: ext4: avoid crash when inline data creation follows DIO write Honza -- Jan Kara <jack@suse.com> SUSE Labs, CR ^ permalink raw reply [flat|nested] 8+ messages in thread
* kernel BUG in ext4_writepages @ 2022-05-10 22:28 Tadeusz Struk 2022-05-19 12:23 ` Jan Kara 0 siblings, 1 reply; 8+ messages in thread From: Tadeusz Struk @ 2022-05-10 22:28 UTC (permalink / raw) To: linux-ext4; +Cc: lkml, linux-fsdevel Hi, Syzbot found another BUG in ext4_writepages [1]. This time it complains about inode with inline data. C reproducer can be found here [2] I was able to trigger it on 5.18.0-rc6 [1] https://syzkaller.appspot.com/bug?id=a1e89d09bbbcbd5c4cb45db230ee28c822953984 [2] https://syzkaller.appspot.com/text?tag=ReproC&x=129da6caf00000 -- Thanks, Tadeusz ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: kernel BUG in ext4_writepages 2022-05-10 22:28 Tadeusz Struk @ 2022-05-19 12:23 ` Jan Kara 2022-05-19 15:40 ` Tadeusz Struk 2022-05-19 23:14 ` Tadeusz Struk 0 siblings, 2 replies; 8+ messages in thread From: Jan Kara @ 2022-05-19 12:23 UTC (permalink / raw) To: Tadeusz Struk; +Cc: linux-ext4, lkml, linux-fsdevel Hi! On Tue 10-05-22 15:28:38, Tadeusz Struk wrote: > Syzbot found another BUG in ext4_writepages [1]. > This time it complains about inode with inline data. > C reproducer can be found here [2] > I was able to trigger it on 5.18.0-rc6 > > [1] https://syzkaller.appspot.com/bug?id=a1e89d09bbbcbd5c4cb45db230ee28c822953984 > [2] https://syzkaller.appspot.com/text?tag=ReproC&x=129da6caf00000 Thanks for report. This should be fixed by: https://lore.kernel.org/all/20220516012752.17241-1-yebin10@huawei.com/ Honza -- Jan Kara <jack@suse.com> SUSE Labs, CR ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: kernel BUG in ext4_writepages 2022-05-19 12:23 ` Jan Kara @ 2022-05-19 15:40 ` Tadeusz Struk 2022-05-19 23:14 ` Tadeusz Struk 1 sibling, 0 replies; 8+ messages in thread From: Tadeusz Struk @ 2022-05-19 15:40 UTC (permalink / raw) To: Jan Kara; +Cc: linux-ext4, lkml, linux-fsdevel On 5/19/22 05:23, Jan Kara wrote: > Hi! > > On Tue 10-05-22 15:28:38, Tadeusz Struk wrote: >> Syzbot found another BUG in ext4_writepages [1]. >> This time it complains about inode with inline data. >> C reproducer can be found here [2] >> I was able to trigger it on 5.18.0-rc6 >> >> [1] https://syzkaller.appspot.com/bug?id=a1e89d09bbbcbd5c4cb45db230ee28c822953984 >> [2] https://syzkaller.appspot.com/text?tag=ReproC&x=129da6caf00000 > > Thanks for report. This should be fixed by: > > https://lore.kernel.org/all/20220516012752.17241-1-yebin10@huawei.com/ Hi, Thanks for info. I tested the patch, but it doesn't fix the issue. In this case it doesn't even call ext4_convert_inline_data() -- Thanks, Tadeusz ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: kernel BUG in ext4_writepages 2022-05-19 12:23 ` Jan Kara 2022-05-19 15:40 ` Tadeusz Struk @ 2022-05-19 23:14 ` Tadeusz Struk 2022-05-20 9:50 ` Jan Kara 1 sibling, 1 reply; 8+ messages in thread From: Tadeusz Struk @ 2022-05-19 23:14 UTC (permalink / raw) To: Jan Kara; +Cc: linux-ext4, lkml, linux-fsdevel On 5/19/22 05:23, Jan Kara wrote: > Hi! > > On Tue 10-05-22 15:28:38, Tadeusz Struk wrote: >> Syzbot found another BUG in ext4_writepages [1]. >> This time it complains about inode with inline data. >> C reproducer can be found here [2] >> I was able to trigger it on 5.18.0-rc6 >> >> [1] https://syzkaller.appspot.com/bug?id=a1e89d09bbbcbd5c4cb45db230ee28c822953984 >> [2] https://syzkaller.appspot.com/text?tag=ReproC&x=129da6caf00000 > > Thanks for report. This should be fixed by: > > https://lore.kernel.org/all/20220516012752.17241-1-yebin10@huawei.com/ In case of the syzbot bug there is something messed up with PAGE DIRTY flags and the way syzbot sets up the write. This is what triggers the crash: $ ftrace -f ./repro ... [pid 2395] open("./bus", O_RDWR|O_CREAT|O_SYNC|O_NOATIME, 000 <unfinished ...> [pid 2395] <... open resumed> ) = 6 ... [pid 2395] write(6, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0", 22 <unfinished ...> ... [pid 2395] <... write resumed> ) = 22 One way I could fix it was to clear the PAGECACHE_TAG_DIRTY on the mapping in ext4_try_to_write_inline_data() after the page has been updated: diff --git a/fs/ext4/inline.c b/fs/ext4/inline.c index 9c076262770d..e4bbb53fa26f 100644 --- a/fs/ext4/inline.c +++ b/fs/ext4/inline.c @@ -715,6 +715,7 @@ int ext4_try_to_write_inline_data(struct address_space *mapping, put_page(page); goto out_up_read; } + __xa_clear_mark(&mapping->i_pages, 0, PAGECACHE_TAG_DIRTY); } ret = 1; Please let me know it if makes sense any I will send a proper patch. -- Thanks, Tadeusz ^ permalink raw reply related [flat|nested] 8+ messages in thread
* Re: kernel BUG in ext4_writepages 2022-05-19 23:14 ` Tadeusz Struk @ 2022-05-20 9:50 ` Jan Kara 2022-05-20 14:50 ` Tadeusz Struk 0 siblings, 1 reply; 8+ messages in thread From: Jan Kara @ 2022-05-20 9:50 UTC (permalink / raw) To: Tadeusz Struk; +Cc: Jan Kara, linux-ext4, lkml, linux-fsdevel On Thu 19-05-22 16:14:17, Tadeusz Struk wrote: > On 5/19/22 05:23, Jan Kara wrote: > > Hi! > > > > On Tue 10-05-22 15:28:38, Tadeusz Struk wrote: > > > Syzbot found another BUG in ext4_writepages [1]. > > > This time it complains about inode with inline data. > > > C reproducer can be found here [2] > > > I was able to trigger it on 5.18.0-rc6 > > > > > > [1] https://syzkaller.appspot.com/bug?id=a1e89d09bbbcbd5c4cb45db230ee28c822953984 > > > [2] https://syzkaller.appspot.com/text?tag=ReproC&x=129da6caf00000 > > > > Thanks for report. This should be fixed by: > > > > https://lore.kernel.org/all/20220516012752.17241-1-yebin10@huawei.com/ > > > In case of the syzbot bug there is something messed up with PAGE DIRTY flags > and the way syzbot sets up the write. This is what triggers the crash: Can you tell me where exactly we hit the bug? I've now noticed that this is on 5.10 kernel and on vanilla 5.10 there's no BUG_ON on line 2753. > $ ftrace -f ./repro > ... > [pid 2395] open("./bus", O_RDWR|O_CREAT|O_SYNC|O_NOATIME, 000 <unfinished ...> > [pid 2395] <... open resumed> ) = 6 > ... > [pid 2395] write(6, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0", 22 <unfinished ...> > ... > [pid 2395] <... write resumed> ) = 22 > > One way I could fix it was to clear the PAGECACHE_TAG_DIRTY on the mapping in > ext4_try_to_write_inline_data() after the page has been updated: > > diff --git a/fs/ext4/inline.c b/fs/ext4/inline.c > index 9c076262770d..e4bbb53fa26f 100644 > --- a/fs/ext4/inline.c > +++ b/fs/ext4/inline.c > @@ -715,6 +715,7 @@ int ext4_try_to_write_inline_data(struct address_space *mapping, > put_page(page); > goto out_up_read; > } > + __xa_clear_mark(&mapping->i_pages, 0, PAGECACHE_TAG_DIRTY); > } > ret = 1; > > Please let me know it if makes sense any I will send a proper patch. No, this looks really wrong... We need to better understand what's going on. Honza -- Jan Kara <jack@suse.com> SUSE Labs, CR ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: kernel BUG in ext4_writepages 2022-05-20 9:50 ` Jan Kara @ 2022-05-20 14:50 ` Tadeusz Struk 0 siblings, 0 replies; 8+ messages in thread From: Tadeusz Struk @ 2022-05-20 14:50 UTC (permalink / raw) To: Jan Kara; +Cc: linux-ext4, lkml, linux-fsdevel On 5/20/22 02:50, Jan Kara wrote: > On Thu 19-05-22 16:14:17, Tadeusz Struk wrote: >> On 5/19/22 05:23, Jan Kara wrote: >>> Hi! >>> >>> On Tue 10-05-22 15:28:38, Tadeusz Struk wrote: >>>> Syzbot found another BUG in ext4_writepages [1]. >>>> This time it complains about inode with inline data. >>>> C reproducer can be found here [2] >>>> I was able to trigger it on 5.18.0-rc6 >>>> >>>> [1] https://syzkaller.appspot.com/bug?id=a1e89d09bbbcbd5c4cb45db230ee28c822953984 >>>> [2] https://syzkaller.appspot.com/text?tag=ReproC&x=129da6caf00000 >>> >>> Thanks for report. This should be fixed by: >>> >>> https://lore.kernel.org/all/20220516012752.17241-1-yebin10@huawei.com/ >> >> >> In case of the syzbot bug there is something messed up with PAGE DIRTY flags >> and the way syzbot sets up the write. This is what triggers the crash: > > Can you tell me where exactly we hit the bug? I've now noticed that this is > on 5.10 kernel and on vanilla 5.10 there's no BUG_ON on line 2753. We are hiting this bug: https://elixir.bootlin.com/linux/latest/source/fs/ext4/inode.c#L2707 Syzbot found it in v5.10, but I recreated it on 5.18-rc7, that's why the line number mismatch. But this is the same bug. On 5.10 it's in line 2739: https://elixir.bootlin.com/linux/v5.10.117/source/fs/ext4/inode.c#L2739 > >> $ ftrace -f ./repro >> ... >> [pid 2395] open("./bus", O_RDWR|O_CREAT|O_SYNC|O_NOATIME, 000 <unfinished ...> >> [pid 2395] <... open resumed> ) = 6 >> ... >> [pid 2395] write(6, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0", 22 <unfinished ...> >> ... >> [pid 2395] <... write resumed> ) = 22 >> >> One way I could fix it was to clear the PAGECACHE_TAG_DIRTY on the mapping in >> ext4_try_to_write_inline_data() after the page has been updated: >> >> diff --git a/fs/ext4/inline.c b/fs/ext4/inline.c >> index 9c076262770d..e4bbb53fa26f 100644 >> --- a/fs/ext4/inline.c >> +++ b/fs/ext4/inline.c >> @@ -715,6 +715,7 @@ int ext4_try_to_write_inline_data(struct address_space *mapping, >> put_page(page); >> goto out_up_read; >> } >> + __xa_clear_mark(&mapping->i_pages, 0, PAGECACHE_TAG_DIRTY); >> } >> ret = 1; >> >> Please let me know it if makes sense any I will send a proper patch. > > No, this looks really wrong... We need to better understand what's going > on. So I was afraid. I'm trying to diverge the ext4_writepages() to go to the out_writepages path before we hit this BOG_ON(). Any hints will be much appreciated. -- Thanks, Tadeusz ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2022-11-07 14:09 UTC | newest] Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- [not found] <000000000000c3a53d05de992007@google.com> 2022-11-07 12:37 ` kernel BUG in ext4_writepages syzbot 2022-11-07 14:08 ` Jan Kara 2022-05-10 22:28 Tadeusz Struk 2022-05-19 12:23 ` Jan Kara 2022-05-19 15:40 ` Tadeusz Struk 2022-05-19 23:14 ` Tadeusz Struk 2022-05-20 9:50 ` Jan Kara 2022-05-20 14:50 ` Tadeusz Struk
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.