From: Kefeng Wang <wangkefeng.wang@huawei.com>
To: Oleg Nesterov <oleg@redhat.com>
Cc: Michal Hocko <mhocko@suse.com>, linux-mm <linux-mm@kvack.org>,
"Andrea Arcangeli" <aarcange@redhat.com>,
Peter Xu <peterx@redhat.com>, Mike Rapoport <rppt@linux.ibm.com>,
Jann Horn <jannh@google.com>, Jason Gunthorpe <jgg@mellanox.com>,
Andrew Morton <akpm@linux-foundation.org>
Subject: Re: [BUG] kernel BUG at fs/userfaultfd.c:385 after 04f5866e41fb
Date: Thu, 15 Aug 2019 10:21:01 +0800 [thread overview]
Message-ID: <0cfded81-6668-905f-f2be-490bf7c750fb@huawei.com> (raw)
In-Reply-To: <20190814154101.GF11595@redhat.com>
On 2019/8/14 23:41, Oleg Nesterov wrote:
> On 08/14, Oleg Nesterov wrote:
>> I am wondering if "goto skip_mm" in userfaultfd_release() is correct...
>> shouldn't it clear VM_UFFD_* and reset vm_userfaultfd_ctx.ctx even if
>> !mmget_still_valid ?
>
> Heh, I didn't notice you too mentioned userfaultfd_release() in your email.
> can you try the patch below?
Your patch below fixes the issue, could you send a formal patch ASAP and also it
should be queued into stable, I have test lts4.4, it works too, thanks.
I built kernel with wrong gcc version, and the KASAN is not enabled, When KASAN enabled,
there is an UAF,
[ 67.393442] ==================================================================
[ 67.395531] BUG: KASAN: use-after-free in handle_userfault+0x12f/0xc70
[ 67.397001] Read of size 8 at addr ffff8883c622c160 by task syz-executor.9/5225
[ 67.398672]
[ 67.399035] CPU: 2 PID: 5225 Comm: syz-executor.9 Not tainted 5.3.0-rc4 #3
[ 67.400601] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.10.2-1ubuntu1 04/01/2014
[ 67.402818] Call Trace:
[ 67.403393] dump_stack+0x9a/0xeb
[ 67.404245] ? handle_userfault+0x12f/0xc70
[ 67.405268] print_address_description+0x79/0x440
[ 67.406411] ? handle_userfault+0x12f/0xc70
[ 67.407454] __kasan_report+0x15c/0x1df
[ 67.408418] ? rwlock_bug.part.0+0x60/0x60
[ 67.409470] ? handle_userfault+0x12f/0xc70
[ 67.410491] kasan_report+0xe/0x20
[ 67.411477] handle_userfault+0x12f/0xc70
[ 67.412610] ? __lock_acquire+0x66c/0x2420
[ 67.413728] ? userfaultfd_ioctl+0x1c30/0x1c30
[ 67.414910] ? match_held_lock+0x1b/0x250
[ 67.415993] ? check_chain_key+0x1d7/0x2d0
[ 67.417127] ? lock_downgrade+0x3a0/0x3a0
[ 67.418198] ? do_raw_spin_lock+0x10a/0x1d0
[ 67.419319] ? rwlock_bug.part.0+0x60/0x60
[ 67.420442] __handle_mm_fault+0x17e0/0x1ac0
[ 67.421618] ? check_chain_key+0x1d7/0x2d0
[ 67.422733] ? __pmd_alloc+0x260/0x260
[ 67.423824] ? mark_held_locks+0x46/0xa0
[ 67.424867] ? handle_mm_fault+0x142/0x540
[ 67.425958] handle_mm_fault+0x20c/0x540
[ 67.427044] __do_page_fault+0x3b4/0x6a0
[ 67.428144] do_page_fault+0x32/0x310
[ 67.429190] async_page_fault+0x43/0x50
[ 67.430243] RIP: 0010:copy_user_handle_tail+0x2/0x10
[ 67.431586] Code: c3 0f 1f 80 00 00 00 00 66 66 90 83 fa 40 0f 82 70 ff ff ff 89 d1 f3 a4 31 c0 66 66 90 c3 66 2e 0f 1f 84 00 00 00 00 00 89 d1 <f3> a4 89 c8 66 66 90 c3 66 0f 1f 44 00 00 66 66 90 83 fa 08 0f 82
[ 67.436978] RSP: 0018:ffff8883c4e8f908 EFLAGS: 00010246
[ 67.438743] RAX: 0000000000000001 RBX: 0000000020ffd000 RCX: 0000000000001000
[ 67.441101] RDX: 0000000000001000 RSI: 0000000020ffd000 RDI: ffff8883c0aa4000
[ 67.442865] RBP: 0000000000001000 R08: ffffed1078154a00 R09: 0000000000000000
[ 67.444534] R10: 0000000000000200 R11: ffffed10781549ff R12: ffff8883c0aa4000
[ 67.446216] R13: ffff8883c6096000 R14: ffff88837721f838 R15: ffff8883c6096000
[ 67.448388] _copy_from_user+0xa1/0xd0
[ 67.449655] mcopy_atomic+0xb3d/0x1380
[ 67.450991] ? lock_downgrade+0x3a0/0x3a0
[ 67.452337] ? mm_alloc_pmd+0x130/0x130
[ 67.453618] ? __might_fault+0x7d/0xe0
[ 67.454980] userfaultfd_ioctl+0x14a2/0x1c30
[ 67.456430] ? drop_futex_key_refs+0x25/0x70
[ 67.457873] ? __x64_sys_userfaultfd+0x200/0x200
[ 67.459420] ? futex_wait_setup+0x200/0x200
[ 67.460823] ? migrate_swap_stop+0x4e0/0x4e0
[ 67.462257] ? plist_del+0xd8/0x190
[ 67.463494] ? wake_up_q+0x59/0xa0
[ 67.464678] ? check_chain_key+0x1d7/0x2d0
[ 67.466088] ? __lock_acquire+0x66c/0x2420
[ 67.467496] ? match_held_lock+0x1b/0x250
[ 67.468937] ? do_vfs_ioctl+0x131/0x9d0
[ 67.470224] do_vfs_ioctl+0x131/0x9d0
[ 67.471473] ? match_held_lock+0x1b/0x250
[ 67.472821] ? ioctl_preallocate+0x170/0x170
[ 67.474251] ? debug_lockdep_rcu_enabled.part.4+0x16/0x30
[ 67.476046] ? selinux_file_ioctl+0x1f9/0x390
[ 67.477500] ? selinux_vm_enough_memory+0x70/0x70
[ 67.479080] ? m_next+0x33/0x70
[ 67.480184] ? do_dup2+0x2c0/0x2c0
[ 67.481475] ksys_ioctl+0x70/0x80
[ 67.482607] ? mark_held_locks+0x1c/0xa0
[ 67.483938] __x64_sys_ioctl+0x3d/0x50
[ 67.485203] do_syscall_64+0x72/0x330
[ 67.486463] entry_SYSCALL_64_after_hwframe+0x49/0xbe
[ 67.488158] RIP: 0033:0x458c59
[ 67.489166] Code: ad b8 fb ff c3 66 2e 0f 1f 84 00 00 00 00 00 66 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 0f 83 7b b8 fb ff c3 66 2e 0f 1f 84 00 00 00 00
[ 67.495146] RSP: 002b:00007f33a38d6c78 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[ 67.497608] RAX: ffffffffffffffda RBX: 0000000000000003 RCX: 0000000000458c59
[ 67.499931] RDX: 0000000020000000 RSI: 00000000c028aa03 RDI: 0000000000000003
[ 67.502262] RBP: 000000000073c040 R08: 0000000000000000 R09: 0000000000000000
[ 67.504608] R10: 0000000000000000 R11: 0000000000000246 R12: 00007f33a38d76d4
[ 67.506939] R13: 00000000004c34cf R14: 00000000004d6958 R15: 00000000ffffffff
[ 67.509359]
[ 67.509906] Allocated by task 5145:
[ 67.511091] save_stack+0x19/0x80
[ 67.512209] __kasan_kmalloc.constprop.8+0xa0/0xd0
[ 67.513805] kmem_cache_alloc+0xae/0x290
[ 67.515158] __x64_sys_userfaultfd+0x70/0x200
[ 67.516617] do_syscall_64+0x72/0x330
[ 67.517843] entry_SYSCALL_64_after_hwframe+0x49/0xbe
[ 67.519509]
[ 67.520052] Freed by task 5145:
[ 67.521112] save_stack+0x19/0x80
[ 67.522224] __kasan_slab_free+0x12e/0x180
[ 67.523600] slab_free_freelist_hook+0x5d/0x160
[ 67.525090] kmem_cache_free+0xa5/0x3b0
[ 67.526366] userfaultfd_release+0x353/0x3e0
[ 67.527805] __fput+0x15f/0x390
[ 67.528866] task_work_run+0xc7/0x100
[ 67.530095] get_signal+0xfd3/0x12e0
[ 67.531305] do_signal+0x93/0xa70
[ 67.532417] exit_to_usermode_loop+0x9d/0x130
[ 67.533885] do_syscall_64+0x2be/0x330
[ 67.535139] entry_SYSCALL_64_after_hwframe+0x49/0xbe
[ 67.536788]
[ 67.537332] The buggy address belongs to the object at ffff8883c622c000
[ 67.537332] which belongs to the cache userfaultfd_ctx_cache of size 360
[ 67.541634] The buggy address is located 352 bytes inside of
[ 67.541634] 360-byte region [ffff8883c622c000, ffff8883c622c168)
[ 67.545400] The buggy address belongs to the page:
[ 67.546996] page:ffffea000f188b00 refcount:1 mapcount:0 mapping:ffff8883c73a5900 index:0x0 compound_mapcount: 0
[ 67.550241] flags: 0x2fffff80010200(slab|head)
[ 67.551730] raw: 002fffff80010200 dead000000000100 dead000000000122 ffff8883c73a5900
[ 67.554230] raw: 0000000000000000 0000000080240024 00000001ffffffff 0000000000000000
[ 67.556743] page dumped because: kasan: bad access detected
[ 67.558575]
[ 67.559105] Memory state around the buggy address:
[ 67.560722] ffff8883c622c000: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
[ 67.563063] ffff8883c622c080: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
[ 67.565406] >ffff8883c622c100: fb fb fb fb fb fb fb fb fb fb fb fb fb fc fc fc
[ 67.567760] ^
[ 67.569795] ffff8883c622c180: fc fc fc fc fc fc fc fc fb fb fb fb fb fb fb fb
[ 67.572016] ffff8883c622c200: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
[ 67.574231] ==================================================================
>
> --- a/fs/userfaultfd.c
> +++ b/fs/userfaultfd.c
> @@ -880,6 +880,7 @@ static int userfaultfd_release(struct inode *inode, struct file *file)
> /* len == 0 means wake all */
> struct userfaultfd_wake_range range = { .len = 0, };
> unsigned long new_flags;
> + bool xxx;
>
> WRITE_ONCE(ctx->released, true);
>
> @@ -895,8 +896,7 @@ static int userfaultfd_release(struct inode *inode, struct file *file)
> * taking the mmap_sem for writing.
> */
> down_write(&mm->mmap_sem);
> - if (!mmget_still_valid(mm))
> - goto skip_mm;
> + xxx = mmget_still_valid(mm);
> prev = NULL;
> for (vma = mm->mmap; vma; vma = vma->vm_next) {
> cond_resched();
> @@ -907,19 +907,20 @@ static int userfaultfd_release(struct inode *inode, struct file *file)
> continue;
> }
> new_flags = vma->vm_flags & ~(VM_UFFD_MISSING | VM_UFFD_WP);
> - prev = vma_merge(mm, prev, vma->vm_start, vma->vm_end,
> - new_flags, vma->anon_vma,
> - vma->vm_file, vma->vm_pgoff,
> - vma_policy(vma),
> - NULL_VM_UFFD_CTX);
> - if (prev)
> - vma = prev;
> - else
> - prev = vma;
> + if (xxx) {
> + prev = vma_merge(mm, prev, vma->vm_start, vma->vm_end,
> + new_flags, vma->anon_vma,
> + vma->vm_file, vma->vm_pgoff,
> + vma_policy(vma),
> + NULL_VM_UFFD_CTX);
> + if (prev)
> + vma = prev;
> + else
> + prev = vma;
> + }
> vma->vm_flags = new_flags;
> vma->vm_userfaultfd_ctx = NULL_VM_UFFD_CTX;
> }
> -skip_mm:
> up_write(&mm->mmap_sem);
> mmput(mm);
> wakeup:
>
>
> .
>
next prev parent reply other threads:[~2019-08-15 2:21 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-08-13 9:08 [BUG] kernel BUG at fs/userfaultfd.c:385 after 04f5866e41fb Kefeng Wang
2019-08-14 13:53 ` Michal Hocko
2019-08-14 14:45 ` Kefeng Wang
2019-08-14 15:10 ` Oleg Nesterov
2019-08-14 15:41 ` Oleg Nesterov
2019-08-15 2:21 ` Kefeng Wang [this message]
2019-08-15 9:54 ` Oleg Nesterov
2019-08-16 10:37 ` Kefeng Wang
2019-08-19 12:48 ` Oleg Nesterov
2019-08-19 16:05 ` Andrea Arcangeli
2019-08-20 15:59 ` Oleg Nesterov
2019-08-20 16:15 ` Andrea Arcangeli
[not found] ` <73d7b5b1-a88c-5fca-ba16-be214c2524a4@I-love.SAKURA.ne.jp>
2019-08-20 16:09 ` Oleg Nesterov
2019-08-20 16:02 ` [PATCH] userfaultfd_release: always remove uffd flags and clear vm_userfaultfd_ctx Oleg Nesterov
2019-08-20 16:05 ` Andrea Arcangeli
2019-08-21 0:53 ` Kefeng Wang
2019-08-27 16:33 ` [BUG] kernel BUG at fs/userfaultfd.c:385 after 04f5866e41fb Oleg Nesterov
2019-08-27 17:14 ` Andrea Arcangeli
2019-08-28 14:25 ` Oleg Nesterov
2019-08-29 12:05 ` Andrea Arcangeli
2019-08-30 16:49 ` Oleg Nesterov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=0cfded81-6668-905f-f2be-490bf7c750fb@huawei.com \
--to=wangkefeng.wang@huawei.com \
--cc=aarcange@redhat.com \
--cc=akpm@linux-foundation.org \
--cc=jannh@google.com \
--cc=jgg@mellanox.com \
--cc=linux-mm@kvack.org \
--cc=mhocko@suse.com \
--cc=oleg@redhat.com \
--cc=peterx@redhat.com \
--cc=rppt@linux.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).