From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-1.0 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_PASS autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id F1C06C43381 for ; Wed, 6 Mar 2019 07:41:15 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id B55DF20661 for ; Wed, 6 Mar 2019 07:41:15 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729477AbfCFHlO (ORCPT ); Wed, 6 Mar 2019 02:41:14 -0500 Received: from szxga07-in.huawei.com ([45.249.212.35]:55410 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1728723AbfCFHlN (ORCPT ); Wed, 6 Mar 2019 02:41:13 -0500 Received: from DGGEMS402-HUB.china.huawei.com (unknown [172.30.72.59]) by Forcepoint Email with ESMTP id 450DFBF8FE01670BA136; Wed, 6 Mar 2019 15:41:11 +0800 (CST) Received: from [127.0.0.1] (10.177.29.68) by DGGEMS402-HUB.china.huawei.com (10.3.19.202) with Microsoft SMTP Server id 14.3.408.0; Wed, 6 Mar 2019 15:41:08 +0800 Message-ID: <5C7F7992.7050806@huawei.com> Date: Wed, 6 Mar 2019 15:41:06 +0800 From: zhong jiang User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:12.0) Gecko/20120428 Thunderbird/12.0.1 MIME-Version: 1.0 To: Mike Rapoport CC: Andrea Arcangeli , Dmitry Vyukov , syzbot , Michal Hocko , , Johannes Weiner , LKML , Linux-MM , syzkaller-bugs , Vladimir Davydov , David Rientjes , Hugh Dickins , Matthew Wilcox , Mel Gorman , Vlastimil Babka , Mike Rapoport , Peter Xu Subject: Re: KASAN: use-after-free Read in get_mem_cgroup_from_mm References: <00000000000006457e057c341ff8@google.com> <5C7BFE94.6070500@huawei.com> <5C7D2F82.40907@huawei.com> <5C7D4500.3070607@huawei.com> <5C7E1A38.2060906@huawei.com> <20190306020540.GA23850@redhat.com> <5C7F6048.2050802@huawei.com> <20190306062625.GA3549@rapoport-lnx> In-Reply-To: <20190306062625.GA3549@rapoport-lnx> Content-Type: text/plain; charset="ISO-8859-1" Content-Transfer-Encoding: 7bit X-Originating-IP: [10.177.29.68] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2019/3/6 14:26, Mike Rapoport wrote: > Hi, > > On Wed, Mar 06, 2019 at 01:53:12PM +0800, zhong jiang wrote: >> On 2019/3/6 10:05, Andrea Arcangeli wrote: >>> Hello everyone, >>> >>> [ CC'ed Mike and Peter ] >>> >>> On Tue, Mar 05, 2019 at 02:42:00PM +0800, zhong jiang wrote: >>>> On 2019/3/5 14:26, Dmitry Vyukov wrote: >>>>> On Mon, Mar 4, 2019 at 4:32 PM zhong jiang wrote: >>>>>> On 2019/3/4 22:11, Dmitry Vyukov wrote: >>>>>>> On Mon, Mar 4, 2019 at 3:00 PM zhong jiang wrote: >>>>>>>> On 2019/3/4 15:40, Dmitry Vyukov wrote: >>>>>>>>> On Sun, Mar 3, 2019 at 5:19 PM zhong jiang wrote: >>>>>>>>>> Hi, guys >>>>>>>>>> >>>>>>>>>> I also hit the following issue. but it fails to reproduce the issue by the log. >>>>>>>>>> >>>>>>>>>> it seems to the case that we access the mm->owner and deference it will result in the UAF. >>>>>>>>>> But it should not be possible that we specify the incomplete process to be the mm->owner. >>>>>>>>>> >>>>>>>>>> Any thoughts? >>>>>>>>> FWIW syzbot was able to reproduce this with this reproducer. >>>>>>>>> This looks like a very subtle race (threaded reproducer that runs >>>>>>>>> repeatedly in multiple processes), so most likely we are looking for >>>>>>>>> something like few instructions inconsistency window. >>>>>>>>> >>>>>>>> I has a little doubtful about the instrustions inconsistency window. >>>>>>>> >>>>>>>> I guess that you mean some smb barriers should be taken into account.:-) >>>>>>>> >>>>>>>> Because IMO, It should not be the lock case to result in the issue. >>>>>>> Since the crash was triggered on x86 _most likley_ this is not a >>>>>>> missed barrier. What I meant is that one thread needs to executed some >>>>>>> code, while another thread is stopped within few instructions. >>>>>>> >>>>>>> >>>>>> It is weird and I can not find any relationship you had said with the issue.:-( >>>>>> >>>>>> Because It is the cause that mm->owner has been freed, whereas we still deference it. >>>>>> >>>>>> From the lastest freed task call trace, It fails to create process. >>>>>> >>>>>> Am I miss something or I misunderstand your meaning. Please correct me. >>>>> Your analysis looks correct. I am just saying that the root cause of >>>>> this use-after-free seems to be a race condition. >>>>> >>>>> >>>>> >>>> Yep, Indeed, I can not figure out how the race works. I will dig up further. >>> Yes it's a race condition. >>> >>> We were aware about the non-cooperative fork userfaultfd feature >>> creating userfaultfd file descriptor that gets reported to the parent >>> uffd, despite they belong to mm created by failed forks. >>> >>> https://www.spinics.net/lists/linux-mm/msg136357.html >>> >> Hi, Andrea >> >> I still not clear why uffd ioctl can use the incomplete process as the mm->owner. >> and how to produce the race. > There is a C reproducer in the syzcaller report: > > https://syzkaller.appspot.com/x/repro.c?x=172fa5a3400000 > >> From your above explainations, My underdtanding is that the process handling do_exexve >> will have a temporary mm, which will be used by the UUFD ioctl. > The race is between userfaultfd operation and fork() failure: > > forking thread | userfaultfd monitor thread > --------------------------------+------------------------------- > fork() | > dup_mmap() | > dup_userfaultfd() | > dup_userfaultfd_complete() | > | read(UFFD_EVENT_FORK) > | uffdio_copy() > | mmget_not_zero() > goto bad_fork_something | > ... | > bad_fork_free: | > free_task() | > | mem_cgroup_from_task() > | /* access stale mm->owner */ > Hi, Mike forking thread fails to create the process ,and then free the allocated task struct. Other userfaultfd monitor thread should not access the stale mm->owner. The parent process and child process do not share the mm struct. Userfaultfd monitor thread's mm->owner should not point to the freed child task_struct. and due to the existence of tasklist_lock, we can not specify the mm->owner to freed task_struct. I miss something,=-O Thanks, zhong jiang >> Thanks, >> zhong jiang