From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-13.6 required=3.0 tests=DKIMWL_WL_MED,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, MENTIONS_GIT_HOSTING,SPF_PASS,USER_IN_DEF_DKIM_WL autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 1801CC43381 for ; Fri, 15 Feb 2019 17:10:47 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id CE46A2190C for ; Fri, 15 Feb 2019 17:10:46 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (2048-bit key) header.d=google.com header.i=@google.com header.b="XhggGxhL" Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727555AbfBORKq (ORCPT ); Fri, 15 Feb 2019 12:10:46 -0500 Received: from mail-ot1-f67.google.com ([209.85.210.67]:37887 "EHLO mail-ot1-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726156AbfBORKn (ORCPT ); Fri, 15 Feb 2019 12:10:43 -0500 Received: by mail-ot1-f67.google.com with SMTP id b3so17701835otp.4 for ; Fri, 15 Feb 2019 09:10:43 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=8CdxlpiePjyWLwNMnxiqQ8kd/LlBnArMjhIAjMijFec=; b=XhggGxhLow2MIfDi0QSHOnH4cU0jxVo/i22OHbdJPy4eZyqnNAoYTqfTugqamYBBdX yrDxiSFDSkBPalT/or1qjfRrmvY5RQQs6l/d3BajBdAxdtWmp+aL0vqelLadA3ZHitcL GCEB/QjiFS9ZTN9SAhZ6/QyQqDe+SdBenYJeABlX+qpnxZlqmhjXtiG/Yei55gq0Pbp1 KFgyFkJVwx7Vlg4CA+tWHcQ8ExoKl270ev46dGSkVmZ21hq1uBiJV5ZMw/A34IfRZgzQ kCs0P0rOUoGzB0qRO/79W6SZWSLycZh6UrD1Y9ZFhPNcG2JOr8dpG73voj/T3lIntX6U vpfw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=8CdxlpiePjyWLwNMnxiqQ8kd/LlBnArMjhIAjMijFec=; b=YkMm9ad5HSUSYzcJ+CBuee4rCvSFGNa/uTpq4aYtV1/8SD4bsB8WT592ndPgdcNbsf 8qozHSxpQkhDiGlK8A+z1B9kylRZYmH+w3gtr8o2IS0yPZ8Dnu1GKY+rd0rECIc1zmq1 sbmqpXZGcTrdkU0ecN9zinO6D0TzftMsQJBMs1IlyPpE7ub6e7dVbL6TxVibV01oLAuj ZxDNfOx8MK3KycHBeov/oOADx7VmQyF+GSSDrkl+voVHR4hR7mjGmujZaKkEq0/THSeS BFqUKJjNL4RgrPaq2OfQm7lBh93Fv1TsEyvwM4EEoMvTOB9wnP25EmKAb+FfPNB1KYAB qVag== X-Gm-Message-State: AHQUAuZyAsF9zwdq+CnjuUS74kw/LVfII905eMWgg25R5t1+ByP6YMfv lCjmzpU1eRCTtBFcJdnVOnG96R1MYK7hjK3yzt3hvA== X-Google-Smtp-Source: AHgI3IYjz8reOXsbu3871GHecu16cdXOFkz4B422Sa56uhhgMOoMhW75u8XJQQ5e/67oRaX2uOvBJDV2XgeOWdjDPCY= X-Received: by 2002:aca:e886:: with SMTP id f128mr6650810oih.68.1550250642189; Fri, 15 Feb 2019 09:10:42 -0800 (PST) MIME-Version: 1.0 References: <000000000000c13ce50577db36cc@google.com> <073dedee-62df-9c67-1742-8de1e6c9502a@redhat.com> In-Reply-To: From: Jann Horn Date: Fri, 15 Feb 2019 18:10:16 +0100 Message-ID: Subject: Re: WARNING: refcount bug in kvm_vm_ioctl To: Dmitry Vyukov Cc: Paolo Bonzini , syzbot , KVM list , LKML , =?UTF-8?B?UmFkaW0gS3LEjW3DocWZ?= , syzkaller-bugs , Christoffer Dall , Janosch Frank , Christian Borntraeger Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Feb 15, 2019 at 5:45 PM Dmitry Vyukov wrote: > > On Fri, Feb 15, 2019 at 5:03 PM Jann Horn wrote: > > > > On Fri, Feb 15, 2019 at 4:40 PM Dmitry Vyukov wrote: > > > On Thu, Oct 11, 2018 at 4:18 PM Paolo Bonzini wrote: > > > > On 10/10/2018 09:58, syzbot wrote: > > > > > do_invalid_op+0x1b/0x20 arch/x86/kernel/traps.c:316 > > > > > invalid_op+0x14/0x20 arch/x86/entry/entry_64.S:993 > > > > > RIP: 0010:refcount_inc_checked+0x5d/0x70 lib/refcount.c:153 > > > > > kvm_get_kvm arch/x86/kvm/../../../virt/kvm/kvm_main.c:766 [inline] > > > > > kvm_ioctl_create_device arch/x86/kvm/../../../virt/kvm/kvm_main.c:2924 > > > > > kvm_vm_ioctl+0xed7/0x1d40 arch/x86/kvm/../../../virt/kvm/kvm_main.c:3114 > > > > > vfs_ioctl fs/ioctl.c:46 [inline] > > > > > file_ioctl fs/ioctl.c:501 [inline] > > > > > do_vfs_ioctl+0x1de/0x1720 fs/ioctl.c:685 > > > > > ksys_ioctl+0xa9/0xd0 fs/ioctl.c:702 > > > > > __do_sys_ioctl fs/ioctl.c:709 [inline] > > > > > __se_sys_ioctl fs/ioctl.c:707 [inline] > > > > > __x64_sys_ioctl+0x73/0xb0 fs/ioctl.c:707 > > > > > do_syscall_64+0x1b9/0x820 arch/x86/entry/common.c:290 > > > > > entry_SYSCALL_64_after_hwframe+0x49/0xbe > > > > > > > > The trace here is fairly simple, but I don't understand how this could > > > > happen. > > > > > > > > The kvm_get_kvm is done within kvm_ioctl_create_device, which is called > > > > from ioctl; the last reference cannot disappear inside a ioctl, because: > > > > > > > > 1) kvm_ioctl is called from vfs_ioctl, which does fdget and holds the fd > > > > reference until after kvm_vm_ioctl returns > > > > > > > > 2) the file descriptor holds one reference to the struct kvm*, and this > > > > reference is not released until kvm_vm_release is called by the last > > > > fput (which could be fdput's call to fput if the process has exited in > > > > the meanwhile) > > > > > > > > 3) for completeness, in case anon_inode_getfd fails, put_unused_fd will > > > > not invoke the file descriptor's ->release callback (in this case > > > > kvm_device_release). > > > > > > > > CCing some random people to get their opinion... > > > > > > > > Paolo > > > > > > > > > Jann, is it what you fixed in "kvm: fix kvm_ioctl_create_device() > > > reference counting (CVE-2019-6974)"? > > > If so, we need to close the syzbot bug. > > > > > > > > > > > # See https://goo.gl/kgGztJ for information about syzkaller reproducers. > > > > > #{"threaded":true,"collide":true,"repeat":true,"procs":6,"sandbox":"none","fault_call":-1,"tun":true,"tmpdir":true,"cgroups":true,"netdev":true,"resetnet":true,"segv":true} > > > > > r0 = openat$kvm(0xffffffffffffff9c, &(0x7f0000000380)='/dev/kvm\x00', 0x0, 0x0) > > > > > r1 = syz_open_dev$dspn(&(0x7f0000000100)='/dev/dsp#\x00', 0x3fe, 0x400) > > > > > r2 = ioctl$KVM_CREATE_VM(r0, 0xae01, 0x0) > > > > Here we create a VM fd... > > > > > > > perf_event_open(&(0x7f0000000040)={0x1, 0x70, 0x0, 0x0, 0x0, 0x0, 0x0, 0x50d}, 0x0, 0xffffffffffffffff, 0xffffffffffffffff, 0x0) > > > > > mincore(&(0x7f0000ffc000/0x1000)=nil, 0x1000, &(0x7f00000003c0)=""/4096) > > > > > setrlimit(0x0, &(0x7f0000000000)) > > > > > readahead(r1, 0x3, 0x9a6) > > > > > ioctl$KVM_CREATE_DEVICE(r2, 0xc00caee0, &(0x7f00000002c0)={0x4}) > > > > ... and here we do the KVM_CREATE_DEVICE ioctl with type==KVM_DEV_TYPE_VFIO. > > > > So that far it looks exactly like CVE-2019-6974. But CVE-2019-6974 > > also requires that someone calls close() on the file descriptor of the > > newly created device very quickly, before the ioctl is able to > > increment the refcount further, and I don't see anything like that > > here. Is there a chance that syzkaller called close() on a file > > descriptor while the ioctl() was still running without saying so here > > (potentially through dup2() or something like that)? > > Yes, all fd's are closed at the end of the test: > https://github.com/google/syzkaller/blob/master/executor/common_linux.h#L2561-L2568 Can that happen before the ioctl() has finished?