From mboxrd@z Thu Jan 1 00:00:00 1970 From: Alex Williamson Subject: Re: [RFC PATCH v4 0/3] Add Mediated device support[was: Add vGPU support] Date: Fri, 27 May 2016 08:54:43 -0600 Message-ID: <20160527085443.27f937eb@t450s.home> References: <1464119897-10844-1-git-send-email-kwankhede@nvidia.com> <20160525074356.52121ab8@ul30vt.home> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Cc: Kirti Wankhede , "pbonzini@redhat.com" , "kraxel@redhat.com" , "cjia@nvidia.com" , "qemu-devel@nongnu.org" , "kvm@vger.kernel.org" , "Ruan, Shuai" , "Song, Jike" , "Lv, Zhiyuan" , "bjsdjshi@linux.vnet.ibm.com" To: "Tian, Kevin" Return-path: Received: from mx1.redhat.com ([209.132.183.28]:46488 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753298AbcE0Oyp (ORCPT ); Fri, 27 May 2016 10:54:45 -0400 In-Reply-To: Sender: kvm-owner@vger.kernel.org List-ID: On Fri, 27 May 2016 11:02:46 +0000 "Tian, Kevin" wrote: > > From: Alex Williamson [mailto:alex.williamson@redhat.com] > > Sent: Wednesday, May 25, 2016 9:44 PM > > > > On Wed, 25 May 2016 07:13:58 +0000 > > "Tian, Kevin" wrote: > > > > > > From: Kirti Wankhede [mailto:kwankhede@nvidia.com] > > > > Sent: Wednesday, May 25, 2016 3:58 AM > > > > > > > > This series adds Mediated device support to v4.6 Linux host kernel. Purpose > > > > of this series is to provide a common interface for mediated device > > > > management that can be used by different devices. This series introduces > > > > Mdev core module that create and manage mediated devices, VFIO based driver > > > > for mediated PCI devices that are created by Mdev core module and update > > > > VFIO type1 IOMMU module to support mediated devices. > > > > > > Thanks. "Mediated device" is more generic than previous one. :-) > > > > > > > > > > > What's new in v4? > > > > - Renamed 'vgpu' module to 'mdev' module that represent generic term > > > > 'Mediated device'. > > > > - Moved mdev directory to drivers/vfio directory as this is the extension > > > > of VFIO APIs for mediated devices. > > > > - Updated mdev driver to be flexible to register multiple types of drivers > > > > to mdev_bus_type bus. > > > > - Updated mdev core driver with mdev_put_device() and mdev_get_device() for > > > > mediated devices. > > > > > > > > > > > > > > Just curious. In this version you move the whole mdev core under > > > VFIO now. Sorry if I missed any agreement on this change. IIRC Alex > > > doesn't want VFIO to manage mdev life-cycle directly. Instead VFIO is > > > just a mdev driver on created mediated devices.... > > > > I did originally suggest keeping them separate, but as we've progressed > > through the implementation, it's become more clear that the mediated > > device interface is very much tied to the vfio interface, acting mostly > > as a passthrough. So I thought it made sense to pull them together. > > Still open to discussion of course. Thanks, > > > > The main benefit of maintaining a separate mdev framework, IMHO, is > to allow better support of both KVM and Xen. Xen doesn't work with VFIO > today, because other VM's memory is not allocated from Dom0 which > means VFIO within Dom0 doesn't has view/permission to control isolation > for other VMs. Isn't this just a matter of the vfio iommu model selected? There could be a vfio-iommu-xen that knows how to do the grant calls. > However, after some thinking I think it might not be a big problem to > combine VFIO/mdev together, if we extend Xen to just use VFIO for > resource enumeration. In such model, VFIO still behaves as a single > kernel portal to enumerate mediated devices to user space, but give up > permission control to Qemu which will request a secure agent - Xen > hypervisor - to ensure isolation of VM usage on mediated device (including > EPT/IOMMU configuration). The whole point here is to use the vfio user api and we seem to be progressing towards using vfio-core as a conduit where the mediated driver api is also fairly vfio-ish. So it seems we're really headed towards a vfio-mediated device rather than some sort generic mediated driver interface. I would object to leaving permission control to QEMU, QEMU is just a vfio user, there are others like DPDK. The kernel needs to be in charge of protecting itself and users from each other, QEMU can't do this, which is part of reason that KVM has moved to vfio rather than the pci-sysfs resource interface. > I'm not sure whether VFIO can support this usage today. It is somehow > similar to channel io passthru in s390, where we also rely on Qemu to > mediate ccw commands to ensure isolation. Maybe just some slight > extension is required (e.g. not assume some API must be invoked). Of > course Qemu side vfio code also need some change. If this can work, > at least we can first put it as the enumeration interface for mediated > device in Xen. In the future it may be extended to cover normal Xen > PCI assignment as well instead of using sysfs to read PCI resource > today. The channel io proposal doesn't rely on QEMU for security either, the mediation occurs in the host kernel, parsing the ccw command program, and doing translations to replace the guest physical addresses with verified and pinned host physical addresses before submitting the program to be run. A mediated device is policed by the mediated vendor driver in the host kernel, QEMU is untrusted, just like any other user. If xen is currently using pci-sysfs for mapping device resources, then vfio should be directly usable, which leaves the IOMMU interfaces, such as pinning and mapping user memory and making use of the IOMMU API, that part of vfio is fairly modular though IOMMU groups is a fairly fundamental concept within the core. Thanks, Alex From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:49631) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1b6JA3-0002rK-1h for qemu-devel@nongnu.org; Fri, 27 May 2016 10:54:52 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1b6J9x-00060k-WA for qemu-devel@nongnu.org; Fri, 27 May 2016 10:54:51 -0400 Received: from mx1.redhat.com ([209.132.183.28]:45454) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1b6J9x-00060Y-NK for qemu-devel@nongnu.org; Fri, 27 May 2016 10:54:45 -0400 Date: Fri, 27 May 2016 08:54:43 -0600 From: Alex Williamson Message-ID: <20160527085443.27f937eb@t450s.home> In-Reply-To: References: <1464119897-10844-1-git-send-email-kwankhede@nvidia.com> <20160525074356.52121ab8@ul30vt.home> MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Subject: Re: [Qemu-devel] [RFC PATCH v4 0/3] Add Mediated device support[was: Add vGPU support] List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: "Tian, Kevin" Cc: Kirti Wankhede , "pbonzini@redhat.com" , "kraxel@redhat.com" , "cjia@nvidia.com" , "qemu-devel@nongnu.org" , "kvm@vger.kernel.org" , "Ruan, Shuai" , "Song, Jike" , "Lv, Zhiyuan" , "bjsdjshi@linux.vnet.ibm.com" On Fri, 27 May 2016 11:02:46 +0000 "Tian, Kevin" wrote: > > From: Alex Williamson [mailto:alex.williamson@redhat.com] > > Sent: Wednesday, May 25, 2016 9:44 PM > > > > On Wed, 25 May 2016 07:13:58 +0000 > > "Tian, Kevin" wrote: > > > > > > From: Kirti Wankhede [mailto:kwankhede@nvidia.com] > > > > Sent: Wednesday, May 25, 2016 3:58 AM > > > > > > > > This series adds Mediated device support to v4.6 Linux host kernel. Purpose > > > > of this series is to provide a common interface for mediated device > > > > management that can be used by different devices. This series introduces > > > > Mdev core module that create and manage mediated devices, VFIO based driver > > > > for mediated PCI devices that are created by Mdev core module and update > > > > VFIO type1 IOMMU module to support mediated devices. > > > > > > Thanks. "Mediated device" is more generic than previous one. :-) > > > > > > > > > > > What's new in v4? > > > > - Renamed 'vgpu' module to 'mdev' module that represent generic term > > > > 'Mediated device'. > > > > - Moved mdev directory to drivers/vfio directory as this is the extension > > > > of VFIO APIs for mediated devices. > > > > - Updated mdev driver to be flexible to register multiple types of drivers > > > > to mdev_bus_type bus. > > > > - Updated mdev core driver with mdev_put_device() and mdev_get_device() for > > > > mediated devices. > > > > > > > > > > > > > > Just curious. In this version you move the whole mdev core under > > > VFIO now. Sorry if I missed any agreement on this change. IIRC Alex > > > doesn't want VFIO to manage mdev life-cycle directly. Instead VFIO is > > > just a mdev driver on created mediated devices.... > > > > I did originally suggest keeping them separate, but as we've progressed > > through the implementation, it's become more clear that the mediated > > device interface is very much tied to the vfio interface, acting mostly > > as a passthrough. So I thought it made sense to pull them together. > > Still open to discussion of course. Thanks, > > > > The main benefit of maintaining a separate mdev framework, IMHO, is > to allow better support of both KVM and Xen. Xen doesn't work with VFIO > today, because other VM's memory is not allocated from Dom0 which > means VFIO within Dom0 doesn't has view/permission to control isolation > for other VMs. Isn't this just a matter of the vfio iommu model selected? There could be a vfio-iommu-xen that knows how to do the grant calls. > However, after some thinking I think it might not be a big problem to > combine VFIO/mdev together, if we extend Xen to just use VFIO for > resource enumeration. In such model, VFIO still behaves as a single > kernel portal to enumerate mediated devices to user space, but give up > permission control to Qemu which will request a secure agent - Xen > hypervisor - to ensure isolation of VM usage on mediated device (including > EPT/IOMMU configuration). The whole point here is to use the vfio user api and we seem to be progressing towards using vfio-core as a conduit where the mediated driver api is also fairly vfio-ish. So it seems we're really headed towards a vfio-mediated device rather than some sort generic mediated driver interface. I would object to leaving permission control to QEMU, QEMU is just a vfio user, there are others like DPDK. The kernel needs to be in charge of protecting itself and users from each other, QEMU can't do this, which is part of reason that KVM has moved to vfio rather than the pci-sysfs resource interface. > I'm not sure whether VFIO can support this usage today. It is somehow > similar to channel io passthru in s390, where we also rely on Qemu to > mediate ccw commands to ensure isolation. Maybe just some slight > extension is required (e.g. not assume some API must be invoked). Of > course Qemu side vfio code also need some change. If this can work, > at least we can first put it as the enumeration interface for mediated > device in Xen. In the future it may be extended to cover normal Xen > PCI assignment as well instead of using sysfs to read PCI resource > today. The channel io proposal doesn't rely on QEMU for security either, the mediation occurs in the host kernel, parsing the ccw command program, and doing translations to replace the guest physical addresses with verified and pinned host physical addresses before submitting the program to be run. A mediated device is policed by the mediated vendor driver in the host kernel, QEMU is untrusted, just like any other user. If xen is currently using pci-sysfs for mapping device resources, then vfio should be directly usable, which leaves the IOMMU interfaces, such as pinning and mapping user memory and making use of the IOMMU API, that part of vfio is fairly modular though IOMMU groups is a fairly fundamental concept within the core. Thanks, Alex