All of lore.kernel.org
 help / color / mirror / Atom feed
From: Yan Zhao <yan.y.zhao@intel.com>
To: intel-gvt-dev@lists.freedesktop.org
Cc: cjia@nvidia.com, kvm@vger.kernel.org, aik@ozlabs.ru,
	Zhengxiao.zx@alibaba-inc.com, shuangtai.tst@alibaba-inc.com,
	qemu-devel@nongnu.org, kwankhede@nvidia.com, eauger@redhat.com,
	yi.l.liu@intel.com, eskultet@redhat.com, ziye.yang@intel.com,
	mlevitsk@redhat.com, pasic@linux.ibm.com, libvir-list@redhat.com,
	arei.gonglei@huawei.com, felipe@nutanix.com, Ken.Xue@amd.com,
	kevin.tian@intel.com, Yan Zhao <yan.y.zhao@intel.com>,
	dgilbert@redhat.com, zhenyuw@linux.intel.com,
	dinechin@redhat.com, alex.williamson@redhat.com,
	changpeng.liu@intel.com, cohuck@redhat.com,
	linux-kernel@vger.kernel.org, zhi.a.wang@intel.com,
	jonathan.davies@nutanix.com, shaopeng.he@intel.com
Subject: [Qemu-devel] [PATCH v2 1/2] vfio/mdev: add version attribute for mdev device
Date: Sun,  5 May 2019 21:49:04 -0400	[thread overview]
Message-ID: <20190506014904.3621-1-yan.y.zhao@intel.com> (raw)
In-Reply-To: <20190506014514.3555-1-yan.y.zhao@intel.com>

version attribute is used to check two mdev devices' compatibility.

The key point of this version attribute is that it's rw.
User space has no need to understand internal of device version and no
need to compare versions by itself.
Compared to reading version strings from both two mdev devices being
checked, user space only reads from one mdev device's version attribute.
After getting its version string, user space writes this string into the
other mdev device's version attribute. Vendor driver of mdev device
whose version attribute being written will check device compatibility of
the two mdev devices for user space and return success for compatibility
or errno for incompatibility.
So two readings of version attributes + checking in user space are now
changed to one reading + one writing of version attributes + checking in
vendor driver.
Format and length of version strings are now private to vendor driver
who can define them freely.

             __ user space
              /\          \
             /             \write
            / read          \
     ______/__           ___\|/___
    | version |         | version |-->check compatibility
    -----------         -----------
    mdev device A       mdev device B

This version attribute is optional. If a mdev device does not provide
with a version attribute, this mdev device is incompatible to all other
mdev devices.

Live migration is able to take advantage of this version attribute.
Before user space actually starts live migration, it can first check
whether two mdev devices are compatible.

v2:
1. added detailed intent and usage
2. made definition of version string completely private to vendor driver
   (Alex Williamson)
3. abandoned changes to sample mdev drivers (Alex Williamson)
4. mandatory --> optional (Cornelia Huck)
5. added description for errno (Cornelia Huck)

Cc: Alex Williamson <alex.williamson@redhat.com>
Cc: Erik Skultety <eskultet@redhat.com>
Cc: "Dr. David Alan Gilbert" <dgilbert@redhat.com>
Cc: Cornelia Huck <cohuck@redhat.com>
Cc: "Tian, Kevin" <kevin.tian@intel.com>
Cc: Zhenyu Wang <zhenyuw@linux.intel.com>
Cc: "Wang, Zhi A" <zhi.a.wang@intel.com>
Cc: Neo Jia <cjia@nvidia.com>
Cc: Kirti Wankhede <kwankhede@nvidia.com>
Cc: Daniel P. Berrangé <berrange@redhat.com>
Cc: Christophe de Dinechin <dinechin@redhat.com>

Signed-off-by: Yan Zhao <yan.y.zhao@intel.com>
---
 Documentation/vfio-mediated-device.txt | 140 +++++++++++++++++++++++++
 1 file changed, 140 insertions(+)

diff --git a/Documentation/vfio-mediated-device.txt b/Documentation/vfio-mediated-device.txt
index c3f69bcaf96e..013a764968eb 100644
--- a/Documentation/vfio-mediated-device.txt
+++ b/Documentation/vfio-mediated-device.txt
@@ -202,6 +202,7 @@ Directories and files under the sysfs for Each Physical Device
   |     |   |--- available_instances
   |     |   |--- device_api
   |     |   |--- description
+  |     |   |--- version
   |     |   |--- [devices]
   |     |--- [<type-id>]
   |     |   |--- create
@@ -209,6 +210,7 @@ Directories and files under the sysfs for Each Physical Device
   |     |   |--- available_instances
   |     |   |--- device_api
   |     |   |--- description
+  |     |   |--- version
   |     |   |--- [devices]
   |     |--- [<type-id>]
   |          |--- create
@@ -216,6 +218,7 @@ Directories and files under the sysfs for Each Physical Device
   |          |--- available_instances
   |          |--- device_api
   |          |--- description
+  |          |--- version
   |          |--- [devices]
 
 * [mdev_supported_types]
@@ -246,6 +249,143 @@ Directories and files under the sysfs for Each Physical Device
   This attribute should show the number of devices of type <type-id> that can be
   created.
 
+* version
+
+  This attribute is rw, and is optional.
+  It is used to check device compatibility between two mdev devices and is
+  accessed in pairs between the two mdev devices being checked.
+  The intent of this attribute is to make an mdev device's version opaque to
+  user space, so instead of reading two mdev devices' version strings and
+  comparing in userspace, user space should only read one mdev device's version
+  attribute, and writes this version string into the other mdev device's version
+  attribute. Then vendor driver of mdev device whose version attribute being
+  written would check the incoming version string and tell user space whether
+  the two mdev devices are compatible via return value. That's why this
+  attribute is writable.
+
+  when reading this attribute, it should show device version string of
+  the device of type <type-id>.
+
+  This string is private to vendor driver itself. Vendor driver is able to
+  freely define format and length of device version string.
+  e.g. It can use a combination of pciid of parent device + mdev type.
+
+  When writing a string to this attribute, vendor driver should analyze this
+  string and check whether the mdev device being identified by this string is
+  compatible with the mdev device for this attribute. vendor driver should then
+  return written string's length if it regards the two mdev devices are
+  compatible; vendor driver should return negative errno if it regards the two
+  mdev devices are not compatible.
+
+  User space should treat ANY of below conditions as two mdev devices not
+  compatible:
+  (1) any one of the two mdev devices does not have a version attribute
+  (2) error when read from one mdev device's version attribute
+  (3) error when write one mdev device's version string to the other mdev
+  device's version attribute
+
+  User space should regard two mdev devices compatible when ALL of below
+  conditions are met:
+  (1) success when read from one mdev device's version attribute.
+  (2) success when write one mdev device's version string to the other mdev
+  device's version attribute
+
+  Errno:
+  If vendor driver wants to claim a mdev device incompatible to all other mdev
+  devices, it should not register version attribute for this mdev device. But if
+  a vendor driver has already registered version attribute and it wants to claim
+  a mdev device incompatible to all other mdev devices, it needs to return
+  -ENODEV on access to this mdev device's version attribute.
+  If a mdev device is only incompatible to certain mdev devices, write of
+  incompatible mdev devices's version strings to its version attribute should
+  return -EINVAL;
+
+  This attribute can be taken advantage of by live migration.
+  If user space detects two mdev devices are compatible through version
+  attribute, it can start migration between the two mdev devices, otherwise it
+  should abort its migration attempts between the two mdev devices.
+
+  Example Usage:
+  case 1:
+  source side mdev device is of uuid 5ac1fb20-2bbf-4842-bb7e-36c58c3be9cd,
+  its mdev type is i915-GVTg_V5_4. pci id of parent device is 8086-193b.
+  target side mdev device is if of uuid 882cc4da-dede-11e7-9180-078a62063ab1,
+  its mdev type is i915-GVTg_V5_4. pci id of parent device is 8086-193b.
+
+  # readlink /sys/bus/pci/devices/0000\:00\:02.0/\
+  5ac1fb20-2bbf-4842-bb7e-36c58c3be9cd/mdev_type
+  ../mdev_supported_types/i915-GVTg_V5_4
+
+  # readlink /sys/bus/pci/devices/0000\:00\:02.0/\
+  882cc4da-dede-11e7-9180-078a62063ab1/mdev_type
+  ../mdev_supported_types/i915-GVTg_V5_4
+
+  (1) read source side mdev device's version.
+  #cat \
+    /sys/bus/pci/devices/0000\:00\:02.0/5ac1fb20-2bbf-4842-bb7e-36c58c3be9cd/\
+    mdev_type/version
+  8086-193b-i915-GVTg_V5_4
+
+  (2) write source side mdev device's version string into target side mdev
+  device's version attribute.
+  # echo 8086-193b-i915-GVTg_V5_4 >
+   /sys/bus/pci/devices/0000\:00\:02.0/882cc4da-dede-11e7-9180-078a62063ab1/\
+  mdev_type/version
+  # echo $?
+  0
+
+  in this case, user space's write to target side mdev device's version
+  attribute returns success to indicate the two mdev devices are compatible.
+
+  case 2:
+  source side mdev device is of uuid 5ac1fb20-2bbf-4842-bb7e-36c58c3be9cd,
+  its mdev type is i915-GVTg_V5_4. pci id of parent device is 8086-193b.
+  target side mdev device is if of uuid 882cc4da-dede-11e7-9180-078a62063ab1,
+  its mdev type is i915-GVTg_V5_4. pci id of parent device is 8086-191b.
+
+  # readlink /sys/bus/pci/devices/0000\:00\:02.0/\
+  5ac1fb20-2bbf-4842-bb7e-36c58c3be9cd/mdev_type
+  ../mdev_supported_types/i915-GVTg_V5_4
+
+  # readlink /sys/bus/pci/devices/0000\:00\:02.0/\
+  882cc4da-dede-11e7-9180-078a62063ab1/mdev_type
+  ../mdev_supported_types/i915-GVTg_V5_4
+
+  (1) read source side mdev device's version.
+  #cat \
+    /sys/bus/pci/devices/0000\:00\:02.0/5ac1fb20-2bbf-4842-bb7e-36c58c3be9cd/\
+    mdev_type/version
+  8086-193b-i915-GVTg_V5_4
+
+  (2) write source side mdev device's version string into target side mdev
+  device's version attribute.
+  # echo 8086-193b-i915-GVTg_V5_4 >
+   /sys/bus/pci/devices/0000\:00\:02.0/882cc4da-dede-11e7-9180-078a62063ab1/\
+  mdev_type/version
+  -bash: echo: write error: Invalid argument
+
+  in this case, user space's write to target side mdev device's version
+  attribute returns error to indicate the two mdev devices are incompatible.
+  (incompatible because pci ids of the two mdev devices' parent devices are
+  different).
+
+  case 3:
+  source side mdev device is of uuid 5ac1fb20-2bbf-4842-bb7e-36c58c3be9cd,
+  its mdev type is i915-GVTg_V5_4. pci id of parent device is 8086-193b.
+  But vendor driver does not provide version attribute for this device.
+
+  (1) read source side mdev device's version.
+  #cat \
+    /sys/bus/pci/devices/0000\:00\:02.0/5ac1fb20-2bbf-4842-bb7e-36c58c3be9cd/\
+    mdev_type/version
+  cat: '/sys/bus/pci/devices/0000:00:02.0/5ac1fb20-2bbf-4842-bb7e-36c58c3be9cd/\
+  mdev_type/version': No such file or directory
+
+  in this case, user space reads source side mdev device's version attribute
+  which does not exist however. user space regards the two mdev devices as not
+  compatible and will not start migration between the two mdev devices.
+
+
 * [device]
 
   This directory contains links to the devices of type <type-id> that have been
-- 
2.17.1



  reply	other threads:[~2019-05-06  1:55 UTC|newest]

Thread overview: 89+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-05-06  1:45 [PATCH v2 0/2] introduction of version attribute for VFIO live migration Yan Zhao
2019-05-06  1:45 ` [Qemu-devel] " Yan Zhao
2019-05-06  1:49 ` Yan Zhao [this message]
2019-05-07  9:19   ` [PATCH v2 1/2] vfio/mdev: add version attribute for mdev device Cornelia Huck
2019-05-07  9:19     ` [Qemu-devel] " Cornelia Huck
2019-05-08 11:57     ` Yan Zhao
2019-05-08 11:57       ` [Qemu-devel] " Yan Zhao
2019-05-09 15:24       ` Cornelia Huck
2019-05-09 15:24         ` [Qemu-devel] " Cornelia Huck
2019-05-10  2:43         ` Yan Zhao
2019-05-10  2:43           ` [Qemu-devel] " Yan Zhao
2019-05-07 21:18   ` Alex Williamson
2019-05-07 21:18     ` [Qemu-devel] " Alex Williamson
2019-05-08 11:27     ` Yan Zhao
2019-05-08 11:27       ` [Qemu-devel] " Yan Zhao
2019-05-08 21:22       ` Alex Williamson
2019-05-08 21:22         ` [Qemu-devel] " Alex Williamson
2019-05-08 15:27         ` [libvirt] " Boris Fiuczynski
2019-05-08 15:27           ` [Qemu-devel] " Boris Fiuczynski
2019-05-09  6:55           ` Yan Zhao
2019-05-09  6:55             ` [Qemu-devel] " Yan Zhao
2019-05-14 15:31           ` Alex Williamson
2019-05-14 15:31             ` [Qemu-devel] " Alex Williamson
2019-05-28 20:57             ` Boris Fiuczynski
2019-05-28 20:57               ` [Qemu-devel] " Boris Fiuczynski
2019-05-29 14:08               ` Alex Williamson
2019-05-29 14:08                 ` [Qemu-devel] " Alex Williamson
2019-05-09  3:10         ` Yan Zhao
2019-05-09  3:10           ` [Qemu-devel] " Yan Zhao
2019-05-09  3:38           ` Alex Williamson
2019-05-09  3:38             ` [Qemu-devel] " Alex Williamson
2019-05-09  5:48             ` Yan Zhao
2019-05-09 15:38     ` Cornelia Huck
2019-05-09 15:38       ` [Qemu-devel] " Cornelia Huck
2019-05-09 15:48       ` Dr. David Alan Gilbert
2019-05-09 15:48         ` [Qemu-devel] " Dr. David Alan Gilbert
2019-05-09 15:54         ` Cornelia Huck
2019-05-09 15:54           ` [Qemu-devel] " Cornelia Huck
2019-05-09 16:48           ` Dr. David Alan Gilbert
2019-05-09 16:48             ` [Qemu-devel] " Dr. David Alan Gilbert
2019-05-10  9:08             ` Cornelia Huck
2019-05-10  9:08               ` [Qemu-devel] " Cornelia Huck
2019-05-10  9:36               ` Dr. David Alan Gilbert
2019-05-10  9:36                 ` [Qemu-devel] " Dr. David Alan Gilbert
2019-05-10  9:48                 ` Cornelia Huck
2019-05-10  9:48                   ` [Qemu-devel] " Cornelia Huck
2019-05-13  1:16                   ` Yan Zhao
2019-05-13  1:16                     ` [Qemu-devel] " Yan Zhao
2019-05-13 13:28                   ` Erik Skultety
2019-05-13 13:28                     ` [Qemu-devel] " Erik Skultety
2019-05-14  6:12                     ` Yan Zhao
2019-05-14  7:03                       ` Cornelia Huck
2019-05-14  7:03                         ` [Qemu-devel] " Cornelia Huck
2019-05-14  7:20                       ` Erik Skultety
2019-05-14  7:20                         ` [Qemu-devel] " Erik Skultety
2019-05-14  7:32                         ` Yan Zhao
2019-05-14  7:32                           ` [Qemu-devel] " Yan Zhao
2019-05-14  7:43                           ` Erik Skultety
2019-05-14  7:43                             ` [Qemu-devel] " Erik Skultety
2019-05-14  7:47                             ` Yan Zhao
2019-05-14  7:47                               ` [Qemu-devel] " Yan Zhao
2019-05-14  9:51                               ` Cornelia Huck
2019-05-14  9:51                                 ` [Qemu-devel] " Cornelia Huck
2019-05-14 10:57                                 ` Erik Skultety
2019-05-14 10:57                                   ` [Qemu-devel] " Erik Skultety
2019-05-14 11:01                                 ` Dr. David Alan Gilbert
2019-05-14 11:01                                   ` [Qemu-devel] " Dr. David Alan Gilbert
2019-05-14 11:30                                   ` Cornelia Huck
2019-05-14 11:30                                     ` [Qemu-devel] " Cornelia Huck
2019-05-14 15:01                             ` Alex Williamson
2019-05-14 15:01                               ` [Qemu-devel] " Alex Williamson
2019-05-16  1:00                               ` Yan Zhao
2019-05-16  1:00                                 ` [Qemu-devel] " Yan Zhao
2019-05-06  1:51 ` [PATCH v2 2/2] drm/i915/gvt: export mdev device version to sysfs for Intel vGPU Yan Zhao
2019-05-06  1:51   ` [Qemu-devel] " Yan Zhao
2019-05-06  3:20   ` Zhenyu Wang
2019-05-06  3:20     ` [Qemu-devel] " Zhenyu Wang
2019-05-06  7:41     ` Zhenyu Wang
2019-05-06  7:41       ` [Qemu-devel] " Zhenyu Wang
2019-05-07  5:43       ` Yan Zhao
2019-05-07  5:43         ` [Qemu-devel] " Yan Zhao
2019-05-07  9:27   ` Cornelia Huck
2019-05-07  9:27     ` [Qemu-devel] " Cornelia Huck
2019-05-08 12:02     ` Yan Zhao
2019-05-08 12:02       ` [Qemu-devel] " Yan Zhao
2019-05-08 10:50   ` Dr. David Alan Gilbert
2019-05-08 10:50     ` [Qemu-devel] " Dr. David Alan Gilbert
2019-05-08 12:10     ` Yan Zhao
2019-05-08 12:10       ` [Qemu-devel] " Yan Zhao

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20190506014904.3621-1-yan.y.zhao@intel.com \
    --to=yan.y.zhao@intel.com \
    --cc=Ken.Xue@amd.com \
    --cc=Zhengxiao.zx@alibaba-inc.com \
    --cc=aik@ozlabs.ru \
    --cc=alex.williamson@redhat.com \
    --cc=arei.gonglei@huawei.com \
    --cc=changpeng.liu@intel.com \
    --cc=cjia@nvidia.com \
    --cc=cohuck@redhat.com \
    --cc=dgilbert@redhat.com \
    --cc=dinechin@redhat.com \
    --cc=eauger@redhat.com \
    --cc=eskultet@redhat.com \
    --cc=felipe@nutanix.com \
    --cc=intel-gvt-dev@lists.freedesktop.org \
    --cc=jonathan.davies@nutanix.com \
    --cc=kevin.tian@intel.com \
    --cc=kvm@vger.kernel.org \
    --cc=kwankhede@nvidia.com \
    --cc=libvir-list@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mlevitsk@redhat.com \
    --cc=pasic@linux.ibm.com \
    --cc=qemu-devel@nongnu.org \
    --cc=shaopeng.he@intel.com \
    --cc=shuangtai.tst@alibaba-inc.com \
    --cc=yi.l.liu@intel.com \
    --cc=zhenyuw@linux.intel.com \
    --cc=zhi.a.wang@intel.com \
    --cc=ziye.yang@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.