[Nouveau] [PATCH drm-misc-next 0/3] [RFC] DRM GPUVA Manager GPU-VM features

All of lore.kernel.org
 help / color / mirror / Atom feed

* [Nouveau] [PATCH drm-misc-next 0/3] [RFC] DRM GPUVA Manager GPU-VM features
@ 2023-08-20 21:53 ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-20 21:53 UTC (permalink / raw)
  To: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel

So far the DRM GPUVA manager offers common infrastructure to track GPU VA
allocations and mappings, generically connect GPU VA mappings to their
backing buffers and perform more complex mapping operations on the GPU VA
space.

However, there are more design patterns commonly used by drivers, which
can potentially be generalized in order to make the DRM GPUVA manager
represent a basic GPU-VM implementation. In this context, this patch series
aims at generalizing the following elements.

1) Provide a common dma-resv for GEM objects not being used outside of
   this GPU-VM.

2) Provide tracking of external GEM objects (GEM objects which are
   shared with other GPU-VMs).

3) Provide functions to efficiently lock all GEM objects dma-resv the
   GPU-VM contains mappings of.

4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
   of, such that validation of evicted GEM objects is accelerated.

5) Provide some convinience functions for common patterns.

Rather than being designed as a "framework", the target is to make all
features appear as a collection of optional helper functions, such that
drivers are free to make use of the DRM GPUVA managers basic
functionality and opt-in for other features without setting any feature
flags, just by making use of the corresponding functions.

The implementation introduces struct drm_gpuva_gem, which serves as abstraction
combining a struct drm_gpuva_manager and struct drm_gem_object, similar to what
amdgpu does with struct amdgpu_bo_vm. While this adds a bit of complexity it
improves the efficiency of tracking evicted GEM objects. [1] provides an
alternative implementation using a maple_tree, resulting into a fairly simpler
API. [2] points to the full patch series providing the alternative
implementation. [3] points to this patch series.

[1] https://gitlab.freedesktop.org/nouvelles/kernel/-/commit/2a7e1b0ece2c3bba43376783b577d97ae6f6e54f
[2] https://gitlab.freedesktop.org/nouvelles/kernel/-/commits/gpuva-vm-resv
[3] https://gitlab.freedesktop.org/nouvelles/kernel/-/commits/gpuva-vm-resv-vm-bo

Danilo Krummrich (3):
  drm: drm_exec: build always builtin
  drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  drm/nouveau: gpuva mgr dma-resv/extobj handling, GEM validation

 drivers/gpu/drm/Kconfig                 |   6 -
 drivers/gpu/drm/Makefile                |   3 +-
 drivers/gpu/drm/drm_gpuva_mgr.c         | 688 +++++++++++++++++++++++-
 drivers/gpu/drm/nouveau/Kconfig         |   1 -
 drivers/gpu/drm/nouveau/nouveau_bo.c    |   4 +-
 drivers/gpu/drm/nouveau/nouveau_exec.c  |  51 +-
 drivers/gpu/drm/nouveau/nouveau_gem.c   |   4 +-
 drivers/gpu/drm/nouveau/nouveau_sched.h |   2 -
 drivers/gpu/drm/nouveau/nouveau_uvmm.c  | 191 +++++--
 include/drm/drm_gem.h                   |  48 +-
 include/drm/drm_gpuva_mgr.h             | 302 ++++++++++-
 11 files changed, 1161 insertions(+), 139 deletions(-)

base-commit: 25205087df1ffe06ccea9302944ed1f77dc68c6f
-- 
2.41.0

^ permalink raw reply	[flat|nested] 88+ messages in thread

* [PATCH drm-misc-next 0/3] [RFC] DRM GPUVA Manager GPU-VM features
@ 2023-08-20 21:53 ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-20 21:53 UTC (permalink / raw)
  To: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett
  Cc: nouveau, Danilo Krummrich, linux-kernel, dri-devel

So far the DRM GPUVA manager offers common infrastructure to track GPU VA
allocations and mappings, generically connect GPU VA mappings to their
backing buffers and perform more complex mapping operations on the GPU VA
space.

However, there are more design patterns commonly used by drivers, which
can potentially be generalized in order to make the DRM GPUVA manager
represent a basic GPU-VM implementation. In this context, this patch series
aims at generalizing the following elements.

1) Provide a common dma-resv for GEM objects not being used outside of
   this GPU-VM.

2) Provide tracking of external GEM objects (GEM objects which are
   shared with other GPU-VMs).

3) Provide functions to efficiently lock all GEM objects dma-resv the
   GPU-VM contains mappings of.

4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
   of, such that validation of evicted GEM objects is accelerated.

5) Provide some convinience functions for common patterns.

Rather than being designed as a "framework", the target is to make all
features appear as a collection of optional helper functions, such that
drivers are free to make use of the DRM GPUVA managers basic
functionality and opt-in for other features without setting any feature
flags, just by making use of the corresponding functions.

The implementation introduces struct drm_gpuva_gem, which serves as abstraction
combining a struct drm_gpuva_manager and struct drm_gem_object, similar to what
amdgpu does with struct amdgpu_bo_vm. While this adds a bit of complexity it
improves the efficiency of tracking evicted GEM objects. [1] provides an
alternative implementation using a maple_tree, resulting into a fairly simpler
API. [2] points to the full patch series providing the alternative
implementation. [3] points to this patch series.

[1] https://gitlab.freedesktop.org/nouvelles/kernel/-/commit/2a7e1b0ece2c3bba43376783b577d97ae6f6e54f
[2] https://gitlab.freedesktop.org/nouvelles/kernel/-/commits/gpuva-vm-resv
[3] https://gitlab.freedesktop.org/nouvelles/kernel/-/commits/gpuva-vm-resv-vm-bo

Danilo Krummrich (3):
  drm: drm_exec: build always builtin
  drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  drm/nouveau: gpuva mgr dma-resv/extobj handling, GEM validation

 drivers/gpu/drm/Kconfig                 |   6 -
 drivers/gpu/drm/Makefile                |   3 +-
 drivers/gpu/drm/drm_gpuva_mgr.c         | 688 +++++++++++++++++++++++-
 drivers/gpu/drm/nouveau/Kconfig         |   1 -
 drivers/gpu/drm/nouveau/nouveau_bo.c    |   4 +-
 drivers/gpu/drm/nouveau/nouveau_exec.c  |  51 +-
 drivers/gpu/drm/nouveau/nouveau_gem.c   |   4 +-
 drivers/gpu/drm/nouveau/nouveau_sched.h |   2 -
 drivers/gpu/drm/nouveau/nouveau_uvmm.c  | 191 +++++--
 include/drm/drm_gem.h                   |  48 +-
 include/drm/drm_gpuva_mgr.h             | 302 ++++++++++-
 11 files changed, 1161 insertions(+), 139 deletions(-)

base-commit: 25205087df1ffe06ccea9302944ed1f77dc68c6f
-- 
2.41.0

^ permalink raw reply	[flat|nested] 88+ messages in thread

* [PATCH drm-misc-next 0/3] [RFC] DRM GPUVA Manager GPU-VM features
@ 2023-08-20 21:53 ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-20 21:53 UTC (permalink / raw)
  To: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett
  Cc: dri-devel, nouveau, linux-kernel, Danilo Krummrich

So far the DRM GPUVA manager offers common infrastructure to track GPU VA
allocations and mappings, generically connect GPU VA mappings to their
backing buffers and perform more complex mapping operations on the GPU VA
space.

However, there are more design patterns commonly used by drivers, which
can potentially be generalized in order to make the DRM GPUVA manager
represent a basic GPU-VM implementation. In this context, this patch series
aims at generalizing the following elements.

1) Provide a common dma-resv for GEM objects not being used outside of
   this GPU-VM.

2) Provide tracking of external GEM objects (GEM objects which are
   shared with other GPU-VMs).

3) Provide functions to efficiently lock all GEM objects dma-resv the
   GPU-VM contains mappings of.

4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
   of, such that validation of evicted GEM objects is accelerated.

5) Provide some convinience functions for common patterns.

Rather than being designed as a "framework", the target is to make all
features appear as a collection of optional helper functions, such that
drivers are free to make use of the DRM GPUVA managers basic
functionality and opt-in for other features without setting any feature
flags, just by making use of the corresponding functions.

The implementation introduces struct drm_gpuva_gem, which serves as abstraction
combining a struct drm_gpuva_manager and struct drm_gem_object, similar to what
amdgpu does with struct amdgpu_bo_vm. While this adds a bit of complexity it
improves the efficiency of tracking evicted GEM objects. [1] provides an
alternative implementation using a maple_tree, resulting into a fairly simpler
API. [2] points to the full patch series providing the alternative
implementation. [3] points to this patch series.

[1] https://gitlab.freedesktop.org/nouvelles/kernel/-/commit/2a7e1b0ece2c3bba43376783b577d97ae6f6e54f
[2] https://gitlab.freedesktop.org/nouvelles/kernel/-/commits/gpuva-vm-resv
[3] https://gitlab.freedesktop.org/nouvelles/kernel/-/commits/gpuva-vm-resv-vm-bo

Danilo Krummrich (3):
  drm: drm_exec: build always builtin
  drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  drm/nouveau: gpuva mgr dma-resv/extobj handling, GEM validation

 drivers/gpu/drm/Kconfig                 |   6 -
 drivers/gpu/drm/Makefile                |   3 +-
 drivers/gpu/drm/drm_gpuva_mgr.c         | 688 +++++++++++++++++++++++-
 drivers/gpu/drm/nouveau/Kconfig         |   1 -
 drivers/gpu/drm/nouveau/nouveau_bo.c    |   4 +-
 drivers/gpu/drm/nouveau/nouveau_exec.c  |  51 +-
 drivers/gpu/drm/nouveau/nouveau_gem.c   |   4 +-
 drivers/gpu/drm/nouveau/nouveau_sched.h |   2 -
 drivers/gpu/drm/nouveau/nouveau_uvmm.c  | 191 +++++--
 include/drm/drm_gem.h                   |  48 +-
 include/drm/drm_gpuva_mgr.h             | 302 ++++++++++-
 11 files changed, 1161 insertions(+), 139 deletions(-)

base-commit: 25205087df1ffe06ccea9302944ed1f77dc68c6f
-- 
2.41.0

^ permalink raw reply	[flat|nested] 88+ messages in thread

* [Nouveau] [PATCH drm-misc-next 1/3] drm: drm_exec: build always builtin
  2023-08-20 21:53 ` Danilo Krummrich
  (?)
@ 2023-08-20 21:53   ` Danilo Krummrich
  -1 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-20 21:53 UTC (permalink / raw)
  To: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel

drm_exec must always be builtin for the DRM GPUVA manager to depend on
it.

Signed-off-by: Danilo Krummrich <dakr@redhat.com>
---
 drivers/gpu/drm/Kconfig         | 6 ------
 drivers/gpu/drm/Makefile        | 3 +--
 drivers/gpu/drm/nouveau/Kconfig | 1 -
 3 files changed, 1 insertion(+), 9 deletions(-)

diff --git a/drivers/gpu/drm/Kconfig b/drivers/gpu/drm/Kconfig
index ab9ef1c20349..85122d4bb1e7 100644
--- a/drivers/gpu/drm/Kconfig
+++ b/drivers/gpu/drm/Kconfig
@@ -210,12 +210,6 @@ config DRM_TTM_KUNIT_TEST
 
           If in doubt, say "N".
 
-config DRM_EXEC
-	tristate
-	depends on DRM
-	help
-	  Execution context for command submissions
-
 config DRM_BUDDY
 	tristate
 	depends on DRM
diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
index 215e78e79125..388e0964a875 100644
--- a/drivers/gpu/drm/Makefile
+++ b/drivers/gpu/drm/Makefile
@@ -23,6 +23,7 @@ drm-y := \
 	drm_dumb_buffers.o \
 	drm_edid.o \
 	drm_encoder.o \
+	drm_exec.o \
 	drm_file.o \
 	drm_fourcc.o \
 	drm_framebuffer.o \
@@ -80,8 +81,6 @@ obj-$(CONFIG_DRM_PANEL_ORIENTATION_QUIRKS) += drm_panel_orientation_quirks.o
 # Memory-management helpers
 #
 #
-obj-$(CONFIG_DRM_EXEC) += drm_exec.o
-
 obj-$(CONFIG_DRM_BUDDY) += drm_buddy.o
 
 drm_dma_helper-y := drm_gem_dma_helper.o
diff --git a/drivers/gpu/drm/nouveau/Kconfig b/drivers/gpu/drm/nouveau/Kconfig
index c52e8096cca4..2dddedac125b 100644
--- a/drivers/gpu/drm/nouveau/Kconfig
+++ b/drivers/gpu/drm/nouveau/Kconfig
@@ -10,7 +10,6 @@ config DRM_NOUVEAU
 	select DRM_KMS_HELPER
 	select DRM_TTM
 	select DRM_TTM_HELPER
-	select DRM_EXEC
 	select DRM_SCHED
 	select I2C
 	select I2C_ALGOBIT
-- 
2.41.0


^ permalink raw reply related	[flat|nested] 88+ messages in thread

* [PATCH drm-misc-next 1/3] drm: drm_exec: build always builtin
@ 2023-08-20 21:53   ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-20 21:53 UTC (permalink / raw)
  To: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett
  Cc: nouveau, Danilo Krummrich, linux-kernel, dri-devel

drm_exec must always be builtin for the DRM GPUVA manager to depend on
it.

Signed-off-by: Danilo Krummrich <dakr@redhat.com>
---
 drivers/gpu/drm/Kconfig         | 6 ------
 drivers/gpu/drm/Makefile        | 3 +--
 drivers/gpu/drm/nouveau/Kconfig | 1 -
 3 files changed, 1 insertion(+), 9 deletions(-)

diff --git a/drivers/gpu/drm/Kconfig b/drivers/gpu/drm/Kconfig
index ab9ef1c20349..85122d4bb1e7 100644
--- a/drivers/gpu/drm/Kconfig
+++ b/drivers/gpu/drm/Kconfig
@@ -210,12 +210,6 @@ config DRM_TTM_KUNIT_TEST
 
           If in doubt, say "N".
 
-config DRM_EXEC
-	tristate
-	depends on DRM
-	help
-	  Execution context for command submissions
-
 config DRM_BUDDY
 	tristate
 	depends on DRM
diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
index 215e78e79125..388e0964a875 100644
--- a/drivers/gpu/drm/Makefile
+++ b/drivers/gpu/drm/Makefile
@@ -23,6 +23,7 @@ drm-y := \
 	drm_dumb_buffers.o \
 	drm_edid.o \
 	drm_encoder.o \
+	drm_exec.o \
 	drm_file.o \
 	drm_fourcc.o \
 	drm_framebuffer.o \
@@ -80,8 +81,6 @@ obj-$(CONFIG_DRM_PANEL_ORIENTATION_QUIRKS) += drm_panel_orientation_quirks.o
 # Memory-management helpers
 #
 #
-obj-$(CONFIG_DRM_EXEC) += drm_exec.o
-
 obj-$(CONFIG_DRM_BUDDY) += drm_buddy.o
 
 drm_dma_helper-y := drm_gem_dma_helper.o
diff --git a/drivers/gpu/drm/nouveau/Kconfig b/drivers/gpu/drm/nouveau/Kconfig
index c52e8096cca4..2dddedac125b 100644
--- a/drivers/gpu/drm/nouveau/Kconfig
+++ b/drivers/gpu/drm/nouveau/Kconfig
@@ -10,7 +10,6 @@ config DRM_NOUVEAU
 	select DRM_KMS_HELPER
 	select DRM_TTM
 	select DRM_TTM_HELPER
-	select DRM_EXEC
 	select DRM_SCHED
 	select I2C
 	select I2C_ALGOBIT
-- 
2.41.0


^ permalink raw reply related	[flat|nested] 88+ messages in thread

* [PATCH drm-misc-next 1/3] drm: drm_exec: build always builtin
@ 2023-08-20 21:53   ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-20 21:53 UTC (permalink / raw)
  To: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett
  Cc: dri-devel, nouveau, linux-kernel, Danilo Krummrich

drm_exec must always be builtin for the DRM GPUVA manager to depend on
it.

Signed-off-by: Danilo Krummrich <dakr@redhat.com>
---
 drivers/gpu/drm/Kconfig         | 6 ------
 drivers/gpu/drm/Makefile        | 3 +--
 drivers/gpu/drm/nouveau/Kconfig | 1 -
 3 files changed, 1 insertion(+), 9 deletions(-)

diff --git a/drivers/gpu/drm/Kconfig b/drivers/gpu/drm/Kconfig
index ab9ef1c20349..85122d4bb1e7 100644
--- a/drivers/gpu/drm/Kconfig
+++ b/drivers/gpu/drm/Kconfig
@@ -210,12 +210,6 @@ config DRM_TTM_KUNIT_TEST
 
           If in doubt, say "N".
 
-config DRM_EXEC
-	tristate
-	depends on DRM
-	help
-	  Execution context for command submissions
-
 config DRM_BUDDY
 	tristate
 	depends on DRM
diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
index 215e78e79125..388e0964a875 100644
--- a/drivers/gpu/drm/Makefile
+++ b/drivers/gpu/drm/Makefile
@@ -23,6 +23,7 @@ drm-y := \
 	drm_dumb_buffers.o \
 	drm_edid.o \
 	drm_encoder.o \
+	drm_exec.o \
 	drm_file.o \
 	drm_fourcc.o \
 	drm_framebuffer.o \
@@ -80,8 +81,6 @@ obj-$(CONFIG_DRM_PANEL_ORIENTATION_QUIRKS) += drm_panel_orientation_quirks.o
 # Memory-management helpers
 #
 #
-obj-$(CONFIG_DRM_EXEC) += drm_exec.o
-
 obj-$(CONFIG_DRM_BUDDY) += drm_buddy.o
 
 drm_dma_helper-y := drm_gem_dma_helper.o
diff --git a/drivers/gpu/drm/nouveau/Kconfig b/drivers/gpu/drm/nouveau/Kconfig
index c52e8096cca4..2dddedac125b 100644
--- a/drivers/gpu/drm/nouveau/Kconfig
+++ b/drivers/gpu/drm/nouveau/Kconfig
@@ -10,7 +10,6 @@ config DRM_NOUVEAU
 	select DRM_KMS_HELPER
 	select DRM_TTM
 	select DRM_TTM_HELPER
-	select DRM_EXEC
 	select DRM_SCHED
 	select I2C
 	select I2C_ALGOBIT
-- 
2.41.0


^ permalink raw reply related	[flat|nested] 88+ messages in thread

* [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-20 21:53 ` Danilo Krummrich
  (?)
@ 2023-08-20 21:53   ` Danilo Krummrich
  -1 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-20 21:53 UTC (permalink / raw)
  To: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel

So far the DRM GPUVA manager offers common infrastructure to track GPU VA
allocations and mappings, generically connect GPU VA mappings to their
backing buffers and perform more complex mapping operations on the GPU VA
space.

However, there are more design patterns commonly used by drivers, which
can potentially be generalized in order to make the DRM GPUVA manager
represent a basic GPU-VM implementation. In this context, this patch aims
at generalizing the following elements.

1) Provide a common dma-resv for GEM objects not being used outside of
   this GPU-VM.

2) Provide tracking of external GEM objects (GEM objects which are
   shared with other GPU-VMs).

3) Provide functions to efficiently lock all GEM objects dma-resv the
   GPU-VM contains mappings of.

4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
   of, such that validation of evicted GEM objects is accelerated.

5) Provide some convinience functions for common patterns.

Rather than being designed as a "framework", the target is to make all
features appear as a collection of optional helper functions, such that
drivers are free to make use of the DRM GPUVA managers basic
functionality and opt-in for other features without setting any feature
flags, just by making use of the corresponding functions.

Signed-off-by: Danilo Krummrich <dakr@redhat.com>
---
 drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
 include/drm/drm_gem.h           |  48 ++-
 include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
 3 files changed, 1010 insertions(+), 28 deletions(-)

diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
index f86bfad74ff8..69872b205961 100644
--- a/drivers/gpu/drm/drm_gpuva_mgr.c
+++ b/drivers/gpu/drm/drm_gpuva_mgr.c
@@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
 /**
  * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
  * @mgr: pointer to the &drm_gpuva_manager to initialize
+ * @drm: the drivers &drm_device
  * @name: the name of the GPU VA space
  * @start_offset: the start offset of the GPU VA space
  * @range: the size of the GPU VA space
@@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
  */
 void
 drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
+		       struct drm_device *drm,
 		       const char *name,
 		       u64 start_offset, u64 range,
 		       u64 reserve_offset, u64 reserve_range,
@@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
 	mgr->rb.tree = RB_ROOT_CACHED;
 	INIT_LIST_HEAD(&mgr->rb.list);
 
+	mt_init(&mgr->mt_ext);
+
+	INIT_LIST_HEAD(&mgr->evict.list);
+	spin_lock_init(&mgr->evict.lock);
+
 	drm_gpuva_check_overflow(start_offset, range);
 	mgr->mm_start = start_offset;
 	mgr->mm_range = range;
@@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
 						     reserve_range)))
 			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
 	}
+
+	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
+	mgr->resv = mgr->d_obj.resv;
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
 
@@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
 		__drm_gpuva_remove(&mgr->kernel_alloc_node);
 
 	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
-	     "GPUVA tree is not empty, potentially leaking memory.");
+	     "GPUVA tree is not empty, potentially leaking memory.\n");
+
+	mtree_destroy(&mgr->mt_ext);
+	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
+
+	drm_gem_private_object_fini(&mgr->d_obj);
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
 
+/**
+ * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ * @num_fences: the amount of &dma_fences to reserve
+ *
+ * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
+ * &drm_gpuva_manager contains mappings of.
+ *
+ * Drivers can obtain the corresponding &drm_exec instance through
+ * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
+ * and drm_exec_fini() accordingly.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
+				  unsigned int num_fences)
+{
+	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
+	MA_STATE(mas, &mgr->mt_ext, 0, 0);
+	union {
+		void *ptr;
+		uintptr_t cnt;
+	} ref;
+	int ret;
+
+	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
+	if (ret)
+		goto out;
+
+	rcu_read_lock();
+	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
+		struct drm_gem_object *obj;
+
+		mas_pause(&mas);
+		rcu_read_unlock();
+
+		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
+		ret = drm_exec_prepare_obj(exec, obj, num_fences);
+		if (ret)
+			goto out;
+
+		rcu_read_lock();
+	}
+	rcu_read_unlock();
+
+out:
+	return ret;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
+
+/**
+ * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ * @fn: callback received by the driver to lock additional dma-resv
+ * @priv: private driver data passed to @fn
+ * @num_fences: the amount of &dma_fences to reserve
+ * @interruptible: sleep interruptible if waiting
+ *
+ * Acquires all dma-resv locks of all &drm_gem_objects the given
+ * &drm_gpuva_manager contains mappings of.
+ *
+ * Addionally, when calling this function the driver receives the given @fn
+ * callback to lock additional dma-resv in the context of the
+ * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
+ * drm_exec_prepare_obj() from within this callback.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
+			     int (*fn)(struct drm_gpuva_manager *mgr,
+				       void *priv, unsigned int num_fences),
+			     void *priv,
+			     unsigned int num_fences,
+			     bool interruptible)
+{
+	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
+	uint32_t flags;
+	int ret;
+
+	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
+		DRM_EXEC_IGNORE_DUPLICATES;
+
+	drm_exec_init(exec, flags);
+
+	drm_exec_until_all_locked(exec) {
+		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
+		drm_exec_retry_on_contention(exec);
+		if (ret)
+			goto err;
+
+		if (fn) {
+			ret = fn(mgr, priv, num_fences);
+			drm_exec_retry_on_contention(exec);
+			if (ret)
+				goto err;
+		}
+	}
+
+	return 0;
+
+err:
+	drm_exec_fini(exec);
+	return ret;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
+
+static int
+fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
+				unsigned int num_fences)
+{
+	struct {
+		struct drm_gem_object **objs;
+		unsigned int num_objs;
+	} *args = priv;
+
+	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
+				      args->num_objs, num_fences);
+}
+
+/**
+ * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ * @objs: additional &drm_gem_objects to lock
+ * @num_objs: the number of additional &drm_gem_objects to lock
+ * @num_fences: the amount of &dma_fences to reserve
+ * @interruptible: sleep interruptible if waiting
+ *
+ * Acquires all dma-resv locks of all &drm_gem_objects the given
+ * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
+			     struct drm_gem_object **objs,
+			     unsigned int num_objs,
+			     unsigned int num_fences,
+			     bool interruptible)
+{
+	struct {
+		struct drm_gem_object **objs;
+		unsigned int num_objs;
+	} args;
+
+	args.objs = objs;
+	args.num_objs = num_objs;
+
+	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
+					    num_fences, interruptible);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
+
+/**
+ * drm_gpuva_manager_validate() - validate all BOs marked as evicted
+ * @mgr: the &drm_gpuva_manager to validate evicted BOs
+ *
+ * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
+ * objects being mapped in the given &drm_gpuva_manager.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
+{
+	const struct drm_gpuva_fn_ops *ops = mgr->ops;
+	struct drm_gpuva_gem *vm_bo;
+	int ret;
+
+	if (unlikely(!ops || !ops->bo_validate))
+		return -ENOTSUPP;
+
+	/* At this point we should hold all dma-resv locks of all GEM objects
+	 * associated with this GPU-VM, hence it is safe to walk the list.
+	 */
+	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
+		dma_resv_assert_held(vm_bo->obj->resv);
+
+		ret = ops->bo_validate(vm_bo->obj);
+		if (ret)
+			return ret;
+	}
+
+	return 0;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
+
+/**
+ * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
+ * dma-resv
+ * @mgr: the &drm_gpuva_manager to add a fence to
+ * @fence: fence to add
+ * @private_usage: private dma-resv usage
+ * @extobj_usage: extobj dma-resv usage
+ */
+void
+drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
+				 struct dma_fence *fence,
+				 enum dma_resv_usage private_usage,
+				 enum dma_resv_usage extobj_usage)
+{
+	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
+	struct drm_gem_object *obj;
+	unsigned long index;
+
+	drm_exec_for_each_locked_object(exec, index, obj) {
+			dma_resv_assert_held(obj->resv);
+			dma_resv_add_fence(obj->resv, fence,
+					   drm_gpuva_is_extobj(mgr, obj) ?
+					   private_usage : extobj_usage);
+	}
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
+
+static struct drm_gpuva_gem *
+__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	struct drm_gpuva_gem *vm_bo;
+
+	drm_gem_gpuva_assert_lock_held(obj);
+
+	drm_gem_for_each_gpuva_gem(vm_bo, obj)
+		if (vm_bo->mgr == mgr)
+			return vm_bo;
+
+	return NULL;
+}
+
+/**
+ * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
+ * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+ * @obj: The &drm_gem_object being mapped in the @mgr.
+ *
+ * If provided by the driver, this function uses the &drm_gpuva_fn_ops
+ * vm_bo_alloc() callback to allocate.
+ *
+ * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
+ */
+struct drm_gpuva_gem *
+drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	const struct drm_gpuva_fn_ops *ops = mgr->ops;
+	struct drm_gpuva_gem *vm_bo;
+
+	if (ops && ops->vm_bo_alloc)
+		vm_bo = ops->vm_bo_alloc();
+	else
+		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
+
+	if (unlikely(!vm_bo))
+		return NULL;
+
+	vm_bo->mgr = mgr;
+	vm_bo->obj = obj;
+
+	kref_init(&vm_bo->kref);
+	INIT_LIST_HEAD(&vm_bo->list.gpuva);
+	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
+	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
+
+	drm_gem_object_get(obj);
+
+	return vm_bo;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
+
+void
+drm_gpuva_gem_destroy(struct kref *kref)
+{
+	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
+						   kref);
+	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
+
+	drm_gem_object_put(vm_bo->obj);
+
+	if (ops && ops->vm_bo_free)
+		ops->vm_bo_free(vm_bo);
+	else
+		kfree(vm_bo);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
+
+/**
+ * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
+ * &drm_gpuva_manager and &drm_gem_object
+ * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+ * @obj: The &drm_gem_object being mapped in the @mgr.
+ *
+ * Find the &drm_gpuva_gem representing the combination of the given
+ * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
+ * count of the &drm_gpuva_gem accordingly.
+ *
+ * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
+ */
+struct drm_gpuva_gem *
+drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
+		   struct drm_gem_object *obj)
+{
+	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
+
+	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
+
+/**
+ * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
+ * given &drm_gpuva_manager and &drm_gem_object
+ * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+ * @obj: The &drm_gem_object being mapped in the @mgr.
+ *
+ * Find the &drm_gpuva_gem representing the combination of the given
+ * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
+ * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
+ * &drm_gpuva_gem.
+ *
+ * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
+ */
+struct drm_gpuva_gem *
+drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	struct drm_gpuva_gem *vm_bo;
+
+	vm_bo = drm_gpuva_gem_find(mgr, obj);
+	if (vm_bo)
+		return vm_bo;
+
+	vm_bo = drm_gpuva_gem_create(mgr, obj);
+	if (!vm_bo)
+		return ERR_PTR(-ENOMEM);
+
+	return vm_bo;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
+
+/**
+ * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
+ * for the given &drm_gpuva_manager and &drm_gem_object
+ * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+ * @obj: The &drm_gem_object being mapped in the @mgr.
+ *
+ * Find the &drm_gpuva_gem representing the combination of the given
+ * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
+ * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
+ * count is decreased. If not found @__vm_bo is returned.
+ *
+ * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
+ * &drm_gpuva_gem was found
+ */
+struct drm_gpuva_gem *
+drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
+			      struct drm_gem_object *obj,
+			      struct drm_gpuva_gem *__vm_bo)
+{
+	struct drm_gpuva_gem *vm_bo;
+
+	vm_bo = drm_gpuva_gem_find(mgr, obj);
+	if (vm_bo) {
+		drm_gpuva_gem_put(__vm_bo);
+		return vm_bo;
+	}
+
+	return __vm_bo;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
+
+static int
+__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
+			  struct drm_gem_object *obj,
+			  gfp_t gfp)
+{
+	MA_STATE(mas, &mgr->mt_ext, 0, 0);
+	union {
+		struct drm_gem_object *obj;
+		uintptr_t index;
+	} gem;
+	union {
+		void *ptr;
+		uintptr_t cnt;
+	} ref;
+	int ret = 0;
+
+	gem.obj = obj;
+	mas_set(&mas, gem.index);
+
+	mas_lock(&mas);
+	ref.ptr = mas_walk(&mas);
+	if (ref.ptr) {
+		++ref.cnt;
+		mas_store(&mas, ref.ptr);
+	} else {
+		if (unlikely(!gfp)) {
+			ret = -EINVAL;
+			goto out;
+		}
+
+		mas_set(&mas, gem.index);
+		ref.cnt = 1;
+		ret = mas_store_gfp(&mas, ref.ptr, gfp);
+		if (likely(!ret))
+			drm_gem_object_get(obj);
+	}
+out:
+	mas_unlock(&mas);
+	return ret;
+}
+
+static void
+__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
+			  struct drm_gem_object *obj)
+{
+	MA_STATE(mas, &mgr->mt_ext, 0, 0);
+	union {
+		struct drm_gem_object *obj;
+		uintptr_t index;
+	} gem;
+	union {
+		void *ptr;
+		uintptr_t cnt;
+	} ref;
+
+	gem.obj = obj;
+	mas_set(&mas, gem.index);
+
+	mas_lock(&mas);
+	if (unlikely(!(ref.ptr = mas_walk(&mas))))
+		goto out;
+
+	if (!--ref.cnt) {
+		mas_erase(&mas);
+		drm_gem_object_put(obj);
+	} else {
+		mas_store(&mas, ref.ptr);
+	}
+out:
+	mas_unlock(&mas);
+}
+
+/**
+ * drm_gpuva_extobj_insert - insert an external &drm_gem_object
+ * @mgr: the &drm_gpuva_manager to insert into
+ * @obj: the &drm_gem_object to insert as extobj
+ *
+ * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
+ * If the &drm_gem_object already exists in the tree, the reference counter
+ * of this external object is increased by one.
+ *
+ * Drivers should insert the external &drm_gem_object before the dma-fence
+ * signalling critical section, e.g. when submitting the job, and before
+ * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
+ * or its dervates.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
+			struct drm_gem_object *obj)
+{
+	return drm_gpuva_is_extobj(mgr, obj) ?
+		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
+
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
+
+/**
+ * drm_gpuva_extobj_get - increase the referecne count of an external
+ * &drm_gem_object
+ * @mgr: the &drm_gpuva_manager storing the extobj
+ * @obj: the &drm_gem_object to representing the extobj
+ *
+ * Increases the reference count of the extobj represented by @obj.
+ *
+ * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
+ * being inserted.
+ *
+ * For &drm_gpuva_op_remap operations drivers should make sure to only take an
+ * additional reference if the re-map operation splits an existing &drm_gpuva
+ * into two separate ones.
+ *
+ * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+void
+drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	if (drm_gpuva_is_extobj(mgr, obj))
+		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
+		     "Can't increase ref-count of non-existent extobj.");
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
+
+/**
+ * drm_gpuva_extobj_put - decrease the referecne count of an external
+ * &drm_gem_object
+ * @mgr: the &drm_gpuva_manager storing the extobj
+ * @obj: the &drm_gem_object to representing the extobj
+ *
+ * Decreases the reference count of the extobj represented by @obj.
+ *
+ * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
+ * being removed from the GPU VA space.
+ *
+ * See also drm_gpuva_unmap_put().
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+void
+drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	if (drm_gpuva_is_extobj(mgr, obj))
+		__drm_gpuva_extobj_remove(mgr, obj);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
+
+/**
+ * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
+ * &drm_gpuva_managers evicted list
+ * @obj: the &drm_gem_object to add or remove
+ * @evict: indicates whether the object is evicted
+ *
+ * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
+ * list containing a mapping of this &drm_gem_object.
+ */
+void
+drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
+{
+	struct drm_gpuva_gem *vm_bo;
+
+	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
+	 * lock has been set, the list is protected with the GEMs dma-resv lock.
+	 */
+	drm_gem_gpuva_assert_lock_held(obj);
+
+	/* Required to protect the GPUVA managers evict list against concurrent
+	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
+	 * the evict list through different GEM object evictions are protected
+	 * by the GPUVA managers evict lock.
+	 */
+	dma_resv_assert_held(obj->resv);
+
+	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
+		struct drm_gpuva_manager *mgr = vm_bo->mgr;
+
+		spin_lock(&mgr->evict.lock);
+		if (evict)
+			list_add_tail(&vm_bo->list.entry.evict,
+				      &mgr->evict.list);
+		else
+			list_del_init(&vm_bo->list.entry.evict);
+		spin_unlock(&mgr->evict.lock);
+	}
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
+
 static int
 __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
 		   struct drm_gpuva *va)
@@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
 /**
  * drm_gpuva_link() - link a &drm_gpuva
  * @va: the &drm_gpuva to link
+ * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
  *
- * This adds the given &va to the GPU VA list of the &drm_gem_object it is
- * associated with.
+ * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
+ * &drm_gpuva_gem to the &drm_gem_object it is associated with.
+ *
+ * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
+ * reference of the latter is taken.
  *
  * This function expects the caller to protect the GEM's GPUVA list against
- * concurrent access using the GEMs dma_resv lock.
+ * concurrent access using either the GEMs dma_resv lock or a driver specific
+ * lock set through drm_gem_gpuva_set_lock().
  */
 void
-drm_gpuva_link(struct drm_gpuva *va)
+drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
 {
 	struct drm_gem_object *obj = va->gem.obj;
 
@@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
 
 	drm_gem_gpuva_assert_lock_held(obj);
 
-	list_add_tail(&va->gem.entry, &obj->gpuva.list);
+	drm_gpuva_gem_get(vm_bo);
+	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
+	if (list_empty(&vm_bo->list.entry.gem))
+		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_link);
 
@@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
  * This removes the given &va from the GPU VA list of the &drm_gem_object it is
  * associated with.
  *
+ * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
+ * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
+ * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
+ *
+ * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
+ * the latter is dropped.
+ *
  * This function expects the caller to protect the GEM's GPUVA list against
- * concurrent access using the GEMs dma_resv lock.
+ * concurrent access using either the GEMs dma_resv lock or a driver specific
+ * lock set through drm_gem_gpuva_set_lock().
  */
 void
 drm_gpuva_unlink(struct drm_gpuva *va)
 {
 	struct drm_gem_object *obj = va->gem.obj;
+	struct drm_gpuva_gem *vm_bo;
 
 	if (unlikely(!obj))
 		return;
 
 	drm_gem_gpuva_assert_lock_held(obj);
 
+	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
+	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
+		return;
+
 	list_del_init(&va->gem.entry);
+
+	if (list_empty(&vm_bo->list.gpuva)) {
+		list_del_init(&vm_bo->list.entry.gem);
+		list_del_init(&vm_bo->list.entry.evict);
+	}
+	drm_gpuva_gem_put(vm_bo);
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
 
@@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_map);
 
+/**
+ * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
+ * &drm_gpuva_op_map
+ * @mgr: the &drm_gpuva_manager
+ * @va: the &drm_gpuva to insert
+ * @op: the &drm_gpuva_op_map to initialize @va with
+ *
+ * Initializes the @va from the @op and inserts it into the given @mgr and
+ * increases the reference count of the corresponding extobj.
+ */
+void
+drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
+		  struct drm_gpuva *va,
+		  struct drm_gpuva_op_map *op)
+{
+	drm_gpuva_map(mgr, va, op);
+	drm_gpuva_extobj_get(mgr, va->gem.obj);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
+
 /**
  * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
  * &drm_gpuva_op_remap
@@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
 		struct drm_gpuva *next,
 		struct drm_gpuva_op_remap *op)
 {
-	struct drm_gpuva *curr = op->unmap->va;
-	struct drm_gpuva_manager *mgr = curr->mgr;
+	struct drm_gpuva *va = op->unmap->va;
+	struct drm_gpuva_manager *mgr = va->mgr;
 
-	drm_gpuva_remove(curr);
+	drm_gpuva_remove(va);
 
 	if (op->prev) {
 		drm_gpuva_init_from_op(prev, op->prev);
@@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_remap);
 
+/**
+ * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
+ * &drm_gpuva_op_remap
+ * @prev: the &drm_gpuva to remap when keeping the start of a mapping
+ * @next: the &drm_gpuva to remap when keeping the end of a mapping
+ * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
+ *
+ * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
+ * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
+ * separate mappings, increases the reference count of the corresponding extobj.
+ */
+void
+drm_gpuva_remap_get(struct drm_gpuva *prev,
+		    struct drm_gpuva *next,
+		    struct drm_gpuva_op_remap *op)
+{
+	struct drm_gpuva *va = op->unmap->va;
+	struct drm_gpuva_manager *mgr = va->mgr;
+
+	drm_gpuva_remap(prev, next, op);
+	if (op->prev && op->next)
+		drm_gpuva_extobj_get(mgr, va->gem.obj);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
+
 /**
  * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
  * &drm_gpuva_op_unmap
@@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
 
+/**
+ * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
+ * &drm_gpuva_op_unmap
+ * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
+ *
+ * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
+ * the reference count of the corresponding extobj.
+ */
+void
+drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
+{
+	struct drm_gpuva *va = op->va;
+
+	drm_gpuva_unmap(op);
+	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
+
 static int
 op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
 	  u64 addr, u64 range,
@@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
 {
 	struct drm_gpuva_ops *ops;
 	struct drm_gpuva_op *op;
+	struct drm_gpuva_gem *vm_bo;
 	struct drm_gpuva *va;
 	int ret;
 
@@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
 
 	INIT_LIST_HEAD(&ops->list);
 
-	drm_gem_for_each_gpuva(va, obj) {
+	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
 		op = gpuva_op_alloc(mgr);
 		if (!op) {
 			ret = -ENOMEM;
diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
index bc9f6aa2f3fe..783ed3ab440d 100644
--- a/include/drm/drm_gem.h
+++ b/include/drm/drm_gem.h
@@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
  * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
  * @obj: the &drm_gem_object
  *
- * This initializes the &drm_gem_object's &drm_gpuva list.
+ * This initializes the &drm_gem_object's &drm_gpuva_gem list.
  *
  * Calling this function is only necessary for drivers intending to support the
  * &drm_driver_feature DRIVER_GEM_GPUVA.
@@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
 }
 
 /**
- * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
- * @entry__: &drm_gpuva structure to assign to in each iteration step
- * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
+ * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
+ * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
+ * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
  *
- * This iterator walks over all &drm_gpuva structures associated with the
- * &drm_gpuva_manager.
+ * This iterator walks over all &drm_gpuva_gem structures associated with the
+ * &drm_gem_object.
  */
-#define drm_gem_for_each_gpuva(entry__, obj__) \
-	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
+#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
+	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
 
 /**
- * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
- * gpuvas
- * @entry__: &drm_gpuva structure to assign to in each iteration step
- * @next__: &next &drm_gpuva to store the next step
- * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
+ * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
+ * &drm_gpuva_gem
+ * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
+ * @next__: &next &drm_gpuva_gem to store the next step
+ * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
  *
- * This iterator walks over all &drm_gpuva structures associated with the
+ * This iterator walks over all &drm_gpuva_gem structures associated with the
  * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
  * it is save against removal of elements.
  */
-#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
-	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
+#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
+	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
+
+/**
+ * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
+ * @va__: &drm_gpuva structure to assign to in each iteration step
+ * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
+ * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
+ * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the
+ * &drm_gpuva_manager and &drm_gem_object.
+ */
+#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
+	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
+	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
+	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
+	     va__ = list_next_entry(va__, gem.entry))
 
 #endif /* __DRM_GEM_H__ */
diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
index ed8d50200cc3..693e2da3f425 100644
--- a/include/drm/drm_gpuva_mgr.h
+++ b/include/drm/drm_gpuva_mgr.h
@@ -26,12 +26,16 @@
  */
 
 #include <linux/list.h>
+#include <linux/dma-resv.h>
+#include <linux/maple_tree.h>
 #include <linux/rbtree.h>
 #include <linux/types.h>
 
 #include <drm/drm_gem.h>
+#include <drm/drm_exec.h>
 
 struct drm_gpuva_manager;
+struct drm_gpuva_gem;
 struct drm_gpuva_fn_ops;
 
 /**
@@ -140,7 +144,7 @@ struct drm_gpuva {
 int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
 void drm_gpuva_remove(struct drm_gpuva *va);
 
-void drm_gpuva_link(struct drm_gpuva *va);
+void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
 void drm_gpuva_unlink(struct drm_gpuva *va);
 
 struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
@@ -240,15 +244,137 @@ struct drm_gpuva_manager {
 	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
 	 */
 	const struct drm_gpuva_fn_ops *ops;
+
+	/**
+	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
+	 * dma-resv to &drm_exec.
+	 */
+	struct drm_gem_object d_obj;
+
+	/**
+	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
+	 * space
+	 */
+	struct dma_resv *resv;
+
+	/**
+	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
+	 */
+	struct drm_exec exec;
+
+	/**
+	 * @mt_ext: &maple_tree storing external &drm_gem_objects
+	 */
+	struct maple_tree mt_ext;
+
+	/**
+	 * @evict: structure holding the evict list and evict list lock
+	 */
+	struct {
+		/**
+		 * @list: &list_head storing &drm_gem_objects currently being
+		 * evicted
+		 */
+		struct list_head list;
+
+		/**
+		 * @lock: spinlock to protect the evict list against concurrent
+		 * insertion / removal of different &drm_gpuva_gems
+		 */
+		spinlock_t lock;
+	} evict;
 };
 
 void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
+			    struct drm_device *drm,
 			    const char *name,
 			    u64 start_offset, u64 range,
 			    u64 reserve_offset, u64 reserve_range,
 			    const struct drm_gpuva_fn_ops *ops);
 void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
 
+/**
+ * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
+ * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
+ */
+#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
+
+int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
+				 int (*fn)(struct drm_gpuva_manager *mgr,
+					   void *priv, unsigned int num_fences),
+				 void *priv,
+				 unsigned int num_fences,
+				 bool interruptible);
+
+int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
+				 struct drm_gem_object **objs,
+				 unsigned int num_objs,
+				 unsigned int num_fences,
+				 bool interruptible);
+
+/**
+ * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ * @num_fences: the amount of &dma_fences to reserve
+ * @interruptible: sleep interruptible if waiting
+ *
+ * Acquires all dma-resv locks of all &drm_gem_objects the given
+ * &drm_gpuva_manager contains mappings of.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+static inline int
+drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
+		       unsigned int num_fences,
+		       bool interruptible)
+{
+	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
+					    interruptible);
+}
+
+/**
+ * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ *
+ * Releases all dma-resv locks of all &drm_gem_objects previously acquired
+ * through drm_gpuva_manager_lock() or its variants.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+static inline void
+drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
+{
+	drm_exec_fini(&mgr->exec);
+}
+
+int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
+void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
+				      struct dma_fence *fence,
+				      enum dma_resv_usage private_usage,
+				      enum dma_resv_usage extobj_usage);
+
+int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
+			    struct drm_gem_object *obj);
+void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
+			  struct drm_gem_object *obj);
+void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
+			  struct drm_gem_object *obj);
+
+/**
+ * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
+ * external object
+ * @mgr: the &drm_gpuva_manager to check
+ * @obj: the &drm_gem_object to check
+ *
+ * Returns: true if the &drm_gem_object &dma_resv differs from the
+ * &drm_gpuva_managers &dma_resv, false otherwise
+ */
+static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
+				       struct drm_gem_object *obj)
+{
+	return obj && obj->resv != mgr->resv;
+}
+
 static inline struct drm_gpuva *
 __drm_gpuva_next(struct drm_gpuva *va)
 {
@@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
 #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
 	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
 
+/**
+ * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
+ * &drm_gem_object combination
+ *
+ * This structure is an abstraction representing a &drm_gpuva_manager and
+ * &drm_gem_object combination. It serves as an indirection to accelerate
+ * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
+ * &drm_gem_object.
+ *
+ * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
+ * accelerate validation.
+ *
+ * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
+ * a GEM object is mapped first in a GPU-VM and release the instance once the
+ * last mapping of the GEM object in this GPU-VM is unmapped.
+ */
+struct drm_gpuva_gem {
+
+	/**
+	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+	 */
+	struct drm_gpuva_manager *mgr;
+
+	/**
+	 * @obj: The &drm_gem_object being mapped in the @mgr.
+	 */
+	struct drm_gem_object *obj;
+
+	/**
+	 * @kref: The reference count for this &drm_gpuva_gem.
+	 */
+	struct kref kref;
+
+	/**
+	 * @list: Structure containing all &list_heads.
+	 */
+	struct {
+		/**
+		 * @gpuva: The list of linked &drm_gpuvas.
+		 */
+		struct list_head gpuva;
+
+		/**
+		 * @entry: Structure containing all &list_heads serving as
+		 * entry.
+		 */
+		struct {
+			/**
+			 * @gem: List entry to attach to the &drm_gem_objects
+			 * gpuva list.
+			 */
+			struct list_head gem;
+
+			/**
+			 * @evict: List entry to attach to the
+			 * &drm_gpuva_managers evict list.
+			 */
+			struct list_head evict;
+		} entry;
+	} list;
+};
+
+struct drm_gpuva_gem *
+drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj);
+struct drm_gpuva_gem *
+drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
+			      struct drm_gem_object *obj,
+			      struct drm_gpuva_gem *__vm_bo);
+
+struct drm_gpuva_gem *
+drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
+		   struct drm_gem_object *obj);
+
+void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
+
+struct drm_gpuva_gem *
+drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj);
+void drm_gpuva_gem_destroy(struct kref *kref);
+
+/**
+ * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
+ * @vm_bo: the &drm_gpuva_gem to acquire the reference of
+ *
+ * This function acquires an additional reference to @vm_bo. It is illegal to
+ * call this without already holding a reference. No locks required.
+ */
+static inline struct drm_gpuva_gem *
+drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
+{
+	kref_get(&vm_bo->kref);
+	return vm_bo;
+}
+
+/**
+ * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
+ * @vm_bo: the &drm_gpuva_gem to release the reference of
+ *
+ * This releases a reference to @vm_bo.
+ */
+static inline void
+drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
+{
+	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
+}
+
+/**
+ * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
+ * @va__: &drm_gpuva structure to assign to in each iteration step
+ * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the
+ * &drm_gpuva_gem.
+ */
+#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
+	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
+
+/**
+ * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
+ * &drm_gpuva
+ * @va__: &drm_gpuva structure to assign to in each iteration step
+ * @next__: &next &drm_gpuva to store the next step
+ * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the
+ * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
+ * it is save against removal of elements.
+ */
+#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
+	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
+
 /**
  * enum drm_gpuva_op_type - GPU VA operation type
  *
@@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
 	 */
 	void (*op_free)(struct drm_gpuva_op *op);
 
+	/**
+	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
+	 * a struct drm_gpuva_gem
+	 *
+	 * Some drivers may want to embed struct drm_gpuva_gem into driver
+	 * specific structures. By implementing this callback drivers can
+	 * allocate memory accordingly.
+	 *
+	 * This callback is optional.
+	 */
+	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
+
+	/**
+	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
+	 * struct drm_gpuva_gem
+	 *
+	 * Some drivers may want to embed struct drm_gpuva_gem into driver
+	 * specific structures. By implementing this callback drivers can
+	 * free the previously allocated memory accordingly.
+	 *
+	 * This callback is optional.
+	 */
+	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
+
 	/**
 	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
 	 * mapping once all previous steps were completed
@@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
 	 * used.
 	 */
 	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
+
+	/**
+	 * @bo_validate: called from drm_gpuva_manager_validate()
+	 *
+	 * Drivers receive this callback for every evicted &drm_gem_object being
+	 * mapped in the corresponding &drm_gpuva_manager.
+	 *
+	 * Typically, drivers would call their driver specific variant of
+	 * ttm_bo_validate() from within this callback.
+	 */
+	int (*bo_validate)(struct drm_gem_object *obj);
 };
 
 int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
@@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
 void drm_gpuva_map(struct drm_gpuva_manager *mgr,
 		   struct drm_gpuva *va,
 		   struct drm_gpuva_op_map *op);
+void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
+		       struct drm_gpuva *va,
+		       struct drm_gpuva_op_map *op);
 
 void drm_gpuva_remap(struct drm_gpuva *prev,
 		     struct drm_gpuva *next,
 		     struct drm_gpuva_op_remap *op);
+void drm_gpuva_remap_get(struct drm_gpuva *prev,
+			 struct drm_gpuva *next,
+			 struct drm_gpuva_op_remap *op);
 
 void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
+void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
 
 #endif /* __DRM_GPUVA_MGR_H__ */
-- 
2.41.0


^ permalink raw reply related	[flat|nested] 88+ messages in thread

* [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-20 21:53   ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-20 21:53 UTC (permalink / raw)
  To: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett
  Cc: nouveau, Danilo Krummrich, linux-kernel, dri-devel

So far the DRM GPUVA manager offers common infrastructure to track GPU VA
allocations and mappings, generically connect GPU VA mappings to their
backing buffers and perform more complex mapping operations on the GPU VA
space.

However, there are more design patterns commonly used by drivers, which
can potentially be generalized in order to make the DRM GPUVA manager
represent a basic GPU-VM implementation. In this context, this patch aims
at generalizing the following elements.

1) Provide a common dma-resv for GEM objects not being used outside of
   this GPU-VM.

2) Provide tracking of external GEM objects (GEM objects which are
   shared with other GPU-VMs).

3) Provide functions to efficiently lock all GEM objects dma-resv the
   GPU-VM contains mappings of.

4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
   of, such that validation of evicted GEM objects is accelerated.

5) Provide some convinience functions for common patterns.

Rather than being designed as a "framework", the target is to make all
features appear as a collection of optional helper functions, such that
drivers are free to make use of the DRM GPUVA managers basic
functionality and opt-in for other features without setting any feature
flags, just by making use of the corresponding functions.

Signed-off-by: Danilo Krummrich <dakr@redhat.com>
---
 drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
 include/drm/drm_gem.h           |  48 ++-
 include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
 3 files changed, 1010 insertions(+), 28 deletions(-)

diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
index f86bfad74ff8..69872b205961 100644
--- a/drivers/gpu/drm/drm_gpuva_mgr.c
+++ b/drivers/gpu/drm/drm_gpuva_mgr.c
@@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
 /**
  * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
  * @mgr: pointer to the &drm_gpuva_manager to initialize
+ * @drm: the drivers &drm_device
  * @name: the name of the GPU VA space
  * @start_offset: the start offset of the GPU VA space
  * @range: the size of the GPU VA space
@@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
  */
 void
 drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
+		       struct drm_device *drm,
 		       const char *name,
 		       u64 start_offset, u64 range,
 		       u64 reserve_offset, u64 reserve_range,
@@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
 	mgr->rb.tree = RB_ROOT_CACHED;
 	INIT_LIST_HEAD(&mgr->rb.list);
 
+	mt_init(&mgr->mt_ext);
+
+	INIT_LIST_HEAD(&mgr->evict.list);
+	spin_lock_init(&mgr->evict.lock);
+
 	drm_gpuva_check_overflow(start_offset, range);
 	mgr->mm_start = start_offset;
 	mgr->mm_range = range;
@@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
 						     reserve_range)))
 			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
 	}
+
+	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
+	mgr->resv = mgr->d_obj.resv;
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
 
@@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
 		__drm_gpuva_remove(&mgr->kernel_alloc_node);
 
 	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
-	     "GPUVA tree is not empty, potentially leaking memory.");
+	     "GPUVA tree is not empty, potentially leaking memory.\n");
+
+	mtree_destroy(&mgr->mt_ext);
+	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
+
+	drm_gem_private_object_fini(&mgr->d_obj);
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
 
+/**
+ * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ * @num_fences: the amount of &dma_fences to reserve
+ *
+ * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
+ * &drm_gpuva_manager contains mappings of.
+ *
+ * Drivers can obtain the corresponding &drm_exec instance through
+ * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
+ * and drm_exec_fini() accordingly.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
+				  unsigned int num_fences)
+{
+	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
+	MA_STATE(mas, &mgr->mt_ext, 0, 0);
+	union {
+		void *ptr;
+		uintptr_t cnt;
+	} ref;
+	int ret;
+
+	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
+	if (ret)
+		goto out;
+
+	rcu_read_lock();
+	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
+		struct drm_gem_object *obj;
+
+		mas_pause(&mas);
+		rcu_read_unlock();
+
+		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
+		ret = drm_exec_prepare_obj(exec, obj, num_fences);
+		if (ret)
+			goto out;
+
+		rcu_read_lock();
+	}
+	rcu_read_unlock();
+
+out:
+	return ret;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
+
+/**
+ * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ * @fn: callback received by the driver to lock additional dma-resv
+ * @priv: private driver data passed to @fn
+ * @num_fences: the amount of &dma_fences to reserve
+ * @interruptible: sleep interruptible if waiting
+ *
+ * Acquires all dma-resv locks of all &drm_gem_objects the given
+ * &drm_gpuva_manager contains mappings of.
+ *
+ * Addionally, when calling this function the driver receives the given @fn
+ * callback to lock additional dma-resv in the context of the
+ * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
+ * drm_exec_prepare_obj() from within this callback.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
+			     int (*fn)(struct drm_gpuva_manager *mgr,
+				       void *priv, unsigned int num_fences),
+			     void *priv,
+			     unsigned int num_fences,
+			     bool interruptible)
+{
+	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
+	uint32_t flags;
+	int ret;
+
+	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
+		DRM_EXEC_IGNORE_DUPLICATES;
+
+	drm_exec_init(exec, flags);
+
+	drm_exec_until_all_locked(exec) {
+		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
+		drm_exec_retry_on_contention(exec);
+		if (ret)
+			goto err;
+
+		if (fn) {
+			ret = fn(mgr, priv, num_fences);
+			drm_exec_retry_on_contention(exec);
+			if (ret)
+				goto err;
+		}
+	}
+
+	return 0;
+
+err:
+	drm_exec_fini(exec);
+	return ret;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
+
+static int
+fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
+				unsigned int num_fences)
+{
+	struct {
+		struct drm_gem_object **objs;
+		unsigned int num_objs;
+	} *args = priv;
+
+	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
+				      args->num_objs, num_fences);
+}
+
+/**
+ * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ * @objs: additional &drm_gem_objects to lock
+ * @num_objs: the number of additional &drm_gem_objects to lock
+ * @num_fences: the amount of &dma_fences to reserve
+ * @interruptible: sleep interruptible if waiting
+ *
+ * Acquires all dma-resv locks of all &drm_gem_objects the given
+ * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
+			     struct drm_gem_object **objs,
+			     unsigned int num_objs,
+			     unsigned int num_fences,
+			     bool interruptible)
+{
+	struct {
+		struct drm_gem_object **objs;
+		unsigned int num_objs;
+	} args;
+
+	args.objs = objs;
+	args.num_objs = num_objs;
+
+	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
+					    num_fences, interruptible);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
+
+/**
+ * drm_gpuva_manager_validate() - validate all BOs marked as evicted
+ * @mgr: the &drm_gpuva_manager to validate evicted BOs
+ *
+ * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
+ * objects being mapped in the given &drm_gpuva_manager.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
+{
+	const struct drm_gpuva_fn_ops *ops = mgr->ops;
+	struct drm_gpuva_gem *vm_bo;
+	int ret;
+
+	if (unlikely(!ops || !ops->bo_validate))
+		return -ENOTSUPP;
+
+	/* At this point we should hold all dma-resv locks of all GEM objects
+	 * associated with this GPU-VM, hence it is safe to walk the list.
+	 */
+	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
+		dma_resv_assert_held(vm_bo->obj->resv);
+
+		ret = ops->bo_validate(vm_bo->obj);
+		if (ret)
+			return ret;
+	}
+
+	return 0;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
+
+/**
+ * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
+ * dma-resv
+ * @mgr: the &drm_gpuva_manager to add a fence to
+ * @fence: fence to add
+ * @private_usage: private dma-resv usage
+ * @extobj_usage: extobj dma-resv usage
+ */
+void
+drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
+				 struct dma_fence *fence,
+				 enum dma_resv_usage private_usage,
+				 enum dma_resv_usage extobj_usage)
+{
+	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
+	struct drm_gem_object *obj;
+	unsigned long index;
+
+	drm_exec_for_each_locked_object(exec, index, obj) {
+			dma_resv_assert_held(obj->resv);
+			dma_resv_add_fence(obj->resv, fence,
+					   drm_gpuva_is_extobj(mgr, obj) ?
+					   private_usage : extobj_usage);
+	}
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
+
+static struct drm_gpuva_gem *
+__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	struct drm_gpuva_gem *vm_bo;
+
+	drm_gem_gpuva_assert_lock_held(obj);
+
+	drm_gem_for_each_gpuva_gem(vm_bo, obj)
+		if (vm_bo->mgr == mgr)
+			return vm_bo;
+
+	return NULL;
+}
+
+/**
+ * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
+ * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+ * @obj: The &drm_gem_object being mapped in the @mgr.
+ *
+ * If provided by the driver, this function uses the &drm_gpuva_fn_ops
+ * vm_bo_alloc() callback to allocate.
+ *
+ * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
+ */
+struct drm_gpuva_gem *
+drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	const struct drm_gpuva_fn_ops *ops = mgr->ops;
+	struct drm_gpuva_gem *vm_bo;
+
+	if (ops && ops->vm_bo_alloc)
+		vm_bo = ops->vm_bo_alloc();
+	else
+		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
+
+	if (unlikely(!vm_bo))
+		return NULL;
+
+	vm_bo->mgr = mgr;
+	vm_bo->obj = obj;
+
+	kref_init(&vm_bo->kref);
+	INIT_LIST_HEAD(&vm_bo->list.gpuva);
+	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
+	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
+
+	drm_gem_object_get(obj);
+
+	return vm_bo;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
+
+void
+drm_gpuva_gem_destroy(struct kref *kref)
+{
+	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
+						   kref);
+	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
+
+	drm_gem_object_put(vm_bo->obj);
+
+	if (ops && ops->vm_bo_free)
+		ops->vm_bo_free(vm_bo);
+	else
+		kfree(vm_bo);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
+
+/**
+ * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
+ * &drm_gpuva_manager and &drm_gem_object
+ * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+ * @obj: The &drm_gem_object being mapped in the @mgr.
+ *
+ * Find the &drm_gpuva_gem representing the combination of the given
+ * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
+ * count of the &drm_gpuva_gem accordingly.
+ *
+ * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
+ */
+struct drm_gpuva_gem *
+drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
+		   struct drm_gem_object *obj)
+{
+	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
+
+	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
+
+/**
+ * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
+ * given &drm_gpuva_manager and &drm_gem_object
+ * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+ * @obj: The &drm_gem_object being mapped in the @mgr.
+ *
+ * Find the &drm_gpuva_gem representing the combination of the given
+ * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
+ * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
+ * &drm_gpuva_gem.
+ *
+ * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
+ */
+struct drm_gpuva_gem *
+drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	struct drm_gpuva_gem *vm_bo;
+
+	vm_bo = drm_gpuva_gem_find(mgr, obj);
+	if (vm_bo)
+		return vm_bo;
+
+	vm_bo = drm_gpuva_gem_create(mgr, obj);
+	if (!vm_bo)
+		return ERR_PTR(-ENOMEM);
+
+	return vm_bo;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
+
+/**
+ * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
+ * for the given &drm_gpuva_manager and &drm_gem_object
+ * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+ * @obj: The &drm_gem_object being mapped in the @mgr.
+ *
+ * Find the &drm_gpuva_gem representing the combination of the given
+ * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
+ * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
+ * count is decreased. If not found @__vm_bo is returned.
+ *
+ * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
+ * &drm_gpuva_gem was found
+ */
+struct drm_gpuva_gem *
+drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
+			      struct drm_gem_object *obj,
+			      struct drm_gpuva_gem *__vm_bo)
+{
+	struct drm_gpuva_gem *vm_bo;
+
+	vm_bo = drm_gpuva_gem_find(mgr, obj);
+	if (vm_bo) {
+		drm_gpuva_gem_put(__vm_bo);
+		return vm_bo;
+	}
+
+	return __vm_bo;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
+
+static int
+__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
+			  struct drm_gem_object *obj,
+			  gfp_t gfp)
+{
+	MA_STATE(mas, &mgr->mt_ext, 0, 0);
+	union {
+		struct drm_gem_object *obj;
+		uintptr_t index;
+	} gem;
+	union {
+		void *ptr;
+		uintptr_t cnt;
+	} ref;
+	int ret = 0;
+
+	gem.obj = obj;
+	mas_set(&mas, gem.index);
+
+	mas_lock(&mas);
+	ref.ptr = mas_walk(&mas);
+	if (ref.ptr) {
+		++ref.cnt;
+		mas_store(&mas, ref.ptr);
+	} else {
+		if (unlikely(!gfp)) {
+			ret = -EINVAL;
+			goto out;
+		}
+
+		mas_set(&mas, gem.index);
+		ref.cnt = 1;
+		ret = mas_store_gfp(&mas, ref.ptr, gfp);
+		if (likely(!ret))
+			drm_gem_object_get(obj);
+	}
+out:
+	mas_unlock(&mas);
+	return ret;
+}
+
+static void
+__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
+			  struct drm_gem_object *obj)
+{
+	MA_STATE(mas, &mgr->mt_ext, 0, 0);
+	union {
+		struct drm_gem_object *obj;
+		uintptr_t index;
+	} gem;
+	union {
+		void *ptr;
+		uintptr_t cnt;
+	} ref;
+
+	gem.obj = obj;
+	mas_set(&mas, gem.index);
+
+	mas_lock(&mas);
+	if (unlikely(!(ref.ptr = mas_walk(&mas))))
+		goto out;
+
+	if (!--ref.cnt) {
+		mas_erase(&mas);
+		drm_gem_object_put(obj);
+	} else {
+		mas_store(&mas, ref.ptr);
+	}
+out:
+	mas_unlock(&mas);
+}
+
+/**
+ * drm_gpuva_extobj_insert - insert an external &drm_gem_object
+ * @mgr: the &drm_gpuva_manager to insert into
+ * @obj: the &drm_gem_object to insert as extobj
+ *
+ * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
+ * If the &drm_gem_object already exists in the tree, the reference counter
+ * of this external object is increased by one.
+ *
+ * Drivers should insert the external &drm_gem_object before the dma-fence
+ * signalling critical section, e.g. when submitting the job, and before
+ * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
+ * or its dervates.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
+			struct drm_gem_object *obj)
+{
+	return drm_gpuva_is_extobj(mgr, obj) ?
+		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
+
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
+
+/**
+ * drm_gpuva_extobj_get - increase the referecne count of an external
+ * &drm_gem_object
+ * @mgr: the &drm_gpuva_manager storing the extobj
+ * @obj: the &drm_gem_object to representing the extobj
+ *
+ * Increases the reference count of the extobj represented by @obj.
+ *
+ * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
+ * being inserted.
+ *
+ * For &drm_gpuva_op_remap operations drivers should make sure to only take an
+ * additional reference if the re-map operation splits an existing &drm_gpuva
+ * into two separate ones.
+ *
+ * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+void
+drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	if (drm_gpuva_is_extobj(mgr, obj))
+		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
+		     "Can't increase ref-count of non-existent extobj.");
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
+
+/**
+ * drm_gpuva_extobj_put - decrease the referecne count of an external
+ * &drm_gem_object
+ * @mgr: the &drm_gpuva_manager storing the extobj
+ * @obj: the &drm_gem_object to representing the extobj
+ *
+ * Decreases the reference count of the extobj represented by @obj.
+ *
+ * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
+ * being removed from the GPU VA space.
+ *
+ * See also drm_gpuva_unmap_put().
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+void
+drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	if (drm_gpuva_is_extobj(mgr, obj))
+		__drm_gpuva_extobj_remove(mgr, obj);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
+
+/**
+ * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
+ * &drm_gpuva_managers evicted list
+ * @obj: the &drm_gem_object to add or remove
+ * @evict: indicates whether the object is evicted
+ *
+ * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
+ * list containing a mapping of this &drm_gem_object.
+ */
+void
+drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
+{
+	struct drm_gpuva_gem *vm_bo;
+
+	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
+	 * lock has been set, the list is protected with the GEMs dma-resv lock.
+	 */
+	drm_gem_gpuva_assert_lock_held(obj);
+
+	/* Required to protect the GPUVA managers evict list against concurrent
+	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
+	 * the evict list through different GEM object evictions are protected
+	 * by the GPUVA managers evict lock.
+	 */
+	dma_resv_assert_held(obj->resv);
+
+	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
+		struct drm_gpuva_manager *mgr = vm_bo->mgr;
+
+		spin_lock(&mgr->evict.lock);
+		if (evict)
+			list_add_tail(&vm_bo->list.entry.evict,
+				      &mgr->evict.list);
+		else
+			list_del_init(&vm_bo->list.entry.evict);
+		spin_unlock(&mgr->evict.lock);
+	}
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
+
 static int
 __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
 		   struct drm_gpuva *va)
@@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
 /**
  * drm_gpuva_link() - link a &drm_gpuva
  * @va: the &drm_gpuva to link
+ * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
  *
- * This adds the given &va to the GPU VA list of the &drm_gem_object it is
- * associated with.
+ * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
+ * &drm_gpuva_gem to the &drm_gem_object it is associated with.
+ *
+ * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
+ * reference of the latter is taken.
  *
  * This function expects the caller to protect the GEM's GPUVA list against
- * concurrent access using the GEMs dma_resv lock.
+ * concurrent access using either the GEMs dma_resv lock or a driver specific
+ * lock set through drm_gem_gpuva_set_lock().
  */
 void
-drm_gpuva_link(struct drm_gpuva *va)
+drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
 {
 	struct drm_gem_object *obj = va->gem.obj;
 
@@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
 
 	drm_gem_gpuva_assert_lock_held(obj);
 
-	list_add_tail(&va->gem.entry, &obj->gpuva.list);
+	drm_gpuva_gem_get(vm_bo);
+	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
+	if (list_empty(&vm_bo->list.entry.gem))
+		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_link);
 
@@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
  * This removes the given &va from the GPU VA list of the &drm_gem_object it is
  * associated with.
  *
+ * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
+ * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
+ * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
+ *
+ * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
+ * the latter is dropped.
+ *
  * This function expects the caller to protect the GEM's GPUVA list against
- * concurrent access using the GEMs dma_resv lock.
+ * concurrent access using either the GEMs dma_resv lock or a driver specific
+ * lock set through drm_gem_gpuva_set_lock().
  */
 void
 drm_gpuva_unlink(struct drm_gpuva *va)
 {
 	struct drm_gem_object *obj = va->gem.obj;
+	struct drm_gpuva_gem *vm_bo;
 
 	if (unlikely(!obj))
 		return;
 
 	drm_gem_gpuva_assert_lock_held(obj);
 
+	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
+	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
+		return;
+
 	list_del_init(&va->gem.entry);
+
+	if (list_empty(&vm_bo->list.gpuva)) {
+		list_del_init(&vm_bo->list.entry.gem);
+		list_del_init(&vm_bo->list.entry.evict);
+	}
+	drm_gpuva_gem_put(vm_bo);
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
 
@@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_map);
 
+/**
+ * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
+ * &drm_gpuva_op_map
+ * @mgr: the &drm_gpuva_manager
+ * @va: the &drm_gpuva to insert
+ * @op: the &drm_gpuva_op_map to initialize @va with
+ *
+ * Initializes the @va from the @op and inserts it into the given @mgr and
+ * increases the reference count of the corresponding extobj.
+ */
+void
+drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
+		  struct drm_gpuva *va,
+		  struct drm_gpuva_op_map *op)
+{
+	drm_gpuva_map(mgr, va, op);
+	drm_gpuva_extobj_get(mgr, va->gem.obj);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
+
 /**
  * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
  * &drm_gpuva_op_remap
@@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
 		struct drm_gpuva *next,
 		struct drm_gpuva_op_remap *op)
 {
-	struct drm_gpuva *curr = op->unmap->va;
-	struct drm_gpuva_manager *mgr = curr->mgr;
+	struct drm_gpuva *va = op->unmap->va;
+	struct drm_gpuva_manager *mgr = va->mgr;
 
-	drm_gpuva_remove(curr);
+	drm_gpuva_remove(va);
 
 	if (op->prev) {
 		drm_gpuva_init_from_op(prev, op->prev);
@@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_remap);
 
+/**
+ * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
+ * &drm_gpuva_op_remap
+ * @prev: the &drm_gpuva to remap when keeping the start of a mapping
+ * @next: the &drm_gpuva to remap when keeping the end of a mapping
+ * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
+ *
+ * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
+ * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
+ * separate mappings, increases the reference count of the corresponding extobj.
+ */
+void
+drm_gpuva_remap_get(struct drm_gpuva *prev,
+		    struct drm_gpuva *next,
+		    struct drm_gpuva_op_remap *op)
+{
+	struct drm_gpuva *va = op->unmap->va;
+	struct drm_gpuva_manager *mgr = va->mgr;
+
+	drm_gpuva_remap(prev, next, op);
+	if (op->prev && op->next)
+		drm_gpuva_extobj_get(mgr, va->gem.obj);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
+
 /**
  * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
  * &drm_gpuva_op_unmap
@@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
 
+/**
+ * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
+ * &drm_gpuva_op_unmap
+ * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
+ *
+ * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
+ * the reference count of the corresponding extobj.
+ */
+void
+drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
+{
+	struct drm_gpuva *va = op->va;
+
+	drm_gpuva_unmap(op);
+	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
+
 static int
 op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
 	  u64 addr, u64 range,
@@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
 {
 	struct drm_gpuva_ops *ops;
 	struct drm_gpuva_op *op;
+	struct drm_gpuva_gem *vm_bo;
 	struct drm_gpuva *va;
 	int ret;
 
@@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
 
 	INIT_LIST_HEAD(&ops->list);
 
-	drm_gem_for_each_gpuva(va, obj) {
+	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
 		op = gpuva_op_alloc(mgr);
 		if (!op) {
 			ret = -ENOMEM;
diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
index bc9f6aa2f3fe..783ed3ab440d 100644
--- a/include/drm/drm_gem.h
+++ b/include/drm/drm_gem.h
@@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
  * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
  * @obj: the &drm_gem_object
  *
- * This initializes the &drm_gem_object's &drm_gpuva list.
+ * This initializes the &drm_gem_object's &drm_gpuva_gem list.
  *
  * Calling this function is only necessary for drivers intending to support the
  * &drm_driver_feature DRIVER_GEM_GPUVA.
@@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
 }
 
 /**
- * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
- * @entry__: &drm_gpuva structure to assign to in each iteration step
- * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
+ * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
+ * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
+ * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
  *
- * This iterator walks over all &drm_gpuva structures associated with the
- * &drm_gpuva_manager.
+ * This iterator walks over all &drm_gpuva_gem structures associated with the
+ * &drm_gem_object.
  */
-#define drm_gem_for_each_gpuva(entry__, obj__) \
-	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
+#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
+	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
 
 /**
- * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
- * gpuvas
- * @entry__: &drm_gpuva structure to assign to in each iteration step
- * @next__: &next &drm_gpuva to store the next step
- * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
+ * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
+ * &drm_gpuva_gem
+ * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
+ * @next__: &next &drm_gpuva_gem to store the next step
+ * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
  *
- * This iterator walks over all &drm_gpuva structures associated with the
+ * This iterator walks over all &drm_gpuva_gem structures associated with the
  * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
  * it is save against removal of elements.
  */
-#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
-	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
+#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
+	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
+
+/**
+ * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
+ * @va__: &drm_gpuva structure to assign to in each iteration step
+ * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
+ * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
+ * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the
+ * &drm_gpuva_manager and &drm_gem_object.
+ */
+#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
+	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
+	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
+	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
+	     va__ = list_next_entry(va__, gem.entry))
 
 #endif /* __DRM_GEM_H__ */
diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
index ed8d50200cc3..693e2da3f425 100644
--- a/include/drm/drm_gpuva_mgr.h
+++ b/include/drm/drm_gpuva_mgr.h
@@ -26,12 +26,16 @@
  */
 
 #include <linux/list.h>
+#include <linux/dma-resv.h>
+#include <linux/maple_tree.h>
 #include <linux/rbtree.h>
 #include <linux/types.h>
 
 #include <drm/drm_gem.h>
+#include <drm/drm_exec.h>
 
 struct drm_gpuva_manager;
+struct drm_gpuva_gem;
 struct drm_gpuva_fn_ops;
 
 /**
@@ -140,7 +144,7 @@ struct drm_gpuva {
 int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
 void drm_gpuva_remove(struct drm_gpuva *va);
 
-void drm_gpuva_link(struct drm_gpuva *va);
+void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
 void drm_gpuva_unlink(struct drm_gpuva *va);
 
 struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
@@ -240,15 +244,137 @@ struct drm_gpuva_manager {
 	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
 	 */
 	const struct drm_gpuva_fn_ops *ops;
+
+	/**
+	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
+	 * dma-resv to &drm_exec.
+	 */
+	struct drm_gem_object d_obj;
+
+	/**
+	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
+	 * space
+	 */
+	struct dma_resv *resv;
+
+	/**
+	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
+	 */
+	struct drm_exec exec;
+
+	/**
+	 * @mt_ext: &maple_tree storing external &drm_gem_objects
+	 */
+	struct maple_tree mt_ext;
+
+	/**
+	 * @evict: structure holding the evict list and evict list lock
+	 */
+	struct {
+		/**
+		 * @list: &list_head storing &drm_gem_objects currently being
+		 * evicted
+		 */
+		struct list_head list;
+
+		/**
+		 * @lock: spinlock to protect the evict list against concurrent
+		 * insertion / removal of different &drm_gpuva_gems
+		 */
+		spinlock_t lock;
+	} evict;
 };
 
 void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
+			    struct drm_device *drm,
 			    const char *name,
 			    u64 start_offset, u64 range,
 			    u64 reserve_offset, u64 reserve_range,
 			    const struct drm_gpuva_fn_ops *ops);
 void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
 
+/**
+ * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
+ * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
+ */
+#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
+
+int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
+				 int (*fn)(struct drm_gpuva_manager *mgr,
+					   void *priv, unsigned int num_fences),
+				 void *priv,
+				 unsigned int num_fences,
+				 bool interruptible);
+
+int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
+				 struct drm_gem_object **objs,
+				 unsigned int num_objs,
+				 unsigned int num_fences,
+				 bool interruptible);
+
+/**
+ * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ * @num_fences: the amount of &dma_fences to reserve
+ * @interruptible: sleep interruptible if waiting
+ *
+ * Acquires all dma-resv locks of all &drm_gem_objects the given
+ * &drm_gpuva_manager contains mappings of.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+static inline int
+drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
+		       unsigned int num_fences,
+		       bool interruptible)
+{
+	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
+					    interruptible);
+}
+
+/**
+ * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ *
+ * Releases all dma-resv locks of all &drm_gem_objects previously acquired
+ * through drm_gpuva_manager_lock() or its variants.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+static inline void
+drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
+{
+	drm_exec_fini(&mgr->exec);
+}
+
+int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
+void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
+				      struct dma_fence *fence,
+				      enum dma_resv_usage private_usage,
+				      enum dma_resv_usage extobj_usage);
+
+int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
+			    struct drm_gem_object *obj);
+void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
+			  struct drm_gem_object *obj);
+void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
+			  struct drm_gem_object *obj);
+
+/**
+ * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
+ * external object
+ * @mgr: the &drm_gpuva_manager to check
+ * @obj: the &drm_gem_object to check
+ *
+ * Returns: true if the &drm_gem_object &dma_resv differs from the
+ * &drm_gpuva_managers &dma_resv, false otherwise
+ */
+static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
+				       struct drm_gem_object *obj)
+{
+	return obj && obj->resv != mgr->resv;
+}
+
 static inline struct drm_gpuva *
 __drm_gpuva_next(struct drm_gpuva *va)
 {
@@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
 #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
 	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
 
+/**
+ * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
+ * &drm_gem_object combination
+ *
+ * This structure is an abstraction representing a &drm_gpuva_manager and
+ * &drm_gem_object combination. It serves as an indirection to accelerate
+ * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
+ * &drm_gem_object.
+ *
+ * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
+ * accelerate validation.
+ *
+ * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
+ * a GEM object is mapped first in a GPU-VM and release the instance once the
+ * last mapping of the GEM object in this GPU-VM is unmapped.
+ */
+struct drm_gpuva_gem {
+
+	/**
+	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+	 */
+	struct drm_gpuva_manager *mgr;
+
+	/**
+	 * @obj: The &drm_gem_object being mapped in the @mgr.
+	 */
+	struct drm_gem_object *obj;
+
+	/**
+	 * @kref: The reference count for this &drm_gpuva_gem.
+	 */
+	struct kref kref;
+
+	/**
+	 * @list: Structure containing all &list_heads.
+	 */
+	struct {
+		/**
+		 * @gpuva: The list of linked &drm_gpuvas.
+		 */
+		struct list_head gpuva;
+
+		/**
+		 * @entry: Structure containing all &list_heads serving as
+		 * entry.
+		 */
+		struct {
+			/**
+			 * @gem: List entry to attach to the &drm_gem_objects
+			 * gpuva list.
+			 */
+			struct list_head gem;
+
+			/**
+			 * @evict: List entry to attach to the
+			 * &drm_gpuva_managers evict list.
+			 */
+			struct list_head evict;
+		} entry;
+	} list;
+};
+
+struct drm_gpuva_gem *
+drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj);
+struct drm_gpuva_gem *
+drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
+			      struct drm_gem_object *obj,
+			      struct drm_gpuva_gem *__vm_bo);
+
+struct drm_gpuva_gem *
+drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
+		   struct drm_gem_object *obj);
+
+void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
+
+struct drm_gpuva_gem *
+drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj);
+void drm_gpuva_gem_destroy(struct kref *kref);
+
+/**
+ * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
+ * @vm_bo: the &drm_gpuva_gem to acquire the reference of
+ *
+ * This function acquires an additional reference to @vm_bo. It is illegal to
+ * call this without already holding a reference. No locks required.
+ */
+static inline struct drm_gpuva_gem *
+drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
+{
+	kref_get(&vm_bo->kref);
+	return vm_bo;
+}
+
+/**
+ * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
+ * @vm_bo: the &drm_gpuva_gem to release the reference of
+ *
+ * This releases a reference to @vm_bo.
+ */
+static inline void
+drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
+{
+	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
+}
+
+/**
+ * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
+ * @va__: &drm_gpuva structure to assign to in each iteration step
+ * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the
+ * &drm_gpuva_gem.
+ */
+#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
+	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
+
+/**
+ * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
+ * &drm_gpuva
+ * @va__: &drm_gpuva structure to assign to in each iteration step
+ * @next__: &next &drm_gpuva to store the next step
+ * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the
+ * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
+ * it is save against removal of elements.
+ */
+#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
+	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
+
 /**
  * enum drm_gpuva_op_type - GPU VA operation type
  *
@@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
 	 */
 	void (*op_free)(struct drm_gpuva_op *op);
 
+	/**
+	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
+	 * a struct drm_gpuva_gem
+	 *
+	 * Some drivers may want to embed struct drm_gpuva_gem into driver
+	 * specific structures. By implementing this callback drivers can
+	 * allocate memory accordingly.
+	 *
+	 * This callback is optional.
+	 */
+	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
+
+	/**
+	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
+	 * struct drm_gpuva_gem
+	 *
+	 * Some drivers may want to embed struct drm_gpuva_gem into driver
+	 * specific structures. By implementing this callback drivers can
+	 * free the previously allocated memory accordingly.
+	 *
+	 * This callback is optional.
+	 */
+	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
+
 	/**
 	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
 	 * mapping once all previous steps were completed
@@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
 	 * used.
 	 */
 	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
+
+	/**
+	 * @bo_validate: called from drm_gpuva_manager_validate()
+	 *
+	 * Drivers receive this callback for every evicted &drm_gem_object being
+	 * mapped in the corresponding &drm_gpuva_manager.
+	 *
+	 * Typically, drivers would call their driver specific variant of
+	 * ttm_bo_validate() from within this callback.
+	 */
+	int (*bo_validate)(struct drm_gem_object *obj);
 };
 
 int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
@@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
 void drm_gpuva_map(struct drm_gpuva_manager *mgr,
 		   struct drm_gpuva *va,
 		   struct drm_gpuva_op_map *op);
+void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
+		       struct drm_gpuva *va,
+		       struct drm_gpuva_op_map *op);
 
 void drm_gpuva_remap(struct drm_gpuva *prev,
 		     struct drm_gpuva *next,
 		     struct drm_gpuva_op_remap *op);
+void drm_gpuva_remap_get(struct drm_gpuva *prev,
+			 struct drm_gpuva *next,
+			 struct drm_gpuva_op_remap *op);
 
 void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
+void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
 
 #endif /* __DRM_GPUVA_MGR_H__ */
-- 
2.41.0


^ permalink raw reply related	[flat|nested] 88+ messages in thread

* [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-20 21:53   ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-20 21:53 UTC (permalink / raw)
  To: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett
  Cc: dri-devel, nouveau, linux-kernel, Danilo Krummrich

So far the DRM GPUVA manager offers common infrastructure to track GPU VA
allocations and mappings, generically connect GPU VA mappings to their
backing buffers and perform more complex mapping operations on the GPU VA
space.

However, there are more design patterns commonly used by drivers, which
can potentially be generalized in order to make the DRM GPUVA manager
represent a basic GPU-VM implementation. In this context, this patch aims
at generalizing the following elements.

1) Provide a common dma-resv for GEM objects not being used outside of
   this GPU-VM.

2) Provide tracking of external GEM objects (GEM objects which are
   shared with other GPU-VMs).

3) Provide functions to efficiently lock all GEM objects dma-resv the
   GPU-VM contains mappings of.

4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
   of, such that validation of evicted GEM objects is accelerated.

5) Provide some convinience functions for common patterns.

Rather than being designed as a "framework", the target is to make all
features appear as a collection of optional helper functions, such that
drivers are free to make use of the DRM GPUVA managers basic
functionality and opt-in for other features without setting any feature
flags, just by making use of the corresponding functions.

Signed-off-by: Danilo Krummrich <dakr@redhat.com>
---
 drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
 include/drm/drm_gem.h           |  48 ++-
 include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
 3 files changed, 1010 insertions(+), 28 deletions(-)

diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
index f86bfad74ff8..69872b205961 100644
--- a/drivers/gpu/drm/drm_gpuva_mgr.c
+++ b/drivers/gpu/drm/drm_gpuva_mgr.c
@@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
 /**
  * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
  * @mgr: pointer to the &drm_gpuva_manager to initialize
+ * @drm: the drivers &drm_device
  * @name: the name of the GPU VA space
  * @start_offset: the start offset of the GPU VA space
  * @range: the size of the GPU VA space
@@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
  */
 void
 drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
+		       struct drm_device *drm,
 		       const char *name,
 		       u64 start_offset, u64 range,
 		       u64 reserve_offset, u64 reserve_range,
@@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
 	mgr->rb.tree = RB_ROOT_CACHED;
 	INIT_LIST_HEAD(&mgr->rb.list);
 
+	mt_init(&mgr->mt_ext);
+
+	INIT_LIST_HEAD(&mgr->evict.list);
+	spin_lock_init(&mgr->evict.lock);
+
 	drm_gpuva_check_overflow(start_offset, range);
 	mgr->mm_start = start_offset;
 	mgr->mm_range = range;
@@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
 						     reserve_range)))
 			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
 	}
+
+	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
+	mgr->resv = mgr->d_obj.resv;
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
 
@@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
 		__drm_gpuva_remove(&mgr->kernel_alloc_node);
 
 	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
-	     "GPUVA tree is not empty, potentially leaking memory.");
+	     "GPUVA tree is not empty, potentially leaking memory.\n");
+
+	mtree_destroy(&mgr->mt_ext);
+	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
+
+	drm_gem_private_object_fini(&mgr->d_obj);
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
 
+/**
+ * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ * @num_fences: the amount of &dma_fences to reserve
+ *
+ * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
+ * &drm_gpuva_manager contains mappings of.
+ *
+ * Drivers can obtain the corresponding &drm_exec instance through
+ * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
+ * and drm_exec_fini() accordingly.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
+				  unsigned int num_fences)
+{
+	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
+	MA_STATE(mas, &mgr->mt_ext, 0, 0);
+	union {
+		void *ptr;
+		uintptr_t cnt;
+	} ref;
+	int ret;
+
+	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
+	if (ret)
+		goto out;
+
+	rcu_read_lock();
+	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
+		struct drm_gem_object *obj;
+
+		mas_pause(&mas);
+		rcu_read_unlock();
+
+		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
+		ret = drm_exec_prepare_obj(exec, obj, num_fences);
+		if (ret)
+			goto out;
+
+		rcu_read_lock();
+	}
+	rcu_read_unlock();
+
+out:
+	return ret;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
+
+/**
+ * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ * @fn: callback received by the driver to lock additional dma-resv
+ * @priv: private driver data passed to @fn
+ * @num_fences: the amount of &dma_fences to reserve
+ * @interruptible: sleep interruptible if waiting
+ *
+ * Acquires all dma-resv locks of all &drm_gem_objects the given
+ * &drm_gpuva_manager contains mappings of.
+ *
+ * Addionally, when calling this function the driver receives the given @fn
+ * callback to lock additional dma-resv in the context of the
+ * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
+ * drm_exec_prepare_obj() from within this callback.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
+			     int (*fn)(struct drm_gpuva_manager *mgr,
+				       void *priv, unsigned int num_fences),
+			     void *priv,
+			     unsigned int num_fences,
+			     bool interruptible)
+{
+	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
+	uint32_t flags;
+	int ret;
+
+	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
+		DRM_EXEC_IGNORE_DUPLICATES;
+
+	drm_exec_init(exec, flags);
+
+	drm_exec_until_all_locked(exec) {
+		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
+		drm_exec_retry_on_contention(exec);
+		if (ret)
+			goto err;
+
+		if (fn) {
+			ret = fn(mgr, priv, num_fences);
+			drm_exec_retry_on_contention(exec);
+			if (ret)
+				goto err;
+		}
+	}
+
+	return 0;
+
+err:
+	drm_exec_fini(exec);
+	return ret;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
+
+static int
+fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
+				unsigned int num_fences)
+{
+	struct {
+		struct drm_gem_object **objs;
+		unsigned int num_objs;
+	} *args = priv;
+
+	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
+				      args->num_objs, num_fences);
+}
+
+/**
+ * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ * @objs: additional &drm_gem_objects to lock
+ * @num_objs: the number of additional &drm_gem_objects to lock
+ * @num_fences: the amount of &dma_fences to reserve
+ * @interruptible: sleep interruptible if waiting
+ *
+ * Acquires all dma-resv locks of all &drm_gem_objects the given
+ * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
+			     struct drm_gem_object **objs,
+			     unsigned int num_objs,
+			     unsigned int num_fences,
+			     bool interruptible)
+{
+	struct {
+		struct drm_gem_object **objs;
+		unsigned int num_objs;
+	} args;
+
+	args.objs = objs;
+	args.num_objs = num_objs;
+
+	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
+					    num_fences, interruptible);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
+
+/**
+ * drm_gpuva_manager_validate() - validate all BOs marked as evicted
+ * @mgr: the &drm_gpuva_manager to validate evicted BOs
+ *
+ * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
+ * objects being mapped in the given &drm_gpuva_manager.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
+{
+	const struct drm_gpuva_fn_ops *ops = mgr->ops;
+	struct drm_gpuva_gem *vm_bo;
+	int ret;
+
+	if (unlikely(!ops || !ops->bo_validate))
+		return -ENOTSUPP;
+
+	/* At this point we should hold all dma-resv locks of all GEM objects
+	 * associated with this GPU-VM, hence it is safe to walk the list.
+	 */
+	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
+		dma_resv_assert_held(vm_bo->obj->resv);
+
+		ret = ops->bo_validate(vm_bo->obj);
+		if (ret)
+			return ret;
+	}
+
+	return 0;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
+
+/**
+ * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
+ * dma-resv
+ * @mgr: the &drm_gpuva_manager to add a fence to
+ * @fence: fence to add
+ * @private_usage: private dma-resv usage
+ * @extobj_usage: extobj dma-resv usage
+ */
+void
+drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
+				 struct dma_fence *fence,
+				 enum dma_resv_usage private_usage,
+				 enum dma_resv_usage extobj_usage)
+{
+	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
+	struct drm_gem_object *obj;
+	unsigned long index;
+
+	drm_exec_for_each_locked_object(exec, index, obj) {
+			dma_resv_assert_held(obj->resv);
+			dma_resv_add_fence(obj->resv, fence,
+					   drm_gpuva_is_extobj(mgr, obj) ?
+					   private_usage : extobj_usage);
+	}
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
+
+static struct drm_gpuva_gem *
+__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	struct drm_gpuva_gem *vm_bo;
+
+	drm_gem_gpuva_assert_lock_held(obj);
+
+	drm_gem_for_each_gpuva_gem(vm_bo, obj)
+		if (vm_bo->mgr == mgr)
+			return vm_bo;
+
+	return NULL;
+}
+
+/**
+ * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
+ * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+ * @obj: The &drm_gem_object being mapped in the @mgr.
+ *
+ * If provided by the driver, this function uses the &drm_gpuva_fn_ops
+ * vm_bo_alloc() callback to allocate.
+ *
+ * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
+ */
+struct drm_gpuva_gem *
+drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	const struct drm_gpuva_fn_ops *ops = mgr->ops;
+	struct drm_gpuva_gem *vm_bo;
+
+	if (ops && ops->vm_bo_alloc)
+		vm_bo = ops->vm_bo_alloc();
+	else
+		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
+
+	if (unlikely(!vm_bo))
+		return NULL;
+
+	vm_bo->mgr = mgr;
+	vm_bo->obj = obj;
+
+	kref_init(&vm_bo->kref);
+	INIT_LIST_HEAD(&vm_bo->list.gpuva);
+	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
+	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
+
+	drm_gem_object_get(obj);
+
+	return vm_bo;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
+
+void
+drm_gpuva_gem_destroy(struct kref *kref)
+{
+	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
+						   kref);
+	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
+
+	drm_gem_object_put(vm_bo->obj);
+
+	if (ops && ops->vm_bo_free)
+		ops->vm_bo_free(vm_bo);
+	else
+		kfree(vm_bo);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
+
+/**
+ * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
+ * &drm_gpuva_manager and &drm_gem_object
+ * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+ * @obj: The &drm_gem_object being mapped in the @mgr.
+ *
+ * Find the &drm_gpuva_gem representing the combination of the given
+ * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
+ * count of the &drm_gpuva_gem accordingly.
+ *
+ * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
+ */
+struct drm_gpuva_gem *
+drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
+		   struct drm_gem_object *obj)
+{
+	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
+
+	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
+
+/**
+ * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
+ * given &drm_gpuva_manager and &drm_gem_object
+ * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+ * @obj: The &drm_gem_object being mapped in the @mgr.
+ *
+ * Find the &drm_gpuva_gem representing the combination of the given
+ * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
+ * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
+ * &drm_gpuva_gem.
+ *
+ * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
+ */
+struct drm_gpuva_gem *
+drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	struct drm_gpuva_gem *vm_bo;
+
+	vm_bo = drm_gpuva_gem_find(mgr, obj);
+	if (vm_bo)
+		return vm_bo;
+
+	vm_bo = drm_gpuva_gem_create(mgr, obj);
+	if (!vm_bo)
+		return ERR_PTR(-ENOMEM);
+
+	return vm_bo;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
+
+/**
+ * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
+ * for the given &drm_gpuva_manager and &drm_gem_object
+ * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+ * @obj: The &drm_gem_object being mapped in the @mgr.
+ *
+ * Find the &drm_gpuva_gem representing the combination of the given
+ * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
+ * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
+ * count is decreased. If not found @__vm_bo is returned.
+ *
+ * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
+ * &drm_gpuva_gem was found
+ */
+struct drm_gpuva_gem *
+drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
+			      struct drm_gem_object *obj,
+			      struct drm_gpuva_gem *__vm_bo)
+{
+	struct drm_gpuva_gem *vm_bo;
+
+	vm_bo = drm_gpuva_gem_find(mgr, obj);
+	if (vm_bo) {
+		drm_gpuva_gem_put(__vm_bo);
+		return vm_bo;
+	}
+
+	return __vm_bo;
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
+
+static int
+__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
+			  struct drm_gem_object *obj,
+			  gfp_t gfp)
+{
+	MA_STATE(mas, &mgr->mt_ext, 0, 0);
+	union {
+		struct drm_gem_object *obj;
+		uintptr_t index;
+	} gem;
+	union {
+		void *ptr;
+		uintptr_t cnt;
+	} ref;
+	int ret = 0;
+
+	gem.obj = obj;
+	mas_set(&mas, gem.index);
+
+	mas_lock(&mas);
+	ref.ptr = mas_walk(&mas);
+	if (ref.ptr) {
+		++ref.cnt;
+		mas_store(&mas, ref.ptr);
+	} else {
+		if (unlikely(!gfp)) {
+			ret = -EINVAL;
+			goto out;
+		}
+
+		mas_set(&mas, gem.index);
+		ref.cnt = 1;
+		ret = mas_store_gfp(&mas, ref.ptr, gfp);
+		if (likely(!ret))
+			drm_gem_object_get(obj);
+	}
+out:
+	mas_unlock(&mas);
+	return ret;
+}
+
+static void
+__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
+			  struct drm_gem_object *obj)
+{
+	MA_STATE(mas, &mgr->mt_ext, 0, 0);
+	union {
+		struct drm_gem_object *obj;
+		uintptr_t index;
+	} gem;
+	union {
+		void *ptr;
+		uintptr_t cnt;
+	} ref;
+
+	gem.obj = obj;
+	mas_set(&mas, gem.index);
+
+	mas_lock(&mas);
+	if (unlikely(!(ref.ptr = mas_walk(&mas))))
+		goto out;
+
+	if (!--ref.cnt) {
+		mas_erase(&mas);
+		drm_gem_object_put(obj);
+	} else {
+		mas_store(&mas, ref.ptr);
+	}
+out:
+	mas_unlock(&mas);
+}
+
+/**
+ * drm_gpuva_extobj_insert - insert an external &drm_gem_object
+ * @mgr: the &drm_gpuva_manager to insert into
+ * @obj: the &drm_gem_object to insert as extobj
+ *
+ * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
+ * If the &drm_gem_object already exists in the tree, the reference counter
+ * of this external object is increased by one.
+ *
+ * Drivers should insert the external &drm_gem_object before the dma-fence
+ * signalling critical section, e.g. when submitting the job, and before
+ * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
+ * or its dervates.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
+			struct drm_gem_object *obj)
+{
+	return drm_gpuva_is_extobj(mgr, obj) ?
+		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
+
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
+
+/**
+ * drm_gpuva_extobj_get - increase the referecne count of an external
+ * &drm_gem_object
+ * @mgr: the &drm_gpuva_manager storing the extobj
+ * @obj: the &drm_gem_object to representing the extobj
+ *
+ * Increases the reference count of the extobj represented by @obj.
+ *
+ * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
+ * being inserted.
+ *
+ * For &drm_gpuva_op_remap operations drivers should make sure to only take an
+ * additional reference if the re-map operation splits an existing &drm_gpuva
+ * into two separate ones.
+ *
+ * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+void
+drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	if (drm_gpuva_is_extobj(mgr, obj))
+		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
+		     "Can't increase ref-count of non-existent extobj.");
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
+
+/**
+ * drm_gpuva_extobj_put - decrease the referecne count of an external
+ * &drm_gem_object
+ * @mgr: the &drm_gpuva_manager storing the extobj
+ * @obj: the &drm_gem_object to representing the extobj
+ *
+ * Decreases the reference count of the extobj represented by @obj.
+ *
+ * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
+ * being removed from the GPU VA space.
+ *
+ * See also drm_gpuva_unmap_put().
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+void
+drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj)
+{
+	if (drm_gpuva_is_extobj(mgr, obj))
+		__drm_gpuva_extobj_remove(mgr, obj);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
+
+/**
+ * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
+ * &drm_gpuva_managers evicted list
+ * @obj: the &drm_gem_object to add or remove
+ * @evict: indicates whether the object is evicted
+ *
+ * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
+ * list containing a mapping of this &drm_gem_object.
+ */
+void
+drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
+{
+	struct drm_gpuva_gem *vm_bo;
+
+	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
+	 * lock has been set, the list is protected with the GEMs dma-resv lock.
+	 */
+	drm_gem_gpuva_assert_lock_held(obj);
+
+	/* Required to protect the GPUVA managers evict list against concurrent
+	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
+	 * the evict list through different GEM object evictions are protected
+	 * by the GPUVA managers evict lock.
+	 */
+	dma_resv_assert_held(obj->resv);
+
+	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
+		struct drm_gpuva_manager *mgr = vm_bo->mgr;
+
+		spin_lock(&mgr->evict.lock);
+		if (evict)
+			list_add_tail(&vm_bo->list.entry.evict,
+				      &mgr->evict.list);
+		else
+			list_del_init(&vm_bo->list.entry.evict);
+		spin_unlock(&mgr->evict.lock);
+	}
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
+
 static int
 __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
 		   struct drm_gpuva *va)
@@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
 /**
  * drm_gpuva_link() - link a &drm_gpuva
  * @va: the &drm_gpuva to link
+ * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
  *
- * This adds the given &va to the GPU VA list of the &drm_gem_object it is
- * associated with.
+ * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
+ * &drm_gpuva_gem to the &drm_gem_object it is associated with.
+ *
+ * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
+ * reference of the latter is taken.
  *
  * This function expects the caller to protect the GEM's GPUVA list against
- * concurrent access using the GEMs dma_resv lock.
+ * concurrent access using either the GEMs dma_resv lock or a driver specific
+ * lock set through drm_gem_gpuva_set_lock().
  */
 void
-drm_gpuva_link(struct drm_gpuva *va)
+drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
 {
 	struct drm_gem_object *obj = va->gem.obj;
 
@@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
 
 	drm_gem_gpuva_assert_lock_held(obj);
 
-	list_add_tail(&va->gem.entry, &obj->gpuva.list);
+	drm_gpuva_gem_get(vm_bo);
+	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
+	if (list_empty(&vm_bo->list.entry.gem))
+		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_link);
 
@@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
  * This removes the given &va from the GPU VA list of the &drm_gem_object it is
  * associated with.
  *
+ * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
+ * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
+ * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
+ *
+ * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
+ * the latter is dropped.
+ *
  * This function expects the caller to protect the GEM's GPUVA list against
- * concurrent access using the GEMs dma_resv lock.
+ * concurrent access using either the GEMs dma_resv lock or a driver specific
+ * lock set through drm_gem_gpuva_set_lock().
  */
 void
 drm_gpuva_unlink(struct drm_gpuva *va)
 {
 	struct drm_gem_object *obj = va->gem.obj;
+	struct drm_gpuva_gem *vm_bo;
 
 	if (unlikely(!obj))
 		return;
 
 	drm_gem_gpuva_assert_lock_held(obj);
 
+	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
+	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
+		return;
+
 	list_del_init(&va->gem.entry);
+
+	if (list_empty(&vm_bo->list.gpuva)) {
+		list_del_init(&vm_bo->list.entry.gem);
+		list_del_init(&vm_bo->list.entry.evict);
+	}
+	drm_gpuva_gem_put(vm_bo);
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
 
@@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_map);
 
+/**
+ * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
+ * &drm_gpuva_op_map
+ * @mgr: the &drm_gpuva_manager
+ * @va: the &drm_gpuva to insert
+ * @op: the &drm_gpuva_op_map to initialize @va with
+ *
+ * Initializes the @va from the @op and inserts it into the given @mgr and
+ * increases the reference count of the corresponding extobj.
+ */
+void
+drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
+		  struct drm_gpuva *va,
+		  struct drm_gpuva_op_map *op)
+{
+	drm_gpuva_map(mgr, va, op);
+	drm_gpuva_extobj_get(mgr, va->gem.obj);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
+
 /**
  * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
  * &drm_gpuva_op_remap
@@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
 		struct drm_gpuva *next,
 		struct drm_gpuva_op_remap *op)
 {
-	struct drm_gpuva *curr = op->unmap->va;
-	struct drm_gpuva_manager *mgr = curr->mgr;
+	struct drm_gpuva *va = op->unmap->va;
+	struct drm_gpuva_manager *mgr = va->mgr;
 
-	drm_gpuva_remove(curr);
+	drm_gpuva_remove(va);
 
 	if (op->prev) {
 		drm_gpuva_init_from_op(prev, op->prev);
@@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_remap);
 
+/**
+ * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
+ * &drm_gpuva_op_remap
+ * @prev: the &drm_gpuva to remap when keeping the start of a mapping
+ * @next: the &drm_gpuva to remap when keeping the end of a mapping
+ * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
+ *
+ * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
+ * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
+ * separate mappings, increases the reference count of the corresponding extobj.
+ */
+void
+drm_gpuva_remap_get(struct drm_gpuva *prev,
+		    struct drm_gpuva *next,
+		    struct drm_gpuva_op_remap *op)
+{
+	struct drm_gpuva *va = op->unmap->va;
+	struct drm_gpuva_manager *mgr = va->mgr;
+
+	drm_gpuva_remap(prev, next, op);
+	if (op->prev && op->next)
+		drm_gpuva_extobj_get(mgr, va->gem.obj);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
+
 /**
  * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
  * &drm_gpuva_op_unmap
@@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
 }
 EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
 
+/**
+ * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
+ * &drm_gpuva_op_unmap
+ * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
+ *
+ * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
+ * the reference count of the corresponding extobj.
+ */
+void
+drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
+{
+	struct drm_gpuva *va = op->va;
+
+	drm_gpuva_unmap(op);
+	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
+}
+EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
+
 static int
 op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
 	  u64 addr, u64 range,
@@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
 {
 	struct drm_gpuva_ops *ops;
 	struct drm_gpuva_op *op;
+	struct drm_gpuva_gem *vm_bo;
 	struct drm_gpuva *va;
 	int ret;
 
@@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
 
 	INIT_LIST_HEAD(&ops->list);
 
-	drm_gem_for_each_gpuva(va, obj) {
+	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
 		op = gpuva_op_alloc(mgr);
 		if (!op) {
 			ret = -ENOMEM;
diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
index bc9f6aa2f3fe..783ed3ab440d 100644
--- a/include/drm/drm_gem.h
+++ b/include/drm/drm_gem.h
@@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
  * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
  * @obj: the &drm_gem_object
  *
- * This initializes the &drm_gem_object's &drm_gpuva list.
+ * This initializes the &drm_gem_object's &drm_gpuva_gem list.
  *
  * Calling this function is only necessary for drivers intending to support the
  * &drm_driver_feature DRIVER_GEM_GPUVA.
@@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
 }
 
 /**
- * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
- * @entry__: &drm_gpuva structure to assign to in each iteration step
- * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
+ * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
+ * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
+ * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
  *
- * This iterator walks over all &drm_gpuva structures associated with the
- * &drm_gpuva_manager.
+ * This iterator walks over all &drm_gpuva_gem structures associated with the
+ * &drm_gem_object.
  */
-#define drm_gem_for_each_gpuva(entry__, obj__) \
-	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
+#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
+	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
 
 /**
- * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
- * gpuvas
- * @entry__: &drm_gpuva structure to assign to in each iteration step
- * @next__: &next &drm_gpuva to store the next step
- * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
+ * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
+ * &drm_gpuva_gem
+ * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
+ * @next__: &next &drm_gpuva_gem to store the next step
+ * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
  *
- * This iterator walks over all &drm_gpuva structures associated with the
+ * This iterator walks over all &drm_gpuva_gem structures associated with the
  * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
  * it is save against removal of elements.
  */
-#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
-	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
+#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
+	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
+
+/**
+ * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
+ * @va__: &drm_gpuva structure to assign to in each iteration step
+ * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
+ * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
+ * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the
+ * &drm_gpuva_manager and &drm_gem_object.
+ */
+#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
+	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
+	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
+	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
+	     va__ = list_next_entry(va__, gem.entry))
 
 #endif /* __DRM_GEM_H__ */
diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
index ed8d50200cc3..693e2da3f425 100644
--- a/include/drm/drm_gpuva_mgr.h
+++ b/include/drm/drm_gpuva_mgr.h
@@ -26,12 +26,16 @@
  */
 
 #include <linux/list.h>
+#include <linux/dma-resv.h>
+#include <linux/maple_tree.h>
 #include <linux/rbtree.h>
 #include <linux/types.h>
 
 #include <drm/drm_gem.h>
+#include <drm/drm_exec.h>
 
 struct drm_gpuva_manager;
+struct drm_gpuva_gem;
 struct drm_gpuva_fn_ops;
 
 /**
@@ -140,7 +144,7 @@ struct drm_gpuva {
 int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
 void drm_gpuva_remove(struct drm_gpuva *va);
 
-void drm_gpuva_link(struct drm_gpuva *va);
+void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
 void drm_gpuva_unlink(struct drm_gpuva *va);
 
 struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
@@ -240,15 +244,137 @@ struct drm_gpuva_manager {
 	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
 	 */
 	const struct drm_gpuva_fn_ops *ops;
+
+	/**
+	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
+	 * dma-resv to &drm_exec.
+	 */
+	struct drm_gem_object d_obj;
+
+	/**
+	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
+	 * space
+	 */
+	struct dma_resv *resv;
+
+	/**
+	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
+	 */
+	struct drm_exec exec;
+
+	/**
+	 * @mt_ext: &maple_tree storing external &drm_gem_objects
+	 */
+	struct maple_tree mt_ext;
+
+	/**
+	 * @evict: structure holding the evict list and evict list lock
+	 */
+	struct {
+		/**
+		 * @list: &list_head storing &drm_gem_objects currently being
+		 * evicted
+		 */
+		struct list_head list;
+
+		/**
+		 * @lock: spinlock to protect the evict list against concurrent
+		 * insertion / removal of different &drm_gpuva_gems
+		 */
+		spinlock_t lock;
+	} evict;
 };
 
 void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
+			    struct drm_device *drm,
 			    const char *name,
 			    u64 start_offset, u64 range,
 			    u64 reserve_offset, u64 reserve_range,
 			    const struct drm_gpuva_fn_ops *ops);
 void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
 
+/**
+ * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
+ * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
+ */
+#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
+
+int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
+				 int (*fn)(struct drm_gpuva_manager *mgr,
+					   void *priv, unsigned int num_fences),
+				 void *priv,
+				 unsigned int num_fences,
+				 bool interruptible);
+
+int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
+				 struct drm_gem_object **objs,
+				 unsigned int num_objs,
+				 unsigned int num_fences,
+				 bool interruptible);
+
+/**
+ * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ * @num_fences: the amount of &dma_fences to reserve
+ * @interruptible: sleep interruptible if waiting
+ *
+ * Acquires all dma-resv locks of all &drm_gem_objects the given
+ * &drm_gpuva_manager contains mappings of.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+static inline int
+drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
+		       unsigned int num_fences,
+		       bool interruptible)
+{
+	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
+					    interruptible);
+}
+
+/**
+ * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
+ * @mgr: the &drm_gpuva_manager
+ *
+ * Releases all dma-resv locks of all &drm_gem_objects previously acquired
+ * through drm_gpuva_manager_lock() or its variants.
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+static inline void
+drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
+{
+	drm_exec_fini(&mgr->exec);
+}
+
+int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
+void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
+				      struct dma_fence *fence,
+				      enum dma_resv_usage private_usage,
+				      enum dma_resv_usage extobj_usage);
+
+int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
+			    struct drm_gem_object *obj);
+void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
+			  struct drm_gem_object *obj);
+void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
+			  struct drm_gem_object *obj);
+
+/**
+ * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
+ * external object
+ * @mgr: the &drm_gpuva_manager to check
+ * @obj: the &drm_gem_object to check
+ *
+ * Returns: true if the &drm_gem_object &dma_resv differs from the
+ * &drm_gpuva_managers &dma_resv, false otherwise
+ */
+static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
+				       struct drm_gem_object *obj)
+{
+	return obj && obj->resv != mgr->resv;
+}
+
 static inline struct drm_gpuva *
 __drm_gpuva_next(struct drm_gpuva *va)
 {
@@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
 #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
 	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
 
+/**
+ * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
+ * &drm_gem_object combination
+ *
+ * This structure is an abstraction representing a &drm_gpuva_manager and
+ * &drm_gem_object combination. It serves as an indirection to accelerate
+ * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
+ * &drm_gem_object.
+ *
+ * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
+ * accelerate validation.
+ *
+ * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
+ * a GEM object is mapped first in a GPU-VM and release the instance once the
+ * last mapping of the GEM object in this GPU-VM is unmapped.
+ */
+struct drm_gpuva_gem {
+
+	/**
+	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
+	 */
+	struct drm_gpuva_manager *mgr;
+
+	/**
+	 * @obj: The &drm_gem_object being mapped in the @mgr.
+	 */
+	struct drm_gem_object *obj;
+
+	/**
+	 * @kref: The reference count for this &drm_gpuva_gem.
+	 */
+	struct kref kref;
+
+	/**
+	 * @list: Structure containing all &list_heads.
+	 */
+	struct {
+		/**
+		 * @gpuva: The list of linked &drm_gpuvas.
+		 */
+		struct list_head gpuva;
+
+		/**
+		 * @entry: Structure containing all &list_heads serving as
+		 * entry.
+		 */
+		struct {
+			/**
+			 * @gem: List entry to attach to the &drm_gem_objects
+			 * gpuva list.
+			 */
+			struct list_head gem;
+
+			/**
+			 * @evict: List entry to attach to the
+			 * &drm_gpuva_managers evict list.
+			 */
+			struct list_head evict;
+		} entry;
+	} list;
+};
+
+struct drm_gpuva_gem *
+drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj);
+struct drm_gpuva_gem *
+drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
+			      struct drm_gem_object *obj,
+			      struct drm_gpuva_gem *__vm_bo);
+
+struct drm_gpuva_gem *
+drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
+		   struct drm_gem_object *obj);
+
+void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
+
+struct drm_gpuva_gem *
+drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
+		     struct drm_gem_object *obj);
+void drm_gpuva_gem_destroy(struct kref *kref);
+
+/**
+ * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
+ * @vm_bo: the &drm_gpuva_gem to acquire the reference of
+ *
+ * This function acquires an additional reference to @vm_bo. It is illegal to
+ * call this without already holding a reference. No locks required.
+ */
+static inline struct drm_gpuva_gem *
+drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
+{
+	kref_get(&vm_bo->kref);
+	return vm_bo;
+}
+
+/**
+ * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
+ * @vm_bo: the &drm_gpuva_gem to release the reference of
+ *
+ * This releases a reference to @vm_bo.
+ */
+static inline void
+drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
+{
+	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
+}
+
+/**
+ * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
+ * @va__: &drm_gpuva structure to assign to in each iteration step
+ * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the
+ * &drm_gpuva_gem.
+ */
+#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
+	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
+
+/**
+ * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
+ * &drm_gpuva
+ * @va__: &drm_gpuva structure to assign to in each iteration step
+ * @next__: &next &drm_gpuva to store the next step
+ * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the
+ * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
+ * it is save against removal of elements.
+ */
+#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
+	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
+
 /**
  * enum drm_gpuva_op_type - GPU VA operation type
  *
@@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
 	 */
 	void (*op_free)(struct drm_gpuva_op *op);
 
+	/**
+	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
+	 * a struct drm_gpuva_gem
+	 *
+	 * Some drivers may want to embed struct drm_gpuva_gem into driver
+	 * specific structures. By implementing this callback drivers can
+	 * allocate memory accordingly.
+	 *
+	 * This callback is optional.
+	 */
+	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
+
+	/**
+	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
+	 * struct drm_gpuva_gem
+	 *
+	 * Some drivers may want to embed struct drm_gpuva_gem into driver
+	 * specific structures. By implementing this callback drivers can
+	 * free the previously allocated memory accordingly.
+	 *
+	 * This callback is optional.
+	 */
+	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
+
 	/**
 	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
 	 * mapping once all previous steps were completed
@@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
 	 * used.
 	 */
 	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
+
+	/**
+	 * @bo_validate: called from drm_gpuva_manager_validate()
+	 *
+	 * Drivers receive this callback for every evicted &drm_gem_object being
+	 * mapped in the corresponding &drm_gpuva_manager.
+	 *
+	 * Typically, drivers would call their driver specific variant of
+	 * ttm_bo_validate() from within this callback.
+	 */
+	int (*bo_validate)(struct drm_gem_object *obj);
 };
 
 int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
@@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
 void drm_gpuva_map(struct drm_gpuva_manager *mgr,
 		   struct drm_gpuva *va,
 		   struct drm_gpuva_op_map *op);
+void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
+		       struct drm_gpuva *va,
+		       struct drm_gpuva_op_map *op);
 
 void drm_gpuva_remap(struct drm_gpuva *prev,
 		     struct drm_gpuva *next,
 		     struct drm_gpuva_op_remap *op);
+void drm_gpuva_remap_get(struct drm_gpuva *prev,
+			 struct drm_gpuva *next,
+			 struct drm_gpuva_op_remap *op);
 
 void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
+void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
 
 #endif /* __DRM_GPUVA_MGR_H__ */
-- 
2.41.0


^ permalink raw reply related	[flat|nested] 88+ messages in thread

* [Nouveau] [PATCH drm-misc-next 3/3] drm/nouveau: gpuva mgr dma-resv/extobj handling, GEM validation
  2023-08-20 21:53 ` Danilo Krummrich
  (?)
@ 2023-08-20 21:53   ` Danilo Krummrich
  -1 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-20 21:53 UTC (permalink / raw)
  To: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel

Make use of the DRM GPUVA managers GPU-VM common dma-resv, external GEM
object tracking, dma-resv locking, evicted GEM object tracking and
validation features.

Signed-off-by: Danilo Krummrich <dakr@redhat.com>
---
 drivers/gpu/drm/nouveau/nouveau_bo.c    |   4 +-
 drivers/gpu/drm/nouveau/nouveau_exec.c  |  51 ++-----
 drivers/gpu/drm/nouveau/nouveau_gem.c   |   4 +-
 drivers/gpu/drm/nouveau/nouveau_sched.h |   2 -
 drivers/gpu/drm/nouveau/nouveau_uvmm.c  | 191 +++++++++++++++++-------
 5 files changed, 150 insertions(+), 102 deletions(-)

diff --git a/drivers/gpu/drm/nouveau/nouveau_bo.c b/drivers/gpu/drm/nouveau/nouveau_bo.c
index 19cab37ac69c..64f50adb2856 100644
--- a/drivers/gpu/drm/nouveau/nouveau_bo.c
+++ b/drivers/gpu/drm/nouveau/nouveau_bo.c
@@ -1060,17 +1060,18 @@ nouveau_bo_move(struct ttm_buffer_object *bo, bool evict,
 {
 	struct nouveau_drm *drm = nouveau_bdev(bo->bdev);
 	struct nouveau_bo *nvbo = nouveau_bo(bo);
+	struct drm_gem_object *obj = &bo->base;
 	struct ttm_resource *old_reg = bo->resource;
 	struct nouveau_drm_tile *new_tile = NULL;
 	int ret = 0;
 
-
 	if (new_reg->mem_type == TTM_PL_TT) {
 		ret = nouveau_ttm_tt_bind(bo->bdev, bo->ttm, new_reg);
 		if (ret)
 			return ret;
 	}
 
+	drm_gpuva_gem_evict(obj, evict);
 	nouveau_bo_move_ntfy(bo, new_reg);
 	ret = ttm_bo_wait_ctx(bo, ctx);
 	if (ret)
@@ -1135,6 +1136,7 @@ nouveau_bo_move(struct ttm_buffer_object *bo, bool evict,
 out_ntfy:
 	if (ret) {
 		nouveau_bo_move_ntfy(bo, bo->resource);
+		drm_gpuva_gem_evict(obj, !evict);
 	}
 	return ret;
 }
diff --git a/drivers/gpu/drm/nouveau/nouveau_exec.c b/drivers/gpu/drm/nouveau/nouveau_exec.c
index 0f927adda4ed..fadb20824b26 100644
--- a/drivers/gpu/drm/nouveau/nouveau_exec.c
+++ b/drivers/gpu/drm/nouveau/nouveau_exec.c
@@ -1,7 +1,5 @@
 // SPDX-License-Identifier: MIT
 
-#include <drm/drm_exec.h>
-
 #include "nouveau_drv.h"
 #include "nouveau_gem.h"
 #include "nouveau_mem.h"
@@ -91,9 +89,6 @@ nouveau_exec_job_submit(struct nouveau_job *job)
 	struct nouveau_exec_job *exec_job = to_nouveau_exec_job(job);
 	struct nouveau_cli *cli = job->cli;
 	struct nouveau_uvmm *uvmm = nouveau_cli_uvmm(cli);
-	struct drm_exec *exec = &job->exec;
-	struct drm_gem_object *obj;
-	unsigned long index;
 	int ret;
 
 	ret = nouveau_fence_new(&exec_job->fence);
@@ -101,52 +96,30 @@ nouveau_exec_job_submit(struct nouveau_job *job)
 		return ret;
 
 	nouveau_uvmm_lock(uvmm);
-	drm_exec_init(exec, DRM_EXEC_INTERRUPTIBLE_WAIT |
-			    DRM_EXEC_IGNORE_DUPLICATES);
-	drm_exec_until_all_locked(exec) {
-		struct drm_gpuva *va;
-
-		drm_gpuva_for_each_va(va, &uvmm->umgr) {
-			if (unlikely(va == &uvmm->umgr.kernel_alloc_node))
-				continue;
-
-			ret = drm_exec_prepare_obj(exec, va->gem.obj, 1);
-			drm_exec_retry_on_contention(exec);
-			if (ret)
-				goto err_uvmm_unlock;
-		}
+	ret = drm_gpuva_manager_lock(&uvmm->umgr, 1, false);
+	if (ret) {
+		nouveau_uvmm_unlock(uvmm);
+		return ret;
 	}
 	nouveau_uvmm_unlock(uvmm);
 
-	drm_exec_for_each_locked_object(exec, index, obj) {
-		struct nouveau_bo *nvbo = nouveau_gem_object(obj);
-
-		ret = nouveau_bo_validate(nvbo, true, false);
-		if (ret)
-			goto err_exec_fini;
+	ret = drm_gpuva_manager_validate(&uvmm->umgr);
+	if (ret) {
+		drm_gpuva_manager_unlock(&uvmm->umgr);
+		return ret;
 	}
 
 	return 0;
-
-err_uvmm_unlock:
-	nouveau_uvmm_unlock(uvmm);
-err_exec_fini:
-	drm_exec_fini(exec);
-	return ret;
-
 }
 
 static void
 nouveau_exec_job_armed_submit(struct nouveau_job *job)
 {
-	struct drm_exec *exec = &job->exec;
-	struct drm_gem_object *obj;
-	unsigned long index;
-
-	drm_exec_for_each_locked_object(exec, index, obj)
-		dma_resv_add_fence(obj->resv, job->done_fence, job->resv_usage);
+	struct nouveau_uvmm *uvmm = nouveau_cli_uvmm(job->cli);
 
-	drm_exec_fini(exec);
+	drm_gpuva_manager_resv_add_fence(&uvmm->umgr, job->done_fence,
+					 job->resv_usage, job->resv_usage);
+	drm_gpuva_manager_unlock(&uvmm->umgr);
 }
 
 static struct dma_fence *
diff --git a/drivers/gpu/drm/nouveau/nouveau_gem.c b/drivers/gpu/drm/nouveau/nouveau_gem.c
index f39360870c70..dec34a88f8b2 100644
--- a/drivers/gpu/drm/nouveau/nouveau_gem.c
+++ b/drivers/gpu/drm/nouveau/nouveau_gem.c
@@ -111,7 +111,7 @@ nouveau_gem_object_open(struct drm_gem_object *gem, struct drm_file *file_priv)
 	if (vmm->vmm.object.oclass < NVIF_CLASS_VMM_NV50)
 		return 0;
 
-	if (nvbo->no_share && uvmm && &uvmm->resv != nvbo->bo.base.resv)
+	if (nvbo->no_share && uvmm && uvmm->umgr.resv != nvbo->bo.base.resv)
 		return -EPERM;
 
 	ret = ttm_bo_reserve(&nvbo->bo, false, false, NULL);
@@ -245,7 +245,7 @@ nouveau_gem_new(struct nouveau_cli *cli, u64 size, int align, uint32_t domain,
 		if (unlikely(!uvmm))
 			return -EINVAL;
 
-		resv = &uvmm->resv;
+		resv = uvmm->umgr.resv;
 	}
 
 	if (!(domain & (NOUVEAU_GEM_DOMAIN_VRAM | NOUVEAU_GEM_DOMAIN_GART)))
diff --git a/drivers/gpu/drm/nouveau/nouveau_sched.h b/drivers/gpu/drm/nouveau/nouveau_sched.h
index 27ac19792597..ccedc80685b3 100644
--- a/drivers/gpu/drm/nouveau/nouveau_sched.h
+++ b/drivers/gpu/drm/nouveau/nouveau_sched.h
@@ -5,7 +5,6 @@
 
 #include <linux/types.h>
 
-#include <drm/drm_exec.h>
 #include <drm/gpu_scheduler.h>
 
 #include "nouveau_drv.h"
@@ -54,7 +53,6 @@ struct nouveau_job {
 	struct drm_file *file_priv;
 	struct nouveau_cli *cli;
 
-	struct drm_exec exec;
 	enum dma_resv_usage resv_usage;
 	struct dma_fence *done_fence;
 
diff --git a/drivers/gpu/drm/nouveau/nouveau_uvmm.c b/drivers/gpu/drm/nouveau/nouveau_uvmm.c
index 3a1e8538f205..ce1975cca8a9 100644
--- a/drivers/gpu/drm/nouveau/nouveau_uvmm.c
+++ b/drivers/gpu/drm/nouveau/nouveau_uvmm.c
@@ -71,6 +71,7 @@ struct bind_job_op {
 		u32 handle;
 		u64 offset;
 		struct drm_gem_object *obj;
+		struct drm_gpuva_gem *vm_bo;
 	} gem;
 
 	struct nouveau_uvma_region *reg;
@@ -436,8 +437,10 @@ nouveau_uvma_region_complete(struct nouveau_uvma_region *reg)
 static void
 op_map_prepare_unwind(struct nouveau_uvma *uvma)
 {
+	struct drm_gpuva *va = &uvma->va;
 	nouveau_uvma_gem_put(uvma);
-	drm_gpuva_remove(&uvma->va);
+	drm_gpuva_remove(va);
+	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
 	nouveau_uvma_free(uvma);
 }
 
@@ -445,6 +448,7 @@ static void
 op_unmap_prepare_unwind(struct drm_gpuva *va)
 {
 	drm_gpuva_insert(va->mgr, va);
+	drm_gpuva_extobj_get(va->mgr, va->gem.obj);
 }
 
 static void
@@ -466,14 +470,17 @@ nouveau_uvmm_sm_prepare_unwind(struct nouveau_uvmm *uvmm,
 			break;
 		case DRM_GPUVA_OP_REMAP: {
 			struct drm_gpuva_op_remap *r = &op->remap;
+			struct drm_gpuva *va = r->unmap->va;
 
+			drm_gpuva_extobj_get(va->mgr, va->gem.obj);
 			if (r->next)
 				op_map_prepare_unwind(new->next);
 
 			if (r->prev)
 				op_map_prepare_unwind(new->prev);
 
-			op_unmap_prepare_unwind(r->unmap->va);
+			op_unmap_prepare_unwind(va);
+			drm_gpuva_extobj_put(va->mgr, va->gem.obj);
 			break;
 		}
 		case DRM_GPUVA_OP_UNMAP:
@@ -589,7 +596,7 @@ op_map_prepare(struct nouveau_uvmm *uvmm,
 	uvma->region = args->region;
 	uvma->kind = args->kind;
 
-	drm_gpuva_map(&uvmm->umgr, &uvma->va, op);
+	drm_gpuva_map_get(&uvmm->umgr, &uvma->va, op);
 
 	/* Keep a reference until this uvma is destroyed. */
 	nouveau_uvma_gem_get(uvma);
@@ -601,7 +608,7 @@ op_map_prepare(struct nouveau_uvmm *uvmm,
 static void
 op_unmap_prepare(struct drm_gpuva_op_unmap *u)
 {
-	drm_gpuva_unmap(u);
+	drm_gpuva_unmap_put(u);
 }
 
 static int
@@ -632,6 +639,7 @@ nouveau_uvmm_sm_prepare(struct nouveau_uvmm *uvmm,
 					goto unwind;
 				}
 			}
+
 			break;
 		}
 		case DRM_GPUVA_OP_REMAP: {
@@ -644,6 +652,7 @@ nouveau_uvmm_sm_prepare(struct nouveau_uvmm *uvmm,
 			u64 urange = va->va.range;
 			u64 uend = ustart + urange;
 
+			drm_gpuva_extobj_get(va->mgr, va->gem.obj);
 			op_unmap_prepare(r->unmap);
 
 			if (r->prev) {
@@ -668,6 +677,7 @@ nouveau_uvmm_sm_prepare(struct nouveau_uvmm *uvmm,
 				if (args)
 					vmm_get_end = ustart;
 			}
+			drm_gpuva_extobj_put(va->mgr, va->gem.obj);
 
 			if (args && (r->prev && r->next))
 				vmm_get_start = vmm_get_end = 0;
@@ -1112,22 +1122,34 @@ bind_validate_region(struct nouveau_job *job)
 }
 
 static void
-bind_link_gpuvas(struct drm_gpuva_ops *ops, struct nouveau_uvma_prealloc *new)
+bind_link_gpuvas(struct bind_job_op *bop)
 {
+	struct nouveau_uvma_prealloc *new = &bop->new;
+	struct drm_gpuva_gem *vm_bo = bop->gem.vm_bo;
+	struct drm_gpuva_ops *ops = bop->ops;
 	struct drm_gpuva_op *op;
 
 	drm_gpuva_for_each_op(op, ops) {
 		switch (op->op) {
 		case DRM_GPUVA_OP_MAP:
-			drm_gpuva_link(&new->map->va);
+			drm_gpuva_link(&new->map->va, vm_bo);
 			break;
-		case DRM_GPUVA_OP_REMAP:
+		case DRM_GPUVA_OP_REMAP: {
+			struct drm_gpuva *va = op->remap.unmap->va;
+			struct drm_gpuva_gem *vm_bo;
+
+			vm_bo = drm_gpuva_gem_find(va->mgr, va->gem.obj);
+			BUG_ON(!vm_bo);
+
 			if (op->remap.prev)
-				drm_gpuva_link(&new->prev->va);
+				drm_gpuva_link(&new->prev->va, vm_bo);
 			if (op->remap.next)
-				drm_gpuva_link(&new->next->va);
-			drm_gpuva_unlink(op->remap.unmap->va);
+				drm_gpuva_link(&new->next->va, vm_bo);
+			drm_gpuva_unlink(va);
+
+			drm_gpuva_gem_put(vm_bo);
 			break;
+		}
 		case DRM_GPUVA_OP_UNMAP:
 			drm_gpuva_unlink(op->unmap.va);
 			break;
@@ -1137,22 +1159,72 @@ bind_link_gpuvas(struct drm_gpuva_ops *ops, struct nouveau_uvma_prealloc *new)
 	}
 }
 
+static int
+bind_lock_extra(struct drm_gpuva_manager *mgr, void *priv,
+		unsigned int num_fences)
+{
+	struct nouveau_uvmm_bind_job *bind_job = priv;
+	struct bind_job_op *op;
+	int ret;
+
+	list_for_each_op(op, &bind_job->ops) {
+		struct drm_gpuva_op *va_op;
+
+		if (IS_ERR_OR_NULL(op->ops))
+			continue;
+
+		drm_gpuva_for_each_op(va_op, op->ops) {
+			struct drm_gem_object *obj = op_gem_obj(va_op);
+
+			if (unlikely(!obj))
+				continue;
+
+			if (va_op->op != DRM_GPUVA_OP_UNMAP)
+				continue;
+
+			ret = drm_exec_prepare_obj(DRM_GPUVA_EXEC(mgr), obj,
+						   num_fences);
+			if (ret)
+				return ret;
+		}
+	}
+
+	return 0;
+}
+
 static int
 nouveau_uvmm_bind_job_submit(struct nouveau_job *job)
 {
 	struct nouveau_uvmm *uvmm = nouveau_cli_uvmm(job->cli);
 	struct nouveau_uvmm_bind_job *bind_job = to_uvmm_bind_job(job);
 	struct nouveau_sched_entity *entity = job->entity;
-	struct drm_exec *exec = &job->exec;
 	struct bind_job_op *op;
 	int ret;
 
 	list_for_each_op(op, &bind_job->ops) {
 		if (op->op == OP_MAP) {
-			op->gem.obj = drm_gem_object_lookup(job->file_priv,
-							    op->gem.handle);
-			if (!op->gem.obj)
+			struct drm_gem_object *obj;
+
+			obj = drm_gem_object_lookup(job->file_priv,
+						    op->gem.handle);
+			if (!obj)
 				return -ENOENT;
+
+			dma_resv_lock(obj->resv, NULL);
+			op->gem.vm_bo = drm_gpuva_gem_obtain(&uvmm->umgr, obj);
+			dma_resv_unlock(obj->resv);
+			if (IS_ERR(op->gem.vm_bo)) {
+				drm_gem_object_put(obj);
+				return PTR_ERR(op->gem.vm_bo);
+			}
+
+			ret = drm_gpuva_extobj_insert(&uvmm->umgr, obj);
+			if (ret) {
+				drm_gem_object_put(obj);
+				return ret;
+			}
+
+			op->gem.obj = obj;
 		}
 
 		ret = bind_validate_op(job, op);
@@ -1286,30 +1358,10 @@ nouveau_uvmm_bind_job_submit(struct nouveau_job *job)
 		}
 	}
 
-	drm_exec_init(exec, DRM_EXEC_INTERRUPTIBLE_WAIT |
-			    DRM_EXEC_IGNORE_DUPLICATES);
-	drm_exec_until_all_locked(exec) {
-		list_for_each_op(op, &bind_job->ops) {
-			struct drm_gpuva_op *va_op;
-
-			if (IS_ERR_OR_NULL(op->ops))
-				continue;
-
-			drm_gpuva_for_each_op(va_op, op->ops) {
-				struct drm_gem_object *obj = op_gem_obj(va_op);
-
-				if (unlikely(!obj))
-					continue;
-
-				ret = drm_exec_prepare_obj(exec, obj, 1);
-				drm_exec_retry_on_contention(exec);
-				if (ret) {
-					op = list_last_op(&bind_job->ops);
-					goto unwind;
-				}
-			}
-		}
-	}
+	ret = drm_gpuva_manager_lock_extra(&uvmm->umgr, bind_lock_extra,
+					   bind_job, 1, false);
+	if (ret)
+		goto unwind_continue;
 
 	list_for_each_op(op, &bind_job->ops) {
 		struct drm_gpuva_op *va_op;
@@ -1363,7 +1415,7 @@ nouveau_uvmm_bind_job_submit(struct nouveau_job *job)
 		case OP_UNMAP_SPARSE:
 		case OP_MAP:
 		case OP_UNMAP:
-			bind_link_gpuvas(op->ops, &op->new);
+			bind_link_gpuvas(op);
 			break;
 		default:
 			break;
@@ -1409,21 +1461,18 @@ nouveau_uvmm_bind_job_submit(struct nouveau_job *job)
 	}
 
 	nouveau_uvmm_unlock(uvmm);
-	drm_exec_fini(exec);
+	drm_gpuva_manager_unlock(&uvmm->umgr);
 	return ret;
 }
 
 static void
 nouveau_uvmm_bind_job_armed_submit(struct nouveau_job *job)
 {
-	struct drm_exec *exec = &job->exec;
-	struct drm_gem_object *obj;
-	unsigned long index;
-
-	drm_exec_for_each_locked_object(exec, index, obj)
-		dma_resv_add_fence(obj->resv, job->done_fence, job->resv_usage);
+	struct nouveau_uvmm *uvmm = nouveau_cli_uvmm(job->cli);
 
-	drm_exec_fini(exec);
+	drm_gpuva_manager_resv_add_fence(&uvmm->umgr, job->done_fence,
+					 job->resv_usage, job->resv_usage);
+	drm_gpuva_manager_unlock(&uvmm->umgr);
 }
 
 static struct dma_fence *
@@ -1510,8 +1559,16 @@ nouveau_uvmm_bind_job_free_work_fn(struct work_struct *work)
 		if (!IS_ERR_OR_NULL(op->ops))
 			drm_gpuva_ops_free(&uvmm->umgr, op->ops);
 
-		if (obj)
+		if (!IS_ERR_OR_NULL(op->gem.vm_bo)) {
+			dma_resv_lock(obj->resv, NULL);
+			drm_gpuva_gem_put(op->gem.vm_bo);
+			dma_resv_unlock(obj->resv);
+		}
+
+		if (obj) {
+			drm_gpuva_extobj_put(&uvmm->umgr, obj);
 			drm_gem_object_put(obj);
+		}
 	}
 
 	spin_lock(&entity->job.list.lock);
@@ -1775,15 +1832,18 @@ void
 nouveau_uvmm_bo_map_all(struct nouveau_bo *nvbo, struct nouveau_mem *mem)
 {
 	struct drm_gem_object *obj = &nvbo->bo.base;
+	struct drm_gpuva_gem *vm_bo;
 	struct drm_gpuva *va;
 
 	dma_resv_assert_held(obj->resv);
 
-	drm_gem_for_each_gpuva(va, obj) {
-		struct nouveau_uvma *uvma = uvma_from_va(va);
+	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
+		drm_gpuva_gem_for_each_va(va, vm_bo) {
+			struct nouveau_uvma *uvma = uvma_from_va(va);
 
-		nouveau_uvma_map(uvma, mem);
-		drm_gpuva_invalidate(va, false);
+			nouveau_uvma_map(uvma, mem);
+			drm_gpuva_invalidate(va, false);
+		}
 	}
 }
 
@@ -1791,18 +1851,33 @@ void
 nouveau_uvmm_bo_unmap_all(struct nouveau_bo *nvbo)
 {
 	struct drm_gem_object *obj = &nvbo->bo.base;
+	struct drm_gpuva_gem *vm_bo;
 	struct drm_gpuva *va;
 
 	dma_resv_assert_held(obj->resv);
 
-	drm_gem_for_each_gpuva(va, obj) {
-		struct nouveau_uvma *uvma = uvma_from_va(va);
+	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
+		drm_gpuva_gem_for_each_va(va, vm_bo) {
+			struct nouveau_uvma *uvma = uvma_from_va(va);
 
-		nouveau_uvma_unmap(uvma);
-		drm_gpuva_invalidate(va, true);
+			nouveau_uvma_unmap(uvma);
+			drm_gpuva_invalidate(va, true);
+		}
 	}
 }
 
+static int
+nouveau_uvmm_bo_validate(struct drm_gem_object *obj)
+{
+	struct nouveau_bo *nvbo = nouveau_gem_object(obj);
+
+	return nouveau_bo_validate(nvbo, true, false);
+}
+
+static const struct drm_gpuva_fn_ops nouveau_uvmm_gpuva_ops = {
+	.bo_validate = nouveau_uvmm_bo_validate,
+};
+
 int
 nouveau_uvmm_init(struct nouveau_uvmm *uvmm, struct nouveau_cli *cli,
 		  u64 kernel_managed_addr, u64 kernel_managed_size)
@@ -1835,11 +1910,11 @@ nouveau_uvmm_init(struct nouveau_uvmm *uvmm, struct nouveau_cli *cli,
 	uvmm->kernel_managed_addr = kernel_managed_addr;
 	uvmm->kernel_managed_size = kernel_managed_size;
 
-	drm_gpuva_manager_init(&uvmm->umgr, cli->name,
+	drm_gpuva_manager_init(&uvmm->umgr, cli->drm->dev, cli->name,
 			       NOUVEAU_VA_SPACE_START,
 			       NOUVEAU_VA_SPACE_END,
 			       kernel_managed_addr, kernel_managed_size,
-			       NULL);
+			       &nouveau_uvmm_gpuva_ops);
 
 	ret = nvif_vmm_ctor(&cli->mmu, "uvmm",
 			    cli->vmm.vmm.object.oclass, RAW,
-- 
2.41.0


^ permalink raw reply related	[flat|nested] 88+ messages in thread

* [PATCH drm-misc-next 3/3] drm/nouveau: gpuva mgr dma-resv/extobj handling, GEM validation
@ 2023-08-20 21:53   ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-20 21:53 UTC (permalink / raw)
  To: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett
  Cc: nouveau, Danilo Krummrich, linux-kernel, dri-devel

Make use of the DRM GPUVA managers GPU-VM common dma-resv, external GEM
object tracking, dma-resv locking, evicted GEM object tracking and
validation features.

Signed-off-by: Danilo Krummrich <dakr@redhat.com>
---
 drivers/gpu/drm/nouveau/nouveau_bo.c    |   4 +-
 drivers/gpu/drm/nouveau/nouveau_exec.c  |  51 ++-----
 drivers/gpu/drm/nouveau/nouveau_gem.c   |   4 +-
 drivers/gpu/drm/nouveau/nouveau_sched.h |   2 -
 drivers/gpu/drm/nouveau/nouveau_uvmm.c  | 191 +++++++++++++++++-------
 5 files changed, 150 insertions(+), 102 deletions(-)

diff --git a/drivers/gpu/drm/nouveau/nouveau_bo.c b/drivers/gpu/drm/nouveau/nouveau_bo.c
index 19cab37ac69c..64f50adb2856 100644
--- a/drivers/gpu/drm/nouveau/nouveau_bo.c
+++ b/drivers/gpu/drm/nouveau/nouveau_bo.c
@@ -1060,17 +1060,18 @@ nouveau_bo_move(struct ttm_buffer_object *bo, bool evict,
 {
 	struct nouveau_drm *drm = nouveau_bdev(bo->bdev);
 	struct nouveau_bo *nvbo = nouveau_bo(bo);
+	struct drm_gem_object *obj = &bo->base;
 	struct ttm_resource *old_reg = bo->resource;
 	struct nouveau_drm_tile *new_tile = NULL;
 	int ret = 0;
 
-
 	if (new_reg->mem_type == TTM_PL_TT) {
 		ret = nouveau_ttm_tt_bind(bo->bdev, bo->ttm, new_reg);
 		if (ret)
 			return ret;
 	}
 
+	drm_gpuva_gem_evict(obj, evict);
 	nouveau_bo_move_ntfy(bo, new_reg);
 	ret = ttm_bo_wait_ctx(bo, ctx);
 	if (ret)
@@ -1135,6 +1136,7 @@ nouveau_bo_move(struct ttm_buffer_object *bo, bool evict,
 out_ntfy:
 	if (ret) {
 		nouveau_bo_move_ntfy(bo, bo->resource);
+		drm_gpuva_gem_evict(obj, !evict);
 	}
 	return ret;
 }
diff --git a/drivers/gpu/drm/nouveau/nouveau_exec.c b/drivers/gpu/drm/nouveau/nouveau_exec.c
index 0f927adda4ed..fadb20824b26 100644
--- a/drivers/gpu/drm/nouveau/nouveau_exec.c
+++ b/drivers/gpu/drm/nouveau/nouveau_exec.c
@@ -1,7 +1,5 @@
 // SPDX-License-Identifier: MIT
 
-#include <drm/drm_exec.h>
-
 #include "nouveau_drv.h"
 #include "nouveau_gem.h"
 #include "nouveau_mem.h"
@@ -91,9 +89,6 @@ nouveau_exec_job_submit(struct nouveau_job *job)
 	struct nouveau_exec_job *exec_job = to_nouveau_exec_job(job);
 	struct nouveau_cli *cli = job->cli;
 	struct nouveau_uvmm *uvmm = nouveau_cli_uvmm(cli);
-	struct drm_exec *exec = &job->exec;
-	struct drm_gem_object *obj;
-	unsigned long index;
 	int ret;
 
 	ret = nouveau_fence_new(&exec_job->fence);
@@ -101,52 +96,30 @@ nouveau_exec_job_submit(struct nouveau_job *job)
 		return ret;
 
 	nouveau_uvmm_lock(uvmm);
-	drm_exec_init(exec, DRM_EXEC_INTERRUPTIBLE_WAIT |
-			    DRM_EXEC_IGNORE_DUPLICATES);
-	drm_exec_until_all_locked(exec) {
-		struct drm_gpuva *va;
-
-		drm_gpuva_for_each_va(va, &uvmm->umgr) {
-			if (unlikely(va == &uvmm->umgr.kernel_alloc_node))
-				continue;
-
-			ret = drm_exec_prepare_obj(exec, va->gem.obj, 1);
-			drm_exec_retry_on_contention(exec);
-			if (ret)
-				goto err_uvmm_unlock;
-		}
+	ret = drm_gpuva_manager_lock(&uvmm->umgr, 1, false);
+	if (ret) {
+		nouveau_uvmm_unlock(uvmm);
+		return ret;
 	}
 	nouveau_uvmm_unlock(uvmm);
 
-	drm_exec_for_each_locked_object(exec, index, obj) {
-		struct nouveau_bo *nvbo = nouveau_gem_object(obj);
-
-		ret = nouveau_bo_validate(nvbo, true, false);
-		if (ret)
-			goto err_exec_fini;
+	ret = drm_gpuva_manager_validate(&uvmm->umgr);
+	if (ret) {
+		drm_gpuva_manager_unlock(&uvmm->umgr);
+		return ret;
 	}
 
 	return 0;
-
-err_uvmm_unlock:
-	nouveau_uvmm_unlock(uvmm);
-err_exec_fini:
-	drm_exec_fini(exec);
-	return ret;
-
 }
 
 static void
 nouveau_exec_job_armed_submit(struct nouveau_job *job)
 {
-	struct drm_exec *exec = &job->exec;
-	struct drm_gem_object *obj;
-	unsigned long index;
-
-	drm_exec_for_each_locked_object(exec, index, obj)
-		dma_resv_add_fence(obj->resv, job->done_fence, job->resv_usage);
+	struct nouveau_uvmm *uvmm = nouveau_cli_uvmm(job->cli);
 
-	drm_exec_fini(exec);
+	drm_gpuva_manager_resv_add_fence(&uvmm->umgr, job->done_fence,
+					 job->resv_usage, job->resv_usage);
+	drm_gpuva_manager_unlock(&uvmm->umgr);
 }
 
 static struct dma_fence *
diff --git a/drivers/gpu/drm/nouveau/nouveau_gem.c b/drivers/gpu/drm/nouveau/nouveau_gem.c
index f39360870c70..dec34a88f8b2 100644
--- a/drivers/gpu/drm/nouveau/nouveau_gem.c
+++ b/drivers/gpu/drm/nouveau/nouveau_gem.c
@@ -111,7 +111,7 @@ nouveau_gem_object_open(struct drm_gem_object *gem, struct drm_file *file_priv)
 	if (vmm->vmm.object.oclass < NVIF_CLASS_VMM_NV50)
 		return 0;
 
-	if (nvbo->no_share && uvmm && &uvmm->resv != nvbo->bo.base.resv)
+	if (nvbo->no_share && uvmm && uvmm->umgr.resv != nvbo->bo.base.resv)
 		return -EPERM;
 
 	ret = ttm_bo_reserve(&nvbo->bo, false, false, NULL);
@@ -245,7 +245,7 @@ nouveau_gem_new(struct nouveau_cli *cli, u64 size, int align, uint32_t domain,
 		if (unlikely(!uvmm))
 			return -EINVAL;
 
-		resv = &uvmm->resv;
+		resv = uvmm->umgr.resv;
 	}
 
 	if (!(domain & (NOUVEAU_GEM_DOMAIN_VRAM | NOUVEAU_GEM_DOMAIN_GART)))
diff --git a/drivers/gpu/drm/nouveau/nouveau_sched.h b/drivers/gpu/drm/nouveau/nouveau_sched.h
index 27ac19792597..ccedc80685b3 100644
--- a/drivers/gpu/drm/nouveau/nouveau_sched.h
+++ b/drivers/gpu/drm/nouveau/nouveau_sched.h
@@ -5,7 +5,6 @@
 
 #include <linux/types.h>
 
-#include <drm/drm_exec.h>
 #include <drm/gpu_scheduler.h>
 
 #include "nouveau_drv.h"
@@ -54,7 +53,6 @@ struct nouveau_job {
 	struct drm_file *file_priv;
 	struct nouveau_cli *cli;
 
-	struct drm_exec exec;
 	enum dma_resv_usage resv_usage;
 	struct dma_fence *done_fence;
 
diff --git a/drivers/gpu/drm/nouveau/nouveau_uvmm.c b/drivers/gpu/drm/nouveau/nouveau_uvmm.c
index 3a1e8538f205..ce1975cca8a9 100644
--- a/drivers/gpu/drm/nouveau/nouveau_uvmm.c
+++ b/drivers/gpu/drm/nouveau/nouveau_uvmm.c
@@ -71,6 +71,7 @@ struct bind_job_op {
 		u32 handle;
 		u64 offset;
 		struct drm_gem_object *obj;
+		struct drm_gpuva_gem *vm_bo;
 	} gem;
 
 	struct nouveau_uvma_region *reg;
@@ -436,8 +437,10 @@ nouveau_uvma_region_complete(struct nouveau_uvma_region *reg)
 static void
 op_map_prepare_unwind(struct nouveau_uvma *uvma)
 {
+	struct drm_gpuva *va = &uvma->va;
 	nouveau_uvma_gem_put(uvma);
-	drm_gpuva_remove(&uvma->va);
+	drm_gpuva_remove(va);
+	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
 	nouveau_uvma_free(uvma);
 }
 
@@ -445,6 +448,7 @@ static void
 op_unmap_prepare_unwind(struct drm_gpuva *va)
 {
 	drm_gpuva_insert(va->mgr, va);
+	drm_gpuva_extobj_get(va->mgr, va->gem.obj);
 }
 
 static void
@@ -466,14 +470,17 @@ nouveau_uvmm_sm_prepare_unwind(struct nouveau_uvmm *uvmm,
 			break;
 		case DRM_GPUVA_OP_REMAP: {
 			struct drm_gpuva_op_remap *r = &op->remap;
+			struct drm_gpuva *va = r->unmap->va;
 
+			drm_gpuva_extobj_get(va->mgr, va->gem.obj);
 			if (r->next)
 				op_map_prepare_unwind(new->next);
 
 			if (r->prev)
 				op_map_prepare_unwind(new->prev);
 
-			op_unmap_prepare_unwind(r->unmap->va);
+			op_unmap_prepare_unwind(va);
+			drm_gpuva_extobj_put(va->mgr, va->gem.obj);
 			break;
 		}
 		case DRM_GPUVA_OP_UNMAP:
@@ -589,7 +596,7 @@ op_map_prepare(struct nouveau_uvmm *uvmm,
 	uvma->region = args->region;
 	uvma->kind = args->kind;
 
-	drm_gpuva_map(&uvmm->umgr, &uvma->va, op);
+	drm_gpuva_map_get(&uvmm->umgr, &uvma->va, op);
 
 	/* Keep a reference until this uvma is destroyed. */
 	nouveau_uvma_gem_get(uvma);
@@ -601,7 +608,7 @@ op_map_prepare(struct nouveau_uvmm *uvmm,
 static void
 op_unmap_prepare(struct drm_gpuva_op_unmap *u)
 {
-	drm_gpuva_unmap(u);
+	drm_gpuva_unmap_put(u);
 }
 
 static int
@@ -632,6 +639,7 @@ nouveau_uvmm_sm_prepare(struct nouveau_uvmm *uvmm,
 					goto unwind;
 				}
 			}
+
 			break;
 		}
 		case DRM_GPUVA_OP_REMAP: {
@@ -644,6 +652,7 @@ nouveau_uvmm_sm_prepare(struct nouveau_uvmm *uvmm,
 			u64 urange = va->va.range;
 			u64 uend = ustart + urange;
 
+			drm_gpuva_extobj_get(va->mgr, va->gem.obj);
 			op_unmap_prepare(r->unmap);
 
 			if (r->prev) {
@@ -668,6 +677,7 @@ nouveau_uvmm_sm_prepare(struct nouveau_uvmm *uvmm,
 				if (args)
 					vmm_get_end = ustart;
 			}
+			drm_gpuva_extobj_put(va->mgr, va->gem.obj);
 
 			if (args && (r->prev && r->next))
 				vmm_get_start = vmm_get_end = 0;
@@ -1112,22 +1122,34 @@ bind_validate_region(struct nouveau_job *job)
 }
 
 static void
-bind_link_gpuvas(struct drm_gpuva_ops *ops, struct nouveau_uvma_prealloc *new)
+bind_link_gpuvas(struct bind_job_op *bop)
 {
+	struct nouveau_uvma_prealloc *new = &bop->new;
+	struct drm_gpuva_gem *vm_bo = bop->gem.vm_bo;
+	struct drm_gpuva_ops *ops = bop->ops;
 	struct drm_gpuva_op *op;
 
 	drm_gpuva_for_each_op(op, ops) {
 		switch (op->op) {
 		case DRM_GPUVA_OP_MAP:
-			drm_gpuva_link(&new->map->va);
+			drm_gpuva_link(&new->map->va, vm_bo);
 			break;
-		case DRM_GPUVA_OP_REMAP:
+		case DRM_GPUVA_OP_REMAP: {
+			struct drm_gpuva *va = op->remap.unmap->va;
+			struct drm_gpuva_gem *vm_bo;
+
+			vm_bo = drm_gpuva_gem_find(va->mgr, va->gem.obj);
+			BUG_ON(!vm_bo);
+
 			if (op->remap.prev)
-				drm_gpuva_link(&new->prev->va);
+				drm_gpuva_link(&new->prev->va, vm_bo);
 			if (op->remap.next)
-				drm_gpuva_link(&new->next->va);
-			drm_gpuva_unlink(op->remap.unmap->va);
+				drm_gpuva_link(&new->next->va, vm_bo);
+			drm_gpuva_unlink(va);
+
+			drm_gpuva_gem_put(vm_bo);
 			break;
+		}
 		case DRM_GPUVA_OP_UNMAP:
 			drm_gpuva_unlink(op->unmap.va);
 			break;
@@ -1137,22 +1159,72 @@ bind_link_gpuvas(struct drm_gpuva_ops *ops, struct nouveau_uvma_prealloc *new)
 	}
 }
 
+static int
+bind_lock_extra(struct drm_gpuva_manager *mgr, void *priv,
+		unsigned int num_fences)
+{
+	struct nouveau_uvmm_bind_job *bind_job = priv;
+	struct bind_job_op *op;
+	int ret;
+
+	list_for_each_op(op, &bind_job->ops) {
+		struct drm_gpuva_op *va_op;
+
+		if (IS_ERR_OR_NULL(op->ops))
+			continue;
+
+		drm_gpuva_for_each_op(va_op, op->ops) {
+			struct drm_gem_object *obj = op_gem_obj(va_op);
+
+			if (unlikely(!obj))
+				continue;
+
+			if (va_op->op != DRM_GPUVA_OP_UNMAP)
+				continue;
+
+			ret = drm_exec_prepare_obj(DRM_GPUVA_EXEC(mgr), obj,
+						   num_fences);
+			if (ret)
+				return ret;
+		}
+	}
+
+	return 0;
+}
+
 static int
 nouveau_uvmm_bind_job_submit(struct nouveau_job *job)
 {
 	struct nouveau_uvmm *uvmm = nouveau_cli_uvmm(job->cli);
 	struct nouveau_uvmm_bind_job *bind_job = to_uvmm_bind_job(job);
 	struct nouveau_sched_entity *entity = job->entity;
-	struct drm_exec *exec = &job->exec;
 	struct bind_job_op *op;
 	int ret;
 
 	list_for_each_op(op, &bind_job->ops) {
 		if (op->op == OP_MAP) {
-			op->gem.obj = drm_gem_object_lookup(job->file_priv,
-							    op->gem.handle);
-			if (!op->gem.obj)
+			struct drm_gem_object *obj;
+
+			obj = drm_gem_object_lookup(job->file_priv,
+						    op->gem.handle);
+			if (!obj)
 				return -ENOENT;
+
+			dma_resv_lock(obj->resv, NULL);
+			op->gem.vm_bo = drm_gpuva_gem_obtain(&uvmm->umgr, obj);
+			dma_resv_unlock(obj->resv);
+			if (IS_ERR(op->gem.vm_bo)) {
+				drm_gem_object_put(obj);
+				return PTR_ERR(op->gem.vm_bo);
+			}
+
+			ret = drm_gpuva_extobj_insert(&uvmm->umgr, obj);
+			if (ret) {
+				drm_gem_object_put(obj);
+				return ret;
+			}
+
+			op->gem.obj = obj;
 		}
 
 		ret = bind_validate_op(job, op);
@@ -1286,30 +1358,10 @@ nouveau_uvmm_bind_job_submit(struct nouveau_job *job)
 		}
 	}
 
-	drm_exec_init(exec, DRM_EXEC_INTERRUPTIBLE_WAIT |
-			    DRM_EXEC_IGNORE_DUPLICATES);
-	drm_exec_until_all_locked(exec) {
-		list_for_each_op(op, &bind_job->ops) {
-			struct drm_gpuva_op *va_op;
-
-			if (IS_ERR_OR_NULL(op->ops))
-				continue;
-
-			drm_gpuva_for_each_op(va_op, op->ops) {
-				struct drm_gem_object *obj = op_gem_obj(va_op);
-
-				if (unlikely(!obj))
-					continue;
-
-				ret = drm_exec_prepare_obj(exec, obj, 1);
-				drm_exec_retry_on_contention(exec);
-				if (ret) {
-					op = list_last_op(&bind_job->ops);
-					goto unwind;
-				}
-			}
-		}
-	}
+	ret = drm_gpuva_manager_lock_extra(&uvmm->umgr, bind_lock_extra,
+					   bind_job, 1, false);
+	if (ret)
+		goto unwind_continue;
 
 	list_for_each_op(op, &bind_job->ops) {
 		struct drm_gpuva_op *va_op;
@@ -1363,7 +1415,7 @@ nouveau_uvmm_bind_job_submit(struct nouveau_job *job)
 		case OP_UNMAP_SPARSE:
 		case OP_MAP:
 		case OP_UNMAP:
-			bind_link_gpuvas(op->ops, &op->new);
+			bind_link_gpuvas(op);
 			break;
 		default:
 			break;
@@ -1409,21 +1461,18 @@ nouveau_uvmm_bind_job_submit(struct nouveau_job *job)
 	}
 
 	nouveau_uvmm_unlock(uvmm);
-	drm_exec_fini(exec);
+	drm_gpuva_manager_unlock(&uvmm->umgr);
 	return ret;
 }
 
 static void
 nouveau_uvmm_bind_job_armed_submit(struct nouveau_job *job)
 {
-	struct drm_exec *exec = &job->exec;
-	struct drm_gem_object *obj;
-	unsigned long index;
-
-	drm_exec_for_each_locked_object(exec, index, obj)
-		dma_resv_add_fence(obj->resv, job->done_fence, job->resv_usage);
+	struct nouveau_uvmm *uvmm = nouveau_cli_uvmm(job->cli);
 
-	drm_exec_fini(exec);
+	drm_gpuva_manager_resv_add_fence(&uvmm->umgr, job->done_fence,
+					 job->resv_usage, job->resv_usage);
+	drm_gpuva_manager_unlock(&uvmm->umgr);
 }
 
 static struct dma_fence *
@@ -1510,8 +1559,16 @@ nouveau_uvmm_bind_job_free_work_fn(struct work_struct *work)
 		if (!IS_ERR_OR_NULL(op->ops))
 			drm_gpuva_ops_free(&uvmm->umgr, op->ops);
 
-		if (obj)
+		if (!IS_ERR_OR_NULL(op->gem.vm_bo)) {
+			dma_resv_lock(obj->resv, NULL);
+			drm_gpuva_gem_put(op->gem.vm_bo);
+			dma_resv_unlock(obj->resv);
+		}
+
+		if (obj) {
+			drm_gpuva_extobj_put(&uvmm->umgr, obj);
 			drm_gem_object_put(obj);
+		}
 	}
 
 	spin_lock(&entity->job.list.lock);
@@ -1775,15 +1832,18 @@ void
 nouveau_uvmm_bo_map_all(struct nouveau_bo *nvbo, struct nouveau_mem *mem)
 {
 	struct drm_gem_object *obj = &nvbo->bo.base;
+	struct drm_gpuva_gem *vm_bo;
 	struct drm_gpuva *va;
 
 	dma_resv_assert_held(obj->resv);
 
-	drm_gem_for_each_gpuva(va, obj) {
-		struct nouveau_uvma *uvma = uvma_from_va(va);
+	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
+		drm_gpuva_gem_for_each_va(va, vm_bo) {
+			struct nouveau_uvma *uvma = uvma_from_va(va);
 
-		nouveau_uvma_map(uvma, mem);
-		drm_gpuva_invalidate(va, false);
+			nouveau_uvma_map(uvma, mem);
+			drm_gpuva_invalidate(va, false);
+		}
 	}
 }
 
@@ -1791,18 +1851,33 @@ void
 nouveau_uvmm_bo_unmap_all(struct nouveau_bo *nvbo)
 {
 	struct drm_gem_object *obj = &nvbo->bo.base;
+	struct drm_gpuva_gem *vm_bo;
 	struct drm_gpuva *va;
 
 	dma_resv_assert_held(obj->resv);
 
-	drm_gem_for_each_gpuva(va, obj) {
-		struct nouveau_uvma *uvma = uvma_from_va(va);
+	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
+		drm_gpuva_gem_for_each_va(va, vm_bo) {
+			struct nouveau_uvma *uvma = uvma_from_va(va);
 
-		nouveau_uvma_unmap(uvma);
-		drm_gpuva_invalidate(va, true);
+			nouveau_uvma_unmap(uvma);
+			drm_gpuva_invalidate(va, true);
+		}
 	}
 }
 
+static int
+nouveau_uvmm_bo_validate(struct drm_gem_object *obj)
+{
+	struct nouveau_bo *nvbo = nouveau_gem_object(obj);
+
+	return nouveau_bo_validate(nvbo, true, false);
+}
+
+static const struct drm_gpuva_fn_ops nouveau_uvmm_gpuva_ops = {
+	.bo_validate = nouveau_uvmm_bo_validate,
+};
+
 int
 nouveau_uvmm_init(struct nouveau_uvmm *uvmm, struct nouveau_cli *cli,
 		  u64 kernel_managed_addr, u64 kernel_managed_size)
@@ -1835,11 +1910,11 @@ nouveau_uvmm_init(struct nouveau_uvmm *uvmm, struct nouveau_cli *cli,
 	uvmm->kernel_managed_addr = kernel_managed_addr;
 	uvmm->kernel_managed_size = kernel_managed_size;
 
-	drm_gpuva_manager_init(&uvmm->umgr, cli->name,
+	drm_gpuva_manager_init(&uvmm->umgr, cli->drm->dev, cli->name,
 			       NOUVEAU_VA_SPACE_START,
 			       NOUVEAU_VA_SPACE_END,
 			       kernel_managed_addr, kernel_managed_size,
-			       NULL);
+			       &nouveau_uvmm_gpuva_ops);
 
 	ret = nvif_vmm_ctor(&cli->mmu, "uvmm",
 			    cli->vmm.vmm.object.oclass, RAW,
-- 
2.41.0


^ permalink raw reply related	[flat|nested] 88+ messages in thread

* [PATCH drm-misc-next 3/3] drm/nouveau: gpuva mgr dma-resv/extobj handling, GEM validation
@ 2023-08-20 21:53   ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-20 21:53 UTC (permalink / raw)
  To: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett
  Cc: dri-devel, nouveau, linux-kernel, Danilo Krummrich

Make use of the DRM GPUVA managers GPU-VM common dma-resv, external GEM
object tracking, dma-resv locking, evicted GEM object tracking and
validation features.

Signed-off-by: Danilo Krummrich <dakr@redhat.com>
---
 drivers/gpu/drm/nouveau/nouveau_bo.c    |   4 +-
 drivers/gpu/drm/nouveau/nouveau_exec.c  |  51 ++-----
 drivers/gpu/drm/nouveau/nouveau_gem.c   |   4 +-
 drivers/gpu/drm/nouveau/nouveau_sched.h |   2 -
 drivers/gpu/drm/nouveau/nouveau_uvmm.c  | 191 +++++++++++++++++-------
 5 files changed, 150 insertions(+), 102 deletions(-)

diff --git a/drivers/gpu/drm/nouveau/nouveau_bo.c b/drivers/gpu/drm/nouveau/nouveau_bo.c
index 19cab37ac69c..64f50adb2856 100644
--- a/drivers/gpu/drm/nouveau/nouveau_bo.c
+++ b/drivers/gpu/drm/nouveau/nouveau_bo.c
@@ -1060,17 +1060,18 @@ nouveau_bo_move(struct ttm_buffer_object *bo, bool evict,
 {
 	struct nouveau_drm *drm = nouveau_bdev(bo->bdev);
 	struct nouveau_bo *nvbo = nouveau_bo(bo);
+	struct drm_gem_object *obj = &bo->base;
 	struct ttm_resource *old_reg = bo->resource;
 	struct nouveau_drm_tile *new_tile = NULL;
 	int ret = 0;
 
-
 	if (new_reg->mem_type == TTM_PL_TT) {
 		ret = nouveau_ttm_tt_bind(bo->bdev, bo->ttm, new_reg);
 		if (ret)
 			return ret;
 	}
 
+	drm_gpuva_gem_evict(obj, evict);
 	nouveau_bo_move_ntfy(bo, new_reg);
 	ret = ttm_bo_wait_ctx(bo, ctx);
 	if (ret)
@@ -1135,6 +1136,7 @@ nouveau_bo_move(struct ttm_buffer_object *bo, bool evict,
 out_ntfy:
 	if (ret) {
 		nouveau_bo_move_ntfy(bo, bo->resource);
+		drm_gpuva_gem_evict(obj, !evict);
 	}
 	return ret;
 }
diff --git a/drivers/gpu/drm/nouveau/nouveau_exec.c b/drivers/gpu/drm/nouveau/nouveau_exec.c
index 0f927adda4ed..fadb20824b26 100644
--- a/drivers/gpu/drm/nouveau/nouveau_exec.c
+++ b/drivers/gpu/drm/nouveau/nouveau_exec.c
@@ -1,7 +1,5 @@
 // SPDX-License-Identifier: MIT
 
-#include <drm/drm_exec.h>
-
 #include "nouveau_drv.h"
 #include "nouveau_gem.h"
 #include "nouveau_mem.h"
@@ -91,9 +89,6 @@ nouveau_exec_job_submit(struct nouveau_job *job)
 	struct nouveau_exec_job *exec_job = to_nouveau_exec_job(job);
 	struct nouveau_cli *cli = job->cli;
 	struct nouveau_uvmm *uvmm = nouveau_cli_uvmm(cli);
-	struct drm_exec *exec = &job->exec;
-	struct drm_gem_object *obj;
-	unsigned long index;
 	int ret;
 
 	ret = nouveau_fence_new(&exec_job->fence);
@@ -101,52 +96,30 @@ nouveau_exec_job_submit(struct nouveau_job *job)
 		return ret;
 
 	nouveau_uvmm_lock(uvmm);
-	drm_exec_init(exec, DRM_EXEC_INTERRUPTIBLE_WAIT |
-			    DRM_EXEC_IGNORE_DUPLICATES);
-	drm_exec_until_all_locked(exec) {
-		struct drm_gpuva *va;
-
-		drm_gpuva_for_each_va(va, &uvmm->umgr) {
-			if (unlikely(va == &uvmm->umgr.kernel_alloc_node))
-				continue;
-
-			ret = drm_exec_prepare_obj(exec, va->gem.obj, 1);
-			drm_exec_retry_on_contention(exec);
-			if (ret)
-				goto err_uvmm_unlock;
-		}
+	ret = drm_gpuva_manager_lock(&uvmm->umgr, 1, false);
+	if (ret) {
+		nouveau_uvmm_unlock(uvmm);
+		return ret;
 	}
 	nouveau_uvmm_unlock(uvmm);
 
-	drm_exec_for_each_locked_object(exec, index, obj) {
-		struct nouveau_bo *nvbo = nouveau_gem_object(obj);
-
-		ret = nouveau_bo_validate(nvbo, true, false);
-		if (ret)
-			goto err_exec_fini;
+	ret = drm_gpuva_manager_validate(&uvmm->umgr);
+	if (ret) {
+		drm_gpuva_manager_unlock(&uvmm->umgr);
+		return ret;
 	}
 
 	return 0;
-
-err_uvmm_unlock:
-	nouveau_uvmm_unlock(uvmm);
-err_exec_fini:
-	drm_exec_fini(exec);
-	return ret;
-
 }
 
 static void
 nouveau_exec_job_armed_submit(struct nouveau_job *job)
 {
-	struct drm_exec *exec = &job->exec;
-	struct drm_gem_object *obj;
-	unsigned long index;
-
-	drm_exec_for_each_locked_object(exec, index, obj)
-		dma_resv_add_fence(obj->resv, job->done_fence, job->resv_usage);
+	struct nouveau_uvmm *uvmm = nouveau_cli_uvmm(job->cli);
 
-	drm_exec_fini(exec);
+	drm_gpuva_manager_resv_add_fence(&uvmm->umgr, job->done_fence,
+					 job->resv_usage, job->resv_usage);
+	drm_gpuva_manager_unlock(&uvmm->umgr);
 }
 
 static struct dma_fence *
diff --git a/drivers/gpu/drm/nouveau/nouveau_gem.c b/drivers/gpu/drm/nouveau/nouveau_gem.c
index f39360870c70..dec34a88f8b2 100644
--- a/drivers/gpu/drm/nouveau/nouveau_gem.c
+++ b/drivers/gpu/drm/nouveau/nouveau_gem.c
@@ -111,7 +111,7 @@ nouveau_gem_object_open(struct drm_gem_object *gem, struct drm_file *file_priv)
 	if (vmm->vmm.object.oclass < NVIF_CLASS_VMM_NV50)
 		return 0;
 
-	if (nvbo->no_share && uvmm && &uvmm->resv != nvbo->bo.base.resv)
+	if (nvbo->no_share && uvmm && uvmm->umgr.resv != nvbo->bo.base.resv)
 		return -EPERM;
 
 	ret = ttm_bo_reserve(&nvbo->bo, false, false, NULL);
@@ -245,7 +245,7 @@ nouveau_gem_new(struct nouveau_cli *cli, u64 size, int align, uint32_t domain,
 		if (unlikely(!uvmm))
 			return -EINVAL;
 
-		resv = &uvmm->resv;
+		resv = uvmm->umgr.resv;
 	}
 
 	if (!(domain & (NOUVEAU_GEM_DOMAIN_VRAM | NOUVEAU_GEM_DOMAIN_GART)))
diff --git a/drivers/gpu/drm/nouveau/nouveau_sched.h b/drivers/gpu/drm/nouveau/nouveau_sched.h
index 27ac19792597..ccedc80685b3 100644
--- a/drivers/gpu/drm/nouveau/nouveau_sched.h
+++ b/drivers/gpu/drm/nouveau/nouveau_sched.h
@@ -5,7 +5,6 @@
 
 #include <linux/types.h>
 
-#include <drm/drm_exec.h>
 #include <drm/gpu_scheduler.h>
 
 #include "nouveau_drv.h"
@@ -54,7 +53,6 @@ struct nouveau_job {
 	struct drm_file *file_priv;
 	struct nouveau_cli *cli;
 
-	struct drm_exec exec;
 	enum dma_resv_usage resv_usage;
 	struct dma_fence *done_fence;
 
diff --git a/drivers/gpu/drm/nouveau/nouveau_uvmm.c b/drivers/gpu/drm/nouveau/nouveau_uvmm.c
index 3a1e8538f205..ce1975cca8a9 100644
--- a/drivers/gpu/drm/nouveau/nouveau_uvmm.c
+++ b/drivers/gpu/drm/nouveau/nouveau_uvmm.c
@@ -71,6 +71,7 @@ struct bind_job_op {
 		u32 handle;
 		u64 offset;
 		struct drm_gem_object *obj;
+		struct drm_gpuva_gem *vm_bo;
 	} gem;
 
 	struct nouveau_uvma_region *reg;
@@ -436,8 +437,10 @@ nouveau_uvma_region_complete(struct nouveau_uvma_region *reg)
 static void
 op_map_prepare_unwind(struct nouveau_uvma *uvma)
 {
+	struct drm_gpuva *va = &uvma->va;
 	nouveau_uvma_gem_put(uvma);
-	drm_gpuva_remove(&uvma->va);
+	drm_gpuva_remove(va);
+	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
 	nouveau_uvma_free(uvma);
 }
 
@@ -445,6 +448,7 @@ static void
 op_unmap_prepare_unwind(struct drm_gpuva *va)
 {
 	drm_gpuva_insert(va->mgr, va);
+	drm_gpuva_extobj_get(va->mgr, va->gem.obj);
 }
 
 static void
@@ -466,14 +470,17 @@ nouveau_uvmm_sm_prepare_unwind(struct nouveau_uvmm *uvmm,
 			break;
 		case DRM_GPUVA_OP_REMAP: {
 			struct drm_gpuva_op_remap *r = &op->remap;
+			struct drm_gpuva *va = r->unmap->va;
 
+			drm_gpuva_extobj_get(va->mgr, va->gem.obj);
 			if (r->next)
 				op_map_prepare_unwind(new->next);
 
 			if (r->prev)
 				op_map_prepare_unwind(new->prev);
 
-			op_unmap_prepare_unwind(r->unmap->va);
+			op_unmap_prepare_unwind(va);
+			drm_gpuva_extobj_put(va->mgr, va->gem.obj);
 			break;
 		}
 		case DRM_GPUVA_OP_UNMAP:
@@ -589,7 +596,7 @@ op_map_prepare(struct nouveau_uvmm *uvmm,
 	uvma->region = args->region;
 	uvma->kind = args->kind;
 
-	drm_gpuva_map(&uvmm->umgr, &uvma->va, op);
+	drm_gpuva_map_get(&uvmm->umgr, &uvma->va, op);
 
 	/* Keep a reference until this uvma is destroyed. */
 	nouveau_uvma_gem_get(uvma);
@@ -601,7 +608,7 @@ op_map_prepare(struct nouveau_uvmm *uvmm,
 static void
 op_unmap_prepare(struct drm_gpuva_op_unmap *u)
 {
-	drm_gpuva_unmap(u);
+	drm_gpuva_unmap_put(u);
 }
 
 static int
@@ -632,6 +639,7 @@ nouveau_uvmm_sm_prepare(struct nouveau_uvmm *uvmm,
 					goto unwind;
 				}
 			}
+
 			break;
 		}
 		case DRM_GPUVA_OP_REMAP: {
@@ -644,6 +652,7 @@ nouveau_uvmm_sm_prepare(struct nouveau_uvmm *uvmm,
 			u64 urange = va->va.range;
 			u64 uend = ustart + urange;
 
+			drm_gpuva_extobj_get(va->mgr, va->gem.obj);
 			op_unmap_prepare(r->unmap);
 
 			if (r->prev) {
@@ -668,6 +677,7 @@ nouveau_uvmm_sm_prepare(struct nouveau_uvmm *uvmm,
 				if (args)
 					vmm_get_end = ustart;
 			}
+			drm_gpuva_extobj_put(va->mgr, va->gem.obj);
 
 			if (args && (r->prev && r->next))
 				vmm_get_start = vmm_get_end = 0;
@@ -1112,22 +1122,34 @@ bind_validate_region(struct nouveau_job *job)
 }
 
 static void
-bind_link_gpuvas(struct drm_gpuva_ops *ops, struct nouveau_uvma_prealloc *new)
+bind_link_gpuvas(struct bind_job_op *bop)
 {
+	struct nouveau_uvma_prealloc *new = &bop->new;
+	struct drm_gpuva_gem *vm_bo = bop->gem.vm_bo;
+	struct drm_gpuva_ops *ops = bop->ops;
 	struct drm_gpuva_op *op;
 
 	drm_gpuva_for_each_op(op, ops) {
 		switch (op->op) {
 		case DRM_GPUVA_OP_MAP:
-			drm_gpuva_link(&new->map->va);
+			drm_gpuva_link(&new->map->va, vm_bo);
 			break;
-		case DRM_GPUVA_OP_REMAP:
+		case DRM_GPUVA_OP_REMAP: {
+			struct drm_gpuva *va = op->remap.unmap->va;
+			struct drm_gpuva_gem *vm_bo;
+
+			vm_bo = drm_gpuva_gem_find(va->mgr, va->gem.obj);
+			BUG_ON(!vm_bo);
+
 			if (op->remap.prev)
-				drm_gpuva_link(&new->prev->va);
+				drm_gpuva_link(&new->prev->va, vm_bo);
 			if (op->remap.next)
-				drm_gpuva_link(&new->next->va);
-			drm_gpuva_unlink(op->remap.unmap->va);
+				drm_gpuva_link(&new->next->va, vm_bo);
+			drm_gpuva_unlink(va);
+
+			drm_gpuva_gem_put(vm_bo);
 			break;
+		}
 		case DRM_GPUVA_OP_UNMAP:
 			drm_gpuva_unlink(op->unmap.va);
 			break;
@@ -1137,22 +1159,72 @@ bind_link_gpuvas(struct drm_gpuva_ops *ops, struct nouveau_uvma_prealloc *new)
 	}
 }
 
+static int
+bind_lock_extra(struct drm_gpuva_manager *mgr, void *priv,
+		unsigned int num_fences)
+{
+	struct nouveau_uvmm_bind_job *bind_job = priv;
+	struct bind_job_op *op;
+	int ret;
+
+	list_for_each_op(op, &bind_job->ops) {
+		struct drm_gpuva_op *va_op;
+
+		if (IS_ERR_OR_NULL(op->ops))
+			continue;
+
+		drm_gpuva_for_each_op(va_op, op->ops) {
+			struct drm_gem_object *obj = op_gem_obj(va_op);
+
+			if (unlikely(!obj))
+				continue;
+
+			if (va_op->op != DRM_GPUVA_OP_UNMAP)
+				continue;
+
+			ret = drm_exec_prepare_obj(DRM_GPUVA_EXEC(mgr), obj,
+						   num_fences);
+			if (ret)
+				return ret;
+		}
+	}
+
+	return 0;
+}
+
 static int
 nouveau_uvmm_bind_job_submit(struct nouveau_job *job)
 {
 	struct nouveau_uvmm *uvmm = nouveau_cli_uvmm(job->cli);
 	struct nouveau_uvmm_bind_job *bind_job = to_uvmm_bind_job(job);
 	struct nouveau_sched_entity *entity = job->entity;
-	struct drm_exec *exec = &job->exec;
 	struct bind_job_op *op;
 	int ret;
 
 	list_for_each_op(op, &bind_job->ops) {
 		if (op->op == OP_MAP) {
-			op->gem.obj = drm_gem_object_lookup(job->file_priv,
-							    op->gem.handle);
-			if (!op->gem.obj)
+			struct drm_gem_object *obj;
+
+			obj = drm_gem_object_lookup(job->file_priv,
+						    op->gem.handle);
+			if (!obj)
 				return -ENOENT;
+
+			dma_resv_lock(obj->resv, NULL);
+			op->gem.vm_bo = drm_gpuva_gem_obtain(&uvmm->umgr, obj);
+			dma_resv_unlock(obj->resv);
+			if (IS_ERR(op->gem.vm_bo)) {
+				drm_gem_object_put(obj);
+				return PTR_ERR(op->gem.vm_bo);
+			}
+
+			ret = drm_gpuva_extobj_insert(&uvmm->umgr, obj);
+			if (ret) {
+				drm_gem_object_put(obj);
+				return ret;
+			}
+
+			op->gem.obj = obj;
 		}
 
 		ret = bind_validate_op(job, op);
@@ -1286,30 +1358,10 @@ nouveau_uvmm_bind_job_submit(struct nouveau_job *job)
 		}
 	}
 
-	drm_exec_init(exec, DRM_EXEC_INTERRUPTIBLE_WAIT |
-			    DRM_EXEC_IGNORE_DUPLICATES);
-	drm_exec_until_all_locked(exec) {
-		list_for_each_op(op, &bind_job->ops) {
-			struct drm_gpuva_op *va_op;
-
-			if (IS_ERR_OR_NULL(op->ops))
-				continue;
-
-			drm_gpuva_for_each_op(va_op, op->ops) {
-				struct drm_gem_object *obj = op_gem_obj(va_op);
-
-				if (unlikely(!obj))
-					continue;
-
-				ret = drm_exec_prepare_obj(exec, obj, 1);
-				drm_exec_retry_on_contention(exec);
-				if (ret) {
-					op = list_last_op(&bind_job->ops);
-					goto unwind;
-				}
-			}
-		}
-	}
+	ret = drm_gpuva_manager_lock_extra(&uvmm->umgr, bind_lock_extra,
+					   bind_job, 1, false);
+	if (ret)
+		goto unwind_continue;
 
 	list_for_each_op(op, &bind_job->ops) {
 		struct drm_gpuva_op *va_op;
@@ -1363,7 +1415,7 @@ nouveau_uvmm_bind_job_submit(struct nouveau_job *job)
 		case OP_UNMAP_SPARSE:
 		case OP_MAP:
 		case OP_UNMAP:
-			bind_link_gpuvas(op->ops, &op->new);
+			bind_link_gpuvas(op);
 			break;
 		default:
 			break;
@@ -1409,21 +1461,18 @@ nouveau_uvmm_bind_job_submit(struct nouveau_job *job)
 	}
 
 	nouveau_uvmm_unlock(uvmm);
-	drm_exec_fini(exec);
+	drm_gpuva_manager_unlock(&uvmm->umgr);
 	return ret;
 }
 
 static void
 nouveau_uvmm_bind_job_armed_submit(struct nouveau_job *job)
 {
-	struct drm_exec *exec = &job->exec;
-	struct drm_gem_object *obj;
-	unsigned long index;
-
-	drm_exec_for_each_locked_object(exec, index, obj)
-		dma_resv_add_fence(obj->resv, job->done_fence, job->resv_usage);
+	struct nouveau_uvmm *uvmm = nouveau_cli_uvmm(job->cli);
 
-	drm_exec_fini(exec);
+	drm_gpuva_manager_resv_add_fence(&uvmm->umgr, job->done_fence,
+					 job->resv_usage, job->resv_usage);
+	drm_gpuva_manager_unlock(&uvmm->umgr);
 }
 
 static struct dma_fence *
@@ -1510,8 +1559,16 @@ nouveau_uvmm_bind_job_free_work_fn(struct work_struct *work)
 		if (!IS_ERR_OR_NULL(op->ops))
 			drm_gpuva_ops_free(&uvmm->umgr, op->ops);
 
-		if (obj)
+		if (!IS_ERR_OR_NULL(op->gem.vm_bo)) {
+			dma_resv_lock(obj->resv, NULL);
+			drm_gpuva_gem_put(op->gem.vm_bo);
+			dma_resv_unlock(obj->resv);
+		}
+
+		if (obj) {
+			drm_gpuva_extobj_put(&uvmm->umgr, obj);
 			drm_gem_object_put(obj);
+		}
 	}
 
 	spin_lock(&entity->job.list.lock);
@@ -1775,15 +1832,18 @@ void
 nouveau_uvmm_bo_map_all(struct nouveau_bo *nvbo, struct nouveau_mem *mem)
 {
 	struct drm_gem_object *obj = &nvbo->bo.base;
+	struct drm_gpuva_gem *vm_bo;
 	struct drm_gpuva *va;
 
 	dma_resv_assert_held(obj->resv);
 
-	drm_gem_for_each_gpuva(va, obj) {
-		struct nouveau_uvma *uvma = uvma_from_va(va);
+	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
+		drm_gpuva_gem_for_each_va(va, vm_bo) {
+			struct nouveau_uvma *uvma = uvma_from_va(va);
 
-		nouveau_uvma_map(uvma, mem);
-		drm_gpuva_invalidate(va, false);
+			nouveau_uvma_map(uvma, mem);
+			drm_gpuva_invalidate(va, false);
+		}
 	}
 }
 
@@ -1791,18 +1851,33 @@ void
 nouveau_uvmm_bo_unmap_all(struct nouveau_bo *nvbo)
 {
 	struct drm_gem_object *obj = &nvbo->bo.base;
+	struct drm_gpuva_gem *vm_bo;
 	struct drm_gpuva *va;
 
 	dma_resv_assert_held(obj->resv);
 
-	drm_gem_for_each_gpuva(va, obj) {
-		struct nouveau_uvma *uvma = uvma_from_va(va);
+	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
+		drm_gpuva_gem_for_each_va(va, vm_bo) {
+			struct nouveau_uvma *uvma = uvma_from_va(va);
 
-		nouveau_uvma_unmap(uvma);
-		drm_gpuva_invalidate(va, true);
+			nouveau_uvma_unmap(uvma);
+			drm_gpuva_invalidate(va, true);
+		}
 	}
 }
 
+static int
+nouveau_uvmm_bo_validate(struct drm_gem_object *obj)
+{
+	struct nouveau_bo *nvbo = nouveau_gem_object(obj);
+
+	return nouveau_bo_validate(nvbo, true, false);
+}
+
+static const struct drm_gpuva_fn_ops nouveau_uvmm_gpuva_ops = {
+	.bo_validate = nouveau_uvmm_bo_validate,
+};
+
 int
 nouveau_uvmm_init(struct nouveau_uvmm *uvmm, struct nouveau_cli *cli,
 		  u64 kernel_managed_addr, u64 kernel_managed_size)
@@ -1835,11 +1910,11 @@ nouveau_uvmm_init(struct nouveau_uvmm *uvmm, struct nouveau_cli *cli,
 	uvmm->kernel_managed_addr = kernel_managed_addr;
 	uvmm->kernel_managed_size = kernel_managed_size;
 
-	drm_gpuva_manager_init(&uvmm->umgr, cli->name,
+	drm_gpuva_manager_init(&uvmm->umgr, cli->drm->dev, cli->name,
 			       NOUVEAU_VA_SPACE_START,
 			       NOUVEAU_VA_SPACE_END,
 			       kernel_managed_addr, kernel_managed_size,
-			       NULL);
+			       &nouveau_uvmm_gpuva_ops);
 
 	ret = nvif_vmm_ctor(&cli->mmu, "uvmm",
 			    cli->vmm.vmm.object.oclass, RAW,
-- 
2.41.0


^ permalink raw reply related	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 1/3] drm: drm_exec: build always builtin
  2023-08-20 21:53   ` Danilo Krummrich
  (?)
@ 2023-08-21  9:49     ` Christian König
  -1 siblings, 0 replies; 88+ messages in thread
From: Christian König @ 2023-08-21  9:49 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	faith.ekstrand, bskeggs, Liam.Howlett
  Cc: dri-devel, nouveau, linux-kernel

Am 20.08.23 um 23:53 schrieb Danilo Krummrich:
> drm_exec must always be builtin for the DRM GPUVA manager to depend on
> it.

You should probably go the other way around and not always build in the 
GPUVA manager.

We have intentionally and with quite a bit of work moved the DRM_EXEC 
and DRM_BUDDY into separate modules.

Regards,
Christian.

>
> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> ---
>   drivers/gpu/drm/Kconfig         | 6 ------
>   drivers/gpu/drm/Makefile        | 3 +--
>   drivers/gpu/drm/nouveau/Kconfig | 1 -
>   3 files changed, 1 insertion(+), 9 deletions(-)
>
> diff --git a/drivers/gpu/drm/Kconfig b/drivers/gpu/drm/Kconfig
> index ab9ef1c20349..85122d4bb1e7 100644
> --- a/drivers/gpu/drm/Kconfig
> +++ b/drivers/gpu/drm/Kconfig
> @@ -210,12 +210,6 @@ config DRM_TTM_KUNIT_TEST
>   
>             If in doubt, say "N".
>   
> -config DRM_EXEC
> -	tristate
> -	depends on DRM
> -	help
> -	  Execution context for command submissions
> -
>   config DRM_BUDDY
>   	tristate
>   	depends on DRM
> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
> index 215e78e79125..388e0964a875 100644
> --- a/drivers/gpu/drm/Makefile
> +++ b/drivers/gpu/drm/Makefile
> @@ -23,6 +23,7 @@ drm-y := \
>   	drm_dumb_buffers.o \
>   	drm_edid.o \
>   	drm_encoder.o \
> +	drm_exec.o \
>   	drm_file.o \
>   	drm_fourcc.o \
>   	drm_framebuffer.o \
> @@ -80,8 +81,6 @@ obj-$(CONFIG_DRM_PANEL_ORIENTATION_QUIRKS) += drm_panel_orientation_quirks.o
>   # Memory-management helpers
>   #
>   #
> -obj-$(CONFIG_DRM_EXEC) += drm_exec.o
> -
>   obj-$(CONFIG_DRM_BUDDY) += drm_buddy.o
>   
>   drm_dma_helper-y := drm_gem_dma_helper.o
> diff --git a/drivers/gpu/drm/nouveau/Kconfig b/drivers/gpu/drm/nouveau/Kconfig
> index c52e8096cca4..2dddedac125b 100644
> --- a/drivers/gpu/drm/nouveau/Kconfig
> +++ b/drivers/gpu/drm/nouveau/Kconfig
> @@ -10,7 +10,6 @@ config DRM_NOUVEAU
>   	select DRM_KMS_HELPER
>   	select DRM_TTM
>   	select DRM_TTM_HELPER
> -	select DRM_EXEC
>   	select DRM_SCHED
>   	select I2C
>   	select I2C_ALGOBIT


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 1/3] drm: drm_exec: build always builtin
@ 2023-08-21  9:49     ` Christian König
  0 siblings, 0 replies; 88+ messages in thread
From: Christian König @ 2023-08-21  9:49 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel

Am 20.08.23 um 23:53 schrieb Danilo Krummrich:
> drm_exec must always be builtin for the DRM GPUVA manager to depend on
> it.

You should probably go the other way around and not always build in the 
GPUVA manager.

We have intentionally and with quite a bit of work moved the DRM_EXEC 
and DRM_BUDDY into separate modules.

Regards,
Christian.

>
> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> ---
>   drivers/gpu/drm/Kconfig         | 6 ------
>   drivers/gpu/drm/Makefile        | 3 +--
>   drivers/gpu/drm/nouveau/Kconfig | 1 -
>   3 files changed, 1 insertion(+), 9 deletions(-)
>
> diff --git a/drivers/gpu/drm/Kconfig b/drivers/gpu/drm/Kconfig
> index ab9ef1c20349..85122d4bb1e7 100644
> --- a/drivers/gpu/drm/Kconfig
> +++ b/drivers/gpu/drm/Kconfig
> @@ -210,12 +210,6 @@ config DRM_TTM_KUNIT_TEST
>   
>             If in doubt, say "N".
>   
> -config DRM_EXEC
> -	tristate
> -	depends on DRM
> -	help
> -	  Execution context for command submissions
> -
>   config DRM_BUDDY
>   	tristate
>   	depends on DRM
> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
> index 215e78e79125..388e0964a875 100644
> --- a/drivers/gpu/drm/Makefile
> +++ b/drivers/gpu/drm/Makefile
> @@ -23,6 +23,7 @@ drm-y := \
>   	drm_dumb_buffers.o \
>   	drm_edid.o \
>   	drm_encoder.o \
> +	drm_exec.o \
>   	drm_file.o \
>   	drm_fourcc.o \
>   	drm_framebuffer.o \
> @@ -80,8 +81,6 @@ obj-$(CONFIG_DRM_PANEL_ORIENTATION_QUIRKS) += drm_panel_orientation_quirks.o
>   # Memory-management helpers
>   #
>   #
> -obj-$(CONFIG_DRM_EXEC) += drm_exec.o
> -
>   obj-$(CONFIG_DRM_BUDDY) += drm_buddy.o
>   
>   drm_dma_helper-y := drm_gem_dma_helper.o
> diff --git a/drivers/gpu/drm/nouveau/Kconfig b/drivers/gpu/drm/nouveau/Kconfig
> index c52e8096cca4..2dddedac125b 100644
> --- a/drivers/gpu/drm/nouveau/Kconfig
> +++ b/drivers/gpu/drm/nouveau/Kconfig
> @@ -10,7 +10,6 @@ config DRM_NOUVEAU
>   	select DRM_KMS_HELPER
>   	select DRM_TTM
>   	select DRM_TTM_HELPER
> -	select DRM_EXEC
>   	select DRM_SCHED
>   	select I2C
>   	select I2C_ALGOBIT


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 1/3] drm: drm_exec: build always builtin
@ 2023-08-21  9:49     ` Christian König
  0 siblings, 0 replies; 88+ messages in thread
From: Christian König @ 2023-08-21  9:49 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel

Am 20.08.23 um 23:53 schrieb Danilo Krummrich:
> drm_exec must always be builtin for the DRM GPUVA manager to depend on
> it.

You should probably go the other way around and not always build in the 
GPUVA manager.

We have intentionally and with quite a bit of work moved the DRM_EXEC 
and DRM_BUDDY into separate modules.

Regards,
Christian.

>
> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> ---
>   drivers/gpu/drm/Kconfig         | 6 ------
>   drivers/gpu/drm/Makefile        | 3 +--
>   drivers/gpu/drm/nouveau/Kconfig | 1 -
>   3 files changed, 1 insertion(+), 9 deletions(-)
>
> diff --git a/drivers/gpu/drm/Kconfig b/drivers/gpu/drm/Kconfig
> index ab9ef1c20349..85122d4bb1e7 100644
> --- a/drivers/gpu/drm/Kconfig
> +++ b/drivers/gpu/drm/Kconfig
> @@ -210,12 +210,6 @@ config DRM_TTM_KUNIT_TEST
>   
>             If in doubt, say "N".
>   
> -config DRM_EXEC
> -	tristate
> -	depends on DRM
> -	help
> -	  Execution context for command submissions
> -
>   config DRM_BUDDY
>   	tristate
>   	depends on DRM
> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
> index 215e78e79125..388e0964a875 100644
> --- a/drivers/gpu/drm/Makefile
> +++ b/drivers/gpu/drm/Makefile
> @@ -23,6 +23,7 @@ drm-y := \
>   	drm_dumb_buffers.o \
>   	drm_edid.o \
>   	drm_encoder.o \
> +	drm_exec.o \
>   	drm_file.o \
>   	drm_fourcc.o \
>   	drm_framebuffer.o \
> @@ -80,8 +81,6 @@ obj-$(CONFIG_DRM_PANEL_ORIENTATION_QUIRKS) += drm_panel_orientation_quirks.o
>   # Memory-management helpers
>   #
>   #
> -obj-$(CONFIG_DRM_EXEC) += drm_exec.o
> -
>   obj-$(CONFIG_DRM_BUDDY) += drm_buddy.o
>   
>   drm_dma_helper-y := drm_gem_dma_helper.o
> diff --git a/drivers/gpu/drm/nouveau/Kconfig b/drivers/gpu/drm/nouveau/Kconfig
> index c52e8096cca4..2dddedac125b 100644
> --- a/drivers/gpu/drm/nouveau/Kconfig
> +++ b/drivers/gpu/drm/nouveau/Kconfig
> @@ -10,7 +10,6 @@ config DRM_NOUVEAU
>   	select DRM_KMS_HELPER
>   	select DRM_TTM
>   	select DRM_TTM_HELPER
> -	select DRM_EXEC
>   	select DRM_SCHED
>   	select I2C
>   	select I2C_ALGOBIT


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 1/3] drm: drm_exec: build always builtin
  2023-08-21  9:49     ` [Nouveau] " Christian König
  (?)
@ 2023-08-21 19:14       ` Danilo Krummrich
  -1 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-21 19:14 UTC (permalink / raw)
  To: Christian König, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel

On 8/21/23 11:49, Christian König wrote:
> Am 20.08.23 um 23:53 schrieb Danilo Krummrich:
>> drm_exec must always be builtin for the DRM GPUVA manager to depend on
>> it.
> 
> You should probably go the other way around and not always build in the 
> GPUVA manager.

Yes, I think that's reasonable. Currently, I don't see any core 
dependencies preventing that.

> 
> We have intentionally and with quite a bit of work moved the DRM_EXEC 
> and DRM_BUDDY into separate modules.
> 
> Regards,
> Christian.
> 
>>
>> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
>> ---
>>   drivers/gpu/drm/Kconfig         | 6 ------
>>   drivers/gpu/drm/Makefile        | 3 +--
>>   drivers/gpu/drm/nouveau/Kconfig | 1 -
>>   3 files changed, 1 insertion(+), 9 deletions(-)
>>
>> diff --git a/drivers/gpu/drm/Kconfig b/drivers/gpu/drm/Kconfig
>> index ab9ef1c20349..85122d4bb1e7 100644
>> --- a/drivers/gpu/drm/Kconfig
>> +++ b/drivers/gpu/drm/Kconfig
>> @@ -210,12 +210,6 @@ config DRM_TTM_KUNIT_TEST
>>             If in doubt, say "N".
>> -config DRM_EXEC
>> -    tristate
>> -    depends on DRM
>> -    help
>> -      Execution context for command submissions
>> -
>>   config DRM_BUDDY
>>       tristate
>>       depends on DRM
>> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
>> index 215e78e79125..388e0964a875 100644
>> --- a/drivers/gpu/drm/Makefile
>> +++ b/drivers/gpu/drm/Makefile
>> @@ -23,6 +23,7 @@ drm-y := \
>>       drm_dumb_buffers.o \
>>       drm_edid.o \
>>       drm_encoder.o \
>> +    drm_exec.o \
>>       drm_file.o \
>>       drm_fourcc.o \
>>       drm_framebuffer.o \
>> @@ -80,8 +81,6 @@ obj-$(CONFIG_DRM_PANEL_ORIENTATION_QUIRKS) += 
>> drm_panel_orientation_quirks.o
>>   # Memory-management helpers
>>   #
>>   #
>> -obj-$(CONFIG_DRM_EXEC) += drm_exec.o
>> -
>>   obj-$(CONFIG_DRM_BUDDY) += drm_buddy.o
>>   drm_dma_helper-y := drm_gem_dma_helper.o
>> diff --git a/drivers/gpu/drm/nouveau/Kconfig 
>> b/drivers/gpu/drm/nouveau/Kconfig
>> index c52e8096cca4..2dddedac125b 100644
>> --- a/drivers/gpu/drm/nouveau/Kconfig
>> +++ b/drivers/gpu/drm/nouveau/Kconfig
>> @@ -10,7 +10,6 @@ config DRM_NOUVEAU
>>       select DRM_KMS_HELPER
>>       select DRM_TTM
>>       select DRM_TTM_HELPER
>> -    select DRM_EXEC
>>       select DRM_SCHED
>>       select I2C
>>       select I2C_ALGOBIT
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 1/3] drm: drm_exec: build always builtin
@ 2023-08-21 19:14       ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-21 19:14 UTC (permalink / raw)
  To: Christian König, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel

On 8/21/23 11:49, Christian König wrote:
> Am 20.08.23 um 23:53 schrieb Danilo Krummrich:
>> drm_exec must always be builtin for the DRM GPUVA manager to depend on
>> it.
> 
> You should probably go the other way around and not always build in the 
> GPUVA manager.

Yes, I think that's reasonable. Currently, I don't see any core 
dependencies preventing that.

> 
> We have intentionally and with quite a bit of work moved the DRM_EXEC 
> and DRM_BUDDY into separate modules.
> 
> Regards,
> Christian.
> 
>>
>> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
>> ---
>>   drivers/gpu/drm/Kconfig         | 6 ------
>>   drivers/gpu/drm/Makefile        | 3 +--
>>   drivers/gpu/drm/nouveau/Kconfig | 1 -
>>   3 files changed, 1 insertion(+), 9 deletions(-)
>>
>> diff --git a/drivers/gpu/drm/Kconfig b/drivers/gpu/drm/Kconfig
>> index ab9ef1c20349..85122d4bb1e7 100644
>> --- a/drivers/gpu/drm/Kconfig
>> +++ b/drivers/gpu/drm/Kconfig
>> @@ -210,12 +210,6 @@ config DRM_TTM_KUNIT_TEST
>>             If in doubt, say "N".
>> -config DRM_EXEC
>> -    tristate
>> -    depends on DRM
>> -    help
>> -      Execution context for command submissions
>> -
>>   config DRM_BUDDY
>>       tristate
>>       depends on DRM
>> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
>> index 215e78e79125..388e0964a875 100644
>> --- a/drivers/gpu/drm/Makefile
>> +++ b/drivers/gpu/drm/Makefile
>> @@ -23,6 +23,7 @@ drm-y := \
>>       drm_dumb_buffers.o \
>>       drm_edid.o \
>>       drm_encoder.o \
>> +    drm_exec.o \
>>       drm_file.o \
>>       drm_fourcc.o \
>>       drm_framebuffer.o \
>> @@ -80,8 +81,6 @@ obj-$(CONFIG_DRM_PANEL_ORIENTATION_QUIRKS) += 
>> drm_panel_orientation_quirks.o
>>   # Memory-management helpers
>>   #
>>   #
>> -obj-$(CONFIG_DRM_EXEC) += drm_exec.o
>> -
>>   obj-$(CONFIG_DRM_BUDDY) += drm_buddy.o
>>   drm_dma_helper-y := drm_gem_dma_helper.o
>> diff --git a/drivers/gpu/drm/nouveau/Kconfig 
>> b/drivers/gpu/drm/nouveau/Kconfig
>> index c52e8096cca4..2dddedac125b 100644
>> --- a/drivers/gpu/drm/nouveau/Kconfig
>> +++ b/drivers/gpu/drm/nouveau/Kconfig
>> @@ -10,7 +10,6 @@ config DRM_NOUVEAU
>>       select DRM_KMS_HELPER
>>       select DRM_TTM
>>       select DRM_TTM_HELPER
>> -    select DRM_EXEC
>>       select DRM_SCHED
>>       select I2C
>>       select I2C_ALGOBIT
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 1/3] drm: drm_exec: build always builtin
@ 2023-08-21 19:14       ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-21 19:14 UTC (permalink / raw)
  To: Christian König, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	faith.ekstrand, bskeggs, Liam.Howlett
  Cc: dri-devel, nouveau, linux-kernel

On 8/21/23 11:49, Christian König wrote:
> Am 20.08.23 um 23:53 schrieb Danilo Krummrich:
>> drm_exec must always be builtin for the DRM GPUVA manager to depend on
>> it.
> 
> You should probably go the other way around and not always build in the 
> GPUVA manager.

Yes, I think that's reasonable. Currently, I don't see any core 
dependencies preventing that.

> 
> We have intentionally and with quite a bit of work moved the DRM_EXEC 
> and DRM_BUDDY into separate modules.
> 
> Regards,
> Christian.
> 
>>
>> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
>> ---
>>   drivers/gpu/drm/Kconfig         | 6 ------
>>   drivers/gpu/drm/Makefile        | 3 +--
>>   drivers/gpu/drm/nouveau/Kconfig | 1 -
>>   3 files changed, 1 insertion(+), 9 deletions(-)
>>
>> diff --git a/drivers/gpu/drm/Kconfig b/drivers/gpu/drm/Kconfig
>> index ab9ef1c20349..85122d4bb1e7 100644
>> --- a/drivers/gpu/drm/Kconfig
>> +++ b/drivers/gpu/drm/Kconfig
>> @@ -210,12 +210,6 @@ config DRM_TTM_KUNIT_TEST
>>             If in doubt, say "N".
>> -config DRM_EXEC
>> -    tristate
>> -    depends on DRM
>> -    help
>> -      Execution context for command submissions
>> -
>>   config DRM_BUDDY
>>       tristate
>>       depends on DRM
>> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
>> index 215e78e79125..388e0964a875 100644
>> --- a/drivers/gpu/drm/Makefile
>> +++ b/drivers/gpu/drm/Makefile
>> @@ -23,6 +23,7 @@ drm-y := \
>>       drm_dumb_buffers.o \
>>       drm_edid.o \
>>       drm_encoder.o \
>> +    drm_exec.o \
>>       drm_file.o \
>>       drm_fourcc.o \
>>       drm_framebuffer.o \
>> @@ -80,8 +81,6 @@ obj-$(CONFIG_DRM_PANEL_ORIENTATION_QUIRKS) += 
>> drm_panel_orientation_quirks.o
>>   # Memory-management helpers
>>   #
>>   #
>> -obj-$(CONFIG_DRM_EXEC) += drm_exec.o
>> -
>>   obj-$(CONFIG_DRM_BUDDY) += drm_buddy.o
>>   drm_dma_helper-y := drm_gem_dma_helper.o
>> diff --git a/drivers/gpu/drm/nouveau/Kconfig 
>> b/drivers/gpu/drm/nouveau/Kconfig
>> index c52e8096cca4..2dddedac125b 100644
>> --- a/drivers/gpu/drm/nouveau/Kconfig
>> +++ b/drivers/gpu/drm/nouveau/Kconfig
>> @@ -10,7 +10,6 @@ config DRM_NOUVEAU
>>       select DRM_KMS_HELPER
>>       select DRM_TTM
>>       select DRM_TTM_HELPER
>> -    select DRM_EXEC
>>       select DRM_SCHED
>>       select I2C
>>       select I2C_ALGOBIT
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-20 21:53   ` Danilo Krummrich
  (?)
@ 2023-08-22  1:31     ` kernel test robot
  -1 siblings, 0 replies; 88+ messages in thread
From: kernel test robot @ 2023-08-22  1:31 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	christian.koenig, faith.ekstrand, bskeggs, Liam.Howlett
  Cc: llvm, oe-kbuild-all, nouveau, Danilo Krummrich, linux-kernel, dri-devel

Hi Danilo,

kernel test robot noticed the following build warnings:

[auto build test WARNING on 25205087df1ffe06ccea9302944ed1f77dc68c6f]

url:    https://github.com/intel-lab-lkp/linux/commits/Danilo-Krummrich/drm-drm_exec-build-always-builtin/20230821-123143
base:   25205087df1ffe06ccea9302944ed1f77dc68c6f
patch link:    https://lore.kernel.org/r/20230820215320.4187-3-dakr%40redhat.com
patch subject: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
config: i386-randconfig-r024-20230822 (https://download.01.org/0day-ci/archive/20230822/202308220935.ik8QPkf4-lkp@intel.com/config)
compiler: clang version 16.0.4 (https://github.com/llvm/llvm-project.git ae42196bc493ffe877a7e3dff8be32035dea4d07)
reproduce: (https://download.01.org/0day-ci/archive/20230822/202308220935.ik8QPkf4-lkp@intel.com/reproduce)

If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <lkp@intel.com>
| Closes: https://lore.kernel.org/oe-kbuild-all/202308220935.ik8QPkf4-lkp@intel.com/

All warnings (new ones prefixed by >>):

>> drivers/gpu/drm/drm_gpuva_mgr.c:750:1: warning: no previous prototype for function 'drm_gpuva_manager_prepare_objects' [-Wmissing-prototypes]
   drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
   ^
   drivers/gpu/drm/drm_gpuva_mgr.c:749:1: note: declare 'static' if the function is not intended to be used outside of this translation unit
   int
   ^
   static 
   drivers/gpu/drm/drm_gpuva_mgr.c:1744:32: warning: variable 'prev' set but not used [-Wunused-but-set-variable]
           struct drm_gpuva *va, *next, *prev = NULL;
                                         ^
   2 warnings generated.
--
>> drivers/gpu/drm/drm_gpuva_mgr.c:1091: warning: Function parameter or member '__vm_bo' not described in 'drm_gpuva_gem_obtain_prealloc'


vim +/drm_gpuva_manager_prepare_objects +750 drivers/gpu/drm/drm_gpuva_mgr.c

   734	
   735	/**
   736	 * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
   737	 * @mgr: the &drm_gpuva_manager
   738	 * @num_fences: the amount of &dma_fences to reserve
   739	 *
   740	 * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
   741	 * &drm_gpuva_manager contains mappings of.
   742	 *
   743	 * Drivers can obtain the corresponding &drm_exec instance through
   744	 * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
   745	 * and drm_exec_fini() accordingly.
   746	 *
   747	 * Returns: 0 on success, negative error code on failure.
   748	 */
   749	int
 > 750	drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
   751					  unsigned int num_fences)
   752	{
   753		struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
   754		MA_STATE(mas, &mgr->mt_ext, 0, 0);
   755		union {
   756			void *ptr;
   757			uintptr_t cnt;
   758		} ref;
   759		int ret;
   760	
   761		ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
   762		if (ret)
   763			goto out;
   764	
   765		rcu_read_lock();
   766		mas_for_each(&mas, ref.ptr, ULONG_MAX) {
   767			struct drm_gem_object *obj;
   768	
   769			mas_pause(&mas);
   770			rcu_read_unlock();
   771	
   772			obj = (struct drm_gem_object *)(uintptr_t)mas.index;
   773			ret = drm_exec_prepare_obj(exec, obj, num_fences);
   774			if (ret)
   775				goto out;
   776	
   777			rcu_read_lock();
   778		}
   779		rcu_read_unlock();
   780	
   781	out:
   782		return ret;
   783	}
   784	EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
   785	

-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-22  1:31     ` kernel test robot
  0 siblings, 0 replies; 88+ messages in thread
From: kernel test robot @ 2023-08-22  1:31 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	christian.koenig, faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, llvm, linux-kernel, dri-devel, oe-kbuild-all

Hi Danilo,

kernel test robot noticed the following build warnings:

[auto build test WARNING on 25205087df1ffe06ccea9302944ed1f77dc68c6f]

url:    https://github.com/intel-lab-lkp/linux/commits/Danilo-Krummrich/drm-drm_exec-build-always-builtin/20230821-123143
base:   25205087df1ffe06ccea9302944ed1f77dc68c6f
patch link:    https://lore.kernel.org/r/20230820215320.4187-3-dakr%40redhat.com
patch subject: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
config: i386-randconfig-r024-20230822 (https://download.01.org/0day-ci/archive/20230822/202308220935.ik8QPkf4-lkp@intel.com/config)
compiler: clang version 16.0.4 (https://github.com/llvm/llvm-project.git ae42196bc493ffe877a7e3dff8be32035dea4d07)
reproduce: (https://download.01.org/0day-ci/archive/20230822/202308220935.ik8QPkf4-lkp@intel.com/reproduce)

If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <lkp@intel.com>
| Closes: https://lore.kernel.org/oe-kbuild-all/202308220935.ik8QPkf4-lkp@intel.com/

All warnings (new ones prefixed by >>):

>> drivers/gpu/drm/drm_gpuva_mgr.c:750:1: warning: no previous prototype for function 'drm_gpuva_manager_prepare_objects' [-Wmissing-prototypes]
   drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
   ^
   drivers/gpu/drm/drm_gpuva_mgr.c:749:1: note: declare 'static' if the function is not intended to be used outside of this translation unit
   int
   ^
   static 
   drivers/gpu/drm/drm_gpuva_mgr.c:1744:32: warning: variable 'prev' set but not used [-Wunused-but-set-variable]
           struct drm_gpuva *va, *next, *prev = NULL;
                                         ^
   2 warnings generated.
--
>> drivers/gpu/drm/drm_gpuva_mgr.c:1091: warning: Function parameter or member '__vm_bo' not described in 'drm_gpuva_gem_obtain_prealloc'


vim +/drm_gpuva_manager_prepare_objects +750 drivers/gpu/drm/drm_gpuva_mgr.c

   734	
   735	/**
   736	 * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
   737	 * @mgr: the &drm_gpuva_manager
   738	 * @num_fences: the amount of &dma_fences to reserve
   739	 *
   740	 * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
   741	 * &drm_gpuva_manager contains mappings of.
   742	 *
   743	 * Drivers can obtain the corresponding &drm_exec instance through
   744	 * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
   745	 * and drm_exec_fini() accordingly.
   746	 *
   747	 * Returns: 0 on success, negative error code on failure.
   748	 */
   749	int
 > 750	drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
   751					  unsigned int num_fences)
   752	{
   753		struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
   754		MA_STATE(mas, &mgr->mt_ext, 0, 0);
   755		union {
   756			void *ptr;
   757			uintptr_t cnt;
   758		} ref;
   759		int ret;
   760	
   761		ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
   762		if (ret)
   763			goto out;
   764	
   765		rcu_read_lock();
   766		mas_for_each(&mas, ref.ptr, ULONG_MAX) {
   767			struct drm_gem_object *obj;
   768	
   769			mas_pause(&mas);
   770			rcu_read_unlock();
   771	
   772			obj = (struct drm_gem_object *)(uintptr_t)mas.index;
   773			ret = drm_exec_prepare_obj(exec, obj, num_fences);
   774			if (ret)
   775				goto out;
   776	
   777			rcu_read_lock();
   778		}
   779		rcu_read_unlock();
   780	
   781	out:
   782		return ret;
   783	}
   784	EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
   785	

-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-22  1:31     ` kernel test robot
  0 siblings, 0 replies; 88+ messages in thread
From: kernel test robot @ 2023-08-22  1:31 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	christian.koenig, faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, llvm, linux-kernel, dri-devel, Danilo Krummrich, oe-kbuild-all

Hi Danilo,

kernel test robot noticed the following build warnings:

[auto build test WARNING on 25205087df1ffe06ccea9302944ed1f77dc68c6f]

url:    https://github.com/intel-lab-lkp/linux/commits/Danilo-Krummrich/drm-drm_exec-build-always-builtin/20230821-123143
base:   25205087df1ffe06ccea9302944ed1f77dc68c6f
patch link:    https://lore.kernel.org/r/20230820215320.4187-3-dakr%40redhat.com
patch subject: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
config: i386-randconfig-r024-20230822 (https://download.01.org/0day-ci/archive/20230822/202308220935.ik8QPkf4-lkp@intel.com/config)
compiler: clang version 16.0.4 (https://github.com/llvm/llvm-project.git ae42196bc493ffe877a7e3dff8be32035dea4d07)
reproduce: (https://download.01.org/0day-ci/archive/20230822/202308220935.ik8QPkf4-lkp@intel.com/reproduce)

If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <lkp@intel.com>
| Closes: https://lore.kernel.org/oe-kbuild-all/202308220935.ik8QPkf4-lkp@intel.com/

All warnings (new ones prefixed by >>):

>> drivers/gpu/drm/drm_gpuva_mgr.c:750:1: warning: no previous prototype for function 'drm_gpuva_manager_prepare_objects' [-Wmissing-prototypes]
   drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
   ^
   drivers/gpu/drm/drm_gpuva_mgr.c:749:1: note: declare 'static' if the function is not intended to be used outside of this translation unit
   int
   ^
   static 
   drivers/gpu/drm/drm_gpuva_mgr.c:1744:32: warning: variable 'prev' set but not used [-Wunused-but-set-variable]
           struct drm_gpuva *va, *next, *prev = NULL;
                                         ^
   2 warnings generated.
--
>> drivers/gpu/drm/drm_gpuva_mgr.c:1091: warning: Function parameter or member '__vm_bo' not described in 'drm_gpuva_gem_obtain_prealloc'


vim +/drm_gpuva_manager_prepare_objects +750 drivers/gpu/drm/drm_gpuva_mgr.c

   734	
   735	/**
   736	 * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
   737	 * @mgr: the &drm_gpuva_manager
   738	 * @num_fences: the amount of &dma_fences to reserve
   739	 *
   740	 * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
   741	 * &drm_gpuva_manager contains mappings of.
   742	 *
   743	 * Drivers can obtain the corresponding &drm_exec instance through
   744	 * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
   745	 * and drm_exec_fini() accordingly.
   746	 *
   747	 * Returns: 0 on success, negative error code on failure.
   748	 */
   749	int
 > 750	drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
   751					  unsigned int num_fences)
   752	{
   753		struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
   754		MA_STATE(mas, &mgr->mt_ext, 0, 0);
   755		union {
   756			void *ptr;
   757			uintptr_t cnt;
   758		} ref;
   759		int ret;
   760	
   761		ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
   762		if (ret)
   763			goto out;
   764	
   765		rcu_read_lock();
   766		mas_for_each(&mas, ref.ptr, ULONG_MAX) {
   767			struct drm_gem_object *obj;
   768	
   769			mas_pause(&mas);
   770			rcu_read_unlock();
   771	
   772			obj = (struct drm_gem_object *)(uintptr_t)mas.index;
   773			ret = drm_exec_prepare_obj(exec, obj, num_fences);
   774			if (ret)
   775				goto out;
   776	
   777			rcu_read_lock();
   778		}
   779		rcu_read_unlock();
   780	
   781	out:
   782		return ret;
   783	}
   784	EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
   785	

-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-20 21:53   ` Danilo Krummrich
  (?)
@ 2023-08-22  2:18     ` kernel test robot
  -1 siblings, 0 replies; 88+ messages in thread
From: kernel test robot @ 2023-08-22  2:18 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	christian.koenig, faith.ekstrand, bskeggs, Liam.Howlett
  Cc: oe-kbuild-all, nouveau, Danilo Krummrich, linux-kernel, dri-devel

Hi Danilo,

kernel test robot noticed the following build warnings:

[auto build test WARNING on 25205087df1ffe06ccea9302944ed1f77dc68c6f]

url:    https://github.com/intel-lab-lkp/linux/commits/Danilo-Krummrich/drm-drm_exec-build-always-builtin/20230821-123143
base:   25205087df1ffe06ccea9302944ed1f77dc68c6f
patch link:    https://lore.kernel.org/r/20230820215320.4187-3-dakr%40redhat.com
patch subject: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
config: sparc-randconfig-r022-20230822 (https://download.01.org/0day-ci/archive/20230822/202308221021.jCZejWoy-lkp@intel.com/config)
compiler: sparc64-linux-gcc (GCC) 12.3.0
reproduce: (https://download.01.org/0day-ci/archive/20230822/202308221021.jCZejWoy-lkp@intel.com/reproduce)

If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <lkp@intel.com>
| Closes: https://lore.kernel.org/oe-kbuild-all/202308221021.jCZejWoy-lkp@intel.com/

All warnings (new ones prefixed by >>):

>> drivers/gpu/drm/drm_gpuva_mgr.c:750:1: warning: no previous prototype for 'drm_gpuva_manager_prepare_objects' [-Wmissing-prototypes]
     750 | drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
         | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   drivers/gpu/drm/drm_gpuva_mgr.c: In function '__drm_gpuva_sm_map':
   drivers/gpu/drm/drm_gpuva_mgr.c:1744:39: warning: variable 'prev' set but not used [-Wunused-but-set-variable]
    1744 |         struct drm_gpuva *va, *next, *prev = NULL;
         |                                       ^~~~


vim +/drm_gpuva_manager_prepare_objects +750 drivers/gpu/drm/drm_gpuva_mgr.c

   734	
   735	/**
   736	 * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
   737	 * @mgr: the &drm_gpuva_manager
   738	 * @num_fences: the amount of &dma_fences to reserve
   739	 *
   740	 * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
   741	 * &drm_gpuva_manager contains mappings of.
   742	 *
   743	 * Drivers can obtain the corresponding &drm_exec instance through
   744	 * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
   745	 * and drm_exec_fini() accordingly.
   746	 *
   747	 * Returns: 0 on success, negative error code on failure.
   748	 */
   749	int
 > 750	drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
   751					  unsigned int num_fences)
   752	{
   753		struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
   754		MA_STATE(mas, &mgr->mt_ext, 0, 0);
   755		union {
   756			void *ptr;
   757			uintptr_t cnt;
   758		} ref;
   759		int ret;
   760	
   761		ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
   762		if (ret)
   763			goto out;
   764	
   765		rcu_read_lock();
   766		mas_for_each(&mas, ref.ptr, ULONG_MAX) {
   767			struct drm_gem_object *obj;
   768	
   769			mas_pause(&mas);
   770			rcu_read_unlock();
   771	
   772			obj = (struct drm_gem_object *)(uintptr_t)mas.index;
   773			ret = drm_exec_prepare_obj(exec, obj, num_fences);
   774			if (ret)
   775				goto out;
   776	
   777			rcu_read_lock();
   778		}
   779		rcu_read_unlock();
   780	
   781	out:
   782		return ret;
   783	}
   784	EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
   785	

-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-22  2:18     ` kernel test robot
  0 siblings, 0 replies; 88+ messages in thread
From: kernel test robot @ 2023-08-22  2:18 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	christian.koenig, faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel, oe-kbuild-all

Hi Danilo,

kernel test robot noticed the following build warnings:

[auto build test WARNING on 25205087df1ffe06ccea9302944ed1f77dc68c6f]

url:    https://github.com/intel-lab-lkp/linux/commits/Danilo-Krummrich/drm-drm_exec-build-always-builtin/20230821-123143
base:   25205087df1ffe06ccea9302944ed1f77dc68c6f
patch link:    https://lore.kernel.org/r/20230820215320.4187-3-dakr%40redhat.com
patch subject: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
config: sparc-randconfig-r022-20230822 (https://download.01.org/0day-ci/archive/20230822/202308221021.jCZejWoy-lkp@intel.com/config)
compiler: sparc64-linux-gcc (GCC) 12.3.0
reproduce: (https://download.01.org/0day-ci/archive/20230822/202308221021.jCZejWoy-lkp@intel.com/reproduce)

If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <lkp@intel.com>
| Closes: https://lore.kernel.org/oe-kbuild-all/202308221021.jCZejWoy-lkp@intel.com/

All warnings (new ones prefixed by >>):

>> drivers/gpu/drm/drm_gpuva_mgr.c:750:1: warning: no previous prototype for 'drm_gpuva_manager_prepare_objects' [-Wmissing-prototypes]
     750 | drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
         | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   drivers/gpu/drm/drm_gpuva_mgr.c: In function '__drm_gpuva_sm_map':
   drivers/gpu/drm/drm_gpuva_mgr.c:1744:39: warning: variable 'prev' set but not used [-Wunused-but-set-variable]
    1744 |         struct drm_gpuva *va, *next, *prev = NULL;
         |                                       ^~~~


vim +/drm_gpuva_manager_prepare_objects +750 drivers/gpu/drm/drm_gpuva_mgr.c

   734	
   735	/**
   736	 * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
   737	 * @mgr: the &drm_gpuva_manager
   738	 * @num_fences: the amount of &dma_fences to reserve
   739	 *
   740	 * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
   741	 * &drm_gpuva_manager contains mappings of.
   742	 *
   743	 * Drivers can obtain the corresponding &drm_exec instance through
   744	 * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
   745	 * and drm_exec_fini() accordingly.
   746	 *
   747	 * Returns: 0 on success, negative error code on failure.
   748	 */
   749	int
 > 750	drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
   751					  unsigned int num_fences)
   752	{
   753		struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
   754		MA_STATE(mas, &mgr->mt_ext, 0, 0);
   755		union {
   756			void *ptr;
   757			uintptr_t cnt;
   758		} ref;
   759		int ret;
   760	
   761		ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
   762		if (ret)
   763			goto out;
   764	
   765		rcu_read_lock();
   766		mas_for_each(&mas, ref.ptr, ULONG_MAX) {
   767			struct drm_gem_object *obj;
   768	
   769			mas_pause(&mas);
   770			rcu_read_unlock();
   771	
   772			obj = (struct drm_gem_object *)(uintptr_t)mas.index;
   773			ret = drm_exec_prepare_obj(exec, obj, num_fences);
   774			if (ret)
   775				goto out;
   776	
   777			rcu_read_lock();
   778		}
   779		rcu_read_unlock();
   780	
   781	out:
   782		return ret;
   783	}
   784	EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
   785	

-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-22  2:18     ` kernel test robot
  0 siblings, 0 replies; 88+ messages in thread
From: kernel test robot @ 2023-08-22  2:18 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	christian.koenig, faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, Danilo Krummrich, linux-kernel, dri-devel, oe-kbuild-all

Hi Danilo,

kernel test robot noticed the following build warnings:

[auto build test WARNING on 25205087df1ffe06ccea9302944ed1f77dc68c6f]

url:    https://github.com/intel-lab-lkp/linux/commits/Danilo-Krummrich/drm-drm_exec-build-always-builtin/20230821-123143
base:   25205087df1ffe06ccea9302944ed1f77dc68c6f
patch link:    https://lore.kernel.org/r/20230820215320.4187-3-dakr%40redhat.com
patch subject: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
config: sparc-randconfig-r022-20230822 (https://download.01.org/0day-ci/archive/20230822/202308221021.jCZejWoy-lkp@intel.com/config)
compiler: sparc64-linux-gcc (GCC) 12.3.0
reproduce: (https://download.01.org/0day-ci/archive/20230822/202308221021.jCZejWoy-lkp@intel.com/reproduce)

If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <lkp@intel.com>
| Closes: https://lore.kernel.org/oe-kbuild-all/202308221021.jCZejWoy-lkp@intel.com/

All warnings (new ones prefixed by >>):

>> drivers/gpu/drm/drm_gpuva_mgr.c:750:1: warning: no previous prototype for 'drm_gpuva_manager_prepare_objects' [-Wmissing-prototypes]
     750 | drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
         | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   drivers/gpu/drm/drm_gpuva_mgr.c: In function '__drm_gpuva_sm_map':
   drivers/gpu/drm/drm_gpuva_mgr.c:1744:39: warning: variable 'prev' set but not used [-Wunused-but-set-variable]
    1744 |         struct drm_gpuva *va, *next, *prev = NULL;
         |                                       ^~~~


vim +/drm_gpuva_manager_prepare_objects +750 drivers/gpu/drm/drm_gpuva_mgr.c

   734	
   735	/**
   736	 * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
   737	 * @mgr: the &drm_gpuva_manager
   738	 * @num_fences: the amount of &dma_fences to reserve
   739	 *
   740	 * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
   741	 * &drm_gpuva_manager contains mappings of.
   742	 *
   743	 * Drivers can obtain the corresponding &drm_exec instance through
   744	 * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
   745	 * and drm_exec_fini() accordingly.
   746	 *
   747	 * Returns: 0 on success, negative error code on failure.
   748	 */
   749	int
 > 750	drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
   751					  unsigned int num_fences)
   752	{
   753		struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
   754		MA_STATE(mas, &mgr->mt_ext, 0, 0);
   755		union {
   756			void *ptr;
   757			uintptr_t cnt;
   758		} ref;
   759		int ret;
   760	
   761		ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
   762		if (ret)
   763			goto out;
   764	
   765		rcu_read_lock();
   766		mas_for_each(&mas, ref.ptr, ULONG_MAX) {
   767			struct drm_gem_object *obj;
   768	
   769			mas_pause(&mas);
   770			rcu_read_unlock();
   771	
   772			obj = (struct drm_gem_object *)(uintptr_t)mas.index;
   773			ret = drm_exec_prepare_obj(exec, obj, num_fences);
   774			if (ret)
   775				goto out;
   776	
   777			rcu_read_lock();
   778		}
   779		rcu_read_unlock();
   780	
   781	out:
   782		return ret;
   783	}
   784	EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
   785	

-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-20 21:53   ` Danilo Krummrich
  (?)
@ 2023-08-22  3:01     ` kernel test robot
  -1 siblings, 0 replies; 88+ messages in thread
From: kernel test robot @ 2023-08-22  3:01 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	christian.koenig, faith.ekstrand, bskeggs, Liam.Howlett
  Cc: oe-kbuild-all, nouveau, Danilo Krummrich, linux-kernel, dri-devel

Hi Danilo,

kernel test robot noticed the following build warnings:

[auto build test WARNING on 25205087df1ffe06ccea9302944ed1f77dc68c6f]

url:    https://github.com/intel-lab-lkp/linux/commits/Danilo-Krummrich/drm-drm_exec-build-always-builtin/20230821-123143
base:   25205087df1ffe06ccea9302944ed1f77dc68c6f
patch link:    https://lore.kernel.org/r/20230820215320.4187-3-dakr%40redhat.com
patch subject: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
config: arm-randconfig-r014-20230822 (https://download.01.org/0day-ci/archive/20230822/202308221050.kTj8uFMA-lkp@intel.com/config)
compiler: arm-linux-gnueabi-gcc (GCC) 12.3.0
reproduce: (https://download.01.org/0day-ci/archive/20230822/202308221050.kTj8uFMA-lkp@intel.com/reproduce)

If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <lkp@intel.com>
| Closes: https://lore.kernel.org/oe-kbuild-all/202308221050.kTj8uFMA-lkp@intel.com/

All warnings (new ones prefixed by >>):

>> drivers/gpu/drm/drm_gpuva_mgr.c:750:1: warning: no previous prototype for 'drm_gpuva_manager_prepare_objects' [-Wmissing-prototypes]
     750 | drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
         | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   drivers/gpu/drm/drm_gpuva_mgr.c: In function '__drm_gpuva_sm_map':
   drivers/gpu/drm/drm_gpuva_mgr.c:1744:39: warning: variable 'prev' set but not used [-Wunused-but-set-variable]
    1744 |         struct drm_gpuva *va, *next, *prev = NULL;
         |                                       ^~~~
--
>> drivers/gpu/drm/drm_gpuva_mgr.c:1091: warning: Function parameter or member '__vm_bo' not described in 'drm_gpuva_gem_obtain_prealloc'


vim +/drm_gpuva_manager_prepare_objects +750 drivers/gpu/drm/drm_gpuva_mgr.c

   734	
   735	/**
   736	 * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
   737	 * @mgr: the &drm_gpuva_manager
   738	 * @num_fences: the amount of &dma_fences to reserve
   739	 *
   740	 * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
   741	 * &drm_gpuva_manager contains mappings of.
   742	 *
   743	 * Drivers can obtain the corresponding &drm_exec instance through
   744	 * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
   745	 * and drm_exec_fini() accordingly.
   746	 *
   747	 * Returns: 0 on success, negative error code on failure.
   748	 */
   749	int
 > 750	drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
   751					  unsigned int num_fences)
   752	{
   753		struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
   754		MA_STATE(mas, &mgr->mt_ext, 0, 0);
   755		union {
   756			void *ptr;
   757			uintptr_t cnt;
   758		} ref;
   759		int ret;
   760	
   761		ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
   762		if (ret)
   763			goto out;
   764	
   765		rcu_read_lock();
   766		mas_for_each(&mas, ref.ptr, ULONG_MAX) {
   767			struct drm_gem_object *obj;
   768	
   769			mas_pause(&mas);
   770			rcu_read_unlock();
   771	
   772			obj = (struct drm_gem_object *)(uintptr_t)mas.index;
   773			ret = drm_exec_prepare_obj(exec, obj, num_fences);
   774			if (ret)
   775				goto out;
   776	
   777			rcu_read_lock();
   778		}
   779		rcu_read_unlock();
   780	
   781	out:
   782		return ret;
   783	}
   784	EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
   785	

-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-22  3:01     ` kernel test robot
  0 siblings, 0 replies; 88+ messages in thread
From: kernel test robot @ 2023-08-22  3:01 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	christian.koenig, faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel, oe-kbuild-all

Hi Danilo,

kernel test robot noticed the following build warnings:

[auto build test WARNING on 25205087df1ffe06ccea9302944ed1f77dc68c6f]

url:    https://github.com/intel-lab-lkp/linux/commits/Danilo-Krummrich/drm-drm_exec-build-always-builtin/20230821-123143
base:   25205087df1ffe06ccea9302944ed1f77dc68c6f
patch link:    https://lore.kernel.org/r/20230820215320.4187-3-dakr%40redhat.com
patch subject: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
config: arm-randconfig-r014-20230822 (https://download.01.org/0day-ci/archive/20230822/202308221050.kTj8uFMA-lkp@intel.com/config)
compiler: arm-linux-gnueabi-gcc (GCC) 12.3.0
reproduce: (https://download.01.org/0day-ci/archive/20230822/202308221050.kTj8uFMA-lkp@intel.com/reproduce)

If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <lkp@intel.com>
| Closes: https://lore.kernel.org/oe-kbuild-all/202308221050.kTj8uFMA-lkp@intel.com/

All warnings (new ones prefixed by >>):

>> drivers/gpu/drm/drm_gpuva_mgr.c:750:1: warning: no previous prototype for 'drm_gpuva_manager_prepare_objects' [-Wmissing-prototypes]
     750 | drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
         | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   drivers/gpu/drm/drm_gpuva_mgr.c: In function '__drm_gpuva_sm_map':
   drivers/gpu/drm/drm_gpuva_mgr.c:1744:39: warning: variable 'prev' set but not used [-Wunused-but-set-variable]
    1744 |         struct drm_gpuva *va, *next, *prev = NULL;
         |                                       ^~~~
--
>> drivers/gpu/drm/drm_gpuva_mgr.c:1091: warning: Function parameter or member '__vm_bo' not described in 'drm_gpuva_gem_obtain_prealloc'


vim +/drm_gpuva_manager_prepare_objects +750 drivers/gpu/drm/drm_gpuva_mgr.c

   734	
   735	/**
   736	 * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
   737	 * @mgr: the &drm_gpuva_manager
   738	 * @num_fences: the amount of &dma_fences to reserve
   739	 *
   740	 * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
   741	 * &drm_gpuva_manager contains mappings of.
   742	 *
   743	 * Drivers can obtain the corresponding &drm_exec instance through
   744	 * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
   745	 * and drm_exec_fini() accordingly.
   746	 *
   747	 * Returns: 0 on success, negative error code on failure.
   748	 */
   749	int
 > 750	drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
   751					  unsigned int num_fences)
   752	{
   753		struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
   754		MA_STATE(mas, &mgr->mt_ext, 0, 0);
   755		union {
   756			void *ptr;
   757			uintptr_t cnt;
   758		} ref;
   759		int ret;
   760	
   761		ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
   762		if (ret)
   763			goto out;
   764	
   765		rcu_read_lock();
   766		mas_for_each(&mas, ref.ptr, ULONG_MAX) {
   767			struct drm_gem_object *obj;
   768	
   769			mas_pause(&mas);
   770			rcu_read_unlock();
   771	
   772			obj = (struct drm_gem_object *)(uintptr_t)mas.index;
   773			ret = drm_exec_prepare_obj(exec, obj, num_fences);
   774			if (ret)
   775				goto out;
   776	
   777			rcu_read_lock();
   778		}
   779		rcu_read_unlock();
   780	
   781	out:
   782		return ret;
   783	}
   784	EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
   785	

-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-22  3:01     ` kernel test robot
  0 siblings, 0 replies; 88+ messages in thread
From: kernel test robot @ 2023-08-22  3:01 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	christian.koenig, faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, Danilo Krummrich, linux-kernel, dri-devel, oe-kbuild-all

Hi Danilo,

kernel test robot noticed the following build warnings:

[auto build test WARNING on 25205087df1ffe06ccea9302944ed1f77dc68c6f]

url:    https://github.com/intel-lab-lkp/linux/commits/Danilo-Krummrich/drm-drm_exec-build-always-builtin/20230821-123143
base:   25205087df1ffe06ccea9302944ed1f77dc68c6f
patch link:    https://lore.kernel.org/r/20230820215320.4187-3-dakr%40redhat.com
patch subject: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
config: arm-randconfig-r014-20230822 (https://download.01.org/0day-ci/archive/20230822/202308221050.kTj8uFMA-lkp@intel.com/config)
compiler: arm-linux-gnueabi-gcc (GCC) 12.3.0
reproduce: (https://download.01.org/0day-ci/archive/20230822/202308221050.kTj8uFMA-lkp@intel.com/reproduce)

If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <lkp@intel.com>
| Closes: https://lore.kernel.org/oe-kbuild-all/202308221050.kTj8uFMA-lkp@intel.com/

All warnings (new ones prefixed by >>):

>> drivers/gpu/drm/drm_gpuva_mgr.c:750:1: warning: no previous prototype for 'drm_gpuva_manager_prepare_objects' [-Wmissing-prototypes]
     750 | drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
         | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   drivers/gpu/drm/drm_gpuva_mgr.c: In function '__drm_gpuva_sm_map':
   drivers/gpu/drm/drm_gpuva_mgr.c:1744:39: warning: variable 'prev' set but not used [-Wunused-but-set-variable]
    1744 |         struct drm_gpuva *va, *next, *prev = NULL;
         |                                       ^~~~
--
>> drivers/gpu/drm/drm_gpuva_mgr.c:1091: warning: Function parameter or member '__vm_bo' not described in 'drm_gpuva_gem_obtain_prealloc'


vim +/drm_gpuva_manager_prepare_objects +750 drivers/gpu/drm/drm_gpuva_mgr.c

   734	
   735	/**
   736	 * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
   737	 * @mgr: the &drm_gpuva_manager
   738	 * @num_fences: the amount of &dma_fences to reserve
   739	 *
   740	 * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
   741	 * &drm_gpuva_manager contains mappings of.
   742	 *
   743	 * Drivers can obtain the corresponding &drm_exec instance through
   744	 * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
   745	 * and drm_exec_fini() accordingly.
   746	 *
   747	 * Returns: 0 on success, negative error code on failure.
   748	 */
   749	int
 > 750	drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
   751					  unsigned int num_fences)
   752	{
   753		struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
   754		MA_STATE(mas, &mgr->mt_ext, 0, 0);
   755		union {
   756			void *ptr;
   757			uintptr_t cnt;
   758		} ref;
   759		int ret;
   760	
   761		ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
   762		if (ret)
   763			goto out;
   764	
   765		rcu_read_lock();
   766		mas_for_each(&mas, ref.ptr, ULONG_MAX) {
   767			struct drm_gem_object *obj;
   768	
   769			mas_pause(&mas);
   770			rcu_read_unlock();
   771	
   772			obj = (struct drm_gem_object *)(uintptr_t)mas.index;
   773			ret = drm_exec_prepare_obj(exec, obj, num_fences);
   774			if (ret)
   775				goto out;
   776	
   777			rcu_read_lock();
   778		}
   779		rcu_read_unlock();
   780	
   781	out:
   782		return ret;
   783	}
   784	EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
   785	

-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-20 21:53   ` Danilo Krummrich
@ 2023-08-30  7:27     ` Thomas Hellström (Intel)
  -1 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-08-30  7:27 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	christian.koenig, faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel

Hi, Danilo.

Some quick comments since I'm doing some Xe work in this area. Will 
probably get back with more.

On 8/20/23 23:53, Danilo Krummrich wrote:
> So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> allocations and mappings, generically connect GPU VA mappings to their
> backing buffers and perform more complex mapping operations on the GPU VA
> space.
>
> However, there are more design patterns commonly used by drivers, which
> can potentially be generalized in order to make the DRM GPUVA manager
> represent a basic GPU-VM implementation. In this context, this patch aims
> at generalizing the following elements.
>
> 1) Provide a common dma-resv for GEM objects not being used outside of
>     this GPU-VM.
>
> 2) Provide tracking of external GEM objects (GEM objects which are
>     shared with other GPU-VMs).
>
> 3) Provide functions to efficiently lock all GEM objects dma-resv the
>     GPU-VM contains mappings of.
>
> 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
>     of, such that validation of evicted GEM objects is accelerated.
>
> 5) Provide some convinience functions for common patterns.
>
> Rather than being designed as a "framework", the target is to make all
> features appear as a collection of optional helper functions, such that
> drivers are free to make use of the DRM GPUVA managers basic
> functionality and opt-in for other features without setting any feature
> flags, just by making use of the corresponding functions.
>
> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> ---
>   drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
>   include/drm/drm_gem.h           |  48 ++-
>   include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
>   3 files changed, 1010 insertions(+), 28 deletions(-)
>
> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> index f86bfad74ff8..69872b205961 100644
> --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>   /**
>    * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
>    * @mgr: pointer to the &drm_gpuva_manager to initialize
> + * @drm: the drivers &drm_device
>    * @name: the name of the GPU VA space
>    * @start_offset: the start offset of the GPU VA space
>    * @range: the size of the GPU VA space
> @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>    */
>   void
>   drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> +		       struct drm_device *drm,
>   		       const char *name,
>   		       u64 start_offset, u64 range,
>   		       u64 reserve_offset, u64 reserve_range,
> @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>   	mgr->rb.tree = RB_ROOT_CACHED;
>   	INIT_LIST_HEAD(&mgr->rb.list);
>   
> +	mt_init(&mgr->mt_ext);
> +
> +	INIT_LIST_HEAD(&mgr->evict.list);
> +	spin_lock_init(&mgr->evict.lock);
> +
>   	drm_gpuva_check_overflow(start_offset, range);
>   	mgr->mm_start = start_offset;
>   	mgr->mm_range = range;
> @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>   						     reserve_range)))
>   			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>   	}
> +
> +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> +	mgr->resv = mgr->d_obj.resv;
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
>   
> @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>   		__drm_gpuva_remove(&mgr->kernel_alloc_node);
>   
>   	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> -	     "GPUVA tree is not empty, potentially leaking memory.");
> +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> +
> +	mtree_destroy(&mgr->mt_ext);
> +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> +
> +	drm_gem_private_object_fini(&mgr->d_obj);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
>   
> +/**
> + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @num_fences: the amount of &dma_fences to reserve
> + *
> + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Drivers can obtain the corresponding &drm_exec instance through
> + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> + * and drm_exec_fini() accordingly.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> +				  unsigned int num_fences)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +	int ret;
> +
> +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> +	if (ret)
> +		goto out;
> +
> +	rcu_read_lock();
In xe we're protecting the external object list with an outer lock, 
(same as protecting the mgr itself). Do we need a separate lock for 
this? In theory as  outlined in the VM_BIND locking document draft, one 
could probably even use the mgr resv for this, but with more complicated 
code I guess. Also see the comment below about the data structure chosen.
> +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> +		struct drm_gem_object *obj;
> +
> +		mas_pause(&mas);
> +		rcu_read_unlock();
> +
> +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> +		if (ret)
> +			goto out;
> +
> +		rcu_read_lock();
> +	}
> +	rcu_read_unlock();
> +
> +out:
> +	return ret;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> +
> +/**
> + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @fn: callback received by the driver to lock additional dma-resv
> + * @priv: private driver data passed to @fn
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Addionally, when calling this function the driver receives the given @fn
> + * callback to lock additional dma-resv in the context of the
> + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> + * drm_exec_prepare_obj() from within this callback.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> +			     int (*fn)(struct drm_gpuva_manager *mgr,
> +				       void *priv, unsigned int num_fences),
> +			     void *priv,
> +			     unsigned int num_fences,
> +			     bool interruptible)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	uint32_t flags;
> +	int ret;
> +
> +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> +		DRM_EXEC_IGNORE_DUPLICATES;
> +
> +	drm_exec_init(exec, flags);
> +
> +	drm_exec_until_all_locked(exec) {
> +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> +		drm_exec_retry_on_contention(exec);
> +		if (ret)
> +			goto err;
> +
> +		if (fn) {
> +			ret = fn(mgr, priv, num_fences);
> +			drm_exec_retry_on_contention(exec);
> +			if (ret)
> +				goto err;
> +		}
> +	}
> +
> +	return 0;
> +
> +err:
> +	drm_exec_fini(exec);
> +	return ret;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> +
> +static int
> +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> +				unsigned int num_fences)
> +{
> +	struct {
> +		struct drm_gem_object **objs;
> +		unsigned int num_objs;
> +	} *args = priv;
> +
> +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> +				      args->num_objs, num_fences);
> +}
> +
> +/**
> + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @objs: additional &drm_gem_objects to lock
> + * @num_objs: the number of additional &drm_gem_objects to lock
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> +			     struct drm_gem_object **objs,
> +			     unsigned int num_objs,
> +			     unsigned int num_fences,
> +			     bool interruptible)
> +{
> +	struct {
> +		struct drm_gem_object **objs;
> +		unsigned int num_objs;
> +	} args;
> +
> +	args.objs = objs;
> +	args.num_objs = num_objs;
> +
> +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> +					    num_fences, interruptible);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> +
> +/**
> + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> + *
> + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> + * objects being mapped in the given &drm_gpuva_manager.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> +{
> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> +	struct drm_gpuva_gem *vm_bo;
> +	int ret;
> +
> +	if (unlikely(!ops || !ops->bo_validate))
> +		return -ENOTSUPP;
> +
> +	/* At this point we should hold all dma-resv locks of all GEM objects
> +	 * associated with this GPU-VM, hence it is safe to walk the list.
> +	 */
> +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> +		dma_resv_assert_held(vm_bo->obj->resv);
> +
> +		ret = ops->bo_validate(vm_bo->obj);
> +		if (ret)
> +			return ret;
> +	}
> +
> +	return 0;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> +
> +/**
> + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> + * dma-resv
> + * @mgr: the &drm_gpuva_manager to add a fence to
> + * @fence: fence to add
> + * @private_usage: private dma-resv usage
> + * @extobj_usage: extobj dma-resv usage
> + */
> +void
> +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> +				 struct dma_fence *fence,
> +				 enum dma_resv_usage private_usage,
> +				 enum dma_resv_usage extobj_usage)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	struct drm_gem_object *obj;
> +	unsigned long index;
> +
> +	drm_exec_for_each_locked_object(exec, index, obj) {
> +			dma_resv_assert_held(obj->resv);
> +			dma_resv_add_fence(obj->resv, fence,
> +					   drm_gpuva_is_extobj(mgr, obj) ?
> +					   private_usage : extobj_usage);
> +	}
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> +
> +static struct drm_gpuva_gem *
> +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	drm_gem_gpuva_assert_lock_held(obj);
> +
> +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> +		if (vm_bo->mgr == mgr)
> +			return vm_bo;
> +
> +	return NULL;
> +}
> +
> +/**
> + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> + * vm_bo_alloc() callback to allocate.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	if (ops && ops->vm_bo_alloc)
> +		vm_bo = ops->vm_bo_alloc();
> +	else
> +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> +
> +	if (unlikely(!vm_bo))
> +		return NULL;
> +
> +	vm_bo->mgr = mgr;
> +	vm_bo->obj = obj;
> +
> +	kref_init(&vm_bo->kref);
> +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> +
> +	drm_gem_object_get(obj);
> +
> +	return vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> +
> +void
> +drm_gpuva_gem_destroy(struct kref *kref)
> +{
> +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> +						   kref);
> +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> +
> +	drm_gem_object_put(vm_bo->obj);
> +
> +	if (ops && ops->vm_bo_free)
> +		ops->vm_bo_free(vm_bo);
> +	else
> +		kfree(vm_bo);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> +
> +/**
> + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> + * &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the &drm_gpuva_gem accordingly.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		   struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> +
> +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> +
> +/**
> + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> + * given &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> + * &drm_gpuva_gem.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> +	if (vm_bo)
> +		return vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> +	if (!vm_bo)
> +		return ERR_PTR(-ENOMEM);
> +
> +	return vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> +
> +/**
> + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> + * for the given &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> + * count is decreased. If not found @__vm_bo is returned.
> + *
> + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> + * &drm_gpuva_gem was found
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> +			      struct drm_gem_object *obj,
> +			      struct drm_gpuva_gem *__vm_bo)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> +	if (vm_bo) {
> +		drm_gpuva_gem_put(__vm_bo);
> +		return vm_bo;
> +	}
> +
> +	return __vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> +
> +static int
> +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj,
> +			  gfp_t gfp)
> +{
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		struct drm_gem_object *obj;
> +		uintptr_t index;
> +	} gem;
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +	int ret = 0;
> +
> +	gem.obj = obj;
> +	mas_set(&mas, gem.index);
> +
> +	mas_lock(&mas);
> +	ref.ptr = mas_walk(&mas);
> +	if (ref.ptr) {
> +		++ref.cnt;
> +		mas_store(&mas, ref.ptr);
> +	} else {
> +		if (unlikely(!gfp)) {
> +			ret = -EINVAL;
> +			goto out;
> +		}
> +
> +		mas_set(&mas, gem.index);
> +		ref.cnt = 1;
> +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> +		if (likely(!ret))
> +			drm_gem_object_get(obj);
> +	}
> +out:
> +	mas_unlock(&mas);
> +	return ret;
> +}
> +
> +static void
> +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj)
> +{
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		struct drm_gem_object *obj;
> +		uintptr_t index;
> +	} gem;
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +
> +	gem.obj = obj;
> +	mas_set(&mas, gem.index);
> +
> +	mas_lock(&mas);
> +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> +		goto out;
> +
> +	if (!--ref.cnt) {
> +		mas_erase(&mas);
> +		drm_gem_object_put(obj);
> +	} else {
> +		mas_store(&mas, ref.ptr);
> +	}
> +out:
> +	mas_unlock(&mas);
> +}
> +
> +/**
> + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> + * @mgr: the &drm_gpuva_manager to insert into
> + * @obj: the &drm_gem_object to insert as extobj
> + *
> + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> + * If the &drm_gem_object already exists in the tree, the reference counter
> + * of this external object is increased by one.
> + *
> + * Drivers should insert the external &drm_gem_object before the dma-fence
> + * signalling critical section, e.g. when submitting the job, and before
> + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> + * or its dervates.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			struct drm_gem_object *obj)
> +{
> +	return drm_gpuva_is_extobj(mgr, obj) ?
> +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> +
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> +
> +/**
> + * drm_gpuva_extobj_get - increase the referecne count of an external
> + * &drm_gem_object
> + * @mgr: the &drm_gpuva_manager storing the extobj
> + * @obj: the &drm_gem_object to representing the extobj
> + *
> + * Increases the reference count of the extobj represented by @obj.
> + *
> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> + * being inserted.
> + *
> + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> + * additional reference if the re-map operation splits an existing &drm_gpuva
> + * into two separate ones.
> + *
> + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +void
> +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	if (drm_gpuva_is_extobj(mgr, obj))
> +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> +		     "Can't increase ref-count of non-existent extobj.");
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> +
> +/**
> + * drm_gpuva_extobj_put - decrease the referecne count of an external
> + * &drm_gem_object
> + * @mgr: the &drm_gpuva_manager storing the extobj
> + * @obj: the &drm_gem_object to representing the extobj
> + *
> + * Decreases the reference count of the extobj represented by @obj.
> + *
> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> + * being removed from the GPU VA space.
> + *
> + * See also drm_gpuva_unmap_put().
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +void
> +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	if (drm_gpuva_is_extobj(mgr, obj))
> +		__drm_gpuva_extobj_remove(mgr, obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> +
> +/**
> + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> + * &drm_gpuva_managers evicted list
> + * @obj: the &drm_gem_object to add or remove
> + * @evict: indicates whether the object is evicted
> + *
> + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> + * list containing a mapping of this &drm_gem_object.
> + */
> +void
> +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> +	 */
> +	drm_gem_gpuva_assert_lock_held(obj);
> +
> +	/* Required to protect the GPUVA managers evict list against concurrent
> +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> +	 * the evict list through different GEM object evictions are protected
> +	 * by the GPUVA managers evict lock.
> +	 */
> +	dma_resv_assert_held(obj->resv);
> +
> +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> +
> +		spin_lock(&mgr->evict.lock);
> +		if (evict)
> +			list_add_tail(&vm_bo->list.entry.evict,
> +				      &mgr->evict.list);
> +		else
> +			list_del_init(&vm_bo->list.entry.evict);
> +		spin_unlock(&mgr->evict.lock);
> +	}
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> +
>   static int
>   __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>   		   struct drm_gpuva *va)
> @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
>   /**
>    * drm_gpuva_link() - link a &drm_gpuva
>    * @va: the &drm_gpuva to link
> + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
>    *
> - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> - * associated with.
> + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> + *
> + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> + * reference of the latter is taken.
>    *
>    * This function expects the caller to protect the GEM's GPUVA list against
> - * concurrent access using the GEMs dma_resv lock.
> + * concurrent access using either the GEMs dma_resv lock or a driver specific
> + * lock set through drm_gem_gpuva_set_lock().
>    */
>   void
> -drm_gpuva_link(struct drm_gpuva *va)
> +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
>   {
>   	struct drm_gem_object *obj = va->gem.obj;
>   
> @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
>   
>   	drm_gem_gpuva_assert_lock_held(obj);
>   
> -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> +	drm_gpuva_gem_get(vm_bo);
> +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> +	if (list_empty(&vm_bo->list.entry.gem))
> +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_link);
>   
> @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
>    * This removes the given &va from the GPU VA list of the &drm_gem_object it is
>    * associated with.
>    *
> + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> + *
> + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> + * the latter is dropped.
> + *
>    * This function expects the caller to protect the GEM's GPUVA list against
> - * concurrent access using the GEMs dma_resv lock.
> + * concurrent access using either the GEMs dma_resv lock or a driver specific
> + * lock set through drm_gem_gpuva_set_lock().
>    */
>   void
>   drm_gpuva_unlink(struct drm_gpuva *va)
>   {
>   	struct drm_gem_object *obj = va->gem.obj;
> +	struct drm_gpuva_gem *vm_bo;
>   
>   	if (unlikely(!obj))
>   		return;
>   
>   	drm_gem_gpuva_assert_lock_held(obj);
>   
> +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> +		return;
> +
>   	list_del_init(&va->gem.entry);
> +
> +	if (list_empty(&vm_bo->list.gpuva)) {
> +		list_del_init(&vm_bo->list.entry.gem);
> +		list_del_init(&vm_bo->list.entry.evict);
> +	}
> +	drm_gpuva_gem_put(vm_bo);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
>   
> @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_map);
>   
> +/**
> + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> + * &drm_gpuva_op_map
> + * @mgr: the &drm_gpuva_manager
> + * @va: the &drm_gpuva to insert
> + * @op: the &drm_gpuva_op_map to initialize @va with
> + *
> + * Initializes the @va from the @op and inserts it into the given @mgr and
> + * increases the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> +		  struct drm_gpuva *va,
> +		  struct drm_gpuva_op_map *op)
> +{
> +	drm_gpuva_map(mgr, va, op);
> +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> +
>   /**
>    * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
>    * &drm_gpuva_op_remap
> @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>   		struct drm_gpuva *next,
>   		struct drm_gpuva_op_remap *op)
>   {
> -	struct drm_gpuva *curr = op->unmap->va;
> -	struct drm_gpuva_manager *mgr = curr->mgr;
> +	struct drm_gpuva *va = op->unmap->va;
> +	struct drm_gpuva_manager *mgr = va->mgr;
>   
> -	drm_gpuva_remove(curr);
> +	drm_gpuva_remove(va);
>   
>   	if (op->prev) {
>   		drm_gpuva_init_from_op(prev, op->prev);
> @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_remap);
>   
> +/**
> + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> + * &drm_gpuva_op_remap
> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> + *
> + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> + * separate mappings, increases the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_remap_get(struct drm_gpuva *prev,
> +		    struct drm_gpuva *next,
> +		    struct drm_gpuva_op_remap *op)
> +{
> +	struct drm_gpuva *va = op->unmap->va;
> +	struct drm_gpuva_manager *mgr = va->mgr;
> +
> +	drm_gpuva_remap(prev, next, op);
> +	if (op->prev && op->next)
> +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> +
>   /**
>    * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
>    * &drm_gpuva_op_unmap
> @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
>   
> +/**
> + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> + * &drm_gpuva_op_unmap
> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> + *
> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> + * the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> +{
> +	struct drm_gpuva *va = op->va;
> +
> +	drm_gpuva_unmap(op);
> +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> +
>   static int
>   op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>   	  u64 addr, u64 range,
> @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>   {
>   	struct drm_gpuva_ops *ops;
>   	struct drm_gpuva_op *op;
> +	struct drm_gpuva_gem *vm_bo;
>   	struct drm_gpuva *va;
>   	int ret;
>   
> @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>   
>   	INIT_LIST_HEAD(&ops->list);
>   
> -	drm_gem_for_each_gpuva(va, obj) {
> +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
>   		op = gpuva_op_alloc(mgr);
>   		if (!op) {
>   			ret = -ENOMEM;
> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> index bc9f6aa2f3fe..783ed3ab440d 100644
> --- a/include/drm/drm_gem.h
> +++ b/include/drm/drm_gem.h
> @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
>    * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
>    * @obj: the &drm_gem_object
>    *
> - * This initializes the &drm_gem_object's &drm_gpuva list.
> + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
>    *
>    * Calling this function is only necessary for drivers intending to support the
>    * &drm_driver_feature DRIVER_GEM_GPUVA.
> @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>   }
>   
>   /**
> - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> - * @entry__: &drm_gpuva structure to assign to in each iteration step
> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>    *
> - * This iterator walks over all &drm_gpuva structures associated with the
> - * &drm_gpuva_manager.
> + * This iterator walks over all &drm_gpuva_gem structures associated with the
> + * &drm_gem_object.
>    */
> -#define drm_gem_for_each_gpuva(entry__, obj__) \
> -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
>   
>   /**
> - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> - * gpuvas
> - * @entry__: &drm_gpuva structure to assign to in each iteration step
> - * @next__: &next &drm_gpuva to store the next step
> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> + * &drm_gpuva_gem
> + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> + * @next__: &next &drm_gpuva_gem to store the next step
> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>    *
> - * This iterator walks over all &drm_gpuva structures associated with the
> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>    * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
>    * it is save against removal of elements.
>    */
> -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> +
> +/**
> + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_manager and &drm_gem_object.
> + */
> +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> +	     va__ = list_next_entry(va__, gem.entry))
>   
>   #endif /* __DRM_GEM_H__ */
> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> index ed8d50200cc3..693e2da3f425 100644
> --- a/include/drm/drm_gpuva_mgr.h
> +++ b/include/drm/drm_gpuva_mgr.h
> @@ -26,12 +26,16 @@
>    */
>   
>   #include <linux/list.h>
> +#include <linux/dma-resv.h>
> +#include <linux/maple_tree.h>
>   #include <linux/rbtree.h>
>   #include <linux/types.h>
>   
>   #include <drm/drm_gem.h>
> +#include <drm/drm_exec.h>
>   
>   struct drm_gpuva_manager;
> +struct drm_gpuva_gem;
>   struct drm_gpuva_fn_ops;
>   
>   /**
> @@ -140,7 +144,7 @@ struct drm_gpuva {
>   int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>   void drm_gpuva_remove(struct drm_gpuva *va);
>   
> -void drm_gpuva_link(struct drm_gpuva *va);
> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>   void drm_gpuva_unlink(struct drm_gpuva *va);
>   
>   struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>   	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>   	 */
>   	const struct drm_gpuva_fn_ops *ops;
> +
> +	/**
> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> +	 * dma-resv to &drm_exec.
> +	 */
> +	struct drm_gem_object d_obj;
> +
> +	/**
> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> +	 * space
> +	 */
> +	struct dma_resv *resv;
> +
> +	/**
> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> +	 */
> +	struct drm_exec exec;
> +
> +	/**
> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> +	 */
> +	struct maple_tree mt_ext;

Why are you using a maple tree here? Insertion and removal is O(log(n)) 
instead of O(1) for a list?

> +
> +	/**
> +	 * @evict: structure holding the evict list and evict list lock
> +	 */
> +	struct {
> +		/**
> +		 * @list: &list_head storing &drm_gem_objects currently being
> +		 * evicted
> +		 */
> +		struct list_head list;
> +
> +		/**
> +		 * @lock: spinlock to protect the evict list against concurrent
> +		 * insertion / removal of different &drm_gpuva_gems
> +		 */
> +		spinlock_t lock;
> +	} evict;
>   };
>   
>   void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> +			    struct drm_device *drm,
>   			    const char *name,
>   			    u64 start_offset, u64 range,
>   			    u64 reserve_offset, u64 reserve_range,
>   			    const struct drm_gpuva_fn_ops *ops);
>   void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>   
> +/**
> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> + */
> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec

A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task 
and should typically be allocated on the stack. Otherwise you'd need to 
protect the mgr->exec member with an exclusive lock throughout the 
locking process, and that's not what we want.

Did you consider subclassing a drm_exec for drm_gpuva purposes and add 
needed ops to it: Like so:

struct drm_gpuva_exec_ops {
     int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
     int (*bo_validate) (struct drm_gpuva_exec *exec, struct 
drm_gem_object *obj);
};

struct drm_gpuva_exec {
     const struct drm_gpuva_exec_ops *ops;
     struct drm_exec exec;
     struct drm_gpuva_manager *mgr;
};

Although I'd actually expect bo_validate to be part of fn in the typical 
case. The drm_gpuva_exec would then be allocated by the caller on the stack.


> +
> +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> +				 int (*fn)(struct drm_gpuva_manager *mgr,
> +					   void *priv, unsigned int num_fences),
> +				 void *priv,
> +				 unsigned int num_fences,
> +				 bool interruptible);
> +
> +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> +				 struct drm_gem_object **objs,
> +				 unsigned int num_objs,
> +				 unsigned int num_fences,
> +				 bool interruptible);
> +
> +/**
> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +static inline int
> +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> +		       unsigned int num_fences,
> +		       bool interruptible)
> +{
> +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> +					    interruptible);
> +}
> +
> +/**
> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + *
> + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> + * through drm_gpuva_manager_lock() or its variants.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +static inline void
> +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> +{
> +	drm_exec_fini(&mgr->exec);
> +}
> +
> +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> +				      struct dma_fence *fence,
> +				      enum dma_resv_usage private_usage,
> +				      enum dma_resv_usage extobj_usage);
> +
> +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			    struct drm_gem_object *obj);
> +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj);
> +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj);
> +
> +/**
> + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> + * external object
> + * @mgr: the &drm_gpuva_manager to check
> + * @obj: the &drm_gem_object to check
> + *
> + * Returns: true if the &drm_gem_object &dma_resv differs from the
> + * &drm_gpuva_managers &dma_resv, false otherwise
> + */
> +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> +				       struct drm_gem_object *obj)
> +{
> +	return obj && obj->resv != mgr->resv;
> +}
> +
>   static inline struct drm_gpuva *
>   __drm_gpuva_next(struct drm_gpuva *va)
>   {
> @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
>   #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>   	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
>   
> +/**
> + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> + * &drm_gem_object combination
> + *
> + * This structure is an abstraction representing a &drm_gpuva_manager and
> + * &drm_gem_object combination. It serves as an indirection to accelerate
> + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> + * &drm_gem_object.
> + *
> + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> + * accelerate validation.
> + *
> + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> + * a GEM object is mapped first in a GPU-VM and release the instance once the
> + * last mapping of the GEM object in this GPU-VM is unmapped.
> + */
> +struct drm_gpuva_gem {
> +
> +	/**
> +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> +	 */
> +	struct drm_gpuva_manager *mgr;
> +
> +	/**
> +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> +	 */
> +	struct drm_gem_object *obj;
> +
> +	/**
> +	 * @kref: The reference count for this &drm_gpuva_gem.
> +	 */
> +	struct kref kref;
> +
> +	/**
> +	 * @list: Structure containing all &list_heads.
> +	 */
> +	struct {
> +		/**
> +		 * @gpuva: The list of linked &drm_gpuvas.
> +		 */
> +		struct list_head gpuva;
> +
> +		/**
> +		 * @entry: Structure containing all &list_heads serving as
> +		 * entry.
> +		 */
> +		struct {
> +			/**
> +			 * @gem: List entry to attach to the &drm_gem_objects
> +			 * gpuva list.
> +			 */
> +			struct list_head gem;
> +
> +			/**
> +			 * @evict: List entry to attach to the
> +			 * &drm_gpuva_managers evict list.
> +			 */
> +			struct list_head evict;
> +		} entry;
> +	} list;
> +};
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj);
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> +			      struct drm_gem_object *obj,
> +			      struct drm_gpuva_gem *__vm_bo);
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		   struct drm_gem_object *obj);
> +
> +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj);
> +void drm_gpuva_gem_destroy(struct kref *kref);
> +
> +/**
> + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> + *
> + * This function acquires an additional reference to @vm_bo. It is illegal to
> + * call this without already holding a reference. No locks required.
> + */
> +static inline struct drm_gpuva_gem *
> +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> +{
> +	kref_get(&vm_bo->kref);
> +	return vm_bo;
> +}
> +
> +/**
> + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> + * @vm_bo: the &drm_gpuva_gem to release the reference of
> + *
> + * This releases a reference to @vm_bo.
> + */
> +static inline void
> +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> +{
> +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> +}
> +
> +/**
> + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_gem.
> + */
> +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> +
> +/**
> + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> + * &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @next__: &next &drm_gpuva to store the next step
> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> + * it is save against removal of elements.
> + */
> +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> +
>   /**
>    * enum drm_gpuva_op_type - GPU VA operation type
>    *
> @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
>   	 */
>   	void (*op_free)(struct drm_gpuva_op *op);
>   
> +	/**
> +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> +	 * a struct drm_gpuva_gem
> +	 *
> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> +	 * specific structures. By implementing this callback drivers can
> +	 * allocate memory accordingly.
> +	 *
> +	 * This callback is optional.
> +	 */
> +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> +
> +	/**
> +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> +	 * struct drm_gpuva_gem
> +	 *
> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> +	 * specific structures. By implementing this callback drivers can
> +	 * free the previously allocated memory accordingly.
> +	 *
> +	 * This callback is optional.
> +	 */
> +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> +
>   	/**
>   	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
>   	 * mapping once all previous steps were completed
> @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
>   	 * used.
>   	 */
>   	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> +
> +	/**
> +	 * @bo_validate: called from drm_gpuva_manager_validate()
> +	 *
> +	 * Drivers receive this callback for every evicted &drm_gem_object being
> +	 * mapped in the corresponding &drm_gpuva_manager.
> +	 *
> +	 * Typically, drivers would call their driver specific variant of
> +	 * ttm_bo_validate() from within this callback.
> +	 */
> +	int (*bo_validate)(struct drm_gem_object *obj);
>   };
>   
>   int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>   void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>   		   struct drm_gpuva *va,
>   		   struct drm_gpuva_op_map *op);
> +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> +		       struct drm_gpuva *va,
> +		       struct drm_gpuva_op_map *op);
>   
>   void drm_gpuva_remap(struct drm_gpuva *prev,
>   		     struct drm_gpuva *next,
>   		     struct drm_gpuva_op_remap *op);
> +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> +			 struct drm_gpuva *next,
> +			 struct drm_gpuva_op_remap *op);
>   
>   void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
>   
>   #endif /* __DRM_GPUVA_MGR_H__ */

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-30  7:27     ` Thomas Hellström (Intel)
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-08-30  7:27 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	christian.koenig, faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel

Hi, Danilo.

Some quick comments since I'm doing some Xe work in this area. Will 
probably get back with more.

On 8/20/23 23:53, Danilo Krummrich wrote:
> So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> allocations and mappings, generically connect GPU VA mappings to their
> backing buffers and perform more complex mapping operations on the GPU VA
> space.
>
> However, there are more design patterns commonly used by drivers, which
> can potentially be generalized in order to make the DRM GPUVA manager
> represent a basic GPU-VM implementation. In this context, this patch aims
> at generalizing the following elements.
>
> 1) Provide a common dma-resv for GEM objects not being used outside of
>     this GPU-VM.
>
> 2) Provide tracking of external GEM objects (GEM objects which are
>     shared with other GPU-VMs).
>
> 3) Provide functions to efficiently lock all GEM objects dma-resv the
>     GPU-VM contains mappings of.
>
> 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
>     of, such that validation of evicted GEM objects is accelerated.
>
> 5) Provide some convinience functions for common patterns.
>
> Rather than being designed as a "framework", the target is to make all
> features appear as a collection of optional helper functions, such that
> drivers are free to make use of the DRM GPUVA managers basic
> functionality and opt-in for other features without setting any feature
> flags, just by making use of the corresponding functions.
>
> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> ---
>   drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
>   include/drm/drm_gem.h           |  48 ++-
>   include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
>   3 files changed, 1010 insertions(+), 28 deletions(-)
>
> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> index f86bfad74ff8..69872b205961 100644
> --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>   /**
>    * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
>    * @mgr: pointer to the &drm_gpuva_manager to initialize
> + * @drm: the drivers &drm_device
>    * @name: the name of the GPU VA space
>    * @start_offset: the start offset of the GPU VA space
>    * @range: the size of the GPU VA space
> @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>    */
>   void
>   drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> +		       struct drm_device *drm,
>   		       const char *name,
>   		       u64 start_offset, u64 range,
>   		       u64 reserve_offset, u64 reserve_range,
> @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>   	mgr->rb.tree = RB_ROOT_CACHED;
>   	INIT_LIST_HEAD(&mgr->rb.list);
>   
> +	mt_init(&mgr->mt_ext);
> +
> +	INIT_LIST_HEAD(&mgr->evict.list);
> +	spin_lock_init(&mgr->evict.lock);
> +
>   	drm_gpuva_check_overflow(start_offset, range);
>   	mgr->mm_start = start_offset;
>   	mgr->mm_range = range;
> @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>   						     reserve_range)))
>   			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>   	}
> +
> +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> +	mgr->resv = mgr->d_obj.resv;
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
>   
> @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>   		__drm_gpuva_remove(&mgr->kernel_alloc_node);
>   
>   	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> -	     "GPUVA tree is not empty, potentially leaking memory.");
> +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> +
> +	mtree_destroy(&mgr->mt_ext);
> +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> +
> +	drm_gem_private_object_fini(&mgr->d_obj);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
>   
> +/**
> + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @num_fences: the amount of &dma_fences to reserve
> + *
> + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Drivers can obtain the corresponding &drm_exec instance through
> + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> + * and drm_exec_fini() accordingly.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> +				  unsigned int num_fences)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +	int ret;
> +
> +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> +	if (ret)
> +		goto out;
> +
> +	rcu_read_lock();
In xe we're protecting the external object list with an outer lock, 
(same as protecting the mgr itself). Do we need a separate lock for 
this? In theory as  outlined in the VM_BIND locking document draft, one 
could probably even use the mgr resv for this, but with more complicated 
code I guess. Also see the comment below about the data structure chosen.
> +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> +		struct drm_gem_object *obj;
> +
> +		mas_pause(&mas);
> +		rcu_read_unlock();
> +
> +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> +		if (ret)
> +			goto out;
> +
> +		rcu_read_lock();
> +	}
> +	rcu_read_unlock();
> +
> +out:
> +	return ret;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> +
> +/**
> + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @fn: callback received by the driver to lock additional dma-resv
> + * @priv: private driver data passed to @fn
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Addionally, when calling this function the driver receives the given @fn
> + * callback to lock additional dma-resv in the context of the
> + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> + * drm_exec_prepare_obj() from within this callback.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> +			     int (*fn)(struct drm_gpuva_manager *mgr,
> +				       void *priv, unsigned int num_fences),
> +			     void *priv,
> +			     unsigned int num_fences,
> +			     bool interruptible)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	uint32_t flags;
> +	int ret;
> +
> +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> +		DRM_EXEC_IGNORE_DUPLICATES;
> +
> +	drm_exec_init(exec, flags);
> +
> +	drm_exec_until_all_locked(exec) {
> +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> +		drm_exec_retry_on_contention(exec);
> +		if (ret)
> +			goto err;
> +
> +		if (fn) {
> +			ret = fn(mgr, priv, num_fences);
> +			drm_exec_retry_on_contention(exec);
> +			if (ret)
> +				goto err;
> +		}
> +	}
> +
> +	return 0;
> +
> +err:
> +	drm_exec_fini(exec);
> +	return ret;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> +
> +static int
> +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> +				unsigned int num_fences)
> +{
> +	struct {
> +		struct drm_gem_object **objs;
> +		unsigned int num_objs;
> +	} *args = priv;
> +
> +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> +				      args->num_objs, num_fences);
> +}
> +
> +/**
> + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @objs: additional &drm_gem_objects to lock
> + * @num_objs: the number of additional &drm_gem_objects to lock
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> +			     struct drm_gem_object **objs,
> +			     unsigned int num_objs,
> +			     unsigned int num_fences,
> +			     bool interruptible)
> +{
> +	struct {
> +		struct drm_gem_object **objs;
> +		unsigned int num_objs;
> +	} args;
> +
> +	args.objs = objs;
> +	args.num_objs = num_objs;
> +
> +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> +					    num_fences, interruptible);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> +
> +/**
> + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> + *
> + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> + * objects being mapped in the given &drm_gpuva_manager.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> +{
> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> +	struct drm_gpuva_gem *vm_bo;
> +	int ret;
> +
> +	if (unlikely(!ops || !ops->bo_validate))
> +		return -ENOTSUPP;
> +
> +	/* At this point we should hold all dma-resv locks of all GEM objects
> +	 * associated with this GPU-VM, hence it is safe to walk the list.
> +	 */
> +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> +		dma_resv_assert_held(vm_bo->obj->resv);
> +
> +		ret = ops->bo_validate(vm_bo->obj);
> +		if (ret)
> +			return ret;
> +	}
> +
> +	return 0;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> +
> +/**
> + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> + * dma-resv
> + * @mgr: the &drm_gpuva_manager to add a fence to
> + * @fence: fence to add
> + * @private_usage: private dma-resv usage
> + * @extobj_usage: extobj dma-resv usage
> + */
> +void
> +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> +				 struct dma_fence *fence,
> +				 enum dma_resv_usage private_usage,
> +				 enum dma_resv_usage extobj_usage)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	struct drm_gem_object *obj;
> +	unsigned long index;
> +
> +	drm_exec_for_each_locked_object(exec, index, obj) {
> +			dma_resv_assert_held(obj->resv);
> +			dma_resv_add_fence(obj->resv, fence,
> +					   drm_gpuva_is_extobj(mgr, obj) ?
> +					   private_usage : extobj_usage);
> +	}
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> +
> +static struct drm_gpuva_gem *
> +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	drm_gem_gpuva_assert_lock_held(obj);
> +
> +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> +		if (vm_bo->mgr == mgr)
> +			return vm_bo;
> +
> +	return NULL;
> +}
> +
> +/**
> + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> + * vm_bo_alloc() callback to allocate.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	if (ops && ops->vm_bo_alloc)
> +		vm_bo = ops->vm_bo_alloc();
> +	else
> +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> +
> +	if (unlikely(!vm_bo))
> +		return NULL;
> +
> +	vm_bo->mgr = mgr;
> +	vm_bo->obj = obj;
> +
> +	kref_init(&vm_bo->kref);
> +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> +
> +	drm_gem_object_get(obj);
> +
> +	return vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> +
> +void
> +drm_gpuva_gem_destroy(struct kref *kref)
> +{
> +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> +						   kref);
> +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> +
> +	drm_gem_object_put(vm_bo->obj);
> +
> +	if (ops && ops->vm_bo_free)
> +		ops->vm_bo_free(vm_bo);
> +	else
> +		kfree(vm_bo);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> +
> +/**
> + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> + * &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the &drm_gpuva_gem accordingly.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		   struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> +
> +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> +
> +/**
> + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> + * given &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> + * &drm_gpuva_gem.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> +	if (vm_bo)
> +		return vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> +	if (!vm_bo)
> +		return ERR_PTR(-ENOMEM);
> +
> +	return vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> +
> +/**
> + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> + * for the given &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> + * count is decreased. If not found @__vm_bo is returned.
> + *
> + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> + * &drm_gpuva_gem was found
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> +			      struct drm_gem_object *obj,
> +			      struct drm_gpuva_gem *__vm_bo)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> +	if (vm_bo) {
> +		drm_gpuva_gem_put(__vm_bo);
> +		return vm_bo;
> +	}
> +
> +	return __vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> +
> +static int
> +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj,
> +			  gfp_t gfp)
> +{
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		struct drm_gem_object *obj;
> +		uintptr_t index;
> +	} gem;
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +	int ret = 0;
> +
> +	gem.obj = obj;
> +	mas_set(&mas, gem.index);
> +
> +	mas_lock(&mas);
> +	ref.ptr = mas_walk(&mas);
> +	if (ref.ptr) {
> +		++ref.cnt;
> +		mas_store(&mas, ref.ptr);
> +	} else {
> +		if (unlikely(!gfp)) {
> +			ret = -EINVAL;
> +			goto out;
> +		}
> +
> +		mas_set(&mas, gem.index);
> +		ref.cnt = 1;
> +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> +		if (likely(!ret))
> +			drm_gem_object_get(obj);
> +	}
> +out:
> +	mas_unlock(&mas);
> +	return ret;
> +}
> +
> +static void
> +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj)
> +{
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		struct drm_gem_object *obj;
> +		uintptr_t index;
> +	} gem;
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +
> +	gem.obj = obj;
> +	mas_set(&mas, gem.index);
> +
> +	mas_lock(&mas);
> +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> +		goto out;
> +
> +	if (!--ref.cnt) {
> +		mas_erase(&mas);
> +		drm_gem_object_put(obj);
> +	} else {
> +		mas_store(&mas, ref.ptr);
> +	}
> +out:
> +	mas_unlock(&mas);
> +}
> +
> +/**
> + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> + * @mgr: the &drm_gpuva_manager to insert into
> + * @obj: the &drm_gem_object to insert as extobj
> + *
> + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> + * If the &drm_gem_object already exists in the tree, the reference counter
> + * of this external object is increased by one.
> + *
> + * Drivers should insert the external &drm_gem_object before the dma-fence
> + * signalling critical section, e.g. when submitting the job, and before
> + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> + * or its dervates.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			struct drm_gem_object *obj)
> +{
> +	return drm_gpuva_is_extobj(mgr, obj) ?
> +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> +
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> +
> +/**
> + * drm_gpuva_extobj_get - increase the referecne count of an external
> + * &drm_gem_object
> + * @mgr: the &drm_gpuva_manager storing the extobj
> + * @obj: the &drm_gem_object to representing the extobj
> + *
> + * Increases the reference count of the extobj represented by @obj.
> + *
> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> + * being inserted.
> + *
> + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> + * additional reference if the re-map operation splits an existing &drm_gpuva
> + * into two separate ones.
> + *
> + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +void
> +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	if (drm_gpuva_is_extobj(mgr, obj))
> +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> +		     "Can't increase ref-count of non-existent extobj.");
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> +
> +/**
> + * drm_gpuva_extobj_put - decrease the referecne count of an external
> + * &drm_gem_object
> + * @mgr: the &drm_gpuva_manager storing the extobj
> + * @obj: the &drm_gem_object to representing the extobj
> + *
> + * Decreases the reference count of the extobj represented by @obj.
> + *
> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> + * being removed from the GPU VA space.
> + *
> + * See also drm_gpuva_unmap_put().
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +void
> +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	if (drm_gpuva_is_extobj(mgr, obj))
> +		__drm_gpuva_extobj_remove(mgr, obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> +
> +/**
> + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> + * &drm_gpuva_managers evicted list
> + * @obj: the &drm_gem_object to add or remove
> + * @evict: indicates whether the object is evicted
> + *
> + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> + * list containing a mapping of this &drm_gem_object.
> + */
> +void
> +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> +	 */
> +	drm_gem_gpuva_assert_lock_held(obj);
> +
> +	/* Required to protect the GPUVA managers evict list against concurrent
> +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> +	 * the evict list through different GEM object evictions are protected
> +	 * by the GPUVA managers evict lock.
> +	 */
> +	dma_resv_assert_held(obj->resv);
> +
> +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> +
> +		spin_lock(&mgr->evict.lock);
> +		if (evict)
> +			list_add_tail(&vm_bo->list.entry.evict,
> +				      &mgr->evict.list);
> +		else
> +			list_del_init(&vm_bo->list.entry.evict);
> +		spin_unlock(&mgr->evict.lock);
> +	}
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> +
>   static int
>   __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>   		   struct drm_gpuva *va)
> @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
>   /**
>    * drm_gpuva_link() - link a &drm_gpuva
>    * @va: the &drm_gpuva to link
> + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
>    *
> - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> - * associated with.
> + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> + *
> + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> + * reference of the latter is taken.
>    *
>    * This function expects the caller to protect the GEM's GPUVA list against
> - * concurrent access using the GEMs dma_resv lock.
> + * concurrent access using either the GEMs dma_resv lock or a driver specific
> + * lock set through drm_gem_gpuva_set_lock().
>    */
>   void
> -drm_gpuva_link(struct drm_gpuva *va)
> +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
>   {
>   	struct drm_gem_object *obj = va->gem.obj;
>   
> @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
>   
>   	drm_gem_gpuva_assert_lock_held(obj);
>   
> -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> +	drm_gpuva_gem_get(vm_bo);
> +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> +	if (list_empty(&vm_bo->list.entry.gem))
> +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_link);
>   
> @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
>    * This removes the given &va from the GPU VA list of the &drm_gem_object it is
>    * associated with.
>    *
> + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> + *
> + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> + * the latter is dropped.
> + *
>    * This function expects the caller to protect the GEM's GPUVA list against
> - * concurrent access using the GEMs dma_resv lock.
> + * concurrent access using either the GEMs dma_resv lock or a driver specific
> + * lock set through drm_gem_gpuva_set_lock().
>    */
>   void
>   drm_gpuva_unlink(struct drm_gpuva *va)
>   {
>   	struct drm_gem_object *obj = va->gem.obj;
> +	struct drm_gpuva_gem *vm_bo;
>   
>   	if (unlikely(!obj))
>   		return;
>   
>   	drm_gem_gpuva_assert_lock_held(obj);
>   
> +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> +		return;
> +
>   	list_del_init(&va->gem.entry);
> +
> +	if (list_empty(&vm_bo->list.gpuva)) {
> +		list_del_init(&vm_bo->list.entry.gem);
> +		list_del_init(&vm_bo->list.entry.evict);
> +	}
> +	drm_gpuva_gem_put(vm_bo);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
>   
> @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_map);
>   
> +/**
> + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> + * &drm_gpuva_op_map
> + * @mgr: the &drm_gpuva_manager
> + * @va: the &drm_gpuva to insert
> + * @op: the &drm_gpuva_op_map to initialize @va with
> + *
> + * Initializes the @va from the @op and inserts it into the given @mgr and
> + * increases the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> +		  struct drm_gpuva *va,
> +		  struct drm_gpuva_op_map *op)
> +{
> +	drm_gpuva_map(mgr, va, op);
> +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> +
>   /**
>    * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
>    * &drm_gpuva_op_remap
> @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>   		struct drm_gpuva *next,
>   		struct drm_gpuva_op_remap *op)
>   {
> -	struct drm_gpuva *curr = op->unmap->va;
> -	struct drm_gpuva_manager *mgr = curr->mgr;
> +	struct drm_gpuva *va = op->unmap->va;
> +	struct drm_gpuva_manager *mgr = va->mgr;
>   
> -	drm_gpuva_remove(curr);
> +	drm_gpuva_remove(va);
>   
>   	if (op->prev) {
>   		drm_gpuva_init_from_op(prev, op->prev);
> @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_remap);
>   
> +/**
> + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> + * &drm_gpuva_op_remap
> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> + *
> + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> + * separate mappings, increases the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_remap_get(struct drm_gpuva *prev,
> +		    struct drm_gpuva *next,
> +		    struct drm_gpuva_op_remap *op)
> +{
> +	struct drm_gpuva *va = op->unmap->va;
> +	struct drm_gpuva_manager *mgr = va->mgr;
> +
> +	drm_gpuva_remap(prev, next, op);
> +	if (op->prev && op->next)
> +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> +
>   /**
>    * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
>    * &drm_gpuva_op_unmap
> @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
>   
> +/**
> + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> + * &drm_gpuva_op_unmap
> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> + *
> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> + * the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> +{
> +	struct drm_gpuva *va = op->va;
> +
> +	drm_gpuva_unmap(op);
> +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> +
>   static int
>   op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>   	  u64 addr, u64 range,
> @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>   {
>   	struct drm_gpuva_ops *ops;
>   	struct drm_gpuva_op *op;
> +	struct drm_gpuva_gem *vm_bo;
>   	struct drm_gpuva *va;
>   	int ret;
>   
> @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>   
>   	INIT_LIST_HEAD(&ops->list);
>   
> -	drm_gem_for_each_gpuva(va, obj) {
> +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
>   		op = gpuva_op_alloc(mgr);
>   		if (!op) {
>   			ret = -ENOMEM;
> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> index bc9f6aa2f3fe..783ed3ab440d 100644
> --- a/include/drm/drm_gem.h
> +++ b/include/drm/drm_gem.h
> @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
>    * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
>    * @obj: the &drm_gem_object
>    *
> - * This initializes the &drm_gem_object's &drm_gpuva list.
> + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
>    *
>    * Calling this function is only necessary for drivers intending to support the
>    * &drm_driver_feature DRIVER_GEM_GPUVA.
> @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>   }
>   
>   /**
> - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> - * @entry__: &drm_gpuva structure to assign to in each iteration step
> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>    *
> - * This iterator walks over all &drm_gpuva structures associated with the
> - * &drm_gpuva_manager.
> + * This iterator walks over all &drm_gpuva_gem structures associated with the
> + * &drm_gem_object.
>    */
> -#define drm_gem_for_each_gpuva(entry__, obj__) \
> -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
>   
>   /**
> - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> - * gpuvas
> - * @entry__: &drm_gpuva structure to assign to in each iteration step
> - * @next__: &next &drm_gpuva to store the next step
> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> + * &drm_gpuva_gem
> + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> + * @next__: &next &drm_gpuva_gem to store the next step
> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>    *
> - * This iterator walks over all &drm_gpuva structures associated with the
> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>    * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
>    * it is save against removal of elements.
>    */
> -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> +
> +/**
> + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_manager and &drm_gem_object.
> + */
> +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> +	     va__ = list_next_entry(va__, gem.entry))
>   
>   #endif /* __DRM_GEM_H__ */
> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> index ed8d50200cc3..693e2da3f425 100644
> --- a/include/drm/drm_gpuva_mgr.h
> +++ b/include/drm/drm_gpuva_mgr.h
> @@ -26,12 +26,16 @@
>    */
>   
>   #include <linux/list.h>
> +#include <linux/dma-resv.h>
> +#include <linux/maple_tree.h>
>   #include <linux/rbtree.h>
>   #include <linux/types.h>
>   
>   #include <drm/drm_gem.h>
> +#include <drm/drm_exec.h>
>   
>   struct drm_gpuva_manager;
> +struct drm_gpuva_gem;
>   struct drm_gpuva_fn_ops;
>   
>   /**
> @@ -140,7 +144,7 @@ struct drm_gpuva {
>   int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>   void drm_gpuva_remove(struct drm_gpuva *va);
>   
> -void drm_gpuva_link(struct drm_gpuva *va);
> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>   void drm_gpuva_unlink(struct drm_gpuva *va);
>   
>   struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>   	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>   	 */
>   	const struct drm_gpuva_fn_ops *ops;
> +
> +	/**
> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> +	 * dma-resv to &drm_exec.
> +	 */
> +	struct drm_gem_object d_obj;
> +
> +	/**
> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> +	 * space
> +	 */
> +	struct dma_resv *resv;
> +
> +	/**
> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> +	 */
> +	struct drm_exec exec;
> +
> +	/**
> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> +	 */
> +	struct maple_tree mt_ext;

Why are you using a maple tree here? Insertion and removal is O(log(n)) 
instead of O(1) for a list?

> +
> +	/**
> +	 * @evict: structure holding the evict list and evict list lock
> +	 */
> +	struct {
> +		/**
> +		 * @list: &list_head storing &drm_gem_objects currently being
> +		 * evicted
> +		 */
> +		struct list_head list;
> +
> +		/**
> +		 * @lock: spinlock to protect the evict list against concurrent
> +		 * insertion / removal of different &drm_gpuva_gems
> +		 */
> +		spinlock_t lock;
> +	} evict;
>   };
>   
>   void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> +			    struct drm_device *drm,
>   			    const char *name,
>   			    u64 start_offset, u64 range,
>   			    u64 reserve_offset, u64 reserve_range,
>   			    const struct drm_gpuva_fn_ops *ops);
>   void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>   
> +/**
> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> + */
> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec

A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task 
and should typically be allocated on the stack. Otherwise you'd need to 
protect the mgr->exec member with an exclusive lock throughout the 
locking process, and that's not what we want.

Did you consider subclassing a drm_exec for drm_gpuva purposes and add 
needed ops to it: Like so:

struct drm_gpuva_exec_ops {
     int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
     int (*bo_validate) (struct drm_gpuva_exec *exec, struct 
drm_gem_object *obj);
};

struct drm_gpuva_exec {
     const struct drm_gpuva_exec_ops *ops;
     struct drm_exec exec;
     struct drm_gpuva_manager *mgr;
};

Although I'd actually expect bo_validate to be part of fn in the typical 
case. The drm_gpuva_exec would then be allocated by the caller on the stack.


> +
> +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> +				 int (*fn)(struct drm_gpuva_manager *mgr,
> +					   void *priv, unsigned int num_fences),
> +				 void *priv,
> +				 unsigned int num_fences,
> +				 bool interruptible);
> +
> +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> +				 struct drm_gem_object **objs,
> +				 unsigned int num_objs,
> +				 unsigned int num_fences,
> +				 bool interruptible);
> +
> +/**
> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +static inline int
> +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> +		       unsigned int num_fences,
> +		       bool interruptible)
> +{
> +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> +					    interruptible);
> +}
> +
> +/**
> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + *
> + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> + * through drm_gpuva_manager_lock() or its variants.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +static inline void
> +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> +{
> +	drm_exec_fini(&mgr->exec);
> +}
> +
> +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> +				      struct dma_fence *fence,
> +				      enum dma_resv_usage private_usage,
> +				      enum dma_resv_usage extobj_usage);
> +
> +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			    struct drm_gem_object *obj);
> +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj);
> +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj);
> +
> +/**
> + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> + * external object
> + * @mgr: the &drm_gpuva_manager to check
> + * @obj: the &drm_gem_object to check
> + *
> + * Returns: true if the &drm_gem_object &dma_resv differs from the
> + * &drm_gpuva_managers &dma_resv, false otherwise
> + */
> +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> +				       struct drm_gem_object *obj)
> +{
> +	return obj && obj->resv != mgr->resv;
> +}
> +
>   static inline struct drm_gpuva *
>   __drm_gpuva_next(struct drm_gpuva *va)
>   {
> @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
>   #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>   	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
>   
> +/**
> + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> + * &drm_gem_object combination
> + *
> + * This structure is an abstraction representing a &drm_gpuva_manager and
> + * &drm_gem_object combination. It serves as an indirection to accelerate
> + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> + * &drm_gem_object.
> + *
> + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> + * accelerate validation.
> + *
> + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> + * a GEM object is mapped first in a GPU-VM and release the instance once the
> + * last mapping of the GEM object in this GPU-VM is unmapped.
> + */
> +struct drm_gpuva_gem {
> +
> +	/**
> +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> +	 */
> +	struct drm_gpuva_manager *mgr;
> +
> +	/**
> +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> +	 */
> +	struct drm_gem_object *obj;
> +
> +	/**
> +	 * @kref: The reference count for this &drm_gpuva_gem.
> +	 */
> +	struct kref kref;
> +
> +	/**
> +	 * @list: Structure containing all &list_heads.
> +	 */
> +	struct {
> +		/**
> +		 * @gpuva: The list of linked &drm_gpuvas.
> +		 */
> +		struct list_head gpuva;
> +
> +		/**
> +		 * @entry: Structure containing all &list_heads serving as
> +		 * entry.
> +		 */
> +		struct {
> +			/**
> +			 * @gem: List entry to attach to the &drm_gem_objects
> +			 * gpuva list.
> +			 */
> +			struct list_head gem;
> +
> +			/**
> +			 * @evict: List entry to attach to the
> +			 * &drm_gpuva_managers evict list.
> +			 */
> +			struct list_head evict;
> +		} entry;
> +	} list;
> +};
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj);
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> +			      struct drm_gem_object *obj,
> +			      struct drm_gpuva_gem *__vm_bo);
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		   struct drm_gem_object *obj);
> +
> +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj);
> +void drm_gpuva_gem_destroy(struct kref *kref);
> +
> +/**
> + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> + *
> + * This function acquires an additional reference to @vm_bo. It is illegal to
> + * call this without already holding a reference. No locks required.
> + */
> +static inline struct drm_gpuva_gem *
> +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> +{
> +	kref_get(&vm_bo->kref);
> +	return vm_bo;
> +}
> +
> +/**
> + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> + * @vm_bo: the &drm_gpuva_gem to release the reference of
> + *
> + * This releases a reference to @vm_bo.
> + */
> +static inline void
> +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> +{
> +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> +}
> +
> +/**
> + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_gem.
> + */
> +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> +
> +/**
> + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> + * &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @next__: &next &drm_gpuva to store the next step
> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> + * it is save against removal of elements.
> + */
> +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> +
>   /**
>    * enum drm_gpuva_op_type - GPU VA operation type
>    *
> @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
>   	 */
>   	void (*op_free)(struct drm_gpuva_op *op);
>   
> +	/**
> +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> +	 * a struct drm_gpuva_gem
> +	 *
> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> +	 * specific structures. By implementing this callback drivers can
> +	 * allocate memory accordingly.
> +	 *
> +	 * This callback is optional.
> +	 */
> +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> +
> +	/**
> +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> +	 * struct drm_gpuva_gem
> +	 *
> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> +	 * specific structures. By implementing this callback drivers can
> +	 * free the previously allocated memory accordingly.
> +	 *
> +	 * This callback is optional.
> +	 */
> +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> +
>   	/**
>   	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
>   	 * mapping once all previous steps were completed
> @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
>   	 * used.
>   	 */
>   	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> +
> +	/**
> +	 * @bo_validate: called from drm_gpuva_manager_validate()
> +	 *
> +	 * Drivers receive this callback for every evicted &drm_gem_object being
> +	 * mapped in the corresponding &drm_gpuva_manager.
> +	 *
> +	 * Typically, drivers would call their driver specific variant of
> +	 * ttm_bo_validate() from within this callback.
> +	 */
> +	int (*bo_validate)(struct drm_gem_object *obj);
>   };
>   
>   int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>   void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>   		   struct drm_gpuva *va,
>   		   struct drm_gpuva_op_map *op);
> +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> +		       struct drm_gpuva *va,
> +		       struct drm_gpuva_op_map *op);
>   
>   void drm_gpuva_remap(struct drm_gpuva *prev,
>   		     struct drm_gpuva *next,
>   		     struct drm_gpuva_op_remap *op);
> +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> +			 struct drm_gpuva *next,
> +			 struct drm_gpuva_op_remap *op);
>   
>   void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
>   
>   #endif /* __DRM_GPUVA_MGR_H__ */

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-20 21:53   ` Danilo Krummrich
  (?)
@ 2023-08-30  7:48     ` Christian König
  -1 siblings, 0 replies; 88+ messages in thread
From: Christian König @ 2023-08-30  7:48 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel



Am 20.08.23 um 23:53 schrieb Danilo Krummrich:
> So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> allocations and mappings, generically connect GPU VA mappings to their
> backing buffers and perform more complex mapping operations on the GPU VA
> space.
>
> However, there are more design patterns commonly used by drivers, which
> can potentially be generalized in order to make the DRM GPUVA manager
> represent a basic GPU-VM implementation. In this context, this patch aims
> at generalizing the following elements.
>
> 1) Provide a common dma-resv for GEM objects not being used outside of
>     this GPU-VM.
>
> 2) Provide tracking of external GEM objects (GEM objects which are
>     shared with other GPU-VMs).
>
> 3) Provide functions to efficiently lock all GEM objects dma-resv the
>     GPU-VM contains mappings of.
>
> 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
>     of, such that validation of evicted GEM objects is accelerated.
>
> 5) Provide some convinience functions for common patterns.

Interesting work.

You basically implement a bunch of the ideas I came up to improve the 
amdgpu performance in the common manager now. The was one of the 
remaining blockers I had for using this in amdgpu.

Question is for example how do you track evictions? E.g. we don't have a 
common concept of eviction in GEM as far as I know. Or is the driver 
responsible for giving those notifications to the GPUVA manager?

And would it be possible to lock only a specific area of the VM, e.g. 
every BO mapped in the interval X..Y?

Regards,
Christian.

>
> Rather than being designed as a "framework", the target is to make all
> features appear as a collection of optional helper functions, such that
> drivers are free to make use of the DRM GPUVA managers basic
> functionality and opt-in for other features without setting any feature
> flags, just by making use of the corresponding functions.
>
> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> ---
>   drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
>   include/drm/drm_gem.h           |  48 ++-
>   include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
>   3 files changed, 1010 insertions(+), 28 deletions(-)
>
> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> index f86bfad74ff8..69872b205961 100644
> --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>   /**
>    * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
>    * @mgr: pointer to the &drm_gpuva_manager to initialize
> + * @drm: the drivers &drm_device
>    * @name: the name of the GPU VA space
>    * @start_offset: the start offset of the GPU VA space
>    * @range: the size of the GPU VA space
> @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>    */
>   void
>   drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> +		       struct drm_device *drm,
>   		       const char *name,
>   		       u64 start_offset, u64 range,
>   		       u64 reserve_offset, u64 reserve_range,
> @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>   	mgr->rb.tree = RB_ROOT_CACHED;
>   	INIT_LIST_HEAD(&mgr->rb.list);
>   
> +	mt_init(&mgr->mt_ext);
> +
> +	INIT_LIST_HEAD(&mgr->evict.list);
> +	spin_lock_init(&mgr->evict.lock);
> +
>   	drm_gpuva_check_overflow(start_offset, range);
>   	mgr->mm_start = start_offset;
>   	mgr->mm_range = range;
> @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>   						     reserve_range)))
>   			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>   	}
> +
> +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> +	mgr->resv = mgr->d_obj.resv;
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
>   
> @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>   		__drm_gpuva_remove(&mgr->kernel_alloc_node);
>   
>   	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> -	     "GPUVA tree is not empty, potentially leaking memory.");
> +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> +
> +	mtree_destroy(&mgr->mt_ext);
> +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> +
> +	drm_gem_private_object_fini(&mgr->d_obj);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
>   
> +/**
> + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @num_fences: the amount of &dma_fences to reserve
> + *
> + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Drivers can obtain the corresponding &drm_exec instance through
> + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> + * and drm_exec_fini() accordingly.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> +				  unsigned int num_fences)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +	int ret;
> +
> +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> +	if (ret)
> +		goto out;
> +
> +	rcu_read_lock();
> +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> +		struct drm_gem_object *obj;
> +
> +		mas_pause(&mas);
> +		rcu_read_unlock();
> +
> +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> +		if (ret)
> +			goto out;
> +
> +		rcu_read_lock();
> +	}
> +	rcu_read_unlock();
> +
> +out:
> +	return ret;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> +
> +/**
> + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @fn: callback received by the driver to lock additional dma-resv
> + * @priv: private driver data passed to @fn
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Addionally, when calling this function the driver receives the given @fn
> + * callback to lock additional dma-resv in the context of the
> + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> + * drm_exec_prepare_obj() from within this callback.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> +			     int (*fn)(struct drm_gpuva_manager *mgr,
> +				       void *priv, unsigned int num_fences),
> +			     void *priv,
> +			     unsigned int num_fences,
> +			     bool interruptible)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	uint32_t flags;
> +	int ret;
> +
> +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> +		DRM_EXEC_IGNORE_DUPLICATES;
> +
> +	drm_exec_init(exec, flags);
> +
> +	drm_exec_until_all_locked(exec) {
> +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> +		drm_exec_retry_on_contention(exec);
> +		if (ret)
> +			goto err;
> +
> +		if (fn) {
> +			ret = fn(mgr, priv, num_fences);
> +			drm_exec_retry_on_contention(exec);
> +			if (ret)
> +				goto err;
> +		}
> +	}
> +
> +	return 0;
> +
> +err:
> +	drm_exec_fini(exec);
> +	return ret;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> +
> +static int
> +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> +				unsigned int num_fences)
> +{
> +	struct {
> +		struct drm_gem_object **objs;
> +		unsigned int num_objs;
> +	} *args = priv;
> +
> +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> +				      args->num_objs, num_fences);
> +}
> +
> +/**
> + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @objs: additional &drm_gem_objects to lock
> + * @num_objs: the number of additional &drm_gem_objects to lock
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> +			     struct drm_gem_object **objs,
> +			     unsigned int num_objs,
> +			     unsigned int num_fences,
> +			     bool interruptible)
> +{
> +	struct {
> +		struct drm_gem_object **objs;
> +		unsigned int num_objs;
> +	} args;
> +
> +	args.objs = objs;
> +	args.num_objs = num_objs;
> +
> +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> +					    num_fences, interruptible);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> +
> +/**
> + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> + *
> + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> + * objects being mapped in the given &drm_gpuva_manager.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> +{
> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> +	struct drm_gpuva_gem *vm_bo;
> +	int ret;
> +
> +	if (unlikely(!ops || !ops->bo_validate))
> +		return -ENOTSUPP;
> +
> +	/* At this point we should hold all dma-resv locks of all GEM objects
> +	 * associated with this GPU-VM, hence it is safe to walk the list.
> +	 */
> +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> +		dma_resv_assert_held(vm_bo->obj->resv);
> +
> +		ret = ops->bo_validate(vm_bo->obj);
> +		if (ret)
> +			return ret;
> +	}
> +
> +	return 0;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> +
> +/**
> + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> + * dma-resv
> + * @mgr: the &drm_gpuva_manager to add a fence to
> + * @fence: fence to add
> + * @private_usage: private dma-resv usage
> + * @extobj_usage: extobj dma-resv usage
> + */
> +void
> +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> +				 struct dma_fence *fence,
> +				 enum dma_resv_usage private_usage,
> +				 enum dma_resv_usage extobj_usage)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	struct drm_gem_object *obj;
> +	unsigned long index;
> +
> +	drm_exec_for_each_locked_object(exec, index, obj) {
> +			dma_resv_assert_held(obj->resv);
> +			dma_resv_add_fence(obj->resv, fence,
> +					   drm_gpuva_is_extobj(mgr, obj) ?
> +					   private_usage : extobj_usage);
> +	}
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> +
> +static struct drm_gpuva_gem *
> +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	drm_gem_gpuva_assert_lock_held(obj);
> +
> +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> +		if (vm_bo->mgr == mgr)
> +			return vm_bo;
> +
> +	return NULL;
> +}
> +
> +/**
> + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> + * vm_bo_alloc() callback to allocate.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	if (ops && ops->vm_bo_alloc)
> +		vm_bo = ops->vm_bo_alloc();
> +	else
> +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> +
> +	if (unlikely(!vm_bo))
> +		return NULL;
> +
> +	vm_bo->mgr = mgr;
> +	vm_bo->obj = obj;
> +
> +	kref_init(&vm_bo->kref);
> +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> +
> +	drm_gem_object_get(obj);
> +
> +	return vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> +
> +void
> +drm_gpuva_gem_destroy(struct kref *kref)
> +{
> +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> +						   kref);
> +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> +
> +	drm_gem_object_put(vm_bo->obj);
> +
> +	if (ops && ops->vm_bo_free)
> +		ops->vm_bo_free(vm_bo);
> +	else
> +		kfree(vm_bo);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> +
> +/**
> + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> + * &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the &drm_gpuva_gem accordingly.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		   struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> +
> +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> +
> +/**
> + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> + * given &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> + * &drm_gpuva_gem.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> +	if (vm_bo)
> +		return vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> +	if (!vm_bo)
> +		return ERR_PTR(-ENOMEM);
> +
> +	return vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> +
> +/**
> + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> + * for the given &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> + * count is decreased. If not found @__vm_bo is returned.
> + *
> + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> + * &drm_gpuva_gem was found
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> +			      struct drm_gem_object *obj,
> +			      struct drm_gpuva_gem *__vm_bo)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> +	if (vm_bo) {
> +		drm_gpuva_gem_put(__vm_bo);
> +		return vm_bo;
> +	}
> +
> +	return __vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> +
> +static int
> +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj,
> +			  gfp_t gfp)
> +{
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		struct drm_gem_object *obj;
> +		uintptr_t index;
> +	} gem;
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +	int ret = 0;
> +
> +	gem.obj = obj;
> +	mas_set(&mas, gem.index);
> +
> +	mas_lock(&mas);
> +	ref.ptr = mas_walk(&mas);
> +	if (ref.ptr) {
> +		++ref.cnt;
> +		mas_store(&mas, ref.ptr);
> +	} else {
> +		if (unlikely(!gfp)) {
> +			ret = -EINVAL;
> +			goto out;
> +		}
> +
> +		mas_set(&mas, gem.index);
> +		ref.cnt = 1;
> +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> +		if (likely(!ret))
> +			drm_gem_object_get(obj);
> +	}
> +out:
> +	mas_unlock(&mas);
> +	return ret;
> +}
> +
> +static void
> +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj)
> +{
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		struct drm_gem_object *obj;
> +		uintptr_t index;
> +	} gem;
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +
> +	gem.obj = obj;
> +	mas_set(&mas, gem.index);
> +
> +	mas_lock(&mas);
> +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> +		goto out;
> +
> +	if (!--ref.cnt) {
> +		mas_erase(&mas);
> +		drm_gem_object_put(obj);
> +	} else {
> +		mas_store(&mas, ref.ptr);
> +	}
> +out:
> +	mas_unlock(&mas);
> +}
> +
> +/**
> + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> + * @mgr: the &drm_gpuva_manager to insert into
> + * @obj: the &drm_gem_object to insert as extobj
> + *
> + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> + * If the &drm_gem_object already exists in the tree, the reference counter
> + * of this external object is increased by one.
> + *
> + * Drivers should insert the external &drm_gem_object before the dma-fence
> + * signalling critical section, e.g. when submitting the job, and before
> + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> + * or its dervates.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			struct drm_gem_object *obj)
> +{
> +	return drm_gpuva_is_extobj(mgr, obj) ?
> +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> +
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> +
> +/**
> + * drm_gpuva_extobj_get - increase the referecne count of an external
> + * &drm_gem_object
> + * @mgr: the &drm_gpuva_manager storing the extobj
> + * @obj: the &drm_gem_object to representing the extobj
> + *
> + * Increases the reference count of the extobj represented by @obj.
> + *
> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> + * being inserted.
> + *
> + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> + * additional reference if the re-map operation splits an existing &drm_gpuva
> + * into two separate ones.
> + *
> + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +void
> +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	if (drm_gpuva_is_extobj(mgr, obj))
> +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> +		     "Can't increase ref-count of non-existent extobj.");
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> +
> +/**
> + * drm_gpuva_extobj_put - decrease the referecne count of an external
> + * &drm_gem_object
> + * @mgr: the &drm_gpuva_manager storing the extobj
> + * @obj: the &drm_gem_object to representing the extobj
> + *
> + * Decreases the reference count of the extobj represented by @obj.
> + *
> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> + * being removed from the GPU VA space.
> + *
> + * See also drm_gpuva_unmap_put().
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +void
> +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	if (drm_gpuva_is_extobj(mgr, obj))
> +		__drm_gpuva_extobj_remove(mgr, obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> +
> +/**
> + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> + * &drm_gpuva_managers evicted list
> + * @obj: the &drm_gem_object to add or remove
> + * @evict: indicates whether the object is evicted
> + *
> + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> + * list containing a mapping of this &drm_gem_object.
> + */
> +void
> +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> +	 */
> +	drm_gem_gpuva_assert_lock_held(obj);
> +
> +	/* Required to protect the GPUVA managers evict list against concurrent
> +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> +	 * the evict list through different GEM object evictions are protected
> +	 * by the GPUVA managers evict lock.
> +	 */
> +	dma_resv_assert_held(obj->resv);
> +
> +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> +
> +		spin_lock(&mgr->evict.lock);
> +		if (evict)
> +			list_add_tail(&vm_bo->list.entry.evict,
> +				      &mgr->evict.list);
> +		else
> +			list_del_init(&vm_bo->list.entry.evict);
> +		spin_unlock(&mgr->evict.lock);
> +	}
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> +
>   static int
>   __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>   		   struct drm_gpuva *va)
> @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
>   /**
>    * drm_gpuva_link() - link a &drm_gpuva
>    * @va: the &drm_gpuva to link
> + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
>    *
> - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> - * associated with.
> + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> + *
> + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> + * reference of the latter is taken.
>    *
>    * This function expects the caller to protect the GEM's GPUVA list against
> - * concurrent access using the GEMs dma_resv lock.
> + * concurrent access using either the GEMs dma_resv lock or a driver specific
> + * lock set through drm_gem_gpuva_set_lock().
>    */
>   void
> -drm_gpuva_link(struct drm_gpuva *va)
> +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
>   {
>   	struct drm_gem_object *obj = va->gem.obj;
>   
> @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
>   
>   	drm_gem_gpuva_assert_lock_held(obj);
>   
> -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> +	drm_gpuva_gem_get(vm_bo);
> +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> +	if (list_empty(&vm_bo->list.entry.gem))
> +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_link);
>   
> @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
>    * This removes the given &va from the GPU VA list of the &drm_gem_object it is
>    * associated with.
>    *
> + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> + *
> + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> + * the latter is dropped.
> + *
>    * This function expects the caller to protect the GEM's GPUVA list against
> - * concurrent access using the GEMs dma_resv lock.
> + * concurrent access using either the GEMs dma_resv lock or a driver specific
> + * lock set through drm_gem_gpuva_set_lock().
>    */
>   void
>   drm_gpuva_unlink(struct drm_gpuva *va)
>   {
>   	struct drm_gem_object *obj = va->gem.obj;
> +	struct drm_gpuva_gem *vm_bo;
>   
>   	if (unlikely(!obj))
>   		return;
>   
>   	drm_gem_gpuva_assert_lock_held(obj);
>   
> +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> +		return;
> +
>   	list_del_init(&va->gem.entry);
> +
> +	if (list_empty(&vm_bo->list.gpuva)) {
> +		list_del_init(&vm_bo->list.entry.gem);
> +		list_del_init(&vm_bo->list.entry.evict);
> +	}
> +	drm_gpuva_gem_put(vm_bo);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
>   
> @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_map);
>   
> +/**
> + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> + * &drm_gpuva_op_map
> + * @mgr: the &drm_gpuva_manager
> + * @va: the &drm_gpuva to insert
> + * @op: the &drm_gpuva_op_map to initialize @va with
> + *
> + * Initializes the @va from the @op and inserts it into the given @mgr and
> + * increases the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> +		  struct drm_gpuva *va,
> +		  struct drm_gpuva_op_map *op)
> +{
> +	drm_gpuva_map(mgr, va, op);
> +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> +
>   /**
>    * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
>    * &drm_gpuva_op_remap
> @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>   		struct drm_gpuva *next,
>   		struct drm_gpuva_op_remap *op)
>   {
> -	struct drm_gpuva *curr = op->unmap->va;
> -	struct drm_gpuva_manager *mgr = curr->mgr;
> +	struct drm_gpuva *va = op->unmap->va;
> +	struct drm_gpuva_manager *mgr = va->mgr;
>   
> -	drm_gpuva_remove(curr);
> +	drm_gpuva_remove(va);
>   
>   	if (op->prev) {
>   		drm_gpuva_init_from_op(prev, op->prev);
> @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_remap);
>   
> +/**
> + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> + * &drm_gpuva_op_remap
> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> + *
> + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> + * separate mappings, increases the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_remap_get(struct drm_gpuva *prev,
> +		    struct drm_gpuva *next,
> +		    struct drm_gpuva_op_remap *op)
> +{
> +	struct drm_gpuva *va = op->unmap->va;
> +	struct drm_gpuva_manager *mgr = va->mgr;
> +
> +	drm_gpuva_remap(prev, next, op);
> +	if (op->prev && op->next)
> +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> +
>   /**
>    * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
>    * &drm_gpuva_op_unmap
> @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
>   
> +/**
> + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> + * &drm_gpuva_op_unmap
> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> + *
> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> + * the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> +{
> +	struct drm_gpuva *va = op->va;
> +
> +	drm_gpuva_unmap(op);
> +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> +
>   static int
>   op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>   	  u64 addr, u64 range,
> @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>   {
>   	struct drm_gpuva_ops *ops;
>   	struct drm_gpuva_op *op;
> +	struct drm_gpuva_gem *vm_bo;
>   	struct drm_gpuva *va;
>   	int ret;
>   
> @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>   
>   	INIT_LIST_HEAD(&ops->list);
>   
> -	drm_gem_for_each_gpuva(va, obj) {
> +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
>   		op = gpuva_op_alloc(mgr);
>   		if (!op) {
>   			ret = -ENOMEM;
> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> index bc9f6aa2f3fe..783ed3ab440d 100644
> --- a/include/drm/drm_gem.h
> +++ b/include/drm/drm_gem.h
> @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
>    * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
>    * @obj: the &drm_gem_object
>    *
> - * This initializes the &drm_gem_object's &drm_gpuva list.
> + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
>    *
>    * Calling this function is only necessary for drivers intending to support the
>    * &drm_driver_feature DRIVER_GEM_GPUVA.
> @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>   }
>   
>   /**
> - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> - * @entry__: &drm_gpuva structure to assign to in each iteration step
> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>    *
> - * This iterator walks over all &drm_gpuva structures associated with the
> - * &drm_gpuva_manager.
> + * This iterator walks over all &drm_gpuva_gem structures associated with the
> + * &drm_gem_object.
>    */
> -#define drm_gem_for_each_gpuva(entry__, obj__) \
> -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
>   
>   /**
> - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> - * gpuvas
> - * @entry__: &drm_gpuva structure to assign to in each iteration step
> - * @next__: &next &drm_gpuva to store the next step
> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> + * &drm_gpuva_gem
> + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> + * @next__: &next &drm_gpuva_gem to store the next step
> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>    *
> - * This iterator walks over all &drm_gpuva structures associated with the
> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>    * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
>    * it is save against removal of elements.
>    */
> -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> +
> +/**
> + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_manager and &drm_gem_object.
> + */
> +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> +	     va__ = list_next_entry(va__, gem.entry))
>   
>   #endif /* __DRM_GEM_H__ */
> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> index ed8d50200cc3..693e2da3f425 100644
> --- a/include/drm/drm_gpuva_mgr.h
> +++ b/include/drm/drm_gpuva_mgr.h
> @@ -26,12 +26,16 @@
>    */
>   
>   #include <linux/list.h>
> +#include <linux/dma-resv.h>
> +#include <linux/maple_tree.h>
>   #include <linux/rbtree.h>
>   #include <linux/types.h>
>   
>   #include <drm/drm_gem.h>
> +#include <drm/drm_exec.h>
>   
>   struct drm_gpuva_manager;
> +struct drm_gpuva_gem;
>   struct drm_gpuva_fn_ops;
>   
>   /**
> @@ -140,7 +144,7 @@ struct drm_gpuva {
>   int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>   void drm_gpuva_remove(struct drm_gpuva *va);
>   
> -void drm_gpuva_link(struct drm_gpuva *va);
> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>   void drm_gpuva_unlink(struct drm_gpuva *va);
>   
>   struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>   	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>   	 */
>   	const struct drm_gpuva_fn_ops *ops;
> +
> +	/**
> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> +	 * dma-resv to &drm_exec.
> +	 */
> +	struct drm_gem_object d_obj;
> +
> +	/**
> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> +	 * space
> +	 */
> +	struct dma_resv *resv;
> +
> +	/**
> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> +	 */
> +	struct drm_exec exec;
> +
> +	/**
> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> +	 */
> +	struct maple_tree mt_ext;
> +
> +	/**
> +	 * @evict: structure holding the evict list and evict list lock
> +	 */
> +	struct {
> +		/**
> +		 * @list: &list_head storing &drm_gem_objects currently being
> +		 * evicted
> +		 */
> +		struct list_head list;
> +
> +		/**
> +		 * @lock: spinlock to protect the evict list against concurrent
> +		 * insertion / removal of different &drm_gpuva_gems
> +		 */
> +		spinlock_t lock;
> +	} evict;
>   };
>   
>   void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> +			    struct drm_device *drm,
>   			    const char *name,
>   			    u64 start_offset, u64 range,
>   			    u64 reserve_offset, u64 reserve_range,
>   			    const struct drm_gpuva_fn_ops *ops);
>   void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>   
> +/**
> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> + */
> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> +
> +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> +				 int (*fn)(struct drm_gpuva_manager *mgr,
> +					   void *priv, unsigned int num_fences),
> +				 void *priv,
> +				 unsigned int num_fences,
> +				 bool interruptible);
> +
> +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> +				 struct drm_gem_object **objs,
> +				 unsigned int num_objs,
> +				 unsigned int num_fences,
> +				 bool interruptible);
> +
> +/**
> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +static inline int
> +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> +		       unsigned int num_fences,
> +		       bool interruptible)
> +{
> +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> +					    interruptible);
> +}
> +
> +/**
> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + *
> + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> + * through drm_gpuva_manager_lock() or its variants.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +static inline void
> +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> +{
> +	drm_exec_fini(&mgr->exec);
> +}
> +
> +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> +				      struct dma_fence *fence,
> +				      enum dma_resv_usage private_usage,
> +				      enum dma_resv_usage extobj_usage);
> +
> +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			    struct drm_gem_object *obj);
> +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj);
> +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj);
> +
> +/**
> + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> + * external object
> + * @mgr: the &drm_gpuva_manager to check
> + * @obj: the &drm_gem_object to check
> + *
> + * Returns: true if the &drm_gem_object &dma_resv differs from the
> + * &drm_gpuva_managers &dma_resv, false otherwise
> + */
> +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> +				       struct drm_gem_object *obj)
> +{
> +	return obj && obj->resv != mgr->resv;
> +}
> +
>   static inline struct drm_gpuva *
>   __drm_gpuva_next(struct drm_gpuva *va)
>   {
> @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
>   #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>   	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
>   
> +/**
> + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> + * &drm_gem_object combination
> + *
> + * This structure is an abstraction representing a &drm_gpuva_manager and
> + * &drm_gem_object combination. It serves as an indirection to accelerate
> + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> + * &drm_gem_object.
> + *
> + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> + * accelerate validation.
> + *
> + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> + * a GEM object is mapped first in a GPU-VM and release the instance once the
> + * last mapping of the GEM object in this GPU-VM is unmapped.
> + */
> +struct drm_gpuva_gem {
> +
> +	/**
> +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> +	 */
> +	struct drm_gpuva_manager *mgr;
> +
> +	/**
> +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> +	 */
> +	struct drm_gem_object *obj;
> +
> +	/**
> +	 * @kref: The reference count for this &drm_gpuva_gem.
> +	 */
> +	struct kref kref;
> +
> +	/**
> +	 * @list: Structure containing all &list_heads.
> +	 */
> +	struct {
> +		/**
> +		 * @gpuva: The list of linked &drm_gpuvas.
> +		 */
> +		struct list_head gpuva;
> +
> +		/**
> +		 * @entry: Structure containing all &list_heads serving as
> +		 * entry.
> +		 */
> +		struct {
> +			/**
> +			 * @gem: List entry to attach to the &drm_gem_objects
> +			 * gpuva list.
> +			 */
> +			struct list_head gem;
> +
> +			/**
> +			 * @evict: List entry to attach to the
> +			 * &drm_gpuva_managers evict list.
> +			 */
> +			struct list_head evict;
> +		} entry;
> +	} list;
> +};
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj);
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> +			      struct drm_gem_object *obj,
> +			      struct drm_gpuva_gem *__vm_bo);
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		   struct drm_gem_object *obj);
> +
> +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj);
> +void drm_gpuva_gem_destroy(struct kref *kref);
> +
> +/**
> + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> + *
> + * This function acquires an additional reference to @vm_bo. It is illegal to
> + * call this without already holding a reference. No locks required.
> + */
> +static inline struct drm_gpuva_gem *
> +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> +{
> +	kref_get(&vm_bo->kref);
> +	return vm_bo;
> +}
> +
> +/**
> + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> + * @vm_bo: the &drm_gpuva_gem to release the reference of
> + *
> + * This releases a reference to @vm_bo.
> + */
> +static inline void
> +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> +{
> +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> +}
> +
> +/**
> + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_gem.
> + */
> +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> +
> +/**
> + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> + * &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @next__: &next &drm_gpuva to store the next step
> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> + * it is save against removal of elements.
> + */
> +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> +
>   /**
>    * enum drm_gpuva_op_type - GPU VA operation type
>    *
> @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
>   	 */
>   	void (*op_free)(struct drm_gpuva_op *op);
>   
> +	/**
> +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> +	 * a struct drm_gpuva_gem
> +	 *
> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> +	 * specific structures. By implementing this callback drivers can
> +	 * allocate memory accordingly.
> +	 *
> +	 * This callback is optional.
> +	 */
> +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> +
> +	/**
> +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> +	 * struct drm_gpuva_gem
> +	 *
> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> +	 * specific structures. By implementing this callback drivers can
> +	 * free the previously allocated memory accordingly.
> +	 *
> +	 * This callback is optional.
> +	 */
> +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> +
>   	/**
>   	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
>   	 * mapping once all previous steps were completed
> @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
>   	 * used.
>   	 */
>   	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> +
> +	/**
> +	 * @bo_validate: called from drm_gpuva_manager_validate()
> +	 *
> +	 * Drivers receive this callback for every evicted &drm_gem_object being
> +	 * mapped in the corresponding &drm_gpuva_manager.
> +	 *
> +	 * Typically, drivers would call their driver specific variant of
> +	 * ttm_bo_validate() from within this callback.
> +	 */
> +	int (*bo_validate)(struct drm_gem_object *obj);
>   };
>   
>   int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>   void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>   		   struct drm_gpuva *va,
>   		   struct drm_gpuva_op_map *op);
> +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> +		       struct drm_gpuva *va,
> +		       struct drm_gpuva_op_map *op);
>   
>   void drm_gpuva_remap(struct drm_gpuva *prev,
>   		     struct drm_gpuva *next,
>   		     struct drm_gpuva_op_remap *op);
> +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> +			 struct drm_gpuva *next,
> +			 struct drm_gpuva_op_remap *op);
>   
>   void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
>   
>   #endif /* __DRM_GPUVA_MGR_H__ */


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-30  7:48     ` Christian König
  0 siblings, 0 replies; 88+ messages in thread
From: Christian König @ 2023-08-30  7:48 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	faith.ekstrand, bskeggs, Liam.Howlett
  Cc: nouveau, linux-kernel, dri-devel



Am 20.08.23 um 23:53 schrieb Danilo Krummrich:
> So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> allocations and mappings, generically connect GPU VA mappings to their
> backing buffers and perform more complex mapping operations on the GPU VA
> space.
>
> However, there are more design patterns commonly used by drivers, which
> can potentially be generalized in order to make the DRM GPUVA manager
> represent a basic GPU-VM implementation. In this context, this patch aims
> at generalizing the following elements.
>
> 1) Provide a common dma-resv for GEM objects not being used outside of
>     this GPU-VM.
>
> 2) Provide tracking of external GEM objects (GEM objects which are
>     shared with other GPU-VMs).
>
> 3) Provide functions to efficiently lock all GEM objects dma-resv the
>     GPU-VM contains mappings of.
>
> 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
>     of, such that validation of evicted GEM objects is accelerated.
>
> 5) Provide some convinience functions for common patterns.

Interesting work.

You basically implement a bunch of the ideas I came up to improve the 
amdgpu performance in the common manager now. The was one of the 
remaining blockers I had for using this in amdgpu.

Question is for example how do you track evictions? E.g. we don't have a 
common concept of eviction in GEM as far as I know. Or is the driver 
responsible for giving those notifications to the GPUVA manager?

And would it be possible to lock only a specific area of the VM, e.g. 
every BO mapped in the interval X..Y?

Regards,
Christian.

>
> Rather than being designed as a "framework", the target is to make all
> features appear as a collection of optional helper functions, such that
> drivers are free to make use of the DRM GPUVA managers basic
> functionality and opt-in for other features without setting any feature
> flags, just by making use of the corresponding functions.
>
> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> ---
>   drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
>   include/drm/drm_gem.h           |  48 ++-
>   include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
>   3 files changed, 1010 insertions(+), 28 deletions(-)
>
> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> index f86bfad74ff8..69872b205961 100644
> --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>   /**
>    * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
>    * @mgr: pointer to the &drm_gpuva_manager to initialize
> + * @drm: the drivers &drm_device
>    * @name: the name of the GPU VA space
>    * @start_offset: the start offset of the GPU VA space
>    * @range: the size of the GPU VA space
> @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>    */
>   void
>   drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> +		       struct drm_device *drm,
>   		       const char *name,
>   		       u64 start_offset, u64 range,
>   		       u64 reserve_offset, u64 reserve_range,
> @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>   	mgr->rb.tree = RB_ROOT_CACHED;
>   	INIT_LIST_HEAD(&mgr->rb.list);
>   
> +	mt_init(&mgr->mt_ext);
> +
> +	INIT_LIST_HEAD(&mgr->evict.list);
> +	spin_lock_init(&mgr->evict.lock);
> +
>   	drm_gpuva_check_overflow(start_offset, range);
>   	mgr->mm_start = start_offset;
>   	mgr->mm_range = range;
> @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>   						     reserve_range)))
>   			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>   	}
> +
> +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> +	mgr->resv = mgr->d_obj.resv;
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
>   
> @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>   		__drm_gpuva_remove(&mgr->kernel_alloc_node);
>   
>   	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> -	     "GPUVA tree is not empty, potentially leaking memory.");
> +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> +
> +	mtree_destroy(&mgr->mt_ext);
> +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> +
> +	drm_gem_private_object_fini(&mgr->d_obj);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
>   
> +/**
> + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @num_fences: the amount of &dma_fences to reserve
> + *
> + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Drivers can obtain the corresponding &drm_exec instance through
> + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> + * and drm_exec_fini() accordingly.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> +				  unsigned int num_fences)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +	int ret;
> +
> +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> +	if (ret)
> +		goto out;
> +
> +	rcu_read_lock();
> +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> +		struct drm_gem_object *obj;
> +
> +		mas_pause(&mas);
> +		rcu_read_unlock();
> +
> +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> +		if (ret)
> +			goto out;
> +
> +		rcu_read_lock();
> +	}
> +	rcu_read_unlock();
> +
> +out:
> +	return ret;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> +
> +/**
> + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @fn: callback received by the driver to lock additional dma-resv
> + * @priv: private driver data passed to @fn
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Addionally, when calling this function the driver receives the given @fn
> + * callback to lock additional dma-resv in the context of the
> + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> + * drm_exec_prepare_obj() from within this callback.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> +			     int (*fn)(struct drm_gpuva_manager *mgr,
> +				       void *priv, unsigned int num_fences),
> +			     void *priv,
> +			     unsigned int num_fences,
> +			     bool interruptible)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	uint32_t flags;
> +	int ret;
> +
> +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> +		DRM_EXEC_IGNORE_DUPLICATES;
> +
> +	drm_exec_init(exec, flags);
> +
> +	drm_exec_until_all_locked(exec) {
> +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> +		drm_exec_retry_on_contention(exec);
> +		if (ret)
> +			goto err;
> +
> +		if (fn) {
> +			ret = fn(mgr, priv, num_fences);
> +			drm_exec_retry_on_contention(exec);
> +			if (ret)
> +				goto err;
> +		}
> +	}
> +
> +	return 0;
> +
> +err:
> +	drm_exec_fini(exec);
> +	return ret;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> +
> +static int
> +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> +				unsigned int num_fences)
> +{
> +	struct {
> +		struct drm_gem_object **objs;
> +		unsigned int num_objs;
> +	} *args = priv;
> +
> +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> +				      args->num_objs, num_fences);
> +}
> +
> +/**
> + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @objs: additional &drm_gem_objects to lock
> + * @num_objs: the number of additional &drm_gem_objects to lock
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> +			     struct drm_gem_object **objs,
> +			     unsigned int num_objs,
> +			     unsigned int num_fences,
> +			     bool interruptible)
> +{
> +	struct {
> +		struct drm_gem_object **objs;
> +		unsigned int num_objs;
> +	} args;
> +
> +	args.objs = objs;
> +	args.num_objs = num_objs;
> +
> +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> +					    num_fences, interruptible);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> +
> +/**
> + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> + *
> + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> + * objects being mapped in the given &drm_gpuva_manager.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> +{
> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> +	struct drm_gpuva_gem *vm_bo;
> +	int ret;
> +
> +	if (unlikely(!ops || !ops->bo_validate))
> +		return -ENOTSUPP;
> +
> +	/* At this point we should hold all dma-resv locks of all GEM objects
> +	 * associated with this GPU-VM, hence it is safe to walk the list.
> +	 */
> +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> +		dma_resv_assert_held(vm_bo->obj->resv);
> +
> +		ret = ops->bo_validate(vm_bo->obj);
> +		if (ret)
> +			return ret;
> +	}
> +
> +	return 0;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> +
> +/**
> + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> + * dma-resv
> + * @mgr: the &drm_gpuva_manager to add a fence to
> + * @fence: fence to add
> + * @private_usage: private dma-resv usage
> + * @extobj_usage: extobj dma-resv usage
> + */
> +void
> +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> +				 struct dma_fence *fence,
> +				 enum dma_resv_usage private_usage,
> +				 enum dma_resv_usage extobj_usage)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	struct drm_gem_object *obj;
> +	unsigned long index;
> +
> +	drm_exec_for_each_locked_object(exec, index, obj) {
> +			dma_resv_assert_held(obj->resv);
> +			dma_resv_add_fence(obj->resv, fence,
> +					   drm_gpuva_is_extobj(mgr, obj) ?
> +					   private_usage : extobj_usage);
> +	}
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> +
> +static struct drm_gpuva_gem *
> +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	drm_gem_gpuva_assert_lock_held(obj);
> +
> +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> +		if (vm_bo->mgr == mgr)
> +			return vm_bo;
> +
> +	return NULL;
> +}
> +
> +/**
> + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> + * vm_bo_alloc() callback to allocate.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	if (ops && ops->vm_bo_alloc)
> +		vm_bo = ops->vm_bo_alloc();
> +	else
> +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> +
> +	if (unlikely(!vm_bo))
> +		return NULL;
> +
> +	vm_bo->mgr = mgr;
> +	vm_bo->obj = obj;
> +
> +	kref_init(&vm_bo->kref);
> +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> +
> +	drm_gem_object_get(obj);
> +
> +	return vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> +
> +void
> +drm_gpuva_gem_destroy(struct kref *kref)
> +{
> +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> +						   kref);
> +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> +
> +	drm_gem_object_put(vm_bo->obj);
> +
> +	if (ops && ops->vm_bo_free)
> +		ops->vm_bo_free(vm_bo);
> +	else
> +		kfree(vm_bo);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> +
> +/**
> + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> + * &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the &drm_gpuva_gem accordingly.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		   struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> +
> +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> +
> +/**
> + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> + * given &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> + * &drm_gpuva_gem.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> +	if (vm_bo)
> +		return vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> +	if (!vm_bo)
> +		return ERR_PTR(-ENOMEM);
> +
> +	return vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> +
> +/**
> + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> + * for the given &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> + * count is decreased. If not found @__vm_bo is returned.
> + *
> + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> + * &drm_gpuva_gem was found
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> +			      struct drm_gem_object *obj,
> +			      struct drm_gpuva_gem *__vm_bo)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> +	if (vm_bo) {
> +		drm_gpuva_gem_put(__vm_bo);
> +		return vm_bo;
> +	}
> +
> +	return __vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> +
> +static int
> +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj,
> +			  gfp_t gfp)
> +{
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		struct drm_gem_object *obj;
> +		uintptr_t index;
> +	} gem;
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +	int ret = 0;
> +
> +	gem.obj = obj;
> +	mas_set(&mas, gem.index);
> +
> +	mas_lock(&mas);
> +	ref.ptr = mas_walk(&mas);
> +	if (ref.ptr) {
> +		++ref.cnt;
> +		mas_store(&mas, ref.ptr);
> +	} else {
> +		if (unlikely(!gfp)) {
> +			ret = -EINVAL;
> +			goto out;
> +		}
> +
> +		mas_set(&mas, gem.index);
> +		ref.cnt = 1;
> +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> +		if (likely(!ret))
> +			drm_gem_object_get(obj);
> +	}
> +out:
> +	mas_unlock(&mas);
> +	return ret;
> +}
> +
> +static void
> +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj)
> +{
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		struct drm_gem_object *obj;
> +		uintptr_t index;
> +	} gem;
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +
> +	gem.obj = obj;
> +	mas_set(&mas, gem.index);
> +
> +	mas_lock(&mas);
> +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> +		goto out;
> +
> +	if (!--ref.cnt) {
> +		mas_erase(&mas);
> +		drm_gem_object_put(obj);
> +	} else {
> +		mas_store(&mas, ref.ptr);
> +	}
> +out:
> +	mas_unlock(&mas);
> +}
> +
> +/**
> + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> + * @mgr: the &drm_gpuva_manager to insert into
> + * @obj: the &drm_gem_object to insert as extobj
> + *
> + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> + * If the &drm_gem_object already exists in the tree, the reference counter
> + * of this external object is increased by one.
> + *
> + * Drivers should insert the external &drm_gem_object before the dma-fence
> + * signalling critical section, e.g. when submitting the job, and before
> + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> + * or its dervates.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			struct drm_gem_object *obj)
> +{
> +	return drm_gpuva_is_extobj(mgr, obj) ?
> +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> +
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> +
> +/**
> + * drm_gpuva_extobj_get - increase the referecne count of an external
> + * &drm_gem_object
> + * @mgr: the &drm_gpuva_manager storing the extobj
> + * @obj: the &drm_gem_object to representing the extobj
> + *
> + * Increases the reference count of the extobj represented by @obj.
> + *
> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> + * being inserted.
> + *
> + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> + * additional reference if the re-map operation splits an existing &drm_gpuva
> + * into two separate ones.
> + *
> + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +void
> +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	if (drm_gpuva_is_extobj(mgr, obj))
> +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> +		     "Can't increase ref-count of non-existent extobj.");
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> +
> +/**
> + * drm_gpuva_extobj_put - decrease the referecne count of an external
> + * &drm_gem_object
> + * @mgr: the &drm_gpuva_manager storing the extobj
> + * @obj: the &drm_gem_object to representing the extobj
> + *
> + * Decreases the reference count of the extobj represented by @obj.
> + *
> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> + * being removed from the GPU VA space.
> + *
> + * See also drm_gpuva_unmap_put().
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +void
> +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	if (drm_gpuva_is_extobj(mgr, obj))
> +		__drm_gpuva_extobj_remove(mgr, obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> +
> +/**
> + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> + * &drm_gpuva_managers evicted list
> + * @obj: the &drm_gem_object to add or remove
> + * @evict: indicates whether the object is evicted
> + *
> + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> + * list containing a mapping of this &drm_gem_object.
> + */
> +void
> +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> +	 */
> +	drm_gem_gpuva_assert_lock_held(obj);
> +
> +	/* Required to protect the GPUVA managers evict list against concurrent
> +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> +	 * the evict list through different GEM object evictions are protected
> +	 * by the GPUVA managers evict lock.
> +	 */
> +	dma_resv_assert_held(obj->resv);
> +
> +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> +
> +		spin_lock(&mgr->evict.lock);
> +		if (evict)
> +			list_add_tail(&vm_bo->list.entry.evict,
> +				      &mgr->evict.list);
> +		else
> +			list_del_init(&vm_bo->list.entry.evict);
> +		spin_unlock(&mgr->evict.lock);
> +	}
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> +
>   static int
>   __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>   		   struct drm_gpuva *va)
> @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
>   /**
>    * drm_gpuva_link() - link a &drm_gpuva
>    * @va: the &drm_gpuva to link
> + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
>    *
> - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> - * associated with.
> + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> + *
> + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> + * reference of the latter is taken.
>    *
>    * This function expects the caller to protect the GEM's GPUVA list against
> - * concurrent access using the GEMs dma_resv lock.
> + * concurrent access using either the GEMs dma_resv lock or a driver specific
> + * lock set through drm_gem_gpuva_set_lock().
>    */
>   void
> -drm_gpuva_link(struct drm_gpuva *va)
> +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
>   {
>   	struct drm_gem_object *obj = va->gem.obj;
>   
> @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
>   
>   	drm_gem_gpuva_assert_lock_held(obj);
>   
> -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> +	drm_gpuva_gem_get(vm_bo);
> +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> +	if (list_empty(&vm_bo->list.entry.gem))
> +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_link);
>   
> @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
>    * This removes the given &va from the GPU VA list of the &drm_gem_object it is
>    * associated with.
>    *
> + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> + *
> + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> + * the latter is dropped.
> + *
>    * This function expects the caller to protect the GEM's GPUVA list against
> - * concurrent access using the GEMs dma_resv lock.
> + * concurrent access using either the GEMs dma_resv lock or a driver specific
> + * lock set through drm_gem_gpuva_set_lock().
>    */
>   void
>   drm_gpuva_unlink(struct drm_gpuva *va)
>   {
>   	struct drm_gem_object *obj = va->gem.obj;
> +	struct drm_gpuva_gem *vm_bo;
>   
>   	if (unlikely(!obj))
>   		return;
>   
>   	drm_gem_gpuva_assert_lock_held(obj);
>   
> +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> +		return;
> +
>   	list_del_init(&va->gem.entry);
> +
> +	if (list_empty(&vm_bo->list.gpuva)) {
> +		list_del_init(&vm_bo->list.entry.gem);
> +		list_del_init(&vm_bo->list.entry.evict);
> +	}
> +	drm_gpuva_gem_put(vm_bo);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
>   
> @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_map);
>   
> +/**
> + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> + * &drm_gpuva_op_map
> + * @mgr: the &drm_gpuva_manager
> + * @va: the &drm_gpuva to insert
> + * @op: the &drm_gpuva_op_map to initialize @va with
> + *
> + * Initializes the @va from the @op and inserts it into the given @mgr and
> + * increases the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> +		  struct drm_gpuva *va,
> +		  struct drm_gpuva_op_map *op)
> +{
> +	drm_gpuva_map(mgr, va, op);
> +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> +
>   /**
>    * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
>    * &drm_gpuva_op_remap
> @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>   		struct drm_gpuva *next,
>   		struct drm_gpuva_op_remap *op)
>   {
> -	struct drm_gpuva *curr = op->unmap->va;
> -	struct drm_gpuva_manager *mgr = curr->mgr;
> +	struct drm_gpuva *va = op->unmap->va;
> +	struct drm_gpuva_manager *mgr = va->mgr;
>   
> -	drm_gpuva_remove(curr);
> +	drm_gpuva_remove(va);
>   
>   	if (op->prev) {
>   		drm_gpuva_init_from_op(prev, op->prev);
> @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_remap);
>   
> +/**
> + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> + * &drm_gpuva_op_remap
> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> + *
> + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> + * separate mappings, increases the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_remap_get(struct drm_gpuva *prev,
> +		    struct drm_gpuva *next,
> +		    struct drm_gpuva_op_remap *op)
> +{
> +	struct drm_gpuva *va = op->unmap->va;
> +	struct drm_gpuva_manager *mgr = va->mgr;
> +
> +	drm_gpuva_remap(prev, next, op);
> +	if (op->prev && op->next)
> +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> +
>   /**
>    * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
>    * &drm_gpuva_op_unmap
> @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
>   
> +/**
> + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> + * &drm_gpuva_op_unmap
> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> + *
> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> + * the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> +{
> +	struct drm_gpuva *va = op->va;
> +
> +	drm_gpuva_unmap(op);
> +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> +
>   static int
>   op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>   	  u64 addr, u64 range,
> @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>   {
>   	struct drm_gpuva_ops *ops;
>   	struct drm_gpuva_op *op;
> +	struct drm_gpuva_gem *vm_bo;
>   	struct drm_gpuva *va;
>   	int ret;
>   
> @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>   
>   	INIT_LIST_HEAD(&ops->list);
>   
> -	drm_gem_for_each_gpuva(va, obj) {
> +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
>   		op = gpuva_op_alloc(mgr);
>   		if (!op) {
>   			ret = -ENOMEM;
> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> index bc9f6aa2f3fe..783ed3ab440d 100644
> --- a/include/drm/drm_gem.h
> +++ b/include/drm/drm_gem.h
> @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
>    * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
>    * @obj: the &drm_gem_object
>    *
> - * This initializes the &drm_gem_object's &drm_gpuva list.
> + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
>    *
>    * Calling this function is only necessary for drivers intending to support the
>    * &drm_driver_feature DRIVER_GEM_GPUVA.
> @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>   }
>   
>   /**
> - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> - * @entry__: &drm_gpuva structure to assign to in each iteration step
> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>    *
> - * This iterator walks over all &drm_gpuva structures associated with the
> - * &drm_gpuva_manager.
> + * This iterator walks over all &drm_gpuva_gem structures associated with the
> + * &drm_gem_object.
>    */
> -#define drm_gem_for_each_gpuva(entry__, obj__) \
> -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
>   
>   /**
> - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> - * gpuvas
> - * @entry__: &drm_gpuva structure to assign to in each iteration step
> - * @next__: &next &drm_gpuva to store the next step
> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> + * &drm_gpuva_gem
> + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> + * @next__: &next &drm_gpuva_gem to store the next step
> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>    *
> - * This iterator walks over all &drm_gpuva structures associated with the
> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>    * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
>    * it is save against removal of elements.
>    */
> -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> +
> +/**
> + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_manager and &drm_gem_object.
> + */
> +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> +	     va__ = list_next_entry(va__, gem.entry))
>   
>   #endif /* __DRM_GEM_H__ */
> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> index ed8d50200cc3..693e2da3f425 100644
> --- a/include/drm/drm_gpuva_mgr.h
> +++ b/include/drm/drm_gpuva_mgr.h
> @@ -26,12 +26,16 @@
>    */
>   
>   #include <linux/list.h>
> +#include <linux/dma-resv.h>
> +#include <linux/maple_tree.h>
>   #include <linux/rbtree.h>
>   #include <linux/types.h>
>   
>   #include <drm/drm_gem.h>
> +#include <drm/drm_exec.h>
>   
>   struct drm_gpuva_manager;
> +struct drm_gpuva_gem;
>   struct drm_gpuva_fn_ops;
>   
>   /**
> @@ -140,7 +144,7 @@ struct drm_gpuva {
>   int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>   void drm_gpuva_remove(struct drm_gpuva *va);
>   
> -void drm_gpuva_link(struct drm_gpuva *va);
> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>   void drm_gpuva_unlink(struct drm_gpuva *va);
>   
>   struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>   	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>   	 */
>   	const struct drm_gpuva_fn_ops *ops;
> +
> +	/**
> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> +	 * dma-resv to &drm_exec.
> +	 */
> +	struct drm_gem_object d_obj;
> +
> +	/**
> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> +	 * space
> +	 */
> +	struct dma_resv *resv;
> +
> +	/**
> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> +	 */
> +	struct drm_exec exec;
> +
> +	/**
> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> +	 */
> +	struct maple_tree mt_ext;
> +
> +	/**
> +	 * @evict: structure holding the evict list and evict list lock
> +	 */
> +	struct {
> +		/**
> +		 * @list: &list_head storing &drm_gem_objects currently being
> +		 * evicted
> +		 */
> +		struct list_head list;
> +
> +		/**
> +		 * @lock: spinlock to protect the evict list against concurrent
> +		 * insertion / removal of different &drm_gpuva_gems
> +		 */
> +		spinlock_t lock;
> +	} evict;
>   };
>   
>   void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> +			    struct drm_device *drm,
>   			    const char *name,
>   			    u64 start_offset, u64 range,
>   			    u64 reserve_offset, u64 reserve_range,
>   			    const struct drm_gpuva_fn_ops *ops);
>   void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>   
> +/**
> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> + */
> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> +
> +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> +				 int (*fn)(struct drm_gpuva_manager *mgr,
> +					   void *priv, unsigned int num_fences),
> +				 void *priv,
> +				 unsigned int num_fences,
> +				 bool interruptible);
> +
> +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> +				 struct drm_gem_object **objs,
> +				 unsigned int num_objs,
> +				 unsigned int num_fences,
> +				 bool interruptible);
> +
> +/**
> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +static inline int
> +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> +		       unsigned int num_fences,
> +		       bool interruptible)
> +{
> +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> +					    interruptible);
> +}
> +
> +/**
> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + *
> + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> + * through drm_gpuva_manager_lock() or its variants.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +static inline void
> +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> +{
> +	drm_exec_fini(&mgr->exec);
> +}
> +
> +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> +				      struct dma_fence *fence,
> +				      enum dma_resv_usage private_usage,
> +				      enum dma_resv_usage extobj_usage);
> +
> +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			    struct drm_gem_object *obj);
> +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj);
> +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj);
> +
> +/**
> + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> + * external object
> + * @mgr: the &drm_gpuva_manager to check
> + * @obj: the &drm_gem_object to check
> + *
> + * Returns: true if the &drm_gem_object &dma_resv differs from the
> + * &drm_gpuva_managers &dma_resv, false otherwise
> + */
> +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> +				       struct drm_gem_object *obj)
> +{
> +	return obj && obj->resv != mgr->resv;
> +}
> +
>   static inline struct drm_gpuva *
>   __drm_gpuva_next(struct drm_gpuva *va)
>   {
> @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
>   #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>   	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
>   
> +/**
> + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> + * &drm_gem_object combination
> + *
> + * This structure is an abstraction representing a &drm_gpuva_manager and
> + * &drm_gem_object combination. It serves as an indirection to accelerate
> + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> + * &drm_gem_object.
> + *
> + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> + * accelerate validation.
> + *
> + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> + * a GEM object is mapped first in a GPU-VM and release the instance once the
> + * last mapping of the GEM object in this GPU-VM is unmapped.
> + */
> +struct drm_gpuva_gem {
> +
> +	/**
> +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> +	 */
> +	struct drm_gpuva_manager *mgr;
> +
> +	/**
> +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> +	 */
> +	struct drm_gem_object *obj;
> +
> +	/**
> +	 * @kref: The reference count for this &drm_gpuva_gem.
> +	 */
> +	struct kref kref;
> +
> +	/**
> +	 * @list: Structure containing all &list_heads.
> +	 */
> +	struct {
> +		/**
> +		 * @gpuva: The list of linked &drm_gpuvas.
> +		 */
> +		struct list_head gpuva;
> +
> +		/**
> +		 * @entry: Structure containing all &list_heads serving as
> +		 * entry.
> +		 */
> +		struct {
> +			/**
> +			 * @gem: List entry to attach to the &drm_gem_objects
> +			 * gpuva list.
> +			 */
> +			struct list_head gem;
> +
> +			/**
> +			 * @evict: List entry to attach to the
> +			 * &drm_gpuva_managers evict list.
> +			 */
> +			struct list_head evict;
> +		} entry;
> +	} list;
> +};
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj);
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> +			      struct drm_gem_object *obj,
> +			      struct drm_gpuva_gem *__vm_bo);
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		   struct drm_gem_object *obj);
> +
> +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj);
> +void drm_gpuva_gem_destroy(struct kref *kref);
> +
> +/**
> + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> + *
> + * This function acquires an additional reference to @vm_bo. It is illegal to
> + * call this without already holding a reference. No locks required.
> + */
> +static inline struct drm_gpuva_gem *
> +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> +{
> +	kref_get(&vm_bo->kref);
> +	return vm_bo;
> +}
> +
> +/**
> + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> + * @vm_bo: the &drm_gpuva_gem to release the reference of
> + *
> + * This releases a reference to @vm_bo.
> + */
> +static inline void
> +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> +{
> +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> +}
> +
> +/**
> + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_gem.
> + */
> +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> +
> +/**
> + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> + * &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @next__: &next &drm_gpuva to store the next step
> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> + * it is save against removal of elements.
> + */
> +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> +
>   /**
>    * enum drm_gpuva_op_type - GPU VA operation type
>    *
> @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
>   	 */
>   	void (*op_free)(struct drm_gpuva_op *op);
>   
> +	/**
> +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> +	 * a struct drm_gpuva_gem
> +	 *
> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> +	 * specific structures. By implementing this callback drivers can
> +	 * allocate memory accordingly.
> +	 *
> +	 * This callback is optional.
> +	 */
> +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> +
> +	/**
> +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> +	 * struct drm_gpuva_gem
> +	 *
> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> +	 * specific structures. By implementing this callback drivers can
> +	 * free the previously allocated memory accordingly.
> +	 *
> +	 * This callback is optional.
> +	 */
> +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> +
>   	/**
>   	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
>   	 * mapping once all previous steps were completed
> @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
>   	 * used.
>   	 */
>   	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> +
> +	/**
> +	 * @bo_validate: called from drm_gpuva_manager_validate()
> +	 *
> +	 * Drivers receive this callback for every evicted &drm_gem_object being
> +	 * mapped in the corresponding &drm_gpuva_manager.
> +	 *
> +	 * Typically, drivers would call their driver specific variant of
> +	 * ttm_bo_validate() from within this callback.
> +	 */
> +	int (*bo_validate)(struct drm_gem_object *obj);
>   };
>   
>   int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>   void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>   		   struct drm_gpuva *va,
>   		   struct drm_gpuva_op_map *op);
> +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> +		       struct drm_gpuva *va,
> +		       struct drm_gpuva_op_map *op);
>   
>   void drm_gpuva_remap(struct drm_gpuva *prev,
>   		     struct drm_gpuva *next,
>   		     struct drm_gpuva_op_remap *op);
> +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> +			 struct drm_gpuva *next,
> +			 struct drm_gpuva_op_remap *op);
>   
>   void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
>   
>   #endif /* __DRM_GPUVA_MGR_H__ */


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-30  7:48     ` Christian König
  0 siblings, 0 replies; 88+ messages in thread
From: Christian König @ 2023-08-30  7:48 UTC (permalink / raw)
  To: Danilo Krummrich, airlied, daniel, matthew.brost,
	thomas.hellstrom, sarah.walker, donald.robson, boris.brezillon,
	faith.ekstrand, bskeggs, Liam.Howlett
  Cc: dri-devel, nouveau, linux-kernel



Am 20.08.23 um 23:53 schrieb Danilo Krummrich:
> So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> allocations and mappings, generically connect GPU VA mappings to their
> backing buffers and perform more complex mapping operations on the GPU VA
> space.
>
> However, there are more design patterns commonly used by drivers, which
> can potentially be generalized in order to make the DRM GPUVA manager
> represent a basic GPU-VM implementation. In this context, this patch aims
> at generalizing the following elements.
>
> 1) Provide a common dma-resv for GEM objects not being used outside of
>     this GPU-VM.
>
> 2) Provide tracking of external GEM objects (GEM objects which are
>     shared with other GPU-VMs).
>
> 3) Provide functions to efficiently lock all GEM objects dma-resv the
>     GPU-VM contains mappings of.
>
> 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
>     of, such that validation of evicted GEM objects is accelerated.
>
> 5) Provide some convinience functions for common patterns.

Interesting work.

You basically implement a bunch of the ideas I came up to improve the 
amdgpu performance in the common manager now. The was one of the 
remaining blockers I had for using this in amdgpu.

Question is for example how do you track evictions? E.g. we don't have a 
common concept of eviction in GEM as far as I know. Or is the driver 
responsible for giving those notifications to the GPUVA manager?

And would it be possible to lock only a specific area of the VM, e.g. 
every BO mapped in the interval X..Y?

Regards,
Christian.

>
> Rather than being designed as a "framework", the target is to make all
> features appear as a collection of optional helper functions, such that
> drivers are free to make use of the DRM GPUVA managers basic
> functionality and opt-in for other features without setting any feature
> flags, just by making use of the corresponding functions.
>
> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> ---
>   drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
>   include/drm/drm_gem.h           |  48 ++-
>   include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
>   3 files changed, 1010 insertions(+), 28 deletions(-)
>
> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> index f86bfad74ff8..69872b205961 100644
> --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>   /**
>    * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
>    * @mgr: pointer to the &drm_gpuva_manager to initialize
> + * @drm: the drivers &drm_device
>    * @name: the name of the GPU VA space
>    * @start_offset: the start offset of the GPU VA space
>    * @range: the size of the GPU VA space
> @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>    */
>   void
>   drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> +		       struct drm_device *drm,
>   		       const char *name,
>   		       u64 start_offset, u64 range,
>   		       u64 reserve_offset, u64 reserve_range,
> @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>   	mgr->rb.tree = RB_ROOT_CACHED;
>   	INIT_LIST_HEAD(&mgr->rb.list);
>   
> +	mt_init(&mgr->mt_ext);
> +
> +	INIT_LIST_HEAD(&mgr->evict.list);
> +	spin_lock_init(&mgr->evict.lock);
> +
>   	drm_gpuva_check_overflow(start_offset, range);
>   	mgr->mm_start = start_offset;
>   	mgr->mm_range = range;
> @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>   						     reserve_range)))
>   			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>   	}
> +
> +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> +	mgr->resv = mgr->d_obj.resv;
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
>   
> @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>   		__drm_gpuva_remove(&mgr->kernel_alloc_node);
>   
>   	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> -	     "GPUVA tree is not empty, potentially leaking memory.");
> +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> +
> +	mtree_destroy(&mgr->mt_ext);
> +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> +
> +	drm_gem_private_object_fini(&mgr->d_obj);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
>   
> +/**
> + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @num_fences: the amount of &dma_fences to reserve
> + *
> + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Drivers can obtain the corresponding &drm_exec instance through
> + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> + * and drm_exec_fini() accordingly.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> +				  unsigned int num_fences)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +	int ret;
> +
> +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> +	if (ret)
> +		goto out;
> +
> +	rcu_read_lock();
> +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> +		struct drm_gem_object *obj;
> +
> +		mas_pause(&mas);
> +		rcu_read_unlock();
> +
> +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> +		if (ret)
> +			goto out;
> +
> +		rcu_read_lock();
> +	}
> +	rcu_read_unlock();
> +
> +out:
> +	return ret;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> +
> +/**
> + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @fn: callback received by the driver to lock additional dma-resv
> + * @priv: private driver data passed to @fn
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Addionally, when calling this function the driver receives the given @fn
> + * callback to lock additional dma-resv in the context of the
> + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> + * drm_exec_prepare_obj() from within this callback.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> +			     int (*fn)(struct drm_gpuva_manager *mgr,
> +				       void *priv, unsigned int num_fences),
> +			     void *priv,
> +			     unsigned int num_fences,
> +			     bool interruptible)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	uint32_t flags;
> +	int ret;
> +
> +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> +		DRM_EXEC_IGNORE_DUPLICATES;
> +
> +	drm_exec_init(exec, flags);
> +
> +	drm_exec_until_all_locked(exec) {
> +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> +		drm_exec_retry_on_contention(exec);
> +		if (ret)
> +			goto err;
> +
> +		if (fn) {
> +			ret = fn(mgr, priv, num_fences);
> +			drm_exec_retry_on_contention(exec);
> +			if (ret)
> +				goto err;
> +		}
> +	}
> +
> +	return 0;
> +
> +err:
> +	drm_exec_fini(exec);
> +	return ret;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> +
> +static int
> +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> +				unsigned int num_fences)
> +{
> +	struct {
> +		struct drm_gem_object **objs;
> +		unsigned int num_objs;
> +	} *args = priv;
> +
> +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> +				      args->num_objs, num_fences);
> +}
> +
> +/**
> + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @objs: additional &drm_gem_objects to lock
> + * @num_objs: the number of additional &drm_gem_objects to lock
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> +			     struct drm_gem_object **objs,
> +			     unsigned int num_objs,
> +			     unsigned int num_fences,
> +			     bool interruptible)
> +{
> +	struct {
> +		struct drm_gem_object **objs;
> +		unsigned int num_objs;
> +	} args;
> +
> +	args.objs = objs;
> +	args.num_objs = num_objs;
> +
> +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> +					    num_fences, interruptible);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> +
> +/**
> + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> + *
> + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> + * objects being mapped in the given &drm_gpuva_manager.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> +{
> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> +	struct drm_gpuva_gem *vm_bo;
> +	int ret;
> +
> +	if (unlikely(!ops || !ops->bo_validate))
> +		return -ENOTSUPP;
> +
> +	/* At this point we should hold all dma-resv locks of all GEM objects
> +	 * associated with this GPU-VM, hence it is safe to walk the list.
> +	 */
> +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> +		dma_resv_assert_held(vm_bo->obj->resv);
> +
> +		ret = ops->bo_validate(vm_bo->obj);
> +		if (ret)
> +			return ret;
> +	}
> +
> +	return 0;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> +
> +/**
> + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> + * dma-resv
> + * @mgr: the &drm_gpuva_manager to add a fence to
> + * @fence: fence to add
> + * @private_usage: private dma-resv usage
> + * @extobj_usage: extobj dma-resv usage
> + */
> +void
> +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> +				 struct dma_fence *fence,
> +				 enum dma_resv_usage private_usage,
> +				 enum dma_resv_usage extobj_usage)
> +{
> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> +	struct drm_gem_object *obj;
> +	unsigned long index;
> +
> +	drm_exec_for_each_locked_object(exec, index, obj) {
> +			dma_resv_assert_held(obj->resv);
> +			dma_resv_add_fence(obj->resv, fence,
> +					   drm_gpuva_is_extobj(mgr, obj) ?
> +					   private_usage : extobj_usage);
> +	}
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> +
> +static struct drm_gpuva_gem *
> +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	drm_gem_gpuva_assert_lock_held(obj);
> +
> +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> +		if (vm_bo->mgr == mgr)
> +			return vm_bo;
> +
> +	return NULL;
> +}
> +
> +/**
> + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> + * vm_bo_alloc() callback to allocate.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	if (ops && ops->vm_bo_alloc)
> +		vm_bo = ops->vm_bo_alloc();
> +	else
> +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> +
> +	if (unlikely(!vm_bo))
> +		return NULL;
> +
> +	vm_bo->mgr = mgr;
> +	vm_bo->obj = obj;
> +
> +	kref_init(&vm_bo->kref);
> +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> +
> +	drm_gem_object_get(obj);
> +
> +	return vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> +
> +void
> +drm_gpuva_gem_destroy(struct kref *kref)
> +{
> +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> +						   kref);
> +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> +
> +	drm_gem_object_put(vm_bo->obj);
> +
> +	if (ops && ops->vm_bo_free)
> +		ops->vm_bo_free(vm_bo);
> +	else
> +		kfree(vm_bo);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> +
> +/**
> + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> + * &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the &drm_gpuva_gem accordingly.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		   struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> +
> +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> +
> +/**
> + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> + * given &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> + * &drm_gpuva_gem.
> + *
> + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> +	if (vm_bo)
> +		return vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> +	if (!vm_bo)
> +		return ERR_PTR(-ENOMEM);
> +
> +	return vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> +
> +/**
> + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> + * for the given &drm_gpuva_manager and &drm_gem_object
> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> + * @obj: The &drm_gem_object being mapped in the @mgr.
> + *
> + * Find the &drm_gpuva_gem representing the combination of the given
> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> + * count is decreased. If not found @__vm_bo is returned.
> + *
> + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> + * &drm_gpuva_gem was found
> + */
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> +			      struct drm_gem_object *obj,
> +			      struct drm_gpuva_gem *__vm_bo)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> +	if (vm_bo) {
> +		drm_gpuva_gem_put(__vm_bo);
> +		return vm_bo;
> +	}
> +
> +	return __vm_bo;
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> +
> +static int
> +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj,
> +			  gfp_t gfp)
> +{
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		struct drm_gem_object *obj;
> +		uintptr_t index;
> +	} gem;
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +	int ret = 0;
> +
> +	gem.obj = obj;
> +	mas_set(&mas, gem.index);
> +
> +	mas_lock(&mas);
> +	ref.ptr = mas_walk(&mas);
> +	if (ref.ptr) {
> +		++ref.cnt;
> +		mas_store(&mas, ref.ptr);
> +	} else {
> +		if (unlikely(!gfp)) {
> +			ret = -EINVAL;
> +			goto out;
> +		}
> +
> +		mas_set(&mas, gem.index);
> +		ref.cnt = 1;
> +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> +		if (likely(!ret))
> +			drm_gem_object_get(obj);
> +	}
> +out:
> +	mas_unlock(&mas);
> +	return ret;
> +}
> +
> +static void
> +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj)
> +{
> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> +	union {
> +		struct drm_gem_object *obj;
> +		uintptr_t index;
> +	} gem;
> +	union {
> +		void *ptr;
> +		uintptr_t cnt;
> +	} ref;
> +
> +	gem.obj = obj;
> +	mas_set(&mas, gem.index);
> +
> +	mas_lock(&mas);
> +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> +		goto out;
> +
> +	if (!--ref.cnt) {
> +		mas_erase(&mas);
> +		drm_gem_object_put(obj);
> +	} else {
> +		mas_store(&mas, ref.ptr);
> +	}
> +out:
> +	mas_unlock(&mas);
> +}
> +
> +/**
> + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> + * @mgr: the &drm_gpuva_manager to insert into
> + * @obj: the &drm_gem_object to insert as extobj
> + *
> + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> + * If the &drm_gem_object already exists in the tree, the reference counter
> + * of this external object is increased by one.
> + *
> + * Drivers should insert the external &drm_gem_object before the dma-fence
> + * signalling critical section, e.g. when submitting the job, and before
> + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> + * or its dervates.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			struct drm_gem_object *obj)
> +{
> +	return drm_gpuva_is_extobj(mgr, obj) ?
> +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> +
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> +
> +/**
> + * drm_gpuva_extobj_get - increase the referecne count of an external
> + * &drm_gem_object
> + * @mgr: the &drm_gpuva_manager storing the extobj
> + * @obj: the &drm_gem_object to representing the extobj
> + *
> + * Increases the reference count of the extobj represented by @obj.
> + *
> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> + * being inserted.
> + *
> + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> + * additional reference if the re-map operation splits an existing &drm_gpuva
> + * into two separate ones.
> + *
> + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +void
> +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	if (drm_gpuva_is_extobj(mgr, obj))
> +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> +		     "Can't increase ref-count of non-existent extobj.");
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> +
> +/**
> + * drm_gpuva_extobj_put - decrease the referecne count of an external
> + * &drm_gem_object
> + * @mgr: the &drm_gpuva_manager storing the extobj
> + * @obj: the &drm_gem_object to representing the extobj
> + *
> + * Decreases the reference count of the extobj represented by @obj.
> + *
> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> + * being removed from the GPU VA space.
> + *
> + * See also drm_gpuva_unmap_put().
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +void
> +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj)
> +{
> +	if (drm_gpuva_is_extobj(mgr, obj))
> +		__drm_gpuva_extobj_remove(mgr, obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> +
> +/**
> + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> + * &drm_gpuva_managers evicted list
> + * @obj: the &drm_gem_object to add or remove
> + * @evict: indicates whether the object is evicted
> + *
> + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> + * list containing a mapping of this &drm_gem_object.
> + */
> +void
> +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> +{
> +	struct drm_gpuva_gem *vm_bo;
> +
> +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> +	 */
> +	drm_gem_gpuva_assert_lock_held(obj);
> +
> +	/* Required to protect the GPUVA managers evict list against concurrent
> +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> +	 * the evict list through different GEM object evictions are protected
> +	 * by the GPUVA managers evict lock.
> +	 */
> +	dma_resv_assert_held(obj->resv);
> +
> +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> +
> +		spin_lock(&mgr->evict.lock);
> +		if (evict)
> +			list_add_tail(&vm_bo->list.entry.evict,
> +				      &mgr->evict.list);
> +		else
> +			list_del_init(&vm_bo->list.entry.evict);
> +		spin_unlock(&mgr->evict.lock);
> +	}
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> +
>   static int
>   __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>   		   struct drm_gpuva *va)
> @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
>   /**
>    * drm_gpuva_link() - link a &drm_gpuva
>    * @va: the &drm_gpuva to link
> + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
>    *
> - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> - * associated with.
> + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> + *
> + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> + * reference of the latter is taken.
>    *
>    * This function expects the caller to protect the GEM's GPUVA list against
> - * concurrent access using the GEMs dma_resv lock.
> + * concurrent access using either the GEMs dma_resv lock or a driver specific
> + * lock set through drm_gem_gpuva_set_lock().
>    */
>   void
> -drm_gpuva_link(struct drm_gpuva *va)
> +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
>   {
>   	struct drm_gem_object *obj = va->gem.obj;
>   
> @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
>   
>   	drm_gem_gpuva_assert_lock_held(obj);
>   
> -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> +	drm_gpuva_gem_get(vm_bo);
> +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> +	if (list_empty(&vm_bo->list.entry.gem))
> +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_link);
>   
> @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
>    * This removes the given &va from the GPU VA list of the &drm_gem_object it is
>    * associated with.
>    *
> + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> + *
> + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> + * the latter is dropped.
> + *
>    * This function expects the caller to protect the GEM's GPUVA list against
> - * concurrent access using the GEMs dma_resv lock.
> + * concurrent access using either the GEMs dma_resv lock or a driver specific
> + * lock set through drm_gem_gpuva_set_lock().
>    */
>   void
>   drm_gpuva_unlink(struct drm_gpuva *va)
>   {
>   	struct drm_gem_object *obj = va->gem.obj;
> +	struct drm_gpuva_gem *vm_bo;
>   
>   	if (unlikely(!obj))
>   		return;
>   
>   	drm_gem_gpuva_assert_lock_held(obj);
>   
> +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> +		return;
> +
>   	list_del_init(&va->gem.entry);
> +
> +	if (list_empty(&vm_bo->list.gpuva)) {
> +		list_del_init(&vm_bo->list.entry.gem);
> +		list_del_init(&vm_bo->list.entry.evict);
> +	}
> +	drm_gpuva_gem_put(vm_bo);
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
>   
> @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_map);
>   
> +/**
> + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> + * &drm_gpuva_op_map
> + * @mgr: the &drm_gpuva_manager
> + * @va: the &drm_gpuva to insert
> + * @op: the &drm_gpuva_op_map to initialize @va with
> + *
> + * Initializes the @va from the @op and inserts it into the given @mgr and
> + * increases the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> +		  struct drm_gpuva *va,
> +		  struct drm_gpuva_op_map *op)
> +{
> +	drm_gpuva_map(mgr, va, op);
> +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> +
>   /**
>    * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
>    * &drm_gpuva_op_remap
> @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>   		struct drm_gpuva *next,
>   		struct drm_gpuva_op_remap *op)
>   {
> -	struct drm_gpuva *curr = op->unmap->va;
> -	struct drm_gpuva_manager *mgr = curr->mgr;
> +	struct drm_gpuva *va = op->unmap->va;
> +	struct drm_gpuva_manager *mgr = va->mgr;
>   
> -	drm_gpuva_remove(curr);
> +	drm_gpuva_remove(va);
>   
>   	if (op->prev) {
>   		drm_gpuva_init_from_op(prev, op->prev);
> @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_remap);
>   
> +/**
> + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> + * &drm_gpuva_op_remap
> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> + *
> + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> + * separate mappings, increases the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_remap_get(struct drm_gpuva *prev,
> +		    struct drm_gpuva *next,
> +		    struct drm_gpuva_op_remap *op)
> +{
> +	struct drm_gpuva *va = op->unmap->va;
> +	struct drm_gpuva_manager *mgr = va->mgr;
> +
> +	drm_gpuva_remap(prev, next, op);
> +	if (op->prev && op->next)
> +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> +
>   /**
>    * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
>    * &drm_gpuva_op_unmap
> @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>   }
>   EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
>   
> +/**
> + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> + * &drm_gpuva_op_unmap
> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> + *
> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> + * the reference count of the corresponding extobj.
> + */
> +void
> +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> +{
> +	struct drm_gpuva *va = op->va;
> +
> +	drm_gpuva_unmap(op);
> +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> +}
> +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> +
>   static int
>   op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>   	  u64 addr, u64 range,
> @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>   {
>   	struct drm_gpuva_ops *ops;
>   	struct drm_gpuva_op *op;
> +	struct drm_gpuva_gem *vm_bo;
>   	struct drm_gpuva *va;
>   	int ret;
>   
> @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>   
>   	INIT_LIST_HEAD(&ops->list);
>   
> -	drm_gem_for_each_gpuva(va, obj) {
> +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
>   		op = gpuva_op_alloc(mgr);
>   		if (!op) {
>   			ret = -ENOMEM;
> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> index bc9f6aa2f3fe..783ed3ab440d 100644
> --- a/include/drm/drm_gem.h
> +++ b/include/drm/drm_gem.h
> @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
>    * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
>    * @obj: the &drm_gem_object
>    *
> - * This initializes the &drm_gem_object's &drm_gpuva list.
> + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
>    *
>    * Calling this function is only necessary for drivers intending to support the
>    * &drm_driver_feature DRIVER_GEM_GPUVA.
> @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>   }
>   
>   /**
> - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> - * @entry__: &drm_gpuva structure to assign to in each iteration step
> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>    *
> - * This iterator walks over all &drm_gpuva structures associated with the
> - * &drm_gpuva_manager.
> + * This iterator walks over all &drm_gpuva_gem structures associated with the
> + * &drm_gem_object.
>    */
> -#define drm_gem_for_each_gpuva(entry__, obj__) \
> -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
>   
>   /**
> - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> - * gpuvas
> - * @entry__: &drm_gpuva structure to assign to in each iteration step
> - * @next__: &next &drm_gpuva to store the next step
> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> + * &drm_gpuva_gem
> + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> + * @next__: &next &drm_gpuva_gem to store the next step
> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>    *
> - * This iterator walks over all &drm_gpuva structures associated with the
> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>    * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
>    * it is save against removal of elements.
>    */
> -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> +
> +/**
> + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_manager and &drm_gem_object.
> + */
> +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> +	     va__ = list_next_entry(va__, gem.entry))
>   
>   #endif /* __DRM_GEM_H__ */
> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> index ed8d50200cc3..693e2da3f425 100644
> --- a/include/drm/drm_gpuva_mgr.h
> +++ b/include/drm/drm_gpuva_mgr.h
> @@ -26,12 +26,16 @@
>    */
>   
>   #include <linux/list.h>
> +#include <linux/dma-resv.h>
> +#include <linux/maple_tree.h>
>   #include <linux/rbtree.h>
>   #include <linux/types.h>
>   
>   #include <drm/drm_gem.h>
> +#include <drm/drm_exec.h>
>   
>   struct drm_gpuva_manager;
> +struct drm_gpuva_gem;
>   struct drm_gpuva_fn_ops;
>   
>   /**
> @@ -140,7 +144,7 @@ struct drm_gpuva {
>   int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>   void drm_gpuva_remove(struct drm_gpuva *va);
>   
> -void drm_gpuva_link(struct drm_gpuva *va);
> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>   void drm_gpuva_unlink(struct drm_gpuva *va);
>   
>   struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>   	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>   	 */
>   	const struct drm_gpuva_fn_ops *ops;
> +
> +	/**
> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> +	 * dma-resv to &drm_exec.
> +	 */
> +	struct drm_gem_object d_obj;
> +
> +	/**
> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> +	 * space
> +	 */
> +	struct dma_resv *resv;
> +
> +	/**
> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> +	 */
> +	struct drm_exec exec;
> +
> +	/**
> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> +	 */
> +	struct maple_tree mt_ext;
> +
> +	/**
> +	 * @evict: structure holding the evict list and evict list lock
> +	 */
> +	struct {
> +		/**
> +		 * @list: &list_head storing &drm_gem_objects currently being
> +		 * evicted
> +		 */
> +		struct list_head list;
> +
> +		/**
> +		 * @lock: spinlock to protect the evict list against concurrent
> +		 * insertion / removal of different &drm_gpuva_gems
> +		 */
> +		spinlock_t lock;
> +	} evict;
>   };
>   
>   void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> +			    struct drm_device *drm,
>   			    const char *name,
>   			    u64 start_offset, u64 range,
>   			    u64 reserve_offset, u64 reserve_range,
>   			    const struct drm_gpuva_fn_ops *ops);
>   void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>   
> +/**
> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> + */
> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> +
> +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> +				 int (*fn)(struct drm_gpuva_manager *mgr,
> +					   void *priv, unsigned int num_fences),
> +				 void *priv,
> +				 unsigned int num_fences,
> +				 bool interruptible);
> +
> +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> +				 struct drm_gem_object **objs,
> +				 unsigned int num_objs,
> +				 unsigned int num_fences,
> +				 bool interruptible);
> +
> +/**
> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + * @num_fences: the amount of &dma_fences to reserve
> + * @interruptible: sleep interruptible if waiting
> + *
> + * Acquires all dma-resv locks of all &drm_gem_objects the given
> + * &drm_gpuva_manager contains mappings of.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +static inline int
> +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> +		       unsigned int num_fences,
> +		       bool interruptible)
> +{
> +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> +					    interruptible);
> +}
> +
> +/**
> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> + * @mgr: the &drm_gpuva_manager
> + *
> + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> + * through drm_gpuva_manager_lock() or its variants.
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +static inline void
> +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> +{
> +	drm_exec_fini(&mgr->exec);
> +}
> +
> +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> +				      struct dma_fence *fence,
> +				      enum dma_resv_usage private_usage,
> +				      enum dma_resv_usage extobj_usage);
> +
> +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> +			    struct drm_gem_object *obj);
> +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj);
> +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> +			  struct drm_gem_object *obj);
> +
> +/**
> + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> + * external object
> + * @mgr: the &drm_gpuva_manager to check
> + * @obj: the &drm_gem_object to check
> + *
> + * Returns: true if the &drm_gem_object &dma_resv differs from the
> + * &drm_gpuva_managers &dma_resv, false otherwise
> + */
> +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> +				       struct drm_gem_object *obj)
> +{
> +	return obj && obj->resv != mgr->resv;
> +}
> +
>   static inline struct drm_gpuva *
>   __drm_gpuva_next(struct drm_gpuva *va)
>   {
> @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
>   #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>   	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
>   
> +/**
> + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> + * &drm_gem_object combination
> + *
> + * This structure is an abstraction representing a &drm_gpuva_manager and
> + * &drm_gem_object combination. It serves as an indirection to accelerate
> + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> + * &drm_gem_object.
> + *
> + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> + * accelerate validation.
> + *
> + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> + * a GEM object is mapped first in a GPU-VM and release the instance once the
> + * last mapping of the GEM object in this GPU-VM is unmapped.
> + */
> +struct drm_gpuva_gem {
> +
> +	/**
> +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> +	 */
> +	struct drm_gpuva_manager *mgr;
> +
> +	/**
> +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> +	 */
> +	struct drm_gem_object *obj;
> +
> +	/**
> +	 * @kref: The reference count for this &drm_gpuva_gem.
> +	 */
> +	struct kref kref;
> +
> +	/**
> +	 * @list: Structure containing all &list_heads.
> +	 */
> +	struct {
> +		/**
> +		 * @gpuva: The list of linked &drm_gpuvas.
> +		 */
> +		struct list_head gpuva;
> +
> +		/**
> +		 * @entry: Structure containing all &list_heads serving as
> +		 * entry.
> +		 */
> +		struct {
> +			/**
> +			 * @gem: List entry to attach to the &drm_gem_objects
> +			 * gpuva list.
> +			 */
> +			struct list_head gem;
> +
> +			/**
> +			 * @evict: List entry to attach to the
> +			 * &drm_gpuva_managers evict list.
> +			 */
> +			struct list_head evict;
> +		} entry;
> +	} list;
> +};
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj);
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> +			      struct drm_gem_object *obj,
> +			      struct drm_gpuva_gem *__vm_bo);
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> +		   struct drm_gem_object *obj);
> +
> +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> +
> +struct drm_gpuva_gem *
> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> +		     struct drm_gem_object *obj);
> +void drm_gpuva_gem_destroy(struct kref *kref);
> +
> +/**
> + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> + *
> + * This function acquires an additional reference to @vm_bo. It is illegal to
> + * call this without already holding a reference. No locks required.
> + */
> +static inline struct drm_gpuva_gem *
> +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> +{
> +	kref_get(&vm_bo->kref);
> +	return vm_bo;
> +}
> +
> +/**
> + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> + * @vm_bo: the &drm_gpuva_gem to release the reference of
> + *
> + * This releases a reference to @vm_bo.
> + */
> +static inline void
> +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> +{
> +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> +}
> +
> +/**
> + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_gem.
> + */
> +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> +
> +/**
> + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> + * &drm_gpuva
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @next__: &next &drm_gpuva to store the next step
> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> + * it is save against removal of elements.
> + */
> +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> +
>   /**
>    * enum drm_gpuva_op_type - GPU VA operation type
>    *
> @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
>   	 */
>   	void (*op_free)(struct drm_gpuva_op *op);
>   
> +	/**
> +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> +	 * a struct drm_gpuva_gem
> +	 *
> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> +	 * specific structures. By implementing this callback drivers can
> +	 * allocate memory accordingly.
> +	 *
> +	 * This callback is optional.
> +	 */
> +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> +
> +	/**
> +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> +	 * struct drm_gpuva_gem
> +	 *
> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> +	 * specific structures. By implementing this callback drivers can
> +	 * free the previously allocated memory accordingly.
> +	 *
> +	 * This callback is optional.
> +	 */
> +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> +
>   	/**
>   	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
>   	 * mapping once all previous steps were completed
> @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
>   	 * used.
>   	 */
>   	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> +
> +	/**
> +	 * @bo_validate: called from drm_gpuva_manager_validate()
> +	 *
> +	 * Drivers receive this callback for every evicted &drm_gem_object being
> +	 * mapped in the corresponding &drm_gpuva_manager.
> +	 *
> +	 * Typically, drivers would call their driver specific variant of
> +	 * ttm_bo_validate() from within this callback.
> +	 */
> +	int (*bo_validate)(struct drm_gem_object *obj);
>   };
>   
>   int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>   void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>   		   struct drm_gpuva *va,
>   		   struct drm_gpuva_op_map *op);
> +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> +		       struct drm_gpuva *va,
> +		       struct drm_gpuva_op_map *op);
>   
>   void drm_gpuva_remap(struct drm_gpuva *prev,
>   		     struct drm_gpuva *next,
>   		     struct drm_gpuva_op_remap *op);
> +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> +			 struct drm_gpuva *next,
> +			 struct drm_gpuva_op_remap *op);
>   
>   void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
>   
>   #endif /* __DRM_GPUVA_MGR_H__ */


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-30  7:27     ` Thomas Hellström (Intel)
  (?)
@ 2023-08-30 12:49       ` Danilo Krummrich
  -1 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-30 12:49 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, christian.koenig, faith.ekstrand, bskeggs

Hi Thomas,

thanks for having a look!

On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> Hi, Danilo.
> 
> Some quick comments since I'm doing some Xe work in this area. Will probably
> get back with more.
> 
> On 8/20/23 23:53, Danilo Krummrich wrote:
> > So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> > allocations and mappings, generically connect GPU VA mappings to their
> > backing buffers and perform more complex mapping operations on the GPU VA
> > space.
> > 
> > However, there are more design patterns commonly used by drivers, which
> > can potentially be generalized in order to make the DRM GPUVA manager
> > represent a basic GPU-VM implementation. In this context, this patch aims
> > at generalizing the following elements.
> > 
> > 1) Provide a common dma-resv for GEM objects not being used outside of
> >     this GPU-VM.
> > 
> > 2) Provide tracking of external GEM objects (GEM objects which are
> >     shared with other GPU-VMs).
> > 
> > 3) Provide functions to efficiently lock all GEM objects dma-resv the
> >     GPU-VM contains mappings of.
> > 
> > 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
> >     of, such that validation of evicted GEM objects is accelerated.
> > 
> > 5) Provide some convinience functions for common patterns.
> > 
> > Rather than being designed as a "framework", the target is to make all
> > features appear as a collection of optional helper functions, such that
> > drivers are free to make use of the DRM GPUVA managers basic
> > functionality and opt-in for other features without setting any feature
> > flags, just by making use of the corresponding functions.
> > 
> > Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> > ---
> >   drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
> >   include/drm/drm_gem.h           |  48 ++-
> >   include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
> >   3 files changed, 1010 insertions(+), 28 deletions(-)
> > 
> > diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> > index f86bfad74ff8..69872b205961 100644
> > --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> > @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> >   /**
> >    * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
> >    * @mgr: pointer to the &drm_gpuva_manager to initialize
> > + * @drm: the drivers &drm_device
> >    * @name: the name of the GPU VA space
> >    * @start_offset: the start offset of the GPU VA space
> >    * @range: the size of the GPU VA space
> > @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> >    */
> >   void
> >   drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > +		       struct drm_device *drm,
> >   		       const char *name,
> >   		       u64 start_offset, u64 range,
> >   		       u64 reserve_offset, u64 reserve_range,
> > @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> >   	mgr->rb.tree = RB_ROOT_CACHED;
> >   	INIT_LIST_HEAD(&mgr->rb.list);
> > +	mt_init(&mgr->mt_ext);
> > +
> > +	INIT_LIST_HEAD(&mgr->evict.list);
> > +	spin_lock_init(&mgr->evict.lock);
> > +
> >   	drm_gpuva_check_overflow(start_offset, range);
> >   	mgr->mm_start = start_offset;
> >   	mgr->mm_range = range;
> > @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> >   						     reserve_range)))
> >   			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
> >   	}
> > +
> > +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> > +	mgr->resv = mgr->d_obj.resv;
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
> > @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
> >   		__drm_gpuva_remove(&mgr->kernel_alloc_node);
> >   	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> > -	     "GPUVA tree is not empty, potentially leaking memory.");
> > +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> > +
> > +	mtree_destroy(&mgr->mt_ext);
> > +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> > +
> > +	drm_gem_private_object_fini(&mgr->d_obj);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
> > +/**
> > + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @num_fences: the amount of &dma_fences to reserve
> > + *
> > + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Drivers can obtain the corresponding &drm_exec instance through
> > + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> > + * and drm_exec_fini() accordingly.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> > +				  unsigned int num_fences)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +	int ret;
> > +
> > +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> > +	if (ret)
> > +		goto out;
> > +
> > +	rcu_read_lock();
> In xe we're protecting the external object list with an outer lock, (same as
> protecting the mgr itself). Do we need a separate lock for this? In theory
> as  outlined in the VM_BIND locking document draft, one could probably even
> use the mgr resv for this, but with more complicated code I guess. Also see
> the comment below about the data structure chosen.

The idea is to protect this list with the GPU-VM lock. The locking here is more
of an implication of the maple tree. Either you use the internal lock of the
maple tree or RCU respectively, or you give the maple tree an external lock to
perform lockdep checks on (mt_set_external_lock()). Basically same as here:

https://elixir.bootlin.com/linux/latest/source/drivers/base/regmap/regcache-maple.c#L124

> > +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> > +		struct drm_gem_object *obj;
> > +
> > +		mas_pause(&mas);
> > +		rcu_read_unlock();
> > +
> > +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> > +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> > +		if (ret)
> > +			goto out;
> > +
> > +		rcu_read_lock();
> > +	}
> > +	rcu_read_unlock();
> > +
> > +out:
> > +	return ret;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> > +
> > +/**
> > + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @fn: callback received by the driver to lock additional dma-resv
> > + * @priv: private driver data passed to @fn
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Addionally, when calling this function the driver receives the given @fn
> > + * callback to lock additional dma-resv in the context of the
> > + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> > + * drm_exec_prepare_obj() from within this callback.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > +			     int (*fn)(struct drm_gpuva_manager *mgr,
> > +				       void *priv, unsigned int num_fences),
> > +			     void *priv,
> > +			     unsigned int num_fences,
> > +			     bool interruptible)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	uint32_t flags;
> > +	int ret;
> > +
> > +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> > +		DRM_EXEC_IGNORE_DUPLICATES;
> > +
> > +	drm_exec_init(exec, flags);
> > +
> > +	drm_exec_until_all_locked(exec) {
> > +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> > +		drm_exec_retry_on_contention(exec);
> > +		if (ret)
> > +			goto err;
> > +
> > +		if (fn) {
> > +			ret = fn(mgr, priv, num_fences);
> > +			drm_exec_retry_on_contention(exec);
> > +			if (ret)
> > +				goto err;
> > +		}
> > +	}
> > +
> > +	return 0;
> > +
> > +err:
> > +	drm_exec_fini(exec);
> > +	return ret;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> > +
> > +static int
> > +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> > +				unsigned int num_fences)
> > +{
> > +	struct {
> > +		struct drm_gem_object **objs;
> > +		unsigned int num_objs;
> > +	} *args = priv;
> > +
> > +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> > +				      args->num_objs, num_fences);
> > +}
> > +
> > +/**
> > + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @objs: additional &drm_gem_objects to lock
> > + * @num_objs: the number of additional &drm_gem_objects to lock
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > +			     struct drm_gem_object **objs,
> > +			     unsigned int num_objs,
> > +			     unsigned int num_fences,
> > +			     bool interruptible)
> > +{
> > +	struct {
> > +		struct drm_gem_object **objs;
> > +		unsigned int num_objs;
> > +	} args;
> > +
> > +	args.objs = objs;
> > +	args.num_objs = num_objs;
> > +
> > +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> > +					    num_fences, interruptible);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> > +
> > +/**
> > + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> > + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> > + *
> > + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> > + * objects being mapped in the given &drm_gpuva_manager.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> > +{
> > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > +	struct drm_gpuva_gem *vm_bo;
> > +	int ret;
> > +
> > +	if (unlikely(!ops || !ops->bo_validate))
> > +		return -ENOTSUPP;
> > +
> > +	/* At this point we should hold all dma-resv locks of all GEM objects
> > +	 * associated with this GPU-VM, hence it is safe to walk the list.
> > +	 */
> > +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> > +		dma_resv_assert_held(vm_bo->obj->resv);
> > +
> > +		ret = ops->bo_validate(vm_bo->obj);
> > +		if (ret)
> > +			return ret;
> > +	}
> > +
> > +	return 0;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> > +
> > +/**
> > + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> > + * dma-resv
> > + * @mgr: the &drm_gpuva_manager to add a fence to
> > + * @fence: fence to add
> > + * @private_usage: private dma-resv usage
> > + * @extobj_usage: extobj dma-resv usage
> > + */
> > +void
> > +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > +				 struct dma_fence *fence,
> > +				 enum dma_resv_usage private_usage,
> > +				 enum dma_resv_usage extobj_usage)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	struct drm_gem_object *obj;
> > +	unsigned long index;
> > +
> > +	drm_exec_for_each_locked_object(exec, index, obj) {
> > +			dma_resv_assert_held(obj->resv);
> > +			dma_resv_add_fence(obj->resv, fence,
> > +					   drm_gpuva_is_extobj(mgr, obj) ?
> > +					   private_usage : extobj_usage);
> > +	}
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> > +
> > +static struct drm_gpuva_gem *
> > +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	drm_gem_gpuva_assert_lock_held(obj);
> > +
> > +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> > +		if (vm_bo->mgr == mgr)
> > +			return vm_bo;
> > +
> > +	return NULL;
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> > + * vm_bo_alloc() callback to allocate.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	if (ops && ops->vm_bo_alloc)
> > +		vm_bo = ops->vm_bo_alloc();
> > +	else
> > +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> > +
> > +	if (unlikely(!vm_bo))
> > +		return NULL;
> > +
> > +	vm_bo->mgr = mgr;
> > +	vm_bo->obj = obj;
> > +
> > +	kref_init(&vm_bo->kref);
> > +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> > +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> > +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> > +
> > +	drm_gem_object_get(obj);
> > +
> > +	return vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> > +
> > +void
> > +drm_gpuva_gem_destroy(struct kref *kref)
> > +{
> > +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> > +						   kref);
> > +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> > +
> > +	drm_gem_object_put(vm_bo->obj);
> > +
> > +	if (ops && ops->vm_bo_free)
> > +		ops->vm_bo_free(vm_bo);
> > +	else
> > +		kfree(vm_bo);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> > +
> > +/**
> > + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> > + * &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the &drm_gpuva_gem accordingly.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		   struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> > +
> > +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> > +
> > +/**
> > + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> > + * given &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> > + * &drm_gpuva_gem.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > +	if (vm_bo)
> > +		return vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> > +	if (!vm_bo)
> > +		return ERR_PTR(-ENOMEM);
> > +
> > +	return vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> > +
> > +/**
> > + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> > + * for the given &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> > + * count is decreased. If not found @__vm_bo is returned.
> > + *
> > + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> > + * &drm_gpuva_gem was found
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > +			      struct drm_gem_object *obj,
> > +			      struct drm_gpuva_gem *__vm_bo)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > +	if (vm_bo) {
> > +		drm_gpuva_gem_put(__vm_bo);
> > +		return vm_bo;
> > +	}
> > +
> > +	return __vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> > +
> > +static int
> > +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj,
> > +			  gfp_t gfp)
> > +{
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		struct drm_gem_object *obj;
> > +		uintptr_t index;
> > +	} gem;
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +	int ret = 0;
> > +
> > +	gem.obj = obj;
> > +	mas_set(&mas, gem.index);
> > +
> > +	mas_lock(&mas);
> > +	ref.ptr = mas_walk(&mas);
> > +	if (ref.ptr) {
> > +		++ref.cnt;
> > +		mas_store(&mas, ref.ptr);
> > +	} else {
> > +		if (unlikely(!gfp)) {
> > +			ret = -EINVAL;
> > +			goto out;
> > +		}
> > +
> > +		mas_set(&mas, gem.index);
> > +		ref.cnt = 1;
> > +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> > +		if (likely(!ret))
> > +			drm_gem_object_get(obj);
> > +	}
> > +out:
> > +	mas_unlock(&mas);
> > +	return ret;
> > +}
> > +
> > +static void
> > +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj)
> > +{
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		struct drm_gem_object *obj;
> > +		uintptr_t index;
> > +	} gem;
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +
> > +	gem.obj = obj;
> > +	mas_set(&mas, gem.index);
> > +
> > +	mas_lock(&mas);
> > +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> > +		goto out;
> > +
> > +	if (!--ref.cnt) {
> > +		mas_erase(&mas);
> > +		drm_gem_object_put(obj);
> > +	} else {
> > +		mas_store(&mas, ref.ptr);
> > +	}
> > +out:
> > +	mas_unlock(&mas);
> > +}
> > +
> > +/**
> > + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager to insert into
> > + * @obj: the &drm_gem_object to insert as extobj
> > + *
> > + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> > + * If the &drm_gem_object already exists in the tree, the reference counter
> > + * of this external object is increased by one.
> > + *
> > + * Drivers should insert the external &drm_gem_object before the dma-fence
> > + * signalling critical section, e.g. when submitting the job, and before
> > + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> > + * or its dervates.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			struct drm_gem_object *obj)
> > +{
> > +	return drm_gpuva_is_extobj(mgr, obj) ?
> > +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> > +
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> > +
> > +/**
> > + * drm_gpuva_extobj_get - increase the referecne count of an external
> > + * &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager storing the extobj
> > + * @obj: the &drm_gem_object to representing the extobj
> > + *
> > + * Increases the reference count of the extobj represented by @obj.
> > + *
> > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > + * being inserted.
> > + *
> > + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> > + * additional reference if the re-map operation splits an existing &drm_gpuva
> > + * into two separate ones.
> > + *
> > + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +void
> > +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	if (drm_gpuva_is_extobj(mgr, obj))
> > +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> > +		     "Can't increase ref-count of non-existent extobj.");
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> > +
> > +/**
> > + * drm_gpuva_extobj_put - decrease the referecne count of an external
> > + * &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager storing the extobj
> > + * @obj: the &drm_gem_object to representing the extobj
> > + *
> > + * Decreases the reference count of the extobj represented by @obj.
> > + *
> > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > + * being removed from the GPU VA space.
> > + *
> > + * See also drm_gpuva_unmap_put().
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +void
> > +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	if (drm_gpuva_is_extobj(mgr, obj))
> > +		__drm_gpuva_extobj_remove(mgr, obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> > +
> > +/**
> > + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> > + * &drm_gpuva_managers evicted list
> > + * @obj: the &drm_gem_object to add or remove
> > + * @evict: indicates whether the object is evicted
> > + *
> > + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> > + * list containing a mapping of this &drm_gem_object.
> > + */
> > +void
> > +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> > +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> > +	 */
> > +	drm_gem_gpuva_assert_lock_held(obj);
> > +
> > +	/* Required to protect the GPUVA managers evict list against concurrent
> > +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> > +	 * the evict list through different GEM object evictions are protected
> > +	 * by the GPUVA managers evict lock.
> > +	 */
> > +	dma_resv_assert_held(obj->resv);
> > +
> > +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> > +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> > +
> > +		spin_lock(&mgr->evict.lock);
> > +		if (evict)
> > +			list_add_tail(&vm_bo->list.entry.evict,
> > +				      &mgr->evict.list);
> > +		else
> > +			list_del_init(&vm_bo->list.entry.evict);
> > +		spin_unlock(&mgr->evict.lock);
> > +	}
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> > +
> >   static int
> >   __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
> >   		   struct drm_gpuva *va)
> > @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
> >   /**
> >    * drm_gpuva_link() - link a &drm_gpuva
> >    * @va: the &drm_gpuva to link
> > + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
> >    *
> > - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> > - * associated with.
> > + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> > + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> > + *
> > + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> > + * reference of the latter is taken.
> >    *
> >    * This function expects the caller to protect the GEM's GPUVA list against
> > - * concurrent access using the GEMs dma_resv lock.
> > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > + * lock set through drm_gem_gpuva_set_lock().
> >    */
> >   void
> > -drm_gpuva_link(struct drm_gpuva *va)
> > +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
> >   {
> >   	struct drm_gem_object *obj = va->gem.obj;
> > @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
> >   	drm_gem_gpuva_assert_lock_held(obj);
> > -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> > +	drm_gpuva_gem_get(vm_bo);
> > +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> > +	if (list_empty(&vm_bo->list.entry.gem))
> > +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_link);
> > @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
> >    * This removes the given &va from the GPU VA list of the &drm_gem_object it is
> >    * associated with.
> >    *
> > + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> > + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> > + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> > + *
> > + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> > + * the latter is dropped.
> > + *
> >    * This function expects the caller to protect the GEM's GPUVA list against
> > - * concurrent access using the GEMs dma_resv lock.
> > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > + * lock set through drm_gem_gpuva_set_lock().
> >    */
> >   void
> >   drm_gpuva_unlink(struct drm_gpuva *va)
> >   {
> >   	struct drm_gem_object *obj = va->gem.obj;
> > +	struct drm_gpuva_gem *vm_bo;
> >   	if (unlikely(!obj))
> >   		return;
> >   	drm_gem_gpuva_assert_lock_held(obj);
> > +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> > +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> > +		return;
> > +
> >   	list_del_init(&va->gem.entry);
> > +
> > +	if (list_empty(&vm_bo->list.gpuva)) {
> > +		list_del_init(&vm_bo->list.entry.gem);
> > +		list_del_init(&vm_bo->list.entry.evict);
> > +	}
> > +	drm_gpuva_gem_put(vm_bo);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
> > @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_map);
> > +/**
> > + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> > + * &drm_gpuva_op_map
> > + * @mgr: the &drm_gpuva_manager
> > + * @va: the &drm_gpuva to insert
> > + * @op: the &drm_gpuva_op_map to initialize @va with
> > + *
> > + * Initializes the @va from the @op and inserts it into the given @mgr and
> > + * increases the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > +		  struct drm_gpuva *va,
> > +		  struct drm_gpuva_op_map *op)
> > +{
> > +	drm_gpuva_map(mgr, va, op);
> > +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> > +
> >   /**
> >    * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
> >    * &drm_gpuva_op_remap
> > @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> >   		struct drm_gpuva *next,
> >   		struct drm_gpuva_op_remap *op)
> >   {
> > -	struct drm_gpuva *curr = op->unmap->va;
> > -	struct drm_gpuva_manager *mgr = curr->mgr;
> > +	struct drm_gpuva *va = op->unmap->va;
> > +	struct drm_gpuva_manager *mgr = va->mgr;
> > -	drm_gpuva_remove(curr);
> > +	drm_gpuva_remove(va);
> >   	if (op->prev) {
> >   		drm_gpuva_init_from_op(prev, op->prev);
> > @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_remap);
> > +/**
> > + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> > + * &drm_gpuva_op_remap
> > + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> > + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> > + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> > + *
> > + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> > + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> > + * separate mappings, increases the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_remap_get(struct drm_gpuva *prev,
> > +		    struct drm_gpuva *next,
> > +		    struct drm_gpuva_op_remap *op)
> > +{
> > +	struct drm_gpuva *va = op->unmap->va;
> > +	struct drm_gpuva_manager *mgr = va->mgr;
> > +
> > +	drm_gpuva_remap(prev, next, op);
> > +	if (op->prev && op->next)
> > +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> > +
> >   /**
> >    * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
> >    * &drm_gpuva_op_unmap
> > @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
> > +/**
> > + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> > + * &drm_gpuva_op_unmap
> > + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> > + *
> > + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> > + * the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> > +{
> > +	struct drm_gpuva *va = op->va;
> > +
> > +	drm_gpuva_unmap(op);
> > +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> > +
> >   static int
> >   op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
> >   	  u64 addr, u64 range,
> > @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> >   {
> >   	struct drm_gpuva_ops *ops;
> >   	struct drm_gpuva_op *op;
> > +	struct drm_gpuva_gem *vm_bo;
> >   	struct drm_gpuva *va;
> >   	int ret;
> > @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> >   	INIT_LIST_HEAD(&ops->list);
> > -	drm_gem_for_each_gpuva(va, obj) {
> > +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
> >   		op = gpuva_op_alloc(mgr);
> >   		if (!op) {
> >   			ret = -ENOMEM;
> > diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> > index bc9f6aa2f3fe..783ed3ab440d 100644
> > --- a/include/drm/drm_gem.h
> > +++ b/include/drm/drm_gem.h
> > @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
> >    * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
> >    * @obj: the &drm_gem_object
> >    *
> > - * This initializes the &drm_gem_object's &drm_gpuva list.
> > + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
> >    *
> >    * Calling this function is only necessary for drivers intending to support the
> >    * &drm_driver_feature DRIVER_GEM_GPUVA.
> > @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
> >   }
> >   /**
> > - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> > + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> >    *
> > - * This iterator walks over all &drm_gpuva structures associated with the
> > - * &drm_gpuva_manager.
> > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> > + * &drm_gem_object.
> >    */
> > -#define drm_gem_for_each_gpuva(entry__, obj__) \
> > -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> > +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> > +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
> >   /**
> > - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> > - * gpuvas
> > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > - * @next__: &next &drm_gpuva to store the next step
> > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> > + * &drm_gpuva_gem
> > + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> > + * @next__: &next &drm_gpuva_gem to store the next step
> > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> >    *
> > - * This iterator walks over all &drm_gpuva structures associated with the
> > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> >    * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
> >    * it is save against removal of elements.
> >    */
> > -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> > -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> > +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> > +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> > +
> > +/**
> > + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> > + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> > + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_manager and &drm_gem_object.
> > + */
> > +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> > +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> > +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> > +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> > +	     va__ = list_next_entry(va__, gem.entry))
> >   #endif /* __DRM_GEM_H__ */
> > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > index ed8d50200cc3..693e2da3f425 100644
> > --- a/include/drm/drm_gpuva_mgr.h
> > +++ b/include/drm/drm_gpuva_mgr.h
> > @@ -26,12 +26,16 @@
> >    */
> >   #include <linux/list.h>
> > +#include <linux/dma-resv.h>
> > +#include <linux/maple_tree.h>
> >   #include <linux/rbtree.h>
> >   #include <linux/types.h>
> >   #include <drm/drm_gem.h>
> > +#include <drm/drm_exec.h>
> >   struct drm_gpuva_manager;
> > +struct drm_gpuva_gem;
> >   struct drm_gpuva_fn_ops;
> >   /**
> > @@ -140,7 +144,7 @@ struct drm_gpuva {
> >   int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> >   void drm_gpuva_remove(struct drm_gpuva *va);
> > -void drm_gpuva_link(struct drm_gpuva *va);
> > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> >   void drm_gpuva_unlink(struct drm_gpuva *va);
> >   struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> >   	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> >   	 */
> >   	const struct drm_gpuva_fn_ops *ops;
> > +
> > +	/**
> > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > +	 * dma-resv to &drm_exec.
> > +	 */
> > +	struct drm_gem_object d_obj;
> > +
> > +	/**
> > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > +	 * space
> > +	 */
> > +	struct dma_resv *resv;
> > +
> > +	/**
> > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > +	 */
> > +	struct drm_exec exec;
> > +
> > +	/**
> > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > +	 */
> > +	struct maple_tree mt_ext;
> 
> Why are you using a maple tree here? Insertion and removal is O(log(n))
> instead of O(1) for a list?
>

Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
could have mappings of the same extobj.

I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
instead, which also seems to be the obvious choice. However, there is a locking
conflict.

A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
the corresponding drm_gem_object, or with an external lock provided by the
driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
changes on the GPUVA space directly from the fence signalling path.

Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
linked and we'd want to remove it for the last one being unlinked.

(Actually we'd want to add the drm_gpuva_gem object to the extobj list even
before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
through drm_gpuva_manager_lock(). But that's trival, we could do that when we
create the drm_gpuva_gem, which we need to do anyways.)

Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
drm_gpuva_unlink() is called from within the fence signalling path. For drivers
like XE or Nouveau, we'd at least need to make sure to not mess up the locking
hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.

Considering all that, I thought it's probably better to track extobjs separate
from the drm_gpuva_gem, hence the maple tree choice.

> > +
> > +	/**
> > +	 * @evict: structure holding the evict list and evict list lock
> > +	 */
> > +	struct {
> > +		/**
> > +		 * @list: &list_head storing &drm_gem_objects currently being
> > +		 * evicted
> > +		 */
> > +		struct list_head list;
> > +
> > +		/**
> > +		 * @lock: spinlock to protect the evict list against concurrent
> > +		 * insertion / removal of different &drm_gpuva_gems
> > +		 */
> > +		spinlock_t lock;
> > +	} evict;
> >   };
> >   void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > +			    struct drm_device *drm,
> >   			    const char *name,
> >   			    u64 start_offset, u64 range,
> >   			    u64 reserve_offset, u64 reserve_range,
> >   			    const struct drm_gpuva_fn_ops *ops);
> >   void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > +/**
> > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > + */
> > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> 
> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> should typically be allocated on the stack. Otherwise you'd need to protect
> the mgr->exec member with an exclusive lock throughout the locking process,
> and that's not what we want.

Oh, good point. I think it works in Nouveau, because there it's implicitly
protected with the job submission lock.

> 
> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> needed ops to it: Like so:

That's a good idea, will take this into V2.

> 
> struct drm_gpuva_exec_ops {
>     int (*fn) (struct drm_gpuva_exec *exec, int num_fences);

Is this the fn argument from drm_gpuva_manager_lock_extra()?

>     int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> *obj);

I guess we could also keep that within the drm_gpuva_fn_ops? This should always
be the same callback, right?

> };
> 
> struct drm_gpuva_exec {
>     const struct drm_gpuva_exec_ops *ops;
>     struct drm_exec exec;
>     struct drm_gpuva_manager *mgr;
> };
> 
> Although I'd actually expect bo_validate to be part of fn in the typical
> case. The drm_gpuva_exec would then be allocated by the caller on the stack.

This doesn't sound like my assumption about fn() above is correct.

> 
> 
> > +
> > +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > +				 int (*fn)(struct drm_gpuva_manager *mgr,
> > +					   void *priv, unsigned int num_fences),
> > +				 void *priv,
> > +				 unsigned int num_fences,
> > +				 bool interruptible);
> > +
> > +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > +				 struct drm_gem_object **objs,
> > +				 unsigned int num_objs,
> > +				 unsigned int num_fences,
> > +				 bool interruptible);
> > +
> > +/**
> > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +static inline int
> > +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> > +		       unsigned int num_fences,
> > +		       bool interruptible)
> > +{
> > +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> > +					    interruptible);
> > +}
> > +
> > +/**
> > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + *
> > + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> > + * through drm_gpuva_manager_lock() or its variants.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +static inline void
> > +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> > +{
> > +	drm_exec_fini(&mgr->exec);
> > +}
> > +
> > +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> > +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > +				      struct dma_fence *fence,
> > +				      enum dma_resv_usage private_usage,
> > +				      enum dma_resv_usage extobj_usage);
> > +
> > +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			    struct drm_gem_object *obj);
> > +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj);
> > +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj);
> > +
> > +/**
> > + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> > + * external object
> > + * @mgr: the &drm_gpuva_manager to check
> > + * @obj: the &drm_gem_object to check
> > + *
> > + * Returns: true if the &drm_gem_object &dma_resv differs from the
> > + * &drm_gpuva_managers &dma_resv, false otherwise
> > + */
> > +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> > +				       struct drm_gem_object *obj)
> > +{
> > +	return obj && obj->resv != mgr->resv;
> > +}
> > +
> >   static inline struct drm_gpuva *
> >   __drm_gpuva_next(struct drm_gpuva *va)
> >   {
> > @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
> >   #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
> >   	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
> > +/**
> > + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> > + * &drm_gem_object combination
> > + *
> > + * This structure is an abstraction representing a &drm_gpuva_manager and
> > + * &drm_gem_object combination. It serves as an indirection to accelerate
> > + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> > + * &drm_gem_object.
> > + *
> > + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> > + * accelerate validation.
> > + *
> > + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> > + * a GEM object is mapped first in a GPU-VM and release the instance once the
> > + * last mapping of the GEM object in this GPU-VM is unmapped.
> > + */
> > +struct drm_gpuva_gem {
> > +
> > +	/**
> > +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > +	 */
> > +	struct drm_gpuva_manager *mgr;
> > +
> > +	/**
> > +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> > +	 */
> > +	struct drm_gem_object *obj;
> > +
> > +	/**
> > +	 * @kref: The reference count for this &drm_gpuva_gem.
> > +	 */
> > +	struct kref kref;
> > +
> > +	/**
> > +	 * @list: Structure containing all &list_heads.
> > +	 */
> > +	struct {
> > +		/**
> > +		 * @gpuva: The list of linked &drm_gpuvas.
> > +		 */
> > +		struct list_head gpuva;
> > +
> > +		/**
> > +		 * @entry: Structure containing all &list_heads serving as
> > +		 * entry.
> > +		 */
> > +		struct {
> > +			/**
> > +			 * @gem: List entry to attach to the &drm_gem_objects
> > +			 * gpuva list.
> > +			 */
> > +			struct list_head gem;
> > +
> > +			/**
> > +			 * @evict: List entry to attach to the
> > +			 * &drm_gpuva_managers evict list.
> > +			 */
> > +			struct list_head evict;
> > +		} entry;
> > +	} list;
> > +};
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj);
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > +			      struct drm_gem_object *obj,
> > +			      struct drm_gpuva_gem *__vm_bo);
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		   struct drm_gem_object *obj);
> > +
> > +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj);
> > +void drm_gpuva_gem_destroy(struct kref *kref);
> > +
> > +/**
> > + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> > + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> > + *
> > + * This function acquires an additional reference to @vm_bo. It is illegal to
> > + * call this without already holding a reference. No locks required.
> > + */
> > +static inline struct drm_gpuva_gem *
> > +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> > +{
> > +	kref_get(&vm_bo->kref);
> > +	return vm_bo;
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> > + * @vm_bo: the &drm_gpuva_gem to release the reference of
> > + *
> > + * This releases a reference to @vm_bo.
> > + */
> > +static inline void
> > +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> > +{
> > +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_gem.
> > + */
> > +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> > +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> > +
> > +/**
> > + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> > + * &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @next__: &next &drm_gpuva to store the next step
> > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> > + * it is save against removal of elements.
> > + */
> > +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> > +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> > +
> >   /**
> >    * enum drm_gpuva_op_type - GPU VA operation type
> >    *
> > @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
> >   	 */
> >   	void (*op_free)(struct drm_gpuva_op *op);
> > +	/**
> > +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> > +	 * a struct drm_gpuva_gem
> > +	 *
> > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > +	 * specific structures. By implementing this callback drivers can
> > +	 * allocate memory accordingly.
> > +	 *
> > +	 * This callback is optional.
> > +	 */
> > +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> > +
> > +	/**
> > +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> > +	 * struct drm_gpuva_gem
> > +	 *
> > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > +	 * specific structures. By implementing this callback drivers can
> > +	 * free the previously allocated memory accordingly.
> > +	 *
> > +	 * This callback is optional.
> > +	 */
> > +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> > +
> >   	/**
> >   	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
> >   	 * mapping once all previous steps were completed
> > @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
> >   	 * used.
> >   	 */
> >   	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> > +
> > +	/**
> > +	 * @bo_validate: called from drm_gpuva_manager_validate()
> > +	 *
> > +	 * Drivers receive this callback for every evicted &drm_gem_object being
> > +	 * mapped in the corresponding &drm_gpuva_manager.
> > +	 *
> > +	 * Typically, drivers would call their driver specific variant of
> > +	 * ttm_bo_validate() from within this callback.
> > +	 */
> > +	int (*bo_validate)(struct drm_gem_object *obj);
> >   };
> >   int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> > @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
> >   void drm_gpuva_map(struct drm_gpuva_manager *mgr,
> >   		   struct drm_gpuva *va,
> >   		   struct drm_gpuva_op_map *op);
> > +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > +		       struct drm_gpuva *va,
> > +		       struct drm_gpuva_op_map *op);
> >   void drm_gpuva_remap(struct drm_gpuva *prev,
> >   		     struct drm_gpuva *next,
> >   		     struct drm_gpuva_op_remap *op);
> > +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> > +			 struct drm_gpuva *next,
> > +			 struct drm_gpuva_op_remap *op);
> >   void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> > +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
> >   #endif /* __DRM_GPUVA_MGR_H__ */
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-30 12:49       ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-30 12:49 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, christian.koenig, faith.ekstrand, bskeggs

Hi Thomas,

thanks for having a look!

On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> Hi, Danilo.
> 
> Some quick comments since I'm doing some Xe work in this area. Will probably
> get back with more.
> 
> On 8/20/23 23:53, Danilo Krummrich wrote:
> > So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> > allocations and mappings, generically connect GPU VA mappings to their
> > backing buffers and perform more complex mapping operations on the GPU VA
> > space.
> > 
> > However, there are more design patterns commonly used by drivers, which
> > can potentially be generalized in order to make the DRM GPUVA manager
> > represent a basic GPU-VM implementation. In this context, this patch aims
> > at generalizing the following elements.
> > 
> > 1) Provide a common dma-resv for GEM objects not being used outside of
> >     this GPU-VM.
> > 
> > 2) Provide tracking of external GEM objects (GEM objects which are
> >     shared with other GPU-VMs).
> > 
> > 3) Provide functions to efficiently lock all GEM objects dma-resv the
> >     GPU-VM contains mappings of.
> > 
> > 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
> >     of, such that validation of evicted GEM objects is accelerated.
> > 
> > 5) Provide some convinience functions for common patterns.
> > 
> > Rather than being designed as a "framework", the target is to make all
> > features appear as a collection of optional helper functions, such that
> > drivers are free to make use of the DRM GPUVA managers basic
> > functionality and opt-in for other features without setting any feature
> > flags, just by making use of the corresponding functions.
> > 
> > Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> > ---
> >   drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
> >   include/drm/drm_gem.h           |  48 ++-
> >   include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
> >   3 files changed, 1010 insertions(+), 28 deletions(-)
> > 
> > diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> > index f86bfad74ff8..69872b205961 100644
> > --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> > @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> >   /**
> >    * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
> >    * @mgr: pointer to the &drm_gpuva_manager to initialize
> > + * @drm: the drivers &drm_device
> >    * @name: the name of the GPU VA space
> >    * @start_offset: the start offset of the GPU VA space
> >    * @range: the size of the GPU VA space
> > @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> >    */
> >   void
> >   drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > +		       struct drm_device *drm,
> >   		       const char *name,
> >   		       u64 start_offset, u64 range,
> >   		       u64 reserve_offset, u64 reserve_range,
> > @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> >   	mgr->rb.tree = RB_ROOT_CACHED;
> >   	INIT_LIST_HEAD(&mgr->rb.list);
> > +	mt_init(&mgr->mt_ext);
> > +
> > +	INIT_LIST_HEAD(&mgr->evict.list);
> > +	spin_lock_init(&mgr->evict.lock);
> > +
> >   	drm_gpuva_check_overflow(start_offset, range);
> >   	mgr->mm_start = start_offset;
> >   	mgr->mm_range = range;
> > @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> >   						     reserve_range)))
> >   			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
> >   	}
> > +
> > +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> > +	mgr->resv = mgr->d_obj.resv;
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
> > @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
> >   		__drm_gpuva_remove(&mgr->kernel_alloc_node);
> >   	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> > -	     "GPUVA tree is not empty, potentially leaking memory.");
> > +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> > +
> > +	mtree_destroy(&mgr->mt_ext);
> > +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> > +
> > +	drm_gem_private_object_fini(&mgr->d_obj);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
> > +/**
> > + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @num_fences: the amount of &dma_fences to reserve
> > + *
> > + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Drivers can obtain the corresponding &drm_exec instance through
> > + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> > + * and drm_exec_fini() accordingly.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> > +				  unsigned int num_fences)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +	int ret;
> > +
> > +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> > +	if (ret)
> > +		goto out;
> > +
> > +	rcu_read_lock();
> In xe we're protecting the external object list with an outer lock, (same as
> protecting the mgr itself). Do we need a separate lock for this? In theory
> as  outlined in the VM_BIND locking document draft, one could probably even
> use the mgr resv for this, but with more complicated code I guess. Also see
> the comment below about the data structure chosen.

The idea is to protect this list with the GPU-VM lock. The locking here is more
of an implication of the maple tree. Either you use the internal lock of the
maple tree or RCU respectively, or you give the maple tree an external lock to
perform lockdep checks on (mt_set_external_lock()). Basically same as here:

https://elixir.bootlin.com/linux/latest/source/drivers/base/regmap/regcache-maple.c#L124

> > +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> > +		struct drm_gem_object *obj;
> > +
> > +		mas_pause(&mas);
> > +		rcu_read_unlock();
> > +
> > +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> > +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> > +		if (ret)
> > +			goto out;
> > +
> > +		rcu_read_lock();
> > +	}
> > +	rcu_read_unlock();
> > +
> > +out:
> > +	return ret;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> > +
> > +/**
> > + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @fn: callback received by the driver to lock additional dma-resv
> > + * @priv: private driver data passed to @fn
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Addionally, when calling this function the driver receives the given @fn
> > + * callback to lock additional dma-resv in the context of the
> > + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> > + * drm_exec_prepare_obj() from within this callback.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > +			     int (*fn)(struct drm_gpuva_manager *mgr,
> > +				       void *priv, unsigned int num_fences),
> > +			     void *priv,
> > +			     unsigned int num_fences,
> > +			     bool interruptible)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	uint32_t flags;
> > +	int ret;
> > +
> > +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> > +		DRM_EXEC_IGNORE_DUPLICATES;
> > +
> > +	drm_exec_init(exec, flags);
> > +
> > +	drm_exec_until_all_locked(exec) {
> > +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> > +		drm_exec_retry_on_contention(exec);
> > +		if (ret)
> > +			goto err;
> > +
> > +		if (fn) {
> > +			ret = fn(mgr, priv, num_fences);
> > +			drm_exec_retry_on_contention(exec);
> > +			if (ret)
> > +				goto err;
> > +		}
> > +	}
> > +
> > +	return 0;
> > +
> > +err:
> > +	drm_exec_fini(exec);
> > +	return ret;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> > +
> > +static int
> > +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> > +				unsigned int num_fences)
> > +{
> > +	struct {
> > +		struct drm_gem_object **objs;
> > +		unsigned int num_objs;
> > +	} *args = priv;
> > +
> > +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> > +				      args->num_objs, num_fences);
> > +}
> > +
> > +/**
> > + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @objs: additional &drm_gem_objects to lock
> > + * @num_objs: the number of additional &drm_gem_objects to lock
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > +			     struct drm_gem_object **objs,
> > +			     unsigned int num_objs,
> > +			     unsigned int num_fences,
> > +			     bool interruptible)
> > +{
> > +	struct {
> > +		struct drm_gem_object **objs;
> > +		unsigned int num_objs;
> > +	} args;
> > +
> > +	args.objs = objs;
> > +	args.num_objs = num_objs;
> > +
> > +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> > +					    num_fences, interruptible);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> > +
> > +/**
> > + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> > + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> > + *
> > + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> > + * objects being mapped in the given &drm_gpuva_manager.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> > +{
> > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > +	struct drm_gpuva_gem *vm_bo;
> > +	int ret;
> > +
> > +	if (unlikely(!ops || !ops->bo_validate))
> > +		return -ENOTSUPP;
> > +
> > +	/* At this point we should hold all dma-resv locks of all GEM objects
> > +	 * associated with this GPU-VM, hence it is safe to walk the list.
> > +	 */
> > +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> > +		dma_resv_assert_held(vm_bo->obj->resv);
> > +
> > +		ret = ops->bo_validate(vm_bo->obj);
> > +		if (ret)
> > +			return ret;
> > +	}
> > +
> > +	return 0;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> > +
> > +/**
> > + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> > + * dma-resv
> > + * @mgr: the &drm_gpuva_manager to add a fence to
> > + * @fence: fence to add
> > + * @private_usage: private dma-resv usage
> > + * @extobj_usage: extobj dma-resv usage
> > + */
> > +void
> > +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > +				 struct dma_fence *fence,
> > +				 enum dma_resv_usage private_usage,
> > +				 enum dma_resv_usage extobj_usage)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	struct drm_gem_object *obj;
> > +	unsigned long index;
> > +
> > +	drm_exec_for_each_locked_object(exec, index, obj) {
> > +			dma_resv_assert_held(obj->resv);
> > +			dma_resv_add_fence(obj->resv, fence,
> > +					   drm_gpuva_is_extobj(mgr, obj) ?
> > +					   private_usage : extobj_usage);
> > +	}
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> > +
> > +static struct drm_gpuva_gem *
> > +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	drm_gem_gpuva_assert_lock_held(obj);
> > +
> > +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> > +		if (vm_bo->mgr == mgr)
> > +			return vm_bo;
> > +
> > +	return NULL;
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> > + * vm_bo_alloc() callback to allocate.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	if (ops && ops->vm_bo_alloc)
> > +		vm_bo = ops->vm_bo_alloc();
> > +	else
> > +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> > +
> > +	if (unlikely(!vm_bo))
> > +		return NULL;
> > +
> > +	vm_bo->mgr = mgr;
> > +	vm_bo->obj = obj;
> > +
> > +	kref_init(&vm_bo->kref);
> > +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> > +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> > +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> > +
> > +	drm_gem_object_get(obj);
> > +
> > +	return vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> > +
> > +void
> > +drm_gpuva_gem_destroy(struct kref *kref)
> > +{
> > +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> > +						   kref);
> > +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> > +
> > +	drm_gem_object_put(vm_bo->obj);
> > +
> > +	if (ops && ops->vm_bo_free)
> > +		ops->vm_bo_free(vm_bo);
> > +	else
> > +		kfree(vm_bo);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> > +
> > +/**
> > + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> > + * &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the &drm_gpuva_gem accordingly.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		   struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> > +
> > +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> > +
> > +/**
> > + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> > + * given &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> > + * &drm_gpuva_gem.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > +	if (vm_bo)
> > +		return vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> > +	if (!vm_bo)
> > +		return ERR_PTR(-ENOMEM);
> > +
> > +	return vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> > +
> > +/**
> > + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> > + * for the given &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> > + * count is decreased. If not found @__vm_bo is returned.
> > + *
> > + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> > + * &drm_gpuva_gem was found
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > +			      struct drm_gem_object *obj,
> > +			      struct drm_gpuva_gem *__vm_bo)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > +	if (vm_bo) {
> > +		drm_gpuva_gem_put(__vm_bo);
> > +		return vm_bo;
> > +	}
> > +
> > +	return __vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> > +
> > +static int
> > +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj,
> > +			  gfp_t gfp)
> > +{
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		struct drm_gem_object *obj;
> > +		uintptr_t index;
> > +	} gem;
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +	int ret = 0;
> > +
> > +	gem.obj = obj;
> > +	mas_set(&mas, gem.index);
> > +
> > +	mas_lock(&mas);
> > +	ref.ptr = mas_walk(&mas);
> > +	if (ref.ptr) {
> > +		++ref.cnt;
> > +		mas_store(&mas, ref.ptr);
> > +	} else {
> > +		if (unlikely(!gfp)) {
> > +			ret = -EINVAL;
> > +			goto out;
> > +		}
> > +
> > +		mas_set(&mas, gem.index);
> > +		ref.cnt = 1;
> > +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> > +		if (likely(!ret))
> > +			drm_gem_object_get(obj);
> > +	}
> > +out:
> > +	mas_unlock(&mas);
> > +	return ret;
> > +}
> > +
> > +static void
> > +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj)
> > +{
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		struct drm_gem_object *obj;
> > +		uintptr_t index;
> > +	} gem;
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +
> > +	gem.obj = obj;
> > +	mas_set(&mas, gem.index);
> > +
> > +	mas_lock(&mas);
> > +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> > +		goto out;
> > +
> > +	if (!--ref.cnt) {
> > +		mas_erase(&mas);
> > +		drm_gem_object_put(obj);
> > +	} else {
> > +		mas_store(&mas, ref.ptr);
> > +	}
> > +out:
> > +	mas_unlock(&mas);
> > +}
> > +
> > +/**
> > + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager to insert into
> > + * @obj: the &drm_gem_object to insert as extobj
> > + *
> > + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> > + * If the &drm_gem_object already exists in the tree, the reference counter
> > + * of this external object is increased by one.
> > + *
> > + * Drivers should insert the external &drm_gem_object before the dma-fence
> > + * signalling critical section, e.g. when submitting the job, and before
> > + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> > + * or its dervates.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			struct drm_gem_object *obj)
> > +{
> > +	return drm_gpuva_is_extobj(mgr, obj) ?
> > +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> > +
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> > +
> > +/**
> > + * drm_gpuva_extobj_get - increase the referecne count of an external
> > + * &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager storing the extobj
> > + * @obj: the &drm_gem_object to representing the extobj
> > + *
> > + * Increases the reference count of the extobj represented by @obj.
> > + *
> > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > + * being inserted.
> > + *
> > + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> > + * additional reference if the re-map operation splits an existing &drm_gpuva
> > + * into two separate ones.
> > + *
> > + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +void
> > +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	if (drm_gpuva_is_extobj(mgr, obj))
> > +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> > +		     "Can't increase ref-count of non-existent extobj.");
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> > +
> > +/**
> > + * drm_gpuva_extobj_put - decrease the referecne count of an external
> > + * &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager storing the extobj
> > + * @obj: the &drm_gem_object to representing the extobj
> > + *
> > + * Decreases the reference count of the extobj represented by @obj.
> > + *
> > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > + * being removed from the GPU VA space.
> > + *
> > + * See also drm_gpuva_unmap_put().
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +void
> > +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	if (drm_gpuva_is_extobj(mgr, obj))
> > +		__drm_gpuva_extobj_remove(mgr, obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> > +
> > +/**
> > + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> > + * &drm_gpuva_managers evicted list
> > + * @obj: the &drm_gem_object to add or remove
> > + * @evict: indicates whether the object is evicted
> > + *
> > + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> > + * list containing a mapping of this &drm_gem_object.
> > + */
> > +void
> > +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> > +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> > +	 */
> > +	drm_gem_gpuva_assert_lock_held(obj);
> > +
> > +	/* Required to protect the GPUVA managers evict list against concurrent
> > +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> > +	 * the evict list through different GEM object evictions are protected
> > +	 * by the GPUVA managers evict lock.
> > +	 */
> > +	dma_resv_assert_held(obj->resv);
> > +
> > +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> > +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> > +
> > +		spin_lock(&mgr->evict.lock);
> > +		if (evict)
> > +			list_add_tail(&vm_bo->list.entry.evict,
> > +				      &mgr->evict.list);
> > +		else
> > +			list_del_init(&vm_bo->list.entry.evict);
> > +		spin_unlock(&mgr->evict.lock);
> > +	}
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> > +
> >   static int
> >   __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
> >   		   struct drm_gpuva *va)
> > @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
> >   /**
> >    * drm_gpuva_link() - link a &drm_gpuva
> >    * @va: the &drm_gpuva to link
> > + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
> >    *
> > - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> > - * associated with.
> > + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> > + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> > + *
> > + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> > + * reference of the latter is taken.
> >    *
> >    * This function expects the caller to protect the GEM's GPUVA list against
> > - * concurrent access using the GEMs dma_resv lock.
> > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > + * lock set through drm_gem_gpuva_set_lock().
> >    */
> >   void
> > -drm_gpuva_link(struct drm_gpuva *va)
> > +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
> >   {
> >   	struct drm_gem_object *obj = va->gem.obj;
> > @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
> >   	drm_gem_gpuva_assert_lock_held(obj);
> > -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> > +	drm_gpuva_gem_get(vm_bo);
> > +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> > +	if (list_empty(&vm_bo->list.entry.gem))
> > +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_link);
> > @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
> >    * This removes the given &va from the GPU VA list of the &drm_gem_object it is
> >    * associated with.
> >    *
> > + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> > + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> > + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> > + *
> > + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> > + * the latter is dropped.
> > + *
> >    * This function expects the caller to protect the GEM's GPUVA list against
> > - * concurrent access using the GEMs dma_resv lock.
> > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > + * lock set through drm_gem_gpuva_set_lock().
> >    */
> >   void
> >   drm_gpuva_unlink(struct drm_gpuva *va)
> >   {
> >   	struct drm_gem_object *obj = va->gem.obj;
> > +	struct drm_gpuva_gem *vm_bo;
> >   	if (unlikely(!obj))
> >   		return;
> >   	drm_gem_gpuva_assert_lock_held(obj);
> > +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> > +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> > +		return;
> > +
> >   	list_del_init(&va->gem.entry);
> > +
> > +	if (list_empty(&vm_bo->list.gpuva)) {
> > +		list_del_init(&vm_bo->list.entry.gem);
> > +		list_del_init(&vm_bo->list.entry.evict);
> > +	}
> > +	drm_gpuva_gem_put(vm_bo);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
> > @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_map);
> > +/**
> > + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> > + * &drm_gpuva_op_map
> > + * @mgr: the &drm_gpuva_manager
> > + * @va: the &drm_gpuva to insert
> > + * @op: the &drm_gpuva_op_map to initialize @va with
> > + *
> > + * Initializes the @va from the @op and inserts it into the given @mgr and
> > + * increases the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > +		  struct drm_gpuva *va,
> > +		  struct drm_gpuva_op_map *op)
> > +{
> > +	drm_gpuva_map(mgr, va, op);
> > +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> > +
> >   /**
> >    * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
> >    * &drm_gpuva_op_remap
> > @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> >   		struct drm_gpuva *next,
> >   		struct drm_gpuva_op_remap *op)
> >   {
> > -	struct drm_gpuva *curr = op->unmap->va;
> > -	struct drm_gpuva_manager *mgr = curr->mgr;
> > +	struct drm_gpuva *va = op->unmap->va;
> > +	struct drm_gpuva_manager *mgr = va->mgr;
> > -	drm_gpuva_remove(curr);
> > +	drm_gpuva_remove(va);
> >   	if (op->prev) {
> >   		drm_gpuva_init_from_op(prev, op->prev);
> > @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_remap);
> > +/**
> > + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> > + * &drm_gpuva_op_remap
> > + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> > + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> > + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> > + *
> > + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> > + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> > + * separate mappings, increases the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_remap_get(struct drm_gpuva *prev,
> > +		    struct drm_gpuva *next,
> > +		    struct drm_gpuva_op_remap *op)
> > +{
> > +	struct drm_gpuva *va = op->unmap->va;
> > +	struct drm_gpuva_manager *mgr = va->mgr;
> > +
> > +	drm_gpuva_remap(prev, next, op);
> > +	if (op->prev && op->next)
> > +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> > +
> >   /**
> >    * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
> >    * &drm_gpuva_op_unmap
> > @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
> > +/**
> > + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> > + * &drm_gpuva_op_unmap
> > + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> > + *
> > + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> > + * the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> > +{
> > +	struct drm_gpuva *va = op->va;
> > +
> > +	drm_gpuva_unmap(op);
> > +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> > +
> >   static int
> >   op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
> >   	  u64 addr, u64 range,
> > @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> >   {
> >   	struct drm_gpuva_ops *ops;
> >   	struct drm_gpuva_op *op;
> > +	struct drm_gpuva_gem *vm_bo;
> >   	struct drm_gpuva *va;
> >   	int ret;
> > @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> >   	INIT_LIST_HEAD(&ops->list);
> > -	drm_gem_for_each_gpuva(va, obj) {
> > +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
> >   		op = gpuva_op_alloc(mgr);
> >   		if (!op) {
> >   			ret = -ENOMEM;
> > diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> > index bc9f6aa2f3fe..783ed3ab440d 100644
> > --- a/include/drm/drm_gem.h
> > +++ b/include/drm/drm_gem.h
> > @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
> >    * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
> >    * @obj: the &drm_gem_object
> >    *
> > - * This initializes the &drm_gem_object's &drm_gpuva list.
> > + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
> >    *
> >    * Calling this function is only necessary for drivers intending to support the
> >    * &drm_driver_feature DRIVER_GEM_GPUVA.
> > @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
> >   }
> >   /**
> > - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> > + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> >    *
> > - * This iterator walks over all &drm_gpuva structures associated with the
> > - * &drm_gpuva_manager.
> > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> > + * &drm_gem_object.
> >    */
> > -#define drm_gem_for_each_gpuva(entry__, obj__) \
> > -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> > +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> > +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
> >   /**
> > - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> > - * gpuvas
> > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > - * @next__: &next &drm_gpuva to store the next step
> > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> > + * &drm_gpuva_gem
> > + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> > + * @next__: &next &drm_gpuva_gem to store the next step
> > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> >    *
> > - * This iterator walks over all &drm_gpuva structures associated with the
> > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> >    * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
> >    * it is save against removal of elements.
> >    */
> > -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> > -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> > +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> > +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> > +
> > +/**
> > + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> > + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> > + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_manager and &drm_gem_object.
> > + */
> > +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> > +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> > +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> > +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> > +	     va__ = list_next_entry(va__, gem.entry))
> >   #endif /* __DRM_GEM_H__ */
> > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > index ed8d50200cc3..693e2da3f425 100644
> > --- a/include/drm/drm_gpuva_mgr.h
> > +++ b/include/drm/drm_gpuva_mgr.h
> > @@ -26,12 +26,16 @@
> >    */
> >   #include <linux/list.h>
> > +#include <linux/dma-resv.h>
> > +#include <linux/maple_tree.h>
> >   #include <linux/rbtree.h>
> >   #include <linux/types.h>
> >   #include <drm/drm_gem.h>
> > +#include <drm/drm_exec.h>
> >   struct drm_gpuva_manager;
> > +struct drm_gpuva_gem;
> >   struct drm_gpuva_fn_ops;
> >   /**
> > @@ -140,7 +144,7 @@ struct drm_gpuva {
> >   int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> >   void drm_gpuva_remove(struct drm_gpuva *va);
> > -void drm_gpuva_link(struct drm_gpuva *va);
> > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> >   void drm_gpuva_unlink(struct drm_gpuva *va);
> >   struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> >   	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> >   	 */
> >   	const struct drm_gpuva_fn_ops *ops;
> > +
> > +	/**
> > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > +	 * dma-resv to &drm_exec.
> > +	 */
> > +	struct drm_gem_object d_obj;
> > +
> > +	/**
> > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > +	 * space
> > +	 */
> > +	struct dma_resv *resv;
> > +
> > +	/**
> > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > +	 */
> > +	struct drm_exec exec;
> > +
> > +	/**
> > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > +	 */
> > +	struct maple_tree mt_ext;
> 
> Why are you using a maple tree here? Insertion and removal is O(log(n))
> instead of O(1) for a list?
>

Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
could have mappings of the same extobj.

I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
instead, which also seems to be the obvious choice. However, there is a locking
conflict.

A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
the corresponding drm_gem_object, or with an external lock provided by the
driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
changes on the GPUVA space directly from the fence signalling path.

Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
linked and we'd want to remove it for the last one being unlinked.

(Actually we'd want to add the drm_gpuva_gem object to the extobj list even
before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
through drm_gpuva_manager_lock(). But that's trival, we could do that when we
create the drm_gpuva_gem, which we need to do anyways.)

Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
drm_gpuva_unlink() is called from within the fence signalling path. For drivers
like XE or Nouveau, we'd at least need to make sure to not mess up the locking
hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.

Considering all that, I thought it's probably better to track extobjs separate
from the drm_gpuva_gem, hence the maple tree choice.

> > +
> > +	/**
> > +	 * @evict: structure holding the evict list and evict list lock
> > +	 */
> > +	struct {
> > +		/**
> > +		 * @list: &list_head storing &drm_gem_objects currently being
> > +		 * evicted
> > +		 */
> > +		struct list_head list;
> > +
> > +		/**
> > +		 * @lock: spinlock to protect the evict list against concurrent
> > +		 * insertion / removal of different &drm_gpuva_gems
> > +		 */
> > +		spinlock_t lock;
> > +	} evict;
> >   };
> >   void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > +			    struct drm_device *drm,
> >   			    const char *name,
> >   			    u64 start_offset, u64 range,
> >   			    u64 reserve_offset, u64 reserve_range,
> >   			    const struct drm_gpuva_fn_ops *ops);
> >   void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > +/**
> > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > + */
> > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> 
> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> should typically be allocated on the stack. Otherwise you'd need to protect
> the mgr->exec member with an exclusive lock throughout the locking process,
> and that's not what we want.

Oh, good point. I think it works in Nouveau, because there it's implicitly
protected with the job submission lock.

> 
> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> needed ops to it: Like so:

That's a good idea, will take this into V2.

> 
> struct drm_gpuva_exec_ops {
>     int (*fn) (struct drm_gpuva_exec *exec, int num_fences);

Is this the fn argument from drm_gpuva_manager_lock_extra()?

>     int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> *obj);

I guess we could also keep that within the drm_gpuva_fn_ops? This should always
be the same callback, right?

> };
> 
> struct drm_gpuva_exec {
>     const struct drm_gpuva_exec_ops *ops;
>     struct drm_exec exec;
>     struct drm_gpuva_manager *mgr;
> };
> 
> Although I'd actually expect bo_validate to be part of fn in the typical
> case. The drm_gpuva_exec would then be allocated by the caller on the stack.

This doesn't sound like my assumption about fn() above is correct.

> 
> 
> > +
> > +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > +				 int (*fn)(struct drm_gpuva_manager *mgr,
> > +					   void *priv, unsigned int num_fences),
> > +				 void *priv,
> > +				 unsigned int num_fences,
> > +				 bool interruptible);
> > +
> > +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > +				 struct drm_gem_object **objs,
> > +				 unsigned int num_objs,
> > +				 unsigned int num_fences,
> > +				 bool interruptible);
> > +
> > +/**
> > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +static inline int
> > +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> > +		       unsigned int num_fences,
> > +		       bool interruptible)
> > +{
> > +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> > +					    interruptible);
> > +}
> > +
> > +/**
> > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + *
> > + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> > + * through drm_gpuva_manager_lock() or its variants.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +static inline void
> > +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> > +{
> > +	drm_exec_fini(&mgr->exec);
> > +}
> > +
> > +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> > +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > +				      struct dma_fence *fence,
> > +				      enum dma_resv_usage private_usage,
> > +				      enum dma_resv_usage extobj_usage);
> > +
> > +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			    struct drm_gem_object *obj);
> > +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj);
> > +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj);
> > +
> > +/**
> > + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> > + * external object
> > + * @mgr: the &drm_gpuva_manager to check
> > + * @obj: the &drm_gem_object to check
> > + *
> > + * Returns: true if the &drm_gem_object &dma_resv differs from the
> > + * &drm_gpuva_managers &dma_resv, false otherwise
> > + */
> > +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> > +				       struct drm_gem_object *obj)
> > +{
> > +	return obj && obj->resv != mgr->resv;
> > +}
> > +
> >   static inline struct drm_gpuva *
> >   __drm_gpuva_next(struct drm_gpuva *va)
> >   {
> > @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
> >   #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
> >   	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
> > +/**
> > + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> > + * &drm_gem_object combination
> > + *
> > + * This structure is an abstraction representing a &drm_gpuva_manager and
> > + * &drm_gem_object combination. It serves as an indirection to accelerate
> > + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> > + * &drm_gem_object.
> > + *
> > + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> > + * accelerate validation.
> > + *
> > + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> > + * a GEM object is mapped first in a GPU-VM and release the instance once the
> > + * last mapping of the GEM object in this GPU-VM is unmapped.
> > + */
> > +struct drm_gpuva_gem {
> > +
> > +	/**
> > +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > +	 */
> > +	struct drm_gpuva_manager *mgr;
> > +
> > +	/**
> > +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> > +	 */
> > +	struct drm_gem_object *obj;
> > +
> > +	/**
> > +	 * @kref: The reference count for this &drm_gpuva_gem.
> > +	 */
> > +	struct kref kref;
> > +
> > +	/**
> > +	 * @list: Structure containing all &list_heads.
> > +	 */
> > +	struct {
> > +		/**
> > +		 * @gpuva: The list of linked &drm_gpuvas.
> > +		 */
> > +		struct list_head gpuva;
> > +
> > +		/**
> > +		 * @entry: Structure containing all &list_heads serving as
> > +		 * entry.
> > +		 */
> > +		struct {
> > +			/**
> > +			 * @gem: List entry to attach to the &drm_gem_objects
> > +			 * gpuva list.
> > +			 */
> > +			struct list_head gem;
> > +
> > +			/**
> > +			 * @evict: List entry to attach to the
> > +			 * &drm_gpuva_managers evict list.
> > +			 */
> > +			struct list_head evict;
> > +		} entry;
> > +	} list;
> > +};
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj);
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > +			      struct drm_gem_object *obj,
> > +			      struct drm_gpuva_gem *__vm_bo);
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		   struct drm_gem_object *obj);
> > +
> > +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj);
> > +void drm_gpuva_gem_destroy(struct kref *kref);
> > +
> > +/**
> > + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> > + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> > + *
> > + * This function acquires an additional reference to @vm_bo. It is illegal to
> > + * call this without already holding a reference. No locks required.
> > + */
> > +static inline struct drm_gpuva_gem *
> > +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> > +{
> > +	kref_get(&vm_bo->kref);
> > +	return vm_bo;
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> > + * @vm_bo: the &drm_gpuva_gem to release the reference of
> > + *
> > + * This releases a reference to @vm_bo.
> > + */
> > +static inline void
> > +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> > +{
> > +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_gem.
> > + */
> > +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> > +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> > +
> > +/**
> > + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> > + * &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @next__: &next &drm_gpuva to store the next step
> > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> > + * it is save against removal of elements.
> > + */
> > +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> > +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> > +
> >   /**
> >    * enum drm_gpuva_op_type - GPU VA operation type
> >    *
> > @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
> >   	 */
> >   	void (*op_free)(struct drm_gpuva_op *op);
> > +	/**
> > +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> > +	 * a struct drm_gpuva_gem
> > +	 *
> > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > +	 * specific structures. By implementing this callback drivers can
> > +	 * allocate memory accordingly.
> > +	 *
> > +	 * This callback is optional.
> > +	 */
> > +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> > +
> > +	/**
> > +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> > +	 * struct drm_gpuva_gem
> > +	 *
> > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > +	 * specific structures. By implementing this callback drivers can
> > +	 * free the previously allocated memory accordingly.
> > +	 *
> > +	 * This callback is optional.
> > +	 */
> > +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> > +
> >   	/**
> >   	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
> >   	 * mapping once all previous steps were completed
> > @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
> >   	 * used.
> >   	 */
> >   	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> > +
> > +	/**
> > +	 * @bo_validate: called from drm_gpuva_manager_validate()
> > +	 *
> > +	 * Drivers receive this callback for every evicted &drm_gem_object being
> > +	 * mapped in the corresponding &drm_gpuva_manager.
> > +	 *
> > +	 * Typically, drivers would call their driver specific variant of
> > +	 * ttm_bo_validate() from within this callback.
> > +	 */
> > +	int (*bo_validate)(struct drm_gem_object *obj);
> >   };
> >   int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> > @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
> >   void drm_gpuva_map(struct drm_gpuva_manager *mgr,
> >   		   struct drm_gpuva *va,
> >   		   struct drm_gpuva_op_map *op);
> > +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > +		       struct drm_gpuva *va,
> > +		       struct drm_gpuva_op_map *op);
> >   void drm_gpuva_remap(struct drm_gpuva *prev,
> >   		     struct drm_gpuva *next,
> >   		     struct drm_gpuva_op_remap *op);
> > +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> > +			 struct drm_gpuva *next,
> > +			 struct drm_gpuva_op_remap *op);
> >   void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> > +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
> >   #endif /* __DRM_GPUVA_MGR_H__ */
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-30 12:49       ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-30 12:49 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

Hi Thomas,

thanks for having a look!

On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> Hi, Danilo.
> 
> Some quick comments since I'm doing some Xe work in this area. Will probably
> get back with more.
> 
> On 8/20/23 23:53, Danilo Krummrich wrote:
> > So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> > allocations and mappings, generically connect GPU VA mappings to their
> > backing buffers and perform more complex mapping operations on the GPU VA
> > space.
> > 
> > However, there are more design patterns commonly used by drivers, which
> > can potentially be generalized in order to make the DRM GPUVA manager
> > represent a basic GPU-VM implementation. In this context, this patch aims
> > at generalizing the following elements.
> > 
> > 1) Provide a common dma-resv for GEM objects not being used outside of
> >     this GPU-VM.
> > 
> > 2) Provide tracking of external GEM objects (GEM objects which are
> >     shared with other GPU-VMs).
> > 
> > 3) Provide functions to efficiently lock all GEM objects dma-resv the
> >     GPU-VM contains mappings of.
> > 
> > 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
> >     of, such that validation of evicted GEM objects is accelerated.
> > 
> > 5) Provide some convinience functions for common patterns.
> > 
> > Rather than being designed as a "framework", the target is to make all
> > features appear as a collection of optional helper functions, such that
> > drivers are free to make use of the DRM GPUVA managers basic
> > functionality and opt-in for other features without setting any feature
> > flags, just by making use of the corresponding functions.
> > 
> > Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> > ---
> >   drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
> >   include/drm/drm_gem.h           |  48 ++-
> >   include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
> >   3 files changed, 1010 insertions(+), 28 deletions(-)
> > 
> > diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> > index f86bfad74ff8..69872b205961 100644
> > --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> > @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> >   /**
> >    * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
> >    * @mgr: pointer to the &drm_gpuva_manager to initialize
> > + * @drm: the drivers &drm_device
> >    * @name: the name of the GPU VA space
> >    * @start_offset: the start offset of the GPU VA space
> >    * @range: the size of the GPU VA space
> > @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> >    */
> >   void
> >   drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > +		       struct drm_device *drm,
> >   		       const char *name,
> >   		       u64 start_offset, u64 range,
> >   		       u64 reserve_offset, u64 reserve_range,
> > @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> >   	mgr->rb.tree = RB_ROOT_CACHED;
> >   	INIT_LIST_HEAD(&mgr->rb.list);
> > +	mt_init(&mgr->mt_ext);
> > +
> > +	INIT_LIST_HEAD(&mgr->evict.list);
> > +	spin_lock_init(&mgr->evict.lock);
> > +
> >   	drm_gpuva_check_overflow(start_offset, range);
> >   	mgr->mm_start = start_offset;
> >   	mgr->mm_range = range;
> > @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> >   						     reserve_range)))
> >   			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
> >   	}
> > +
> > +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> > +	mgr->resv = mgr->d_obj.resv;
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
> > @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
> >   		__drm_gpuva_remove(&mgr->kernel_alloc_node);
> >   	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> > -	     "GPUVA tree is not empty, potentially leaking memory.");
> > +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> > +
> > +	mtree_destroy(&mgr->mt_ext);
> > +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> > +
> > +	drm_gem_private_object_fini(&mgr->d_obj);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
> > +/**
> > + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @num_fences: the amount of &dma_fences to reserve
> > + *
> > + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Drivers can obtain the corresponding &drm_exec instance through
> > + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> > + * and drm_exec_fini() accordingly.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> > +				  unsigned int num_fences)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +	int ret;
> > +
> > +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> > +	if (ret)
> > +		goto out;
> > +
> > +	rcu_read_lock();
> In xe we're protecting the external object list with an outer lock, (same as
> protecting the mgr itself). Do we need a separate lock for this? In theory
> as  outlined in the VM_BIND locking document draft, one could probably even
> use the mgr resv for this, but with more complicated code I guess. Also see
> the comment below about the data structure chosen.

The idea is to protect this list with the GPU-VM lock. The locking here is more
of an implication of the maple tree. Either you use the internal lock of the
maple tree or RCU respectively, or you give the maple tree an external lock to
perform lockdep checks on (mt_set_external_lock()). Basically same as here:

https://elixir.bootlin.com/linux/latest/source/drivers/base/regmap/regcache-maple.c#L124

> > +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> > +		struct drm_gem_object *obj;
> > +
> > +		mas_pause(&mas);
> > +		rcu_read_unlock();
> > +
> > +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> > +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> > +		if (ret)
> > +			goto out;
> > +
> > +		rcu_read_lock();
> > +	}
> > +	rcu_read_unlock();
> > +
> > +out:
> > +	return ret;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> > +
> > +/**
> > + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @fn: callback received by the driver to lock additional dma-resv
> > + * @priv: private driver data passed to @fn
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Addionally, when calling this function the driver receives the given @fn
> > + * callback to lock additional dma-resv in the context of the
> > + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> > + * drm_exec_prepare_obj() from within this callback.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > +			     int (*fn)(struct drm_gpuva_manager *mgr,
> > +				       void *priv, unsigned int num_fences),
> > +			     void *priv,
> > +			     unsigned int num_fences,
> > +			     bool interruptible)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	uint32_t flags;
> > +	int ret;
> > +
> > +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> > +		DRM_EXEC_IGNORE_DUPLICATES;
> > +
> > +	drm_exec_init(exec, flags);
> > +
> > +	drm_exec_until_all_locked(exec) {
> > +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> > +		drm_exec_retry_on_contention(exec);
> > +		if (ret)
> > +			goto err;
> > +
> > +		if (fn) {
> > +			ret = fn(mgr, priv, num_fences);
> > +			drm_exec_retry_on_contention(exec);
> > +			if (ret)
> > +				goto err;
> > +		}
> > +	}
> > +
> > +	return 0;
> > +
> > +err:
> > +	drm_exec_fini(exec);
> > +	return ret;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> > +
> > +static int
> > +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> > +				unsigned int num_fences)
> > +{
> > +	struct {
> > +		struct drm_gem_object **objs;
> > +		unsigned int num_objs;
> > +	} *args = priv;
> > +
> > +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> > +				      args->num_objs, num_fences);
> > +}
> > +
> > +/**
> > + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @objs: additional &drm_gem_objects to lock
> > + * @num_objs: the number of additional &drm_gem_objects to lock
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > +			     struct drm_gem_object **objs,
> > +			     unsigned int num_objs,
> > +			     unsigned int num_fences,
> > +			     bool interruptible)
> > +{
> > +	struct {
> > +		struct drm_gem_object **objs;
> > +		unsigned int num_objs;
> > +	} args;
> > +
> > +	args.objs = objs;
> > +	args.num_objs = num_objs;
> > +
> > +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> > +					    num_fences, interruptible);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> > +
> > +/**
> > + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> > + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> > + *
> > + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> > + * objects being mapped in the given &drm_gpuva_manager.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> > +{
> > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > +	struct drm_gpuva_gem *vm_bo;
> > +	int ret;
> > +
> > +	if (unlikely(!ops || !ops->bo_validate))
> > +		return -ENOTSUPP;
> > +
> > +	/* At this point we should hold all dma-resv locks of all GEM objects
> > +	 * associated with this GPU-VM, hence it is safe to walk the list.
> > +	 */
> > +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> > +		dma_resv_assert_held(vm_bo->obj->resv);
> > +
> > +		ret = ops->bo_validate(vm_bo->obj);
> > +		if (ret)
> > +			return ret;
> > +	}
> > +
> > +	return 0;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> > +
> > +/**
> > + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> > + * dma-resv
> > + * @mgr: the &drm_gpuva_manager to add a fence to
> > + * @fence: fence to add
> > + * @private_usage: private dma-resv usage
> > + * @extobj_usage: extobj dma-resv usage
> > + */
> > +void
> > +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > +				 struct dma_fence *fence,
> > +				 enum dma_resv_usage private_usage,
> > +				 enum dma_resv_usage extobj_usage)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	struct drm_gem_object *obj;
> > +	unsigned long index;
> > +
> > +	drm_exec_for_each_locked_object(exec, index, obj) {
> > +			dma_resv_assert_held(obj->resv);
> > +			dma_resv_add_fence(obj->resv, fence,
> > +					   drm_gpuva_is_extobj(mgr, obj) ?
> > +					   private_usage : extobj_usage);
> > +	}
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> > +
> > +static struct drm_gpuva_gem *
> > +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	drm_gem_gpuva_assert_lock_held(obj);
> > +
> > +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> > +		if (vm_bo->mgr == mgr)
> > +			return vm_bo;
> > +
> > +	return NULL;
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> > + * vm_bo_alloc() callback to allocate.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	if (ops && ops->vm_bo_alloc)
> > +		vm_bo = ops->vm_bo_alloc();
> > +	else
> > +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> > +
> > +	if (unlikely(!vm_bo))
> > +		return NULL;
> > +
> > +	vm_bo->mgr = mgr;
> > +	vm_bo->obj = obj;
> > +
> > +	kref_init(&vm_bo->kref);
> > +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> > +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> > +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> > +
> > +	drm_gem_object_get(obj);
> > +
> > +	return vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> > +
> > +void
> > +drm_gpuva_gem_destroy(struct kref *kref)
> > +{
> > +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> > +						   kref);
> > +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> > +
> > +	drm_gem_object_put(vm_bo->obj);
> > +
> > +	if (ops && ops->vm_bo_free)
> > +		ops->vm_bo_free(vm_bo);
> > +	else
> > +		kfree(vm_bo);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> > +
> > +/**
> > + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> > + * &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the &drm_gpuva_gem accordingly.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		   struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> > +
> > +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> > +
> > +/**
> > + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> > + * given &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> > + * &drm_gpuva_gem.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > +	if (vm_bo)
> > +		return vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> > +	if (!vm_bo)
> > +		return ERR_PTR(-ENOMEM);
> > +
> > +	return vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> > +
> > +/**
> > + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> > + * for the given &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> > + * count is decreased. If not found @__vm_bo is returned.
> > + *
> > + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> > + * &drm_gpuva_gem was found
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > +			      struct drm_gem_object *obj,
> > +			      struct drm_gpuva_gem *__vm_bo)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > +	if (vm_bo) {
> > +		drm_gpuva_gem_put(__vm_bo);
> > +		return vm_bo;
> > +	}
> > +
> > +	return __vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> > +
> > +static int
> > +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj,
> > +			  gfp_t gfp)
> > +{
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		struct drm_gem_object *obj;
> > +		uintptr_t index;
> > +	} gem;
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +	int ret = 0;
> > +
> > +	gem.obj = obj;
> > +	mas_set(&mas, gem.index);
> > +
> > +	mas_lock(&mas);
> > +	ref.ptr = mas_walk(&mas);
> > +	if (ref.ptr) {
> > +		++ref.cnt;
> > +		mas_store(&mas, ref.ptr);
> > +	} else {
> > +		if (unlikely(!gfp)) {
> > +			ret = -EINVAL;
> > +			goto out;
> > +		}
> > +
> > +		mas_set(&mas, gem.index);
> > +		ref.cnt = 1;
> > +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> > +		if (likely(!ret))
> > +			drm_gem_object_get(obj);
> > +	}
> > +out:
> > +	mas_unlock(&mas);
> > +	return ret;
> > +}
> > +
> > +static void
> > +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj)
> > +{
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		struct drm_gem_object *obj;
> > +		uintptr_t index;
> > +	} gem;
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +
> > +	gem.obj = obj;
> > +	mas_set(&mas, gem.index);
> > +
> > +	mas_lock(&mas);
> > +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> > +		goto out;
> > +
> > +	if (!--ref.cnt) {
> > +		mas_erase(&mas);
> > +		drm_gem_object_put(obj);
> > +	} else {
> > +		mas_store(&mas, ref.ptr);
> > +	}
> > +out:
> > +	mas_unlock(&mas);
> > +}
> > +
> > +/**
> > + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager to insert into
> > + * @obj: the &drm_gem_object to insert as extobj
> > + *
> > + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> > + * If the &drm_gem_object already exists in the tree, the reference counter
> > + * of this external object is increased by one.
> > + *
> > + * Drivers should insert the external &drm_gem_object before the dma-fence
> > + * signalling critical section, e.g. when submitting the job, and before
> > + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> > + * or its dervates.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			struct drm_gem_object *obj)
> > +{
> > +	return drm_gpuva_is_extobj(mgr, obj) ?
> > +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> > +
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> > +
> > +/**
> > + * drm_gpuva_extobj_get - increase the referecne count of an external
> > + * &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager storing the extobj
> > + * @obj: the &drm_gem_object to representing the extobj
> > + *
> > + * Increases the reference count of the extobj represented by @obj.
> > + *
> > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > + * being inserted.
> > + *
> > + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> > + * additional reference if the re-map operation splits an existing &drm_gpuva
> > + * into two separate ones.
> > + *
> > + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +void
> > +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	if (drm_gpuva_is_extobj(mgr, obj))
> > +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> > +		     "Can't increase ref-count of non-existent extobj.");
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> > +
> > +/**
> > + * drm_gpuva_extobj_put - decrease the referecne count of an external
> > + * &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager storing the extobj
> > + * @obj: the &drm_gem_object to representing the extobj
> > + *
> > + * Decreases the reference count of the extobj represented by @obj.
> > + *
> > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > + * being removed from the GPU VA space.
> > + *
> > + * See also drm_gpuva_unmap_put().
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +void
> > +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	if (drm_gpuva_is_extobj(mgr, obj))
> > +		__drm_gpuva_extobj_remove(mgr, obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> > +
> > +/**
> > + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> > + * &drm_gpuva_managers evicted list
> > + * @obj: the &drm_gem_object to add or remove
> > + * @evict: indicates whether the object is evicted
> > + *
> > + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> > + * list containing a mapping of this &drm_gem_object.
> > + */
> > +void
> > +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> > +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> > +	 */
> > +	drm_gem_gpuva_assert_lock_held(obj);
> > +
> > +	/* Required to protect the GPUVA managers evict list against concurrent
> > +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> > +	 * the evict list through different GEM object evictions are protected
> > +	 * by the GPUVA managers evict lock.
> > +	 */
> > +	dma_resv_assert_held(obj->resv);
> > +
> > +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> > +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> > +
> > +		spin_lock(&mgr->evict.lock);
> > +		if (evict)
> > +			list_add_tail(&vm_bo->list.entry.evict,
> > +				      &mgr->evict.list);
> > +		else
> > +			list_del_init(&vm_bo->list.entry.evict);
> > +		spin_unlock(&mgr->evict.lock);
> > +	}
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> > +
> >   static int
> >   __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
> >   		   struct drm_gpuva *va)
> > @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
> >   /**
> >    * drm_gpuva_link() - link a &drm_gpuva
> >    * @va: the &drm_gpuva to link
> > + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
> >    *
> > - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> > - * associated with.
> > + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> > + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> > + *
> > + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> > + * reference of the latter is taken.
> >    *
> >    * This function expects the caller to protect the GEM's GPUVA list against
> > - * concurrent access using the GEMs dma_resv lock.
> > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > + * lock set through drm_gem_gpuva_set_lock().
> >    */
> >   void
> > -drm_gpuva_link(struct drm_gpuva *va)
> > +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
> >   {
> >   	struct drm_gem_object *obj = va->gem.obj;
> > @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
> >   	drm_gem_gpuva_assert_lock_held(obj);
> > -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> > +	drm_gpuva_gem_get(vm_bo);
> > +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> > +	if (list_empty(&vm_bo->list.entry.gem))
> > +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_link);
> > @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
> >    * This removes the given &va from the GPU VA list of the &drm_gem_object it is
> >    * associated with.
> >    *
> > + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> > + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> > + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> > + *
> > + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> > + * the latter is dropped.
> > + *
> >    * This function expects the caller to protect the GEM's GPUVA list against
> > - * concurrent access using the GEMs dma_resv lock.
> > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > + * lock set through drm_gem_gpuva_set_lock().
> >    */
> >   void
> >   drm_gpuva_unlink(struct drm_gpuva *va)
> >   {
> >   	struct drm_gem_object *obj = va->gem.obj;
> > +	struct drm_gpuva_gem *vm_bo;
> >   	if (unlikely(!obj))
> >   		return;
> >   	drm_gem_gpuva_assert_lock_held(obj);
> > +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> > +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> > +		return;
> > +
> >   	list_del_init(&va->gem.entry);
> > +
> > +	if (list_empty(&vm_bo->list.gpuva)) {
> > +		list_del_init(&vm_bo->list.entry.gem);
> > +		list_del_init(&vm_bo->list.entry.evict);
> > +	}
> > +	drm_gpuva_gem_put(vm_bo);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
> > @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_map);
> > +/**
> > + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> > + * &drm_gpuva_op_map
> > + * @mgr: the &drm_gpuva_manager
> > + * @va: the &drm_gpuva to insert
> > + * @op: the &drm_gpuva_op_map to initialize @va with
> > + *
> > + * Initializes the @va from the @op and inserts it into the given @mgr and
> > + * increases the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > +		  struct drm_gpuva *va,
> > +		  struct drm_gpuva_op_map *op)
> > +{
> > +	drm_gpuva_map(mgr, va, op);
> > +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> > +
> >   /**
> >    * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
> >    * &drm_gpuva_op_remap
> > @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> >   		struct drm_gpuva *next,
> >   		struct drm_gpuva_op_remap *op)
> >   {
> > -	struct drm_gpuva *curr = op->unmap->va;
> > -	struct drm_gpuva_manager *mgr = curr->mgr;
> > +	struct drm_gpuva *va = op->unmap->va;
> > +	struct drm_gpuva_manager *mgr = va->mgr;
> > -	drm_gpuva_remove(curr);
> > +	drm_gpuva_remove(va);
> >   	if (op->prev) {
> >   		drm_gpuva_init_from_op(prev, op->prev);
> > @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_remap);
> > +/**
> > + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> > + * &drm_gpuva_op_remap
> > + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> > + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> > + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> > + *
> > + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> > + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> > + * separate mappings, increases the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_remap_get(struct drm_gpuva *prev,
> > +		    struct drm_gpuva *next,
> > +		    struct drm_gpuva_op_remap *op)
> > +{
> > +	struct drm_gpuva *va = op->unmap->va;
> > +	struct drm_gpuva_manager *mgr = va->mgr;
> > +
> > +	drm_gpuva_remap(prev, next, op);
> > +	if (op->prev && op->next)
> > +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> > +
> >   /**
> >    * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
> >    * &drm_gpuva_op_unmap
> > @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
> > +/**
> > + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> > + * &drm_gpuva_op_unmap
> > + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> > + *
> > + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> > + * the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> > +{
> > +	struct drm_gpuva *va = op->va;
> > +
> > +	drm_gpuva_unmap(op);
> > +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> > +
> >   static int
> >   op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
> >   	  u64 addr, u64 range,
> > @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> >   {
> >   	struct drm_gpuva_ops *ops;
> >   	struct drm_gpuva_op *op;
> > +	struct drm_gpuva_gem *vm_bo;
> >   	struct drm_gpuva *va;
> >   	int ret;
> > @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> >   	INIT_LIST_HEAD(&ops->list);
> > -	drm_gem_for_each_gpuva(va, obj) {
> > +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
> >   		op = gpuva_op_alloc(mgr);
> >   		if (!op) {
> >   			ret = -ENOMEM;
> > diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> > index bc9f6aa2f3fe..783ed3ab440d 100644
> > --- a/include/drm/drm_gem.h
> > +++ b/include/drm/drm_gem.h
> > @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
> >    * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
> >    * @obj: the &drm_gem_object
> >    *
> > - * This initializes the &drm_gem_object's &drm_gpuva list.
> > + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
> >    *
> >    * Calling this function is only necessary for drivers intending to support the
> >    * &drm_driver_feature DRIVER_GEM_GPUVA.
> > @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
> >   }
> >   /**
> > - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> > + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> >    *
> > - * This iterator walks over all &drm_gpuva structures associated with the
> > - * &drm_gpuva_manager.
> > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> > + * &drm_gem_object.
> >    */
> > -#define drm_gem_for_each_gpuva(entry__, obj__) \
> > -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> > +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> > +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
> >   /**
> > - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> > - * gpuvas
> > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > - * @next__: &next &drm_gpuva to store the next step
> > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> > + * &drm_gpuva_gem
> > + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> > + * @next__: &next &drm_gpuva_gem to store the next step
> > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> >    *
> > - * This iterator walks over all &drm_gpuva structures associated with the
> > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> >    * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
> >    * it is save against removal of elements.
> >    */
> > -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> > -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> > +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> > +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> > +
> > +/**
> > + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> > + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> > + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_manager and &drm_gem_object.
> > + */
> > +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> > +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> > +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> > +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> > +	     va__ = list_next_entry(va__, gem.entry))
> >   #endif /* __DRM_GEM_H__ */
> > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > index ed8d50200cc3..693e2da3f425 100644
> > --- a/include/drm/drm_gpuva_mgr.h
> > +++ b/include/drm/drm_gpuva_mgr.h
> > @@ -26,12 +26,16 @@
> >    */
> >   #include <linux/list.h>
> > +#include <linux/dma-resv.h>
> > +#include <linux/maple_tree.h>
> >   #include <linux/rbtree.h>
> >   #include <linux/types.h>
> >   #include <drm/drm_gem.h>
> > +#include <drm/drm_exec.h>
> >   struct drm_gpuva_manager;
> > +struct drm_gpuva_gem;
> >   struct drm_gpuva_fn_ops;
> >   /**
> > @@ -140,7 +144,7 @@ struct drm_gpuva {
> >   int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> >   void drm_gpuva_remove(struct drm_gpuva *va);
> > -void drm_gpuva_link(struct drm_gpuva *va);
> > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> >   void drm_gpuva_unlink(struct drm_gpuva *va);
> >   struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> >   	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> >   	 */
> >   	const struct drm_gpuva_fn_ops *ops;
> > +
> > +	/**
> > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > +	 * dma-resv to &drm_exec.
> > +	 */
> > +	struct drm_gem_object d_obj;
> > +
> > +	/**
> > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > +	 * space
> > +	 */
> > +	struct dma_resv *resv;
> > +
> > +	/**
> > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > +	 */
> > +	struct drm_exec exec;
> > +
> > +	/**
> > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > +	 */
> > +	struct maple_tree mt_ext;
> 
> Why are you using a maple tree here? Insertion and removal is O(log(n))
> instead of O(1) for a list?
>

Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
could have mappings of the same extobj.

I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
instead, which also seems to be the obvious choice. However, there is a locking
conflict.

A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
the corresponding drm_gem_object, or with an external lock provided by the
driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
changes on the GPUVA space directly from the fence signalling path.

Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
linked and we'd want to remove it for the last one being unlinked.

(Actually we'd want to add the drm_gpuva_gem object to the extobj list even
before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
through drm_gpuva_manager_lock(). But that's trival, we could do that when we
create the drm_gpuva_gem, which we need to do anyways.)

Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
drm_gpuva_unlink() is called from within the fence signalling path. For drivers
like XE or Nouveau, we'd at least need to make sure to not mess up the locking
hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.

Considering all that, I thought it's probably better to track extobjs separate
from the drm_gpuva_gem, hence the maple tree choice.

> > +
> > +	/**
> > +	 * @evict: structure holding the evict list and evict list lock
> > +	 */
> > +	struct {
> > +		/**
> > +		 * @list: &list_head storing &drm_gem_objects currently being
> > +		 * evicted
> > +		 */
> > +		struct list_head list;
> > +
> > +		/**
> > +		 * @lock: spinlock to protect the evict list against concurrent
> > +		 * insertion / removal of different &drm_gpuva_gems
> > +		 */
> > +		spinlock_t lock;
> > +	} evict;
> >   };
> >   void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > +			    struct drm_device *drm,
> >   			    const char *name,
> >   			    u64 start_offset, u64 range,
> >   			    u64 reserve_offset, u64 reserve_range,
> >   			    const struct drm_gpuva_fn_ops *ops);
> >   void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > +/**
> > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > + */
> > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> 
> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> should typically be allocated on the stack. Otherwise you'd need to protect
> the mgr->exec member with an exclusive lock throughout the locking process,
> and that's not what we want.

Oh, good point. I think it works in Nouveau, because there it's implicitly
protected with the job submission lock.

> 
> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> needed ops to it: Like so:

That's a good idea, will take this into V2.

> 
> struct drm_gpuva_exec_ops {
>     int (*fn) (struct drm_gpuva_exec *exec, int num_fences);

Is this the fn argument from drm_gpuva_manager_lock_extra()?

>     int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> *obj);

I guess we could also keep that within the drm_gpuva_fn_ops? This should always
be the same callback, right?

> };
> 
> struct drm_gpuva_exec {
>     const struct drm_gpuva_exec_ops *ops;
>     struct drm_exec exec;
>     struct drm_gpuva_manager *mgr;
> };
> 
> Although I'd actually expect bo_validate to be part of fn in the typical
> case. The drm_gpuva_exec would then be allocated by the caller on the stack.

This doesn't sound like my assumption about fn() above is correct.

> 
> 
> > +
> > +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > +				 int (*fn)(struct drm_gpuva_manager *mgr,
> > +					   void *priv, unsigned int num_fences),
> > +				 void *priv,
> > +				 unsigned int num_fences,
> > +				 bool interruptible);
> > +
> > +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > +				 struct drm_gem_object **objs,
> > +				 unsigned int num_objs,
> > +				 unsigned int num_fences,
> > +				 bool interruptible);
> > +
> > +/**
> > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +static inline int
> > +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> > +		       unsigned int num_fences,
> > +		       bool interruptible)
> > +{
> > +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> > +					    interruptible);
> > +}
> > +
> > +/**
> > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + *
> > + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> > + * through drm_gpuva_manager_lock() or its variants.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +static inline void
> > +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> > +{
> > +	drm_exec_fini(&mgr->exec);
> > +}
> > +
> > +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> > +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > +				      struct dma_fence *fence,
> > +				      enum dma_resv_usage private_usage,
> > +				      enum dma_resv_usage extobj_usage);
> > +
> > +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			    struct drm_gem_object *obj);
> > +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj);
> > +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj);
> > +
> > +/**
> > + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> > + * external object
> > + * @mgr: the &drm_gpuva_manager to check
> > + * @obj: the &drm_gem_object to check
> > + *
> > + * Returns: true if the &drm_gem_object &dma_resv differs from the
> > + * &drm_gpuva_managers &dma_resv, false otherwise
> > + */
> > +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> > +				       struct drm_gem_object *obj)
> > +{
> > +	return obj && obj->resv != mgr->resv;
> > +}
> > +
> >   static inline struct drm_gpuva *
> >   __drm_gpuva_next(struct drm_gpuva *va)
> >   {
> > @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
> >   #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
> >   	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
> > +/**
> > + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> > + * &drm_gem_object combination
> > + *
> > + * This structure is an abstraction representing a &drm_gpuva_manager and
> > + * &drm_gem_object combination. It serves as an indirection to accelerate
> > + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> > + * &drm_gem_object.
> > + *
> > + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> > + * accelerate validation.
> > + *
> > + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> > + * a GEM object is mapped first in a GPU-VM and release the instance once the
> > + * last mapping of the GEM object in this GPU-VM is unmapped.
> > + */
> > +struct drm_gpuva_gem {
> > +
> > +	/**
> > +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > +	 */
> > +	struct drm_gpuva_manager *mgr;
> > +
> > +	/**
> > +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> > +	 */
> > +	struct drm_gem_object *obj;
> > +
> > +	/**
> > +	 * @kref: The reference count for this &drm_gpuva_gem.
> > +	 */
> > +	struct kref kref;
> > +
> > +	/**
> > +	 * @list: Structure containing all &list_heads.
> > +	 */
> > +	struct {
> > +		/**
> > +		 * @gpuva: The list of linked &drm_gpuvas.
> > +		 */
> > +		struct list_head gpuva;
> > +
> > +		/**
> > +		 * @entry: Structure containing all &list_heads serving as
> > +		 * entry.
> > +		 */
> > +		struct {
> > +			/**
> > +			 * @gem: List entry to attach to the &drm_gem_objects
> > +			 * gpuva list.
> > +			 */
> > +			struct list_head gem;
> > +
> > +			/**
> > +			 * @evict: List entry to attach to the
> > +			 * &drm_gpuva_managers evict list.
> > +			 */
> > +			struct list_head evict;
> > +		} entry;
> > +	} list;
> > +};
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj);
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > +			      struct drm_gem_object *obj,
> > +			      struct drm_gpuva_gem *__vm_bo);
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		   struct drm_gem_object *obj);
> > +
> > +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj);
> > +void drm_gpuva_gem_destroy(struct kref *kref);
> > +
> > +/**
> > + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> > + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> > + *
> > + * This function acquires an additional reference to @vm_bo. It is illegal to
> > + * call this without already holding a reference. No locks required.
> > + */
> > +static inline struct drm_gpuva_gem *
> > +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> > +{
> > +	kref_get(&vm_bo->kref);
> > +	return vm_bo;
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> > + * @vm_bo: the &drm_gpuva_gem to release the reference of
> > + *
> > + * This releases a reference to @vm_bo.
> > + */
> > +static inline void
> > +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> > +{
> > +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_gem.
> > + */
> > +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> > +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> > +
> > +/**
> > + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> > + * &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @next__: &next &drm_gpuva to store the next step
> > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> > + * it is save against removal of elements.
> > + */
> > +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> > +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> > +
> >   /**
> >    * enum drm_gpuva_op_type - GPU VA operation type
> >    *
> > @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
> >   	 */
> >   	void (*op_free)(struct drm_gpuva_op *op);
> > +	/**
> > +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> > +	 * a struct drm_gpuva_gem
> > +	 *
> > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > +	 * specific structures. By implementing this callback drivers can
> > +	 * allocate memory accordingly.
> > +	 *
> > +	 * This callback is optional.
> > +	 */
> > +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> > +
> > +	/**
> > +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> > +	 * struct drm_gpuva_gem
> > +	 *
> > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > +	 * specific structures. By implementing this callback drivers can
> > +	 * free the previously allocated memory accordingly.
> > +	 *
> > +	 * This callback is optional.
> > +	 */
> > +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> > +
> >   	/**
> >   	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
> >   	 * mapping once all previous steps were completed
> > @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
> >   	 * used.
> >   	 */
> >   	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> > +
> > +	/**
> > +	 * @bo_validate: called from drm_gpuva_manager_validate()
> > +	 *
> > +	 * Drivers receive this callback for every evicted &drm_gem_object being
> > +	 * mapped in the corresponding &drm_gpuva_manager.
> > +	 *
> > +	 * Typically, drivers would call their driver specific variant of
> > +	 * ttm_bo_validate() from within this callback.
> > +	 */
> > +	int (*bo_validate)(struct drm_gem_object *obj);
> >   };
> >   int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> > @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
> >   void drm_gpuva_map(struct drm_gpuva_manager *mgr,
> >   		   struct drm_gpuva *va,
> >   		   struct drm_gpuva_op_map *op);
> > +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > +		       struct drm_gpuva *va,
> > +		       struct drm_gpuva_op_map *op);
> >   void drm_gpuva_remap(struct drm_gpuva *prev,
> >   		     struct drm_gpuva *next,
> >   		     struct drm_gpuva_op_remap *op);
> > +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> > +			 struct drm_gpuva *next,
> > +			 struct drm_gpuva_op_remap *op);
> >   void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> > +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
> >   #endif /* __DRM_GPUVA_MGR_H__ */
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-30  7:48     ` Christian König
  (?)
@ 2023-08-30 13:05       ` Danilo Krummrich
  -1 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-30 13:05 UTC (permalink / raw)
  To: Christian König
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, faith.ekstrand, bskeggs

On Wed, Aug 30, 2023 at 09:48:02AM +0200, Christian König wrote:
> 
> 
> Am 20.08.23 um 23:53 schrieb Danilo Krummrich:
> > So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> > allocations and mappings, generically connect GPU VA mappings to their
> > backing buffers and perform more complex mapping operations on the GPU VA
> > space.
> > 
> > However, there are more design patterns commonly used by drivers, which
> > can potentially be generalized in order to make the DRM GPUVA manager
> > represent a basic GPU-VM implementation. In this context, this patch aims
> > at generalizing the following elements.
> > 
> > 1) Provide a common dma-resv for GEM objects not being used outside of
> >     this GPU-VM.
> > 
> > 2) Provide tracking of external GEM objects (GEM objects which are
> >     shared with other GPU-VMs).
> > 
> > 3) Provide functions to efficiently lock all GEM objects dma-resv the
> >     GPU-VM contains mappings of.
> > 
> > 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
> >     of, such that validation of evicted GEM objects is accelerated.
> > 
> > 5) Provide some convinience functions for common patterns.
> 
> Interesting work.
> 
> You basically implement a bunch of the ideas I came up to improve the amdgpu
> performance in the common manager now. The was one of the remaining blockers
> I had for using this in amdgpu.
> 
> Question is for example how do you track evictions? E.g. we don't have a
> common concept of eviction in GEM as far as I know. Or is the driver
> responsible for giving those notifications to the GPUVA manager?

Right, it is the driver being responsible to adding a drm_gpuva_gem (or VM_BO)
to the managers evict list.

The idea was that drivers have control about the state of a drm_gpuva_gem, such
that a driver can move it to driver specific lists as well, like all the ones
you have in amdgpu.

> 
> And would it be possible to lock only a specific area of the VM, e.g. every
> BO mapped in the interval X..Y?

Currently, the drm_gpuva_manager_lock() functions always lock the GPU-VMs
dma-resv lock, plus all the dma-resv locks of the external objects the manager
keeps track of.

But surely, we could also add something like drm_gpuva_manager_lock_range()
where we just iterate all drm_gpuvas between X and Y and lock the dma-resv
locks of each drm_gpuva's backing BO.

> 
> Regards,
> Christian.
> 
> > 
> > Rather than being designed as a "framework", the target is to make all
> > features appear as a collection of optional helper functions, such that
> > drivers are free to make use of the DRM GPUVA managers basic
> > functionality and opt-in for other features without setting any feature
> > flags, just by making use of the corresponding functions.
> > 
> > Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> > ---
> >   drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
> >   include/drm/drm_gem.h           |  48 ++-
> >   include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
> >   3 files changed, 1010 insertions(+), 28 deletions(-)
> > 
> > diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> > index f86bfad74ff8..69872b205961 100644
> > --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> > @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> >   /**
> >    * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
> >    * @mgr: pointer to the &drm_gpuva_manager to initialize
> > + * @drm: the drivers &drm_device
> >    * @name: the name of the GPU VA space
> >    * @start_offset: the start offset of the GPU VA space
> >    * @range: the size of the GPU VA space
> > @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> >    */
> >   void
> >   drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > +		       struct drm_device *drm,
> >   		       const char *name,
> >   		       u64 start_offset, u64 range,
> >   		       u64 reserve_offset, u64 reserve_range,
> > @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> >   	mgr->rb.tree = RB_ROOT_CACHED;
> >   	INIT_LIST_HEAD(&mgr->rb.list);
> > +	mt_init(&mgr->mt_ext);
> > +
> > +	INIT_LIST_HEAD(&mgr->evict.list);
> > +	spin_lock_init(&mgr->evict.lock);
> > +
> >   	drm_gpuva_check_overflow(start_offset, range);
> >   	mgr->mm_start = start_offset;
> >   	mgr->mm_range = range;
> > @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> >   						     reserve_range)))
> >   			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
> >   	}
> > +
> > +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> > +	mgr->resv = mgr->d_obj.resv;
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
> > @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
> >   		__drm_gpuva_remove(&mgr->kernel_alloc_node);
> >   	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> > -	     "GPUVA tree is not empty, potentially leaking memory.");
> > +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> > +
> > +	mtree_destroy(&mgr->mt_ext);
> > +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> > +
> > +	drm_gem_private_object_fini(&mgr->d_obj);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
> > +/**
> > + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @num_fences: the amount of &dma_fences to reserve
> > + *
> > + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Drivers can obtain the corresponding &drm_exec instance through
> > + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> > + * and drm_exec_fini() accordingly.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> > +				  unsigned int num_fences)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +	int ret;
> > +
> > +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> > +	if (ret)
> > +		goto out;
> > +
> > +	rcu_read_lock();
> > +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> > +		struct drm_gem_object *obj;
> > +
> > +		mas_pause(&mas);
> > +		rcu_read_unlock();
> > +
> > +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> > +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> > +		if (ret)
> > +			goto out;
> > +
> > +		rcu_read_lock();
> > +	}
> > +	rcu_read_unlock();
> > +
> > +out:
> > +	return ret;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> > +
> > +/**
> > + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @fn: callback received by the driver to lock additional dma-resv
> > + * @priv: private driver data passed to @fn
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Addionally, when calling this function the driver receives the given @fn
> > + * callback to lock additional dma-resv in the context of the
> > + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> > + * drm_exec_prepare_obj() from within this callback.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > +			     int (*fn)(struct drm_gpuva_manager *mgr,
> > +				       void *priv, unsigned int num_fences),
> > +			     void *priv,
> > +			     unsigned int num_fences,
> > +			     bool interruptible)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	uint32_t flags;
> > +	int ret;
> > +
> > +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> > +		DRM_EXEC_IGNORE_DUPLICATES;
> > +
> > +	drm_exec_init(exec, flags);
> > +
> > +	drm_exec_until_all_locked(exec) {
> > +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> > +		drm_exec_retry_on_contention(exec);
> > +		if (ret)
> > +			goto err;
> > +
> > +		if (fn) {
> > +			ret = fn(mgr, priv, num_fences);
> > +			drm_exec_retry_on_contention(exec);
> > +			if (ret)
> > +				goto err;
> > +		}
> > +	}
> > +
> > +	return 0;
> > +
> > +err:
> > +	drm_exec_fini(exec);
> > +	return ret;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> > +
> > +static int
> > +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> > +				unsigned int num_fences)
> > +{
> > +	struct {
> > +		struct drm_gem_object **objs;
> > +		unsigned int num_objs;
> > +	} *args = priv;
> > +
> > +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> > +				      args->num_objs, num_fences);
> > +}
> > +
> > +/**
> > + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @objs: additional &drm_gem_objects to lock
> > + * @num_objs: the number of additional &drm_gem_objects to lock
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > +			     struct drm_gem_object **objs,
> > +			     unsigned int num_objs,
> > +			     unsigned int num_fences,
> > +			     bool interruptible)
> > +{
> > +	struct {
> > +		struct drm_gem_object **objs;
> > +		unsigned int num_objs;
> > +	} args;
> > +
> > +	args.objs = objs;
> > +	args.num_objs = num_objs;
> > +
> > +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> > +					    num_fences, interruptible);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> > +
> > +/**
> > + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> > + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> > + *
> > + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> > + * objects being mapped in the given &drm_gpuva_manager.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> > +{
> > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > +	struct drm_gpuva_gem *vm_bo;
> > +	int ret;
> > +
> > +	if (unlikely(!ops || !ops->bo_validate))
> > +		return -ENOTSUPP;
> > +
> > +	/* At this point we should hold all dma-resv locks of all GEM objects
> > +	 * associated with this GPU-VM, hence it is safe to walk the list.
> > +	 */
> > +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> > +		dma_resv_assert_held(vm_bo->obj->resv);
> > +
> > +		ret = ops->bo_validate(vm_bo->obj);
> > +		if (ret)
> > +			return ret;
> > +	}
> > +
> > +	return 0;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> > +
> > +/**
> > + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> > + * dma-resv
> > + * @mgr: the &drm_gpuva_manager to add a fence to
> > + * @fence: fence to add
> > + * @private_usage: private dma-resv usage
> > + * @extobj_usage: extobj dma-resv usage
> > + */
> > +void
> > +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > +				 struct dma_fence *fence,
> > +				 enum dma_resv_usage private_usage,
> > +				 enum dma_resv_usage extobj_usage)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	struct drm_gem_object *obj;
> > +	unsigned long index;
> > +
> > +	drm_exec_for_each_locked_object(exec, index, obj) {
> > +			dma_resv_assert_held(obj->resv);
> > +			dma_resv_add_fence(obj->resv, fence,
> > +					   drm_gpuva_is_extobj(mgr, obj) ?
> > +					   private_usage : extobj_usage);
> > +	}
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> > +
> > +static struct drm_gpuva_gem *
> > +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	drm_gem_gpuva_assert_lock_held(obj);
> > +
> > +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> > +		if (vm_bo->mgr == mgr)
> > +			return vm_bo;
> > +
> > +	return NULL;
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> > + * vm_bo_alloc() callback to allocate.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	if (ops && ops->vm_bo_alloc)
> > +		vm_bo = ops->vm_bo_alloc();
> > +	else
> > +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> > +
> > +	if (unlikely(!vm_bo))
> > +		return NULL;
> > +
> > +	vm_bo->mgr = mgr;
> > +	vm_bo->obj = obj;
> > +
> > +	kref_init(&vm_bo->kref);
> > +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> > +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> > +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> > +
> > +	drm_gem_object_get(obj);
> > +
> > +	return vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> > +
> > +void
> > +drm_gpuva_gem_destroy(struct kref *kref)
> > +{
> > +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> > +						   kref);
> > +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> > +
> > +	drm_gem_object_put(vm_bo->obj);
> > +
> > +	if (ops && ops->vm_bo_free)
> > +		ops->vm_bo_free(vm_bo);
> > +	else
> > +		kfree(vm_bo);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> > +
> > +/**
> > + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> > + * &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the &drm_gpuva_gem accordingly.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		   struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> > +
> > +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> > +
> > +/**
> > + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> > + * given &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> > + * &drm_gpuva_gem.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > +	if (vm_bo)
> > +		return vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> > +	if (!vm_bo)
> > +		return ERR_PTR(-ENOMEM);
> > +
> > +	return vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> > +
> > +/**
> > + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> > + * for the given &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> > + * count is decreased. If not found @__vm_bo is returned.
> > + *
> > + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> > + * &drm_gpuva_gem was found
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > +			      struct drm_gem_object *obj,
> > +			      struct drm_gpuva_gem *__vm_bo)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > +	if (vm_bo) {
> > +		drm_gpuva_gem_put(__vm_bo);
> > +		return vm_bo;
> > +	}
> > +
> > +	return __vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> > +
> > +static int
> > +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj,
> > +			  gfp_t gfp)
> > +{
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		struct drm_gem_object *obj;
> > +		uintptr_t index;
> > +	} gem;
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +	int ret = 0;
> > +
> > +	gem.obj = obj;
> > +	mas_set(&mas, gem.index);
> > +
> > +	mas_lock(&mas);
> > +	ref.ptr = mas_walk(&mas);
> > +	if (ref.ptr) {
> > +		++ref.cnt;
> > +		mas_store(&mas, ref.ptr);
> > +	} else {
> > +		if (unlikely(!gfp)) {
> > +			ret = -EINVAL;
> > +			goto out;
> > +		}
> > +
> > +		mas_set(&mas, gem.index);
> > +		ref.cnt = 1;
> > +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> > +		if (likely(!ret))
> > +			drm_gem_object_get(obj);
> > +	}
> > +out:
> > +	mas_unlock(&mas);
> > +	return ret;
> > +}
> > +
> > +static void
> > +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj)
> > +{
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		struct drm_gem_object *obj;
> > +		uintptr_t index;
> > +	} gem;
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +
> > +	gem.obj = obj;
> > +	mas_set(&mas, gem.index);
> > +
> > +	mas_lock(&mas);
> > +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> > +		goto out;
> > +
> > +	if (!--ref.cnt) {
> > +		mas_erase(&mas);
> > +		drm_gem_object_put(obj);
> > +	} else {
> > +		mas_store(&mas, ref.ptr);
> > +	}
> > +out:
> > +	mas_unlock(&mas);
> > +}
> > +
> > +/**
> > + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager to insert into
> > + * @obj: the &drm_gem_object to insert as extobj
> > + *
> > + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> > + * If the &drm_gem_object already exists in the tree, the reference counter
> > + * of this external object is increased by one.
> > + *
> > + * Drivers should insert the external &drm_gem_object before the dma-fence
> > + * signalling critical section, e.g. when submitting the job, and before
> > + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> > + * or its dervates.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			struct drm_gem_object *obj)
> > +{
> > +	return drm_gpuva_is_extobj(mgr, obj) ?
> > +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> > +
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> > +
> > +/**
> > + * drm_gpuva_extobj_get - increase the referecne count of an external
> > + * &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager storing the extobj
> > + * @obj: the &drm_gem_object to representing the extobj
> > + *
> > + * Increases the reference count of the extobj represented by @obj.
> > + *
> > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > + * being inserted.
> > + *
> > + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> > + * additional reference if the re-map operation splits an existing &drm_gpuva
> > + * into two separate ones.
> > + *
> > + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +void
> > +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	if (drm_gpuva_is_extobj(mgr, obj))
> > +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> > +		     "Can't increase ref-count of non-existent extobj.");
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> > +
> > +/**
> > + * drm_gpuva_extobj_put - decrease the referecne count of an external
> > + * &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager storing the extobj
> > + * @obj: the &drm_gem_object to representing the extobj
> > + *
> > + * Decreases the reference count of the extobj represented by @obj.
> > + *
> > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > + * being removed from the GPU VA space.
> > + *
> > + * See also drm_gpuva_unmap_put().
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +void
> > +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	if (drm_gpuva_is_extobj(mgr, obj))
> > +		__drm_gpuva_extobj_remove(mgr, obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> > +
> > +/**
> > + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> > + * &drm_gpuva_managers evicted list
> > + * @obj: the &drm_gem_object to add or remove
> > + * @evict: indicates whether the object is evicted
> > + *
> > + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> > + * list containing a mapping of this &drm_gem_object.
> > + */
> > +void
> > +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> > +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> > +	 */
> > +	drm_gem_gpuva_assert_lock_held(obj);
> > +
> > +	/* Required to protect the GPUVA managers evict list against concurrent
> > +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> > +	 * the evict list through different GEM object evictions are protected
> > +	 * by the GPUVA managers evict lock.
> > +	 */
> > +	dma_resv_assert_held(obj->resv);
> > +
> > +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> > +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> > +
> > +		spin_lock(&mgr->evict.lock);
> > +		if (evict)
> > +			list_add_tail(&vm_bo->list.entry.evict,
> > +				      &mgr->evict.list);
> > +		else
> > +			list_del_init(&vm_bo->list.entry.evict);
> > +		spin_unlock(&mgr->evict.lock);
> > +	}
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> > +
> >   static int
> >   __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
> >   		   struct drm_gpuva *va)
> > @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
> >   /**
> >    * drm_gpuva_link() - link a &drm_gpuva
> >    * @va: the &drm_gpuva to link
> > + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
> >    *
> > - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> > - * associated with.
> > + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> > + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> > + *
> > + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> > + * reference of the latter is taken.
> >    *
> >    * This function expects the caller to protect the GEM's GPUVA list against
> > - * concurrent access using the GEMs dma_resv lock.
> > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > + * lock set through drm_gem_gpuva_set_lock().
> >    */
> >   void
> > -drm_gpuva_link(struct drm_gpuva *va)
> > +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
> >   {
> >   	struct drm_gem_object *obj = va->gem.obj;
> > @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
> >   	drm_gem_gpuva_assert_lock_held(obj);
> > -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> > +	drm_gpuva_gem_get(vm_bo);
> > +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> > +	if (list_empty(&vm_bo->list.entry.gem))
> > +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_link);
> > @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
> >    * This removes the given &va from the GPU VA list of the &drm_gem_object it is
> >    * associated with.
> >    *
> > + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> > + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> > + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> > + *
> > + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> > + * the latter is dropped.
> > + *
> >    * This function expects the caller to protect the GEM's GPUVA list against
> > - * concurrent access using the GEMs dma_resv lock.
> > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > + * lock set through drm_gem_gpuva_set_lock().
> >    */
> >   void
> >   drm_gpuva_unlink(struct drm_gpuva *va)
> >   {
> >   	struct drm_gem_object *obj = va->gem.obj;
> > +	struct drm_gpuva_gem *vm_bo;
> >   	if (unlikely(!obj))
> >   		return;
> >   	drm_gem_gpuva_assert_lock_held(obj);
> > +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> > +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> > +		return;
> > +
> >   	list_del_init(&va->gem.entry);
> > +
> > +	if (list_empty(&vm_bo->list.gpuva)) {
> > +		list_del_init(&vm_bo->list.entry.gem);
> > +		list_del_init(&vm_bo->list.entry.evict);
> > +	}
> > +	drm_gpuva_gem_put(vm_bo);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
> > @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_map);
> > +/**
> > + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> > + * &drm_gpuva_op_map
> > + * @mgr: the &drm_gpuva_manager
> > + * @va: the &drm_gpuva to insert
> > + * @op: the &drm_gpuva_op_map to initialize @va with
> > + *
> > + * Initializes the @va from the @op and inserts it into the given @mgr and
> > + * increases the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > +		  struct drm_gpuva *va,
> > +		  struct drm_gpuva_op_map *op)
> > +{
> > +	drm_gpuva_map(mgr, va, op);
> > +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> > +
> >   /**
> >    * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
> >    * &drm_gpuva_op_remap
> > @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> >   		struct drm_gpuva *next,
> >   		struct drm_gpuva_op_remap *op)
> >   {
> > -	struct drm_gpuva *curr = op->unmap->va;
> > -	struct drm_gpuva_manager *mgr = curr->mgr;
> > +	struct drm_gpuva *va = op->unmap->va;
> > +	struct drm_gpuva_manager *mgr = va->mgr;
> > -	drm_gpuva_remove(curr);
> > +	drm_gpuva_remove(va);
> >   	if (op->prev) {
> >   		drm_gpuva_init_from_op(prev, op->prev);
> > @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_remap);
> > +/**
> > + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> > + * &drm_gpuva_op_remap
> > + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> > + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> > + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> > + *
> > + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> > + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> > + * separate mappings, increases the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_remap_get(struct drm_gpuva *prev,
> > +		    struct drm_gpuva *next,
> > +		    struct drm_gpuva_op_remap *op)
> > +{
> > +	struct drm_gpuva *va = op->unmap->va;
> > +	struct drm_gpuva_manager *mgr = va->mgr;
> > +
> > +	drm_gpuva_remap(prev, next, op);
> > +	if (op->prev && op->next)
> > +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> > +
> >   /**
> >    * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
> >    * &drm_gpuva_op_unmap
> > @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
> > +/**
> > + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> > + * &drm_gpuva_op_unmap
> > + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> > + *
> > + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> > + * the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> > +{
> > +	struct drm_gpuva *va = op->va;
> > +
> > +	drm_gpuva_unmap(op);
> > +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> > +
> >   static int
> >   op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
> >   	  u64 addr, u64 range,
> > @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> >   {
> >   	struct drm_gpuva_ops *ops;
> >   	struct drm_gpuva_op *op;
> > +	struct drm_gpuva_gem *vm_bo;
> >   	struct drm_gpuva *va;
> >   	int ret;
> > @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> >   	INIT_LIST_HEAD(&ops->list);
> > -	drm_gem_for_each_gpuva(va, obj) {
> > +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
> >   		op = gpuva_op_alloc(mgr);
> >   		if (!op) {
> >   			ret = -ENOMEM;
> > diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> > index bc9f6aa2f3fe..783ed3ab440d 100644
> > --- a/include/drm/drm_gem.h
> > +++ b/include/drm/drm_gem.h
> > @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
> >    * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
> >    * @obj: the &drm_gem_object
> >    *
> > - * This initializes the &drm_gem_object's &drm_gpuva list.
> > + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
> >    *
> >    * Calling this function is only necessary for drivers intending to support the
> >    * &drm_driver_feature DRIVER_GEM_GPUVA.
> > @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
> >   }
> >   /**
> > - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> > + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> >    *
> > - * This iterator walks over all &drm_gpuva structures associated with the
> > - * &drm_gpuva_manager.
> > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> > + * &drm_gem_object.
> >    */
> > -#define drm_gem_for_each_gpuva(entry__, obj__) \
> > -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> > +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> > +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
> >   /**
> > - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> > - * gpuvas
> > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > - * @next__: &next &drm_gpuva to store the next step
> > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> > + * &drm_gpuva_gem
> > + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> > + * @next__: &next &drm_gpuva_gem to store the next step
> > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> >    *
> > - * This iterator walks over all &drm_gpuva structures associated with the
> > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> >    * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
> >    * it is save against removal of elements.
> >    */
> > -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> > -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> > +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> > +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> > +
> > +/**
> > + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> > + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> > + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_manager and &drm_gem_object.
> > + */
> > +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> > +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> > +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> > +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> > +	     va__ = list_next_entry(va__, gem.entry))
> >   #endif /* __DRM_GEM_H__ */
> > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > index ed8d50200cc3..693e2da3f425 100644
> > --- a/include/drm/drm_gpuva_mgr.h
> > +++ b/include/drm/drm_gpuva_mgr.h
> > @@ -26,12 +26,16 @@
> >    */
> >   #include <linux/list.h>
> > +#include <linux/dma-resv.h>
> > +#include <linux/maple_tree.h>
> >   #include <linux/rbtree.h>
> >   #include <linux/types.h>
> >   #include <drm/drm_gem.h>
> > +#include <drm/drm_exec.h>
> >   struct drm_gpuva_manager;
> > +struct drm_gpuva_gem;
> >   struct drm_gpuva_fn_ops;
> >   /**
> > @@ -140,7 +144,7 @@ struct drm_gpuva {
> >   int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> >   void drm_gpuva_remove(struct drm_gpuva *va);
> > -void drm_gpuva_link(struct drm_gpuva *va);
> > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> >   void drm_gpuva_unlink(struct drm_gpuva *va);
> >   struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> >   	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> >   	 */
> >   	const struct drm_gpuva_fn_ops *ops;
> > +
> > +	/**
> > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > +	 * dma-resv to &drm_exec.
> > +	 */
> > +	struct drm_gem_object d_obj;
> > +
> > +	/**
> > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > +	 * space
> > +	 */
> > +	struct dma_resv *resv;
> > +
> > +	/**
> > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > +	 */
> > +	struct drm_exec exec;
> > +
> > +	/**
> > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > +	 */
> > +	struct maple_tree mt_ext;
> > +
> > +	/**
> > +	 * @evict: structure holding the evict list and evict list lock
> > +	 */
> > +	struct {
> > +		/**
> > +		 * @list: &list_head storing &drm_gem_objects currently being
> > +		 * evicted
> > +		 */
> > +		struct list_head list;
> > +
> > +		/**
> > +		 * @lock: spinlock to protect the evict list against concurrent
> > +		 * insertion / removal of different &drm_gpuva_gems
> > +		 */
> > +		spinlock_t lock;
> > +	} evict;
> >   };
> >   void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > +			    struct drm_device *drm,
> >   			    const char *name,
> >   			    u64 start_offset, u64 range,
> >   			    u64 reserve_offset, u64 reserve_range,
> >   			    const struct drm_gpuva_fn_ops *ops);
> >   void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > +/**
> > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > + */
> > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > +
> > +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > +				 int (*fn)(struct drm_gpuva_manager *mgr,
> > +					   void *priv, unsigned int num_fences),
> > +				 void *priv,
> > +				 unsigned int num_fences,
> > +				 bool interruptible);
> > +
> > +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > +				 struct drm_gem_object **objs,
> > +				 unsigned int num_objs,
> > +				 unsigned int num_fences,
> > +				 bool interruptible);
> > +
> > +/**
> > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +static inline int
> > +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> > +		       unsigned int num_fences,
> > +		       bool interruptible)
> > +{
> > +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> > +					    interruptible);
> > +}
> > +
> > +/**
> > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + *
> > + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> > + * through drm_gpuva_manager_lock() or its variants.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +static inline void
> > +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> > +{
> > +	drm_exec_fini(&mgr->exec);
> > +}
> > +
> > +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> > +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > +				      struct dma_fence *fence,
> > +				      enum dma_resv_usage private_usage,
> > +				      enum dma_resv_usage extobj_usage);
> > +
> > +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			    struct drm_gem_object *obj);
> > +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj);
> > +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj);
> > +
> > +/**
> > + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> > + * external object
> > + * @mgr: the &drm_gpuva_manager to check
> > + * @obj: the &drm_gem_object to check
> > + *
> > + * Returns: true if the &drm_gem_object &dma_resv differs from the
> > + * &drm_gpuva_managers &dma_resv, false otherwise
> > + */
> > +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> > +				       struct drm_gem_object *obj)
> > +{
> > +	return obj && obj->resv != mgr->resv;
> > +}
> > +
> >   static inline struct drm_gpuva *
> >   __drm_gpuva_next(struct drm_gpuva *va)
> >   {
> > @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
> >   #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
> >   	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
> > +/**
> > + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> > + * &drm_gem_object combination
> > + *
> > + * This structure is an abstraction representing a &drm_gpuva_manager and
> > + * &drm_gem_object combination. It serves as an indirection to accelerate
> > + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> > + * &drm_gem_object.
> > + *
> > + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> > + * accelerate validation.
> > + *
> > + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> > + * a GEM object is mapped first in a GPU-VM and release the instance once the
> > + * last mapping of the GEM object in this GPU-VM is unmapped.
> > + */
> > +struct drm_gpuva_gem {
> > +
> > +	/**
> > +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > +	 */
> > +	struct drm_gpuva_manager *mgr;
> > +
> > +	/**
> > +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> > +	 */
> > +	struct drm_gem_object *obj;
> > +
> > +	/**
> > +	 * @kref: The reference count for this &drm_gpuva_gem.
> > +	 */
> > +	struct kref kref;
> > +
> > +	/**
> > +	 * @list: Structure containing all &list_heads.
> > +	 */
> > +	struct {
> > +		/**
> > +		 * @gpuva: The list of linked &drm_gpuvas.
> > +		 */
> > +		struct list_head gpuva;
> > +
> > +		/**
> > +		 * @entry: Structure containing all &list_heads serving as
> > +		 * entry.
> > +		 */
> > +		struct {
> > +			/**
> > +			 * @gem: List entry to attach to the &drm_gem_objects
> > +			 * gpuva list.
> > +			 */
> > +			struct list_head gem;
> > +
> > +			/**
> > +			 * @evict: List entry to attach to the
> > +			 * &drm_gpuva_managers evict list.
> > +			 */
> > +			struct list_head evict;
> > +		} entry;
> > +	} list;
> > +};
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj);
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > +			      struct drm_gem_object *obj,
> > +			      struct drm_gpuva_gem *__vm_bo);
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		   struct drm_gem_object *obj);
> > +
> > +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj);
> > +void drm_gpuva_gem_destroy(struct kref *kref);
> > +
> > +/**
> > + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> > + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> > + *
> > + * This function acquires an additional reference to @vm_bo. It is illegal to
> > + * call this without already holding a reference. No locks required.
> > + */
> > +static inline struct drm_gpuva_gem *
> > +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> > +{
> > +	kref_get(&vm_bo->kref);
> > +	return vm_bo;
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> > + * @vm_bo: the &drm_gpuva_gem to release the reference of
> > + *
> > + * This releases a reference to @vm_bo.
> > + */
> > +static inline void
> > +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> > +{
> > +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_gem.
> > + */
> > +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> > +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> > +
> > +/**
> > + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> > + * &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @next__: &next &drm_gpuva to store the next step
> > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> > + * it is save against removal of elements.
> > + */
> > +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> > +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> > +
> >   /**
> >    * enum drm_gpuva_op_type - GPU VA operation type
> >    *
> > @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
> >   	 */
> >   	void (*op_free)(struct drm_gpuva_op *op);
> > +	/**
> > +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> > +	 * a struct drm_gpuva_gem
> > +	 *
> > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > +	 * specific structures. By implementing this callback drivers can
> > +	 * allocate memory accordingly.
> > +	 *
> > +	 * This callback is optional.
> > +	 */
> > +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> > +
> > +	/**
> > +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> > +	 * struct drm_gpuva_gem
> > +	 *
> > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > +	 * specific structures. By implementing this callback drivers can
> > +	 * free the previously allocated memory accordingly.
> > +	 *
> > +	 * This callback is optional.
> > +	 */
> > +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> > +
> >   	/**
> >   	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
> >   	 * mapping once all previous steps were completed
> > @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
> >   	 * used.
> >   	 */
> >   	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> > +
> > +	/**
> > +	 * @bo_validate: called from drm_gpuva_manager_validate()
> > +	 *
> > +	 * Drivers receive this callback for every evicted &drm_gem_object being
> > +	 * mapped in the corresponding &drm_gpuva_manager.
> > +	 *
> > +	 * Typically, drivers would call their driver specific variant of
> > +	 * ttm_bo_validate() from within this callback.
> > +	 */
> > +	int (*bo_validate)(struct drm_gem_object *obj);
> >   };
> >   int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> > @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
> >   void drm_gpuva_map(struct drm_gpuva_manager *mgr,
> >   		   struct drm_gpuva *va,
> >   		   struct drm_gpuva_op_map *op);
> > +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > +		       struct drm_gpuva *va,
> > +		       struct drm_gpuva_op_map *op);
> >   void drm_gpuva_remap(struct drm_gpuva *prev,
> >   		     struct drm_gpuva *next,
> >   		     struct drm_gpuva_op_remap *op);
> > +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> > +			 struct drm_gpuva *next,
> > +			 struct drm_gpuva_op_remap *op);
> >   void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> > +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
> >   #endif /* __DRM_GPUVA_MGR_H__ */
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-30 13:05       ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-30 13:05 UTC (permalink / raw)
  To: Christian König
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, faith.ekstrand, bskeggs

On Wed, Aug 30, 2023 at 09:48:02AM +0200, Christian König wrote:
> 
> 
> Am 20.08.23 um 23:53 schrieb Danilo Krummrich:
> > So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> > allocations and mappings, generically connect GPU VA mappings to their
> > backing buffers and perform more complex mapping operations on the GPU VA
> > space.
> > 
> > However, there are more design patterns commonly used by drivers, which
> > can potentially be generalized in order to make the DRM GPUVA manager
> > represent a basic GPU-VM implementation. In this context, this patch aims
> > at generalizing the following elements.
> > 
> > 1) Provide a common dma-resv for GEM objects not being used outside of
> >     this GPU-VM.
> > 
> > 2) Provide tracking of external GEM objects (GEM objects which are
> >     shared with other GPU-VMs).
> > 
> > 3) Provide functions to efficiently lock all GEM objects dma-resv the
> >     GPU-VM contains mappings of.
> > 
> > 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
> >     of, such that validation of evicted GEM objects is accelerated.
> > 
> > 5) Provide some convinience functions for common patterns.
> 
> Interesting work.
> 
> You basically implement a bunch of the ideas I came up to improve the amdgpu
> performance in the common manager now. The was one of the remaining blockers
> I had for using this in amdgpu.
> 
> Question is for example how do you track evictions? E.g. we don't have a
> common concept of eviction in GEM as far as I know. Or is the driver
> responsible for giving those notifications to the GPUVA manager?

Right, it is the driver being responsible to adding a drm_gpuva_gem (or VM_BO)
to the managers evict list.

The idea was that drivers have control about the state of a drm_gpuva_gem, such
that a driver can move it to driver specific lists as well, like all the ones
you have in amdgpu.

> 
> And would it be possible to lock only a specific area of the VM, e.g. every
> BO mapped in the interval X..Y?

Currently, the drm_gpuva_manager_lock() functions always lock the GPU-VMs
dma-resv lock, plus all the dma-resv locks of the external objects the manager
keeps track of.

But surely, we could also add something like drm_gpuva_manager_lock_range()
where we just iterate all drm_gpuvas between X and Y and lock the dma-resv
locks of each drm_gpuva's backing BO.

> 
> Regards,
> Christian.
> 
> > 
> > Rather than being designed as a "framework", the target is to make all
> > features appear as a collection of optional helper functions, such that
> > drivers are free to make use of the DRM GPUVA managers basic
> > functionality and opt-in for other features without setting any feature
> > flags, just by making use of the corresponding functions.
> > 
> > Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> > ---
> >   drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
> >   include/drm/drm_gem.h           |  48 ++-
> >   include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
> >   3 files changed, 1010 insertions(+), 28 deletions(-)
> > 
> > diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> > index f86bfad74ff8..69872b205961 100644
> > --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> > @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> >   /**
> >    * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
> >    * @mgr: pointer to the &drm_gpuva_manager to initialize
> > + * @drm: the drivers &drm_device
> >    * @name: the name of the GPU VA space
> >    * @start_offset: the start offset of the GPU VA space
> >    * @range: the size of the GPU VA space
> > @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> >    */
> >   void
> >   drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > +		       struct drm_device *drm,
> >   		       const char *name,
> >   		       u64 start_offset, u64 range,
> >   		       u64 reserve_offset, u64 reserve_range,
> > @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> >   	mgr->rb.tree = RB_ROOT_CACHED;
> >   	INIT_LIST_HEAD(&mgr->rb.list);
> > +	mt_init(&mgr->mt_ext);
> > +
> > +	INIT_LIST_HEAD(&mgr->evict.list);
> > +	spin_lock_init(&mgr->evict.lock);
> > +
> >   	drm_gpuva_check_overflow(start_offset, range);
> >   	mgr->mm_start = start_offset;
> >   	mgr->mm_range = range;
> > @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> >   						     reserve_range)))
> >   			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
> >   	}
> > +
> > +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> > +	mgr->resv = mgr->d_obj.resv;
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
> > @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
> >   		__drm_gpuva_remove(&mgr->kernel_alloc_node);
> >   	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> > -	     "GPUVA tree is not empty, potentially leaking memory.");
> > +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> > +
> > +	mtree_destroy(&mgr->mt_ext);
> > +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> > +
> > +	drm_gem_private_object_fini(&mgr->d_obj);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
> > +/**
> > + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @num_fences: the amount of &dma_fences to reserve
> > + *
> > + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Drivers can obtain the corresponding &drm_exec instance through
> > + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> > + * and drm_exec_fini() accordingly.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> > +				  unsigned int num_fences)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +	int ret;
> > +
> > +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> > +	if (ret)
> > +		goto out;
> > +
> > +	rcu_read_lock();
> > +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> > +		struct drm_gem_object *obj;
> > +
> > +		mas_pause(&mas);
> > +		rcu_read_unlock();
> > +
> > +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> > +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> > +		if (ret)
> > +			goto out;
> > +
> > +		rcu_read_lock();
> > +	}
> > +	rcu_read_unlock();
> > +
> > +out:
> > +	return ret;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> > +
> > +/**
> > + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @fn: callback received by the driver to lock additional dma-resv
> > + * @priv: private driver data passed to @fn
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Addionally, when calling this function the driver receives the given @fn
> > + * callback to lock additional dma-resv in the context of the
> > + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> > + * drm_exec_prepare_obj() from within this callback.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > +			     int (*fn)(struct drm_gpuva_manager *mgr,
> > +				       void *priv, unsigned int num_fences),
> > +			     void *priv,
> > +			     unsigned int num_fences,
> > +			     bool interruptible)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	uint32_t flags;
> > +	int ret;
> > +
> > +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> > +		DRM_EXEC_IGNORE_DUPLICATES;
> > +
> > +	drm_exec_init(exec, flags);
> > +
> > +	drm_exec_until_all_locked(exec) {
> > +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> > +		drm_exec_retry_on_contention(exec);
> > +		if (ret)
> > +			goto err;
> > +
> > +		if (fn) {
> > +			ret = fn(mgr, priv, num_fences);
> > +			drm_exec_retry_on_contention(exec);
> > +			if (ret)
> > +				goto err;
> > +		}
> > +	}
> > +
> > +	return 0;
> > +
> > +err:
> > +	drm_exec_fini(exec);
> > +	return ret;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> > +
> > +static int
> > +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> > +				unsigned int num_fences)
> > +{
> > +	struct {
> > +		struct drm_gem_object **objs;
> > +		unsigned int num_objs;
> > +	} *args = priv;
> > +
> > +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> > +				      args->num_objs, num_fences);
> > +}
> > +
> > +/**
> > + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @objs: additional &drm_gem_objects to lock
> > + * @num_objs: the number of additional &drm_gem_objects to lock
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > +			     struct drm_gem_object **objs,
> > +			     unsigned int num_objs,
> > +			     unsigned int num_fences,
> > +			     bool interruptible)
> > +{
> > +	struct {
> > +		struct drm_gem_object **objs;
> > +		unsigned int num_objs;
> > +	} args;
> > +
> > +	args.objs = objs;
> > +	args.num_objs = num_objs;
> > +
> > +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> > +					    num_fences, interruptible);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> > +
> > +/**
> > + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> > + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> > + *
> > + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> > + * objects being mapped in the given &drm_gpuva_manager.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> > +{
> > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > +	struct drm_gpuva_gem *vm_bo;
> > +	int ret;
> > +
> > +	if (unlikely(!ops || !ops->bo_validate))
> > +		return -ENOTSUPP;
> > +
> > +	/* At this point we should hold all dma-resv locks of all GEM objects
> > +	 * associated with this GPU-VM, hence it is safe to walk the list.
> > +	 */
> > +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> > +		dma_resv_assert_held(vm_bo->obj->resv);
> > +
> > +		ret = ops->bo_validate(vm_bo->obj);
> > +		if (ret)
> > +			return ret;
> > +	}
> > +
> > +	return 0;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> > +
> > +/**
> > + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> > + * dma-resv
> > + * @mgr: the &drm_gpuva_manager to add a fence to
> > + * @fence: fence to add
> > + * @private_usage: private dma-resv usage
> > + * @extobj_usage: extobj dma-resv usage
> > + */
> > +void
> > +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > +				 struct dma_fence *fence,
> > +				 enum dma_resv_usage private_usage,
> > +				 enum dma_resv_usage extobj_usage)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	struct drm_gem_object *obj;
> > +	unsigned long index;
> > +
> > +	drm_exec_for_each_locked_object(exec, index, obj) {
> > +			dma_resv_assert_held(obj->resv);
> > +			dma_resv_add_fence(obj->resv, fence,
> > +					   drm_gpuva_is_extobj(mgr, obj) ?
> > +					   private_usage : extobj_usage);
> > +	}
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> > +
> > +static struct drm_gpuva_gem *
> > +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	drm_gem_gpuva_assert_lock_held(obj);
> > +
> > +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> > +		if (vm_bo->mgr == mgr)
> > +			return vm_bo;
> > +
> > +	return NULL;
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> > + * vm_bo_alloc() callback to allocate.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	if (ops && ops->vm_bo_alloc)
> > +		vm_bo = ops->vm_bo_alloc();
> > +	else
> > +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> > +
> > +	if (unlikely(!vm_bo))
> > +		return NULL;
> > +
> > +	vm_bo->mgr = mgr;
> > +	vm_bo->obj = obj;
> > +
> > +	kref_init(&vm_bo->kref);
> > +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> > +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> > +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> > +
> > +	drm_gem_object_get(obj);
> > +
> > +	return vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> > +
> > +void
> > +drm_gpuva_gem_destroy(struct kref *kref)
> > +{
> > +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> > +						   kref);
> > +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> > +
> > +	drm_gem_object_put(vm_bo->obj);
> > +
> > +	if (ops && ops->vm_bo_free)
> > +		ops->vm_bo_free(vm_bo);
> > +	else
> > +		kfree(vm_bo);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> > +
> > +/**
> > + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> > + * &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the &drm_gpuva_gem accordingly.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		   struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> > +
> > +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> > +
> > +/**
> > + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> > + * given &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> > + * &drm_gpuva_gem.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > +	if (vm_bo)
> > +		return vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> > +	if (!vm_bo)
> > +		return ERR_PTR(-ENOMEM);
> > +
> > +	return vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> > +
> > +/**
> > + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> > + * for the given &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> > + * count is decreased. If not found @__vm_bo is returned.
> > + *
> > + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> > + * &drm_gpuva_gem was found
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > +			      struct drm_gem_object *obj,
> > +			      struct drm_gpuva_gem *__vm_bo)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > +	if (vm_bo) {
> > +		drm_gpuva_gem_put(__vm_bo);
> > +		return vm_bo;
> > +	}
> > +
> > +	return __vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> > +
> > +static int
> > +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj,
> > +			  gfp_t gfp)
> > +{
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		struct drm_gem_object *obj;
> > +		uintptr_t index;
> > +	} gem;
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +	int ret = 0;
> > +
> > +	gem.obj = obj;
> > +	mas_set(&mas, gem.index);
> > +
> > +	mas_lock(&mas);
> > +	ref.ptr = mas_walk(&mas);
> > +	if (ref.ptr) {
> > +		++ref.cnt;
> > +		mas_store(&mas, ref.ptr);
> > +	} else {
> > +		if (unlikely(!gfp)) {
> > +			ret = -EINVAL;
> > +			goto out;
> > +		}
> > +
> > +		mas_set(&mas, gem.index);
> > +		ref.cnt = 1;
> > +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> > +		if (likely(!ret))
> > +			drm_gem_object_get(obj);
> > +	}
> > +out:
> > +	mas_unlock(&mas);
> > +	return ret;
> > +}
> > +
> > +static void
> > +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj)
> > +{
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		struct drm_gem_object *obj;
> > +		uintptr_t index;
> > +	} gem;
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +
> > +	gem.obj = obj;
> > +	mas_set(&mas, gem.index);
> > +
> > +	mas_lock(&mas);
> > +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> > +		goto out;
> > +
> > +	if (!--ref.cnt) {
> > +		mas_erase(&mas);
> > +		drm_gem_object_put(obj);
> > +	} else {
> > +		mas_store(&mas, ref.ptr);
> > +	}
> > +out:
> > +	mas_unlock(&mas);
> > +}
> > +
> > +/**
> > + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager to insert into
> > + * @obj: the &drm_gem_object to insert as extobj
> > + *
> > + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> > + * If the &drm_gem_object already exists in the tree, the reference counter
> > + * of this external object is increased by one.
> > + *
> > + * Drivers should insert the external &drm_gem_object before the dma-fence
> > + * signalling critical section, e.g. when submitting the job, and before
> > + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> > + * or its dervates.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			struct drm_gem_object *obj)
> > +{
> > +	return drm_gpuva_is_extobj(mgr, obj) ?
> > +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> > +
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> > +
> > +/**
> > + * drm_gpuva_extobj_get - increase the referecne count of an external
> > + * &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager storing the extobj
> > + * @obj: the &drm_gem_object to representing the extobj
> > + *
> > + * Increases the reference count of the extobj represented by @obj.
> > + *
> > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > + * being inserted.
> > + *
> > + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> > + * additional reference if the re-map operation splits an existing &drm_gpuva
> > + * into two separate ones.
> > + *
> > + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +void
> > +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	if (drm_gpuva_is_extobj(mgr, obj))
> > +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> > +		     "Can't increase ref-count of non-existent extobj.");
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> > +
> > +/**
> > + * drm_gpuva_extobj_put - decrease the referecne count of an external
> > + * &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager storing the extobj
> > + * @obj: the &drm_gem_object to representing the extobj
> > + *
> > + * Decreases the reference count of the extobj represented by @obj.
> > + *
> > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > + * being removed from the GPU VA space.
> > + *
> > + * See also drm_gpuva_unmap_put().
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +void
> > +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	if (drm_gpuva_is_extobj(mgr, obj))
> > +		__drm_gpuva_extobj_remove(mgr, obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> > +
> > +/**
> > + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> > + * &drm_gpuva_managers evicted list
> > + * @obj: the &drm_gem_object to add or remove
> > + * @evict: indicates whether the object is evicted
> > + *
> > + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> > + * list containing a mapping of this &drm_gem_object.
> > + */
> > +void
> > +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> > +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> > +	 */
> > +	drm_gem_gpuva_assert_lock_held(obj);
> > +
> > +	/* Required to protect the GPUVA managers evict list against concurrent
> > +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> > +	 * the evict list through different GEM object evictions are protected
> > +	 * by the GPUVA managers evict lock.
> > +	 */
> > +	dma_resv_assert_held(obj->resv);
> > +
> > +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> > +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> > +
> > +		spin_lock(&mgr->evict.lock);
> > +		if (evict)
> > +			list_add_tail(&vm_bo->list.entry.evict,
> > +				      &mgr->evict.list);
> > +		else
> > +			list_del_init(&vm_bo->list.entry.evict);
> > +		spin_unlock(&mgr->evict.lock);
> > +	}
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> > +
> >   static int
> >   __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
> >   		   struct drm_gpuva *va)
> > @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
> >   /**
> >    * drm_gpuva_link() - link a &drm_gpuva
> >    * @va: the &drm_gpuva to link
> > + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
> >    *
> > - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> > - * associated with.
> > + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> > + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> > + *
> > + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> > + * reference of the latter is taken.
> >    *
> >    * This function expects the caller to protect the GEM's GPUVA list against
> > - * concurrent access using the GEMs dma_resv lock.
> > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > + * lock set through drm_gem_gpuva_set_lock().
> >    */
> >   void
> > -drm_gpuva_link(struct drm_gpuva *va)
> > +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
> >   {
> >   	struct drm_gem_object *obj = va->gem.obj;
> > @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
> >   	drm_gem_gpuva_assert_lock_held(obj);
> > -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> > +	drm_gpuva_gem_get(vm_bo);
> > +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> > +	if (list_empty(&vm_bo->list.entry.gem))
> > +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_link);
> > @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
> >    * This removes the given &va from the GPU VA list of the &drm_gem_object it is
> >    * associated with.
> >    *
> > + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> > + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> > + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> > + *
> > + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> > + * the latter is dropped.
> > + *
> >    * This function expects the caller to protect the GEM's GPUVA list against
> > - * concurrent access using the GEMs dma_resv lock.
> > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > + * lock set through drm_gem_gpuva_set_lock().
> >    */
> >   void
> >   drm_gpuva_unlink(struct drm_gpuva *va)
> >   {
> >   	struct drm_gem_object *obj = va->gem.obj;
> > +	struct drm_gpuva_gem *vm_bo;
> >   	if (unlikely(!obj))
> >   		return;
> >   	drm_gem_gpuva_assert_lock_held(obj);
> > +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> > +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> > +		return;
> > +
> >   	list_del_init(&va->gem.entry);
> > +
> > +	if (list_empty(&vm_bo->list.gpuva)) {
> > +		list_del_init(&vm_bo->list.entry.gem);
> > +		list_del_init(&vm_bo->list.entry.evict);
> > +	}
> > +	drm_gpuva_gem_put(vm_bo);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
> > @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_map);
> > +/**
> > + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> > + * &drm_gpuva_op_map
> > + * @mgr: the &drm_gpuva_manager
> > + * @va: the &drm_gpuva to insert
> > + * @op: the &drm_gpuva_op_map to initialize @va with
> > + *
> > + * Initializes the @va from the @op and inserts it into the given @mgr and
> > + * increases the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > +		  struct drm_gpuva *va,
> > +		  struct drm_gpuva_op_map *op)
> > +{
> > +	drm_gpuva_map(mgr, va, op);
> > +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> > +
> >   /**
> >    * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
> >    * &drm_gpuva_op_remap
> > @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> >   		struct drm_gpuva *next,
> >   		struct drm_gpuva_op_remap *op)
> >   {
> > -	struct drm_gpuva *curr = op->unmap->va;
> > -	struct drm_gpuva_manager *mgr = curr->mgr;
> > +	struct drm_gpuva *va = op->unmap->va;
> > +	struct drm_gpuva_manager *mgr = va->mgr;
> > -	drm_gpuva_remove(curr);
> > +	drm_gpuva_remove(va);
> >   	if (op->prev) {
> >   		drm_gpuva_init_from_op(prev, op->prev);
> > @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_remap);
> > +/**
> > + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> > + * &drm_gpuva_op_remap
> > + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> > + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> > + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> > + *
> > + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> > + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> > + * separate mappings, increases the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_remap_get(struct drm_gpuva *prev,
> > +		    struct drm_gpuva *next,
> > +		    struct drm_gpuva_op_remap *op)
> > +{
> > +	struct drm_gpuva *va = op->unmap->va;
> > +	struct drm_gpuva_manager *mgr = va->mgr;
> > +
> > +	drm_gpuva_remap(prev, next, op);
> > +	if (op->prev && op->next)
> > +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> > +
> >   /**
> >    * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
> >    * &drm_gpuva_op_unmap
> > @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
> > +/**
> > + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> > + * &drm_gpuva_op_unmap
> > + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> > + *
> > + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> > + * the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> > +{
> > +	struct drm_gpuva *va = op->va;
> > +
> > +	drm_gpuva_unmap(op);
> > +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> > +
> >   static int
> >   op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
> >   	  u64 addr, u64 range,
> > @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> >   {
> >   	struct drm_gpuva_ops *ops;
> >   	struct drm_gpuva_op *op;
> > +	struct drm_gpuva_gem *vm_bo;
> >   	struct drm_gpuva *va;
> >   	int ret;
> > @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> >   	INIT_LIST_HEAD(&ops->list);
> > -	drm_gem_for_each_gpuva(va, obj) {
> > +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
> >   		op = gpuva_op_alloc(mgr);
> >   		if (!op) {
> >   			ret = -ENOMEM;
> > diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> > index bc9f6aa2f3fe..783ed3ab440d 100644
> > --- a/include/drm/drm_gem.h
> > +++ b/include/drm/drm_gem.h
> > @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
> >    * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
> >    * @obj: the &drm_gem_object
> >    *
> > - * This initializes the &drm_gem_object's &drm_gpuva list.
> > + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
> >    *
> >    * Calling this function is only necessary for drivers intending to support the
> >    * &drm_driver_feature DRIVER_GEM_GPUVA.
> > @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
> >   }
> >   /**
> > - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> > + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> >    *
> > - * This iterator walks over all &drm_gpuva structures associated with the
> > - * &drm_gpuva_manager.
> > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> > + * &drm_gem_object.
> >    */
> > -#define drm_gem_for_each_gpuva(entry__, obj__) \
> > -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> > +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> > +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
> >   /**
> > - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> > - * gpuvas
> > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > - * @next__: &next &drm_gpuva to store the next step
> > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> > + * &drm_gpuva_gem
> > + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> > + * @next__: &next &drm_gpuva_gem to store the next step
> > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> >    *
> > - * This iterator walks over all &drm_gpuva structures associated with the
> > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> >    * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
> >    * it is save against removal of elements.
> >    */
> > -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> > -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> > +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> > +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> > +
> > +/**
> > + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> > + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> > + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_manager and &drm_gem_object.
> > + */
> > +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> > +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> > +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> > +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> > +	     va__ = list_next_entry(va__, gem.entry))
> >   #endif /* __DRM_GEM_H__ */
> > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > index ed8d50200cc3..693e2da3f425 100644
> > --- a/include/drm/drm_gpuva_mgr.h
> > +++ b/include/drm/drm_gpuva_mgr.h
> > @@ -26,12 +26,16 @@
> >    */
> >   #include <linux/list.h>
> > +#include <linux/dma-resv.h>
> > +#include <linux/maple_tree.h>
> >   #include <linux/rbtree.h>
> >   #include <linux/types.h>
> >   #include <drm/drm_gem.h>
> > +#include <drm/drm_exec.h>
> >   struct drm_gpuva_manager;
> > +struct drm_gpuva_gem;
> >   struct drm_gpuva_fn_ops;
> >   /**
> > @@ -140,7 +144,7 @@ struct drm_gpuva {
> >   int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> >   void drm_gpuva_remove(struct drm_gpuva *va);
> > -void drm_gpuva_link(struct drm_gpuva *va);
> > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> >   void drm_gpuva_unlink(struct drm_gpuva *va);
> >   struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> >   	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> >   	 */
> >   	const struct drm_gpuva_fn_ops *ops;
> > +
> > +	/**
> > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > +	 * dma-resv to &drm_exec.
> > +	 */
> > +	struct drm_gem_object d_obj;
> > +
> > +	/**
> > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > +	 * space
> > +	 */
> > +	struct dma_resv *resv;
> > +
> > +	/**
> > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > +	 */
> > +	struct drm_exec exec;
> > +
> > +	/**
> > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > +	 */
> > +	struct maple_tree mt_ext;
> > +
> > +	/**
> > +	 * @evict: structure holding the evict list and evict list lock
> > +	 */
> > +	struct {
> > +		/**
> > +		 * @list: &list_head storing &drm_gem_objects currently being
> > +		 * evicted
> > +		 */
> > +		struct list_head list;
> > +
> > +		/**
> > +		 * @lock: spinlock to protect the evict list against concurrent
> > +		 * insertion / removal of different &drm_gpuva_gems
> > +		 */
> > +		spinlock_t lock;
> > +	} evict;
> >   };
> >   void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > +			    struct drm_device *drm,
> >   			    const char *name,
> >   			    u64 start_offset, u64 range,
> >   			    u64 reserve_offset, u64 reserve_range,
> >   			    const struct drm_gpuva_fn_ops *ops);
> >   void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > +/**
> > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > + */
> > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > +
> > +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > +				 int (*fn)(struct drm_gpuva_manager *mgr,
> > +					   void *priv, unsigned int num_fences),
> > +				 void *priv,
> > +				 unsigned int num_fences,
> > +				 bool interruptible);
> > +
> > +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > +				 struct drm_gem_object **objs,
> > +				 unsigned int num_objs,
> > +				 unsigned int num_fences,
> > +				 bool interruptible);
> > +
> > +/**
> > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +static inline int
> > +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> > +		       unsigned int num_fences,
> > +		       bool interruptible)
> > +{
> > +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> > +					    interruptible);
> > +}
> > +
> > +/**
> > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + *
> > + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> > + * through drm_gpuva_manager_lock() or its variants.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +static inline void
> > +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> > +{
> > +	drm_exec_fini(&mgr->exec);
> > +}
> > +
> > +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> > +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > +				      struct dma_fence *fence,
> > +				      enum dma_resv_usage private_usage,
> > +				      enum dma_resv_usage extobj_usage);
> > +
> > +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			    struct drm_gem_object *obj);
> > +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj);
> > +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj);
> > +
> > +/**
> > + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> > + * external object
> > + * @mgr: the &drm_gpuva_manager to check
> > + * @obj: the &drm_gem_object to check
> > + *
> > + * Returns: true if the &drm_gem_object &dma_resv differs from the
> > + * &drm_gpuva_managers &dma_resv, false otherwise
> > + */
> > +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> > +				       struct drm_gem_object *obj)
> > +{
> > +	return obj && obj->resv != mgr->resv;
> > +}
> > +
> >   static inline struct drm_gpuva *
> >   __drm_gpuva_next(struct drm_gpuva *va)
> >   {
> > @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
> >   #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
> >   	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
> > +/**
> > + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> > + * &drm_gem_object combination
> > + *
> > + * This structure is an abstraction representing a &drm_gpuva_manager and
> > + * &drm_gem_object combination. It serves as an indirection to accelerate
> > + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> > + * &drm_gem_object.
> > + *
> > + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> > + * accelerate validation.
> > + *
> > + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> > + * a GEM object is mapped first in a GPU-VM and release the instance once the
> > + * last mapping of the GEM object in this GPU-VM is unmapped.
> > + */
> > +struct drm_gpuva_gem {
> > +
> > +	/**
> > +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > +	 */
> > +	struct drm_gpuva_manager *mgr;
> > +
> > +	/**
> > +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> > +	 */
> > +	struct drm_gem_object *obj;
> > +
> > +	/**
> > +	 * @kref: The reference count for this &drm_gpuva_gem.
> > +	 */
> > +	struct kref kref;
> > +
> > +	/**
> > +	 * @list: Structure containing all &list_heads.
> > +	 */
> > +	struct {
> > +		/**
> > +		 * @gpuva: The list of linked &drm_gpuvas.
> > +		 */
> > +		struct list_head gpuva;
> > +
> > +		/**
> > +		 * @entry: Structure containing all &list_heads serving as
> > +		 * entry.
> > +		 */
> > +		struct {
> > +			/**
> > +			 * @gem: List entry to attach to the &drm_gem_objects
> > +			 * gpuva list.
> > +			 */
> > +			struct list_head gem;
> > +
> > +			/**
> > +			 * @evict: List entry to attach to the
> > +			 * &drm_gpuva_managers evict list.
> > +			 */
> > +			struct list_head evict;
> > +		} entry;
> > +	} list;
> > +};
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj);
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > +			      struct drm_gem_object *obj,
> > +			      struct drm_gpuva_gem *__vm_bo);
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		   struct drm_gem_object *obj);
> > +
> > +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj);
> > +void drm_gpuva_gem_destroy(struct kref *kref);
> > +
> > +/**
> > + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> > + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> > + *
> > + * This function acquires an additional reference to @vm_bo. It is illegal to
> > + * call this without already holding a reference. No locks required.
> > + */
> > +static inline struct drm_gpuva_gem *
> > +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> > +{
> > +	kref_get(&vm_bo->kref);
> > +	return vm_bo;
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> > + * @vm_bo: the &drm_gpuva_gem to release the reference of
> > + *
> > + * This releases a reference to @vm_bo.
> > + */
> > +static inline void
> > +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> > +{
> > +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_gem.
> > + */
> > +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> > +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> > +
> > +/**
> > + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> > + * &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @next__: &next &drm_gpuva to store the next step
> > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> > + * it is save against removal of elements.
> > + */
> > +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> > +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> > +
> >   /**
> >    * enum drm_gpuva_op_type - GPU VA operation type
> >    *
> > @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
> >   	 */
> >   	void (*op_free)(struct drm_gpuva_op *op);
> > +	/**
> > +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> > +	 * a struct drm_gpuva_gem
> > +	 *
> > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > +	 * specific structures. By implementing this callback drivers can
> > +	 * allocate memory accordingly.
> > +	 *
> > +	 * This callback is optional.
> > +	 */
> > +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> > +
> > +	/**
> > +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> > +	 * struct drm_gpuva_gem
> > +	 *
> > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > +	 * specific structures. By implementing this callback drivers can
> > +	 * free the previously allocated memory accordingly.
> > +	 *
> > +	 * This callback is optional.
> > +	 */
> > +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> > +
> >   	/**
> >   	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
> >   	 * mapping once all previous steps were completed
> > @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
> >   	 * used.
> >   	 */
> >   	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> > +
> > +	/**
> > +	 * @bo_validate: called from drm_gpuva_manager_validate()
> > +	 *
> > +	 * Drivers receive this callback for every evicted &drm_gem_object being
> > +	 * mapped in the corresponding &drm_gpuva_manager.
> > +	 *
> > +	 * Typically, drivers would call their driver specific variant of
> > +	 * ttm_bo_validate() from within this callback.
> > +	 */
> > +	int (*bo_validate)(struct drm_gem_object *obj);
> >   };
> >   int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> > @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
> >   void drm_gpuva_map(struct drm_gpuva_manager *mgr,
> >   		   struct drm_gpuva *va,
> >   		   struct drm_gpuva_op_map *op);
> > +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > +		       struct drm_gpuva *va,
> > +		       struct drm_gpuva_op_map *op);
> >   void drm_gpuva_remap(struct drm_gpuva *prev,
> >   		     struct drm_gpuva *next,
> >   		     struct drm_gpuva_op_remap *op);
> > +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> > +			 struct drm_gpuva *next,
> > +			 struct drm_gpuva_op_remap *op);
> >   void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> > +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
> >   #endif /* __DRM_GPUVA_MGR_H__ */
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-30 13:05       ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-30 13:05 UTC (permalink / raw)
  To: Christian König
  Cc: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, faith.ekstrand, bskeggs,
	Liam.Howlett, dri-devel, nouveau, linux-kernel

On Wed, Aug 30, 2023 at 09:48:02AM +0200, Christian König wrote:
> 
> 
> Am 20.08.23 um 23:53 schrieb Danilo Krummrich:
> > So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> > allocations and mappings, generically connect GPU VA mappings to their
> > backing buffers and perform more complex mapping operations on the GPU VA
> > space.
> > 
> > However, there are more design patterns commonly used by drivers, which
> > can potentially be generalized in order to make the DRM GPUVA manager
> > represent a basic GPU-VM implementation. In this context, this patch aims
> > at generalizing the following elements.
> > 
> > 1) Provide a common dma-resv for GEM objects not being used outside of
> >     this GPU-VM.
> > 
> > 2) Provide tracking of external GEM objects (GEM objects which are
> >     shared with other GPU-VMs).
> > 
> > 3) Provide functions to efficiently lock all GEM objects dma-resv the
> >     GPU-VM contains mappings of.
> > 
> > 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
> >     of, such that validation of evicted GEM objects is accelerated.
> > 
> > 5) Provide some convinience functions for common patterns.
> 
> Interesting work.
> 
> You basically implement a bunch of the ideas I came up to improve the amdgpu
> performance in the common manager now. The was one of the remaining blockers
> I had for using this in amdgpu.
> 
> Question is for example how do you track evictions? E.g. we don't have a
> common concept of eviction in GEM as far as I know. Or is the driver
> responsible for giving those notifications to the GPUVA manager?

Right, it is the driver being responsible to adding a drm_gpuva_gem (or VM_BO)
to the managers evict list.

The idea was that drivers have control about the state of a drm_gpuva_gem, such
that a driver can move it to driver specific lists as well, like all the ones
you have in amdgpu.

> 
> And would it be possible to lock only a specific area of the VM, e.g. every
> BO mapped in the interval X..Y?

Currently, the drm_gpuva_manager_lock() functions always lock the GPU-VMs
dma-resv lock, plus all the dma-resv locks of the external objects the manager
keeps track of.

But surely, we could also add something like drm_gpuva_manager_lock_range()
where we just iterate all drm_gpuvas between X and Y and lock the dma-resv
locks of each drm_gpuva's backing BO.

> 
> Regards,
> Christian.
> 
> > 
> > Rather than being designed as a "framework", the target is to make all
> > features appear as a collection of optional helper functions, such that
> > drivers are free to make use of the DRM GPUVA managers basic
> > functionality and opt-in for other features without setting any feature
> > flags, just by making use of the corresponding functions.
> > 
> > Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> > ---
> >   drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
> >   include/drm/drm_gem.h           |  48 ++-
> >   include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
> >   3 files changed, 1010 insertions(+), 28 deletions(-)
> > 
> > diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> > index f86bfad74ff8..69872b205961 100644
> > --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> > @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> >   /**
> >    * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
> >    * @mgr: pointer to the &drm_gpuva_manager to initialize
> > + * @drm: the drivers &drm_device
> >    * @name: the name of the GPU VA space
> >    * @start_offset: the start offset of the GPU VA space
> >    * @range: the size of the GPU VA space
> > @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> >    */
> >   void
> >   drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > +		       struct drm_device *drm,
> >   		       const char *name,
> >   		       u64 start_offset, u64 range,
> >   		       u64 reserve_offset, u64 reserve_range,
> > @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> >   	mgr->rb.tree = RB_ROOT_CACHED;
> >   	INIT_LIST_HEAD(&mgr->rb.list);
> > +	mt_init(&mgr->mt_ext);
> > +
> > +	INIT_LIST_HEAD(&mgr->evict.list);
> > +	spin_lock_init(&mgr->evict.lock);
> > +
> >   	drm_gpuva_check_overflow(start_offset, range);
> >   	mgr->mm_start = start_offset;
> >   	mgr->mm_range = range;
> > @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> >   						     reserve_range)))
> >   			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
> >   	}
> > +
> > +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> > +	mgr->resv = mgr->d_obj.resv;
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
> > @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
> >   		__drm_gpuva_remove(&mgr->kernel_alloc_node);
> >   	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> > -	     "GPUVA tree is not empty, potentially leaking memory.");
> > +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> > +
> > +	mtree_destroy(&mgr->mt_ext);
> > +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> > +
> > +	drm_gem_private_object_fini(&mgr->d_obj);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
> > +/**
> > + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @num_fences: the amount of &dma_fences to reserve
> > + *
> > + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Drivers can obtain the corresponding &drm_exec instance through
> > + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> > + * and drm_exec_fini() accordingly.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> > +				  unsigned int num_fences)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +	int ret;
> > +
> > +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> > +	if (ret)
> > +		goto out;
> > +
> > +	rcu_read_lock();
> > +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> > +		struct drm_gem_object *obj;
> > +
> > +		mas_pause(&mas);
> > +		rcu_read_unlock();
> > +
> > +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> > +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> > +		if (ret)
> > +			goto out;
> > +
> > +		rcu_read_lock();
> > +	}
> > +	rcu_read_unlock();
> > +
> > +out:
> > +	return ret;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> > +
> > +/**
> > + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @fn: callback received by the driver to lock additional dma-resv
> > + * @priv: private driver data passed to @fn
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Addionally, when calling this function the driver receives the given @fn
> > + * callback to lock additional dma-resv in the context of the
> > + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> > + * drm_exec_prepare_obj() from within this callback.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > +			     int (*fn)(struct drm_gpuva_manager *mgr,
> > +				       void *priv, unsigned int num_fences),
> > +			     void *priv,
> > +			     unsigned int num_fences,
> > +			     bool interruptible)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	uint32_t flags;
> > +	int ret;
> > +
> > +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> > +		DRM_EXEC_IGNORE_DUPLICATES;
> > +
> > +	drm_exec_init(exec, flags);
> > +
> > +	drm_exec_until_all_locked(exec) {
> > +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> > +		drm_exec_retry_on_contention(exec);
> > +		if (ret)
> > +			goto err;
> > +
> > +		if (fn) {
> > +			ret = fn(mgr, priv, num_fences);
> > +			drm_exec_retry_on_contention(exec);
> > +			if (ret)
> > +				goto err;
> > +		}
> > +	}
> > +
> > +	return 0;
> > +
> > +err:
> > +	drm_exec_fini(exec);
> > +	return ret;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> > +
> > +static int
> > +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> > +				unsigned int num_fences)
> > +{
> > +	struct {
> > +		struct drm_gem_object **objs;
> > +		unsigned int num_objs;
> > +	} *args = priv;
> > +
> > +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> > +				      args->num_objs, num_fences);
> > +}
> > +
> > +/**
> > + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @objs: additional &drm_gem_objects to lock
> > + * @num_objs: the number of additional &drm_gem_objects to lock
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > +			     struct drm_gem_object **objs,
> > +			     unsigned int num_objs,
> > +			     unsigned int num_fences,
> > +			     bool interruptible)
> > +{
> > +	struct {
> > +		struct drm_gem_object **objs;
> > +		unsigned int num_objs;
> > +	} args;
> > +
> > +	args.objs = objs;
> > +	args.num_objs = num_objs;
> > +
> > +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> > +					    num_fences, interruptible);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> > +
> > +/**
> > + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> > + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> > + *
> > + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> > + * objects being mapped in the given &drm_gpuva_manager.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> > +{
> > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > +	struct drm_gpuva_gem *vm_bo;
> > +	int ret;
> > +
> > +	if (unlikely(!ops || !ops->bo_validate))
> > +		return -ENOTSUPP;
> > +
> > +	/* At this point we should hold all dma-resv locks of all GEM objects
> > +	 * associated with this GPU-VM, hence it is safe to walk the list.
> > +	 */
> > +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> > +		dma_resv_assert_held(vm_bo->obj->resv);
> > +
> > +		ret = ops->bo_validate(vm_bo->obj);
> > +		if (ret)
> > +			return ret;
> > +	}
> > +
> > +	return 0;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> > +
> > +/**
> > + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> > + * dma-resv
> > + * @mgr: the &drm_gpuva_manager to add a fence to
> > + * @fence: fence to add
> > + * @private_usage: private dma-resv usage
> > + * @extobj_usage: extobj dma-resv usage
> > + */
> > +void
> > +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > +				 struct dma_fence *fence,
> > +				 enum dma_resv_usage private_usage,
> > +				 enum dma_resv_usage extobj_usage)
> > +{
> > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > +	struct drm_gem_object *obj;
> > +	unsigned long index;
> > +
> > +	drm_exec_for_each_locked_object(exec, index, obj) {
> > +			dma_resv_assert_held(obj->resv);
> > +			dma_resv_add_fence(obj->resv, fence,
> > +					   drm_gpuva_is_extobj(mgr, obj) ?
> > +					   private_usage : extobj_usage);
> > +	}
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> > +
> > +static struct drm_gpuva_gem *
> > +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	drm_gem_gpuva_assert_lock_held(obj);
> > +
> > +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> > +		if (vm_bo->mgr == mgr)
> > +			return vm_bo;
> > +
> > +	return NULL;
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> > + * vm_bo_alloc() callback to allocate.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	if (ops && ops->vm_bo_alloc)
> > +		vm_bo = ops->vm_bo_alloc();
> > +	else
> > +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> > +
> > +	if (unlikely(!vm_bo))
> > +		return NULL;
> > +
> > +	vm_bo->mgr = mgr;
> > +	vm_bo->obj = obj;
> > +
> > +	kref_init(&vm_bo->kref);
> > +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> > +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> > +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> > +
> > +	drm_gem_object_get(obj);
> > +
> > +	return vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> > +
> > +void
> > +drm_gpuva_gem_destroy(struct kref *kref)
> > +{
> > +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> > +						   kref);
> > +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> > +
> > +	drm_gem_object_put(vm_bo->obj);
> > +
> > +	if (ops && ops->vm_bo_free)
> > +		ops->vm_bo_free(vm_bo);
> > +	else
> > +		kfree(vm_bo);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> > +
> > +/**
> > + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> > + * &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the &drm_gpuva_gem accordingly.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		   struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> > +
> > +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> > +
> > +/**
> > + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> > + * given &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> > + * &drm_gpuva_gem.
> > + *
> > + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > +	if (vm_bo)
> > +		return vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> > +	if (!vm_bo)
> > +		return ERR_PTR(-ENOMEM);
> > +
> > +	return vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> > +
> > +/**
> > + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> > + * for the given &drm_gpuva_manager and &drm_gem_object
> > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > + *
> > + * Find the &drm_gpuva_gem representing the combination of the given
> > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> > + * count is decreased. If not found @__vm_bo is returned.
> > + *
> > + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> > + * &drm_gpuva_gem was found
> > + */
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > +			      struct drm_gem_object *obj,
> > +			      struct drm_gpuva_gem *__vm_bo)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > +	if (vm_bo) {
> > +		drm_gpuva_gem_put(__vm_bo);
> > +		return vm_bo;
> > +	}
> > +
> > +	return __vm_bo;
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> > +
> > +static int
> > +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj,
> > +			  gfp_t gfp)
> > +{
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		struct drm_gem_object *obj;
> > +		uintptr_t index;
> > +	} gem;
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +	int ret = 0;
> > +
> > +	gem.obj = obj;
> > +	mas_set(&mas, gem.index);
> > +
> > +	mas_lock(&mas);
> > +	ref.ptr = mas_walk(&mas);
> > +	if (ref.ptr) {
> > +		++ref.cnt;
> > +		mas_store(&mas, ref.ptr);
> > +	} else {
> > +		if (unlikely(!gfp)) {
> > +			ret = -EINVAL;
> > +			goto out;
> > +		}
> > +
> > +		mas_set(&mas, gem.index);
> > +		ref.cnt = 1;
> > +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> > +		if (likely(!ret))
> > +			drm_gem_object_get(obj);
> > +	}
> > +out:
> > +	mas_unlock(&mas);
> > +	return ret;
> > +}
> > +
> > +static void
> > +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj)
> > +{
> > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > +	union {
> > +		struct drm_gem_object *obj;
> > +		uintptr_t index;
> > +	} gem;
> > +	union {
> > +		void *ptr;
> > +		uintptr_t cnt;
> > +	} ref;
> > +
> > +	gem.obj = obj;
> > +	mas_set(&mas, gem.index);
> > +
> > +	mas_lock(&mas);
> > +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> > +		goto out;
> > +
> > +	if (!--ref.cnt) {
> > +		mas_erase(&mas);
> > +		drm_gem_object_put(obj);
> > +	} else {
> > +		mas_store(&mas, ref.ptr);
> > +	}
> > +out:
> > +	mas_unlock(&mas);
> > +}
> > +
> > +/**
> > + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager to insert into
> > + * @obj: the &drm_gem_object to insert as extobj
> > + *
> > + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> > + * If the &drm_gem_object already exists in the tree, the reference counter
> > + * of this external object is increased by one.
> > + *
> > + * Drivers should insert the external &drm_gem_object before the dma-fence
> > + * signalling critical section, e.g. when submitting the job, and before
> > + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> > + * or its dervates.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +int
> > +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			struct drm_gem_object *obj)
> > +{
> > +	return drm_gpuva_is_extobj(mgr, obj) ?
> > +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> > +
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> > +
> > +/**
> > + * drm_gpuva_extobj_get - increase the referecne count of an external
> > + * &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager storing the extobj
> > + * @obj: the &drm_gem_object to representing the extobj
> > + *
> > + * Increases the reference count of the extobj represented by @obj.
> > + *
> > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > + * being inserted.
> > + *
> > + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> > + * additional reference if the re-map operation splits an existing &drm_gpuva
> > + * into two separate ones.
> > + *
> > + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +void
> > +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	if (drm_gpuva_is_extobj(mgr, obj))
> > +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> > +		     "Can't increase ref-count of non-existent extobj.");
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> > +
> > +/**
> > + * drm_gpuva_extobj_put - decrease the referecne count of an external
> > + * &drm_gem_object
> > + * @mgr: the &drm_gpuva_manager storing the extobj
> > + * @obj: the &drm_gem_object to representing the extobj
> > + *
> > + * Decreases the reference count of the extobj represented by @obj.
> > + *
> > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > + * being removed from the GPU VA space.
> > + *
> > + * See also drm_gpuva_unmap_put().
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +void
> > +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj)
> > +{
> > +	if (drm_gpuva_is_extobj(mgr, obj))
> > +		__drm_gpuva_extobj_remove(mgr, obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> > +
> > +/**
> > + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> > + * &drm_gpuva_managers evicted list
> > + * @obj: the &drm_gem_object to add or remove
> > + * @evict: indicates whether the object is evicted
> > + *
> > + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> > + * list containing a mapping of this &drm_gem_object.
> > + */
> > +void
> > +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> > +{
> > +	struct drm_gpuva_gem *vm_bo;
> > +
> > +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> > +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> > +	 */
> > +	drm_gem_gpuva_assert_lock_held(obj);
> > +
> > +	/* Required to protect the GPUVA managers evict list against concurrent
> > +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> > +	 * the evict list through different GEM object evictions are protected
> > +	 * by the GPUVA managers evict lock.
> > +	 */
> > +	dma_resv_assert_held(obj->resv);
> > +
> > +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> > +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> > +
> > +		spin_lock(&mgr->evict.lock);
> > +		if (evict)
> > +			list_add_tail(&vm_bo->list.entry.evict,
> > +				      &mgr->evict.list);
> > +		else
> > +			list_del_init(&vm_bo->list.entry.evict);
> > +		spin_unlock(&mgr->evict.lock);
> > +	}
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> > +
> >   static int
> >   __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
> >   		   struct drm_gpuva *va)
> > @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
> >   /**
> >    * drm_gpuva_link() - link a &drm_gpuva
> >    * @va: the &drm_gpuva to link
> > + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
> >    *
> > - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> > - * associated with.
> > + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> > + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> > + *
> > + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> > + * reference of the latter is taken.
> >    *
> >    * This function expects the caller to protect the GEM's GPUVA list against
> > - * concurrent access using the GEMs dma_resv lock.
> > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > + * lock set through drm_gem_gpuva_set_lock().
> >    */
> >   void
> > -drm_gpuva_link(struct drm_gpuva *va)
> > +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
> >   {
> >   	struct drm_gem_object *obj = va->gem.obj;
> > @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
> >   	drm_gem_gpuva_assert_lock_held(obj);
> > -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> > +	drm_gpuva_gem_get(vm_bo);
> > +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> > +	if (list_empty(&vm_bo->list.entry.gem))
> > +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_link);
> > @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
> >    * This removes the given &va from the GPU VA list of the &drm_gem_object it is
> >    * associated with.
> >    *
> > + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> > + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> > + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> > + *
> > + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> > + * the latter is dropped.
> > + *
> >    * This function expects the caller to protect the GEM's GPUVA list against
> > - * concurrent access using the GEMs dma_resv lock.
> > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > + * lock set through drm_gem_gpuva_set_lock().
> >    */
> >   void
> >   drm_gpuva_unlink(struct drm_gpuva *va)
> >   {
> >   	struct drm_gem_object *obj = va->gem.obj;
> > +	struct drm_gpuva_gem *vm_bo;
> >   	if (unlikely(!obj))
> >   		return;
> >   	drm_gem_gpuva_assert_lock_held(obj);
> > +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> > +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> > +		return;
> > +
> >   	list_del_init(&va->gem.entry);
> > +
> > +	if (list_empty(&vm_bo->list.gpuva)) {
> > +		list_del_init(&vm_bo->list.entry.gem);
> > +		list_del_init(&vm_bo->list.entry.evict);
> > +	}
> > +	drm_gpuva_gem_put(vm_bo);
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
> > @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_map);
> > +/**
> > + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> > + * &drm_gpuva_op_map
> > + * @mgr: the &drm_gpuva_manager
> > + * @va: the &drm_gpuva to insert
> > + * @op: the &drm_gpuva_op_map to initialize @va with
> > + *
> > + * Initializes the @va from the @op and inserts it into the given @mgr and
> > + * increases the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > +		  struct drm_gpuva *va,
> > +		  struct drm_gpuva_op_map *op)
> > +{
> > +	drm_gpuva_map(mgr, va, op);
> > +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> > +
> >   /**
> >    * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
> >    * &drm_gpuva_op_remap
> > @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> >   		struct drm_gpuva *next,
> >   		struct drm_gpuva_op_remap *op)
> >   {
> > -	struct drm_gpuva *curr = op->unmap->va;
> > -	struct drm_gpuva_manager *mgr = curr->mgr;
> > +	struct drm_gpuva *va = op->unmap->va;
> > +	struct drm_gpuva_manager *mgr = va->mgr;
> > -	drm_gpuva_remove(curr);
> > +	drm_gpuva_remove(va);
> >   	if (op->prev) {
> >   		drm_gpuva_init_from_op(prev, op->prev);
> > @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_remap);
> > +/**
> > + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> > + * &drm_gpuva_op_remap
> > + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> > + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> > + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> > + *
> > + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> > + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> > + * separate mappings, increases the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_remap_get(struct drm_gpuva *prev,
> > +		    struct drm_gpuva *next,
> > +		    struct drm_gpuva_op_remap *op)
> > +{
> > +	struct drm_gpuva *va = op->unmap->va;
> > +	struct drm_gpuva_manager *mgr = va->mgr;
> > +
> > +	drm_gpuva_remap(prev, next, op);
> > +	if (op->prev && op->next)
> > +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> > +
> >   /**
> >    * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
> >    * &drm_gpuva_op_unmap
> > @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
> >   }
> >   EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
> > +/**
> > + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> > + * &drm_gpuva_op_unmap
> > + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> > + *
> > + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> > + * the reference count of the corresponding extobj.
> > + */
> > +void
> > +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> > +{
> > +	struct drm_gpuva *va = op->va;
> > +
> > +	drm_gpuva_unmap(op);
> > +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> > +}
> > +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> > +
> >   static int
> >   op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
> >   	  u64 addr, u64 range,
> > @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> >   {
> >   	struct drm_gpuva_ops *ops;
> >   	struct drm_gpuva_op *op;
> > +	struct drm_gpuva_gem *vm_bo;
> >   	struct drm_gpuva *va;
> >   	int ret;
> > @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> >   	INIT_LIST_HEAD(&ops->list);
> > -	drm_gem_for_each_gpuva(va, obj) {
> > +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
> >   		op = gpuva_op_alloc(mgr);
> >   		if (!op) {
> >   			ret = -ENOMEM;
> > diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> > index bc9f6aa2f3fe..783ed3ab440d 100644
> > --- a/include/drm/drm_gem.h
> > +++ b/include/drm/drm_gem.h
> > @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
> >    * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
> >    * @obj: the &drm_gem_object
> >    *
> > - * This initializes the &drm_gem_object's &drm_gpuva list.
> > + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
> >    *
> >    * Calling this function is only necessary for drivers intending to support the
> >    * &drm_driver_feature DRIVER_GEM_GPUVA.
> > @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
> >   }
> >   /**
> > - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> > + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> >    *
> > - * This iterator walks over all &drm_gpuva structures associated with the
> > - * &drm_gpuva_manager.
> > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> > + * &drm_gem_object.
> >    */
> > -#define drm_gem_for_each_gpuva(entry__, obj__) \
> > -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> > +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> > +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
> >   /**
> > - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> > - * gpuvas
> > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > - * @next__: &next &drm_gpuva to store the next step
> > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> > + * &drm_gpuva_gem
> > + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> > + * @next__: &next &drm_gpuva_gem to store the next step
> > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> >    *
> > - * This iterator walks over all &drm_gpuva structures associated with the
> > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> >    * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
> >    * it is save against removal of elements.
> >    */
> > -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> > -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> > +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> > +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> > +
> > +/**
> > + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> > + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> > + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_manager and &drm_gem_object.
> > + */
> > +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> > +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> > +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> > +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> > +	     va__ = list_next_entry(va__, gem.entry))
> >   #endif /* __DRM_GEM_H__ */
> > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > index ed8d50200cc3..693e2da3f425 100644
> > --- a/include/drm/drm_gpuva_mgr.h
> > +++ b/include/drm/drm_gpuva_mgr.h
> > @@ -26,12 +26,16 @@
> >    */
> >   #include <linux/list.h>
> > +#include <linux/dma-resv.h>
> > +#include <linux/maple_tree.h>
> >   #include <linux/rbtree.h>
> >   #include <linux/types.h>
> >   #include <drm/drm_gem.h>
> > +#include <drm/drm_exec.h>
> >   struct drm_gpuva_manager;
> > +struct drm_gpuva_gem;
> >   struct drm_gpuva_fn_ops;
> >   /**
> > @@ -140,7 +144,7 @@ struct drm_gpuva {
> >   int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> >   void drm_gpuva_remove(struct drm_gpuva *va);
> > -void drm_gpuva_link(struct drm_gpuva *va);
> > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> >   void drm_gpuva_unlink(struct drm_gpuva *va);
> >   struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> >   	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> >   	 */
> >   	const struct drm_gpuva_fn_ops *ops;
> > +
> > +	/**
> > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > +	 * dma-resv to &drm_exec.
> > +	 */
> > +	struct drm_gem_object d_obj;
> > +
> > +	/**
> > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > +	 * space
> > +	 */
> > +	struct dma_resv *resv;
> > +
> > +	/**
> > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > +	 */
> > +	struct drm_exec exec;
> > +
> > +	/**
> > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > +	 */
> > +	struct maple_tree mt_ext;
> > +
> > +	/**
> > +	 * @evict: structure holding the evict list and evict list lock
> > +	 */
> > +	struct {
> > +		/**
> > +		 * @list: &list_head storing &drm_gem_objects currently being
> > +		 * evicted
> > +		 */
> > +		struct list_head list;
> > +
> > +		/**
> > +		 * @lock: spinlock to protect the evict list against concurrent
> > +		 * insertion / removal of different &drm_gpuva_gems
> > +		 */
> > +		spinlock_t lock;
> > +	} evict;
> >   };
> >   void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > +			    struct drm_device *drm,
> >   			    const char *name,
> >   			    u64 start_offset, u64 range,
> >   			    u64 reserve_offset, u64 reserve_range,
> >   			    const struct drm_gpuva_fn_ops *ops);
> >   void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > +/**
> > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > + */
> > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > +
> > +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > +				 int (*fn)(struct drm_gpuva_manager *mgr,
> > +					   void *priv, unsigned int num_fences),
> > +				 void *priv,
> > +				 unsigned int num_fences,
> > +				 bool interruptible);
> > +
> > +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > +				 struct drm_gem_object **objs,
> > +				 unsigned int num_objs,
> > +				 unsigned int num_fences,
> > +				 bool interruptible);
> > +
> > +/**
> > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + * @num_fences: the amount of &dma_fences to reserve
> > + * @interruptible: sleep interruptible if waiting
> > + *
> > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > + * &drm_gpuva_manager contains mappings of.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +static inline int
> > +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> > +		       unsigned int num_fences,
> > +		       bool interruptible)
> > +{
> > +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> > +					    interruptible);
> > +}
> > +
> > +/**
> > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > + * @mgr: the &drm_gpuva_manager
> > + *
> > + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> > + * through drm_gpuva_manager_lock() or its variants.
> > + *
> > + * Returns: 0 on success, negative error code on failure.
> > + */
> > +static inline void
> > +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> > +{
> > +	drm_exec_fini(&mgr->exec);
> > +}
> > +
> > +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> > +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > +				      struct dma_fence *fence,
> > +				      enum dma_resv_usage private_usage,
> > +				      enum dma_resv_usage extobj_usage);
> > +
> > +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > +			    struct drm_gem_object *obj);
> > +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj);
> > +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > +			  struct drm_gem_object *obj);
> > +
> > +/**
> > + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> > + * external object
> > + * @mgr: the &drm_gpuva_manager to check
> > + * @obj: the &drm_gem_object to check
> > + *
> > + * Returns: true if the &drm_gem_object &dma_resv differs from the
> > + * &drm_gpuva_managers &dma_resv, false otherwise
> > + */
> > +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> > +				       struct drm_gem_object *obj)
> > +{
> > +	return obj && obj->resv != mgr->resv;
> > +}
> > +
> >   static inline struct drm_gpuva *
> >   __drm_gpuva_next(struct drm_gpuva *va)
> >   {
> > @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
> >   #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
> >   	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
> > +/**
> > + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> > + * &drm_gem_object combination
> > + *
> > + * This structure is an abstraction representing a &drm_gpuva_manager and
> > + * &drm_gem_object combination. It serves as an indirection to accelerate
> > + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> > + * &drm_gem_object.
> > + *
> > + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> > + * accelerate validation.
> > + *
> > + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> > + * a GEM object is mapped first in a GPU-VM and release the instance once the
> > + * last mapping of the GEM object in this GPU-VM is unmapped.
> > + */
> > +struct drm_gpuva_gem {
> > +
> > +	/**
> > +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > +	 */
> > +	struct drm_gpuva_manager *mgr;
> > +
> > +	/**
> > +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> > +	 */
> > +	struct drm_gem_object *obj;
> > +
> > +	/**
> > +	 * @kref: The reference count for this &drm_gpuva_gem.
> > +	 */
> > +	struct kref kref;
> > +
> > +	/**
> > +	 * @list: Structure containing all &list_heads.
> > +	 */
> > +	struct {
> > +		/**
> > +		 * @gpuva: The list of linked &drm_gpuvas.
> > +		 */
> > +		struct list_head gpuva;
> > +
> > +		/**
> > +		 * @entry: Structure containing all &list_heads serving as
> > +		 * entry.
> > +		 */
> > +		struct {
> > +			/**
> > +			 * @gem: List entry to attach to the &drm_gem_objects
> > +			 * gpuva list.
> > +			 */
> > +			struct list_head gem;
> > +
> > +			/**
> > +			 * @evict: List entry to attach to the
> > +			 * &drm_gpuva_managers evict list.
> > +			 */
> > +			struct list_head evict;
> > +		} entry;
> > +	} list;
> > +};
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj);
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > +			      struct drm_gem_object *obj,
> > +			      struct drm_gpuva_gem *__vm_bo);
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > +		   struct drm_gem_object *obj);
> > +
> > +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> > +
> > +struct drm_gpuva_gem *
> > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > +		     struct drm_gem_object *obj);
> > +void drm_gpuva_gem_destroy(struct kref *kref);
> > +
> > +/**
> > + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> > + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> > + *
> > + * This function acquires an additional reference to @vm_bo. It is illegal to
> > + * call this without already holding a reference. No locks required.
> > + */
> > +static inline struct drm_gpuva_gem *
> > +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> > +{
> > +	kref_get(&vm_bo->kref);
> > +	return vm_bo;
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> > + * @vm_bo: the &drm_gpuva_gem to release the reference of
> > + *
> > + * This releases a reference to @vm_bo.
> > + */
> > +static inline void
> > +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> > +{
> > +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> > +}
> > +
> > +/**
> > + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_gem.
> > + */
> > +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> > +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> > +
> > +/**
> > + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> > + * &drm_gpuva
> > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > + * @next__: &next &drm_gpuva to store the next step
> > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > + *
> > + * This iterator walks over all &drm_gpuva structures associated with the
> > + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> > + * it is save against removal of elements.
> > + */
> > +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> > +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> > +
> >   /**
> >    * enum drm_gpuva_op_type - GPU VA operation type
> >    *
> > @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
> >   	 */
> >   	void (*op_free)(struct drm_gpuva_op *op);
> > +	/**
> > +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> > +	 * a struct drm_gpuva_gem
> > +	 *
> > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > +	 * specific structures. By implementing this callback drivers can
> > +	 * allocate memory accordingly.
> > +	 *
> > +	 * This callback is optional.
> > +	 */
> > +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> > +
> > +	/**
> > +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> > +	 * struct drm_gpuva_gem
> > +	 *
> > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > +	 * specific structures. By implementing this callback drivers can
> > +	 * free the previously allocated memory accordingly.
> > +	 *
> > +	 * This callback is optional.
> > +	 */
> > +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> > +
> >   	/**
> >   	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
> >   	 * mapping once all previous steps were completed
> > @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
> >   	 * used.
> >   	 */
> >   	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> > +
> > +	/**
> > +	 * @bo_validate: called from drm_gpuva_manager_validate()
> > +	 *
> > +	 * Drivers receive this callback for every evicted &drm_gem_object being
> > +	 * mapped in the corresponding &drm_gpuva_manager.
> > +	 *
> > +	 * Typically, drivers would call their driver specific variant of
> > +	 * ttm_bo_validate() from within this callback.
> > +	 */
> > +	int (*bo_validate)(struct drm_gem_object *obj);
> >   };
> >   int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> > @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
> >   void drm_gpuva_map(struct drm_gpuva_manager *mgr,
> >   		   struct drm_gpuva *va,
> >   		   struct drm_gpuva_op_map *op);
> > +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > +		       struct drm_gpuva *va,
> > +		       struct drm_gpuva_op_map *op);
> >   void drm_gpuva_remap(struct drm_gpuva *prev,
> >   		     struct drm_gpuva *next,
> >   		     struct drm_gpuva_op_remap *op);
> > +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> > +			 struct drm_gpuva *next,
> > +			 struct drm_gpuva_op_remap *op);
> >   void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> > +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
> >   #endif /* __DRM_GPUVA_MGR_H__ */
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-30 12:49       ` Danilo Krummrich
  (?)
@ 2023-08-30 13:42         ` Thomas Hellström (Intel)
  -1 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-08-30 13:42 UTC (permalink / raw)
  To: Danilo Krummrich
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, christian.koenig, faith.ekstrand, bskeggs


On 8/30/23 14:49, Danilo Krummrich wrote:
> Hi Thomas,
>
> thanks for having a look!
>
> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>> Hi, Danilo.
>>
>> Some quick comments since I'm doing some Xe work in this area. Will probably
>> get back with more.
>>
>> On 8/20/23 23:53, Danilo Krummrich wrote:
>>> So far the DRM GPUVA manager offers common infrastructure to track GPU VA
>>> allocations and mappings, generically connect GPU VA mappings to their
>>> backing buffers and perform more complex mapping operations on the GPU VA
>>> space.
>>>
>>> However, there are more design patterns commonly used by drivers, which
>>> can potentially be generalized in order to make the DRM GPUVA manager
>>> represent a basic GPU-VM implementation. In this context, this patch aims
>>> at generalizing the following elements.
>>>
>>> 1) Provide a common dma-resv for GEM objects not being used outside of
>>>      this GPU-VM.
>>>
>>> 2) Provide tracking of external GEM objects (GEM objects which are
>>>      shared with other GPU-VMs).
>>>
>>> 3) Provide functions to efficiently lock all GEM objects dma-resv the
>>>      GPU-VM contains mappings of.
>>>
>>> 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
>>>      of, such that validation of evicted GEM objects is accelerated.
>>>
>>> 5) Provide some convinience functions for common patterns.
>>>
>>> Rather than being designed as a "framework", the target is to make all
>>> features appear as a collection of optional helper functions, such that
>>> drivers are free to make use of the DRM GPUVA managers basic
>>> functionality and opt-in for other features without setting any feature
>>> flags, just by making use of the corresponding functions.
>>>
>>> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
>>> ---
>>>    drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
>>>    include/drm/drm_gem.h           |  48 ++-
>>>    include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
>>>    3 files changed, 1010 insertions(+), 28 deletions(-)
>>>
>>> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
>>> index f86bfad74ff8..69872b205961 100644
>>> --- a/drivers/gpu/drm/drm_gpuva_mgr.c
>>> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
>>> @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>>    /**
>>>     * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
>>>     * @mgr: pointer to the &drm_gpuva_manager to initialize
>>> + * @drm: the drivers &drm_device
>>>     * @name: the name of the GPU VA space
>>>     * @start_offset: the start offset of the GPU VA space
>>>     * @range: the size of the GPU VA space
>>> @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>>     */
>>>    void
>>>    drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>> +		       struct drm_device *drm,
>>>    		       const char *name,
>>>    		       u64 start_offset, u64 range,
>>>    		       u64 reserve_offset, u64 reserve_range,
>>> @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>    	mgr->rb.tree = RB_ROOT_CACHED;
>>>    	INIT_LIST_HEAD(&mgr->rb.list);
>>> +	mt_init(&mgr->mt_ext);
>>> +
>>> +	INIT_LIST_HEAD(&mgr->evict.list);
>>> +	spin_lock_init(&mgr->evict.lock);
>>> +
>>>    	drm_gpuva_check_overflow(start_offset, range);
>>>    	mgr->mm_start = start_offset;
>>>    	mgr->mm_range = range;
>>> @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>    						     reserve_range)))
>>>    			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>>>    	}
>>> +
>>> +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
>>> +	mgr->resv = mgr->d_obj.resv;
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
>>> @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>>>    		__drm_gpuva_remove(&mgr->kernel_alloc_node);
>>>    	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
>>> -	     "GPUVA tree is not empty, potentially leaking memory.");
>>> +	     "GPUVA tree is not empty, potentially leaking memory.\n");
>>> +
>>> +	mtree_destroy(&mgr->mt_ext);
>>> +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
>>> +
>>> +	drm_gem_private_object_fini(&mgr->d_obj);
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
>>> +/**
>>> + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @num_fences: the amount of &dma_fences to reserve
>>> + *
>>> + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
>>> + * &drm_gpuva_manager contains mappings of.
>>> + *
>>> + * Drivers can obtain the corresponding &drm_exec instance through
>>> + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
>>> + * and drm_exec_fini() accordingly.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
>>> +				  unsigned int num_fences)
>>> +{
>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>> +	union {
>>> +		void *ptr;
>>> +		uintptr_t cnt;
>>> +	} ref;
>>> +	int ret;
>>> +
>>> +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
>>> +	if (ret)
>>> +		goto out;
>>> +
>>> +	rcu_read_lock();
>> In xe we're protecting the external object list with an outer lock, (same as
>> protecting the mgr itself). Do we need a separate lock for this? In theory
>> as  outlined in the VM_BIND locking document draft, one could probably even
>> use the mgr resv for this, but with more complicated code I guess. Also see
>> the comment below about the data structure chosen.
> The idea is to protect this list with the GPU-VM lock. The locking here is more
> of an implication of the maple tree. Either you use the internal lock of the
> maple tree or RCU respectively, or you give the maple tree an external lock to
> perform lockdep checks on (mt_set_external_lock()). Basically same as here:
>
> https://elixir.bootlin.com/linux/latest/source/drivers/base/regmap/regcache-maple.c#L124

Ah, I suspected it was something along those lines.


>
>>> +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
>>> +		struct drm_gem_object *obj;
>>> +
>>> +		mas_pause(&mas);
>>> +		rcu_read_unlock();
>>> +
>>> +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
>>> +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
>>> +		if (ret)
>>> +			goto out;
>>> +
>>> +		rcu_read_lock();
>>> +	}
>>> +	rcu_read_unlock();
>>> +
>>> +out:
>>> +	return ret;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @fn: callback received by the driver to lock additional dma-resv
>>> + * @priv: private driver data passed to @fn
>>> + * @num_fences: the amount of &dma_fences to reserve
>>> + * @interruptible: sleep interruptible if waiting
>>> + *
>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>> + * &drm_gpuva_manager contains mappings of.
>>> + *
>>> + * Addionally, when calling this function the driver receives the given @fn
>>> + * callback to lock additional dma-resv in the context of the
>>> + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
>>> + * drm_exec_prepare_obj() from within this callback.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
>>> +			     int (*fn)(struct drm_gpuva_manager *mgr,
>>> +				       void *priv, unsigned int num_fences),
>>> +			     void *priv,
>>> +			     unsigned int num_fences,
>>> +			     bool interruptible)
>>> +{
>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>> +	uint32_t flags;
>>> +	int ret;
>>> +
>>> +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
>>> +		DRM_EXEC_IGNORE_DUPLICATES;
>>> +
>>> +	drm_exec_init(exec, flags);
>>> +
>>> +	drm_exec_until_all_locked(exec) {
>>> +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
>>> +		drm_exec_retry_on_contention(exec);
>>> +		if (ret)
>>> +			goto err;
>>> +
>>> +		if (fn) {
>>> +			ret = fn(mgr, priv, num_fences);
>>> +			drm_exec_retry_on_contention(exec);
>>> +			if (ret)
>>> +				goto err;
>>> +		}
>>> +	}
>>> +
>>> +	return 0;
>>> +
>>> +err:
>>> +	drm_exec_fini(exec);
>>> +	return ret;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
>>> +
>>> +static int
>>> +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
>>> +				unsigned int num_fences)
>>> +{
>>> +	struct {
>>> +		struct drm_gem_object **objs;
>>> +		unsigned int num_objs;
>>> +	} *args = priv;
>>> +
>>> +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
>>> +				      args->num_objs, num_fences);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @objs: additional &drm_gem_objects to lock
>>> + * @num_objs: the number of additional &drm_gem_objects to lock
>>> + * @num_fences: the amount of &dma_fences to reserve
>>> + * @interruptible: sleep interruptible if waiting
>>> + *
>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>> + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
>>> +			     struct drm_gem_object **objs,
>>> +			     unsigned int num_objs,
>>> +			     unsigned int num_fences,
>>> +			     bool interruptible)
>>> +{
>>> +	struct {
>>> +		struct drm_gem_object **objs;
>>> +		unsigned int num_objs;
>>> +	} args;
>>> +
>>> +	args.objs = objs;
>>> +	args.num_objs = num_objs;
>>> +
>>> +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
>>> +					    num_fences, interruptible);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
>>> + * @mgr: the &drm_gpuva_manager to validate evicted BOs
>>> + *
>>> + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
>>> + * objects being mapped in the given &drm_gpuva_manager.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
>>> +{
>>> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +	int ret;
>>> +
>>> +	if (unlikely(!ops || !ops->bo_validate))
>>> +		return -ENOTSUPP;
>>> +
>>> +	/* At this point we should hold all dma-resv locks of all GEM objects
>>> +	 * associated with this GPU-VM, hence it is safe to walk the list.
>>> +	 */
>>> +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
>>> +		dma_resv_assert_held(vm_bo->obj->resv);
>>> +
>>> +		ret = ops->bo_validate(vm_bo->obj);
>>> +		if (ret)
>>> +			return ret;
>>> +	}
>>> +
>>> +	return 0;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
>>> + * dma-resv
>>> + * @mgr: the &drm_gpuva_manager to add a fence to
>>> + * @fence: fence to add
>>> + * @private_usage: private dma-resv usage
>>> + * @extobj_usage: extobj dma-resv usage
>>> + */
>>> +void
>>> +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
>>> +				 struct dma_fence *fence,
>>> +				 enum dma_resv_usage private_usage,
>>> +				 enum dma_resv_usage extobj_usage)
>>> +{
>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>> +	struct drm_gem_object *obj;
>>> +	unsigned long index;
>>> +
>>> +	drm_exec_for_each_locked_object(exec, index, obj) {
>>> +			dma_resv_assert_held(obj->resv);
>>> +			dma_resv_add_fence(obj->resv, fence,
>>> +					   drm_gpuva_is_extobj(mgr, obj) ?
>>> +					   private_usage : extobj_usage);
>>> +	}
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
>>> +
>>> +static struct drm_gpuva_gem *
>>> +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	drm_gem_gpuva_assert_lock_held(obj);
>>> +
>>> +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
>>> +		if (vm_bo->mgr == mgr)
>>> +			return vm_bo;
>>> +
>>> +	return NULL;
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>> + *
>>> + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
>>> + * vm_bo_alloc() callback to allocate.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
>>> + */
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	if (ops && ops->vm_bo_alloc)
>>> +		vm_bo = ops->vm_bo_alloc();
>>> +	else
>>> +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
>>> +
>>> +	if (unlikely(!vm_bo))
>>> +		return NULL;
>>> +
>>> +	vm_bo->mgr = mgr;
>>> +	vm_bo->obj = obj;
>>> +
>>> +	kref_init(&vm_bo->kref);
>>> +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
>>> +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
>>> +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
>>> +
>>> +	drm_gem_object_get(obj);
>>> +
>>> +	return vm_bo;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
>>> +
>>> +void
>>> +drm_gpuva_gem_destroy(struct kref *kref)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
>>> +						   kref);
>>> +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
>>> +
>>> +	drm_gem_object_put(vm_bo->obj);
>>> +
>>> +	if (ops && ops->vm_bo_free)
>>> +		ops->vm_bo_free(vm_bo);
>>> +	else
>>> +		kfree(vm_bo);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
>>> + * &drm_gpuva_manager and &drm_gem_object
>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>> + *
>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>> + * count of the &drm_gpuva_gem accordingly.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
>>> + */
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>> +		   struct drm_gem_object *obj)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
>>> +
>>> +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
>>> + * given &drm_gpuva_manager and &drm_gem_object
>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>> + *
>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>> + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
>>> + * &drm_gpuva_gem.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
>>> + */
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
>>> +	if (vm_bo)
>>> +		return vm_bo;
>>> +
>>> +	vm_bo = drm_gpuva_gem_create(mgr, obj);
>>> +	if (!vm_bo)
>>> +		return ERR_PTR(-ENOMEM);
>>> +
>>> +	return vm_bo;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
>>> + * for the given &drm_gpuva_manager and &drm_gem_object
>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>> + *
>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>> + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
>>> + * count is decreased. If not found @__vm_bo is returned.
>>> + *
>>> + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
>>> + * &drm_gpuva_gem was found
>>> + */
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
>>> +			      struct drm_gem_object *obj,
>>> +			      struct drm_gpuva_gem *__vm_bo)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
>>> +	if (vm_bo) {
>>> +		drm_gpuva_gem_put(__vm_bo);
>>> +		return vm_bo;
>>> +	}
>>> +
>>> +	return __vm_bo;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
>>> +
>>> +static int
>>> +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>> +			  struct drm_gem_object *obj,
>>> +			  gfp_t gfp)
>>> +{
>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>> +	union {
>>> +		struct drm_gem_object *obj;
>>> +		uintptr_t index;
>>> +	} gem;
>>> +	union {
>>> +		void *ptr;
>>> +		uintptr_t cnt;
>>> +	} ref;
>>> +	int ret = 0;
>>> +
>>> +	gem.obj = obj;
>>> +	mas_set(&mas, gem.index);
>>> +
>>> +	mas_lock(&mas);
>>> +	ref.ptr = mas_walk(&mas);
>>> +	if (ref.ptr) {
>>> +		++ref.cnt;
>>> +		mas_store(&mas, ref.ptr);
>>> +	} else {
>>> +		if (unlikely(!gfp)) {
>>> +			ret = -EINVAL;
>>> +			goto out;
>>> +		}
>>> +
>>> +		mas_set(&mas, gem.index);
>>> +		ref.cnt = 1;
>>> +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
>>> +		if (likely(!ret))
>>> +			drm_gem_object_get(obj);
>>> +	}
>>> +out:
>>> +	mas_unlock(&mas);
>>> +	return ret;
>>> +}
>>> +
>>> +static void
>>> +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
>>> +			  struct drm_gem_object *obj)
>>> +{
>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>> +	union {
>>> +		struct drm_gem_object *obj;
>>> +		uintptr_t index;
>>> +	} gem;
>>> +	union {
>>> +		void *ptr;
>>> +		uintptr_t cnt;
>>> +	} ref;
>>> +
>>> +	gem.obj = obj;
>>> +	mas_set(&mas, gem.index);
>>> +
>>> +	mas_lock(&mas);
>>> +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
>>> +		goto out;
>>> +
>>> +	if (!--ref.cnt) {
>>> +		mas_erase(&mas);
>>> +		drm_gem_object_put(obj);
>>> +	} else {
>>> +		mas_store(&mas, ref.ptr);
>>> +	}
>>> +out:
>>> +	mas_unlock(&mas);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
>>> + * @mgr: the &drm_gpuva_manager to insert into
>>> + * @obj: the &drm_gem_object to insert as extobj
>>> + *
>>> + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
>>> + * If the &drm_gem_object already exists in the tree, the reference counter
>>> + * of this external object is increased by one.
>>> + *
>>> + * Drivers should insert the external &drm_gem_object before the dma-fence
>>> + * signalling critical section, e.g. when submitting the job, and before
>>> + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
>>> + * or its dervates.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>> +			struct drm_gem_object *obj)
>>> +{
>>> +	return drm_gpuva_is_extobj(mgr, obj) ?
>>> +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
>>> +
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
>>> +
>>> +/**
>>> + * drm_gpuva_extobj_get - increase the referecne count of an external
>>> + * &drm_gem_object
>>> + * @mgr: the &drm_gpuva_manager storing the extobj
>>> + * @obj: the &drm_gem_object to representing the extobj
>>> + *
>>> + * Increases the reference count of the extobj represented by @obj.
>>> + *
>>> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
>>> + * being inserted.
>>> + *
>>> + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
>>> + * additional reference if the re-map operation splits an existing &drm_gpuva
>>> + * into two separate ones.
>>> + *
>>> + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +void
>>> +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	if (drm_gpuva_is_extobj(mgr, obj))
>>> +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
>>> +		     "Can't increase ref-count of non-existent extobj.");
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
>>> +
>>> +/**
>>> + * drm_gpuva_extobj_put - decrease the referecne count of an external
>>> + * &drm_gem_object
>>> + * @mgr: the &drm_gpuva_manager storing the extobj
>>> + * @obj: the &drm_gem_object to representing the extobj
>>> + *
>>> + * Decreases the reference count of the extobj represented by @obj.
>>> + *
>>> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
>>> + * being removed from the GPU VA space.
>>> + *
>>> + * See also drm_gpuva_unmap_put().
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +void
>>> +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	if (drm_gpuva_is_extobj(mgr, obj))
>>> +		__drm_gpuva_extobj_remove(mgr, obj);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
>>> + * &drm_gpuva_managers evicted list
>>> + * @obj: the &drm_gem_object to add or remove
>>> + * @evict: indicates whether the object is evicted
>>> + *
>>> + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
>>> + * list containing a mapping of this &drm_gem_object.
>>> + */
>>> +void
>>> +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
>>> +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
>>> +	 */
>>> +	drm_gem_gpuva_assert_lock_held(obj);
>>> +
>>> +	/* Required to protect the GPUVA managers evict list against concurrent
>>> +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
>>> +	 * the evict list through different GEM object evictions are protected
>>> +	 * by the GPUVA managers evict lock.
>>> +	 */
>>> +	dma_resv_assert_held(obj->resv);
>>> +
>>> +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
>>> +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
>>> +
>>> +		spin_lock(&mgr->evict.lock);
>>> +		if (evict)
>>> +			list_add_tail(&vm_bo->list.entry.evict,
>>> +				      &mgr->evict.list);
>>> +		else
>>> +			list_del_init(&vm_bo->list.entry.evict);
>>> +		spin_unlock(&mgr->evict.lock);
>>> +	}
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
>>> +
>>>    static int
>>>    __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>>>    		   struct drm_gpuva *va)
>>> @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
>>>    /**
>>>     * drm_gpuva_link() - link a &drm_gpuva
>>>     * @va: the &drm_gpuva to link
>>> + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
>>>     *
>>> - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
>>> - * associated with.
>>> + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
>>> + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
>>> + *
>>> + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
>>> + * reference of the latter is taken.
>>>     *
>>>     * This function expects the caller to protect the GEM's GPUVA list against
>>> - * concurrent access using the GEMs dma_resv lock.
>>> + * concurrent access using either the GEMs dma_resv lock or a driver specific
>>> + * lock set through drm_gem_gpuva_set_lock().
>>>     */
>>>    void
>>> -drm_gpuva_link(struct drm_gpuva *va)
>>> +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
>>>    {
>>>    	struct drm_gem_object *obj = va->gem.obj;
>>> @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
>>>    	drm_gem_gpuva_assert_lock_held(obj);
>>> -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
>>> +	drm_gpuva_gem_get(vm_bo);
>>> +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
>>> +	if (list_empty(&vm_bo->list.entry.gem))
>>> +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_link);
>>> @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
>>>     * This removes the given &va from the GPU VA list of the &drm_gem_object it is
>>>     * associated with.
>>>     *
>>> + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
>>> + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
>>> + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
>>> + *
>>> + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
>>> + * the latter is dropped.
>>> + *
>>>     * This function expects the caller to protect the GEM's GPUVA list against
>>> - * concurrent access using the GEMs dma_resv lock.
>>> + * concurrent access using either the GEMs dma_resv lock or a driver specific
>>> + * lock set through drm_gem_gpuva_set_lock().
>>>     */
>>>    void
>>>    drm_gpuva_unlink(struct drm_gpuva *va)
>>>    {
>>>    	struct drm_gem_object *obj = va->gem.obj;
>>> +	struct drm_gpuva_gem *vm_bo;
>>>    	if (unlikely(!obj))
>>>    		return;
>>>    	drm_gem_gpuva_assert_lock_held(obj);
>>> +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
>>> +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
>>> +		return;
>>> +
>>>    	list_del_init(&va->gem.entry);
>>> +
>>> +	if (list_empty(&vm_bo->list.gpuva)) {
>>> +		list_del_init(&vm_bo->list.entry.gem);
>>> +		list_del_init(&vm_bo->list.entry.evict);
>>> +	}
>>> +	drm_gpuva_gem_put(vm_bo);
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
>>> @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_map);
>>> +/**
>>> + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
>>> + * &drm_gpuva_op_map
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @va: the &drm_gpuva to insert
>>> + * @op: the &drm_gpuva_op_map to initialize @va with
>>> + *
>>> + * Initializes the @va from the @op and inserts it into the given @mgr and
>>> + * increases the reference count of the corresponding extobj.
>>> + */
>>> +void
>>> +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
>>> +		  struct drm_gpuva *va,
>>> +		  struct drm_gpuva_op_map *op)
>>> +{
>>> +	drm_gpuva_map(mgr, va, op);
>>> +	drm_gpuva_extobj_get(mgr, va->gem.obj);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
>>> +
>>>    /**
>>>     * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
>>>     * &drm_gpuva_op_remap
>>> @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>>>    		struct drm_gpuva *next,
>>>    		struct drm_gpuva_op_remap *op)
>>>    {
>>> -	struct drm_gpuva *curr = op->unmap->va;
>>> -	struct drm_gpuva_manager *mgr = curr->mgr;
>>> +	struct drm_gpuva *va = op->unmap->va;
>>> +	struct drm_gpuva_manager *mgr = va->mgr;
>>> -	drm_gpuva_remove(curr);
>>> +	drm_gpuva_remove(va);
>>>    	if (op->prev) {
>>>    		drm_gpuva_init_from_op(prev, op->prev);
>>> @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_remap);
>>> +/**
>>> + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
>>> + * &drm_gpuva_op_remap
>>> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
>>> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
>>> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
>>> + *
>>> + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
>>> + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
>>> + * separate mappings, increases the reference count of the corresponding extobj.
>>> + */
>>> +void
>>> +drm_gpuva_remap_get(struct drm_gpuva *prev,
>>> +		    struct drm_gpuva *next,
>>> +		    struct drm_gpuva_op_remap *op)
>>> +{
>>> +	struct drm_gpuva *va = op->unmap->va;
>>> +	struct drm_gpuva_manager *mgr = va->mgr;
>>> +
>>> +	drm_gpuva_remap(prev, next, op);
>>> +	if (op->prev && op->next)
>>> +		drm_gpuva_extobj_get(mgr, va->gem.obj);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
>>> +
>>>    /**
>>>     * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
>>>     * &drm_gpuva_op_unmap
>>> @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
>>> +/**
>>> + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
>>> + * &drm_gpuva_op_unmap
>>> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
>>> + *
>>> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
>>> + * the reference count of the corresponding extobj.
>>> + */
>>> +void
>>> +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
>>> +{
>>> +	struct drm_gpuva *va = op->va;
>>> +
>>> +	drm_gpuva_unmap(op);
>>> +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
>>> +
>>>    static int
>>>    op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>>>    	  u64 addr, u64 range,
>>> @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>>    {
>>>    	struct drm_gpuva_ops *ops;
>>>    	struct drm_gpuva_op *op;
>>> +	struct drm_gpuva_gem *vm_bo;
>>>    	struct drm_gpuva *va;
>>>    	int ret;
>>> @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>>    	INIT_LIST_HEAD(&ops->list);
>>> -	drm_gem_for_each_gpuva(va, obj) {
>>> +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
>>>    		op = gpuva_op_alloc(mgr);
>>>    		if (!op) {
>>>    			ret = -ENOMEM;
>>> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
>>> index bc9f6aa2f3fe..783ed3ab440d 100644
>>> --- a/include/drm/drm_gem.h
>>> +++ b/include/drm/drm_gem.h
>>> @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
>>>     * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
>>>     * @obj: the &drm_gem_object
>>>     *
>>> - * This initializes the &drm_gem_object's &drm_gpuva list.
>>> + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
>>>     *
>>>     * Calling this function is only necessary for drivers intending to support the
>>>     * &drm_driver_feature DRIVER_GEM_GPUVA.
>>> @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>>>    }
>>>    /**
>>> - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
>>> - * @entry__: &drm_gpuva structure to assign to in each iteration step
>>> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>> + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
>>> + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
>>> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>>>     *
>>> - * This iterator walks over all &drm_gpuva structures associated with the
>>> - * &drm_gpuva_manager.
>>> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>>> + * &drm_gem_object.
>>>     */
>>> -#define drm_gem_for_each_gpuva(entry__, obj__) \
>>> -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
>>> +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
>>> +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
>>>    /**
>>> - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
>>> - * gpuvas
>>> - * @entry__: &drm_gpuva structure to assign to in each iteration step
>>> - * @next__: &next &drm_gpuva to store the next step
>>> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>> + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
>>> + * &drm_gpuva_gem
>>> + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
>>> + * @next__: &next &drm_gpuva_gem to store the next step
>>> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>>>     *
>>> - * This iterator walks over all &drm_gpuva structures associated with the
>>> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>>>     * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
>>>     * it is save against removal of elements.
>>>     */
>>> -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
>>> -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
>>> +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
>>> +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
>>> +
>>> +/**
>>> + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>> + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
>>> + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
>>> + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>> + * &drm_gpuva_manager and &drm_gem_object.
>>> + */
>>> +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
>>> +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
>>> +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
>>> +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
>>> +	     va__ = list_next_entry(va__, gem.entry))
>>>    #endif /* __DRM_GEM_H__ */
>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>> index ed8d50200cc3..693e2da3f425 100644
>>> --- a/include/drm/drm_gpuva_mgr.h
>>> +++ b/include/drm/drm_gpuva_mgr.h
>>> @@ -26,12 +26,16 @@
>>>     */
>>>    #include <linux/list.h>
>>> +#include <linux/dma-resv.h>
>>> +#include <linux/maple_tree.h>
>>>    #include <linux/rbtree.h>
>>>    #include <linux/types.h>
>>>    #include <drm/drm_gem.h>
>>> +#include <drm/drm_exec.h>
>>>    struct drm_gpuva_manager;
>>> +struct drm_gpuva_gem;
>>>    struct drm_gpuva_fn_ops;
>>>    /**
>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>    int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>    void drm_gpuva_remove(struct drm_gpuva *va);
>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>    void drm_gpuva_unlink(struct drm_gpuva *va);
>>>    struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>    	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>    	 */
>>>    	const struct drm_gpuva_fn_ops *ops;
>>> +
>>> +	/**
>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>> +	 * dma-resv to &drm_exec.
>>> +	 */
>>> +	struct drm_gem_object d_obj;
>>> +
>>> +	/**
>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>> +	 * space
>>> +	 */
>>> +	struct dma_resv *resv;
>>> +
>>> +	/**
>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>> +	 */
>>> +	struct drm_exec exec;
>>> +
>>> +	/**
>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>> +	 */
>>> +	struct maple_tree mt_ext;
>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>> instead of O(1) for a list?
>>
> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> could have mappings of the same extobj.
>
> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> instead, which also seems to be the obvious choice. However, there is a locking
> conflict.
>
> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> the corresponding drm_gem_object, or with an external lock provided by the
> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> changes on the GPUVA space directly from the fence signalling path.
>
> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> linked and we'd want to remove it for the last one being unlinked.
>
> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> create the drm_gpuva_gem, which we need to do anyways.)
>
> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>
> Considering all that, I thought it's probably better to track extobjs separate
> from the drm_gpuva_gem, hence the maple tree choice.

Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point 
to external objects, or in the case of multiple mappings to the same gem 
object, only one of the drm_gpuvas is in the list. These are protected 
by the GPU-VM lock. I don't see a problem with removing those from the 
fence signalling path, though?

Although assuming that's a no-go for GPUVA wouldn't an XArray be a 
better choice, keeping O(1)?

>
>>> +
>>> +	/**
>>> +	 * @evict: structure holding the evict list and evict list lock
>>> +	 */
>>> +	struct {
>>> +		/**
>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>> +		 * evicted
>>> +		 */
>>> +		struct list_head list;
>>> +
>>> +		/**
>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>> +		 * insertion / removal of different &drm_gpuva_gems
>>> +		 */
>>> +		spinlock_t lock;
>>> +	} evict;
>>>    };
>>>    void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>> +			    struct drm_device *drm,
>>>    			    const char *name,
>>>    			    u64 start_offset, u64 range,
>>>    			    u64 reserve_offset, u64 reserve_range,
>>>    			    const struct drm_gpuva_fn_ops *ops);
>>>    void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>> +/**
>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>> + */
>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>> should typically be allocated on the stack. Otherwise you'd need to protect
>> the mgr->exec member with an exclusive lock throughout the locking process,
>> and that's not what we want.
> Oh, good point. I think it works in Nouveau, because there it's implicitly
> protected with the job submission lock.
>
>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>> needed ops to it: Like so:
> That's a good idea, will take this into V2.

Actually, I'm not fully sure that was a good idea: I've now have a 
working version of Xe ported over to drm_exec, having these helpers in 
mind and with the intention to start using them as they mature. What I 
found, though is that open-coding the drm_exec loop is not all that bad, 
but that building blocks that can be called from within the loop are useful:

Like the drm_gpuva_prepare_objects() and an imaginary 
drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the 
object (if different and the gpuva points to the object. And 
drm_gpuva_prepare_array() although we don't use it within Xe. That means 
you can use these building blocks like helpers and avoid the fn() 
callback by instead open-coding.

But I guess YMMV.

>
>> struct drm_gpuva_exec_ops {
>>      int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>
>>      int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>> *obj);
> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> be the same callback, right?
>
>> };
>>
>> struct drm_gpuva_exec {
>>      const struct drm_gpuva_exec_ops *ops;
>>      struct drm_exec exec;
>>      struct drm_gpuva_manager *mgr;
>> };
>>
>> Although I'd actually expect bo_validate to be part of fn in the typical
>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> This doesn't sound like my assumption about fn() above is correct.

Well one important thing in our conversion is that ttm_bo_validate () 
needs to be in the until_all_locked() loop. We want to be able soon to 
use sleeping locks for eviction, so a xe_bo_validate() would, at least 
temporarily, add locked objects to the drm_exec list of locked objects. 
That means everything that may end up calling validate deep within the 
call chain needs to be part of the until_all_locked() loop, so our 
drm_gpuva_manager_lock_extra() fn callback would include those validates 
and look different all the time. Hence that's why open-coding isn't all 
that bad...

/Thomas


>
>>
>>> +
>>> +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
>>> +				 int (*fn)(struct drm_gpuva_manager *mgr,
>>> +					   void *priv, unsigned int num_fences),
>>> +				 void *priv,
>>> +				 unsigned int num_fences,
>>> +				 bool interruptible);
>>> +
>>> +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
>>> +				 struct drm_gem_object **objs,
>>> +				 unsigned int num_objs,
>>> +				 unsigned int num_fences,
>>> +				 bool interruptible);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @num_fences: the amount of &dma_fences to reserve
>>> + * @interruptible: sleep interruptible if waiting
>>> + *
>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>> + * &drm_gpuva_manager contains mappings of.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +static inline int
>>> +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
>>> +		       unsigned int num_fences,
>>> +		       bool interruptible)
>>> +{
>>> +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
>>> +					    interruptible);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + *
>>> + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
>>> + * through drm_gpuva_manager_lock() or its variants.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +static inline void
>>> +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
>>> +{
>>> +	drm_exec_fini(&mgr->exec);
>>> +}
>>> +
>>> +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
>>> +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
>>> +				      struct dma_fence *fence,
>>> +				      enum dma_resv_usage private_usage,
>>> +				      enum dma_resv_usage extobj_usage);
>>> +
>>> +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>> +			    struct drm_gem_object *obj);
>>> +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
>>> +			  struct drm_gem_object *obj);
>>> +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
>>> +			  struct drm_gem_object *obj);
>>> +
>>> +/**
>>> + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
>>> + * external object
>>> + * @mgr: the &drm_gpuva_manager to check
>>> + * @obj: the &drm_gem_object to check
>>> + *
>>> + * Returns: true if the &drm_gem_object &dma_resv differs from the
>>> + * &drm_gpuva_managers &dma_resv, false otherwise
>>> + */
>>> +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
>>> +				       struct drm_gem_object *obj)
>>> +{
>>> +	return obj && obj->resv != mgr->resv;
>>> +}
>>> +
>>>    static inline struct drm_gpuva *
>>>    __drm_gpuva_next(struct drm_gpuva *va)
>>>    {
>>> @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
>>>    #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>>>    	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
>>> +/**
>>> + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
>>> + * &drm_gem_object combination
>>> + *
>>> + * This structure is an abstraction representing a &drm_gpuva_manager and
>>> + * &drm_gem_object combination. It serves as an indirection to accelerate
>>> + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
>>> + * &drm_gem_object.
>>> + *
>>> + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
>>> + * accelerate validation.
>>> + *
>>> + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
>>> + * a GEM object is mapped first in a GPU-VM and release the instance once the
>>> + * last mapping of the GEM object in this GPU-VM is unmapped.
>>> + */
>>> +struct drm_gpuva_gem {
>>> +
>>> +	/**
>>> +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> +	 */
>>> +	struct drm_gpuva_manager *mgr;
>>> +
>>> +	/**
>>> +	 * @obj: The &drm_gem_object being mapped in the @mgr.
>>> +	 */
>>> +	struct drm_gem_object *obj;
>>> +
>>> +	/**
>>> +	 * @kref: The reference count for this &drm_gpuva_gem.
>>> +	 */
>>> +	struct kref kref;
>>> +
>>> +	/**
>>> +	 * @list: Structure containing all &list_heads.
>>> +	 */
>>> +	struct {
>>> +		/**
>>> +		 * @gpuva: The list of linked &drm_gpuvas.
>>> +		 */
>>> +		struct list_head gpuva;
>>> +
>>> +		/**
>>> +		 * @entry: Structure containing all &list_heads serving as
>>> +		 * entry.
>>> +		 */
>>> +		struct {
>>> +			/**
>>> +			 * @gem: List entry to attach to the &drm_gem_objects
>>> +			 * gpuva list.
>>> +			 */
>>> +			struct list_head gem;
>>> +
>>> +			/**
>>> +			 * @evict: List entry to attach to the
>>> +			 * &drm_gpuva_managers evict list.
>>> +			 */
>>> +			struct list_head evict;
>>> +		} entry;
>>> +	} list;
>>> +};
>>> +
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj);
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
>>> +			      struct drm_gem_object *obj,
>>> +			      struct drm_gpuva_gem *__vm_bo);
>>> +
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>> +		   struct drm_gem_object *obj);
>>> +
>>> +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
>>> +
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj);
>>> +void drm_gpuva_gem_destroy(struct kref *kref);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
>>> + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
>>> + *
>>> + * This function acquires an additional reference to @vm_bo. It is illegal to
>>> + * call this without already holding a reference. No locks required.
>>> + */
>>> +static inline struct drm_gpuva_gem *
>>> +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
>>> +{
>>> +	kref_get(&vm_bo->kref);
>>> +	return vm_bo;
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
>>> + * @vm_bo: the &drm_gpuva_gem to release the reference of
>>> + *
>>> + * This releases a reference to @vm_bo.
>>> + */
>>> +static inline void
>>> +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
>>> +{
>>> +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>> + * &drm_gpuva_gem.
>>> + */
>>> +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
>>> +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
>>> +
>>> +/**
>>> + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
>>> + * &drm_gpuva
>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>> + * @next__: &next &drm_gpuva to store the next step
>>> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>> + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
>>> + * it is save against removal of elements.
>>> + */
>>> +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
>>> +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
>>> +
>>>    /**
>>>     * enum drm_gpuva_op_type - GPU VA operation type
>>>     *
>>> @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
>>>    	 */
>>>    	void (*op_free)(struct drm_gpuva_op *op);
>>> +	/**
>>> +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
>>> +	 * a struct drm_gpuva_gem
>>> +	 *
>>> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
>>> +	 * specific structures. By implementing this callback drivers can
>>> +	 * allocate memory accordingly.
>>> +	 *
>>> +	 * This callback is optional.
>>> +	 */
>>> +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
>>> +
>>> +	/**
>>> +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
>>> +	 * struct drm_gpuva_gem
>>> +	 *
>>> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
>>> +	 * specific structures. By implementing this callback drivers can
>>> +	 * free the previously allocated memory accordingly.
>>> +	 *
>>> +	 * This callback is optional.
>>> +	 */
>>> +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
>>> +
>>>    	/**
>>>    	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
>>>    	 * mapping once all previous steps were completed
>>> @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
>>>    	 * used.
>>>    	 */
>>>    	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
>>> +
>>> +	/**
>>> +	 * @bo_validate: called from drm_gpuva_manager_validate()
>>> +	 *
>>> +	 * Drivers receive this callback for every evicted &drm_gem_object being
>>> +	 * mapped in the corresponding &drm_gpuva_manager.
>>> +	 *
>>> +	 * Typically, drivers would call their driver specific variant of
>>> +	 * ttm_bo_validate() from within this callback.
>>> +	 */
>>> +	int (*bo_validate)(struct drm_gem_object *obj);
>>>    };
>>>    int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
>>> @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>>>    void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>>    		   struct drm_gpuva *va,
>>>    		   struct drm_gpuva_op_map *op);
>>> +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
>>> +		       struct drm_gpuva *va,
>>> +		       struct drm_gpuva_op_map *op);
>>>    void drm_gpuva_remap(struct drm_gpuva *prev,
>>>    		     struct drm_gpuva *next,
>>>    		     struct drm_gpuva_op_remap *op);
>>> +void drm_gpuva_remap_get(struct drm_gpuva *prev,
>>> +			 struct drm_gpuva *next,
>>> +			 struct drm_gpuva_op_remap *op);
>>>    void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
>>> +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
>>>    #endif /* __DRM_GPUVA_MGR_H__ */

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-30 13:42         ` Thomas Hellström (Intel)
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-08-30 13:42 UTC (permalink / raw)
  To: Danilo Krummrich
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, christian.koenig, faith.ekstrand, bskeggs


On 8/30/23 14:49, Danilo Krummrich wrote:
> Hi Thomas,
>
> thanks for having a look!
>
> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>> Hi, Danilo.
>>
>> Some quick comments since I'm doing some Xe work in this area. Will probably
>> get back with more.
>>
>> On 8/20/23 23:53, Danilo Krummrich wrote:
>>> So far the DRM GPUVA manager offers common infrastructure to track GPU VA
>>> allocations and mappings, generically connect GPU VA mappings to their
>>> backing buffers and perform more complex mapping operations on the GPU VA
>>> space.
>>>
>>> However, there are more design patterns commonly used by drivers, which
>>> can potentially be generalized in order to make the DRM GPUVA manager
>>> represent a basic GPU-VM implementation. In this context, this patch aims
>>> at generalizing the following elements.
>>>
>>> 1) Provide a common dma-resv for GEM objects not being used outside of
>>>      this GPU-VM.
>>>
>>> 2) Provide tracking of external GEM objects (GEM objects which are
>>>      shared with other GPU-VMs).
>>>
>>> 3) Provide functions to efficiently lock all GEM objects dma-resv the
>>>      GPU-VM contains mappings of.
>>>
>>> 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
>>>      of, such that validation of evicted GEM objects is accelerated.
>>>
>>> 5) Provide some convinience functions for common patterns.
>>>
>>> Rather than being designed as a "framework", the target is to make all
>>> features appear as a collection of optional helper functions, such that
>>> drivers are free to make use of the DRM GPUVA managers basic
>>> functionality and opt-in for other features without setting any feature
>>> flags, just by making use of the corresponding functions.
>>>
>>> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
>>> ---
>>>    drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
>>>    include/drm/drm_gem.h           |  48 ++-
>>>    include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
>>>    3 files changed, 1010 insertions(+), 28 deletions(-)
>>>
>>> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
>>> index f86bfad74ff8..69872b205961 100644
>>> --- a/drivers/gpu/drm/drm_gpuva_mgr.c
>>> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
>>> @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>>    /**
>>>     * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
>>>     * @mgr: pointer to the &drm_gpuva_manager to initialize
>>> + * @drm: the drivers &drm_device
>>>     * @name: the name of the GPU VA space
>>>     * @start_offset: the start offset of the GPU VA space
>>>     * @range: the size of the GPU VA space
>>> @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>>     */
>>>    void
>>>    drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>> +		       struct drm_device *drm,
>>>    		       const char *name,
>>>    		       u64 start_offset, u64 range,
>>>    		       u64 reserve_offset, u64 reserve_range,
>>> @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>    	mgr->rb.tree = RB_ROOT_CACHED;
>>>    	INIT_LIST_HEAD(&mgr->rb.list);
>>> +	mt_init(&mgr->mt_ext);
>>> +
>>> +	INIT_LIST_HEAD(&mgr->evict.list);
>>> +	spin_lock_init(&mgr->evict.lock);
>>> +
>>>    	drm_gpuva_check_overflow(start_offset, range);
>>>    	mgr->mm_start = start_offset;
>>>    	mgr->mm_range = range;
>>> @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>    						     reserve_range)))
>>>    			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>>>    	}
>>> +
>>> +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
>>> +	mgr->resv = mgr->d_obj.resv;
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
>>> @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>>>    		__drm_gpuva_remove(&mgr->kernel_alloc_node);
>>>    	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
>>> -	     "GPUVA tree is not empty, potentially leaking memory.");
>>> +	     "GPUVA tree is not empty, potentially leaking memory.\n");
>>> +
>>> +	mtree_destroy(&mgr->mt_ext);
>>> +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
>>> +
>>> +	drm_gem_private_object_fini(&mgr->d_obj);
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
>>> +/**
>>> + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @num_fences: the amount of &dma_fences to reserve
>>> + *
>>> + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
>>> + * &drm_gpuva_manager contains mappings of.
>>> + *
>>> + * Drivers can obtain the corresponding &drm_exec instance through
>>> + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
>>> + * and drm_exec_fini() accordingly.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
>>> +				  unsigned int num_fences)
>>> +{
>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>> +	union {
>>> +		void *ptr;
>>> +		uintptr_t cnt;
>>> +	} ref;
>>> +	int ret;
>>> +
>>> +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
>>> +	if (ret)
>>> +		goto out;
>>> +
>>> +	rcu_read_lock();
>> In xe we're protecting the external object list with an outer lock, (same as
>> protecting the mgr itself). Do we need a separate lock for this? In theory
>> as  outlined in the VM_BIND locking document draft, one could probably even
>> use the mgr resv for this, but with more complicated code I guess. Also see
>> the comment below about the data structure chosen.
> The idea is to protect this list with the GPU-VM lock. The locking here is more
> of an implication of the maple tree. Either you use the internal lock of the
> maple tree or RCU respectively, or you give the maple tree an external lock to
> perform lockdep checks on (mt_set_external_lock()). Basically same as here:
>
> https://elixir.bootlin.com/linux/latest/source/drivers/base/regmap/regcache-maple.c#L124

Ah, I suspected it was something along those lines.


>
>>> +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
>>> +		struct drm_gem_object *obj;
>>> +
>>> +		mas_pause(&mas);
>>> +		rcu_read_unlock();
>>> +
>>> +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
>>> +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
>>> +		if (ret)
>>> +			goto out;
>>> +
>>> +		rcu_read_lock();
>>> +	}
>>> +	rcu_read_unlock();
>>> +
>>> +out:
>>> +	return ret;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @fn: callback received by the driver to lock additional dma-resv
>>> + * @priv: private driver data passed to @fn
>>> + * @num_fences: the amount of &dma_fences to reserve
>>> + * @interruptible: sleep interruptible if waiting
>>> + *
>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>> + * &drm_gpuva_manager contains mappings of.
>>> + *
>>> + * Addionally, when calling this function the driver receives the given @fn
>>> + * callback to lock additional dma-resv in the context of the
>>> + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
>>> + * drm_exec_prepare_obj() from within this callback.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
>>> +			     int (*fn)(struct drm_gpuva_manager *mgr,
>>> +				       void *priv, unsigned int num_fences),
>>> +			     void *priv,
>>> +			     unsigned int num_fences,
>>> +			     bool interruptible)
>>> +{
>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>> +	uint32_t flags;
>>> +	int ret;
>>> +
>>> +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
>>> +		DRM_EXEC_IGNORE_DUPLICATES;
>>> +
>>> +	drm_exec_init(exec, flags);
>>> +
>>> +	drm_exec_until_all_locked(exec) {
>>> +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
>>> +		drm_exec_retry_on_contention(exec);
>>> +		if (ret)
>>> +			goto err;
>>> +
>>> +		if (fn) {
>>> +			ret = fn(mgr, priv, num_fences);
>>> +			drm_exec_retry_on_contention(exec);
>>> +			if (ret)
>>> +				goto err;
>>> +		}
>>> +	}
>>> +
>>> +	return 0;
>>> +
>>> +err:
>>> +	drm_exec_fini(exec);
>>> +	return ret;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
>>> +
>>> +static int
>>> +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
>>> +				unsigned int num_fences)
>>> +{
>>> +	struct {
>>> +		struct drm_gem_object **objs;
>>> +		unsigned int num_objs;
>>> +	} *args = priv;
>>> +
>>> +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
>>> +				      args->num_objs, num_fences);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @objs: additional &drm_gem_objects to lock
>>> + * @num_objs: the number of additional &drm_gem_objects to lock
>>> + * @num_fences: the amount of &dma_fences to reserve
>>> + * @interruptible: sleep interruptible if waiting
>>> + *
>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>> + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
>>> +			     struct drm_gem_object **objs,
>>> +			     unsigned int num_objs,
>>> +			     unsigned int num_fences,
>>> +			     bool interruptible)
>>> +{
>>> +	struct {
>>> +		struct drm_gem_object **objs;
>>> +		unsigned int num_objs;
>>> +	} args;
>>> +
>>> +	args.objs = objs;
>>> +	args.num_objs = num_objs;
>>> +
>>> +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
>>> +					    num_fences, interruptible);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
>>> + * @mgr: the &drm_gpuva_manager to validate evicted BOs
>>> + *
>>> + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
>>> + * objects being mapped in the given &drm_gpuva_manager.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
>>> +{
>>> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +	int ret;
>>> +
>>> +	if (unlikely(!ops || !ops->bo_validate))
>>> +		return -ENOTSUPP;
>>> +
>>> +	/* At this point we should hold all dma-resv locks of all GEM objects
>>> +	 * associated with this GPU-VM, hence it is safe to walk the list.
>>> +	 */
>>> +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
>>> +		dma_resv_assert_held(vm_bo->obj->resv);
>>> +
>>> +		ret = ops->bo_validate(vm_bo->obj);
>>> +		if (ret)
>>> +			return ret;
>>> +	}
>>> +
>>> +	return 0;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
>>> + * dma-resv
>>> + * @mgr: the &drm_gpuva_manager to add a fence to
>>> + * @fence: fence to add
>>> + * @private_usage: private dma-resv usage
>>> + * @extobj_usage: extobj dma-resv usage
>>> + */
>>> +void
>>> +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
>>> +				 struct dma_fence *fence,
>>> +				 enum dma_resv_usage private_usage,
>>> +				 enum dma_resv_usage extobj_usage)
>>> +{
>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>> +	struct drm_gem_object *obj;
>>> +	unsigned long index;
>>> +
>>> +	drm_exec_for_each_locked_object(exec, index, obj) {
>>> +			dma_resv_assert_held(obj->resv);
>>> +			dma_resv_add_fence(obj->resv, fence,
>>> +					   drm_gpuva_is_extobj(mgr, obj) ?
>>> +					   private_usage : extobj_usage);
>>> +	}
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
>>> +
>>> +static struct drm_gpuva_gem *
>>> +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	drm_gem_gpuva_assert_lock_held(obj);
>>> +
>>> +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
>>> +		if (vm_bo->mgr == mgr)
>>> +			return vm_bo;
>>> +
>>> +	return NULL;
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>> + *
>>> + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
>>> + * vm_bo_alloc() callback to allocate.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
>>> + */
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	if (ops && ops->vm_bo_alloc)
>>> +		vm_bo = ops->vm_bo_alloc();
>>> +	else
>>> +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
>>> +
>>> +	if (unlikely(!vm_bo))
>>> +		return NULL;
>>> +
>>> +	vm_bo->mgr = mgr;
>>> +	vm_bo->obj = obj;
>>> +
>>> +	kref_init(&vm_bo->kref);
>>> +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
>>> +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
>>> +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
>>> +
>>> +	drm_gem_object_get(obj);
>>> +
>>> +	return vm_bo;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
>>> +
>>> +void
>>> +drm_gpuva_gem_destroy(struct kref *kref)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
>>> +						   kref);
>>> +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
>>> +
>>> +	drm_gem_object_put(vm_bo->obj);
>>> +
>>> +	if (ops && ops->vm_bo_free)
>>> +		ops->vm_bo_free(vm_bo);
>>> +	else
>>> +		kfree(vm_bo);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
>>> + * &drm_gpuva_manager and &drm_gem_object
>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>> + *
>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>> + * count of the &drm_gpuva_gem accordingly.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
>>> + */
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>> +		   struct drm_gem_object *obj)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
>>> +
>>> +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
>>> + * given &drm_gpuva_manager and &drm_gem_object
>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>> + *
>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>> + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
>>> + * &drm_gpuva_gem.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
>>> + */
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
>>> +	if (vm_bo)
>>> +		return vm_bo;
>>> +
>>> +	vm_bo = drm_gpuva_gem_create(mgr, obj);
>>> +	if (!vm_bo)
>>> +		return ERR_PTR(-ENOMEM);
>>> +
>>> +	return vm_bo;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
>>> + * for the given &drm_gpuva_manager and &drm_gem_object
>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>> + *
>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>> + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
>>> + * count is decreased. If not found @__vm_bo is returned.
>>> + *
>>> + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
>>> + * &drm_gpuva_gem was found
>>> + */
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
>>> +			      struct drm_gem_object *obj,
>>> +			      struct drm_gpuva_gem *__vm_bo)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
>>> +	if (vm_bo) {
>>> +		drm_gpuva_gem_put(__vm_bo);
>>> +		return vm_bo;
>>> +	}
>>> +
>>> +	return __vm_bo;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
>>> +
>>> +static int
>>> +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>> +			  struct drm_gem_object *obj,
>>> +			  gfp_t gfp)
>>> +{
>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>> +	union {
>>> +		struct drm_gem_object *obj;
>>> +		uintptr_t index;
>>> +	} gem;
>>> +	union {
>>> +		void *ptr;
>>> +		uintptr_t cnt;
>>> +	} ref;
>>> +	int ret = 0;
>>> +
>>> +	gem.obj = obj;
>>> +	mas_set(&mas, gem.index);
>>> +
>>> +	mas_lock(&mas);
>>> +	ref.ptr = mas_walk(&mas);
>>> +	if (ref.ptr) {
>>> +		++ref.cnt;
>>> +		mas_store(&mas, ref.ptr);
>>> +	} else {
>>> +		if (unlikely(!gfp)) {
>>> +			ret = -EINVAL;
>>> +			goto out;
>>> +		}
>>> +
>>> +		mas_set(&mas, gem.index);
>>> +		ref.cnt = 1;
>>> +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
>>> +		if (likely(!ret))
>>> +			drm_gem_object_get(obj);
>>> +	}
>>> +out:
>>> +	mas_unlock(&mas);
>>> +	return ret;
>>> +}
>>> +
>>> +static void
>>> +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
>>> +			  struct drm_gem_object *obj)
>>> +{
>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>> +	union {
>>> +		struct drm_gem_object *obj;
>>> +		uintptr_t index;
>>> +	} gem;
>>> +	union {
>>> +		void *ptr;
>>> +		uintptr_t cnt;
>>> +	} ref;
>>> +
>>> +	gem.obj = obj;
>>> +	mas_set(&mas, gem.index);
>>> +
>>> +	mas_lock(&mas);
>>> +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
>>> +		goto out;
>>> +
>>> +	if (!--ref.cnt) {
>>> +		mas_erase(&mas);
>>> +		drm_gem_object_put(obj);
>>> +	} else {
>>> +		mas_store(&mas, ref.ptr);
>>> +	}
>>> +out:
>>> +	mas_unlock(&mas);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
>>> + * @mgr: the &drm_gpuva_manager to insert into
>>> + * @obj: the &drm_gem_object to insert as extobj
>>> + *
>>> + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
>>> + * If the &drm_gem_object already exists in the tree, the reference counter
>>> + * of this external object is increased by one.
>>> + *
>>> + * Drivers should insert the external &drm_gem_object before the dma-fence
>>> + * signalling critical section, e.g. when submitting the job, and before
>>> + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
>>> + * or its dervates.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>> +			struct drm_gem_object *obj)
>>> +{
>>> +	return drm_gpuva_is_extobj(mgr, obj) ?
>>> +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
>>> +
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
>>> +
>>> +/**
>>> + * drm_gpuva_extobj_get - increase the referecne count of an external
>>> + * &drm_gem_object
>>> + * @mgr: the &drm_gpuva_manager storing the extobj
>>> + * @obj: the &drm_gem_object to representing the extobj
>>> + *
>>> + * Increases the reference count of the extobj represented by @obj.
>>> + *
>>> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
>>> + * being inserted.
>>> + *
>>> + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
>>> + * additional reference if the re-map operation splits an existing &drm_gpuva
>>> + * into two separate ones.
>>> + *
>>> + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +void
>>> +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	if (drm_gpuva_is_extobj(mgr, obj))
>>> +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
>>> +		     "Can't increase ref-count of non-existent extobj.");
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
>>> +
>>> +/**
>>> + * drm_gpuva_extobj_put - decrease the referecne count of an external
>>> + * &drm_gem_object
>>> + * @mgr: the &drm_gpuva_manager storing the extobj
>>> + * @obj: the &drm_gem_object to representing the extobj
>>> + *
>>> + * Decreases the reference count of the extobj represented by @obj.
>>> + *
>>> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
>>> + * being removed from the GPU VA space.
>>> + *
>>> + * See also drm_gpuva_unmap_put().
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +void
>>> +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	if (drm_gpuva_is_extobj(mgr, obj))
>>> +		__drm_gpuva_extobj_remove(mgr, obj);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
>>> + * &drm_gpuva_managers evicted list
>>> + * @obj: the &drm_gem_object to add or remove
>>> + * @evict: indicates whether the object is evicted
>>> + *
>>> + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
>>> + * list containing a mapping of this &drm_gem_object.
>>> + */
>>> +void
>>> +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
>>> +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
>>> +	 */
>>> +	drm_gem_gpuva_assert_lock_held(obj);
>>> +
>>> +	/* Required to protect the GPUVA managers evict list against concurrent
>>> +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
>>> +	 * the evict list through different GEM object evictions are protected
>>> +	 * by the GPUVA managers evict lock.
>>> +	 */
>>> +	dma_resv_assert_held(obj->resv);
>>> +
>>> +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
>>> +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
>>> +
>>> +		spin_lock(&mgr->evict.lock);
>>> +		if (evict)
>>> +			list_add_tail(&vm_bo->list.entry.evict,
>>> +				      &mgr->evict.list);
>>> +		else
>>> +			list_del_init(&vm_bo->list.entry.evict);
>>> +		spin_unlock(&mgr->evict.lock);
>>> +	}
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
>>> +
>>>    static int
>>>    __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>>>    		   struct drm_gpuva *va)
>>> @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
>>>    /**
>>>     * drm_gpuva_link() - link a &drm_gpuva
>>>     * @va: the &drm_gpuva to link
>>> + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
>>>     *
>>> - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
>>> - * associated with.
>>> + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
>>> + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
>>> + *
>>> + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
>>> + * reference of the latter is taken.
>>>     *
>>>     * This function expects the caller to protect the GEM's GPUVA list against
>>> - * concurrent access using the GEMs dma_resv lock.
>>> + * concurrent access using either the GEMs dma_resv lock or a driver specific
>>> + * lock set through drm_gem_gpuva_set_lock().
>>>     */
>>>    void
>>> -drm_gpuva_link(struct drm_gpuva *va)
>>> +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
>>>    {
>>>    	struct drm_gem_object *obj = va->gem.obj;
>>> @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
>>>    	drm_gem_gpuva_assert_lock_held(obj);
>>> -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
>>> +	drm_gpuva_gem_get(vm_bo);
>>> +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
>>> +	if (list_empty(&vm_bo->list.entry.gem))
>>> +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_link);
>>> @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
>>>     * This removes the given &va from the GPU VA list of the &drm_gem_object it is
>>>     * associated with.
>>>     *
>>> + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
>>> + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
>>> + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
>>> + *
>>> + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
>>> + * the latter is dropped.
>>> + *
>>>     * This function expects the caller to protect the GEM's GPUVA list against
>>> - * concurrent access using the GEMs dma_resv lock.
>>> + * concurrent access using either the GEMs dma_resv lock or a driver specific
>>> + * lock set through drm_gem_gpuva_set_lock().
>>>     */
>>>    void
>>>    drm_gpuva_unlink(struct drm_gpuva *va)
>>>    {
>>>    	struct drm_gem_object *obj = va->gem.obj;
>>> +	struct drm_gpuva_gem *vm_bo;
>>>    	if (unlikely(!obj))
>>>    		return;
>>>    	drm_gem_gpuva_assert_lock_held(obj);
>>> +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
>>> +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
>>> +		return;
>>> +
>>>    	list_del_init(&va->gem.entry);
>>> +
>>> +	if (list_empty(&vm_bo->list.gpuva)) {
>>> +		list_del_init(&vm_bo->list.entry.gem);
>>> +		list_del_init(&vm_bo->list.entry.evict);
>>> +	}
>>> +	drm_gpuva_gem_put(vm_bo);
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
>>> @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_map);
>>> +/**
>>> + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
>>> + * &drm_gpuva_op_map
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @va: the &drm_gpuva to insert
>>> + * @op: the &drm_gpuva_op_map to initialize @va with
>>> + *
>>> + * Initializes the @va from the @op and inserts it into the given @mgr and
>>> + * increases the reference count of the corresponding extobj.
>>> + */
>>> +void
>>> +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
>>> +		  struct drm_gpuva *va,
>>> +		  struct drm_gpuva_op_map *op)
>>> +{
>>> +	drm_gpuva_map(mgr, va, op);
>>> +	drm_gpuva_extobj_get(mgr, va->gem.obj);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
>>> +
>>>    /**
>>>     * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
>>>     * &drm_gpuva_op_remap
>>> @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>>>    		struct drm_gpuva *next,
>>>    		struct drm_gpuva_op_remap *op)
>>>    {
>>> -	struct drm_gpuva *curr = op->unmap->va;
>>> -	struct drm_gpuva_manager *mgr = curr->mgr;
>>> +	struct drm_gpuva *va = op->unmap->va;
>>> +	struct drm_gpuva_manager *mgr = va->mgr;
>>> -	drm_gpuva_remove(curr);
>>> +	drm_gpuva_remove(va);
>>>    	if (op->prev) {
>>>    		drm_gpuva_init_from_op(prev, op->prev);
>>> @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_remap);
>>> +/**
>>> + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
>>> + * &drm_gpuva_op_remap
>>> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
>>> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
>>> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
>>> + *
>>> + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
>>> + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
>>> + * separate mappings, increases the reference count of the corresponding extobj.
>>> + */
>>> +void
>>> +drm_gpuva_remap_get(struct drm_gpuva *prev,
>>> +		    struct drm_gpuva *next,
>>> +		    struct drm_gpuva_op_remap *op)
>>> +{
>>> +	struct drm_gpuva *va = op->unmap->va;
>>> +	struct drm_gpuva_manager *mgr = va->mgr;
>>> +
>>> +	drm_gpuva_remap(prev, next, op);
>>> +	if (op->prev && op->next)
>>> +		drm_gpuva_extobj_get(mgr, va->gem.obj);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
>>> +
>>>    /**
>>>     * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
>>>     * &drm_gpuva_op_unmap
>>> @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
>>> +/**
>>> + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
>>> + * &drm_gpuva_op_unmap
>>> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
>>> + *
>>> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
>>> + * the reference count of the corresponding extobj.
>>> + */
>>> +void
>>> +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
>>> +{
>>> +	struct drm_gpuva *va = op->va;
>>> +
>>> +	drm_gpuva_unmap(op);
>>> +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
>>> +
>>>    static int
>>>    op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>>>    	  u64 addr, u64 range,
>>> @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>>    {
>>>    	struct drm_gpuva_ops *ops;
>>>    	struct drm_gpuva_op *op;
>>> +	struct drm_gpuva_gem *vm_bo;
>>>    	struct drm_gpuva *va;
>>>    	int ret;
>>> @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>>    	INIT_LIST_HEAD(&ops->list);
>>> -	drm_gem_for_each_gpuva(va, obj) {
>>> +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
>>>    		op = gpuva_op_alloc(mgr);
>>>    		if (!op) {
>>>    			ret = -ENOMEM;
>>> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
>>> index bc9f6aa2f3fe..783ed3ab440d 100644
>>> --- a/include/drm/drm_gem.h
>>> +++ b/include/drm/drm_gem.h
>>> @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
>>>     * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
>>>     * @obj: the &drm_gem_object
>>>     *
>>> - * This initializes the &drm_gem_object's &drm_gpuva list.
>>> + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
>>>     *
>>>     * Calling this function is only necessary for drivers intending to support the
>>>     * &drm_driver_feature DRIVER_GEM_GPUVA.
>>> @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>>>    }
>>>    /**
>>> - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
>>> - * @entry__: &drm_gpuva structure to assign to in each iteration step
>>> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>> + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
>>> + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
>>> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>>>     *
>>> - * This iterator walks over all &drm_gpuva structures associated with the
>>> - * &drm_gpuva_manager.
>>> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>>> + * &drm_gem_object.
>>>     */
>>> -#define drm_gem_for_each_gpuva(entry__, obj__) \
>>> -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
>>> +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
>>> +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
>>>    /**
>>> - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
>>> - * gpuvas
>>> - * @entry__: &drm_gpuva structure to assign to in each iteration step
>>> - * @next__: &next &drm_gpuva to store the next step
>>> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>> + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
>>> + * &drm_gpuva_gem
>>> + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
>>> + * @next__: &next &drm_gpuva_gem to store the next step
>>> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>>>     *
>>> - * This iterator walks over all &drm_gpuva structures associated with the
>>> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>>>     * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
>>>     * it is save against removal of elements.
>>>     */
>>> -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
>>> -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
>>> +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
>>> +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
>>> +
>>> +/**
>>> + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>> + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
>>> + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
>>> + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>> + * &drm_gpuva_manager and &drm_gem_object.
>>> + */
>>> +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
>>> +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
>>> +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
>>> +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
>>> +	     va__ = list_next_entry(va__, gem.entry))
>>>    #endif /* __DRM_GEM_H__ */
>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>> index ed8d50200cc3..693e2da3f425 100644
>>> --- a/include/drm/drm_gpuva_mgr.h
>>> +++ b/include/drm/drm_gpuva_mgr.h
>>> @@ -26,12 +26,16 @@
>>>     */
>>>    #include <linux/list.h>
>>> +#include <linux/dma-resv.h>
>>> +#include <linux/maple_tree.h>
>>>    #include <linux/rbtree.h>
>>>    #include <linux/types.h>
>>>    #include <drm/drm_gem.h>
>>> +#include <drm/drm_exec.h>
>>>    struct drm_gpuva_manager;
>>> +struct drm_gpuva_gem;
>>>    struct drm_gpuva_fn_ops;
>>>    /**
>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>    int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>    void drm_gpuva_remove(struct drm_gpuva *va);
>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>    void drm_gpuva_unlink(struct drm_gpuva *va);
>>>    struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>    	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>    	 */
>>>    	const struct drm_gpuva_fn_ops *ops;
>>> +
>>> +	/**
>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>> +	 * dma-resv to &drm_exec.
>>> +	 */
>>> +	struct drm_gem_object d_obj;
>>> +
>>> +	/**
>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>> +	 * space
>>> +	 */
>>> +	struct dma_resv *resv;
>>> +
>>> +	/**
>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>> +	 */
>>> +	struct drm_exec exec;
>>> +
>>> +	/**
>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>> +	 */
>>> +	struct maple_tree mt_ext;
>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>> instead of O(1) for a list?
>>
> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> could have mappings of the same extobj.
>
> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> instead, which also seems to be the obvious choice. However, there is a locking
> conflict.
>
> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> the corresponding drm_gem_object, or with an external lock provided by the
> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> changes on the GPUVA space directly from the fence signalling path.
>
> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> linked and we'd want to remove it for the last one being unlinked.
>
> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> create the drm_gpuva_gem, which we need to do anyways.)
>
> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>
> Considering all that, I thought it's probably better to track extobjs separate
> from the drm_gpuva_gem, hence the maple tree choice.

Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point 
to external objects, or in the case of multiple mappings to the same gem 
object, only one of the drm_gpuvas is in the list. These are protected 
by the GPU-VM lock. I don't see a problem with removing those from the 
fence signalling path, though?

Although assuming that's a no-go for GPUVA wouldn't an XArray be a 
better choice, keeping O(1)?

>
>>> +
>>> +	/**
>>> +	 * @evict: structure holding the evict list and evict list lock
>>> +	 */
>>> +	struct {
>>> +		/**
>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>> +		 * evicted
>>> +		 */
>>> +		struct list_head list;
>>> +
>>> +		/**
>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>> +		 * insertion / removal of different &drm_gpuva_gems
>>> +		 */
>>> +		spinlock_t lock;
>>> +	} evict;
>>>    };
>>>    void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>> +			    struct drm_device *drm,
>>>    			    const char *name,
>>>    			    u64 start_offset, u64 range,
>>>    			    u64 reserve_offset, u64 reserve_range,
>>>    			    const struct drm_gpuva_fn_ops *ops);
>>>    void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>> +/**
>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>> + */
>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>> should typically be allocated on the stack. Otherwise you'd need to protect
>> the mgr->exec member with an exclusive lock throughout the locking process,
>> and that's not what we want.
> Oh, good point. I think it works in Nouveau, because there it's implicitly
> protected with the job submission lock.
>
>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>> needed ops to it: Like so:
> That's a good idea, will take this into V2.

Actually, I'm not fully sure that was a good idea: I've now have a 
working version of Xe ported over to drm_exec, having these helpers in 
mind and with the intention to start using them as they mature. What I 
found, though is that open-coding the drm_exec loop is not all that bad, 
but that building blocks that can be called from within the loop are useful:

Like the drm_gpuva_prepare_objects() and an imaginary 
drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the 
object (if different and the gpuva points to the object. And 
drm_gpuva_prepare_array() although we don't use it within Xe. That means 
you can use these building blocks like helpers and avoid the fn() 
callback by instead open-coding.

But I guess YMMV.

>
>> struct drm_gpuva_exec_ops {
>>      int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>
>>      int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>> *obj);
> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> be the same callback, right?
>
>> };
>>
>> struct drm_gpuva_exec {
>>      const struct drm_gpuva_exec_ops *ops;
>>      struct drm_exec exec;
>>      struct drm_gpuva_manager *mgr;
>> };
>>
>> Although I'd actually expect bo_validate to be part of fn in the typical
>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> This doesn't sound like my assumption about fn() above is correct.

Well one important thing in our conversion is that ttm_bo_validate () 
needs to be in the until_all_locked() loop. We want to be able soon to 
use sleeping locks for eviction, so a xe_bo_validate() would, at least 
temporarily, add locked objects to the drm_exec list of locked objects. 
That means everything that may end up calling validate deep within the 
call chain needs to be part of the until_all_locked() loop, so our 
drm_gpuva_manager_lock_extra() fn callback would include those validates 
and look different all the time. Hence that's why open-coding isn't all 
that bad...

/Thomas


>
>>
>>> +
>>> +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
>>> +				 int (*fn)(struct drm_gpuva_manager *mgr,
>>> +					   void *priv, unsigned int num_fences),
>>> +				 void *priv,
>>> +				 unsigned int num_fences,
>>> +				 bool interruptible);
>>> +
>>> +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
>>> +				 struct drm_gem_object **objs,
>>> +				 unsigned int num_objs,
>>> +				 unsigned int num_fences,
>>> +				 bool interruptible);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @num_fences: the amount of &dma_fences to reserve
>>> + * @interruptible: sleep interruptible if waiting
>>> + *
>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>> + * &drm_gpuva_manager contains mappings of.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +static inline int
>>> +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
>>> +		       unsigned int num_fences,
>>> +		       bool interruptible)
>>> +{
>>> +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
>>> +					    interruptible);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + *
>>> + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
>>> + * through drm_gpuva_manager_lock() or its variants.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +static inline void
>>> +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
>>> +{
>>> +	drm_exec_fini(&mgr->exec);
>>> +}
>>> +
>>> +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
>>> +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
>>> +				      struct dma_fence *fence,
>>> +				      enum dma_resv_usage private_usage,
>>> +				      enum dma_resv_usage extobj_usage);
>>> +
>>> +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>> +			    struct drm_gem_object *obj);
>>> +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
>>> +			  struct drm_gem_object *obj);
>>> +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
>>> +			  struct drm_gem_object *obj);
>>> +
>>> +/**
>>> + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
>>> + * external object
>>> + * @mgr: the &drm_gpuva_manager to check
>>> + * @obj: the &drm_gem_object to check
>>> + *
>>> + * Returns: true if the &drm_gem_object &dma_resv differs from the
>>> + * &drm_gpuva_managers &dma_resv, false otherwise
>>> + */
>>> +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
>>> +				       struct drm_gem_object *obj)
>>> +{
>>> +	return obj && obj->resv != mgr->resv;
>>> +}
>>> +
>>>    static inline struct drm_gpuva *
>>>    __drm_gpuva_next(struct drm_gpuva *va)
>>>    {
>>> @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
>>>    #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>>>    	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
>>> +/**
>>> + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
>>> + * &drm_gem_object combination
>>> + *
>>> + * This structure is an abstraction representing a &drm_gpuva_manager and
>>> + * &drm_gem_object combination. It serves as an indirection to accelerate
>>> + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
>>> + * &drm_gem_object.
>>> + *
>>> + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
>>> + * accelerate validation.
>>> + *
>>> + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
>>> + * a GEM object is mapped first in a GPU-VM and release the instance once the
>>> + * last mapping of the GEM object in this GPU-VM is unmapped.
>>> + */
>>> +struct drm_gpuva_gem {
>>> +
>>> +	/**
>>> +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> +	 */
>>> +	struct drm_gpuva_manager *mgr;
>>> +
>>> +	/**
>>> +	 * @obj: The &drm_gem_object being mapped in the @mgr.
>>> +	 */
>>> +	struct drm_gem_object *obj;
>>> +
>>> +	/**
>>> +	 * @kref: The reference count for this &drm_gpuva_gem.
>>> +	 */
>>> +	struct kref kref;
>>> +
>>> +	/**
>>> +	 * @list: Structure containing all &list_heads.
>>> +	 */
>>> +	struct {
>>> +		/**
>>> +		 * @gpuva: The list of linked &drm_gpuvas.
>>> +		 */
>>> +		struct list_head gpuva;
>>> +
>>> +		/**
>>> +		 * @entry: Structure containing all &list_heads serving as
>>> +		 * entry.
>>> +		 */
>>> +		struct {
>>> +			/**
>>> +			 * @gem: List entry to attach to the &drm_gem_objects
>>> +			 * gpuva list.
>>> +			 */
>>> +			struct list_head gem;
>>> +
>>> +			/**
>>> +			 * @evict: List entry to attach to the
>>> +			 * &drm_gpuva_managers evict list.
>>> +			 */
>>> +			struct list_head evict;
>>> +		} entry;
>>> +	} list;
>>> +};
>>> +
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj);
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
>>> +			      struct drm_gem_object *obj,
>>> +			      struct drm_gpuva_gem *__vm_bo);
>>> +
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>> +		   struct drm_gem_object *obj);
>>> +
>>> +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
>>> +
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj);
>>> +void drm_gpuva_gem_destroy(struct kref *kref);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
>>> + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
>>> + *
>>> + * This function acquires an additional reference to @vm_bo. It is illegal to
>>> + * call this without already holding a reference. No locks required.
>>> + */
>>> +static inline struct drm_gpuva_gem *
>>> +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
>>> +{
>>> +	kref_get(&vm_bo->kref);
>>> +	return vm_bo;
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
>>> + * @vm_bo: the &drm_gpuva_gem to release the reference of
>>> + *
>>> + * This releases a reference to @vm_bo.
>>> + */
>>> +static inline void
>>> +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
>>> +{
>>> +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>> + * &drm_gpuva_gem.
>>> + */
>>> +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
>>> +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
>>> +
>>> +/**
>>> + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
>>> + * &drm_gpuva
>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>> + * @next__: &next &drm_gpuva to store the next step
>>> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>> + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
>>> + * it is save against removal of elements.
>>> + */
>>> +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
>>> +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
>>> +
>>>    /**
>>>     * enum drm_gpuva_op_type - GPU VA operation type
>>>     *
>>> @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
>>>    	 */
>>>    	void (*op_free)(struct drm_gpuva_op *op);
>>> +	/**
>>> +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
>>> +	 * a struct drm_gpuva_gem
>>> +	 *
>>> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
>>> +	 * specific structures. By implementing this callback drivers can
>>> +	 * allocate memory accordingly.
>>> +	 *
>>> +	 * This callback is optional.
>>> +	 */
>>> +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
>>> +
>>> +	/**
>>> +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
>>> +	 * struct drm_gpuva_gem
>>> +	 *
>>> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
>>> +	 * specific structures. By implementing this callback drivers can
>>> +	 * free the previously allocated memory accordingly.
>>> +	 *
>>> +	 * This callback is optional.
>>> +	 */
>>> +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
>>> +
>>>    	/**
>>>    	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
>>>    	 * mapping once all previous steps were completed
>>> @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
>>>    	 * used.
>>>    	 */
>>>    	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
>>> +
>>> +	/**
>>> +	 * @bo_validate: called from drm_gpuva_manager_validate()
>>> +	 *
>>> +	 * Drivers receive this callback for every evicted &drm_gem_object being
>>> +	 * mapped in the corresponding &drm_gpuva_manager.
>>> +	 *
>>> +	 * Typically, drivers would call their driver specific variant of
>>> +	 * ttm_bo_validate() from within this callback.
>>> +	 */
>>> +	int (*bo_validate)(struct drm_gem_object *obj);
>>>    };
>>>    int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
>>> @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>>>    void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>>    		   struct drm_gpuva *va,
>>>    		   struct drm_gpuva_op_map *op);
>>> +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
>>> +		       struct drm_gpuva *va,
>>> +		       struct drm_gpuva_op_map *op);
>>>    void drm_gpuva_remap(struct drm_gpuva *prev,
>>>    		     struct drm_gpuva *next,
>>>    		     struct drm_gpuva_op_remap *op);
>>> +void drm_gpuva_remap_get(struct drm_gpuva *prev,
>>> +			 struct drm_gpuva *next,
>>> +			 struct drm_gpuva_op_remap *op);
>>>    void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
>>> +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
>>>    #endif /* __DRM_GPUVA_MGR_H__ */

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-30 13:42         ` Thomas Hellström (Intel)
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-08-30 13:42 UTC (permalink / raw)
  To: Danilo Krummrich
  Cc: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel


On 8/30/23 14:49, Danilo Krummrich wrote:
> Hi Thomas,
>
> thanks for having a look!
>
> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>> Hi, Danilo.
>>
>> Some quick comments since I'm doing some Xe work in this area. Will probably
>> get back with more.
>>
>> On 8/20/23 23:53, Danilo Krummrich wrote:
>>> So far the DRM GPUVA manager offers common infrastructure to track GPU VA
>>> allocations and mappings, generically connect GPU VA mappings to their
>>> backing buffers and perform more complex mapping operations on the GPU VA
>>> space.
>>>
>>> However, there are more design patterns commonly used by drivers, which
>>> can potentially be generalized in order to make the DRM GPUVA manager
>>> represent a basic GPU-VM implementation. In this context, this patch aims
>>> at generalizing the following elements.
>>>
>>> 1) Provide a common dma-resv for GEM objects not being used outside of
>>>      this GPU-VM.
>>>
>>> 2) Provide tracking of external GEM objects (GEM objects which are
>>>      shared with other GPU-VMs).
>>>
>>> 3) Provide functions to efficiently lock all GEM objects dma-resv the
>>>      GPU-VM contains mappings of.
>>>
>>> 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
>>>      of, such that validation of evicted GEM objects is accelerated.
>>>
>>> 5) Provide some convinience functions for common patterns.
>>>
>>> Rather than being designed as a "framework", the target is to make all
>>> features appear as a collection of optional helper functions, such that
>>> drivers are free to make use of the DRM GPUVA managers basic
>>> functionality and opt-in for other features without setting any feature
>>> flags, just by making use of the corresponding functions.
>>>
>>> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
>>> ---
>>>    drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
>>>    include/drm/drm_gem.h           |  48 ++-
>>>    include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
>>>    3 files changed, 1010 insertions(+), 28 deletions(-)
>>>
>>> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
>>> index f86bfad74ff8..69872b205961 100644
>>> --- a/drivers/gpu/drm/drm_gpuva_mgr.c
>>> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
>>> @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>>    /**
>>>     * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
>>>     * @mgr: pointer to the &drm_gpuva_manager to initialize
>>> + * @drm: the drivers &drm_device
>>>     * @name: the name of the GPU VA space
>>>     * @start_offset: the start offset of the GPU VA space
>>>     * @range: the size of the GPU VA space
>>> @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>>     */
>>>    void
>>>    drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>> +		       struct drm_device *drm,
>>>    		       const char *name,
>>>    		       u64 start_offset, u64 range,
>>>    		       u64 reserve_offset, u64 reserve_range,
>>> @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>    	mgr->rb.tree = RB_ROOT_CACHED;
>>>    	INIT_LIST_HEAD(&mgr->rb.list);
>>> +	mt_init(&mgr->mt_ext);
>>> +
>>> +	INIT_LIST_HEAD(&mgr->evict.list);
>>> +	spin_lock_init(&mgr->evict.lock);
>>> +
>>>    	drm_gpuva_check_overflow(start_offset, range);
>>>    	mgr->mm_start = start_offset;
>>>    	mgr->mm_range = range;
>>> @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>    						     reserve_range)))
>>>    			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>>>    	}
>>> +
>>> +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
>>> +	mgr->resv = mgr->d_obj.resv;
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
>>> @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>>>    		__drm_gpuva_remove(&mgr->kernel_alloc_node);
>>>    	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
>>> -	     "GPUVA tree is not empty, potentially leaking memory.");
>>> +	     "GPUVA tree is not empty, potentially leaking memory.\n");
>>> +
>>> +	mtree_destroy(&mgr->mt_ext);
>>> +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
>>> +
>>> +	drm_gem_private_object_fini(&mgr->d_obj);
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
>>> +/**
>>> + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @num_fences: the amount of &dma_fences to reserve
>>> + *
>>> + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
>>> + * &drm_gpuva_manager contains mappings of.
>>> + *
>>> + * Drivers can obtain the corresponding &drm_exec instance through
>>> + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
>>> + * and drm_exec_fini() accordingly.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
>>> +				  unsigned int num_fences)
>>> +{
>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>> +	union {
>>> +		void *ptr;
>>> +		uintptr_t cnt;
>>> +	} ref;
>>> +	int ret;
>>> +
>>> +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
>>> +	if (ret)
>>> +		goto out;
>>> +
>>> +	rcu_read_lock();
>> In xe we're protecting the external object list with an outer lock, (same as
>> protecting the mgr itself). Do we need a separate lock for this? In theory
>> as  outlined in the VM_BIND locking document draft, one could probably even
>> use the mgr resv for this, but with more complicated code I guess. Also see
>> the comment below about the data structure chosen.
> The idea is to protect this list with the GPU-VM lock. The locking here is more
> of an implication of the maple tree. Either you use the internal lock of the
> maple tree or RCU respectively, or you give the maple tree an external lock to
> perform lockdep checks on (mt_set_external_lock()). Basically same as here:
>
> https://elixir.bootlin.com/linux/latest/source/drivers/base/regmap/regcache-maple.c#L124

Ah, I suspected it was something along those lines.


>
>>> +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
>>> +		struct drm_gem_object *obj;
>>> +
>>> +		mas_pause(&mas);
>>> +		rcu_read_unlock();
>>> +
>>> +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
>>> +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
>>> +		if (ret)
>>> +			goto out;
>>> +
>>> +		rcu_read_lock();
>>> +	}
>>> +	rcu_read_unlock();
>>> +
>>> +out:
>>> +	return ret;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @fn: callback received by the driver to lock additional dma-resv
>>> + * @priv: private driver data passed to @fn
>>> + * @num_fences: the amount of &dma_fences to reserve
>>> + * @interruptible: sleep interruptible if waiting
>>> + *
>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>> + * &drm_gpuva_manager contains mappings of.
>>> + *
>>> + * Addionally, when calling this function the driver receives the given @fn
>>> + * callback to lock additional dma-resv in the context of the
>>> + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
>>> + * drm_exec_prepare_obj() from within this callback.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
>>> +			     int (*fn)(struct drm_gpuva_manager *mgr,
>>> +				       void *priv, unsigned int num_fences),
>>> +			     void *priv,
>>> +			     unsigned int num_fences,
>>> +			     bool interruptible)
>>> +{
>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>> +	uint32_t flags;
>>> +	int ret;
>>> +
>>> +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
>>> +		DRM_EXEC_IGNORE_DUPLICATES;
>>> +
>>> +	drm_exec_init(exec, flags);
>>> +
>>> +	drm_exec_until_all_locked(exec) {
>>> +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
>>> +		drm_exec_retry_on_contention(exec);
>>> +		if (ret)
>>> +			goto err;
>>> +
>>> +		if (fn) {
>>> +			ret = fn(mgr, priv, num_fences);
>>> +			drm_exec_retry_on_contention(exec);
>>> +			if (ret)
>>> +				goto err;
>>> +		}
>>> +	}
>>> +
>>> +	return 0;
>>> +
>>> +err:
>>> +	drm_exec_fini(exec);
>>> +	return ret;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
>>> +
>>> +static int
>>> +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
>>> +				unsigned int num_fences)
>>> +{
>>> +	struct {
>>> +		struct drm_gem_object **objs;
>>> +		unsigned int num_objs;
>>> +	} *args = priv;
>>> +
>>> +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
>>> +				      args->num_objs, num_fences);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @objs: additional &drm_gem_objects to lock
>>> + * @num_objs: the number of additional &drm_gem_objects to lock
>>> + * @num_fences: the amount of &dma_fences to reserve
>>> + * @interruptible: sleep interruptible if waiting
>>> + *
>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>> + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
>>> +			     struct drm_gem_object **objs,
>>> +			     unsigned int num_objs,
>>> +			     unsigned int num_fences,
>>> +			     bool interruptible)
>>> +{
>>> +	struct {
>>> +		struct drm_gem_object **objs;
>>> +		unsigned int num_objs;
>>> +	} args;
>>> +
>>> +	args.objs = objs;
>>> +	args.num_objs = num_objs;
>>> +
>>> +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
>>> +					    num_fences, interruptible);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
>>> + * @mgr: the &drm_gpuva_manager to validate evicted BOs
>>> + *
>>> + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
>>> + * objects being mapped in the given &drm_gpuva_manager.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
>>> +{
>>> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +	int ret;
>>> +
>>> +	if (unlikely(!ops || !ops->bo_validate))
>>> +		return -ENOTSUPP;
>>> +
>>> +	/* At this point we should hold all dma-resv locks of all GEM objects
>>> +	 * associated with this GPU-VM, hence it is safe to walk the list.
>>> +	 */
>>> +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
>>> +		dma_resv_assert_held(vm_bo->obj->resv);
>>> +
>>> +		ret = ops->bo_validate(vm_bo->obj);
>>> +		if (ret)
>>> +			return ret;
>>> +	}
>>> +
>>> +	return 0;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
>>> + * dma-resv
>>> + * @mgr: the &drm_gpuva_manager to add a fence to
>>> + * @fence: fence to add
>>> + * @private_usage: private dma-resv usage
>>> + * @extobj_usage: extobj dma-resv usage
>>> + */
>>> +void
>>> +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
>>> +				 struct dma_fence *fence,
>>> +				 enum dma_resv_usage private_usage,
>>> +				 enum dma_resv_usage extobj_usage)
>>> +{
>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>> +	struct drm_gem_object *obj;
>>> +	unsigned long index;
>>> +
>>> +	drm_exec_for_each_locked_object(exec, index, obj) {
>>> +			dma_resv_assert_held(obj->resv);
>>> +			dma_resv_add_fence(obj->resv, fence,
>>> +					   drm_gpuva_is_extobj(mgr, obj) ?
>>> +					   private_usage : extobj_usage);
>>> +	}
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
>>> +
>>> +static struct drm_gpuva_gem *
>>> +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	drm_gem_gpuva_assert_lock_held(obj);
>>> +
>>> +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
>>> +		if (vm_bo->mgr == mgr)
>>> +			return vm_bo;
>>> +
>>> +	return NULL;
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>> + *
>>> + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
>>> + * vm_bo_alloc() callback to allocate.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
>>> + */
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	if (ops && ops->vm_bo_alloc)
>>> +		vm_bo = ops->vm_bo_alloc();
>>> +	else
>>> +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
>>> +
>>> +	if (unlikely(!vm_bo))
>>> +		return NULL;
>>> +
>>> +	vm_bo->mgr = mgr;
>>> +	vm_bo->obj = obj;
>>> +
>>> +	kref_init(&vm_bo->kref);
>>> +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
>>> +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
>>> +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
>>> +
>>> +	drm_gem_object_get(obj);
>>> +
>>> +	return vm_bo;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
>>> +
>>> +void
>>> +drm_gpuva_gem_destroy(struct kref *kref)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
>>> +						   kref);
>>> +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
>>> +
>>> +	drm_gem_object_put(vm_bo->obj);
>>> +
>>> +	if (ops && ops->vm_bo_free)
>>> +		ops->vm_bo_free(vm_bo);
>>> +	else
>>> +		kfree(vm_bo);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
>>> + * &drm_gpuva_manager and &drm_gem_object
>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>> + *
>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>> + * count of the &drm_gpuva_gem accordingly.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
>>> + */
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>> +		   struct drm_gem_object *obj)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
>>> +
>>> +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
>>> + * given &drm_gpuva_manager and &drm_gem_object
>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>> + *
>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>> + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
>>> + * &drm_gpuva_gem.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
>>> + */
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
>>> +	if (vm_bo)
>>> +		return vm_bo;
>>> +
>>> +	vm_bo = drm_gpuva_gem_create(mgr, obj);
>>> +	if (!vm_bo)
>>> +		return ERR_PTR(-ENOMEM);
>>> +
>>> +	return vm_bo;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
>>> + * for the given &drm_gpuva_manager and &drm_gem_object
>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>> + *
>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>> + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
>>> + * count is decreased. If not found @__vm_bo is returned.
>>> + *
>>> + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
>>> + * &drm_gpuva_gem was found
>>> + */
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
>>> +			      struct drm_gem_object *obj,
>>> +			      struct drm_gpuva_gem *__vm_bo)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
>>> +	if (vm_bo) {
>>> +		drm_gpuva_gem_put(__vm_bo);
>>> +		return vm_bo;
>>> +	}
>>> +
>>> +	return __vm_bo;
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
>>> +
>>> +static int
>>> +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>> +			  struct drm_gem_object *obj,
>>> +			  gfp_t gfp)
>>> +{
>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>> +	union {
>>> +		struct drm_gem_object *obj;
>>> +		uintptr_t index;
>>> +	} gem;
>>> +	union {
>>> +		void *ptr;
>>> +		uintptr_t cnt;
>>> +	} ref;
>>> +	int ret = 0;
>>> +
>>> +	gem.obj = obj;
>>> +	mas_set(&mas, gem.index);
>>> +
>>> +	mas_lock(&mas);
>>> +	ref.ptr = mas_walk(&mas);
>>> +	if (ref.ptr) {
>>> +		++ref.cnt;
>>> +		mas_store(&mas, ref.ptr);
>>> +	} else {
>>> +		if (unlikely(!gfp)) {
>>> +			ret = -EINVAL;
>>> +			goto out;
>>> +		}
>>> +
>>> +		mas_set(&mas, gem.index);
>>> +		ref.cnt = 1;
>>> +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
>>> +		if (likely(!ret))
>>> +			drm_gem_object_get(obj);
>>> +	}
>>> +out:
>>> +	mas_unlock(&mas);
>>> +	return ret;
>>> +}
>>> +
>>> +static void
>>> +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
>>> +			  struct drm_gem_object *obj)
>>> +{
>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>> +	union {
>>> +		struct drm_gem_object *obj;
>>> +		uintptr_t index;
>>> +	} gem;
>>> +	union {
>>> +		void *ptr;
>>> +		uintptr_t cnt;
>>> +	} ref;
>>> +
>>> +	gem.obj = obj;
>>> +	mas_set(&mas, gem.index);
>>> +
>>> +	mas_lock(&mas);
>>> +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
>>> +		goto out;
>>> +
>>> +	if (!--ref.cnt) {
>>> +		mas_erase(&mas);
>>> +		drm_gem_object_put(obj);
>>> +	} else {
>>> +		mas_store(&mas, ref.ptr);
>>> +	}
>>> +out:
>>> +	mas_unlock(&mas);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
>>> + * @mgr: the &drm_gpuva_manager to insert into
>>> + * @obj: the &drm_gem_object to insert as extobj
>>> + *
>>> + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
>>> + * If the &drm_gem_object already exists in the tree, the reference counter
>>> + * of this external object is increased by one.
>>> + *
>>> + * Drivers should insert the external &drm_gem_object before the dma-fence
>>> + * signalling critical section, e.g. when submitting the job, and before
>>> + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
>>> + * or its dervates.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>> +			struct drm_gem_object *obj)
>>> +{
>>> +	return drm_gpuva_is_extobj(mgr, obj) ?
>>> +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
>>> +
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
>>> +
>>> +/**
>>> + * drm_gpuva_extobj_get - increase the referecne count of an external
>>> + * &drm_gem_object
>>> + * @mgr: the &drm_gpuva_manager storing the extobj
>>> + * @obj: the &drm_gem_object to representing the extobj
>>> + *
>>> + * Increases the reference count of the extobj represented by @obj.
>>> + *
>>> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
>>> + * being inserted.
>>> + *
>>> + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
>>> + * additional reference if the re-map operation splits an existing &drm_gpuva
>>> + * into two separate ones.
>>> + *
>>> + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +void
>>> +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	if (drm_gpuva_is_extobj(mgr, obj))
>>> +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
>>> +		     "Can't increase ref-count of non-existent extobj.");
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
>>> +
>>> +/**
>>> + * drm_gpuva_extobj_put - decrease the referecne count of an external
>>> + * &drm_gem_object
>>> + * @mgr: the &drm_gpuva_manager storing the extobj
>>> + * @obj: the &drm_gem_object to representing the extobj
>>> + *
>>> + * Decreases the reference count of the extobj represented by @obj.
>>> + *
>>> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
>>> + * being removed from the GPU VA space.
>>> + *
>>> + * See also drm_gpuva_unmap_put().
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +void
>>> +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj)
>>> +{
>>> +	if (drm_gpuva_is_extobj(mgr, obj))
>>> +		__drm_gpuva_extobj_remove(mgr, obj);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
>>> + * &drm_gpuva_managers evicted list
>>> + * @obj: the &drm_gem_object to add or remove
>>> + * @evict: indicates whether the object is evicted
>>> + *
>>> + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
>>> + * list containing a mapping of this &drm_gem_object.
>>> + */
>>> +void
>>> +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
>>> +{
>>> +	struct drm_gpuva_gem *vm_bo;
>>> +
>>> +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
>>> +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
>>> +	 */
>>> +	drm_gem_gpuva_assert_lock_held(obj);
>>> +
>>> +	/* Required to protect the GPUVA managers evict list against concurrent
>>> +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
>>> +	 * the evict list through different GEM object evictions are protected
>>> +	 * by the GPUVA managers evict lock.
>>> +	 */
>>> +	dma_resv_assert_held(obj->resv);
>>> +
>>> +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
>>> +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
>>> +
>>> +		spin_lock(&mgr->evict.lock);
>>> +		if (evict)
>>> +			list_add_tail(&vm_bo->list.entry.evict,
>>> +				      &mgr->evict.list);
>>> +		else
>>> +			list_del_init(&vm_bo->list.entry.evict);
>>> +		spin_unlock(&mgr->evict.lock);
>>> +	}
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
>>> +
>>>    static int
>>>    __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>>>    		   struct drm_gpuva *va)
>>> @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
>>>    /**
>>>     * drm_gpuva_link() - link a &drm_gpuva
>>>     * @va: the &drm_gpuva to link
>>> + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
>>>     *
>>> - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
>>> - * associated with.
>>> + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
>>> + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
>>> + *
>>> + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
>>> + * reference of the latter is taken.
>>>     *
>>>     * This function expects the caller to protect the GEM's GPUVA list against
>>> - * concurrent access using the GEMs dma_resv lock.
>>> + * concurrent access using either the GEMs dma_resv lock or a driver specific
>>> + * lock set through drm_gem_gpuva_set_lock().
>>>     */
>>>    void
>>> -drm_gpuva_link(struct drm_gpuva *va)
>>> +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
>>>    {
>>>    	struct drm_gem_object *obj = va->gem.obj;
>>> @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
>>>    	drm_gem_gpuva_assert_lock_held(obj);
>>> -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
>>> +	drm_gpuva_gem_get(vm_bo);
>>> +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
>>> +	if (list_empty(&vm_bo->list.entry.gem))
>>> +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_link);
>>> @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
>>>     * This removes the given &va from the GPU VA list of the &drm_gem_object it is
>>>     * associated with.
>>>     *
>>> + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
>>> + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
>>> + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
>>> + *
>>> + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
>>> + * the latter is dropped.
>>> + *
>>>     * This function expects the caller to protect the GEM's GPUVA list against
>>> - * concurrent access using the GEMs dma_resv lock.
>>> + * concurrent access using either the GEMs dma_resv lock or a driver specific
>>> + * lock set through drm_gem_gpuva_set_lock().
>>>     */
>>>    void
>>>    drm_gpuva_unlink(struct drm_gpuva *va)
>>>    {
>>>    	struct drm_gem_object *obj = va->gem.obj;
>>> +	struct drm_gpuva_gem *vm_bo;
>>>    	if (unlikely(!obj))
>>>    		return;
>>>    	drm_gem_gpuva_assert_lock_held(obj);
>>> +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
>>> +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
>>> +		return;
>>> +
>>>    	list_del_init(&va->gem.entry);
>>> +
>>> +	if (list_empty(&vm_bo->list.gpuva)) {
>>> +		list_del_init(&vm_bo->list.entry.gem);
>>> +		list_del_init(&vm_bo->list.entry.evict);
>>> +	}
>>> +	drm_gpuva_gem_put(vm_bo);
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
>>> @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_map);
>>> +/**
>>> + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
>>> + * &drm_gpuva_op_map
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @va: the &drm_gpuva to insert
>>> + * @op: the &drm_gpuva_op_map to initialize @va with
>>> + *
>>> + * Initializes the @va from the @op and inserts it into the given @mgr and
>>> + * increases the reference count of the corresponding extobj.
>>> + */
>>> +void
>>> +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
>>> +		  struct drm_gpuva *va,
>>> +		  struct drm_gpuva_op_map *op)
>>> +{
>>> +	drm_gpuva_map(mgr, va, op);
>>> +	drm_gpuva_extobj_get(mgr, va->gem.obj);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
>>> +
>>>    /**
>>>     * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
>>>     * &drm_gpuva_op_remap
>>> @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>>>    		struct drm_gpuva *next,
>>>    		struct drm_gpuva_op_remap *op)
>>>    {
>>> -	struct drm_gpuva *curr = op->unmap->va;
>>> -	struct drm_gpuva_manager *mgr = curr->mgr;
>>> +	struct drm_gpuva *va = op->unmap->va;
>>> +	struct drm_gpuva_manager *mgr = va->mgr;
>>> -	drm_gpuva_remove(curr);
>>> +	drm_gpuva_remove(va);
>>>    	if (op->prev) {
>>>    		drm_gpuva_init_from_op(prev, op->prev);
>>> @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_remap);
>>> +/**
>>> + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
>>> + * &drm_gpuva_op_remap
>>> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
>>> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
>>> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
>>> + *
>>> + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
>>> + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
>>> + * separate mappings, increases the reference count of the corresponding extobj.
>>> + */
>>> +void
>>> +drm_gpuva_remap_get(struct drm_gpuva *prev,
>>> +		    struct drm_gpuva *next,
>>> +		    struct drm_gpuva_op_remap *op)
>>> +{
>>> +	struct drm_gpuva *va = op->unmap->va;
>>> +	struct drm_gpuva_manager *mgr = va->mgr;
>>> +
>>> +	drm_gpuva_remap(prev, next, op);
>>> +	if (op->prev && op->next)
>>> +		drm_gpuva_extobj_get(mgr, va->gem.obj);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
>>> +
>>>    /**
>>>     * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
>>>     * &drm_gpuva_op_unmap
>>> @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>>>    }
>>>    EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
>>> +/**
>>> + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
>>> + * &drm_gpuva_op_unmap
>>> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
>>> + *
>>> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
>>> + * the reference count of the corresponding extobj.
>>> + */
>>> +void
>>> +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
>>> +{
>>> +	struct drm_gpuva *va = op->va;
>>> +
>>> +	drm_gpuva_unmap(op);
>>> +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
>>> +}
>>> +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
>>> +
>>>    static int
>>>    op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>>>    	  u64 addr, u64 range,
>>> @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>>    {
>>>    	struct drm_gpuva_ops *ops;
>>>    	struct drm_gpuva_op *op;
>>> +	struct drm_gpuva_gem *vm_bo;
>>>    	struct drm_gpuva *va;
>>>    	int ret;
>>> @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>>    	INIT_LIST_HEAD(&ops->list);
>>> -	drm_gem_for_each_gpuva(va, obj) {
>>> +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
>>>    		op = gpuva_op_alloc(mgr);
>>>    		if (!op) {
>>>    			ret = -ENOMEM;
>>> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
>>> index bc9f6aa2f3fe..783ed3ab440d 100644
>>> --- a/include/drm/drm_gem.h
>>> +++ b/include/drm/drm_gem.h
>>> @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
>>>     * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
>>>     * @obj: the &drm_gem_object
>>>     *
>>> - * This initializes the &drm_gem_object's &drm_gpuva list.
>>> + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
>>>     *
>>>     * Calling this function is only necessary for drivers intending to support the
>>>     * &drm_driver_feature DRIVER_GEM_GPUVA.
>>> @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>>>    }
>>>    /**
>>> - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
>>> - * @entry__: &drm_gpuva structure to assign to in each iteration step
>>> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>> + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
>>> + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
>>> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>>>     *
>>> - * This iterator walks over all &drm_gpuva structures associated with the
>>> - * &drm_gpuva_manager.
>>> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>>> + * &drm_gem_object.
>>>     */
>>> -#define drm_gem_for_each_gpuva(entry__, obj__) \
>>> -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
>>> +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
>>> +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
>>>    /**
>>> - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
>>> - * gpuvas
>>> - * @entry__: &drm_gpuva structure to assign to in each iteration step
>>> - * @next__: &next &drm_gpuva to store the next step
>>> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>> + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
>>> + * &drm_gpuva_gem
>>> + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
>>> + * @next__: &next &drm_gpuva_gem to store the next step
>>> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>>>     *
>>> - * This iterator walks over all &drm_gpuva structures associated with the
>>> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>>>     * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
>>>     * it is save against removal of elements.
>>>     */
>>> -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
>>> -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
>>> +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
>>> +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
>>> +
>>> +/**
>>> + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>> + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
>>> + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
>>> + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>> + * &drm_gpuva_manager and &drm_gem_object.
>>> + */
>>> +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
>>> +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
>>> +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
>>> +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
>>> +	     va__ = list_next_entry(va__, gem.entry))
>>>    #endif /* __DRM_GEM_H__ */
>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>> index ed8d50200cc3..693e2da3f425 100644
>>> --- a/include/drm/drm_gpuva_mgr.h
>>> +++ b/include/drm/drm_gpuva_mgr.h
>>> @@ -26,12 +26,16 @@
>>>     */
>>>    #include <linux/list.h>
>>> +#include <linux/dma-resv.h>
>>> +#include <linux/maple_tree.h>
>>>    #include <linux/rbtree.h>
>>>    #include <linux/types.h>
>>>    #include <drm/drm_gem.h>
>>> +#include <drm/drm_exec.h>
>>>    struct drm_gpuva_manager;
>>> +struct drm_gpuva_gem;
>>>    struct drm_gpuva_fn_ops;
>>>    /**
>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>    int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>    void drm_gpuva_remove(struct drm_gpuva *va);
>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>    void drm_gpuva_unlink(struct drm_gpuva *va);
>>>    struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>    	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>    	 */
>>>    	const struct drm_gpuva_fn_ops *ops;
>>> +
>>> +	/**
>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>> +	 * dma-resv to &drm_exec.
>>> +	 */
>>> +	struct drm_gem_object d_obj;
>>> +
>>> +	/**
>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>> +	 * space
>>> +	 */
>>> +	struct dma_resv *resv;
>>> +
>>> +	/**
>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>> +	 */
>>> +	struct drm_exec exec;
>>> +
>>> +	/**
>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>> +	 */
>>> +	struct maple_tree mt_ext;
>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>> instead of O(1) for a list?
>>
> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> could have mappings of the same extobj.
>
> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> instead, which also seems to be the obvious choice. However, there is a locking
> conflict.
>
> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> the corresponding drm_gem_object, or with an external lock provided by the
> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> changes on the GPUVA space directly from the fence signalling path.
>
> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> linked and we'd want to remove it for the last one being unlinked.
>
> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> create the drm_gpuva_gem, which we need to do anyways.)
>
> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>
> Considering all that, I thought it's probably better to track extobjs separate
> from the drm_gpuva_gem, hence the maple tree choice.

Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point 
to external objects, or in the case of multiple mappings to the same gem 
object, only one of the drm_gpuvas is in the list. These are protected 
by the GPU-VM lock. I don't see a problem with removing those from the 
fence signalling path, though?

Although assuming that's a no-go for GPUVA wouldn't an XArray be a 
better choice, keeping O(1)?

>
>>> +
>>> +	/**
>>> +	 * @evict: structure holding the evict list and evict list lock
>>> +	 */
>>> +	struct {
>>> +		/**
>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>> +		 * evicted
>>> +		 */
>>> +		struct list_head list;
>>> +
>>> +		/**
>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>> +		 * insertion / removal of different &drm_gpuva_gems
>>> +		 */
>>> +		spinlock_t lock;
>>> +	} evict;
>>>    };
>>>    void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>> +			    struct drm_device *drm,
>>>    			    const char *name,
>>>    			    u64 start_offset, u64 range,
>>>    			    u64 reserve_offset, u64 reserve_range,
>>>    			    const struct drm_gpuva_fn_ops *ops);
>>>    void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>> +/**
>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>> + */
>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>> should typically be allocated on the stack. Otherwise you'd need to protect
>> the mgr->exec member with an exclusive lock throughout the locking process,
>> and that's not what we want.
> Oh, good point. I think it works in Nouveau, because there it's implicitly
> protected with the job submission lock.
>
>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>> needed ops to it: Like so:
> That's a good idea, will take this into V2.

Actually, I'm not fully sure that was a good idea: I've now have a 
working version of Xe ported over to drm_exec, having these helpers in 
mind and with the intention to start using them as they mature. What I 
found, though is that open-coding the drm_exec loop is not all that bad, 
but that building blocks that can be called from within the loop are useful:

Like the drm_gpuva_prepare_objects() and an imaginary 
drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the 
object (if different and the gpuva points to the object. And 
drm_gpuva_prepare_array() although we don't use it within Xe. That means 
you can use these building blocks like helpers and avoid the fn() 
callback by instead open-coding.

But I guess YMMV.

>
>> struct drm_gpuva_exec_ops {
>>      int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>
>>      int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>> *obj);
> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> be the same callback, right?
>
>> };
>>
>> struct drm_gpuva_exec {
>>      const struct drm_gpuva_exec_ops *ops;
>>      struct drm_exec exec;
>>      struct drm_gpuva_manager *mgr;
>> };
>>
>> Although I'd actually expect bo_validate to be part of fn in the typical
>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> This doesn't sound like my assumption about fn() above is correct.

Well one important thing in our conversion is that ttm_bo_validate () 
needs to be in the until_all_locked() loop. We want to be able soon to 
use sleeping locks for eviction, so a xe_bo_validate() would, at least 
temporarily, add locked objects to the drm_exec list of locked objects. 
That means everything that may end up calling validate deep within the 
call chain needs to be part of the until_all_locked() loop, so our 
drm_gpuva_manager_lock_extra() fn callback would include those validates 
and look different all the time. Hence that's why open-coding isn't all 
that bad...

/Thomas


>
>>
>>> +
>>> +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
>>> +				 int (*fn)(struct drm_gpuva_manager *mgr,
>>> +					   void *priv, unsigned int num_fences),
>>> +				 void *priv,
>>> +				 unsigned int num_fences,
>>> +				 bool interruptible);
>>> +
>>> +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
>>> +				 struct drm_gem_object **objs,
>>> +				 unsigned int num_objs,
>>> +				 unsigned int num_fences,
>>> +				 bool interruptible);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @num_fences: the amount of &dma_fences to reserve
>>> + * @interruptible: sleep interruptible if waiting
>>> + *
>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>> + * &drm_gpuva_manager contains mappings of.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +static inline int
>>> +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
>>> +		       unsigned int num_fences,
>>> +		       bool interruptible)
>>> +{
>>> +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
>>> +					    interruptible);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
>>> + * @mgr: the &drm_gpuva_manager
>>> + *
>>> + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
>>> + * through drm_gpuva_manager_lock() or its variants.
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +static inline void
>>> +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
>>> +{
>>> +	drm_exec_fini(&mgr->exec);
>>> +}
>>> +
>>> +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
>>> +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
>>> +				      struct dma_fence *fence,
>>> +				      enum dma_resv_usage private_usage,
>>> +				      enum dma_resv_usage extobj_usage);
>>> +
>>> +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>> +			    struct drm_gem_object *obj);
>>> +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
>>> +			  struct drm_gem_object *obj);
>>> +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
>>> +			  struct drm_gem_object *obj);
>>> +
>>> +/**
>>> + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
>>> + * external object
>>> + * @mgr: the &drm_gpuva_manager to check
>>> + * @obj: the &drm_gem_object to check
>>> + *
>>> + * Returns: true if the &drm_gem_object &dma_resv differs from the
>>> + * &drm_gpuva_managers &dma_resv, false otherwise
>>> + */
>>> +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
>>> +				       struct drm_gem_object *obj)
>>> +{
>>> +	return obj && obj->resv != mgr->resv;
>>> +}
>>> +
>>>    static inline struct drm_gpuva *
>>>    __drm_gpuva_next(struct drm_gpuva *va)
>>>    {
>>> @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
>>>    #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>>>    	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
>>> +/**
>>> + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
>>> + * &drm_gem_object combination
>>> + *
>>> + * This structure is an abstraction representing a &drm_gpuva_manager and
>>> + * &drm_gem_object combination. It serves as an indirection to accelerate
>>> + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
>>> + * &drm_gem_object.
>>> + *
>>> + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
>>> + * accelerate validation.
>>> + *
>>> + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
>>> + * a GEM object is mapped first in a GPU-VM and release the instance once the
>>> + * last mapping of the GEM object in this GPU-VM is unmapped.
>>> + */
>>> +struct drm_gpuva_gem {
>>> +
>>> +	/**
>>> +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>> +	 */
>>> +	struct drm_gpuva_manager *mgr;
>>> +
>>> +	/**
>>> +	 * @obj: The &drm_gem_object being mapped in the @mgr.
>>> +	 */
>>> +	struct drm_gem_object *obj;
>>> +
>>> +	/**
>>> +	 * @kref: The reference count for this &drm_gpuva_gem.
>>> +	 */
>>> +	struct kref kref;
>>> +
>>> +	/**
>>> +	 * @list: Structure containing all &list_heads.
>>> +	 */
>>> +	struct {
>>> +		/**
>>> +		 * @gpuva: The list of linked &drm_gpuvas.
>>> +		 */
>>> +		struct list_head gpuva;
>>> +
>>> +		/**
>>> +		 * @entry: Structure containing all &list_heads serving as
>>> +		 * entry.
>>> +		 */
>>> +		struct {
>>> +			/**
>>> +			 * @gem: List entry to attach to the &drm_gem_objects
>>> +			 * gpuva list.
>>> +			 */
>>> +			struct list_head gem;
>>> +
>>> +			/**
>>> +			 * @evict: List entry to attach to the
>>> +			 * &drm_gpuva_managers evict list.
>>> +			 */
>>> +			struct list_head evict;
>>> +		} entry;
>>> +	} list;
>>> +};
>>> +
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj);
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
>>> +			      struct drm_gem_object *obj,
>>> +			      struct drm_gpuva_gem *__vm_bo);
>>> +
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>> +		   struct drm_gem_object *obj);
>>> +
>>> +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
>>> +
>>> +struct drm_gpuva_gem *
>>> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
>>> +		     struct drm_gem_object *obj);
>>> +void drm_gpuva_gem_destroy(struct kref *kref);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
>>> + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
>>> + *
>>> + * This function acquires an additional reference to @vm_bo. It is illegal to
>>> + * call this without already holding a reference. No locks required.
>>> + */
>>> +static inline struct drm_gpuva_gem *
>>> +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
>>> +{
>>> +	kref_get(&vm_bo->kref);
>>> +	return vm_bo;
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
>>> + * @vm_bo: the &drm_gpuva_gem to release the reference of
>>> + *
>>> + * This releases a reference to @vm_bo.
>>> + */
>>> +static inline void
>>> +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
>>> +{
>>> +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>> + * &drm_gpuva_gem.
>>> + */
>>> +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
>>> +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
>>> +
>>> +/**
>>> + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
>>> + * &drm_gpuva
>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>> + * @next__: &next &drm_gpuva to store the next step
>>> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>> + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
>>> + * it is save against removal of elements.
>>> + */
>>> +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
>>> +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
>>> +
>>>    /**
>>>     * enum drm_gpuva_op_type - GPU VA operation type
>>>     *
>>> @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
>>>    	 */
>>>    	void (*op_free)(struct drm_gpuva_op *op);
>>> +	/**
>>> +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
>>> +	 * a struct drm_gpuva_gem
>>> +	 *
>>> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
>>> +	 * specific structures. By implementing this callback drivers can
>>> +	 * allocate memory accordingly.
>>> +	 *
>>> +	 * This callback is optional.
>>> +	 */
>>> +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
>>> +
>>> +	/**
>>> +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
>>> +	 * struct drm_gpuva_gem
>>> +	 *
>>> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
>>> +	 * specific structures. By implementing this callback drivers can
>>> +	 * free the previously allocated memory accordingly.
>>> +	 *
>>> +	 * This callback is optional.
>>> +	 */
>>> +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
>>> +
>>>    	/**
>>>    	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
>>>    	 * mapping once all previous steps were completed
>>> @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
>>>    	 * used.
>>>    	 */
>>>    	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
>>> +
>>> +	/**
>>> +	 * @bo_validate: called from drm_gpuva_manager_validate()
>>> +	 *
>>> +	 * Drivers receive this callback for every evicted &drm_gem_object being
>>> +	 * mapped in the corresponding &drm_gpuva_manager.
>>> +	 *
>>> +	 * Typically, drivers would call their driver specific variant of
>>> +	 * ttm_bo_validate() from within this callback.
>>> +	 */
>>> +	int (*bo_validate)(struct drm_gem_object *obj);
>>>    };
>>>    int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
>>> @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>>>    void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>>    		   struct drm_gpuva *va,
>>>    		   struct drm_gpuva_op_map *op);
>>> +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
>>> +		       struct drm_gpuva *va,
>>> +		       struct drm_gpuva_op_map *op);
>>>    void drm_gpuva_remap(struct drm_gpuva *prev,
>>>    		     struct drm_gpuva *next,
>>>    		     struct drm_gpuva_op_remap *op);
>>> +void drm_gpuva_remap_get(struct drm_gpuva *prev,
>>> +			 struct drm_gpuva *next,
>>> +			 struct drm_gpuva_op_remap *op);
>>>    void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
>>> +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
>>>    #endif /* __DRM_GPUVA_MGR_H__ */

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-30 13:42         ` Thomas Hellström (Intel)
  (?)
@ 2023-08-30 15:00           ` Danilo Krummrich
  -1 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-30 15:00 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, christian.koenig, faith.ekstrand, bskeggs

On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
> 
> On 8/30/23 14:49, Danilo Krummrich wrote:
> > Hi Thomas,
> > 
> > thanks for having a look!
> > 
> > On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> > > Hi, Danilo.
> > > 
> > > Some quick comments since I'm doing some Xe work in this area. Will probably
> > > get back with more.
> > > 
> > > On 8/20/23 23:53, Danilo Krummrich wrote:
> > > > So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> > > > allocations and mappings, generically connect GPU VA mappings to their
> > > > backing buffers and perform more complex mapping operations on the GPU VA
> > > > space.
> > > > 
> > > > However, there are more design patterns commonly used by drivers, which
> > > > can potentially be generalized in order to make the DRM GPUVA manager
> > > > represent a basic GPU-VM implementation. In this context, this patch aims
> > > > at generalizing the following elements.
> > > > 
> > > > 1) Provide a common dma-resv for GEM objects not being used outside of
> > > >      this GPU-VM.
> > > > 
> > > > 2) Provide tracking of external GEM objects (GEM objects which are
> > > >      shared with other GPU-VMs).
> > > > 
> > > > 3) Provide functions to efficiently lock all GEM objects dma-resv the
> > > >      GPU-VM contains mappings of.
> > > > 
> > > > 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
> > > >      of, such that validation of evicted GEM objects is accelerated.
> > > > 
> > > > 5) Provide some convinience functions for common patterns.
> > > > 
> > > > Rather than being designed as a "framework", the target is to make all
> > > > features appear as a collection of optional helper functions, such that
> > > > drivers are free to make use of the DRM GPUVA managers basic
> > > > functionality and opt-in for other features without setting any feature
> > > > flags, just by making use of the corresponding functions.
> > > > 
> > > > Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> > > > ---
> > > >    drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
> > > >    include/drm/drm_gem.h           |  48 ++-
> > > >    include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
> > > >    3 files changed, 1010 insertions(+), 28 deletions(-)
> > > > 
> > > > diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> > > > index f86bfad74ff8..69872b205961 100644
> > > > --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> > > > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> > > > @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> > > >    /**
> > > >     * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
> > > >     * @mgr: pointer to the &drm_gpuva_manager to initialize
> > > > + * @drm: the drivers &drm_device
> > > >     * @name: the name of the GPU VA space
> > > >     * @start_offset: the start offset of the GPU VA space
> > > >     * @range: the size of the GPU VA space
> > > > @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> > > >     */
> > > >    void
> > > >    drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > +		       struct drm_device *drm,
> > > >    		       const char *name,
> > > >    		       u64 start_offset, u64 range,
> > > >    		       u64 reserve_offset, u64 reserve_range,
> > > > @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > >    	mgr->rb.tree = RB_ROOT_CACHED;
> > > >    	INIT_LIST_HEAD(&mgr->rb.list);
> > > > +	mt_init(&mgr->mt_ext);
> > > > +
> > > > +	INIT_LIST_HEAD(&mgr->evict.list);
> > > > +	spin_lock_init(&mgr->evict.lock);
> > > > +
> > > >    	drm_gpuva_check_overflow(start_offset, range);
> > > >    	mgr->mm_start = start_offset;
> > > >    	mgr->mm_range = range;
> > > > @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > >    						     reserve_range)))
> > > >    			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
> > > >    	}
> > > > +
> > > > +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> > > > +	mgr->resv = mgr->d_obj.resv;
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
> > > > @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
> > > >    		__drm_gpuva_remove(&mgr->kernel_alloc_node);
> > > >    	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> > > > -	     "GPUVA tree is not empty, potentially leaking memory.");
> > > > +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> > > > +
> > > > +	mtree_destroy(&mgr->mt_ext);
> > > > +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> > > > +
> > > > +	drm_gem_private_object_fini(&mgr->d_obj);
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
> > > > +/**
> > > > + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @num_fences: the amount of &dma_fences to reserve
> > > > + *
> > > > + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> > > > + * &drm_gpuva_manager contains mappings of.
> > > > + *
> > > > + * Drivers can obtain the corresponding &drm_exec instance through
> > > > + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> > > > + * and drm_exec_fini() accordingly.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> > > > +				  unsigned int num_fences)
> > > > +{
> > > > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > > > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > > > +	union {
> > > > +		void *ptr;
> > > > +		uintptr_t cnt;
> > > > +	} ref;
> > > > +	int ret;
> > > > +
> > > > +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> > > > +	if (ret)
> > > > +		goto out;
> > > > +
> > > > +	rcu_read_lock();
> > > In xe we're protecting the external object list with an outer lock, (same as
> > > protecting the mgr itself). Do we need a separate lock for this? In theory
> > > as  outlined in the VM_BIND locking document draft, one could probably even
> > > use the mgr resv for this, but with more complicated code I guess. Also see
> > > the comment below about the data structure chosen.
> > The idea is to protect this list with the GPU-VM lock. The locking here is more
> > of an implication of the maple tree. Either you use the internal lock of the
> > maple tree or RCU respectively, or you give the maple tree an external lock to
> > perform lockdep checks on (mt_set_external_lock()). Basically same as here:
> > 
> > https://elixir.bootlin.com/linux/latest/source/drivers/base/regmap/regcache-maple.c#L124
> 
> Ah, I suspected it was something along those lines.
> 
> 
> > 
> > > > +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> > > > +		struct drm_gem_object *obj;
> > > > +
> > > > +		mas_pause(&mas);
> > > > +		rcu_read_unlock();
> > > > +
> > > > +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> > > > +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> > > > +		if (ret)
> > > > +			goto out;
> > > > +
> > > > +		rcu_read_lock();
> > > > +	}
> > > > +	rcu_read_unlock();
> > > > +
> > > > +out:
> > > > +	return ret;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @fn: callback received by the driver to lock additional dma-resv
> > > > + * @priv: private driver data passed to @fn
> > > > + * @num_fences: the amount of &dma_fences to reserve
> > > > + * @interruptible: sleep interruptible if waiting
> > > > + *
> > > > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > > > + * &drm_gpuva_manager contains mappings of.
> > > > + *
> > > > + * Addionally, when calling this function the driver receives the given @fn
> > > > + * callback to lock additional dma-resv in the context of the
> > > > + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> > > > + * drm_exec_prepare_obj() from within this callback.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > > > +			     int (*fn)(struct drm_gpuva_manager *mgr,
> > > > +				       void *priv, unsigned int num_fences),
> > > > +			     void *priv,
> > > > +			     unsigned int num_fences,
> > > > +			     bool interruptible)
> > > > +{
> > > > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > > > +	uint32_t flags;
> > > > +	int ret;
> > > > +
> > > > +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> > > > +		DRM_EXEC_IGNORE_DUPLICATES;
> > > > +
> > > > +	drm_exec_init(exec, flags);
> > > > +
> > > > +	drm_exec_until_all_locked(exec) {
> > > > +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> > > > +		drm_exec_retry_on_contention(exec);
> > > > +		if (ret)
> > > > +			goto err;
> > > > +
> > > > +		if (fn) {
> > > > +			ret = fn(mgr, priv, num_fences);
> > > > +			drm_exec_retry_on_contention(exec);
> > > > +			if (ret)
> > > > +				goto err;
> > > > +		}
> > > > +	}
> > > > +
> > > > +	return 0;
> > > > +
> > > > +err:
> > > > +	drm_exec_fini(exec);
> > > > +	return ret;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> > > > +
> > > > +static int
> > > > +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> > > > +				unsigned int num_fences)
> > > > +{
> > > > +	struct {
> > > > +		struct drm_gem_object **objs;
> > > > +		unsigned int num_objs;
> > > > +	} *args = priv;
> > > > +
> > > > +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> > > > +				      args->num_objs, num_fences);
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @objs: additional &drm_gem_objects to lock
> > > > + * @num_objs: the number of additional &drm_gem_objects to lock
> > > > + * @num_fences: the amount of &dma_fences to reserve
> > > > + * @interruptible: sleep interruptible if waiting
> > > > + *
> > > > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > > > + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > > > +			     struct drm_gem_object **objs,
> > > > +			     unsigned int num_objs,
> > > > +			     unsigned int num_fences,
> > > > +			     bool interruptible)
> > > > +{
> > > > +	struct {
> > > > +		struct drm_gem_object **objs;
> > > > +		unsigned int num_objs;
> > > > +	} args;
> > > > +
> > > > +	args.objs = objs;
> > > > +	args.num_objs = num_objs;
> > > > +
> > > > +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> > > > +					    num_fences, interruptible);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> > > > + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> > > > + *
> > > > + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> > > > + * objects being mapped in the given &drm_gpuva_manager.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> > > > +{
> > > > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +	int ret;
> > > > +
> > > > +	if (unlikely(!ops || !ops->bo_validate))
> > > > +		return -ENOTSUPP;
> > > > +
> > > > +	/* At this point we should hold all dma-resv locks of all GEM objects
> > > > +	 * associated with this GPU-VM, hence it is safe to walk the list.
> > > > +	 */
> > > > +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> > > > +		dma_resv_assert_held(vm_bo->obj->resv);
> > > > +
> > > > +		ret = ops->bo_validate(vm_bo->obj);
> > > > +		if (ret)
> > > > +			return ret;
> > > > +	}
> > > > +
> > > > +	return 0;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> > > > + * dma-resv
> > > > + * @mgr: the &drm_gpuva_manager to add a fence to
> > > > + * @fence: fence to add
> > > > + * @private_usage: private dma-resv usage
> > > > + * @extobj_usage: extobj dma-resv usage
> > > > + */
> > > > +void
> > > > +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > > > +				 struct dma_fence *fence,
> > > > +				 enum dma_resv_usage private_usage,
> > > > +				 enum dma_resv_usage extobj_usage)
> > > > +{
> > > > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > > > +	struct drm_gem_object *obj;
> > > > +	unsigned long index;
> > > > +
> > > > +	drm_exec_for_each_locked_object(exec, index, obj) {
> > > > +			dma_resv_assert_held(obj->resv);
> > > > +			dma_resv_add_fence(obj->resv, fence,
> > > > +					   drm_gpuva_is_extobj(mgr, obj) ?
> > > > +					   private_usage : extobj_usage);
> > > > +	}
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> > > > +
> > > > +static struct drm_gpuva_gem *
> > > > +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	drm_gem_gpuva_assert_lock_held(obj);
> > > > +
> > > > +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> > > > +		if (vm_bo->mgr == mgr)
> > > > +			return vm_bo;
> > > > +
> > > > +	return NULL;
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> > > > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > + *
> > > > + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> > > > + * vm_bo_alloc() callback to allocate.
> > > > + *
> > > > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > > > + */
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	if (ops && ops->vm_bo_alloc)
> > > > +		vm_bo = ops->vm_bo_alloc();
> > > > +	else
> > > > +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> > > > +
> > > > +	if (unlikely(!vm_bo))
> > > > +		return NULL;
> > > > +
> > > > +	vm_bo->mgr = mgr;
> > > > +	vm_bo->obj = obj;
> > > > +
> > > > +	kref_init(&vm_bo->kref);
> > > > +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> > > > +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> > > > +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> > > > +
> > > > +	drm_gem_object_get(obj);
> > > > +
> > > > +	return vm_bo;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> > > > +
> > > > +void
> > > > +drm_gpuva_gem_destroy(struct kref *kref)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> > > > +						   kref);
> > > > +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> > > > +
> > > > +	drm_gem_object_put(vm_bo->obj);
> > > > +
> > > > +	if (ops && ops->vm_bo_free)
> > > > +		ops->vm_bo_free(vm_bo);
> > > > +	else
> > > > +		kfree(vm_bo);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> > > > + * &drm_gpuva_manager and &drm_gem_object
> > > > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > + *
> > > > + * Find the &drm_gpuva_gem representing the combination of the given
> > > > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > > > + * count of the &drm_gpuva_gem accordingly.
> > > > + *
> > > > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > > > + */
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > > > +		   struct drm_gem_object *obj)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> > > > +
> > > > +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> > > > + * given &drm_gpuva_manager and &drm_gem_object
> > > > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > + *
> > > > + * Find the &drm_gpuva_gem representing the combination of the given
> > > > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > > > + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> > > > + * &drm_gpuva_gem.
> > > > + *
> > > > + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> > > > + */
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > > > +	if (vm_bo)
> > > > +		return vm_bo;
> > > > +
> > > > +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> > > > +	if (!vm_bo)
> > > > +		return ERR_PTR(-ENOMEM);
> > > > +
> > > > +	return vm_bo;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> > > > + * for the given &drm_gpuva_manager and &drm_gem_object
> > > > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > + *
> > > > + * Find the &drm_gpuva_gem representing the combination of the given
> > > > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > > > + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> > > > + * count is decreased. If not found @__vm_bo is returned.
> > > > + *
> > > > + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> > > > + * &drm_gpuva_gem was found
> > > > + */
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > > > +			      struct drm_gem_object *obj,
> > > > +			      struct drm_gpuva_gem *__vm_bo)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > > > +	if (vm_bo) {
> > > > +		drm_gpuva_gem_put(__vm_bo);
> > > > +		return vm_bo;
> > > > +	}
> > > > +
> > > > +	return __vm_bo;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> > > > +
> > > > +static int
> > > > +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > > > +			  struct drm_gem_object *obj,
> > > > +			  gfp_t gfp)
> > > > +{
> > > > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > > > +	union {
> > > > +		struct drm_gem_object *obj;
> > > > +		uintptr_t index;
> > > > +	} gem;
> > > > +	union {
> > > > +		void *ptr;
> > > > +		uintptr_t cnt;
> > > > +	} ref;
> > > > +	int ret = 0;
> > > > +
> > > > +	gem.obj = obj;
> > > > +	mas_set(&mas, gem.index);
> > > > +
> > > > +	mas_lock(&mas);
> > > > +	ref.ptr = mas_walk(&mas);
> > > > +	if (ref.ptr) {
> > > > +		++ref.cnt;
> > > > +		mas_store(&mas, ref.ptr);
> > > > +	} else {
> > > > +		if (unlikely(!gfp)) {
> > > > +			ret = -EINVAL;
> > > > +			goto out;
> > > > +		}
> > > > +
> > > > +		mas_set(&mas, gem.index);
> > > > +		ref.cnt = 1;
> > > > +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> > > > +		if (likely(!ret))
> > > > +			drm_gem_object_get(obj);
> > > > +	}
> > > > +out:
> > > > +	mas_unlock(&mas);
> > > > +	return ret;
> > > > +}
> > > > +
> > > > +static void
> > > > +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> > > > +			  struct drm_gem_object *obj)
> > > > +{
> > > > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > > > +	union {
> > > > +		struct drm_gem_object *obj;
> > > > +		uintptr_t index;
> > > > +	} gem;
> > > > +	union {
> > > > +		void *ptr;
> > > > +		uintptr_t cnt;
> > > > +	} ref;
> > > > +
> > > > +	gem.obj = obj;
> > > > +	mas_set(&mas, gem.index);
> > > > +
> > > > +	mas_lock(&mas);
> > > > +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> > > > +		goto out;
> > > > +
> > > > +	if (!--ref.cnt) {
> > > > +		mas_erase(&mas);
> > > > +		drm_gem_object_put(obj);
> > > > +	} else {
> > > > +		mas_store(&mas, ref.ptr);
> > > > +	}
> > > > +out:
> > > > +	mas_unlock(&mas);
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> > > > + * @mgr: the &drm_gpuva_manager to insert into
> > > > + * @obj: the &drm_gem_object to insert as extobj
> > > > + *
> > > > + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> > > > + * If the &drm_gem_object already exists in the tree, the reference counter
> > > > + * of this external object is increased by one.
> > > > + *
> > > > + * Drivers should insert the external &drm_gem_object before the dma-fence
> > > > + * signalling critical section, e.g. when submitting the job, and before
> > > > + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> > > > + * or its dervates.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > > > +			struct drm_gem_object *obj)
> > > > +{
> > > > +	return drm_gpuva_is_extobj(mgr, obj) ?
> > > > +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> > > > +
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_extobj_get - increase the referecne count of an external
> > > > + * &drm_gem_object
> > > > + * @mgr: the &drm_gpuva_manager storing the extobj
> > > > + * @obj: the &drm_gem_object to representing the extobj
> > > > + *
> > > > + * Increases the reference count of the extobj represented by @obj.
> > > > + *
> > > > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > > > + * being inserted.
> > > > + *
> > > > + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> > > > + * additional reference if the re-map operation splits an existing &drm_gpuva
> > > > + * into two separate ones.
> > > > + *
> > > > + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +void
> > > > +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	if (drm_gpuva_is_extobj(mgr, obj))
> > > > +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> > > > +		     "Can't increase ref-count of non-existent extobj.");
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_extobj_put - decrease the referecne count of an external
> > > > + * &drm_gem_object
> > > > + * @mgr: the &drm_gpuva_manager storing the extobj
> > > > + * @obj: the &drm_gem_object to representing the extobj
> > > > + *
> > > > + * Decreases the reference count of the extobj represented by @obj.
> > > > + *
> > > > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > > > + * being removed from the GPU VA space.
> > > > + *
> > > > + * See also drm_gpuva_unmap_put().
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +void
> > > > +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	if (drm_gpuva_is_extobj(mgr, obj))
> > > > +		__drm_gpuva_extobj_remove(mgr, obj);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> > > > + * &drm_gpuva_managers evicted list
> > > > + * @obj: the &drm_gem_object to add or remove
> > > > + * @evict: indicates whether the object is evicted
> > > > + *
> > > > + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> > > > + * list containing a mapping of this &drm_gem_object.
> > > > + */
> > > > +void
> > > > +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> > > > +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> > > > +	 */
> > > > +	drm_gem_gpuva_assert_lock_held(obj);
> > > > +
> > > > +	/* Required to protect the GPUVA managers evict list against concurrent
> > > > +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> > > > +	 * the evict list through different GEM object evictions are protected
> > > > +	 * by the GPUVA managers evict lock.
> > > > +	 */
> > > > +	dma_resv_assert_held(obj->resv);
> > > > +
> > > > +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> > > > +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> > > > +
> > > > +		spin_lock(&mgr->evict.lock);
> > > > +		if (evict)
> > > > +			list_add_tail(&vm_bo->list.entry.evict,
> > > > +				      &mgr->evict.list);
> > > > +		else
> > > > +			list_del_init(&vm_bo->list.entry.evict);
> > > > +		spin_unlock(&mgr->evict.lock);
> > > > +	}
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> > > > +
> > > >    static int
> > > >    __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
> > > >    		   struct drm_gpuva *va)
> > > > @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
> > > >    /**
> > > >     * drm_gpuva_link() - link a &drm_gpuva
> > > >     * @va: the &drm_gpuva to link
> > > > + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
> > > >     *
> > > > - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> > > > - * associated with.
> > > > + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> > > > + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> > > > + *
> > > > + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> > > > + * reference of the latter is taken.
> > > >     *
> > > >     * This function expects the caller to protect the GEM's GPUVA list against
> > > > - * concurrent access using the GEMs dma_resv lock.
> > > > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > > > + * lock set through drm_gem_gpuva_set_lock().
> > > >     */
> > > >    void
> > > > -drm_gpuva_link(struct drm_gpuva *va)
> > > > +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
> > > >    {
> > > >    	struct drm_gem_object *obj = va->gem.obj;
> > > > @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
> > > >    	drm_gem_gpuva_assert_lock_held(obj);
> > > > -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> > > > +	drm_gpuva_gem_get(vm_bo);
> > > > +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> > > > +	if (list_empty(&vm_bo->list.entry.gem))
> > > > +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_link);
> > > > @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
> > > >     * This removes the given &va from the GPU VA list of the &drm_gem_object it is
> > > >     * associated with.
> > > >     *
> > > > + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> > > > + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> > > > + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> > > > + *
> > > > + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> > > > + * the latter is dropped.
> > > > + *
> > > >     * This function expects the caller to protect the GEM's GPUVA list against
> > > > - * concurrent access using the GEMs dma_resv lock.
> > > > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > > > + * lock set through drm_gem_gpuva_set_lock().
> > > >     */
> > > >    void
> > > >    drm_gpuva_unlink(struct drm_gpuva *va)
> > > >    {
> > > >    	struct drm_gem_object *obj = va->gem.obj;
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > >    	if (unlikely(!obj))
> > > >    		return;
> > > >    	drm_gem_gpuva_assert_lock_held(obj);
> > > > +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> > > > +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> > > > +		return;
> > > > +
> > > >    	list_del_init(&va->gem.entry);
> > > > +
> > > > +	if (list_empty(&vm_bo->list.gpuva)) {
> > > > +		list_del_init(&vm_bo->list.entry.gem);
> > > > +		list_del_init(&vm_bo->list.entry.evict);
> > > > +	}
> > > > +	drm_gpuva_gem_put(vm_bo);
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
> > > > @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_map);
> > > > +/**
> > > > + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> > > > + * &drm_gpuva_op_map
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @va: the &drm_gpuva to insert
> > > > + * @op: the &drm_gpuva_op_map to initialize @va with
> > > > + *
> > > > + * Initializes the @va from the @op and inserts it into the given @mgr and
> > > > + * increases the reference count of the corresponding extobj.
> > > > + */
> > > > +void
> > > > +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > > > +		  struct drm_gpuva *va,
> > > > +		  struct drm_gpuva_op_map *op)
> > > > +{
> > > > +	drm_gpuva_map(mgr, va, op);
> > > > +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> > > > +
> > > >    /**
> > > >     * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
> > > >     * &drm_gpuva_op_remap
> > > > @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> > > >    		struct drm_gpuva *next,
> > > >    		struct drm_gpuva_op_remap *op)
> > > >    {
> > > > -	struct drm_gpuva *curr = op->unmap->va;
> > > > -	struct drm_gpuva_manager *mgr = curr->mgr;
> > > > +	struct drm_gpuva *va = op->unmap->va;
> > > > +	struct drm_gpuva_manager *mgr = va->mgr;
> > > > -	drm_gpuva_remove(curr);
> > > > +	drm_gpuva_remove(va);
> > > >    	if (op->prev) {
> > > >    		drm_gpuva_init_from_op(prev, op->prev);
> > > > @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_remap);
> > > > +/**
> > > > + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> > > > + * &drm_gpuva_op_remap
> > > > + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> > > > + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> > > > + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> > > > + *
> > > > + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> > > > + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> > > > + * separate mappings, increases the reference count of the corresponding extobj.
> > > > + */
> > > > +void
> > > > +drm_gpuva_remap_get(struct drm_gpuva *prev,
> > > > +		    struct drm_gpuva *next,
> > > > +		    struct drm_gpuva_op_remap *op)
> > > > +{
> > > > +	struct drm_gpuva *va = op->unmap->va;
> > > > +	struct drm_gpuva_manager *mgr = va->mgr;
> > > > +
> > > > +	drm_gpuva_remap(prev, next, op);
> > > > +	if (op->prev && op->next)
> > > > +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> > > > +
> > > >    /**
> > > >     * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
> > > >     * &drm_gpuva_op_unmap
> > > > @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
> > > > +/**
> > > > + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> > > > + * &drm_gpuva_op_unmap
> > > > + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> > > > + *
> > > > + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> > > > + * the reference count of the corresponding extobj.
> > > > + */
> > > > +void
> > > > +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> > > > +{
> > > > +	struct drm_gpuva *va = op->va;
> > > > +
> > > > +	drm_gpuva_unmap(op);
> > > > +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> > > > +
> > > >    static int
> > > >    op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
> > > >    	  u64 addr, u64 range,
> > > > @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> > > >    {
> > > >    	struct drm_gpuva_ops *ops;
> > > >    	struct drm_gpuva_op *op;
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > >    	struct drm_gpuva *va;
> > > >    	int ret;
> > > > @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> > > >    	INIT_LIST_HEAD(&ops->list);
> > > > -	drm_gem_for_each_gpuva(va, obj) {
> > > > +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
> > > >    		op = gpuva_op_alloc(mgr);
> > > >    		if (!op) {
> > > >    			ret = -ENOMEM;
> > > > diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> > > > index bc9f6aa2f3fe..783ed3ab440d 100644
> > > > --- a/include/drm/drm_gem.h
> > > > +++ b/include/drm/drm_gem.h
> > > > @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
> > > >     * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
> > > >     * @obj: the &drm_gem_object
> > > >     *
> > > > - * This initializes the &drm_gem_object's &drm_gpuva list.
> > > > + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
> > > >     *
> > > >     * Calling this function is only necessary for drivers intending to support the
> > > >     * &drm_driver_feature DRIVER_GEM_GPUVA.
> > > > @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
> > > >    }
> > > >    /**
> > > > - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> > > > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > > > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > > > + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> > > > + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> > > > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> > > >     *
> > > > - * This iterator walks over all &drm_gpuva structures associated with the
> > > > - * &drm_gpuva_manager.
> > > > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> > > > + * &drm_gem_object.
> > > >     */
> > > > -#define drm_gem_for_each_gpuva(entry__, obj__) \
> > > > -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> > > > +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> > > > +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
> > > >    /**
> > > > - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> > > > - * gpuvas
> > > > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > > > - * @next__: &next &drm_gpuva to store the next step
> > > > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > > > + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> > > > + * &drm_gpuva_gem
> > > > + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> > > > + * @next__: &next &drm_gpuva_gem to store the next step
> > > > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> > > >     *
> > > > - * This iterator walks over all &drm_gpuva structures associated with the
> > > > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> > > >     * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
> > > >     * it is save against removal of elements.
> > > >     */
> > > > -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> > > > -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> > > > +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> > > > +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> > > > +
> > > > +/**
> > > > + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> > > > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > > > + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> > > > + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> > > > + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > > > + *
> > > > + * This iterator walks over all &drm_gpuva structures associated with the
> > > > + * &drm_gpuva_manager and &drm_gem_object.
> > > > + */
> > > > +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> > > > +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> > > > +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> > > > +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> > > > +	     va__ = list_next_entry(va__, gem.entry))
> > > >    #endif /* __DRM_GEM_H__ */
> > > > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > > > index ed8d50200cc3..693e2da3f425 100644
> > > > --- a/include/drm/drm_gpuva_mgr.h
> > > > +++ b/include/drm/drm_gpuva_mgr.h
> > > > @@ -26,12 +26,16 @@
> > > >     */
> > > >    #include <linux/list.h>
> > > > +#include <linux/dma-resv.h>
> > > > +#include <linux/maple_tree.h>
> > > >    #include <linux/rbtree.h>
> > > >    #include <linux/types.h>
> > > >    #include <drm/drm_gem.h>
> > > > +#include <drm/drm_exec.h>
> > > >    struct drm_gpuva_manager;
> > > > +struct drm_gpuva_gem;
> > > >    struct drm_gpuva_fn_ops;
> > > >    /**
> > > > @@ -140,7 +144,7 @@ struct drm_gpuva {
> > > >    int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> > > >    void drm_gpuva_remove(struct drm_gpuva *va);
> > > > -void drm_gpuva_link(struct drm_gpuva *va);
> > > > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> > > >    void drm_gpuva_unlink(struct drm_gpuva *va);
> > > >    struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > > > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> > > >    	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> > > >    	 */
> > > >    	const struct drm_gpuva_fn_ops *ops;
> > > > +
> > > > +	/**
> > > > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > > > +	 * dma-resv to &drm_exec.
> > > > +	 */
> > > > +	struct drm_gem_object d_obj;
> > > > +
> > > > +	/**
> > > > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > > > +	 * space
> > > > +	 */
> > > > +	struct dma_resv *resv;
> > > > +
> > > > +	/**
> > > > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > > > +	 */
> > > > +	struct drm_exec exec;
> > > > +
> > > > +	/**
> > > > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > > > +	 */
> > > > +	struct maple_tree mt_ext;
> > > Why are you using a maple tree here? Insertion and removal is O(log(n))
> > > instead of O(1) for a list?
> > > 
> > Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> > could have mappings of the same extobj.
> > 
> > I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> > instead, which also seems to be the obvious choice. However, there is a locking
> > conflict.
> > 
> > A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> > a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> > the corresponding drm_gem_object, or with an external lock provided by the
> > driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> > changes on the GPUVA space directly from the fence signalling path.
> > 
> > Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> > we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> > linked and we'd want to remove it for the last one being unlinked.
> > 
> > (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> > before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> > through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> > create the drm_gpuva_gem, which we need to do anyways.)
> > 
> > Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> > list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> > order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> > lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> > drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> > like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> > hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
> > 
> > Considering all that, I thought it's probably better to track extobjs separate
> > from the drm_gpuva_gem, hence the maple tree choice.
> 
> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
> external objects, or in the case of multiple mappings to the same gem
> object, only one of the drm_gpuvas is in the list. These are protected by
> the GPU-VM lock. I don't see a problem with removing those from the fence
> signalling path, though?

I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
since this is generic code I don't know how much mappings of an external object
the corresponding driver potentially creates. This could become a pretty large
list to iterate. Another reason was, that I want to keep the drm_gpuva structure
as small as possible, hence avoiding another list_head.

Now, it sounds like in XE you're doing some kind of optimization just keeping a
single mapping of an extobj in the list? How do you know when to remove it? What
if the mapping from the extobj list gets unmapped, but there is still another
one left in the GPU-VM being backed by the same BO?

> 
> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> choice, keeping O(1)?

When tracking extobjs, the address of the drm_gem_object is the key while the
reference count is the value. I was thinking of an XArray as well, but I was
worried that the corresponding indices could be too much distributed for an
XArray to still be efficient. Now that I think about it, it's probably not that
bad.

Btw., while I agree trying to make things as efficient as possible, what is the
magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?

> 
> > 
> > > > +
> > > > +	/**
> > > > +	 * @evict: structure holding the evict list and evict list lock
> > > > +	 */
> > > > +	struct {
> > > > +		/**
> > > > +		 * @list: &list_head storing &drm_gem_objects currently being
> > > > +		 * evicted
> > > > +		 */
> > > > +		struct list_head list;
> > > > +
> > > > +		/**
> > > > +		 * @lock: spinlock to protect the evict list against concurrent
> > > > +		 * insertion / removal of different &drm_gpuva_gems
> > > > +		 */
> > > > +		spinlock_t lock;
> > > > +	} evict;
> > > >    };
> > > >    void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > +			    struct drm_device *drm,
> > > >    			    const char *name,
> > > >    			    u64 start_offset, u64 range,
> > > >    			    u64 reserve_offset, u64 reserve_range,
> > > >    			    const struct drm_gpuva_fn_ops *ops);
> > > >    void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > > > +/**
> > > > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > > > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > > > + */
> > > > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > > A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> > > should typically be allocated on the stack. Otherwise you'd need to protect
> > > the mgr->exec member with an exclusive lock throughout the locking process,
> > > and that's not what we want.
> > Oh, good point. I think it works in Nouveau, because there it's implicitly
> > protected with the job submission lock.
> > 
> > > Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> > > needed ops to it: Like so:
> > That's a good idea, will take this into V2.
> 
> Actually, I'm not fully sure that was a good idea: I've now have a working
> version of Xe ported over to drm_exec, having these helpers in mind and with
> the intention to start using them as they mature. What I found, though is
> that open-coding the drm_exec loop is not all that bad, but that building
> blocks that can be called from within the loop are useful:
> 
> Like the drm_gpuva_prepare_objects() and an imaginary
> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
> (if different and the gpuva points to the object. And
> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
> can use these building blocks like helpers and avoid the fn() callback by
> instead open-coding.
> 
> But I guess YMMV.

That's exactly why those building blocks are exported, I already had in mind
that there might be drivers which still want to open-code the drm_exec loop,
while others might just want a simple interface to lock everything.

I still think it is a good idea, but I'd keep that as simple as possible. And
for everything else just let the driver open-code it and use the "building
blocks" - will also expand the bulding blocks to what you mentioned above.

> 
> > 
> > > struct drm_gpuva_exec_ops {
> > >      int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> > Is this the fn argument from drm_gpuva_manager_lock_extra()?
> > 
> > >      int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> > > *obj);
> > I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> > be the same callback, right?
> > 
> > > };
> > > 
> > > struct drm_gpuva_exec {
> > >      const struct drm_gpuva_exec_ops *ops;
> > >      struct drm_exec exec;
> > >      struct drm_gpuva_manager *mgr;
> > > };
> > > 
> > > Although I'd actually expect bo_validate to be part of fn in the typical
> > > case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> > This doesn't sound like my assumption about fn() above is correct.
> 
> Well one important thing in our conversion is that ttm_bo_validate () needs
> to be in the until_all_locked() loop. We want to be able soon to use
> sleeping locks for eviction, so a xe_bo_validate() would, at least
> temporarily, add locked objects to the drm_exec list of locked objects. That
> means everything that may end up calling validate deep within the call chain
> needs to be part of the until_all_locked() loop, so our
> drm_gpuva_manager_lock_extra() fn callback would include those validates and
> look different all the time. Hence that's why open-coding isn't all that
> bad...

Oh, I see. You indeed want to call validate() from within until_all_locked().

> 
> /Thomas
> 
> 
> > 
> > > 
> > > > +
> > > > +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > > > +				 int (*fn)(struct drm_gpuva_manager *mgr,
> > > > +					   void *priv, unsigned int num_fences),
> > > > +				 void *priv,
> > > > +				 unsigned int num_fences,
> > > > +				 bool interruptible);
> > > > +
> > > > +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > > > +				 struct drm_gem_object **objs,
> > > > +				 unsigned int num_objs,
> > > > +				 unsigned int num_fences,
> > > > +				 bool interruptible);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @num_fences: the amount of &dma_fences to reserve
> > > > + * @interruptible: sleep interruptible if waiting
> > > > + *
> > > > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > > > + * &drm_gpuva_manager contains mappings of.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +static inline int
> > > > +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> > > > +		       unsigned int num_fences,
> > > > +		       bool interruptible)
> > > > +{
> > > > +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> > > > +					    interruptible);
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + *
> > > > + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> > > > + * through drm_gpuva_manager_lock() or its variants.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +static inline void
> > > > +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> > > > +{
> > > > +	drm_exec_fini(&mgr->exec);
> > > > +}
> > > > +
> > > > +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> > > > +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > > > +				      struct dma_fence *fence,
> > > > +				      enum dma_resv_usage private_usage,
> > > > +				      enum dma_resv_usage extobj_usage);
> > > > +
> > > > +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > > > +			    struct drm_gem_object *obj);
> > > > +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > > > +			  struct drm_gem_object *obj);
> > > > +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > > > +			  struct drm_gem_object *obj);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> > > > + * external object
> > > > + * @mgr: the &drm_gpuva_manager to check
> > > > + * @obj: the &drm_gem_object to check
> > > > + *
> > > > + * Returns: true if the &drm_gem_object &dma_resv differs from the
> > > > + * &drm_gpuva_managers &dma_resv, false otherwise
> > > > + */
> > > > +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> > > > +				       struct drm_gem_object *obj)
> > > > +{
> > > > +	return obj && obj->resv != mgr->resv;
> > > > +}
> > > > +
> > > >    static inline struct drm_gpuva *
> > > >    __drm_gpuva_next(struct drm_gpuva *va)
> > > >    {
> > > > @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
> > > >    #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
> > > >    	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
> > > > +/**
> > > > + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> > > > + * &drm_gem_object combination
> > > > + *
> > > > + * This structure is an abstraction representing a &drm_gpuva_manager and
> > > > + * &drm_gem_object combination. It serves as an indirection to accelerate
> > > > + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> > > > + * &drm_gem_object.
> > > > + *
> > > > + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> > > > + * accelerate validation.
> > > > + *
> > > > + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> > > > + * a GEM object is mapped first in a GPU-VM and release the instance once the
> > > > + * last mapping of the GEM object in this GPU-VM is unmapped.
> > > > + */
> > > > +struct drm_gpuva_gem {
> > > > +
> > > > +	/**
> > > > +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > +	 */
> > > > +	struct drm_gpuva_manager *mgr;
> > > > +
> > > > +	/**
> > > > +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > +	 */
> > > > +	struct drm_gem_object *obj;
> > > > +
> > > > +	/**
> > > > +	 * @kref: The reference count for this &drm_gpuva_gem.
> > > > +	 */
> > > > +	struct kref kref;
> > > > +
> > > > +	/**
> > > > +	 * @list: Structure containing all &list_heads.
> > > > +	 */
> > > > +	struct {
> > > > +		/**
> > > > +		 * @gpuva: The list of linked &drm_gpuvas.
> > > > +		 */
> > > > +		struct list_head gpuva;
> > > > +
> > > > +		/**
> > > > +		 * @entry: Structure containing all &list_heads serving as
> > > > +		 * entry.
> > > > +		 */
> > > > +		struct {
> > > > +			/**
> > > > +			 * @gem: List entry to attach to the &drm_gem_objects
> > > > +			 * gpuva list.
> > > > +			 */
> > > > +			struct list_head gem;
> > > > +
> > > > +			/**
> > > > +			 * @evict: List entry to attach to the
> > > > +			 * &drm_gpuva_managers evict list.
> > > > +			 */
> > > > +			struct list_head evict;
> > > > +		} entry;
> > > > +	} list;
> > > > +};
> > > > +
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj);
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > > > +			      struct drm_gem_object *obj,
> > > > +			      struct drm_gpuva_gem *__vm_bo);
> > > > +
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > > > +		   struct drm_gem_object *obj);
> > > > +
> > > > +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> > > > +
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj);
> > > > +void drm_gpuva_gem_destroy(struct kref *kref);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> > > > + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> > > > + *
> > > > + * This function acquires an additional reference to @vm_bo. It is illegal to
> > > > + * call this without already holding a reference. No locks required.
> > > > + */
> > > > +static inline struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> > > > +{
> > > > +	kref_get(&vm_bo->kref);
> > > > +	return vm_bo;
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> > > > + * @vm_bo: the &drm_gpuva_gem to release the reference of
> > > > + *
> > > > + * This releases a reference to @vm_bo.
> > > > + */
> > > > +static inline void
> > > > +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> > > > +{
> > > > +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> > > > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > > > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > > > + *
> > > > + * This iterator walks over all &drm_gpuva structures associated with the
> > > > + * &drm_gpuva_gem.
> > > > + */
> > > > +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> > > > +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> > > > + * &drm_gpuva
> > > > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > > > + * @next__: &next &drm_gpuva to store the next step
> > > > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > > > + *
> > > > + * This iterator walks over all &drm_gpuva structures associated with the
> > > > + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> > > > + * it is save against removal of elements.
> > > > + */
> > > > +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> > > > +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> > > > +
> > > >    /**
> > > >     * enum drm_gpuva_op_type - GPU VA operation type
> > > >     *
> > > > @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
> > > >    	 */
> > > >    	void (*op_free)(struct drm_gpuva_op *op);
> > > > +	/**
> > > > +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> > > > +	 * a struct drm_gpuva_gem
> > > > +	 *
> > > > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > > > +	 * specific structures. By implementing this callback drivers can
> > > > +	 * allocate memory accordingly.
> > > > +	 *
> > > > +	 * This callback is optional.
> > > > +	 */
> > > > +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> > > > +
> > > > +	/**
> > > > +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> > > > +	 * struct drm_gpuva_gem
> > > > +	 *
> > > > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > > > +	 * specific structures. By implementing this callback drivers can
> > > > +	 * free the previously allocated memory accordingly.
> > > > +	 *
> > > > +	 * This callback is optional.
> > > > +	 */
> > > > +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> > > > +
> > > >    	/**
> > > >    	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
> > > >    	 * mapping once all previous steps were completed
> > > > @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
> > > >    	 * used.
> > > >    	 */
> > > >    	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> > > > +
> > > > +	/**
> > > > +	 * @bo_validate: called from drm_gpuva_manager_validate()
> > > > +	 *
> > > > +	 * Drivers receive this callback for every evicted &drm_gem_object being
> > > > +	 * mapped in the corresponding &drm_gpuva_manager.
> > > > +	 *
> > > > +	 * Typically, drivers would call their driver specific variant of
> > > > +	 * ttm_bo_validate() from within this callback.
> > > > +	 */
> > > > +	int (*bo_validate)(struct drm_gem_object *obj);
> > > >    };
> > > >    int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> > > > @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
> > > >    void drm_gpuva_map(struct drm_gpuva_manager *mgr,
> > > >    		   struct drm_gpuva *va,
> > > >    		   struct drm_gpuva_op_map *op);
> > > > +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > > > +		       struct drm_gpuva *va,
> > > > +		       struct drm_gpuva_op_map *op);
> > > >    void drm_gpuva_remap(struct drm_gpuva *prev,
> > > >    		     struct drm_gpuva *next,
> > > >    		     struct drm_gpuva_op_remap *op);
> > > > +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> > > > +			 struct drm_gpuva *next,
> > > > +			 struct drm_gpuva_op_remap *op);
> > > >    void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> > > > +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
> > > >    #endif /* __DRM_GPUVA_MGR_H__ */
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-30 15:00           ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-30 15:00 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, christian.koenig, faith.ekstrand, bskeggs

On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
> 
> On 8/30/23 14:49, Danilo Krummrich wrote:
> > Hi Thomas,
> > 
> > thanks for having a look!
> > 
> > On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> > > Hi, Danilo.
> > > 
> > > Some quick comments since I'm doing some Xe work in this area. Will probably
> > > get back with more.
> > > 
> > > On 8/20/23 23:53, Danilo Krummrich wrote:
> > > > So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> > > > allocations and mappings, generically connect GPU VA mappings to their
> > > > backing buffers and perform more complex mapping operations on the GPU VA
> > > > space.
> > > > 
> > > > However, there are more design patterns commonly used by drivers, which
> > > > can potentially be generalized in order to make the DRM GPUVA manager
> > > > represent a basic GPU-VM implementation. In this context, this patch aims
> > > > at generalizing the following elements.
> > > > 
> > > > 1) Provide a common dma-resv for GEM objects not being used outside of
> > > >      this GPU-VM.
> > > > 
> > > > 2) Provide tracking of external GEM objects (GEM objects which are
> > > >      shared with other GPU-VMs).
> > > > 
> > > > 3) Provide functions to efficiently lock all GEM objects dma-resv the
> > > >      GPU-VM contains mappings of.
> > > > 
> > > > 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
> > > >      of, such that validation of evicted GEM objects is accelerated.
> > > > 
> > > > 5) Provide some convinience functions for common patterns.
> > > > 
> > > > Rather than being designed as a "framework", the target is to make all
> > > > features appear as a collection of optional helper functions, such that
> > > > drivers are free to make use of the DRM GPUVA managers basic
> > > > functionality and opt-in for other features without setting any feature
> > > > flags, just by making use of the corresponding functions.
> > > > 
> > > > Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> > > > ---
> > > >    drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
> > > >    include/drm/drm_gem.h           |  48 ++-
> > > >    include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
> > > >    3 files changed, 1010 insertions(+), 28 deletions(-)
> > > > 
> > > > diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> > > > index f86bfad74ff8..69872b205961 100644
> > > > --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> > > > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> > > > @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> > > >    /**
> > > >     * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
> > > >     * @mgr: pointer to the &drm_gpuva_manager to initialize
> > > > + * @drm: the drivers &drm_device
> > > >     * @name: the name of the GPU VA space
> > > >     * @start_offset: the start offset of the GPU VA space
> > > >     * @range: the size of the GPU VA space
> > > > @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> > > >     */
> > > >    void
> > > >    drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > +		       struct drm_device *drm,
> > > >    		       const char *name,
> > > >    		       u64 start_offset, u64 range,
> > > >    		       u64 reserve_offset, u64 reserve_range,
> > > > @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > >    	mgr->rb.tree = RB_ROOT_CACHED;
> > > >    	INIT_LIST_HEAD(&mgr->rb.list);
> > > > +	mt_init(&mgr->mt_ext);
> > > > +
> > > > +	INIT_LIST_HEAD(&mgr->evict.list);
> > > > +	spin_lock_init(&mgr->evict.lock);
> > > > +
> > > >    	drm_gpuva_check_overflow(start_offset, range);
> > > >    	mgr->mm_start = start_offset;
> > > >    	mgr->mm_range = range;
> > > > @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > >    						     reserve_range)))
> > > >    			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
> > > >    	}
> > > > +
> > > > +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> > > > +	mgr->resv = mgr->d_obj.resv;
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
> > > > @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
> > > >    		__drm_gpuva_remove(&mgr->kernel_alloc_node);
> > > >    	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> > > > -	     "GPUVA tree is not empty, potentially leaking memory.");
> > > > +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> > > > +
> > > > +	mtree_destroy(&mgr->mt_ext);
> > > > +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> > > > +
> > > > +	drm_gem_private_object_fini(&mgr->d_obj);
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
> > > > +/**
> > > > + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @num_fences: the amount of &dma_fences to reserve
> > > > + *
> > > > + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> > > > + * &drm_gpuva_manager contains mappings of.
> > > > + *
> > > > + * Drivers can obtain the corresponding &drm_exec instance through
> > > > + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> > > > + * and drm_exec_fini() accordingly.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> > > > +				  unsigned int num_fences)
> > > > +{
> > > > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > > > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > > > +	union {
> > > > +		void *ptr;
> > > > +		uintptr_t cnt;
> > > > +	} ref;
> > > > +	int ret;
> > > > +
> > > > +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> > > > +	if (ret)
> > > > +		goto out;
> > > > +
> > > > +	rcu_read_lock();
> > > In xe we're protecting the external object list with an outer lock, (same as
> > > protecting the mgr itself). Do we need a separate lock for this? In theory
> > > as  outlined in the VM_BIND locking document draft, one could probably even
> > > use the mgr resv for this, but with more complicated code I guess. Also see
> > > the comment below about the data structure chosen.
> > The idea is to protect this list with the GPU-VM lock. The locking here is more
> > of an implication of the maple tree. Either you use the internal lock of the
> > maple tree or RCU respectively, or you give the maple tree an external lock to
> > perform lockdep checks on (mt_set_external_lock()). Basically same as here:
> > 
> > https://elixir.bootlin.com/linux/latest/source/drivers/base/regmap/regcache-maple.c#L124
> 
> Ah, I suspected it was something along those lines.
> 
> 
> > 
> > > > +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> > > > +		struct drm_gem_object *obj;
> > > > +
> > > > +		mas_pause(&mas);
> > > > +		rcu_read_unlock();
> > > > +
> > > > +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> > > > +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> > > > +		if (ret)
> > > > +			goto out;
> > > > +
> > > > +		rcu_read_lock();
> > > > +	}
> > > > +	rcu_read_unlock();
> > > > +
> > > > +out:
> > > > +	return ret;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @fn: callback received by the driver to lock additional dma-resv
> > > > + * @priv: private driver data passed to @fn
> > > > + * @num_fences: the amount of &dma_fences to reserve
> > > > + * @interruptible: sleep interruptible if waiting
> > > > + *
> > > > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > > > + * &drm_gpuva_manager contains mappings of.
> > > > + *
> > > > + * Addionally, when calling this function the driver receives the given @fn
> > > > + * callback to lock additional dma-resv in the context of the
> > > > + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> > > > + * drm_exec_prepare_obj() from within this callback.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > > > +			     int (*fn)(struct drm_gpuva_manager *mgr,
> > > > +				       void *priv, unsigned int num_fences),
> > > > +			     void *priv,
> > > > +			     unsigned int num_fences,
> > > > +			     bool interruptible)
> > > > +{
> > > > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > > > +	uint32_t flags;
> > > > +	int ret;
> > > > +
> > > > +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> > > > +		DRM_EXEC_IGNORE_DUPLICATES;
> > > > +
> > > > +	drm_exec_init(exec, flags);
> > > > +
> > > > +	drm_exec_until_all_locked(exec) {
> > > > +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> > > > +		drm_exec_retry_on_contention(exec);
> > > > +		if (ret)
> > > > +			goto err;
> > > > +
> > > > +		if (fn) {
> > > > +			ret = fn(mgr, priv, num_fences);
> > > > +			drm_exec_retry_on_contention(exec);
> > > > +			if (ret)
> > > > +				goto err;
> > > > +		}
> > > > +	}
> > > > +
> > > > +	return 0;
> > > > +
> > > > +err:
> > > > +	drm_exec_fini(exec);
> > > > +	return ret;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> > > > +
> > > > +static int
> > > > +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> > > > +				unsigned int num_fences)
> > > > +{
> > > > +	struct {
> > > > +		struct drm_gem_object **objs;
> > > > +		unsigned int num_objs;
> > > > +	} *args = priv;
> > > > +
> > > > +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> > > > +				      args->num_objs, num_fences);
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @objs: additional &drm_gem_objects to lock
> > > > + * @num_objs: the number of additional &drm_gem_objects to lock
> > > > + * @num_fences: the amount of &dma_fences to reserve
> > > > + * @interruptible: sleep interruptible if waiting
> > > > + *
> > > > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > > > + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > > > +			     struct drm_gem_object **objs,
> > > > +			     unsigned int num_objs,
> > > > +			     unsigned int num_fences,
> > > > +			     bool interruptible)
> > > > +{
> > > > +	struct {
> > > > +		struct drm_gem_object **objs;
> > > > +		unsigned int num_objs;
> > > > +	} args;
> > > > +
> > > > +	args.objs = objs;
> > > > +	args.num_objs = num_objs;
> > > > +
> > > > +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> > > > +					    num_fences, interruptible);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> > > > + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> > > > + *
> > > > + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> > > > + * objects being mapped in the given &drm_gpuva_manager.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> > > > +{
> > > > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +	int ret;
> > > > +
> > > > +	if (unlikely(!ops || !ops->bo_validate))
> > > > +		return -ENOTSUPP;
> > > > +
> > > > +	/* At this point we should hold all dma-resv locks of all GEM objects
> > > > +	 * associated with this GPU-VM, hence it is safe to walk the list.
> > > > +	 */
> > > > +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> > > > +		dma_resv_assert_held(vm_bo->obj->resv);
> > > > +
> > > > +		ret = ops->bo_validate(vm_bo->obj);
> > > > +		if (ret)
> > > > +			return ret;
> > > > +	}
> > > > +
> > > > +	return 0;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> > > > + * dma-resv
> > > > + * @mgr: the &drm_gpuva_manager to add a fence to
> > > > + * @fence: fence to add
> > > > + * @private_usage: private dma-resv usage
> > > > + * @extobj_usage: extobj dma-resv usage
> > > > + */
> > > > +void
> > > > +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > > > +				 struct dma_fence *fence,
> > > > +				 enum dma_resv_usage private_usage,
> > > > +				 enum dma_resv_usage extobj_usage)
> > > > +{
> > > > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > > > +	struct drm_gem_object *obj;
> > > > +	unsigned long index;
> > > > +
> > > > +	drm_exec_for_each_locked_object(exec, index, obj) {
> > > > +			dma_resv_assert_held(obj->resv);
> > > > +			dma_resv_add_fence(obj->resv, fence,
> > > > +					   drm_gpuva_is_extobj(mgr, obj) ?
> > > > +					   private_usage : extobj_usage);
> > > > +	}
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> > > > +
> > > > +static struct drm_gpuva_gem *
> > > > +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	drm_gem_gpuva_assert_lock_held(obj);
> > > > +
> > > > +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> > > > +		if (vm_bo->mgr == mgr)
> > > > +			return vm_bo;
> > > > +
> > > > +	return NULL;
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> > > > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > + *
> > > > + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> > > > + * vm_bo_alloc() callback to allocate.
> > > > + *
> > > > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > > > + */
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	if (ops && ops->vm_bo_alloc)
> > > > +		vm_bo = ops->vm_bo_alloc();
> > > > +	else
> > > > +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> > > > +
> > > > +	if (unlikely(!vm_bo))
> > > > +		return NULL;
> > > > +
> > > > +	vm_bo->mgr = mgr;
> > > > +	vm_bo->obj = obj;
> > > > +
> > > > +	kref_init(&vm_bo->kref);
> > > > +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> > > > +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> > > > +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> > > > +
> > > > +	drm_gem_object_get(obj);
> > > > +
> > > > +	return vm_bo;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> > > > +
> > > > +void
> > > > +drm_gpuva_gem_destroy(struct kref *kref)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> > > > +						   kref);
> > > > +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> > > > +
> > > > +	drm_gem_object_put(vm_bo->obj);
> > > > +
> > > > +	if (ops && ops->vm_bo_free)
> > > > +		ops->vm_bo_free(vm_bo);
> > > > +	else
> > > > +		kfree(vm_bo);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> > > > + * &drm_gpuva_manager and &drm_gem_object
> > > > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > + *
> > > > + * Find the &drm_gpuva_gem representing the combination of the given
> > > > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > > > + * count of the &drm_gpuva_gem accordingly.
> > > > + *
> > > > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > > > + */
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > > > +		   struct drm_gem_object *obj)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> > > > +
> > > > +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> > > > + * given &drm_gpuva_manager and &drm_gem_object
> > > > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > + *
> > > > + * Find the &drm_gpuva_gem representing the combination of the given
> > > > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > > > + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> > > > + * &drm_gpuva_gem.
> > > > + *
> > > > + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> > > > + */
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > > > +	if (vm_bo)
> > > > +		return vm_bo;
> > > > +
> > > > +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> > > > +	if (!vm_bo)
> > > > +		return ERR_PTR(-ENOMEM);
> > > > +
> > > > +	return vm_bo;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> > > > + * for the given &drm_gpuva_manager and &drm_gem_object
> > > > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > + *
> > > > + * Find the &drm_gpuva_gem representing the combination of the given
> > > > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > > > + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> > > > + * count is decreased. If not found @__vm_bo is returned.
> > > > + *
> > > > + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> > > > + * &drm_gpuva_gem was found
> > > > + */
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > > > +			      struct drm_gem_object *obj,
> > > > +			      struct drm_gpuva_gem *__vm_bo)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > > > +	if (vm_bo) {
> > > > +		drm_gpuva_gem_put(__vm_bo);
> > > > +		return vm_bo;
> > > > +	}
> > > > +
> > > > +	return __vm_bo;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> > > > +
> > > > +static int
> > > > +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > > > +			  struct drm_gem_object *obj,
> > > > +			  gfp_t gfp)
> > > > +{
> > > > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > > > +	union {
> > > > +		struct drm_gem_object *obj;
> > > > +		uintptr_t index;
> > > > +	} gem;
> > > > +	union {
> > > > +		void *ptr;
> > > > +		uintptr_t cnt;
> > > > +	} ref;
> > > > +	int ret = 0;
> > > > +
> > > > +	gem.obj = obj;
> > > > +	mas_set(&mas, gem.index);
> > > > +
> > > > +	mas_lock(&mas);
> > > > +	ref.ptr = mas_walk(&mas);
> > > > +	if (ref.ptr) {
> > > > +		++ref.cnt;
> > > > +		mas_store(&mas, ref.ptr);
> > > > +	} else {
> > > > +		if (unlikely(!gfp)) {
> > > > +			ret = -EINVAL;
> > > > +			goto out;
> > > > +		}
> > > > +
> > > > +		mas_set(&mas, gem.index);
> > > > +		ref.cnt = 1;
> > > > +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> > > > +		if (likely(!ret))
> > > > +			drm_gem_object_get(obj);
> > > > +	}
> > > > +out:
> > > > +	mas_unlock(&mas);
> > > > +	return ret;
> > > > +}
> > > > +
> > > > +static void
> > > > +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> > > > +			  struct drm_gem_object *obj)
> > > > +{
> > > > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > > > +	union {
> > > > +		struct drm_gem_object *obj;
> > > > +		uintptr_t index;
> > > > +	} gem;
> > > > +	union {
> > > > +		void *ptr;
> > > > +		uintptr_t cnt;
> > > > +	} ref;
> > > > +
> > > > +	gem.obj = obj;
> > > > +	mas_set(&mas, gem.index);
> > > > +
> > > > +	mas_lock(&mas);
> > > > +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> > > > +		goto out;
> > > > +
> > > > +	if (!--ref.cnt) {
> > > > +		mas_erase(&mas);
> > > > +		drm_gem_object_put(obj);
> > > > +	} else {
> > > > +		mas_store(&mas, ref.ptr);
> > > > +	}
> > > > +out:
> > > > +	mas_unlock(&mas);
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> > > > + * @mgr: the &drm_gpuva_manager to insert into
> > > > + * @obj: the &drm_gem_object to insert as extobj
> > > > + *
> > > > + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> > > > + * If the &drm_gem_object already exists in the tree, the reference counter
> > > > + * of this external object is increased by one.
> > > > + *
> > > > + * Drivers should insert the external &drm_gem_object before the dma-fence
> > > > + * signalling critical section, e.g. when submitting the job, and before
> > > > + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> > > > + * or its dervates.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > > > +			struct drm_gem_object *obj)
> > > > +{
> > > > +	return drm_gpuva_is_extobj(mgr, obj) ?
> > > > +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> > > > +
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_extobj_get - increase the referecne count of an external
> > > > + * &drm_gem_object
> > > > + * @mgr: the &drm_gpuva_manager storing the extobj
> > > > + * @obj: the &drm_gem_object to representing the extobj
> > > > + *
> > > > + * Increases the reference count of the extobj represented by @obj.
> > > > + *
> > > > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > > > + * being inserted.
> > > > + *
> > > > + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> > > > + * additional reference if the re-map operation splits an existing &drm_gpuva
> > > > + * into two separate ones.
> > > > + *
> > > > + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +void
> > > > +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	if (drm_gpuva_is_extobj(mgr, obj))
> > > > +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> > > > +		     "Can't increase ref-count of non-existent extobj.");
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_extobj_put - decrease the referecne count of an external
> > > > + * &drm_gem_object
> > > > + * @mgr: the &drm_gpuva_manager storing the extobj
> > > > + * @obj: the &drm_gem_object to representing the extobj
> > > > + *
> > > > + * Decreases the reference count of the extobj represented by @obj.
> > > > + *
> > > > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > > > + * being removed from the GPU VA space.
> > > > + *
> > > > + * See also drm_gpuva_unmap_put().
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +void
> > > > +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	if (drm_gpuva_is_extobj(mgr, obj))
> > > > +		__drm_gpuva_extobj_remove(mgr, obj);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> > > > + * &drm_gpuva_managers evicted list
> > > > + * @obj: the &drm_gem_object to add or remove
> > > > + * @evict: indicates whether the object is evicted
> > > > + *
> > > > + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> > > > + * list containing a mapping of this &drm_gem_object.
> > > > + */
> > > > +void
> > > > +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> > > > +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> > > > +	 */
> > > > +	drm_gem_gpuva_assert_lock_held(obj);
> > > > +
> > > > +	/* Required to protect the GPUVA managers evict list against concurrent
> > > > +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> > > > +	 * the evict list through different GEM object evictions are protected
> > > > +	 * by the GPUVA managers evict lock.
> > > > +	 */
> > > > +	dma_resv_assert_held(obj->resv);
> > > > +
> > > > +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> > > > +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> > > > +
> > > > +		spin_lock(&mgr->evict.lock);
> > > > +		if (evict)
> > > > +			list_add_tail(&vm_bo->list.entry.evict,
> > > > +				      &mgr->evict.list);
> > > > +		else
> > > > +			list_del_init(&vm_bo->list.entry.evict);
> > > > +		spin_unlock(&mgr->evict.lock);
> > > > +	}
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> > > > +
> > > >    static int
> > > >    __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
> > > >    		   struct drm_gpuva *va)
> > > > @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
> > > >    /**
> > > >     * drm_gpuva_link() - link a &drm_gpuva
> > > >     * @va: the &drm_gpuva to link
> > > > + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
> > > >     *
> > > > - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> > > > - * associated with.
> > > > + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> > > > + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> > > > + *
> > > > + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> > > > + * reference of the latter is taken.
> > > >     *
> > > >     * This function expects the caller to protect the GEM's GPUVA list against
> > > > - * concurrent access using the GEMs dma_resv lock.
> > > > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > > > + * lock set through drm_gem_gpuva_set_lock().
> > > >     */
> > > >    void
> > > > -drm_gpuva_link(struct drm_gpuva *va)
> > > > +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
> > > >    {
> > > >    	struct drm_gem_object *obj = va->gem.obj;
> > > > @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
> > > >    	drm_gem_gpuva_assert_lock_held(obj);
> > > > -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> > > > +	drm_gpuva_gem_get(vm_bo);
> > > > +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> > > > +	if (list_empty(&vm_bo->list.entry.gem))
> > > > +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_link);
> > > > @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
> > > >     * This removes the given &va from the GPU VA list of the &drm_gem_object it is
> > > >     * associated with.
> > > >     *
> > > > + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> > > > + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> > > > + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> > > > + *
> > > > + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> > > > + * the latter is dropped.
> > > > + *
> > > >     * This function expects the caller to protect the GEM's GPUVA list against
> > > > - * concurrent access using the GEMs dma_resv lock.
> > > > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > > > + * lock set through drm_gem_gpuva_set_lock().
> > > >     */
> > > >    void
> > > >    drm_gpuva_unlink(struct drm_gpuva *va)
> > > >    {
> > > >    	struct drm_gem_object *obj = va->gem.obj;
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > >    	if (unlikely(!obj))
> > > >    		return;
> > > >    	drm_gem_gpuva_assert_lock_held(obj);
> > > > +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> > > > +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> > > > +		return;
> > > > +
> > > >    	list_del_init(&va->gem.entry);
> > > > +
> > > > +	if (list_empty(&vm_bo->list.gpuva)) {
> > > > +		list_del_init(&vm_bo->list.entry.gem);
> > > > +		list_del_init(&vm_bo->list.entry.evict);
> > > > +	}
> > > > +	drm_gpuva_gem_put(vm_bo);
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
> > > > @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_map);
> > > > +/**
> > > > + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> > > > + * &drm_gpuva_op_map
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @va: the &drm_gpuva to insert
> > > > + * @op: the &drm_gpuva_op_map to initialize @va with
> > > > + *
> > > > + * Initializes the @va from the @op and inserts it into the given @mgr and
> > > > + * increases the reference count of the corresponding extobj.
> > > > + */
> > > > +void
> > > > +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > > > +		  struct drm_gpuva *va,
> > > > +		  struct drm_gpuva_op_map *op)
> > > > +{
> > > > +	drm_gpuva_map(mgr, va, op);
> > > > +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> > > > +
> > > >    /**
> > > >     * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
> > > >     * &drm_gpuva_op_remap
> > > > @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> > > >    		struct drm_gpuva *next,
> > > >    		struct drm_gpuva_op_remap *op)
> > > >    {
> > > > -	struct drm_gpuva *curr = op->unmap->va;
> > > > -	struct drm_gpuva_manager *mgr = curr->mgr;
> > > > +	struct drm_gpuva *va = op->unmap->va;
> > > > +	struct drm_gpuva_manager *mgr = va->mgr;
> > > > -	drm_gpuva_remove(curr);
> > > > +	drm_gpuva_remove(va);
> > > >    	if (op->prev) {
> > > >    		drm_gpuva_init_from_op(prev, op->prev);
> > > > @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_remap);
> > > > +/**
> > > > + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> > > > + * &drm_gpuva_op_remap
> > > > + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> > > > + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> > > > + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> > > > + *
> > > > + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> > > > + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> > > > + * separate mappings, increases the reference count of the corresponding extobj.
> > > > + */
> > > > +void
> > > > +drm_gpuva_remap_get(struct drm_gpuva *prev,
> > > > +		    struct drm_gpuva *next,
> > > > +		    struct drm_gpuva_op_remap *op)
> > > > +{
> > > > +	struct drm_gpuva *va = op->unmap->va;
> > > > +	struct drm_gpuva_manager *mgr = va->mgr;
> > > > +
> > > > +	drm_gpuva_remap(prev, next, op);
> > > > +	if (op->prev && op->next)
> > > > +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> > > > +
> > > >    /**
> > > >     * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
> > > >     * &drm_gpuva_op_unmap
> > > > @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
> > > > +/**
> > > > + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> > > > + * &drm_gpuva_op_unmap
> > > > + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> > > > + *
> > > > + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> > > > + * the reference count of the corresponding extobj.
> > > > + */
> > > > +void
> > > > +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> > > > +{
> > > > +	struct drm_gpuva *va = op->va;
> > > > +
> > > > +	drm_gpuva_unmap(op);
> > > > +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> > > > +
> > > >    static int
> > > >    op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
> > > >    	  u64 addr, u64 range,
> > > > @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> > > >    {
> > > >    	struct drm_gpuva_ops *ops;
> > > >    	struct drm_gpuva_op *op;
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > >    	struct drm_gpuva *va;
> > > >    	int ret;
> > > > @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> > > >    	INIT_LIST_HEAD(&ops->list);
> > > > -	drm_gem_for_each_gpuva(va, obj) {
> > > > +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
> > > >    		op = gpuva_op_alloc(mgr);
> > > >    		if (!op) {
> > > >    			ret = -ENOMEM;
> > > > diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> > > > index bc9f6aa2f3fe..783ed3ab440d 100644
> > > > --- a/include/drm/drm_gem.h
> > > > +++ b/include/drm/drm_gem.h
> > > > @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
> > > >     * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
> > > >     * @obj: the &drm_gem_object
> > > >     *
> > > > - * This initializes the &drm_gem_object's &drm_gpuva list.
> > > > + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
> > > >     *
> > > >     * Calling this function is only necessary for drivers intending to support the
> > > >     * &drm_driver_feature DRIVER_GEM_GPUVA.
> > > > @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
> > > >    }
> > > >    /**
> > > > - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> > > > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > > > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > > > + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> > > > + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> > > > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> > > >     *
> > > > - * This iterator walks over all &drm_gpuva structures associated with the
> > > > - * &drm_gpuva_manager.
> > > > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> > > > + * &drm_gem_object.
> > > >     */
> > > > -#define drm_gem_for_each_gpuva(entry__, obj__) \
> > > > -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> > > > +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> > > > +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
> > > >    /**
> > > > - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> > > > - * gpuvas
> > > > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > > > - * @next__: &next &drm_gpuva to store the next step
> > > > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > > > + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> > > > + * &drm_gpuva_gem
> > > > + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> > > > + * @next__: &next &drm_gpuva_gem to store the next step
> > > > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> > > >     *
> > > > - * This iterator walks over all &drm_gpuva structures associated with the
> > > > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> > > >     * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
> > > >     * it is save against removal of elements.
> > > >     */
> > > > -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> > > > -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> > > > +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> > > > +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> > > > +
> > > > +/**
> > > > + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> > > > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > > > + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> > > > + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> > > > + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > > > + *
> > > > + * This iterator walks over all &drm_gpuva structures associated with the
> > > > + * &drm_gpuva_manager and &drm_gem_object.
> > > > + */
> > > > +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> > > > +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> > > > +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> > > > +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> > > > +	     va__ = list_next_entry(va__, gem.entry))
> > > >    #endif /* __DRM_GEM_H__ */
> > > > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > > > index ed8d50200cc3..693e2da3f425 100644
> > > > --- a/include/drm/drm_gpuva_mgr.h
> > > > +++ b/include/drm/drm_gpuva_mgr.h
> > > > @@ -26,12 +26,16 @@
> > > >     */
> > > >    #include <linux/list.h>
> > > > +#include <linux/dma-resv.h>
> > > > +#include <linux/maple_tree.h>
> > > >    #include <linux/rbtree.h>
> > > >    #include <linux/types.h>
> > > >    #include <drm/drm_gem.h>
> > > > +#include <drm/drm_exec.h>
> > > >    struct drm_gpuva_manager;
> > > > +struct drm_gpuva_gem;
> > > >    struct drm_gpuva_fn_ops;
> > > >    /**
> > > > @@ -140,7 +144,7 @@ struct drm_gpuva {
> > > >    int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> > > >    void drm_gpuva_remove(struct drm_gpuva *va);
> > > > -void drm_gpuva_link(struct drm_gpuva *va);
> > > > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> > > >    void drm_gpuva_unlink(struct drm_gpuva *va);
> > > >    struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > > > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> > > >    	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> > > >    	 */
> > > >    	const struct drm_gpuva_fn_ops *ops;
> > > > +
> > > > +	/**
> > > > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > > > +	 * dma-resv to &drm_exec.
> > > > +	 */
> > > > +	struct drm_gem_object d_obj;
> > > > +
> > > > +	/**
> > > > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > > > +	 * space
> > > > +	 */
> > > > +	struct dma_resv *resv;
> > > > +
> > > > +	/**
> > > > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > > > +	 */
> > > > +	struct drm_exec exec;
> > > > +
> > > > +	/**
> > > > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > > > +	 */
> > > > +	struct maple_tree mt_ext;
> > > Why are you using a maple tree here? Insertion and removal is O(log(n))
> > > instead of O(1) for a list?
> > > 
> > Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> > could have mappings of the same extobj.
> > 
> > I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> > instead, which also seems to be the obvious choice. However, there is a locking
> > conflict.
> > 
> > A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> > a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> > the corresponding drm_gem_object, or with an external lock provided by the
> > driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> > changes on the GPUVA space directly from the fence signalling path.
> > 
> > Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> > we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> > linked and we'd want to remove it for the last one being unlinked.
> > 
> > (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> > before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> > through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> > create the drm_gpuva_gem, which we need to do anyways.)
> > 
> > Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> > list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> > order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> > lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> > drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> > like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> > hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
> > 
> > Considering all that, I thought it's probably better to track extobjs separate
> > from the drm_gpuva_gem, hence the maple tree choice.
> 
> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
> external objects, or in the case of multiple mappings to the same gem
> object, only one of the drm_gpuvas is in the list. These are protected by
> the GPU-VM lock. I don't see a problem with removing those from the fence
> signalling path, though?

I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
since this is generic code I don't know how much mappings of an external object
the corresponding driver potentially creates. This could become a pretty large
list to iterate. Another reason was, that I want to keep the drm_gpuva structure
as small as possible, hence avoiding another list_head.

Now, it sounds like in XE you're doing some kind of optimization just keeping a
single mapping of an extobj in the list? How do you know when to remove it? What
if the mapping from the extobj list gets unmapped, but there is still another
one left in the GPU-VM being backed by the same BO?

> 
> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> choice, keeping O(1)?

When tracking extobjs, the address of the drm_gem_object is the key while the
reference count is the value. I was thinking of an XArray as well, but I was
worried that the corresponding indices could be too much distributed for an
XArray to still be efficient. Now that I think about it, it's probably not that
bad.

Btw., while I agree trying to make things as efficient as possible, what is the
magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?

> 
> > 
> > > > +
> > > > +	/**
> > > > +	 * @evict: structure holding the evict list and evict list lock
> > > > +	 */
> > > > +	struct {
> > > > +		/**
> > > > +		 * @list: &list_head storing &drm_gem_objects currently being
> > > > +		 * evicted
> > > > +		 */
> > > > +		struct list_head list;
> > > > +
> > > > +		/**
> > > > +		 * @lock: spinlock to protect the evict list against concurrent
> > > > +		 * insertion / removal of different &drm_gpuva_gems
> > > > +		 */
> > > > +		spinlock_t lock;
> > > > +	} evict;
> > > >    };
> > > >    void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > +			    struct drm_device *drm,
> > > >    			    const char *name,
> > > >    			    u64 start_offset, u64 range,
> > > >    			    u64 reserve_offset, u64 reserve_range,
> > > >    			    const struct drm_gpuva_fn_ops *ops);
> > > >    void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > > > +/**
> > > > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > > > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > > > + */
> > > > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > > A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> > > should typically be allocated on the stack. Otherwise you'd need to protect
> > > the mgr->exec member with an exclusive lock throughout the locking process,
> > > and that's not what we want.
> > Oh, good point. I think it works in Nouveau, because there it's implicitly
> > protected with the job submission lock.
> > 
> > > Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> > > needed ops to it: Like so:
> > That's a good idea, will take this into V2.
> 
> Actually, I'm not fully sure that was a good idea: I've now have a working
> version of Xe ported over to drm_exec, having these helpers in mind and with
> the intention to start using them as they mature. What I found, though is
> that open-coding the drm_exec loop is not all that bad, but that building
> blocks that can be called from within the loop are useful:
> 
> Like the drm_gpuva_prepare_objects() and an imaginary
> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
> (if different and the gpuva points to the object. And
> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
> can use these building blocks like helpers and avoid the fn() callback by
> instead open-coding.
> 
> But I guess YMMV.

That's exactly why those building blocks are exported, I already had in mind
that there might be drivers which still want to open-code the drm_exec loop,
while others might just want a simple interface to lock everything.

I still think it is a good idea, but I'd keep that as simple as possible. And
for everything else just let the driver open-code it and use the "building
blocks" - will also expand the bulding blocks to what you mentioned above.

> 
> > 
> > > struct drm_gpuva_exec_ops {
> > >      int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> > Is this the fn argument from drm_gpuva_manager_lock_extra()?
> > 
> > >      int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> > > *obj);
> > I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> > be the same callback, right?
> > 
> > > };
> > > 
> > > struct drm_gpuva_exec {
> > >      const struct drm_gpuva_exec_ops *ops;
> > >      struct drm_exec exec;
> > >      struct drm_gpuva_manager *mgr;
> > > };
> > > 
> > > Although I'd actually expect bo_validate to be part of fn in the typical
> > > case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> > This doesn't sound like my assumption about fn() above is correct.
> 
> Well one important thing in our conversion is that ttm_bo_validate () needs
> to be in the until_all_locked() loop. We want to be able soon to use
> sleeping locks for eviction, so a xe_bo_validate() would, at least
> temporarily, add locked objects to the drm_exec list of locked objects. That
> means everything that may end up calling validate deep within the call chain
> needs to be part of the until_all_locked() loop, so our
> drm_gpuva_manager_lock_extra() fn callback would include those validates and
> look different all the time. Hence that's why open-coding isn't all that
> bad...

Oh, I see. You indeed want to call validate() from within until_all_locked().

> 
> /Thomas
> 
> 
> > 
> > > 
> > > > +
> > > > +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > > > +				 int (*fn)(struct drm_gpuva_manager *mgr,
> > > > +					   void *priv, unsigned int num_fences),
> > > > +				 void *priv,
> > > > +				 unsigned int num_fences,
> > > > +				 bool interruptible);
> > > > +
> > > > +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > > > +				 struct drm_gem_object **objs,
> > > > +				 unsigned int num_objs,
> > > > +				 unsigned int num_fences,
> > > > +				 bool interruptible);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @num_fences: the amount of &dma_fences to reserve
> > > > + * @interruptible: sleep interruptible if waiting
> > > > + *
> > > > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > > > + * &drm_gpuva_manager contains mappings of.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +static inline int
> > > > +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> > > > +		       unsigned int num_fences,
> > > > +		       bool interruptible)
> > > > +{
> > > > +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> > > > +					    interruptible);
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + *
> > > > + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> > > > + * through drm_gpuva_manager_lock() or its variants.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +static inline void
> > > > +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> > > > +{
> > > > +	drm_exec_fini(&mgr->exec);
> > > > +}
> > > > +
> > > > +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> > > > +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > > > +				      struct dma_fence *fence,
> > > > +				      enum dma_resv_usage private_usage,
> > > > +				      enum dma_resv_usage extobj_usage);
> > > > +
> > > > +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > > > +			    struct drm_gem_object *obj);
> > > > +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > > > +			  struct drm_gem_object *obj);
> > > > +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > > > +			  struct drm_gem_object *obj);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> > > > + * external object
> > > > + * @mgr: the &drm_gpuva_manager to check
> > > > + * @obj: the &drm_gem_object to check
> > > > + *
> > > > + * Returns: true if the &drm_gem_object &dma_resv differs from the
> > > > + * &drm_gpuva_managers &dma_resv, false otherwise
> > > > + */
> > > > +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> > > > +				       struct drm_gem_object *obj)
> > > > +{
> > > > +	return obj && obj->resv != mgr->resv;
> > > > +}
> > > > +
> > > >    static inline struct drm_gpuva *
> > > >    __drm_gpuva_next(struct drm_gpuva *va)
> > > >    {
> > > > @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
> > > >    #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
> > > >    	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
> > > > +/**
> > > > + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> > > > + * &drm_gem_object combination
> > > > + *
> > > > + * This structure is an abstraction representing a &drm_gpuva_manager and
> > > > + * &drm_gem_object combination. It serves as an indirection to accelerate
> > > > + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> > > > + * &drm_gem_object.
> > > > + *
> > > > + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> > > > + * accelerate validation.
> > > > + *
> > > > + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> > > > + * a GEM object is mapped first in a GPU-VM and release the instance once the
> > > > + * last mapping of the GEM object in this GPU-VM is unmapped.
> > > > + */
> > > > +struct drm_gpuva_gem {
> > > > +
> > > > +	/**
> > > > +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > +	 */
> > > > +	struct drm_gpuva_manager *mgr;
> > > > +
> > > > +	/**
> > > > +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > +	 */
> > > > +	struct drm_gem_object *obj;
> > > > +
> > > > +	/**
> > > > +	 * @kref: The reference count for this &drm_gpuva_gem.
> > > > +	 */
> > > > +	struct kref kref;
> > > > +
> > > > +	/**
> > > > +	 * @list: Structure containing all &list_heads.
> > > > +	 */
> > > > +	struct {
> > > > +		/**
> > > > +		 * @gpuva: The list of linked &drm_gpuvas.
> > > > +		 */
> > > > +		struct list_head gpuva;
> > > > +
> > > > +		/**
> > > > +		 * @entry: Structure containing all &list_heads serving as
> > > > +		 * entry.
> > > > +		 */
> > > > +		struct {
> > > > +			/**
> > > > +			 * @gem: List entry to attach to the &drm_gem_objects
> > > > +			 * gpuva list.
> > > > +			 */
> > > > +			struct list_head gem;
> > > > +
> > > > +			/**
> > > > +			 * @evict: List entry to attach to the
> > > > +			 * &drm_gpuva_managers evict list.
> > > > +			 */
> > > > +			struct list_head evict;
> > > > +		} entry;
> > > > +	} list;
> > > > +};
> > > > +
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj);
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > > > +			      struct drm_gem_object *obj,
> > > > +			      struct drm_gpuva_gem *__vm_bo);
> > > > +
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > > > +		   struct drm_gem_object *obj);
> > > > +
> > > > +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> > > > +
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj);
> > > > +void drm_gpuva_gem_destroy(struct kref *kref);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> > > > + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> > > > + *
> > > > + * This function acquires an additional reference to @vm_bo. It is illegal to
> > > > + * call this without already holding a reference. No locks required.
> > > > + */
> > > > +static inline struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> > > > +{
> > > > +	kref_get(&vm_bo->kref);
> > > > +	return vm_bo;
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> > > > + * @vm_bo: the &drm_gpuva_gem to release the reference of
> > > > + *
> > > > + * This releases a reference to @vm_bo.
> > > > + */
> > > > +static inline void
> > > > +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> > > > +{
> > > > +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> > > > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > > > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > > > + *
> > > > + * This iterator walks over all &drm_gpuva structures associated with the
> > > > + * &drm_gpuva_gem.
> > > > + */
> > > > +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> > > > +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> > > > + * &drm_gpuva
> > > > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > > > + * @next__: &next &drm_gpuva to store the next step
> > > > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > > > + *
> > > > + * This iterator walks over all &drm_gpuva structures associated with the
> > > > + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> > > > + * it is save against removal of elements.
> > > > + */
> > > > +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> > > > +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> > > > +
> > > >    /**
> > > >     * enum drm_gpuva_op_type - GPU VA operation type
> > > >     *
> > > > @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
> > > >    	 */
> > > >    	void (*op_free)(struct drm_gpuva_op *op);
> > > > +	/**
> > > > +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> > > > +	 * a struct drm_gpuva_gem
> > > > +	 *
> > > > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > > > +	 * specific structures. By implementing this callback drivers can
> > > > +	 * allocate memory accordingly.
> > > > +	 *
> > > > +	 * This callback is optional.
> > > > +	 */
> > > > +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> > > > +
> > > > +	/**
> > > > +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> > > > +	 * struct drm_gpuva_gem
> > > > +	 *
> > > > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > > > +	 * specific structures. By implementing this callback drivers can
> > > > +	 * free the previously allocated memory accordingly.
> > > > +	 *
> > > > +	 * This callback is optional.
> > > > +	 */
> > > > +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> > > > +
> > > >    	/**
> > > >    	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
> > > >    	 * mapping once all previous steps were completed
> > > > @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
> > > >    	 * used.
> > > >    	 */
> > > >    	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> > > > +
> > > > +	/**
> > > > +	 * @bo_validate: called from drm_gpuva_manager_validate()
> > > > +	 *
> > > > +	 * Drivers receive this callback for every evicted &drm_gem_object being
> > > > +	 * mapped in the corresponding &drm_gpuva_manager.
> > > > +	 *
> > > > +	 * Typically, drivers would call their driver specific variant of
> > > > +	 * ttm_bo_validate() from within this callback.
> > > > +	 */
> > > > +	int (*bo_validate)(struct drm_gem_object *obj);
> > > >    };
> > > >    int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> > > > @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
> > > >    void drm_gpuva_map(struct drm_gpuva_manager *mgr,
> > > >    		   struct drm_gpuva *va,
> > > >    		   struct drm_gpuva_op_map *op);
> > > > +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > > > +		       struct drm_gpuva *va,
> > > > +		       struct drm_gpuva_op_map *op);
> > > >    void drm_gpuva_remap(struct drm_gpuva *prev,
> > > >    		     struct drm_gpuva *next,
> > > >    		     struct drm_gpuva_op_remap *op);
> > > > +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> > > > +			 struct drm_gpuva *next,
> > > > +			 struct drm_gpuva_op_remap *op);
> > > >    void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> > > > +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
> > > >    #endif /* __DRM_GPUVA_MGR_H__ */
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-30 15:00           ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-30 15:00 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
> 
> On 8/30/23 14:49, Danilo Krummrich wrote:
> > Hi Thomas,
> > 
> > thanks for having a look!
> > 
> > On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> > > Hi, Danilo.
> > > 
> > > Some quick comments since I'm doing some Xe work in this area. Will probably
> > > get back with more.
> > > 
> > > On 8/20/23 23:53, Danilo Krummrich wrote:
> > > > So far the DRM GPUVA manager offers common infrastructure to track GPU VA
> > > > allocations and mappings, generically connect GPU VA mappings to their
> > > > backing buffers and perform more complex mapping operations on the GPU VA
> > > > space.
> > > > 
> > > > However, there are more design patterns commonly used by drivers, which
> > > > can potentially be generalized in order to make the DRM GPUVA manager
> > > > represent a basic GPU-VM implementation. In this context, this patch aims
> > > > at generalizing the following elements.
> > > > 
> > > > 1) Provide a common dma-resv for GEM objects not being used outside of
> > > >      this GPU-VM.
> > > > 
> > > > 2) Provide tracking of external GEM objects (GEM objects which are
> > > >      shared with other GPU-VMs).
> > > > 
> > > > 3) Provide functions to efficiently lock all GEM objects dma-resv the
> > > >      GPU-VM contains mappings of.
> > > > 
> > > > 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
> > > >      of, such that validation of evicted GEM objects is accelerated.
> > > > 
> > > > 5) Provide some convinience functions for common patterns.
> > > > 
> > > > Rather than being designed as a "framework", the target is to make all
> > > > features appear as a collection of optional helper functions, such that
> > > > drivers are free to make use of the DRM GPUVA managers basic
> > > > functionality and opt-in for other features without setting any feature
> > > > flags, just by making use of the corresponding functions.
> > > > 
> > > > Signed-off-by: Danilo Krummrich <dakr@redhat.com>
> > > > ---
> > > >    drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
> > > >    include/drm/drm_gem.h           |  48 ++-
> > > >    include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
> > > >    3 files changed, 1010 insertions(+), 28 deletions(-)
> > > > 
> > > > diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> > > > index f86bfad74ff8..69872b205961 100644
> > > > --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> > > > +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> > > > @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> > > >    /**
> > > >     * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
> > > >     * @mgr: pointer to the &drm_gpuva_manager to initialize
> > > > + * @drm: the drivers &drm_device
> > > >     * @name: the name of the GPU VA space
> > > >     * @start_offset: the start offset of the GPU VA space
> > > >     * @range: the size of the GPU VA space
> > > > @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> > > >     */
> > > >    void
> > > >    drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > +		       struct drm_device *drm,
> > > >    		       const char *name,
> > > >    		       u64 start_offset, u64 range,
> > > >    		       u64 reserve_offset, u64 reserve_range,
> > > > @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > >    	mgr->rb.tree = RB_ROOT_CACHED;
> > > >    	INIT_LIST_HEAD(&mgr->rb.list);
> > > > +	mt_init(&mgr->mt_ext);
> > > > +
> > > > +	INIT_LIST_HEAD(&mgr->evict.list);
> > > > +	spin_lock_init(&mgr->evict.lock);
> > > > +
> > > >    	drm_gpuva_check_overflow(start_offset, range);
> > > >    	mgr->mm_start = start_offset;
> > > >    	mgr->mm_range = range;
> > > > @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > >    						     reserve_range)))
> > > >    			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
> > > >    	}
> > > > +
> > > > +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
> > > > +	mgr->resv = mgr->d_obj.resv;
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
> > > > @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
> > > >    		__drm_gpuva_remove(&mgr->kernel_alloc_node);
> > > >    	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> > > > -	     "GPUVA tree is not empty, potentially leaking memory.");
> > > > +	     "GPUVA tree is not empty, potentially leaking memory.\n");
> > > > +
> > > > +	mtree_destroy(&mgr->mt_ext);
> > > > +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
> > > > +
> > > > +	drm_gem_private_object_fini(&mgr->d_obj);
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
> > > > +/**
> > > > + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @num_fences: the amount of &dma_fences to reserve
> > > > + *
> > > > + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
> > > > + * &drm_gpuva_manager contains mappings of.
> > > > + *
> > > > + * Drivers can obtain the corresponding &drm_exec instance through
> > > > + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
> > > > + * and drm_exec_fini() accordingly.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
> > > > +				  unsigned int num_fences)
> > > > +{
> > > > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > > > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > > > +	union {
> > > > +		void *ptr;
> > > > +		uintptr_t cnt;
> > > > +	} ref;
> > > > +	int ret;
> > > > +
> > > > +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
> > > > +	if (ret)
> > > > +		goto out;
> > > > +
> > > > +	rcu_read_lock();
> > > In xe we're protecting the external object list with an outer lock, (same as
> > > protecting the mgr itself). Do we need a separate lock for this? In theory
> > > as  outlined in the VM_BIND locking document draft, one could probably even
> > > use the mgr resv for this, but with more complicated code I guess. Also see
> > > the comment below about the data structure chosen.
> > The idea is to protect this list with the GPU-VM lock. The locking here is more
> > of an implication of the maple tree. Either you use the internal lock of the
> > maple tree or RCU respectively, or you give the maple tree an external lock to
> > perform lockdep checks on (mt_set_external_lock()). Basically same as here:
> > 
> > https://elixir.bootlin.com/linux/latest/source/drivers/base/regmap/regcache-maple.c#L124
> 
> Ah, I suspected it was something along those lines.
> 
> 
> > 
> > > > +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
> > > > +		struct drm_gem_object *obj;
> > > > +
> > > > +		mas_pause(&mas);
> > > > +		rcu_read_unlock();
> > > > +
> > > > +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
> > > > +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
> > > > +		if (ret)
> > > > +			goto out;
> > > > +
> > > > +		rcu_read_lock();
> > > > +	}
> > > > +	rcu_read_unlock();
> > > > +
> > > > +out:
> > > > +	return ret;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @fn: callback received by the driver to lock additional dma-resv
> > > > + * @priv: private driver data passed to @fn
> > > > + * @num_fences: the amount of &dma_fences to reserve
> > > > + * @interruptible: sleep interruptible if waiting
> > > > + *
> > > > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > > > + * &drm_gpuva_manager contains mappings of.
> > > > + *
> > > > + * Addionally, when calling this function the driver receives the given @fn
> > > > + * callback to lock additional dma-resv in the context of the
> > > > + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
> > > > + * drm_exec_prepare_obj() from within this callback.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > > > +			     int (*fn)(struct drm_gpuva_manager *mgr,
> > > > +				       void *priv, unsigned int num_fences),
> > > > +			     void *priv,
> > > > +			     unsigned int num_fences,
> > > > +			     bool interruptible)
> > > > +{
> > > > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > > > +	uint32_t flags;
> > > > +	int ret;
> > > > +
> > > > +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
> > > > +		DRM_EXEC_IGNORE_DUPLICATES;
> > > > +
> > > > +	drm_exec_init(exec, flags);
> > > > +
> > > > +	drm_exec_until_all_locked(exec) {
> > > > +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
> > > > +		drm_exec_retry_on_contention(exec);
> > > > +		if (ret)
> > > > +			goto err;
> > > > +
> > > > +		if (fn) {
> > > > +			ret = fn(mgr, priv, num_fences);
> > > > +			drm_exec_retry_on_contention(exec);
> > > > +			if (ret)
> > > > +				goto err;
> > > > +		}
> > > > +	}
> > > > +
> > > > +	return 0;
> > > > +
> > > > +err:
> > > > +	drm_exec_fini(exec);
> > > > +	return ret;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
> > > > +
> > > > +static int
> > > > +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
> > > > +				unsigned int num_fences)
> > > > +{
> > > > +	struct {
> > > > +		struct drm_gem_object **objs;
> > > > +		unsigned int num_objs;
> > > > +	} *args = priv;
> > > > +
> > > > +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
> > > > +				      args->num_objs, num_fences);
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @objs: additional &drm_gem_objects to lock
> > > > + * @num_objs: the number of additional &drm_gem_objects to lock
> > > > + * @num_fences: the amount of &dma_fences to reserve
> > > > + * @interruptible: sleep interruptible if waiting
> > > > + *
> > > > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > > > + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > > > +			     struct drm_gem_object **objs,
> > > > +			     unsigned int num_objs,
> > > > +			     unsigned int num_fences,
> > > > +			     bool interruptible)
> > > > +{
> > > > +	struct {
> > > > +		struct drm_gem_object **objs;
> > > > +		unsigned int num_objs;
> > > > +	} args;
> > > > +
> > > > +	args.objs = objs;
> > > > +	args.num_objs = num_objs;
> > > > +
> > > > +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
> > > > +					    num_fences, interruptible);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
> > > > + * @mgr: the &drm_gpuva_manager to validate evicted BOs
> > > > + *
> > > > + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
> > > > + * objects being mapped in the given &drm_gpuva_manager.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
> > > > +{
> > > > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +	int ret;
> > > > +
> > > > +	if (unlikely(!ops || !ops->bo_validate))
> > > > +		return -ENOTSUPP;
> > > > +
> > > > +	/* At this point we should hold all dma-resv locks of all GEM objects
> > > > +	 * associated with this GPU-VM, hence it is safe to walk the list.
> > > > +	 */
> > > > +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
> > > > +		dma_resv_assert_held(vm_bo->obj->resv);
> > > > +
> > > > +		ret = ops->bo_validate(vm_bo->obj);
> > > > +		if (ret)
> > > > +			return ret;
> > > > +	}
> > > > +
> > > > +	return 0;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
> > > > + * dma-resv
> > > > + * @mgr: the &drm_gpuva_manager to add a fence to
> > > > + * @fence: fence to add
> > > > + * @private_usage: private dma-resv usage
> > > > + * @extobj_usage: extobj dma-resv usage
> > > > + */
> > > > +void
> > > > +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > > > +				 struct dma_fence *fence,
> > > > +				 enum dma_resv_usage private_usage,
> > > > +				 enum dma_resv_usage extobj_usage)
> > > > +{
> > > > +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
> > > > +	struct drm_gem_object *obj;
> > > > +	unsigned long index;
> > > > +
> > > > +	drm_exec_for_each_locked_object(exec, index, obj) {
> > > > +			dma_resv_assert_held(obj->resv);
> > > > +			dma_resv_add_fence(obj->resv, fence,
> > > > +					   drm_gpuva_is_extobj(mgr, obj) ?
> > > > +					   private_usage : extobj_usage);
> > > > +	}
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
> > > > +
> > > > +static struct drm_gpuva_gem *
> > > > +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	drm_gem_gpuva_assert_lock_held(obj);
> > > > +
> > > > +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
> > > > +		if (vm_bo->mgr == mgr)
> > > > +			return vm_bo;
> > > > +
> > > > +	return NULL;
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
> > > > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > + *
> > > > + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
> > > > + * vm_bo_alloc() callback to allocate.
> > > > + *
> > > > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > > > + */
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	if (ops && ops->vm_bo_alloc)
> > > > +		vm_bo = ops->vm_bo_alloc();
> > > > +	else
> > > > +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
> > > > +
> > > > +	if (unlikely(!vm_bo))
> > > > +		return NULL;
> > > > +
> > > > +	vm_bo->mgr = mgr;
> > > > +	vm_bo->obj = obj;
> > > > +
> > > > +	kref_init(&vm_bo->kref);
> > > > +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
> > > > +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
> > > > +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
> > > > +
> > > > +	drm_gem_object_get(obj);
> > > > +
> > > > +	return vm_bo;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
> > > > +
> > > > +void
> > > > +drm_gpuva_gem_destroy(struct kref *kref)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
> > > > +						   kref);
> > > > +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
> > > > +
> > > > +	drm_gem_object_put(vm_bo->obj);
> > > > +
> > > > +	if (ops && ops->vm_bo_free)
> > > > +		ops->vm_bo_free(vm_bo);
> > > > +	else
> > > > +		kfree(vm_bo);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
> > > > + * &drm_gpuva_manager and &drm_gem_object
> > > > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > + *
> > > > + * Find the &drm_gpuva_gem representing the combination of the given
> > > > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > > > + * count of the &drm_gpuva_gem accordingly.
> > > > + *
> > > > + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
> > > > + */
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > > > +		   struct drm_gem_object *obj)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
> > > > +
> > > > +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
> > > > + * given &drm_gpuva_manager and &drm_gem_object
> > > > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > + *
> > > > + * Find the &drm_gpuva_gem representing the combination of the given
> > > > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > > > + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
> > > > + * &drm_gpuva_gem.
> > > > + *
> > > > + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
> > > > + */
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > > > +	if (vm_bo)
> > > > +		return vm_bo;
> > > > +
> > > > +	vm_bo = drm_gpuva_gem_create(mgr, obj);
> > > > +	if (!vm_bo)
> > > > +		return ERR_PTR(-ENOMEM);
> > > > +
> > > > +	return vm_bo;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
> > > > + * for the given &drm_gpuva_manager and &drm_gem_object
> > > > + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > + * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > + *
> > > > + * Find the &drm_gpuva_gem representing the combination of the given
> > > > + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
> > > > + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
> > > > + * count is decreased. If not found @__vm_bo is returned.
> > > > + *
> > > > + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
> > > > + * &drm_gpuva_gem was found
> > > > + */
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > > > +			      struct drm_gem_object *obj,
> > > > +			      struct drm_gpuva_gem *__vm_bo)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	vm_bo = drm_gpuva_gem_find(mgr, obj);
> > > > +	if (vm_bo) {
> > > > +		drm_gpuva_gem_put(__vm_bo);
> > > > +		return vm_bo;
> > > > +	}
> > > > +
> > > > +	return __vm_bo;
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
> > > > +
> > > > +static int
> > > > +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > > > +			  struct drm_gem_object *obj,
> > > > +			  gfp_t gfp)
> > > > +{
> > > > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > > > +	union {
> > > > +		struct drm_gem_object *obj;
> > > > +		uintptr_t index;
> > > > +	} gem;
> > > > +	union {
> > > > +		void *ptr;
> > > > +		uintptr_t cnt;
> > > > +	} ref;
> > > > +	int ret = 0;
> > > > +
> > > > +	gem.obj = obj;
> > > > +	mas_set(&mas, gem.index);
> > > > +
> > > > +	mas_lock(&mas);
> > > > +	ref.ptr = mas_walk(&mas);
> > > > +	if (ref.ptr) {
> > > > +		++ref.cnt;
> > > > +		mas_store(&mas, ref.ptr);
> > > > +	} else {
> > > > +		if (unlikely(!gfp)) {
> > > > +			ret = -EINVAL;
> > > > +			goto out;
> > > > +		}
> > > > +
> > > > +		mas_set(&mas, gem.index);
> > > > +		ref.cnt = 1;
> > > > +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
> > > > +		if (likely(!ret))
> > > > +			drm_gem_object_get(obj);
> > > > +	}
> > > > +out:
> > > > +	mas_unlock(&mas);
> > > > +	return ret;
> > > > +}
> > > > +
> > > > +static void
> > > > +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
> > > > +			  struct drm_gem_object *obj)
> > > > +{
> > > > +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
> > > > +	union {
> > > > +		struct drm_gem_object *obj;
> > > > +		uintptr_t index;
> > > > +	} gem;
> > > > +	union {
> > > > +		void *ptr;
> > > > +		uintptr_t cnt;
> > > > +	} ref;
> > > > +
> > > > +	gem.obj = obj;
> > > > +	mas_set(&mas, gem.index);
> > > > +
> > > > +	mas_lock(&mas);
> > > > +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
> > > > +		goto out;
> > > > +
> > > > +	if (!--ref.cnt) {
> > > > +		mas_erase(&mas);
> > > > +		drm_gem_object_put(obj);
> > > > +	} else {
> > > > +		mas_store(&mas, ref.ptr);
> > > > +	}
> > > > +out:
> > > > +	mas_unlock(&mas);
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
> > > > + * @mgr: the &drm_gpuva_manager to insert into
> > > > + * @obj: the &drm_gem_object to insert as extobj
> > > > + *
> > > > + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
> > > > + * If the &drm_gem_object already exists in the tree, the reference counter
> > > > + * of this external object is increased by one.
> > > > + *
> > > > + * Drivers should insert the external &drm_gem_object before the dma-fence
> > > > + * signalling critical section, e.g. when submitting the job, and before
> > > > + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
> > > > + * or its dervates.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +int
> > > > +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > > > +			struct drm_gem_object *obj)
> > > > +{
> > > > +	return drm_gpuva_is_extobj(mgr, obj) ?
> > > > +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
> > > > +
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_extobj_get - increase the referecne count of an external
> > > > + * &drm_gem_object
> > > > + * @mgr: the &drm_gpuva_manager storing the extobj
> > > > + * @obj: the &drm_gem_object to representing the extobj
> > > > + *
> > > > + * Increases the reference count of the extobj represented by @obj.
> > > > + *
> > > > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > > > + * being inserted.
> > > > + *
> > > > + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
> > > > + * additional reference if the re-map operation splits an existing &drm_gpuva
> > > > + * into two separate ones.
> > > > + *
> > > > + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +void
> > > > +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	if (drm_gpuva_is_extobj(mgr, obj))
> > > > +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
> > > > +		     "Can't increase ref-count of non-existent extobj.");
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_extobj_put - decrease the referecne count of an external
> > > > + * &drm_gem_object
> > > > + * @mgr: the &drm_gpuva_manager storing the extobj
> > > > + * @obj: the &drm_gem_object to representing the extobj
> > > > + *
> > > > + * Decreases the reference count of the extobj represented by @obj.
> > > > + *
> > > > + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
> > > > + * being removed from the GPU VA space.
> > > > + *
> > > > + * See also drm_gpuva_unmap_put().
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +void
> > > > +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj)
> > > > +{
> > > > +	if (drm_gpuva_is_extobj(mgr, obj))
> > > > +		__drm_gpuva_extobj_remove(mgr, obj);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
> > > > + * &drm_gpuva_managers evicted list
> > > > + * @obj: the &drm_gem_object to add or remove
> > > > + * @evict: indicates whether the object is evicted
> > > > + *
> > > > + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
> > > > + * list containing a mapping of this &drm_gem_object.
> > > > + */
> > > > +void
> > > > +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
> > > > +{
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > > +
> > > > +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
> > > > +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
> > > > +	 */
> > > > +	drm_gem_gpuva_assert_lock_held(obj);
> > > > +
> > > > +	/* Required to protect the GPUVA managers evict list against concurrent
> > > > +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
> > > > +	 * the evict list through different GEM object evictions are protected
> > > > +	 * by the GPUVA managers evict lock.
> > > > +	 */
> > > > +	dma_resv_assert_held(obj->resv);
> > > > +
> > > > +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
> > > > +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
> > > > +
> > > > +		spin_lock(&mgr->evict.lock);
> > > > +		if (evict)
> > > > +			list_add_tail(&vm_bo->list.entry.evict,
> > > > +				      &mgr->evict.list);
> > > > +		else
> > > > +			list_del_init(&vm_bo->list.entry.evict);
> > > > +		spin_unlock(&mgr->evict.lock);
> > > > +	}
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
> > > > +
> > > >    static int
> > > >    __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
> > > >    		   struct drm_gpuva *va)
> > > > @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
> > > >    /**
> > > >     * drm_gpuva_link() - link a &drm_gpuva
> > > >     * @va: the &drm_gpuva to link
> > > > + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
> > > >     *
> > > > - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> > > > - * associated with.
> > > > + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
> > > > + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
> > > > + *
> > > > + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
> > > > + * reference of the latter is taken.
> > > >     *
> > > >     * This function expects the caller to protect the GEM's GPUVA list against
> > > > - * concurrent access using the GEMs dma_resv lock.
> > > > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > > > + * lock set through drm_gem_gpuva_set_lock().
> > > >     */
> > > >    void
> > > > -drm_gpuva_link(struct drm_gpuva *va)
> > > > +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
> > > >    {
> > > >    	struct drm_gem_object *obj = va->gem.obj;
> > > > @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
> > > >    	drm_gem_gpuva_assert_lock_held(obj);
> > > > -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
> > > > +	drm_gpuva_gem_get(vm_bo);
> > > > +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
> > > > +	if (list_empty(&vm_bo->list.entry.gem))
> > > > +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_link);
> > > > @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
> > > >     * This removes the given &va from the GPU VA list of the &drm_gem_object it is
> > > >     * associated with.
> > > >     *
> > > > + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
> > > > + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
> > > > + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
> > > > + *
> > > > + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
> > > > + * the latter is dropped.
> > > > + *
> > > >     * This function expects the caller to protect the GEM's GPUVA list against
> > > > - * concurrent access using the GEMs dma_resv lock.
> > > > + * concurrent access using either the GEMs dma_resv lock or a driver specific
> > > > + * lock set through drm_gem_gpuva_set_lock().
> > > >     */
> > > >    void
> > > >    drm_gpuva_unlink(struct drm_gpuva *va)
> > > >    {
> > > >    	struct drm_gem_object *obj = va->gem.obj;
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > >    	if (unlikely(!obj))
> > > >    		return;
> > > >    	drm_gem_gpuva_assert_lock_held(obj);
> > > > +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
> > > > +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
> > > > +		return;
> > > > +
> > > >    	list_del_init(&va->gem.entry);
> > > > +
> > > > +	if (list_empty(&vm_bo->list.gpuva)) {
> > > > +		list_del_init(&vm_bo->list.entry.gem);
> > > > +		list_del_init(&vm_bo->list.entry.evict);
> > > > +	}
> > > > +	drm_gpuva_gem_put(vm_bo);
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
> > > > @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_map);
> > > > +/**
> > > > + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
> > > > + * &drm_gpuva_op_map
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @va: the &drm_gpuva to insert
> > > > + * @op: the &drm_gpuva_op_map to initialize @va with
> > > > + *
> > > > + * Initializes the @va from the @op and inserts it into the given @mgr and
> > > > + * increases the reference count of the corresponding extobj.
> > > > + */
> > > > +void
> > > > +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > > > +		  struct drm_gpuva *va,
> > > > +		  struct drm_gpuva_op_map *op)
> > > > +{
> > > > +	drm_gpuva_map(mgr, va, op);
> > > > +	drm_gpuva_extobj_get(mgr, va->gem.obj);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
> > > > +
> > > >    /**
> > > >     * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
> > > >     * &drm_gpuva_op_remap
> > > > @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> > > >    		struct drm_gpuva *next,
> > > >    		struct drm_gpuva_op_remap *op)
> > > >    {
> > > > -	struct drm_gpuva *curr = op->unmap->va;
> > > > -	struct drm_gpuva_manager *mgr = curr->mgr;
> > > > +	struct drm_gpuva *va = op->unmap->va;
> > > > +	struct drm_gpuva_manager *mgr = va->mgr;
> > > > -	drm_gpuva_remove(curr);
> > > > +	drm_gpuva_remove(va);
> > > >    	if (op->prev) {
> > > >    		drm_gpuva_init_from_op(prev, op->prev);
> > > > @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_remap);
> > > > +/**
> > > > + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
> > > > + * &drm_gpuva_op_remap
> > > > + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> > > > + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> > > > + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> > > > + *
> > > > + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> > > > + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
> > > > + * separate mappings, increases the reference count of the corresponding extobj.
> > > > + */
> > > > +void
> > > > +drm_gpuva_remap_get(struct drm_gpuva *prev,
> > > > +		    struct drm_gpuva *next,
> > > > +		    struct drm_gpuva_op_remap *op)
> > > > +{
> > > > +	struct drm_gpuva *va = op->unmap->va;
> > > > +	struct drm_gpuva_manager *mgr = va->mgr;
> > > > +
> > > > +	drm_gpuva_remap(prev, next, op);
> > > > +	if (op->prev && op->next)
> > > > +		drm_gpuva_extobj_get(mgr, va->gem.obj);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
> > > > +
> > > >    /**
> > > >     * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
> > > >     * &drm_gpuva_op_unmap
> > > > @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
> > > >    }
> > > >    EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
> > > > +/**
> > > > + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
> > > > + * &drm_gpuva_op_unmap
> > > > + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> > > > + *
> > > > + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
> > > > + * the reference count of the corresponding extobj.
> > > > + */
> > > > +void
> > > > +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
> > > > +{
> > > > +	struct drm_gpuva *va = op->va;
> > > > +
> > > > +	drm_gpuva_unmap(op);
> > > > +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
> > > > +}
> > > > +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
> > > > +
> > > >    static int
> > > >    op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
> > > >    	  u64 addr, u64 range,
> > > > @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> > > >    {
> > > >    	struct drm_gpuva_ops *ops;
> > > >    	struct drm_gpuva_op *op;
> > > > +	struct drm_gpuva_gem *vm_bo;
> > > >    	struct drm_gpuva *va;
> > > >    	int ret;
> > > > @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> > > >    	INIT_LIST_HEAD(&ops->list);
> > > > -	drm_gem_for_each_gpuva(va, obj) {
> > > > +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
> > > >    		op = gpuva_op_alloc(mgr);
> > > >    		if (!op) {
> > > >    			ret = -ENOMEM;
> > > > diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> > > > index bc9f6aa2f3fe..783ed3ab440d 100644
> > > > --- a/include/drm/drm_gem.h
> > > > +++ b/include/drm/drm_gem.h
> > > > @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
> > > >     * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
> > > >     * @obj: the &drm_gem_object
> > > >     *
> > > > - * This initializes the &drm_gem_object's &drm_gpuva list.
> > > > + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
> > > >     *
> > > >     * Calling this function is only necessary for drivers intending to support the
> > > >     * &drm_driver_feature DRIVER_GEM_GPUVA.
> > > > @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
> > > >    }
> > > >    /**
> > > > - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
> > > > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > > > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > > > + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
> > > > + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
> > > > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> > > >     *
> > > > - * This iterator walks over all &drm_gpuva structures associated with the
> > > > - * &drm_gpuva_manager.
> > > > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> > > > + * &drm_gem_object.
> > > >     */
> > > > -#define drm_gem_for_each_gpuva(entry__, obj__) \
> > > > -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> > > > +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
> > > > +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
> > > >    /**
> > > > - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
> > > > - * gpuvas
> > > > - * @entry__: &drm_gpuva structure to assign to in each iteration step
> > > > - * @next__: &next &drm_gpuva to store the next step
> > > > - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > > > + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
> > > > + * &drm_gpuva_gem
> > > > + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
> > > > + * @next__: &next &drm_gpuva_gem to store the next step
> > > > + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
> > > >     *
> > > > - * This iterator walks over all &drm_gpuva structures associated with the
> > > > + * This iterator walks over all &drm_gpuva_gem structures associated with the
> > > >     * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
> > > >     * it is save against removal of elements.
> > > >     */
> > > > -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> > > > -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> > > > +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
> > > > +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
> > > > +
> > > > +/**
> > > > + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
> > > > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > > > + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
> > > > + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
> > > > + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
> > > > + *
> > > > + * This iterator walks over all &drm_gpuva structures associated with the
> > > > + * &drm_gpuva_manager and &drm_gem_object.
> > > > + */
> > > > +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
> > > > +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
> > > > +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
> > > > +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
> > > > +	     va__ = list_next_entry(va__, gem.entry))
> > > >    #endif /* __DRM_GEM_H__ */
> > > > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > > > index ed8d50200cc3..693e2da3f425 100644
> > > > --- a/include/drm/drm_gpuva_mgr.h
> > > > +++ b/include/drm/drm_gpuva_mgr.h
> > > > @@ -26,12 +26,16 @@
> > > >     */
> > > >    #include <linux/list.h>
> > > > +#include <linux/dma-resv.h>
> > > > +#include <linux/maple_tree.h>
> > > >    #include <linux/rbtree.h>
> > > >    #include <linux/types.h>
> > > >    #include <drm/drm_gem.h>
> > > > +#include <drm/drm_exec.h>
> > > >    struct drm_gpuva_manager;
> > > > +struct drm_gpuva_gem;
> > > >    struct drm_gpuva_fn_ops;
> > > >    /**
> > > > @@ -140,7 +144,7 @@ struct drm_gpuva {
> > > >    int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> > > >    void drm_gpuva_remove(struct drm_gpuva *va);
> > > > -void drm_gpuva_link(struct drm_gpuva *va);
> > > > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> > > >    void drm_gpuva_unlink(struct drm_gpuva *va);
> > > >    struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > > > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> > > >    	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> > > >    	 */
> > > >    	const struct drm_gpuva_fn_ops *ops;
> > > > +
> > > > +	/**
> > > > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > > > +	 * dma-resv to &drm_exec.
> > > > +	 */
> > > > +	struct drm_gem_object d_obj;
> > > > +
> > > > +	/**
> > > > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > > > +	 * space
> > > > +	 */
> > > > +	struct dma_resv *resv;
> > > > +
> > > > +	/**
> > > > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > > > +	 */
> > > > +	struct drm_exec exec;
> > > > +
> > > > +	/**
> > > > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > > > +	 */
> > > > +	struct maple_tree mt_ext;
> > > Why are you using a maple tree here? Insertion and removal is O(log(n))
> > > instead of O(1) for a list?
> > > 
> > Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> > could have mappings of the same extobj.
> > 
> > I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> > instead, which also seems to be the obvious choice. However, there is a locking
> > conflict.
> > 
> > A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> > a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> > the corresponding drm_gem_object, or with an external lock provided by the
> > driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> > changes on the GPUVA space directly from the fence signalling path.
> > 
> > Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> > we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> > linked and we'd want to remove it for the last one being unlinked.
> > 
> > (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> > before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> > through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> > create the drm_gpuva_gem, which we need to do anyways.)
> > 
> > Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> > list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> > order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> > lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> > drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> > like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> > hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
> > 
> > Considering all that, I thought it's probably better to track extobjs separate
> > from the drm_gpuva_gem, hence the maple tree choice.
> 
> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
> external objects, or in the case of multiple mappings to the same gem
> object, only one of the drm_gpuvas is in the list. These are protected by
> the GPU-VM lock. I don't see a problem with removing those from the fence
> signalling path, though?

I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
since this is generic code I don't know how much mappings of an external object
the corresponding driver potentially creates. This could become a pretty large
list to iterate. Another reason was, that I want to keep the drm_gpuva structure
as small as possible, hence avoiding another list_head.

Now, it sounds like in XE you're doing some kind of optimization just keeping a
single mapping of an extobj in the list? How do you know when to remove it? What
if the mapping from the extobj list gets unmapped, but there is still another
one left in the GPU-VM being backed by the same BO?

> 
> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> choice, keeping O(1)?

When tracking extobjs, the address of the drm_gem_object is the key while the
reference count is the value. I was thinking of an XArray as well, but I was
worried that the corresponding indices could be too much distributed for an
XArray to still be efficient. Now that I think about it, it's probably not that
bad.

Btw., while I agree trying to make things as efficient as possible, what is the
magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?

> 
> > 
> > > > +
> > > > +	/**
> > > > +	 * @evict: structure holding the evict list and evict list lock
> > > > +	 */
> > > > +	struct {
> > > > +		/**
> > > > +		 * @list: &list_head storing &drm_gem_objects currently being
> > > > +		 * evicted
> > > > +		 */
> > > > +		struct list_head list;
> > > > +
> > > > +		/**
> > > > +		 * @lock: spinlock to protect the evict list against concurrent
> > > > +		 * insertion / removal of different &drm_gpuva_gems
> > > > +		 */
> > > > +		spinlock_t lock;
> > > > +	} evict;
> > > >    };
> > > >    void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > +			    struct drm_device *drm,
> > > >    			    const char *name,
> > > >    			    u64 start_offset, u64 range,
> > > >    			    u64 reserve_offset, u64 reserve_range,
> > > >    			    const struct drm_gpuva_fn_ops *ops);
> > > >    void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > > > +/**
> > > > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > > > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > > > + */
> > > > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > > A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> > > should typically be allocated on the stack. Otherwise you'd need to protect
> > > the mgr->exec member with an exclusive lock throughout the locking process,
> > > and that's not what we want.
> > Oh, good point. I think it works in Nouveau, because there it's implicitly
> > protected with the job submission lock.
> > 
> > > Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> > > needed ops to it: Like so:
> > That's a good idea, will take this into V2.
> 
> Actually, I'm not fully sure that was a good idea: I've now have a working
> version of Xe ported over to drm_exec, having these helpers in mind and with
> the intention to start using them as they mature. What I found, though is
> that open-coding the drm_exec loop is not all that bad, but that building
> blocks that can be called from within the loop are useful:
> 
> Like the drm_gpuva_prepare_objects() and an imaginary
> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
> (if different and the gpuva points to the object. And
> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
> can use these building blocks like helpers and avoid the fn() callback by
> instead open-coding.
> 
> But I guess YMMV.

That's exactly why those building blocks are exported, I already had in mind
that there might be drivers which still want to open-code the drm_exec loop,
while others might just want a simple interface to lock everything.

I still think it is a good idea, but I'd keep that as simple as possible. And
for everything else just let the driver open-code it and use the "building
blocks" - will also expand the bulding blocks to what you mentioned above.

> 
> > 
> > > struct drm_gpuva_exec_ops {
> > >      int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> > Is this the fn argument from drm_gpuva_manager_lock_extra()?
> > 
> > >      int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> > > *obj);
> > I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> > be the same callback, right?
> > 
> > > };
> > > 
> > > struct drm_gpuva_exec {
> > >      const struct drm_gpuva_exec_ops *ops;
> > >      struct drm_exec exec;
> > >      struct drm_gpuva_manager *mgr;
> > > };
> > > 
> > > Although I'd actually expect bo_validate to be part of fn in the typical
> > > case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> > This doesn't sound like my assumption about fn() above is correct.
> 
> Well one important thing in our conversion is that ttm_bo_validate () needs
> to be in the until_all_locked() loop. We want to be able soon to use
> sleeping locks for eviction, so a xe_bo_validate() would, at least
> temporarily, add locked objects to the drm_exec list of locked objects. That
> means everything that may end up calling validate deep within the call chain
> needs to be part of the until_all_locked() loop, so our
> drm_gpuva_manager_lock_extra() fn callback would include those validates and
> look different all the time. Hence that's why open-coding isn't all that
> bad...

Oh, I see. You indeed want to call validate() from within until_all_locked().

> 
> /Thomas
> 
> 
> > 
> > > 
> > > > +
> > > > +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
> > > > +				 int (*fn)(struct drm_gpuva_manager *mgr,
> > > > +					   void *priv, unsigned int num_fences),
> > > > +				 void *priv,
> > > > +				 unsigned int num_fences,
> > > > +				 bool interruptible);
> > > > +
> > > > +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
> > > > +				 struct drm_gem_object **objs,
> > > > +				 unsigned int num_objs,
> > > > +				 unsigned int num_fences,
> > > > +				 bool interruptible);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + * @num_fences: the amount of &dma_fences to reserve
> > > > + * @interruptible: sleep interruptible if waiting
> > > > + *
> > > > + * Acquires all dma-resv locks of all &drm_gem_objects the given
> > > > + * &drm_gpuva_manager contains mappings of.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +static inline int
> > > > +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
> > > > +		       unsigned int num_fences,
> > > > +		       bool interruptible)
> > > > +{
> > > > +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
> > > > +					    interruptible);
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
> > > > + * @mgr: the &drm_gpuva_manager
> > > > + *
> > > > + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
> > > > + * through drm_gpuva_manager_lock() or its variants.
> > > > + *
> > > > + * Returns: 0 on success, negative error code on failure.
> > > > + */
> > > > +static inline void
> > > > +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
> > > > +{
> > > > +	drm_exec_fini(&mgr->exec);
> > > > +}
> > > > +
> > > > +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
> > > > +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
> > > > +				      struct dma_fence *fence,
> > > > +				      enum dma_resv_usage private_usage,
> > > > +				      enum dma_resv_usage extobj_usage);
> > > > +
> > > > +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
> > > > +			    struct drm_gem_object *obj);
> > > > +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
> > > > +			  struct drm_gem_object *obj);
> > > > +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
> > > > +			  struct drm_gem_object *obj);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
> > > > + * external object
> > > > + * @mgr: the &drm_gpuva_manager to check
> > > > + * @obj: the &drm_gem_object to check
> > > > + *
> > > > + * Returns: true if the &drm_gem_object &dma_resv differs from the
> > > > + * &drm_gpuva_managers &dma_resv, false otherwise
> > > > + */
> > > > +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
> > > > +				       struct drm_gem_object *obj)
> > > > +{
> > > > +	return obj && obj->resv != mgr->resv;
> > > > +}
> > > > +
> > > >    static inline struct drm_gpuva *
> > > >    __drm_gpuva_next(struct drm_gpuva *va)
> > > >    {
> > > > @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
> > > >    #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
> > > >    	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
> > > > +/**
> > > > + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
> > > > + * &drm_gem_object combination
> > > > + *
> > > > + * This structure is an abstraction representing a &drm_gpuva_manager and
> > > > + * &drm_gem_object combination. It serves as an indirection to accelerate
> > > > + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
> > > > + * &drm_gem_object.
> > > > + *
> > > > + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
> > > > + * accelerate validation.
> > > > + *
> > > > + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
> > > > + * a GEM object is mapped first in a GPU-VM and release the instance once the
> > > > + * last mapping of the GEM object in this GPU-VM is unmapped.
> > > > + */
> > > > +struct drm_gpuva_gem {
> > > > +
> > > > +	/**
> > > > +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
> > > > +	 */
> > > > +	struct drm_gpuva_manager *mgr;
> > > > +
> > > > +	/**
> > > > +	 * @obj: The &drm_gem_object being mapped in the @mgr.
> > > > +	 */
> > > > +	struct drm_gem_object *obj;
> > > > +
> > > > +	/**
> > > > +	 * @kref: The reference count for this &drm_gpuva_gem.
> > > > +	 */
> > > > +	struct kref kref;
> > > > +
> > > > +	/**
> > > > +	 * @list: Structure containing all &list_heads.
> > > > +	 */
> > > > +	struct {
> > > > +		/**
> > > > +		 * @gpuva: The list of linked &drm_gpuvas.
> > > > +		 */
> > > > +		struct list_head gpuva;
> > > > +
> > > > +		/**
> > > > +		 * @entry: Structure containing all &list_heads serving as
> > > > +		 * entry.
> > > > +		 */
> > > > +		struct {
> > > > +			/**
> > > > +			 * @gem: List entry to attach to the &drm_gem_objects
> > > > +			 * gpuva list.
> > > > +			 */
> > > > +			struct list_head gem;
> > > > +
> > > > +			/**
> > > > +			 * @evict: List entry to attach to the
> > > > +			 * &drm_gpuva_managers evict list.
> > > > +			 */
> > > > +			struct list_head evict;
> > > > +		} entry;
> > > > +	} list;
> > > > +};
> > > > +
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj);
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
> > > > +			      struct drm_gem_object *obj,
> > > > +			      struct drm_gpuva_gem *__vm_bo);
> > > > +
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
> > > > +		   struct drm_gem_object *obj);
> > > > +
> > > > +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
> > > > +
> > > > +struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
> > > > +		     struct drm_gem_object *obj);
> > > > +void drm_gpuva_gem_destroy(struct kref *kref);
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
> > > > + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
> > > > + *
> > > > + * This function acquires an additional reference to @vm_bo. It is illegal to
> > > > + * call this without already holding a reference. No locks required.
> > > > + */
> > > > +static inline struct drm_gpuva_gem *
> > > > +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
> > > > +{
> > > > +	kref_get(&vm_bo->kref);
> > > > +	return vm_bo;
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
> > > > + * @vm_bo: the &drm_gpuva_gem to release the reference of
> > > > + *
> > > > + * This releases a reference to @vm_bo.
> > > > + */
> > > > +static inline void
> > > > +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
> > > > +{
> > > > +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
> > > > +}
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
> > > > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > > > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > > > + *
> > > > + * This iterator walks over all &drm_gpuva structures associated with the
> > > > + * &drm_gpuva_gem.
> > > > + */
> > > > +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
> > > > +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
> > > > +
> > > > +/**
> > > > + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
> > > > + * &drm_gpuva
> > > > + * @va__: &drm_gpuva structure to assign to in each iteration step
> > > > + * @next__: &next &drm_gpuva to store the next step
> > > > + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
> > > > + *
> > > > + * This iterator walks over all &drm_gpuva structures associated with the
> > > > + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
> > > > + * it is save against removal of elements.
> > > > + */
> > > > +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
> > > > +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
> > > > +
> > > >    /**
> > > >     * enum drm_gpuva_op_type - GPU VA operation type
> > > >     *
> > > > @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
> > > >    	 */
> > > >    	void (*op_free)(struct drm_gpuva_op *op);
> > > > +	/**
> > > > +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
> > > > +	 * a struct drm_gpuva_gem
> > > > +	 *
> > > > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > > > +	 * specific structures. By implementing this callback drivers can
> > > > +	 * allocate memory accordingly.
> > > > +	 *
> > > > +	 * This callback is optional.
> > > > +	 */
> > > > +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
> > > > +
> > > > +	/**
> > > > +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
> > > > +	 * struct drm_gpuva_gem
> > > > +	 *
> > > > +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
> > > > +	 * specific structures. By implementing this callback drivers can
> > > > +	 * free the previously allocated memory accordingly.
> > > > +	 *
> > > > +	 * This callback is optional.
> > > > +	 */
> > > > +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
> > > > +
> > > >    	/**
> > > >    	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
> > > >    	 * mapping once all previous steps were completed
> > > > @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
> > > >    	 * used.
> > > >    	 */
> > > >    	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> > > > +
> > > > +	/**
> > > > +	 * @bo_validate: called from drm_gpuva_manager_validate()
> > > > +	 *
> > > > +	 * Drivers receive this callback for every evicted &drm_gem_object being
> > > > +	 * mapped in the corresponding &drm_gpuva_manager.
> > > > +	 *
> > > > +	 * Typically, drivers would call their driver specific variant of
> > > > +	 * ttm_bo_validate() from within this callback.
> > > > +	 */
> > > > +	int (*bo_validate)(struct drm_gem_object *obj);
> > > >    };
> > > >    int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> > > > @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
> > > >    void drm_gpuva_map(struct drm_gpuva_manager *mgr,
> > > >    		   struct drm_gpuva *va,
> > > >    		   struct drm_gpuva_op_map *op);
> > > > +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
> > > > +		       struct drm_gpuva *va,
> > > > +		       struct drm_gpuva_op_map *op);
> > > >    void drm_gpuva_remap(struct drm_gpuva *prev,
> > > >    		     struct drm_gpuva *next,
> > > >    		     struct drm_gpuva_op_remap *op);
> > > > +void drm_gpuva_remap_get(struct drm_gpuva *prev,
> > > > +			 struct drm_gpuva *next,
> > > > +			 struct drm_gpuva_op_remap *op);
> > > >    void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> > > > +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
> > > >    #endif /* __DRM_GPUVA_MGR_H__ */
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-30 15:00           ` Danilo Krummrich
  (?)
@ 2023-08-31  9:04             ` Thomas Hellström (Intel)
  -1 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-08-31  9:04 UTC (permalink / raw)
  To: Danilo Krummrich
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, christian.koenig, faith.ekstrand, bskeggs

Hi!

On 8/30/23 17:00, Danilo Krummrich wrote:
> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>> Hi Thomas,
>>>
>>> thanks for having a look!
>>>
>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>>>> Hi, Danilo.
>>>>
>>>> Some quick comments since I'm doing some Xe work in this area. Will probably
>>>> get back with more.
>>>>
>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
>>>>> So far the DRM GPUVA manager offers common infrastructure to track GPU VA
>>>>> allocations and mappings, generically connect GPU VA mappings to their
>>>>> backing buffers and perform more complex mapping operations on the GPU VA
>>>>> space.
>>>>>
>>>>> However, there are more design patterns commonly used by drivers, which
>>>>> can potentially be generalized in order to make the DRM GPUVA manager
>>>>> represent a basic GPU-VM implementation. In this context, this patch aims
>>>>> at generalizing the following elements.
>>>>>
>>>>> 1) Provide a common dma-resv for GEM objects not being used outside of
>>>>>       this GPU-VM.
>>>>>
>>>>> 2) Provide tracking of external GEM objects (GEM objects which are
>>>>>       shared with other GPU-VMs).
>>>>>
>>>>> 3) Provide functions to efficiently lock all GEM objects dma-resv the
>>>>>       GPU-VM contains mappings of.
>>>>>
>>>>> 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
>>>>>       of, such that validation of evicted GEM objects is accelerated.
>>>>>
>>>>> 5) Provide some convinience functions for common patterns.
>>>>>
>>>>> Rather than being designed as a "framework", the target is to make all
>>>>> features appear as a collection of optional helper functions, such that
>>>>> drivers are free to make use of the DRM GPUVA managers basic
>>>>> functionality and opt-in for other features without setting any feature
>>>>> flags, just by making use of the corresponding functions.
>>>>>
>>>>> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
>>>>> ---
>>>>>     drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
>>>>>     include/drm/drm_gem.h           |  48 ++-
>>>>>     include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
>>>>>     3 files changed, 1010 insertions(+), 28 deletions(-)
>>>>>
>>>>> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
>>>>> index f86bfad74ff8..69872b205961 100644
>>>>> --- a/drivers/gpu/drm/drm_gpuva_mgr.c
>>>>> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
>>>>> @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>>>>     /**
>>>>>      * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
>>>>>      * @mgr: pointer to the &drm_gpuva_manager to initialize
>>>>> + * @drm: the drivers &drm_device
>>>>>      * @name: the name of the GPU VA space
>>>>>      * @start_offset: the start offset of the GPU VA space
>>>>>      * @range: the size of the GPU VA space
>>>>> @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>>>>      */
>>>>>     void
>>>>>     drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>> +		       struct drm_device *drm,
>>>>>     		       const char *name,
>>>>>     		       u64 start_offset, u64 range,
>>>>>     		       u64 reserve_offset, u64 reserve_range,
>>>>> @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>     	mgr->rb.tree = RB_ROOT_CACHED;
>>>>>     	INIT_LIST_HEAD(&mgr->rb.list);
>>>>> +	mt_init(&mgr->mt_ext);
>>>>> +
>>>>> +	INIT_LIST_HEAD(&mgr->evict.list);
>>>>> +	spin_lock_init(&mgr->evict.lock);
>>>>> +
>>>>>     	drm_gpuva_check_overflow(start_offset, range);
>>>>>     	mgr->mm_start = start_offset;
>>>>>     	mgr->mm_range = range;
>>>>> @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>     						     reserve_range)))
>>>>>     			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>>>>>     	}
>>>>> +
>>>>> +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
>>>>> +	mgr->resv = mgr->d_obj.resv;
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
>>>>> @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>>>>>     		__drm_gpuva_remove(&mgr->kernel_alloc_node);
>>>>>     	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
>>>>> -	     "GPUVA tree is not empty, potentially leaking memory.");
>>>>> +	     "GPUVA tree is not empty, potentially leaking memory.\n");
>>>>> +
>>>>> +	mtree_destroy(&mgr->mt_ext);
>>>>> +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
>>>>> +
>>>>> +	drm_gem_private_object_fini(&mgr->d_obj);
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
>>>>> +/**
>>>>> + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @num_fences: the amount of &dma_fences to reserve
>>>>> + *
>>>>> + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
>>>>> + * &drm_gpuva_manager contains mappings of.
>>>>> + *
>>>>> + * Drivers can obtain the corresponding &drm_exec instance through
>>>>> + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
>>>>> + * and drm_exec_fini() accordingly.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
>>>>> +				  unsigned int num_fences)
>>>>> +{
>>>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>>>> +	union {
>>>>> +		void *ptr;
>>>>> +		uintptr_t cnt;
>>>>> +	} ref;
>>>>> +	int ret;
>>>>> +
>>>>> +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
>>>>> +	if (ret)
>>>>> +		goto out;
>>>>> +
>>>>> +	rcu_read_lock();
>>>> In xe we're protecting the external object list with an outer lock, (same as
>>>> protecting the mgr itself). Do we need a separate lock for this? In theory
>>>> as  outlined in the VM_BIND locking document draft, one could probably even
>>>> use the mgr resv for this, but with more complicated code I guess. Also see
>>>> the comment below about the data structure chosen.
>>> The idea is to protect this list with the GPU-VM lock. The locking here is more
>>> of an implication of the maple tree. Either you use the internal lock of the
>>> maple tree or RCU respectively, or you give the maple tree an external lock to
>>> perform lockdep checks on (mt_set_external_lock()). Basically same as here:
>>>
>>> https://elixir.bootlin.com/linux/latest/source/drivers/base/regmap/regcache-maple.c#L124
>> Ah, I suspected it was something along those lines.
>>
>>
>>>>> +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
>>>>> +		struct drm_gem_object *obj;
>>>>> +
>>>>> +		mas_pause(&mas);
>>>>> +		rcu_read_unlock();
>>>>> +
>>>>> +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
>>>>> +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
>>>>> +		if (ret)
>>>>> +			goto out;
>>>>> +
>>>>> +		rcu_read_lock();
>>>>> +	}
>>>>> +	rcu_read_unlock();
>>>>> +
>>>>> +out:
>>>>> +	return ret;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @fn: callback received by the driver to lock additional dma-resv
>>>>> + * @priv: private driver data passed to @fn
>>>>> + * @num_fences: the amount of &dma_fences to reserve
>>>>> + * @interruptible: sleep interruptible if waiting
>>>>> + *
>>>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>>>> + * &drm_gpuva_manager contains mappings of.
>>>>> + *
>>>>> + * Addionally, when calling this function the driver receives the given @fn
>>>>> + * callback to lock additional dma-resv in the context of the
>>>>> + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
>>>>> + * drm_exec_prepare_obj() from within this callback.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
>>>>> +			     int (*fn)(struct drm_gpuva_manager *mgr,
>>>>> +				       void *priv, unsigned int num_fences),
>>>>> +			     void *priv,
>>>>> +			     unsigned int num_fences,
>>>>> +			     bool interruptible)
>>>>> +{
>>>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>>>> +	uint32_t flags;
>>>>> +	int ret;
>>>>> +
>>>>> +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
>>>>> +		DRM_EXEC_IGNORE_DUPLICATES;
>>>>> +
>>>>> +	drm_exec_init(exec, flags);
>>>>> +
>>>>> +	drm_exec_until_all_locked(exec) {
>>>>> +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
>>>>> +		drm_exec_retry_on_contention(exec);
>>>>> +		if (ret)
>>>>> +			goto err;
>>>>> +
>>>>> +		if (fn) {
>>>>> +			ret = fn(mgr, priv, num_fences);
>>>>> +			drm_exec_retry_on_contention(exec);
>>>>> +			if (ret)
>>>>> +				goto err;
>>>>> +		}
>>>>> +	}
>>>>> +
>>>>> +	return 0;
>>>>> +
>>>>> +err:
>>>>> +	drm_exec_fini(exec);
>>>>> +	return ret;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
>>>>> +
>>>>> +static int
>>>>> +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
>>>>> +				unsigned int num_fences)
>>>>> +{
>>>>> +	struct {
>>>>> +		struct drm_gem_object **objs;
>>>>> +		unsigned int num_objs;
>>>>> +	} *args = priv;
>>>>> +
>>>>> +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
>>>>> +				      args->num_objs, num_fences);
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @objs: additional &drm_gem_objects to lock
>>>>> + * @num_objs: the number of additional &drm_gem_objects to lock
>>>>> + * @num_fences: the amount of &dma_fences to reserve
>>>>> + * @interruptible: sleep interruptible if waiting
>>>>> + *
>>>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>>>> + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
>>>>> +			     struct drm_gem_object **objs,
>>>>> +			     unsigned int num_objs,
>>>>> +			     unsigned int num_fences,
>>>>> +			     bool interruptible)
>>>>> +{
>>>>> +	struct {
>>>>> +		struct drm_gem_object **objs;
>>>>> +		unsigned int num_objs;
>>>>> +	} args;
>>>>> +
>>>>> +	args.objs = objs;
>>>>> +	args.num_objs = num_objs;
>>>>> +
>>>>> +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
>>>>> +					    num_fences, interruptible);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
>>>>> + * @mgr: the &drm_gpuva_manager to validate evicted BOs
>>>>> + *
>>>>> + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
>>>>> + * objects being mapped in the given &drm_gpuva_manager.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
>>>>> +{
>>>>> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +	int ret;
>>>>> +
>>>>> +	if (unlikely(!ops || !ops->bo_validate))
>>>>> +		return -ENOTSUPP;
>>>>> +
>>>>> +	/* At this point we should hold all dma-resv locks of all GEM objects
>>>>> +	 * associated with this GPU-VM, hence it is safe to walk the list.
>>>>> +	 */
>>>>> +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
>>>>> +		dma_resv_assert_held(vm_bo->obj->resv);
>>>>> +
>>>>> +		ret = ops->bo_validate(vm_bo->obj);
>>>>> +		if (ret)
>>>>> +			return ret;
>>>>> +	}
>>>>> +
>>>>> +	return 0;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
>>>>> + * dma-resv
>>>>> + * @mgr: the &drm_gpuva_manager to add a fence to
>>>>> + * @fence: fence to add
>>>>> + * @private_usage: private dma-resv usage
>>>>> + * @extobj_usage: extobj dma-resv usage
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
>>>>> +				 struct dma_fence *fence,
>>>>> +				 enum dma_resv_usage private_usage,
>>>>> +				 enum dma_resv_usage extobj_usage)
>>>>> +{
>>>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>>>> +	struct drm_gem_object *obj;
>>>>> +	unsigned long index;
>>>>> +
>>>>> +	drm_exec_for_each_locked_object(exec, index, obj) {
>>>>> +			dma_resv_assert_held(obj->resv);
>>>>> +			dma_resv_add_fence(obj->resv, fence,
>>>>> +					   drm_gpuva_is_extobj(mgr, obj) ?
>>>>> +					   private_usage : extobj_usage);
>>>>> +	}
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
>>>>> +
>>>>> +static struct drm_gpuva_gem *
>>>>> +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	drm_gem_gpuva_assert_lock_held(obj);
>>>>> +
>>>>> +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
>>>>> +		if (vm_bo->mgr == mgr)
>>>>> +			return vm_bo;
>>>>> +
>>>>> +	return NULL;
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
>>>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> + *
>>>>> + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
>>>>> + * vm_bo_alloc() callback to allocate.
>>>>> + *
>>>>> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
>>>>> + */
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	if (ops && ops->vm_bo_alloc)
>>>>> +		vm_bo = ops->vm_bo_alloc();
>>>>> +	else
>>>>> +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
>>>>> +
>>>>> +	if (unlikely(!vm_bo))
>>>>> +		return NULL;
>>>>> +
>>>>> +	vm_bo->mgr = mgr;
>>>>> +	vm_bo->obj = obj;
>>>>> +
>>>>> +	kref_init(&vm_bo->kref);
>>>>> +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
>>>>> +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
>>>>> +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
>>>>> +
>>>>> +	drm_gem_object_get(obj);
>>>>> +
>>>>> +	return vm_bo;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
>>>>> +
>>>>> +void
>>>>> +drm_gpuva_gem_destroy(struct kref *kref)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
>>>>> +						   kref);
>>>>> +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
>>>>> +
>>>>> +	drm_gem_object_put(vm_bo->obj);
>>>>> +
>>>>> +	if (ops && ops->vm_bo_free)
>>>>> +		ops->vm_bo_free(vm_bo);
>>>>> +	else
>>>>> +		kfree(vm_bo);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
>>>>> + * &drm_gpuva_manager and &drm_gem_object
>>>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> + *
>>>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>>>> + * count of the &drm_gpuva_gem accordingly.
>>>>> + *
>>>>> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
>>>>> + */
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>>>> +		   struct drm_gem_object *obj)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
>>>>> +
>>>>> +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
>>>>> + * given &drm_gpuva_manager and &drm_gem_object
>>>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> + *
>>>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>>>> + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
>>>>> + * &drm_gpuva_gem.
>>>>> + *
>>>>> + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
>>>>> + */
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
>>>>> +	if (vm_bo)
>>>>> +		return vm_bo;
>>>>> +
>>>>> +	vm_bo = drm_gpuva_gem_create(mgr, obj);
>>>>> +	if (!vm_bo)
>>>>> +		return ERR_PTR(-ENOMEM);
>>>>> +
>>>>> +	return vm_bo;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
>>>>> + * for the given &drm_gpuva_manager and &drm_gem_object
>>>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> + *
>>>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>>>> + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
>>>>> + * count is decreased. If not found @__vm_bo is returned.
>>>>> + *
>>>>> + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
>>>>> + * &drm_gpuva_gem was found
>>>>> + */
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
>>>>> +			      struct drm_gem_object *obj,
>>>>> +			      struct drm_gpuva_gem *__vm_bo)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
>>>>> +	if (vm_bo) {
>>>>> +		drm_gpuva_gem_put(__vm_bo);
>>>>> +		return vm_bo;
>>>>> +	}
>>>>> +
>>>>> +	return __vm_bo;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
>>>>> +
>>>>> +static int
>>>>> +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>>>> +			  struct drm_gem_object *obj,
>>>>> +			  gfp_t gfp)
>>>>> +{
>>>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>>>> +	union {
>>>>> +		struct drm_gem_object *obj;
>>>>> +		uintptr_t index;
>>>>> +	} gem;
>>>>> +	union {
>>>>> +		void *ptr;
>>>>> +		uintptr_t cnt;
>>>>> +	} ref;
>>>>> +	int ret = 0;
>>>>> +
>>>>> +	gem.obj = obj;
>>>>> +	mas_set(&mas, gem.index);
>>>>> +
>>>>> +	mas_lock(&mas);
>>>>> +	ref.ptr = mas_walk(&mas);
>>>>> +	if (ref.ptr) {
>>>>> +		++ref.cnt;
>>>>> +		mas_store(&mas, ref.ptr);
>>>>> +	} else {
>>>>> +		if (unlikely(!gfp)) {
>>>>> +			ret = -EINVAL;
>>>>> +			goto out;
>>>>> +		}
>>>>> +
>>>>> +		mas_set(&mas, gem.index);
>>>>> +		ref.cnt = 1;
>>>>> +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
>>>>> +		if (likely(!ret))
>>>>> +			drm_gem_object_get(obj);
>>>>> +	}
>>>>> +out:
>>>>> +	mas_unlock(&mas);
>>>>> +	return ret;
>>>>> +}
>>>>> +
>>>>> +static void
>>>>> +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
>>>>> +			  struct drm_gem_object *obj)
>>>>> +{
>>>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>>>> +	union {
>>>>> +		struct drm_gem_object *obj;
>>>>> +		uintptr_t index;
>>>>> +	} gem;
>>>>> +	union {
>>>>> +		void *ptr;
>>>>> +		uintptr_t cnt;
>>>>> +	} ref;
>>>>> +
>>>>> +	gem.obj = obj;
>>>>> +	mas_set(&mas, gem.index);
>>>>> +
>>>>> +	mas_lock(&mas);
>>>>> +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
>>>>> +		goto out;
>>>>> +
>>>>> +	if (!--ref.cnt) {
>>>>> +		mas_erase(&mas);
>>>>> +		drm_gem_object_put(obj);
>>>>> +	} else {
>>>>> +		mas_store(&mas, ref.ptr);
>>>>> +	}
>>>>> +out:
>>>>> +	mas_unlock(&mas);
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
>>>>> + * @mgr: the &drm_gpuva_manager to insert into
>>>>> + * @obj: the &drm_gem_object to insert as extobj
>>>>> + *
>>>>> + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
>>>>> + * If the &drm_gem_object already exists in the tree, the reference counter
>>>>> + * of this external object is increased by one.
>>>>> + *
>>>>> + * Drivers should insert the external &drm_gem_object before the dma-fence
>>>>> + * signalling critical section, e.g. when submitting the job, and before
>>>>> + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
>>>>> + * or its dervates.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>>>> +			struct drm_gem_object *obj)
>>>>> +{
>>>>> +	return drm_gpuva_is_extobj(mgr, obj) ?
>>>>> +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
>>>>> +
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_extobj_get - increase the referecne count of an external
>>>>> + * &drm_gem_object
>>>>> + * @mgr: the &drm_gpuva_manager storing the extobj
>>>>> + * @obj: the &drm_gem_object to representing the extobj
>>>>> + *
>>>>> + * Increases the reference count of the extobj represented by @obj.
>>>>> + *
>>>>> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
>>>>> + * being inserted.
>>>>> + *
>>>>> + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
>>>>> + * additional reference if the re-map operation splits an existing &drm_gpuva
>>>>> + * into two separate ones.
>>>>> + *
>>>>> + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	if (drm_gpuva_is_extobj(mgr, obj))
>>>>> +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
>>>>> +		     "Can't increase ref-count of non-existent extobj.");
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_extobj_put - decrease the referecne count of an external
>>>>> + * &drm_gem_object
>>>>> + * @mgr: the &drm_gpuva_manager storing the extobj
>>>>> + * @obj: the &drm_gem_object to representing the extobj
>>>>> + *
>>>>> + * Decreases the reference count of the extobj represented by @obj.
>>>>> + *
>>>>> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
>>>>> + * being removed from the GPU VA space.
>>>>> + *
>>>>> + * See also drm_gpuva_unmap_put().
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	if (drm_gpuva_is_extobj(mgr, obj))
>>>>> +		__drm_gpuva_extobj_remove(mgr, obj);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
>>>>> + * &drm_gpuva_managers evicted list
>>>>> + * @obj: the &drm_gem_object to add or remove
>>>>> + * @evict: indicates whether the object is evicted
>>>>> + *
>>>>> + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
>>>>> + * list containing a mapping of this &drm_gem_object.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
>>>>> +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
>>>>> +	 */
>>>>> +	drm_gem_gpuva_assert_lock_held(obj);
>>>>> +
>>>>> +	/* Required to protect the GPUVA managers evict list against concurrent
>>>>> +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
>>>>> +	 * the evict list through different GEM object evictions are protected
>>>>> +	 * by the GPUVA managers evict lock.
>>>>> +	 */
>>>>> +	dma_resv_assert_held(obj->resv);
>>>>> +
>>>>> +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
>>>>> +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
>>>>> +
>>>>> +		spin_lock(&mgr->evict.lock);
>>>>> +		if (evict)
>>>>> +			list_add_tail(&vm_bo->list.entry.evict,
>>>>> +				      &mgr->evict.list);
>>>>> +		else
>>>>> +			list_del_init(&vm_bo->list.entry.evict);
>>>>> +		spin_unlock(&mgr->evict.lock);
>>>>> +	}
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
>>>>> +
>>>>>     static int
>>>>>     __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>>>>>     		   struct drm_gpuva *va)
>>>>> @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
>>>>>     /**
>>>>>      * drm_gpuva_link() - link a &drm_gpuva
>>>>>      * @va: the &drm_gpuva to link
>>>>> + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
>>>>>      *
>>>>> - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
>>>>> - * associated with.
>>>>> + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
>>>>> + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
>>>>> + *
>>>>> + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
>>>>> + * reference of the latter is taken.
>>>>>      *
>>>>>      * This function expects the caller to protect the GEM's GPUVA list against
>>>>> - * concurrent access using the GEMs dma_resv lock.
>>>>> + * concurrent access using either the GEMs dma_resv lock or a driver specific
>>>>> + * lock set through drm_gem_gpuva_set_lock().
>>>>>      */
>>>>>     void
>>>>> -drm_gpuva_link(struct drm_gpuva *va)
>>>>> +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
>>>>>     {
>>>>>     	struct drm_gem_object *obj = va->gem.obj;
>>>>> @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
>>>>>     	drm_gem_gpuva_assert_lock_held(obj);
>>>>> -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
>>>>> +	drm_gpuva_gem_get(vm_bo);
>>>>> +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
>>>>> +	if (list_empty(&vm_bo->list.entry.gem))
>>>>> +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_link);
>>>>> @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
>>>>>      * This removes the given &va from the GPU VA list of the &drm_gem_object it is
>>>>>      * associated with.
>>>>>      *
>>>>> + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
>>>>> + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
>>>>> + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
>>>>> + *
>>>>> + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
>>>>> + * the latter is dropped.
>>>>> + *
>>>>>      * This function expects the caller to protect the GEM's GPUVA list against
>>>>> - * concurrent access using the GEMs dma_resv lock.
>>>>> + * concurrent access using either the GEMs dma_resv lock or a driver specific
>>>>> + * lock set through drm_gem_gpuva_set_lock().
>>>>>      */
>>>>>     void
>>>>>     drm_gpuva_unlink(struct drm_gpuva *va)
>>>>>     {
>>>>>     	struct drm_gem_object *obj = va->gem.obj;
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>>     	if (unlikely(!obj))
>>>>>     		return;
>>>>>     	drm_gem_gpuva_assert_lock_held(obj);
>>>>> +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
>>>>> +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
>>>>> +		return;
>>>>> +
>>>>>     	list_del_init(&va->gem.entry);
>>>>> +
>>>>> +	if (list_empty(&vm_bo->list.gpuva)) {
>>>>> +		list_del_init(&vm_bo->list.entry.gem);
>>>>> +		list_del_init(&vm_bo->list.entry.evict);
>>>>> +	}
>>>>> +	drm_gpuva_gem_put(vm_bo);
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
>>>>> @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_map);
>>>>> +/**
>>>>> + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
>>>>> + * &drm_gpuva_op_map
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @va: the &drm_gpuva to insert
>>>>> + * @op: the &drm_gpuva_op_map to initialize @va with
>>>>> + *
>>>>> + * Initializes the @va from the @op and inserts it into the given @mgr and
>>>>> + * increases the reference count of the corresponding extobj.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
>>>>> +		  struct drm_gpuva *va,
>>>>> +		  struct drm_gpuva_op_map *op)
>>>>> +{
>>>>> +	drm_gpuva_map(mgr, va, op);
>>>>> +	drm_gpuva_extobj_get(mgr, va->gem.obj);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
>>>>> +
>>>>>     /**
>>>>>      * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
>>>>>      * &drm_gpuva_op_remap
>>>>> @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>>>>>     		struct drm_gpuva *next,
>>>>>     		struct drm_gpuva_op_remap *op)
>>>>>     {
>>>>> -	struct drm_gpuva *curr = op->unmap->va;
>>>>> -	struct drm_gpuva_manager *mgr = curr->mgr;
>>>>> +	struct drm_gpuva *va = op->unmap->va;
>>>>> +	struct drm_gpuva_manager *mgr = va->mgr;
>>>>> -	drm_gpuva_remove(curr);
>>>>> +	drm_gpuva_remove(va);
>>>>>     	if (op->prev) {
>>>>>     		drm_gpuva_init_from_op(prev, op->prev);
>>>>> @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_remap);
>>>>> +/**
>>>>> + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
>>>>> + * &drm_gpuva_op_remap
>>>>> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
>>>>> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
>>>>> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
>>>>> + *
>>>>> + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
>>>>> + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
>>>>> + * separate mappings, increases the reference count of the corresponding extobj.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_remap_get(struct drm_gpuva *prev,
>>>>> +		    struct drm_gpuva *next,
>>>>> +		    struct drm_gpuva_op_remap *op)
>>>>> +{
>>>>> +	struct drm_gpuva *va = op->unmap->va;
>>>>> +	struct drm_gpuva_manager *mgr = va->mgr;
>>>>> +
>>>>> +	drm_gpuva_remap(prev, next, op);
>>>>> +	if (op->prev && op->next)
>>>>> +		drm_gpuva_extobj_get(mgr, va->gem.obj);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
>>>>> +
>>>>>     /**
>>>>>      * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
>>>>>      * &drm_gpuva_op_unmap
>>>>> @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
>>>>> +/**
>>>>> + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
>>>>> + * &drm_gpuva_op_unmap
>>>>> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
>>>>> + *
>>>>> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
>>>>> + * the reference count of the corresponding extobj.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
>>>>> +{
>>>>> +	struct drm_gpuva *va = op->va;
>>>>> +
>>>>> +	drm_gpuva_unmap(op);
>>>>> +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
>>>>> +
>>>>>     static int
>>>>>     op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>>>>>     	  u64 addr, u64 range,
>>>>> @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>>>>     {
>>>>>     	struct drm_gpuva_ops *ops;
>>>>>     	struct drm_gpuva_op *op;
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>>     	struct drm_gpuva *va;
>>>>>     	int ret;
>>>>> @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>>>>     	INIT_LIST_HEAD(&ops->list);
>>>>> -	drm_gem_for_each_gpuva(va, obj) {
>>>>> +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
>>>>>     		op = gpuva_op_alloc(mgr);
>>>>>     		if (!op) {
>>>>>     			ret = -ENOMEM;
>>>>> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
>>>>> index bc9f6aa2f3fe..783ed3ab440d 100644
>>>>> --- a/include/drm/drm_gem.h
>>>>> +++ b/include/drm/drm_gem.h
>>>>> @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
>>>>>      * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
>>>>>      * @obj: the &drm_gem_object
>>>>>      *
>>>>> - * This initializes the &drm_gem_object's &drm_gpuva list.
>>>>> + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
>>>>>      *
>>>>>      * Calling this function is only necessary for drivers intending to support the
>>>>>      * &drm_driver_feature DRIVER_GEM_GPUVA.
>>>>> @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>>>>>     }
>>>>>     /**
>>>>> - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
>>>>> - * @entry__: &drm_gpuva structure to assign to in each iteration step
>>>>> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>>>> + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
>>>>> + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
>>>>> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>>>>>      *
>>>>> - * This iterator walks over all &drm_gpuva structures associated with the
>>>>> - * &drm_gpuva_manager.
>>>>> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>>>>> + * &drm_gem_object.
>>>>>      */
>>>>> -#define drm_gem_for_each_gpuva(entry__, obj__) \
>>>>> -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
>>>>> +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
>>>>> +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
>>>>>     /**
>>>>> - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
>>>>> - * gpuvas
>>>>> - * @entry__: &drm_gpuva structure to assign to in each iteration step
>>>>> - * @next__: &next &drm_gpuva to store the next step
>>>>> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>>>> + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
>>>>> + * &drm_gpuva_gem
>>>>> + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
>>>>> + * @next__: &next &drm_gpuva_gem to store the next step
>>>>> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>>>>>      *
>>>>> - * This iterator walks over all &drm_gpuva structures associated with the
>>>>> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>>>>>      * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
>>>>>      * it is save against removal of elements.
>>>>>      */
>>>>> -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
>>>>> -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
>>>>> +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
>>>>> +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
>>>>> +
>>>>> +/**
>>>>> + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
>>>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>>>> + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
>>>>> + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
>>>>> + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>>>> + *
>>>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>>>> + * &drm_gpuva_manager and &drm_gem_object.
>>>>> + */
>>>>> +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
>>>>> +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
>>>>> +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
>>>>> +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
>>>>> +	     va__ = list_next_entry(va__, gem.entry))
>>>>>     #endif /* __DRM_GEM_H__ */
>>>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>> @@ -26,12 +26,16 @@
>>>>>      */
>>>>>     #include <linux/list.h>
>>>>> +#include <linux/dma-resv.h>
>>>>> +#include <linux/maple_tree.h>
>>>>>     #include <linux/rbtree.h>
>>>>>     #include <linux/types.h>
>>>>>     #include <drm/drm_gem.h>
>>>>> +#include <drm/drm_exec.h>
>>>>>     struct drm_gpuva_manager;
>>>>> +struct drm_gpuva_gem;
>>>>>     struct drm_gpuva_fn_ops;
>>>>>     /**
>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>     int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>>>     void drm_gpuva_remove(struct drm_gpuva *va);
>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>>>     void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>     struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>     	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>>>     	 */
>>>>>     	const struct drm_gpuva_fn_ops *ops;
>>>>> +
>>>>> +	/**
>>>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>>>> +	 * dma-resv to &drm_exec.
>>>>> +	 */
>>>>> +	struct drm_gem_object d_obj;
>>>>> +
>>>>> +	/**
>>>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>>>> +	 * space
>>>>> +	 */
>>>>> +	struct dma_resv *resv;
>>>>> +
>>>>> +	/**
>>>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>>>> +	 */
>>>>> +	struct drm_exec exec;
>>>>> +
>>>>> +	/**
>>>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>> +	 */
>>>>> +	struct maple_tree mt_ext;
>>>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>>>> instead of O(1) for a list?
>>>>
>>> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
>>> could have mappings of the same extobj.
>>>
>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
>>> instead, which also seems to be the obvious choice. However, there is a locking
>>> conflict.
>>>
>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
>>> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
>>> the corresponding drm_gem_object, or with an external lock provided by the
>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
>>> changes on the GPUVA space directly from the fence signalling path.
>>>
>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
>>> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
>>> linked and we'd want to remove it for the last one being unlinked.
>>>
>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
>>> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
>>> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>
>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
>>> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
>>> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
>>> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>
>>> Considering all that, I thought it's probably better to track extobjs separate
>>> from the drm_gpuva_gem, hence the maple tree choice.
>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
>> external objects, or in the case of multiple mappings to the same gem
>> object, only one of the drm_gpuvas is in the list. These are protected by
>> the GPU-VM lock. I don't see a problem with removing those from the fence
>> signalling path, though?
> I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
> since this is generic code I don't know how much mappings of an external object
> the corresponding driver potentially creates. This could become a pretty large
> list to iterate. Another reason was, that I want to keep the drm_gpuva structure
> as small as possible, hence avoiding another list_head.

Yes, the list might be pretty large, but OTOH you never iterate to 
access a single list element. When you need to iterate the whole list 
you need to do that regardless of the data structure used. As for the 
list head, it might perhaps be aliased (union) with an upcoming userptr 
list head?

>
> Now, it sounds like in XE you're doing some kind of optimization just keeping a
> single mapping of an extobj in the list? How do you know when to remove it? What
> if the mapping from the extobj list gets unmapped, but there is still another
> one left in the GPU-VM being backed by the same BO?
When removing from the lists, we iterate through the object's list of 
vmas, and if there is one matching the same vm, we replace the old one 
with the new one. A similar iteration is done when adding to avoid 
adding one that is already on the list.
> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> choice, keeping O(1)?
> When tracking extobjs, the address of the drm_gem_object is the key while the
> reference count is the value. I was thinking of an XArray as well, but I was
> worried that the corresponding indices could be too much distributed for an
> XArray to still be efficient. Now that I think about it, it's probably not that
> bad.
>
> Btw., while I agree trying to make things as efficient as possible, what is the
> magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?

Not sure yet, TBH, but I think one of our UMDs can only use external 
object, because they don't know at creation time which ones need 
exporting. However if this turns out to be too bad, there are various 
flavours of "clever but complicated" optimizations that we could think 
of to reduce the list size. Still in our case, we opted for the vma list 
head for now.

/Thomas


>
>>>>> +
>>>>> +	/**
>>>>> +	 * @evict: structure holding the evict list and evict list lock
>>>>> +	 */
>>>>> +	struct {
>>>>> +		/**
>>>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>>>> +		 * evicted
>>>>> +		 */
>>>>> +		struct list_head list;
>>>>> +
>>>>> +		/**
>>>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>>>> +		 * insertion / removal of different &drm_gpuva_gems
>>>>> +		 */
>>>>> +		spinlock_t lock;
>>>>> +	} evict;
>>>>>     };
>>>>>     void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>> +			    struct drm_device *drm,
>>>>>     			    const char *name,
>>>>>     			    u64 start_offset, u64 range,
>>>>>     			    u64 reserve_offset, u64 reserve_range,
>>>>>     			    const struct drm_gpuva_fn_ops *ops);
>>>>>     void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>>>> +/**
>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>>>> + */
>>>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>>>> should typically be allocated on the stack. Otherwise you'd need to protect
>>>> the mgr->exec member with an exclusive lock throughout the locking process,
>>>> and that's not what we want.
>>> Oh, good point. I think it works in Nouveau, because there it's implicitly
>>> protected with the job submission lock.
>>>
>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>>>> needed ops to it: Like so:
>>> That's a good idea, will take this into V2.
>> Actually, I'm not fully sure that was a good idea: I've now have a working
>> version of Xe ported over to drm_exec, having these helpers in mind and with
>> the intention to start using them as they mature. What I found, though is
>> that open-coding the drm_exec loop is not all that bad, but that building
>> blocks that can be called from within the loop are useful:
>>
>> Like the drm_gpuva_prepare_objects() and an imaginary
>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
>> (if different and the gpuva points to the object. And
>> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
>> can use these building blocks like helpers and avoid the fn() callback by
>> instead open-coding.
>>
>> But I guess YMMV.
> That's exactly why those building blocks are exported, I already had in mind
> that there might be drivers which still want to open-code the drm_exec loop,
> while others might just want a simple interface to lock everything.
>
> I still think it is a good idea, but I'd keep that as simple as possible. And
> for everything else just let the driver open-code it and use the "building
> blocks" - will also expand the bulding blocks to what you mentioned above.
>
>>>> struct drm_gpuva_exec_ops {
>>>>       int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>
>>>>       int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>>>> *obj);
>>> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
>>> be the same callback, right?
>>>
>>>> };
>>>>
>>>> struct drm_gpuva_exec {
>>>>       const struct drm_gpuva_exec_ops *ops;
>>>>       struct drm_exec exec;
>>>>       struct drm_gpuva_manager *mgr;
>>>> };
>>>>
>>>> Although I'd actually expect bo_validate to be part of fn in the typical
>>>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
>>> This doesn't sound like my assumption about fn() above is correct.
>> Well one important thing in our conversion is that ttm_bo_validate () needs
>> to be in the until_all_locked() loop. We want to be able soon to use
>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>> temporarily, add locked objects to the drm_exec list of locked objects. That
>> means everything that may end up calling validate deep within the call chain
>> needs to be part of the until_all_locked() loop, so our
>> drm_gpuva_manager_lock_extra() fn callback would include those validates and
>> look different all the time. Hence that's why open-coding isn't all that
>> bad...
> Oh, I see. You indeed want to call validate() from within until_all_locked().
>
>> /Thomas
>>
>>
>>>>> +
>>>>> +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
>>>>> +				 int (*fn)(struct drm_gpuva_manager *mgr,
>>>>> +					   void *priv, unsigned int num_fences),
>>>>> +				 void *priv,
>>>>> +				 unsigned int num_fences,
>>>>> +				 bool interruptible);
>>>>> +
>>>>> +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
>>>>> +				 struct drm_gem_object **objs,
>>>>> +				 unsigned int num_objs,
>>>>> +				 unsigned int num_fences,
>>>>> +				 bool interruptible);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @num_fences: the amount of &dma_fences to reserve
>>>>> + * @interruptible: sleep interruptible if waiting
>>>>> + *
>>>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>>>> + * &drm_gpuva_manager contains mappings of.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +static inline int
>>>>> +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
>>>>> +		       unsigned int num_fences,
>>>>> +		       bool interruptible)
>>>>> +{
>>>>> +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
>>>>> +					    interruptible);
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + *
>>>>> + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
>>>>> + * through drm_gpuva_manager_lock() or its variants.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +static inline void
>>>>> +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
>>>>> +{
>>>>> +	drm_exec_fini(&mgr->exec);
>>>>> +}
>>>>> +
>>>>> +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
>>>>> +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
>>>>> +				      struct dma_fence *fence,
>>>>> +				      enum dma_resv_usage private_usage,
>>>>> +				      enum dma_resv_usage extobj_usage);
>>>>> +
>>>>> +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>>>> +			    struct drm_gem_object *obj);
>>>>> +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
>>>>> +			  struct drm_gem_object *obj);
>>>>> +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
>>>>> +			  struct drm_gem_object *obj);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
>>>>> + * external object
>>>>> + * @mgr: the &drm_gpuva_manager to check
>>>>> + * @obj: the &drm_gem_object to check
>>>>> + *
>>>>> + * Returns: true if the &drm_gem_object &dma_resv differs from the
>>>>> + * &drm_gpuva_managers &dma_resv, false otherwise
>>>>> + */
>>>>> +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
>>>>> +				       struct drm_gem_object *obj)
>>>>> +{
>>>>> +	return obj && obj->resv != mgr->resv;
>>>>> +}
>>>>> +
>>>>>     static inline struct drm_gpuva *
>>>>>     __drm_gpuva_next(struct drm_gpuva *va)
>>>>>     {
>>>>> @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
>>>>>     #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>>>>>     	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
>>>>> +/**
>>>>> + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
>>>>> + * &drm_gem_object combination
>>>>> + *
>>>>> + * This structure is an abstraction representing a &drm_gpuva_manager and
>>>>> + * &drm_gem_object combination. It serves as an indirection to accelerate
>>>>> + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
>>>>> + * &drm_gem_object.
>>>>> + *
>>>>> + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
>>>>> + * accelerate validation.
>>>>> + *
>>>>> + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
>>>>> + * a GEM object is mapped first in a GPU-VM and release the instance once the
>>>>> + * last mapping of the GEM object in this GPU-VM is unmapped.
>>>>> + */
>>>>> +struct drm_gpuva_gem {
>>>>> +
>>>>> +	/**
>>>>> +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> +	 */
>>>>> +	struct drm_gpuva_manager *mgr;
>>>>> +
>>>>> +	/**
>>>>> +	 * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> +	 */
>>>>> +	struct drm_gem_object *obj;
>>>>> +
>>>>> +	/**
>>>>> +	 * @kref: The reference count for this &drm_gpuva_gem.
>>>>> +	 */
>>>>> +	struct kref kref;
>>>>> +
>>>>> +	/**
>>>>> +	 * @list: Structure containing all &list_heads.
>>>>> +	 */
>>>>> +	struct {
>>>>> +		/**
>>>>> +		 * @gpuva: The list of linked &drm_gpuvas.
>>>>> +		 */
>>>>> +		struct list_head gpuva;
>>>>> +
>>>>> +		/**
>>>>> +		 * @entry: Structure containing all &list_heads serving as
>>>>> +		 * entry.
>>>>> +		 */
>>>>> +		struct {
>>>>> +			/**
>>>>> +			 * @gem: List entry to attach to the &drm_gem_objects
>>>>> +			 * gpuva list.
>>>>> +			 */
>>>>> +			struct list_head gem;
>>>>> +
>>>>> +			/**
>>>>> +			 * @evict: List entry to attach to the
>>>>> +			 * &drm_gpuva_managers evict list.
>>>>> +			 */
>>>>> +			struct list_head evict;
>>>>> +		} entry;
>>>>> +	} list;
>>>>> +};
>>>>> +
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj);
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
>>>>> +			      struct drm_gem_object *obj,
>>>>> +			      struct drm_gpuva_gem *__vm_bo);
>>>>> +
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>>>> +		   struct drm_gem_object *obj);
>>>>> +
>>>>> +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
>>>>> +
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj);
>>>>> +void drm_gpuva_gem_destroy(struct kref *kref);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
>>>>> + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
>>>>> + *
>>>>> + * This function acquires an additional reference to @vm_bo. It is illegal to
>>>>> + * call this without already holding a reference. No locks required.
>>>>> + */
>>>>> +static inline struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
>>>>> +{
>>>>> +	kref_get(&vm_bo->kref);
>>>>> +	return vm_bo;
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
>>>>> + * @vm_bo: the &drm_gpuva_gem to release the reference of
>>>>> + *
>>>>> + * This releases a reference to @vm_bo.
>>>>> + */
>>>>> +static inline void
>>>>> +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
>>>>> +{
>>>>> +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
>>>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>>>> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
>>>>> + *
>>>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>>>> + * &drm_gpuva_gem.
>>>>> + */
>>>>> +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
>>>>> +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
>>>>> + * &drm_gpuva
>>>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>>>> + * @next__: &next &drm_gpuva to store the next step
>>>>> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
>>>>> + *
>>>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>>>> + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
>>>>> + * it is save against removal of elements.
>>>>> + */
>>>>> +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
>>>>> +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
>>>>> +
>>>>>     /**
>>>>>      * enum drm_gpuva_op_type - GPU VA operation type
>>>>>      *
>>>>> @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
>>>>>     	 */
>>>>>     	void (*op_free)(struct drm_gpuva_op *op);
>>>>> +	/**
>>>>> +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
>>>>> +	 * a struct drm_gpuva_gem
>>>>> +	 *
>>>>> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
>>>>> +	 * specific structures. By implementing this callback drivers can
>>>>> +	 * allocate memory accordingly.
>>>>> +	 *
>>>>> +	 * This callback is optional.
>>>>> +	 */
>>>>> +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
>>>>> +
>>>>> +	/**
>>>>> +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
>>>>> +	 * struct drm_gpuva_gem
>>>>> +	 *
>>>>> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
>>>>> +	 * specific structures. By implementing this callback drivers can
>>>>> +	 * free the previously allocated memory accordingly.
>>>>> +	 *
>>>>> +	 * This callback is optional.
>>>>> +	 */
>>>>> +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
>>>>> +
>>>>>     	/**
>>>>>     	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
>>>>>     	 * mapping once all previous steps were completed
>>>>> @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
>>>>>     	 * used.
>>>>>     	 */
>>>>>     	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
>>>>> +
>>>>> +	/**
>>>>> +	 * @bo_validate: called from drm_gpuva_manager_validate()
>>>>> +	 *
>>>>> +	 * Drivers receive this callback for every evicted &drm_gem_object being
>>>>> +	 * mapped in the corresponding &drm_gpuva_manager.
>>>>> +	 *
>>>>> +	 * Typically, drivers would call their driver specific variant of
>>>>> +	 * ttm_bo_validate() from within this callback.
>>>>> +	 */
>>>>> +	int (*bo_validate)(struct drm_gem_object *obj);
>>>>>     };
>>>>>     int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
>>>>> @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>>>>>     void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>>>>     		   struct drm_gpuva *va,
>>>>>     		   struct drm_gpuva_op_map *op);
>>>>> +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
>>>>> +		       struct drm_gpuva *va,
>>>>> +		       struct drm_gpuva_op_map *op);
>>>>>     void drm_gpuva_remap(struct drm_gpuva *prev,
>>>>>     		     struct drm_gpuva *next,
>>>>>     		     struct drm_gpuva_op_remap *op);
>>>>> +void drm_gpuva_remap_get(struct drm_gpuva *prev,
>>>>> +			 struct drm_gpuva *next,
>>>>> +			 struct drm_gpuva_op_remap *op);
>>>>>     void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
>>>>> +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
>>>>>     #endif /* __DRM_GPUVA_MGR_H__ */

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-31  9:04             ` Thomas Hellström (Intel)
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-08-31  9:04 UTC (permalink / raw)
  To: Danilo Krummrich
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, christian.koenig, faith.ekstrand, bskeggs

Hi!

On 8/30/23 17:00, Danilo Krummrich wrote:
> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>> Hi Thomas,
>>>
>>> thanks for having a look!
>>>
>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>>>> Hi, Danilo.
>>>>
>>>> Some quick comments since I'm doing some Xe work in this area. Will probably
>>>> get back with more.
>>>>
>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
>>>>> So far the DRM GPUVA manager offers common infrastructure to track GPU VA
>>>>> allocations and mappings, generically connect GPU VA mappings to their
>>>>> backing buffers and perform more complex mapping operations on the GPU VA
>>>>> space.
>>>>>
>>>>> However, there are more design patterns commonly used by drivers, which
>>>>> can potentially be generalized in order to make the DRM GPUVA manager
>>>>> represent a basic GPU-VM implementation. In this context, this patch aims
>>>>> at generalizing the following elements.
>>>>>
>>>>> 1) Provide a common dma-resv for GEM objects not being used outside of
>>>>>       this GPU-VM.
>>>>>
>>>>> 2) Provide tracking of external GEM objects (GEM objects which are
>>>>>       shared with other GPU-VMs).
>>>>>
>>>>> 3) Provide functions to efficiently lock all GEM objects dma-resv the
>>>>>       GPU-VM contains mappings of.
>>>>>
>>>>> 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
>>>>>       of, such that validation of evicted GEM objects is accelerated.
>>>>>
>>>>> 5) Provide some convinience functions for common patterns.
>>>>>
>>>>> Rather than being designed as a "framework", the target is to make all
>>>>> features appear as a collection of optional helper functions, such that
>>>>> drivers are free to make use of the DRM GPUVA managers basic
>>>>> functionality and opt-in for other features without setting any feature
>>>>> flags, just by making use of the corresponding functions.
>>>>>
>>>>> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
>>>>> ---
>>>>>     drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
>>>>>     include/drm/drm_gem.h           |  48 ++-
>>>>>     include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
>>>>>     3 files changed, 1010 insertions(+), 28 deletions(-)
>>>>>
>>>>> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
>>>>> index f86bfad74ff8..69872b205961 100644
>>>>> --- a/drivers/gpu/drm/drm_gpuva_mgr.c
>>>>> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
>>>>> @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>>>>     /**
>>>>>      * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
>>>>>      * @mgr: pointer to the &drm_gpuva_manager to initialize
>>>>> + * @drm: the drivers &drm_device
>>>>>      * @name: the name of the GPU VA space
>>>>>      * @start_offset: the start offset of the GPU VA space
>>>>>      * @range: the size of the GPU VA space
>>>>> @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>>>>      */
>>>>>     void
>>>>>     drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>> +		       struct drm_device *drm,
>>>>>     		       const char *name,
>>>>>     		       u64 start_offset, u64 range,
>>>>>     		       u64 reserve_offset, u64 reserve_range,
>>>>> @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>     	mgr->rb.tree = RB_ROOT_CACHED;
>>>>>     	INIT_LIST_HEAD(&mgr->rb.list);
>>>>> +	mt_init(&mgr->mt_ext);
>>>>> +
>>>>> +	INIT_LIST_HEAD(&mgr->evict.list);
>>>>> +	spin_lock_init(&mgr->evict.lock);
>>>>> +
>>>>>     	drm_gpuva_check_overflow(start_offset, range);
>>>>>     	mgr->mm_start = start_offset;
>>>>>     	mgr->mm_range = range;
>>>>> @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>     						     reserve_range)))
>>>>>     			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>>>>>     	}
>>>>> +
>>>>> +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
>>>>> +	mgr->resv = mgr->d_obj.resv;
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
>>>>> @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>>>>>     		__drm_gpuva_remove(&mgr->kernel_alloc_node);
>>>>>     	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
>>>>> -	     "GPUVA tree is not empty, potentially leaking memory.");
>>>>> +	     "GPUVA tree is not empty, potentially leaking memory.\n");
>>>>> +
>>>>> +	mtree_destroy(&mgr->mt_ext);
>>>>> +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
>>>>> +
>>>>> +	drm_gem_private_object_fini(&mgr->d_obj);
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
>>>>> +/**
>>>>> + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @num_fences: the amount of &dma_fences to reserve
>>>>> + *
>>>>> + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
>>>>> + * &drm_gpuva_manager contains mappings of.
>>>>> + *
>>>>> + * Drivers can obtain the corresponding &drm_exec instance through
>>>>> + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
>>>>> + * and drm_exec_fini() accordingly.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
>>>>> +				  unsigned int num_fences)
>>>>> +{
>>>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>>>> +	union {
>>>>> +		void *ptr;
>>>>> +		uintptr_t cnt;
>>>>> +	} ref;
>>>>> +	int ret;
>>>>> +
>>>>> +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
>>>>> +	if (ret)
>>>>> +		goto out;
>>>>> +
>>>>> +	rcu_read_lock();
>>>> In xe we're protecting the external object list with an outer lock, (same as
>>>> protecting the mgr itself). Do we need a separate lock for this? In theory
>>>> as  outlined in the VM_BIND locking document draft, one could probably even
>>>> use the mgr resv for this, but with more complicated code I guess. Also see
>>>> the comment below about the data structure chosen.
>>> The idea is to protect this list with the GPU-VM lock. The locking here is more
>>> of an implication of the maple tree. Either you use the internal lock of the
>>> maple tree or RCU respectively, or you give the maple tree an external lock to
>>> perform lockdep checks on (mt_set_external_lock()). Basically same as here:
>>>
>>> https://elixir.bootlin.com/linux/latest/source/drivers/base/regmap/regcache-maple.c#L124
>> Ah, I suspected it was something along those lines.
>>
>>
>>>>> +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
>>>>> +		struct drm_gem_object *obj;
>>>>> +
>>>>> +		mas_pause(&mas);
>>>>> +		rcu_read_unlock();
>>>>> +
>>>>> +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
>>>>> +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
>>>>> +		if (ret)
>>>>> +			goto out;
>>>>> +
>>>>> +		rcu_read_lock();
>>>>> +	}
>>>>> +	rcu_read_unlock();
>>>>> +
>>>>> +out:
>>>>> +	return ret;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @fn: callback received by the driver to lock additional dma-resv
>>>>> + * @priv: private driver data passed to @fn
>>>>> + * @num_fences: the amount of &dma_fences to reserve
>>>>> + * @interruptible: sleep interruptible if waiting
>>>>> + *
>>>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>>>> + * &drm_gpuva_manager contains mappings of.
>>>>> + *
>>>>> + * Addionally, when calling this function the driver receives the given @fn
>>>>> + * callback to lock additional dma-resv in the context of the
>>>>> + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
>>>>> + * drm_exec_prepare_obj() from within this callback.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
>>>>> +			     int (*fn)(struct drm_gpuva_manager *mgr,
>>>>> +				       void *priv, unsigned int num_fences),
>>>>> +			     void *priv,
>>>>> +			     unsigned int num_fences,
>>>>> +			     bool interruptible)
>>>>> +{
>>>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>>>> +	uint32_t flags;
>>>>> +	int ret;
>>>>> +
>>>>> +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
>>>>> +		DRM_EXEC_IGNORE_DUPLICATES;
>>>>> +
>>>>> +	drm_exec_init(exec, flags);
>>>>> +
>>>>> +	drm_exec_until_all_locked(exec) {
>>>>> +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
>>>>> +		drm_exec_retry_on_contention(exec);
>>>>> +		if (ret)
>>>>> +			goto err;
>>>>> +
>>>>> +		if (fn) {
>>>>> +			ret = fn(mgr, priv, num_fences);
>>>>> +			drm_exec_retry_on_contention(exec);
>>>>> +			if (ret)
>>>>> +				goto err;
>>>>> +		}
>>>>> +	}
>>>>> +
>>>>> +	return 0;
>>>>> +
>>>>> +err:
>>>>> +	drm_exec_fini(exec);
>>>>> +	return ret;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
>>>>> +
>>>>> +static int
>>>>> +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
>>>>> +				unsigned int num_fences)
>>>>> +{
>>>>> +	struct {
>>>>> +		struct drm_gem_object **objs;
>>>>> +		unsigned int num_objs;
>>>>> +	} *args = priv;
>>>>> +
>>>>> +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
>>>>> +				      args->num_objs, num_fences);
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @objs: additional &drm_gem_objects to lock
>>>>> + * @num_objs: the number of additional &drm_gem_objects to lock
>>>>> + * @num_fences: the amount of &dma_fences to reserve
>>>>> + * @interruptible: sleep interruptible if waiting
>>>>> + *
>>>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>>>> + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
>>>>> +			     struct drm_gem_object **objs,
>>>>> +			     unsigned int num_objs,
>>>>> +			     unsigned int num_fences,
>>>>> +			     bool interruptible)
>>>>> +{
>>>>> +	struct {
>>>>> +		struct drm_gem_object **objs;
>>>>> +		unsigned int num_objs;
>>>>> +	} args;
>>>>> +
>>>>> +	args.objs = objs;
>>>>> +	args.num_objs = num_objs;
>>>>> +
>>>>> +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
>>>>> +					    num_fences, interruptible);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
>>>>> + * @mgr: the &drm_gpuva_manager to validate evicted BOs
>>>>> + *
>>>>> + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
>>>>> + * objects being mapped in the given &drm_gpuva_manager.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
>>>>> +{
>>>>> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +	int ret;
>>>>> +
>>>>> +	if (unlikely(!ops || !ops->bo_validate))
>>>>> +		return -ENOTSUPP;
>>>>> +
>>>>> +	/* At this point we should hold all dma-resv locks of all GEM objects
>>>>> +	 * associated with this GPU-VM, hence it is safe to walk the list.
>>>>> +	 */
>>>>> +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
>>>>> +		dma_resv_assert_held(vm_bo->obj->resv);
>>>>> +
>>>>> +		ret = ops->bo_validate(vm_bo->obj);
>>>>> +		if (ret)
>>>>> +			return ret;
>>>>> +	}
>>>>> +
>>>>> +	return 0;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
>>>>> + * dma-resv
>>>>> + * @mgr: the &drm_gpuva_manager to add a fence to
>>>>> + * @fence: fence to add
>>>>> + * @private_usage: private dma-resv usage
>>>>> + * @extobj_usage: extobj dma-resv usage
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
>>>>> +				 struct dma_fence *fence,
>>>>> +				 enum dma_resv_usage private_usage,
>>>>> +				 enum dma_resv_usage extobj_usage)
>>>>> +{
>>>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>>>> +	struct drm_gem_object *obj;
>>>>> +	unsigned long index;
>>>>> +
>>>>> +	drm_exec_for_each_locked_object(exec, index, obj) {
>>>>> +			dma_resv_assert_held(obj->resv);
>>>>> +			dma_resv_add_fence(obj->resv, fence,
>>>>> +					   drm_gpuva_is_extobj(mgr, obj) ?
>>>>> +					   private_usage : extobj_usage);
>>>>> +	}
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
>>>>> +
>>>>> +static struct drm_gpuva_gem *
>>>>> +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	drm_gem_gpuva_assert_lock_held(obj);
>>>>> +
>>>>> +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
>>>>> +		if (vm_bo->mgr == mgr)
>>>>> +			return vm_bo;
>>>>> +
>>>>> +	return NULL;
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
>>>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> + *
>>>>> + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
>>>>> + * vm_bo_alloc() callback to allocate.
>>>>> + *
>>>>> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
>>>>> + */
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	if (ops && ops->vm_bo_alloc)
>>>>> +		vm_bo = ops->vm_bo_alloc();
>>>>> +	else
>>>>> +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
>>>>> +
>>>>> +	if (unlikely(!vm_bo))
>>>>> +		return NULL;
>>>>> +
>>>>> +	vm_bo->mgr = mgr;
>>>>> +	vm_bo->obj = obj;
>>>>> +
>>>>> +	kref_init(&vm_bo->kref);
>>>>> +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
>>>>> +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
>>>>> +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
>>>>> +
>>>>> +	drm_gem_object_get(obj);
>>>>> +
>>>>> +	return vm_bo;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
>>>>> +
>>>>> +void
>>>>> +drm_gpuva_gem_destroy(struct kref *kref)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
>>>>> +						   kref);
>>>>> +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
>>>>> +
>>>>> +	drm_gem_object_put(vm_bo->obj);
>>>>> +
>>>>> +	if (ops && ops->vm_bo_free)
>>>>> +		ops->vm_bo_free(vm_bo);
>>>>> +	else
>>>>> +		kfree(vm_bo);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
>>>>> + * &drm_gpuva_manager and &drm_gem_object
>>>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> + *
>>>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>>>> + * count of the &drm_gpuva_gem accordingly.
>>>>> + *
>>>>> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
>>>>> + */
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>>>> +		   struct drm_gem_object *obj)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
>>>>> +
>>>>> +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
>>>>> + * given &drm_gpuva_manager and &drm_gem_object
>>>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> + *
>>>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>>>> + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
>>>>> + * &drm_gpuva_gem.
>>>>> + *
>>>>> + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
>>>>> + */
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
>>>>> +	if (vm_bo)
>>>>> +		return vm_bo;
>>>>> +
>>>>> +	vm_bo = drm_gpuva_gem_create(mgr, obj);
>>>>> +	if (!vm_bo)
>>>>> +		return ERR_PTR(-ENOMEM);
>>>>> +
>>>>> +	return vm_bo;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
>>>>> + * for the given &drm_gpuva_manager and &drm_gem_object
>>>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> + *
>>>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>>>> + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
>>>>> + * count is decreased. If not found @__vm_bo is returned.
>>>>> + *
>>>>> + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
>>>>> + * &drm_gpuva_gem was found
>>>>> + */
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
>>>>> +			      struct drm_gem_object *obj,
>>>>> +			      struct drm_gpuva_gem *__vm_bo)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
>>>>> +	if (vm_bo) {
>>>>> +		drm_gpuva_gem_put(__vm_bo);
>>>>> +		return vm_bo;
>>>>> +	}
>>>>> +
>>>>> +	return __vm_bo;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
>>>>> +
>>>>> +static int
>>>>> +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>>>> +			  struct drm_gem_object *obj,
>>>>> +			  gfp_t gfp)
>>>>> +{
>>>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>>>> +	union {
>>>>> +		struct drm_gem_object *obj;
>>>>> +		uintptr_t index;
>>>>> +	} gem;
>>>>> +	union {
>>>>> +		void *ptr;
>>>>> +		uintptr_t cnt;
>>>>> +	} ref;
>>>>> +	int ret = 0;
>>>>> +
>>>>> +	gem.obj = obj;
>>>>> +	mas_set(&mas, gem.index);
>>>>> +
>>>>> +	mas_lock(&mas);
>>>>> +	ref.ptr = mas_walk(&mas);
>>>>> +	if (ref.ptr) {
>>>>> +		++ref.cnt;
>>>>> +		mas_store(&mas, ref.ptr);
>>>>> +	} else {
>>>>> +		if (unlikely(!gfp)) {
>>>>> +			ret = -EINVAL;
>>>>> +			goto out;
>>>>> +		}
>>>>> +
>>>>> +		mas_set(&mas, gem.index);
>>>>> +		ref.cnt = 1;
>>>>> +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
>>>>> +		if (likely(!ret))
>>>>> +			drm_gem_object_get(obj);
>>>>> +	}
>>>>> +out:
>>>>> +	mas_unlock(&mas);
>>>>> +	return ret;
>>>>> +}
>>>>> +
>>>>> +static void
>>>>> +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
>>>>> +			  struct drm_gem_object *obj)
>>>>> +{
>>>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>>>> +	union {
>>>>> +		struct drm_gem_object *obj;
>>>>> +		uintptr_t index;
>>>>> +	} gem;
>>>>> +	union {
>>>>> +		void *ptr;
>>>>> +		uintptr_t cnt;
>>>>> +	} ref;
>>>>> +
>>>>> +	gem.obj = obj;
>>>>> +	mas_set(&mas, gem.index);
>>>>> +
>>>>> +	mas_lock(&mas);
>>>>> +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
>>>>> +		goto out;
>>>>> +
>>>>> +	if (!--ref.cnt) {
>>>>> +		mas_erase(&mas);
>>>>> +		drm_gem_object_put(obj);
>>>>> +	} else {
>>>>> +		mas_store(&mas, ref.ptr);
>>>>> +	}
>>>>> +out:
>>>>> +	mas_unlock(&mas);
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
>>>>> + * @mgr: the &drm_gpuva_manager to insert into
>>>>> + * @obj: the &drm_gem_object to insert as extobj
>>>>> + *
>>>>> + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
>>>>> + * If the &drm_gem_object already exists in the tree, the reference counter
>>>>> + * of this external object is increased by one.
>>>>> + *
>>>>> + * Drivers should insert the external &drm_gem_object before the dma-fence
>>>>> + * signalling critical section, e.g. when submitting the job, and before
>>>>> + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
>>>>> + * or its dervates.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>>>> +			struct drm_gem_object *obj)
>>>>> +{
>>>>> +	return drm_gpuva_is_extobj(mgr, obj) ?
>>>>> +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
>>>>> +
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_extobj_get - increase the referecne count of an external
>>>>> + * &drm_gem_object
>>>>> + * @mgr: the &drm_gpuva_manager storing the extobj
>>>>> + * @obj: the &drm_gem_object to representing the extobj
>>>>> + *
>>>>> + * Increases the reference count of the extobj represented by @obj.
>>>>> + *
>>>>> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
>>>>> + * being inserted.
>>>>> + *
>>>>> + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
>>>>> + * additional reference if the re-map operation splits an existing &drm_gpuva
>>>>> + * into two separate ones.
>>>>> + *
>>>>> + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	if (drm_gpuva_is_extobj(mgr, obj))
>>>>> +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
>>>>> +		     "Can't increase ref-count of non-existent extobj.");
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_extobj_put - decrease the referecne count of an external
>>>>> + * &drm_gem_object
>>>>> + * @mgr: the &drm_gpuva_manager storing the extobj
>>>>> + * @obj: the &drm_gem_object to representing the extobj
>>>>> + *
>>>>> + * Decreases the reference count of the extobj represented by @obj.
>>>>> + *
>>>>> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
>>>>> + * being removed from the GPU VA space.
>>>>> + *
>>>>> + * See also drm_gpuva_unmap_put().
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	if (drm_gpuva_is_extobj(mgr, obj))
>>>>> +		__drm_gpuva_extobj_remove(mgr, obj);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
>>>>> + * &drm_gpuva_managers evicted list
>>>>> + * @obj: the &drm_gem_object to add or remove
>>>>> + * @evict: indicates whether the object is evicted
>>>>> + *
>>>>> + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
>>>>> + * list containing a mapping of this &drm_gem_object.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
>>>>> +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
>>>>> +	 */
>>>>> +	drm_gem_gpuva_assert_lock_held(obj);
>>>>> +
>>>>> +	/* Required to protect the GPUVA managers evict list against concurrent
>>>>> +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
>>>>> +	 * the evict list through different GEM object evictions are protected
>>>>> +	 * by the GPUVA managers evict lock.
>>>>> +	 */
>>>>> +	dma_resv_assert_held(obj->resv);
>>>>> +
>>>>> +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
>>>>> +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
>>>>> +
>>>>> +		spin_lock(&mgr->evict.lock);
>>>>> +		if (evict)
>>>>> +			list_add_tail(&vm_bo->list.entry.evict,
>>>>> +				      &mgr->evict.list);
>>>>> +		else
>>>>> +			list_del_init(&vm_bo->list.entry.evict);
>>>>> +		spin_unlock(&mgr->evict.lock);
>>>>> +	}
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
>>>>> +
>>>>>     static int
>>>>>     __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>>>>>     		   struct drm_gpuva *va)
>>>>> @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
>>>>>     /**
>>>>>      * drm_gpuva_link() - link a &drm_gpuva
>>>>>      * @va: the &drm_gpuva to link
>>>>> + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
>>>>>      *
>>>>> - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
>>>>> - * associated with.
>>>>> + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
>>>>> + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
>>>>> + *
>>>>> + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
>>>>> + * reference of the latter is taken.
>>>>>      *
>>>>>      * This function expects the caller to protect the GEM's GPUVA list against
>>>>> - * concurrent access using the GEMs dma_resv lock.
>>>>> + * concurrent access using either the GEMs dma_resv lock or a driver specific
>>>>> + * lock set through drm_gem_gpuva_set_lock().
>>>>>      */
>>>>>     void
>>>>> -drm_gpuva_link(struct drm_gpuva *va)
>>>>> +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
>>>>>     {
>>>>>     	struct drm_gem_object *obj = va->gem.obj;
>>>>> @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
>>>>>     	drm_gem_gpuva_assert_lock_held(obj);
>>>>> -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
>>>>> +	drm_gpuva_gem_get(vm_bo);
>>>>> +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
>>>>> +	if (list_empty(&vm_bo->list.entry.gem))
>>>>> +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_link);
>>>>> @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
>>>>>      * This removes the given &va from the GPU VA list of the &drm_gem_object it is
>>>>>      * associated with.
>>>>>      *
>>>>> + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
>>>>> + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
>>>>> + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
>>>>> + *
>>>>> + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
>>>>> + * the latter is dropped.
>>>>> + *
>>>>>      * This function expects the caller to protect the GEM's GPUVA list against
>>>>> - * concurrent access using the GEMs dma_resv lock.
>>>>> + * concurrent access using either the GEMs dma_resv lock or a driver specific
>>>>> + * lock set through drm_gem_gpuva_set_lock().
>>>>>      */
>>>>>     void
>>>>>     drm_gpuva_unlink(struct drm_gpuva *va)
>>>>>     {
>>>>>     	struct drm_gem_object *obj = va->gem.obj;
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>>     	if (unlikely(!obj))
>>>>>     		return;
>>>>>     	drm_gem_gpuva_assert_lock_held(obj);
>>>>> +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
>>>>> +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
>>>>> +		return;
>>>>> +
>>>>>     	list_del_init(&va->gem.entry);
>>>>> +
>>>>> +	if (list_empty(&vm_bo->list.gpuva)) {
>>>>> +		list_del_init(&vm_bo->list.entry.gem);
>>>>> +		list_del_init(&vm_bo->list.entry.evict);
>>>>> +	}
>>>>> +	drm_gpuva_gem_put(vm_bo);
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
>>>>> @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_map);
>>>>> +/**
>>>>> + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
>>>>> + * &drm_gpuva_op_map
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @va: the &drm_gpuva to insert
>>>>> + * @op: the &drm_gpuva_op_map to initialize @va with
>>>>> + *
>>>>> + * Initializes the @va from the @op and inserts it into the given @mgr and
>>>>> + * increases the reference count of the corresponding extobj.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
>>>>> +		  struct drm_gpuva *va,
>>>>> +		  struct drm_gpuva_op_map *op)
>>>>> +{
>>>>> +	drm_gpuva_map(mgr, va, op);
>>>>> +	drm_gpuva_extobj_get(mgr, va->gem.obj);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
>>>>> +
>>>>>     /**
>>>>>      * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
>>>>>      * &drm_gpuva_op_remap
>>>>> @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>>>>>     		struct drm_gpuva *next,
>>>>>     		struct drm_gpuva_op_remap *op)
>>>>>     {
>>>>> -	struct drm_gpuva *curr = op->unmap->va;
>>>>> -	struct drm_gpuva_manager *mgr = curr->mgr;
>>>>> +	struct drm_gpuva *va = op->unmap->va;
>>>>> +	struct drm_gpuva_manager *mgr = va->mgr;
>>>>> -	drm_gpuva_remove(curr);
>>>>> +	drm_gpuva_remove(va);
>>>>>     	if (op->prev) {
>>>>>     		drm_gpuva_init_from_op(prev, op->prev);
>>>>> @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_remap);
>>>>> +/**
>>>>> + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
>>>>> + * &drm_gpuva_op_remap
>>>>> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
>>>>> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
>>>>> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
>>>>> + *
>>>>> + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
>>>>> + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
>>>>> + * separate mappings, increases the reference count of the corresponding extobj.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_remap_get(struct drm_gpuva *prev,
>>>>> +		    struct drm_gpuva *next,
>>>>> +		    struct drm_gpuva_op_remap *op)
>>>>> +{
>>>>> +	struct drm_gpuva *va = op->unmap->va;
>>>>> +	struct drm_gpuva_manager *mgr = va->mgr;
>>>>> +
>>>>> +	drm_gpuva_remap(prev, next, op);
>>>>> +	if (op->prev && op->next)
>>>>> +		drm_gpuva_extobj_get(mgr, va->gem.obj);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
>>>>> +
>>>>>     /**
>>>>>      * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
>>>>>      * &drm_gpuva_op_unmap
>>>>> @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
>>>>> +/**
>>>>> + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
>>>>> + * &drm_gpuva_op_unmap
>>>>> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
>>>>> + *
>>>>> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
>>>>> + * the reference count of the corresponding extobj.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
>>>>> +{
>>>>> +	struct drm_gpuva *va = op->va;
>>>>> +
>>>>> +	drm_gpuva_unmap(op);
>>>>> +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
>>>>> +
>>>>>     static int
>>>>>     op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>>>>>     	  u64 addr, u64 range,
>>>>> @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>>>>     {
>>>>>     	struct drm_gpuva_ops *ops;
>>>>>     	struct drm_gpuva_op *op;
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>>     	struct drm_gpuva *va;
>>>>>     	int ret;
>>>>> @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>>>>     	INIT_LIST_HEAD(&ops->list);
>>>>> -	drm_gem_for_each_gpuva(va, obj) {
>>>>> +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
>>>>>     		op = gpuva_op_alloc(mgr);
>>>>>     		if (!op) {
>>>>>     			ret = -ENOMEM;
>>>>> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
>>>>> index bc9f6aa2f3fe..783ed3ab440d 100644
>>>>> --- a/include/drm/drm_gem.h
>>>>> +++ b/include/drm/drm_gem.h
>>>>> @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
>>>>>      * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
>>>>>      * @obj: the &drm_gem_object
>>>>>      *
>>>>> - * This initializes the &drm_gem_object's &drm_gpuva list.
>>>>> + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
>>>>>      *
>>>>>      * Calling this function is only necessary for drivers intending to support the
>>>>>      * &drm_driver_feature DRIVER_GEM_GPUVA.
>>>>> @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>>>>>     }
>>>>>     /**
>>>>> - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
>>>>> - * @entry__: &drm_gpuva structure to assign to in each iteration step
>>>>> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>>>> + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
>>>>> + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
>>>>> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>>>>>      *
>>>>> - * This iterator walks over all &drm_gpuva structures associated with the
>>>>> - * &drm_gpuva_manager.
>>>>> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>>>>> + * &drm_gem_object.
>>>>>      */
>>>>> -#define drm_gem_for_each_gpuva(entry__, obj__) \
>>>>> -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
>>>>> +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
>>>>> +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
>>>>>     /**
>>>>> - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
>>>>> - * gpuvas
>>>>> - * @entry__: &drm_gpuva structure to assign to in each iteration step
>>>>> - * @next__: &next &drm_gpuva to store the next step
>>>>> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>>>> + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
>>>>> + * &drm_gpuva_gem
>>>>> + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
>>>>> + * @next__: &next &drm_gpuva_gem to store the next step
>>>>> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>>>>>      *
>>>>> - * This iterator walks over all &drm_gpuva structures associated with the
>>>>> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>>>>>      * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
>>>>>      * it is save against removal of elements.
>>>>>      */
>>>>> -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
>>>>> -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
>>>>> +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
>>>>> +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
>>>>> +
>>>>> +/**
>>>>> + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
>>>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>>>> + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
>>>>> + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
>>>>> + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>>>> + *
>>>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>>>> + * &drm_gpuva_manager and &drm_gem_object.
>>>>> + */
>>>>> +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
>>>>> +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
>>>>> +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
>>>>> +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
>>>>> +	     va__ = list_next_entry(va__, gem.entry))
>>>>>     #endif /* __DRM_GEM_H__ */
>>>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>> @@ -26,12 +26,16 @@
>>>>>      */
>>>>>     #include <linux/list.h>
>>>>> +#include <linux/dma-resv.h>
>>>>> +#include <linux/maple_tree.h>
>>>>>     #include <linux/rbtree.h>
>>>>>     #include <linux/types.h>
>>>>>     #include <drm/drm_gem.h>
>>>>> +#include <drm/drm_exec.h>
>>>>>     struct drm_gpuva_manager;
>>>>> +struct drm_gpuva_gem;
>>>>>     struct drm_gpuva_fn_ops;
>>>>>     /**
>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>     int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>>>     void drm_gpuva_remove(struct drm_gpuva *va);
>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>>>     void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>     struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>     	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>>>     	 */
>>>>>     	const struct drm_gpuva_fn_ops *ops;
>>>>> +
>>>>> +	/**
>>>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>>>> +	 * dma-resv to &drm_exec.
>>>>> +	 */
>>>>> +	struct drm_gem_object d_obj;
>>>>> +
>>>>> +	/**
>>>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>>>> +	 * space
>>>>> +	 */
>>>>> +	struct dma_resv *resv;
>>>>> +
>>>>> +	/**
>>>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>>>> +	 */
>>>>> +	struct drm_exec exec;
>>>>> +
>>>>> +	/**
>>>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>> +	 */
>>>>> +	struct maple_tree mt_ext;
>>>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>>>> instead of O(1) for a list?
>>>>
>>> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
>>> could have mappings of the same extobj.
>>>
>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
>>> instead, which also seems to be the obvious choice. However, there is a locking
>>> conflict.
>>>
>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
>>> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
>>> the corresponding drm_gem_object, or with an external lock provided by the
>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
>>> changes on the GPUVA space directly from the fence signalling path.
>>>
>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
>>> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
>>> linked and we'd want to remove it for the last one being unlinked.
>>>
>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
>>> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
>>> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>
>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
>>> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
>>> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
>>> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>
>>> Considering all that, I thought it's probably better to track extobjs separate
>>> from the drm_gpuva_gem, hence the maple tree choice.
>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
>> external objects, or in the case of multiple mappings to the same gem
>> object, only one of the drm_gpuvas is in the list. These are protected by
>> the GPU-VM lock. I don't see a problem with removing those from the fence
>> signalling path, though?
> I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
> since this is generic code I don't know how much mappings of an external object
> the corresponding driver potentially creates. This could become a pretty large
> list to iterate. Another reason was, that I want to keep the drm_gpuva structure
> as small as possible, hence avoiding another list_head.

Yes, the list might be pretty large, but OTOH you never iterate to 
access a single list element. When you need to iterate the whole list 
you need to do that regardless of the data structure used. As for the 
list head, it might perhaps be aliased (union) with an upcoming userptr 
list head?

>
> Now, it sounds like in XE you're doing some kind of optimization just keeping a
> single mapping of an extobj in the list? How do you know when to remove it? What
> if the mapping from the extobj list gets unmapped, but there is still another
> one left in the GPU-VM being backed by the same BO?
When removing from the lists, we iterate through the object's list of 
vmas, and if there is one matching the same vm, we replace the old one 
with the new one. A similar iteration is done when adding to avoid 
adding one that is already on the list.
> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> choice, keeping O(1)?
> When tracking extobjs, the address of the drm_gem_object is the key while the
> reference count is the value. I was thinking of an XArray as well, but I was
> worried that the corresponding indices could be too much distributed for an
> XArray to still be efficient. Now that I think about it, it's probably not that
> bad.
>
> Btw., while I agree trying to make things as efficient as possible, what is the
> magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?

Not sure yet, TBH, but I think one of our UMDs can only use external 
object, because they don't know at creation time which ones need 
exporting. However if this turns out to be too bad, there are various 
flavours of "clever but complicated" optimizations that we could think 
of to reduce the list size. Still in our case, we opted for the vma list 
head for now.

/Thomas


>
>>>>> +
>>>>> +	/**
>>>>> +	 * @evict: structure holding the evict list and evict list lock
>>>>> +	 */
>>>>> +	struct {
>>>>> +		/**
>>>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>>>> +		 * evicted
>>>>> +		 */
>>>>> +		struct list_head list;
>>>>> +
>>>>> +		/**
>>>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>>>> +		 * insertion / removal of different &drm_gpuva_gems
>>>>> +		 */
>>>>> +		spinlock_t lock;
>>>>> +	} evict;
>>>>>     };
>>>>>     void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>> +			    struct drm_device *drm,
>>>>>     			    const char *name,
>>>>>     			    u64 start_offset, u64 range,
>>>>>     			    u64 reserve_offset, u64 reserve_range,
>>>>>     			    const struct drm_gpuva_fn_ops *ops);
>>>>>     void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>>>> +/**
>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>>>> + */
>>>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>>>> should typically be allocated on the stack. Otherwise you'd need to protect
>>>> the mgr->exec member with an exclusive lock throughout the locking process,
>>>> and that's not what we want.
>>> Oh, good point. I think it works in Nouveau, because there it's implicitly
>>> protected with the job submission lock.
>>>
>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>>>> needed ops to it: Like so:
>>> That's a good idea, will take this into V2.
>> Actually, I'm not fully sure that was a good idea: I've now have a working
>> version of Xe ported over to drm_exec, having these helpers in mind and with
>> the intention to start using them as they mature. What I found, though is
>> that open-coding the drm_exec loop is not all that bad, but that building
>> blocks that can be called from within the loop are useful:
>>
>> Like the drm_gpuva_prepare_objects() and an imaginary
>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
>> (if different and the gpuva points to the object. And
>> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
>> can use these building blocks like helpers and avoid the fn() callback by
>> instead open-coding.
>>
>> But I guess YMMV.
> That's exactly why those building blocks are exported, I already had in mind
> that there might be drivers which still want to open-code the drm_exec loop,
> while others might just want a simple interface to lock everything.
>
> I still think it is a good idea, but I'd keep that as simple as possible. And
> for everything else just let the driver open-code it and use the "building
> blocks" - will also expand the bulding blocks to what you mentioned above.
>
>>>> struct drm_gpuva_exec_ops {
>>>>       int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>
>>>>       int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>>>> *obj);
>>> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
>>> be the same callback, right?
>>>
>>>> };
>>>>
>>>> struct drm_gpuva_exec {
>>>>       const struct drm_gpuva_exec_ops *ops;
>>>>       struct drm_exec exec;
>>>>       struct drm_gpuva_manager *mgr;
>>>> };
>>>>
>>>> Although I'd actually expect bo_validate to be part of fn in the typical
>>>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
>>> This doesn't sound like my assumption about fn() above is correct.
>> Well one important thing in our conversion is that ttm_bo_validate () needs
>> to be in the until_all_locked() loop. We want to be able soon to use
>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>> temporarily, add locked objects to the drm_exec list of locked objects. That
>> means everything that may end up calling validate deep within the call chain
>> needs to be part of the until_all_locked() loop, so our
>> drm_gpuva_manager_lock_extra() fn callback would include those validates and
>> look different all the time. Hence that's why open-coding isn't all that
>> bad...
> Oh, I see. You indeed want to call validate() from within until_all_locked().
>
>> /Thomas
>>
>>
>>>>> +
>>>>> +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
>>>>> +				 int (*fn)(struct drm_gpuva_manager *mgr,
>>>>> +					   void *priv, unsigned int num_fences),
>>>>> +				 void *priv,
>>>>> +				 unsigned int num_fences,
>>>>> +				 bool interruptible);
>>>>> +
>>>>> +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
>>>>> +				 struct drm_gem_object **objs,
>>>>> +				 unsigned int num_objs,
>>>>> +				 unsigned int num_fences,
>>>>> +				 bool interruptible);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @num_fences: the amount of &dma_fences to reserve
>>>>> + * @interruptible: sleep interruptible if waiting
>>>>> + *
>>>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>>>> + * &drm_gpuva_manager contains mappings of.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +static inline int
>>>>> +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
>>>>> +		       unsigned int num_fences,
>>>>> +		       bool interruptible)
>>>>> +{
>>>>> +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
>>>>> +					    interruptible);
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + *
>>>>> + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
>>>>> + * through drm_gpuva_manager_lock() or its variants.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +static inline void
>>>>> +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
>>>>> +{
>>>>> +	drm_exec_fini(&mgr->exec);
>>>>> +}
>>>>> +
>>>>> +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
>>>>> +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
>>>>> +				      struct dma_fence *fence,
>>>>> +				      enum dma_resv_usage private_usage,
>>>>> +				      enum dma_resv_usage extobj_usage);
>>>>> +
>>>>> +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>>>> +			    struct drm_gem_object *obj);
>>>>> +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
>>>>> +			  struct drm_gem_object *obj);
>>>>> +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
>>>>> +			  struct drm_gem_object *obj);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
>>>>> + * external object
>>>>> + * @mgr: the &drm_gpuva_manager to check
>>>>> + * @obj: the &drm_gem_object to check
>>>>> + *
>>>>> + * Returns: true if the &drm_gem_object &dma_resv differs from the
>>>>> + * &drm_gpuva_managers &dma_resv, false otherwise
>>>>> + */
>>>>> +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
>>>>> +				       struct drm_gem_object *obj)
>>>>> +{
>>>>> +	return obj && obj->resv != mgr->resv;
>>>>> +}
>>>>> +
>>>>>     static inline struct drm_gpuva *
>>>>>     __drm_gpuva_next(struct drm_gpuva *va)
>>>>>     {
>>>>> @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
>>>>>     #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>>>>>     	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
>>>>> +/**
>>>>> + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
>>>>> + * &drm_gem_object combination
>>>>> + *
>>>>> + * This structure is an abstraction representing a &drm_gpuva_manager and
>>>>> + * &drm_gem_object combination. It serves as an indirection to accelerate
>>>>> + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
>>>>> + * &drm_gem_object.
>>>>> + *
>>>>> + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
>>>>> + * accelerate validation.
>>>>> + *
>>>>> + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
>>>>> + * a GEM object is mapped first in a GPU-VM and release the instance once the
>>>>> + * last mapping of the GEM object in this GPU-VM is unmapped.
>>>>> + */
>>>>> +struct drm_gpuva_gem {
>>>>> +
>>>>> +	/**
>>>>> +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> +	 */
>>>>> +	struct drm_gpuva_manager *mgr;
>>>>> +
>>>>> +	/**
>>>>> +	 * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> +	 */
>>>>> +	struct drm_gem_object *obj;
>>>>> +
>>>>> +	/**
>>>>> +	 * @kref: The reference count for this &drm_gpuva_gem.
>>>>> +	 */
>>>>> +	struct kref kref;
>>>>> +
>>>>> +	/**
>>>>> +	 * @list: Structure containing all &list_heads.
>>>>> +	 */
>>>>> +	struct {
>>>>> +		/**
>>>>> +		 * @gpuva: The list of linked &drm_gpuvas.
>>>>> +		 */
>>>>> +		struct list_head gpuva;
>>>>> +
>>>>> +		/**
>>>>> +		 * @entry: Structure containing all &list_heads serving as
>>>>> +		 * entry.
>>>>> +		 */
>>>>> +		struct {
>>>>> +			/**
>>>>> +			 * @gem: List entry to attach to the &drm_gem_objects
>>>>> +			 * gpuva list.
>>>>> +			 */
>>>>> +			struct list_head gem;
>>>>> +
>>>>> +			/**
>>>>> +			 * @evict: List entry to attach to the
>>>>> +			 * &drm_gpuva_managers evict list.
>>>>> +			 */
>>>>> +			struct list_head evict;
>>>>> +		} entry;
>>>>> +	} list;
>>>>> +};
>>>>> +
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj);
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
>>>>> +			      struct drm_gem_object *obj,
>>>>> +			      struct drm_gpuva_gem *__vm_bo);
>>>>> +
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>>>> +		   struct drm_gem_object *obj);
>>>>> +
>>>>> +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
>>>>> +
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj);
>>>>> +void drm_gpuva_gem_destroy(struct kref *kref);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
>>>>> + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
>>>>> + *
>>>>> + * This function acquires an additional reference to @vm_bo. It is illegal to
>>>>> + * call this without already holding a reference. No locks required.
>>>>> + */
>>>>> +static inline struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
>>>>> +{
>>>>> +	kref_get(&vm_bo->kref);
>>>>> +	return vm_bo;
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
>>>>> + * @vm_bo: the &drm_gpuva_gem to release the reference of
>>>>> + *
>>>>> + * This releases a reference to @vm_bo.
>>>>> + */
>>>>> +static inline void
>>>>> +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
>>>>> +{
>>>>> +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
>>>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>>>> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
>>>>> + *
>>>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>>>> + * &drm_gpuva_gem.
>>>>> + */
>>>>> +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
>>>>> +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
>>>>> + * &drm_gpuva
>>>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>>>> + * @next__: &next &drm_gpuva to store the next step
>>>>> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
>>>>> + *
>>>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>>>> + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
>>>>> + * it is save against removal of elements.
>>>>> + */
>>>>> +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
>>>>> +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
>>>>> +
>>>>>     /**
>>>>>      * enum drm_gpuva_op_type - GPU VA operation type
>>>>>      *
>>>>> @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
>>>>>     	 */
>>>>>     	void (*op_free)(struct drm_gpuva_op *op);
>>>>> +	/**
>>>>> +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
>>>>> +	 * a struct drm_gpuva_gem
>>>>> +	 *
>>>>> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
>>>>> +	 * specific structures. By implementing this callback drivers can
>>>>> +	 * allocate memory accordingly.
>>>>> +	 *
>>>>> +	 * This callback is optional.
>>>>> +	 */
>>>>> +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
>>>>> +
>>>>> +	/**
>>>>> +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
>>>>> +	 * struct drm_gpuva_gem
>>>>> +	 *
>>>>> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
>>>>> +	 * specific structures. By implementing this callback drivers can
>>>>> +	 * free the previously allocated memory accordingly.
>>>>> +	 *
>>>>> +	 * This callback is optional.
>>>>> +	 */
>>>>> +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
>>>>> +
>>>>>     	/**
>>>>>     	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
>>>>>     	 * mapping once all previous steps were completed
>>>>> @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
>>>>>     	 * used.
>>>>>     	 */
>>>>>     	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
>>>>> +
>>>>> +	/**
>>>>> +	 * @bo_validate: called from drm_gpuva_manager_validate()
>>>>> +	 *
>>>>> +	 * Drivers receive this callback for every evicted &drm_gem_object being
>>>>> +	 * mapped in the corresponding &drm_gpuva_manager.
>>>>> +	 *
>>>>> +	 * Typically, drivers would call their driver specific variant of
>>>>> +	 * ttm_bo_validate() from within this callback.
>>>>> +	 */
>>>>> +	int (*bo_validate)(struct drm_gem_object *obj);
>>>>>     };
>>>>>     int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
>>>>> @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>>>>>     void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>>>>     		   struct drm_gpuva *va,
>>>>>     		   struct drm_gpuva_op_map *op);
>>>>> +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
>>>>> +		       struct drm_gpuva *va,
>>>>> +		       struct drm_gpuva_op_map *op);
>>>>>     void drm_gpuva_remap(struct drm_gpuva *prev,
>>>>>     		     struct drm_gpuva *next,
>>>>>     		     struct drm_gpuva_op_remap *op);
>>>>> +void drm_gpuva_remap_get(struct drm_gpuva *prev,
>>>>> +			 struct drm_gpuva *next,
>>>>> +			 struct drm_gpuva_op_remap *op);
>>>>>     void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
>>>>> +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
>>>>>     #endif /* __DRM_GPUVA_MGR_H__ */

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-31  9:04             ` Thomas Hellström (Intel)
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-08-31  9:04 UTC (permalink / raw)
  To: Danilo Krummrich
  Cc: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

Hi!

On 8/30/23 17:00, Danilo Krummrich wrote:
> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>> Hi Thomas,
>>>
>>> thanks for having a look!
>>>
>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>>>> Hi, Danilo.
>>>>
>>>> Some quick comments since I'm doing some Xe work in this area. Will probably
>>>> get back with more.
>>>>
>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
>>>>> So far the DRM GPUVA manager offers common infrastructure to track GPU VA
>>>>> allocations and mappings, generically connect GPU VA mappings to their
>>>>> backing buffers and perform more complex mapping operations on the GPU VA
>>>>> space.
>>>>>
>>>>> However, there are more design patterns commonly used by drivers, which
>>>>> can potentially be generalized in order to make the DRM GPUVA manager
>>>>> represent a basic GPU-VM implementation. In this context, this patch aims
>>>>> at generalizing the following elements.
>>>>>
>>>>> 1) Provide a common dma-resv for GEM objects not being used outside of
>>>>>       this GPU-VM.
>>>>>
>>>>> 2) Provide tracking of external GEM objects (GEM objects which are
>>>>>       shared with other GPU-VMs).
>>>>>
>>>>> 3) Provide functions to efficiently lock all GEM objects dma-resv the
>>>>>       GPU-VM contains mappings of.
>>>>>
>>>>> 4) Provide tracking of evicted GEM objects the GPU-VM contains mappings
>>>>>       of, such that validation of evicted GEM objects is accelerated.
>>>>>
>>>>> 5) Provide some convinience functions for common patterns.
>>>>>
>>>>> Rather than being designed as a "framework", the target is to make all
>>>>> features appear as a collection of optional helper functions, such that
>>>>> drivers are free to make use of the DRM GPUVA managers basic
>>>>> functionality and opt-in for other features without setting any feature
>>>>> flags, just by making use of the corresponding functions.
>>>>>
>>>>> Signed-off-by: Danilo Krummrich <dakr@redhat.com>
>>>>> ---
>>>>>     drivers/gpu/drm/drm_gpuva_mgr.c | 688 +++++++++++++++++++++++++++++++-
>>>>>     include/drm/drm_gem.h           |  48 ++-
>>>>>     include/drm/drm_gpuva_mgr.h     | 302 +++++++++++++-
>>>>>     3 files changed, 1010 insertions(+), 28 deletions(-)
>>>>>
>>>>> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
>>>>> index f86bfad74ff8..69872b205961 100644
>>>>> --- a/drivers/gpu/drm/drm_gpuva_mgr.c
>>>>> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
>>>>> @@ -655,6 +655,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>>>>     /**
>>>>>      * drm_gpuva_manager_init() - initialize a &drm_gpuva_manager
>>>>>      * @mgr: pointer to the &drm_gpuva_manager to initialize
>>>>> + * @drm: the drivers &drm_device
>>>>>      * @name: the name of the GPU VA space
>>>>>      * @start_offset: the start offset of the GPU VA space
>>>>>      * @range: the size of the GPU VA space
>>>>> @@ -669,6 +670,7 @@ drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>>>>      */
>>>>>     void
>>>>>     drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>> +		       struct drm_device *drm,
>>>>>     		       const char *name,
>>>>>     		       u64 start_offset, u64 range,
>>>>>     		       u64 reserve_offset, u64 reserve_range,
>>>>> @@ -677,6 +679,11 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>     	mgr->rb.tree = RB_ROOT_CACHED;
>>>>>     	INIT_LIST_HEAD(&mgr->rb.list);
>>>>> +	mt_init(&mgr->mt_ext);
>>>>> +
>>>>> +	INIT_LIST_HEAD(&mgr->evict.list);
>>>>> +	spin_lock_init(&mgr->evict.lock);
>>>>> +
>>>>>     	drm_gpuva_check_overflow(start_offset, range);
>>>>>     	mgr->mm_start = start_offset;
>>>>>     	mgr->mm_range = range;
>>>>> @@ -694,6 +701,9 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>     						     reserve_range)))
>>>>>     			__drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>>>>>     	}
>>>>> +
>>>>> +	drm_gem_private_object_init(drm, &mgr->d_obj, 0);
>>>>> +	mgr->resv = mgr->d_obj.resv;
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_manager_init);
>>>>> @@ -713,10 +723,575 @@ drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>>>>>     		__drm_gpuva_remove(&mgr->kernel_alloc_node);
>>>>>     	WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
>>>>> -	     "GPUVA tree is not empty, potentially leaking memory.");
>>>>> +	     "GPUVA tree is not empty, potentially leaking memory.\n");
>>>>> +
>>>>> +	mtree_destroy(&mgr->mt_ext);
>>>>> +	WARN(!list_empty(&mgr->evict.list), "Evict list should be empty.\n");
>>>>> +
>>>>> +	drm_gem_private_object_fini(&mgr->d_obj);
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_manager_destroy);
>>>>> +/**
>>>>> + * drm_gpuva_manager_prepare_objects() - prepare all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @num_fences: the amount of &dma_fences to reserve
>>>>> + *
>>>>> + * Calls drm_exec_prepare_obj() for all &drm_gem_objects the given
>>>>> + * &drm_gpuva_manager contains mappings of.
>>>>> + *
>>>>> + * Drivers can obtain the corresponding &drm_exec instance through
>>>>> + * DRM_GPUVA_EXEC(). It is the drivers responsibility to call drm_exec_init()
>>>>> + * and drm_exec_fini() accordingly.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_manager_prepare_objects(struct drm_gpuva_manager *mgr,
>>>>> +				  unsigned int num_fences)
>>>>> +{
>>>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>>>> +	union {
>>>>> +		void *ptr;
>>>>> +		uintptr_t cnt;
>>>>> +	} ref;
>>>>> +	int ret;
>>>>> +
>>>>> +	ret = drm_exec_prepare_obj(exec, &mgr->d_obj, num_fences);
>>>>> +	if (ret)
>>>>> +		goto out;
>>>>> +
>>>>> +	rcu_read_lock();
>>>> In xe we're protecting the external object list with an outer lock, (same as
>>>> protecting the mgr itself). Do we need a separate lock for this? In theory
>>>> as  outlined in the VM_BIND locking document draft, one could probably even
>>>> use the mgr resv for this, but with more complicated code I guess. Also see
>>>> the comment below about the data structure chosen.
>>> The idea is to protect this list with the GPU-VM lock. The locking here is more
>>> of an implication of the maple tree. Either you use the internal lock of the
>>> maple tree or RCU respectively, or you give the maple tree an external lock to
>>> perform lockdep checks on (mt_set_external_lock()). Basically same as here:
>>>
>>> https://elixir.bootlin.com/linux/latest/source/drivers/base/regmap/regcache-maple.c#L124
>> Ah, I suspected it was something along those lines.
>>
>>
>>>>> +	mas_for_each(&mas, ref.ptr, ULONG_MAX) {
>>>>> +		struct drm_gem_object *obj;
>>>>> +
>>>>> +		mas_pause(&mas);
>>>>> +		rcu_read_unlock();
>>>>> +
>>>>> +		obj = (struct drm_gem_object *)(uintptr_t)mas.index;
>>>>> +		ret = drm_exec_prepare_obj(exec, obj, num_fences);
>>>>> +		if (ret)
>>>>> +			goto out;
>>>>> +
>>>>> +		rcu_read_lock();
>>>>> +	}
>>>>> +	rcu_read_unlock();
>>>>> +
>>>>> +out:
>>>>> +	return ret;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_prepare_objects);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_lock_extra() - lock all dma-resv of all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @fn: callback received by the driver to lock additional dma-resv
>>>>> + * @priv: private driver data passed to @fn
>>>>> + * @num_fences: the amount of &dma_fences to reserve
>>>>> + * @interruptible: sleep interruptible if waiting
>>>>> + *
>>>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>>>> + * &drm_gpuva_manager contains mappings of.
>>>>> + *
>>>>> + * Addionally, when calling this function the driver receives the given @fn
>>>>> + * callback to lock additional dma-resv in the context of the
>>>>> + * &drm_gpuva_managers &drm_exec instance. Typically, drivers would call
>>>>> + * drm_exec_prepare_obj() from within this callback.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
>>>>> +			     int (*fn)(struct drm_gpuva_manager *mgr,
>>>>> +				       void *priv, unsigned int num_fences),
>>>>> +			     void *priv,
>>>>> +			     unsigned int num_fences,
>>>>> +			     bool interruptible)
>>>>> +{
>>>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>>>> +	uint32_t flags;
>>>>> +	int ret;
>>>>> +
>>>>> +	flags = interruptible ? DRM_EXEC_INTERRUPTIBLE_WAIT : 0 |
>>>>> +		DRM_EXEC_IGNORE_DUPLICATES;
>>>>> +
>>>>> +	drm_exec_init(exec, flags);
>>>>> +
>>>>> +	drm_exec_until_all_locked(exec) {
>>>>> +		ret = drm_gpuva_manager_prepare_objects(mgr, num_fences);
>>>>> +		drm_exec_retry_on_contention(exec);
>>>>> +		if (ret)
>>>>> +			goto err;
>>>>> +
>>>>> +		if (fn) {
>>>>> +			ret = fn(mgr, priv, num_fences);
>>>>> +			drm_exec_retry_on_contention(exec);
>>>>> +			if (ret)
>>>>> +				goto err;
>>>>> +		}
>>>>> +	}
>>>>> +
>>>>> +	return 0;
>>>>> +
>>>>> +err:
>>>>> +	drm_exec_fini(exec);
>>>>> +	return ret;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_extra);
>>>>> +
>>>>> +static int
>>>>> +fn_lock_array(struct drm_gpuva_manager *mgr, void *priv,
>>>>> +				unsigned int num_fences)
>>>>> +{
>>>>> +	struct {
>>>>> +		struct drm_gem_object **objs;
>>>>> +		unsigned int num_objs;
>>>>> +	} *args = priv;
>>>>> +
>>>>> +	return drm_exec_prepare_array(DRM_GPUVA_EXEC(mgr), args->objs,
>>>>> +				      args->num_objs, num_fences);
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_lock_array() - lock all dma-resv of all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @objs: additional &drm_gem_objects to lock
>>>>> + * @num_objs: the number of additional &drm_gem_objects to lock
>>>>> + * @num_fences: the amount of &dma_fences to reserve
>>>>> + * @interruptible: sleep interruptible if waiting
>>>>> + *
>>>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>>>> + * &drm_gpuva_manager contains mappings of, plus the ones given through @objs.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
>>>>> +			     struct drm_gem_object **objs,
>>>>> +			     unsigned int num_objs,
>>>>> +			     unsigned int num_fences,
>>>>> +			     bool interruptible)
>>>>> +{
>>>>> +	struct {
>>>>> +		struct drm_gem_object **objs;
>>>>> +		unsigned int num_objs;
>>>>> +	} args;
>>>>> +
>>>>> +	args.objs = objs;
>>>>> +	args.num_objs = num_objs;
>>>>> +
>>>>> +	return drm_gpuva_manager_lock_extra(mgr, fn_lock_array, &args,
>>>>> +					    num_fences, interruptible);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_lock_array);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_validate() - validate all BOs marked as evicted
>>>>> + * @mgr: the &drm_gpuva_manager to validate evicted BOs
>>>>> + *
>>>>> + * Calls the &drm_gpuva_fn_ops.bo_validate callback for all evicted buffer
>>>>> + * objects being mapped in the given &drm_gpuva_manager.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr)
>>>>> +{
>>>>> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +	int ret;
>>>>> +
>>>>> +	if (unlikely(!ops || !ops->bo_validate))
>>>>> +		return -ENOTSUPP;
>>>>> +
>>>>> +	/* At this point we should hold all dma-resv locks of all GEM objects
>>>>> +	 * associated with this GPU-VM, hence it is safe to walk the list.
>>>>> +	 */
>>>>> +	list_for_each_entry(vm_bo, &mgr->evict.list, list.entry.evict) {
>>>>> +		dma_resv_assert_held(vm_bo->obj->resv);
>>>>> +
>>>>> +		ret = ops->bo_validate(vm_bo->obj);
>>>>> +		if (ret)
>>>>> +			return ret;
>>>>> +	}
>>>>> +
>>>>> +	return 0;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_validate);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_resv_add_fence - add fence to private and all extobj
>>>>> + * dma-resv
>>>>> + * @mgr: the &drm_gpuva_manager to add a fence to
>>>>> + * @fence: fence to add
>>>>> + * @private_usage: private dma-resv usage
>>>>> + * @extobj_usage: extobj dma-resv usage
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
>>>>> +				 struct dma_fence *fence,
>>>>> +				 enum dma_resv_usage private_usage,
>>>>> +				 enum dma_resv_usage extobj_usage)
>>>>> +{
>>>>> +	struct drm_exec *exec = DRM_GPUVA_EXEC(mgr);
>>>>> +	struct drm_gem_object *obj;
>>>>> +	unsigned long index;
>>>>> +
>>>>> +	drm_exec_for_each_locked_object(exec, index, obj) {
>>>>> +			dma_resv_assert_held(obj->resv);
>>>>> +			dma_resv_add_fence(obj->resv, fence,
>>>>> +					   drm_gpuva_is_extobj(mgr, obj) ?
>>>>> +					   private_usage : extobj_usage);
>>>>> +	}
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_manager_resv_add_fence);
>>>>> +
>>>>> +static struct drm_gpuva_gem *
>>>>> +__drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	drm_gem_gpuva_assert_lock_held(obj);
>>>>> +
>>>>> +	drm_gem_for_each_gpuva_gem(vm_bo, obj)
>>>>> +		if (vm_bo->mgr == mgr)
>>>>> +			return vm_bo;
>>>>> +
>>>>> +	return NULL;
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_create() - create a new instance of struct drm_gpuva_gem
>>>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> + *
>>>>> + * If provided by the driver, this function uses the &drm_gpuva_fn_ops
>>>>> + * vm_bo_alloc() callback to allocate.
>>>>> + *
>>>>> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
>>>>> + */
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	if (ops && ops->vm_bo_alloc)
>>>>> +		vm_bo = ops->vm_bo_alloc();
>>>>> +	else
>>>>> +		vm_bo = kzalloc(sizeof(*vm_bo), GFP_KERNEL);
>>>>> +
>>>>> +	if (unlikely(!vm_bo))
>>>>> +		return NULL;
>>>>> +
>>>>> +	vm_bo->mgr = mgr;
>>>>> +	vm_bo->obj = obj;
>>>>> +
>>>>> +	kref_init(&vm_bo->kref);
>>>>> +	INIT_LIST_HEAD(&vm_bo->list.gpuva);
>>>>> +	INIT_LIST_HEAD(&vm_bo->list.entry.gem);
>>>>> +	INIT_LIST_HEAD(&vm_bo->list.entry.evict);
>>>>> +
>>>>> +	drm_gem_object_get(obj);
>>>>> +
>>>>> +	return vm_bo;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_create);
>>>>> +
>>>>> +void
>>>>> +drm_gpuva_gem_destroy(struct kref *kref)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo = container_of(kref, struct drm_gpuva_gem,
>>>>> +						   kref);
>>>>> +	const struct drm_gpuva_fn_ops *ops = vm_bo->mgr->ops;
>>>>> +
>>>>> +	drm_gem_object_put(vm_bo->obj);
>>>>> +
>>>>> +	if (ops && ops->vm_bo_free)
>>>>> +		ops->vm_bo_free(vm_bo);
>>>>> +	else
>>>>> +		kfree(vm_bo);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_destroy);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_find() - find the &drm_gpuva_gem for the given
>>>>> + * &drm_gpuva_manager and &drm_gem_object
>>>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> + *
>>>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>>>> + * count of the &drm_gpuva_gem accordingly.
>>>>> + *
>>>>> + * Returns: a pointer to the &drm_gpuva_gem on success, NULL on failure
>>>>> + */
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>>>> +		   struct drm_gem_object *obj)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo = __drm_gpuva_gem_find(mgr, obj);
>>>>> +
>>>>> +	return vm_bo ? drm_gpuva_gem_get(vm_bo) : NULL;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_find);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_obtain() - obtains and instance of the &drm_gpuva_gem for the
>>>>> + * given &drm_gpuva_manager and &drm_gem_object
>>>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> + *
>>>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>>>> + * count of the &drm_gpuva_gem accordingly. If not found, allsocates a new
>>>>> + * &drm_gpuva_gem.
>>>>> + *
>>>>> + * Returns: a pointer to the &drm_gpuva_gem on success, an ERR_PTR on failure
>>>>> + */
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
>>>>> +	if (vm_bo)
>>>>> +		return vm_bo;
>>>>> +
>>>>> +	vm_bo = drm_gpuva_gem_create(mgr, obj);
>>>>> +	if (!vm_bo)
>>>>> +		return ERR_PTR(-ENOMEM);
>>>>> +
>>>>> +	return vm_bo;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_obtain_prealloc() - obtains and instance of the &drm_gpuva_gem
>>>>> + * for the given &drm_gpuva_manager and &drm_gem_object
>>>>> + * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> + * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> + *
>>>>> + * Find the &drm_gpuva_gem representing the combination of the given
>>>>> + * &drm_gpuva_manager and &drm_gem_object. If found, increases the reference
>>>>> + * count of the found &drm_gpuva_gem accordingly, while the @__vm_bo reference
>>>>> + * count is decreased. If not found @__vm_bo is returned.
>>>>> + *
>>>>> + * Returns: a pointer to the found &drm_gpuva_gem or @__vm_bo if no existing
>>>>> + * &drm_gpuva_gem was found
>>>>> + */
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
>>>>> +			      struct drm_gem_object *obj,
>>>>> +			      struct drm_gpuva_gem *__vm_bo)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	vm_bo = drm_gpuva_gem_find(mgr, obj);
>>>>> +	if (vm_bo) {
>>>>> +		drm_gpuva_gem_put(__vm_bo);
>>>>> +		return vm_bo;
>>>>> +	}
>>>>> +
>>>>> +	return __vm_bo;
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_obtain_prealloc);
>>>>> +
>>>>> +static int
>>>>> +__drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>>>> +			  struct drm_gem_object *obj,
>>>>> +			  gfp_t gfp)
>>>>> +{
>>>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>>>> +	union {
>>>>> +		struct drm_gem_object *obj;
>>>>> +		uintptr_t index;
>>>>> +	} gem;
>>>>> +	union {
>>>>> +		void *ptr;
>>>>> +		uintptr_t cnt;
>>>>> +	} ref;
>>>>> +	int ret = 0;
>>>>> +
>>>>> +	gem.obj = obj;
>>>>> +	mas_set(&mas, gem.index);
>>>>> +
>>>>> +	mas_lock(&mas);
>>>>> +	ref.ptr = mas_walk(&mas);
>>>>> +	if (ref.ptr) {
>>>>> +		++ref.cnt;
>>>>> +		mas_store(&mas, ref.ptr);
>>>>> +	} else {
>>>>> +		if (unlikely(!gfp)) {
>>>>> +			ret = -EINVAL;
>>>>> +			goto out;
>>>>> +		}
>>>>> +
>>>>> +		mas_set(&mas, gem.index);
>>>>> +		ref.cnt = 1;
>>>>> +		ret = mas_store_gfp(&mas, ref.ptr, gfp);
>>>>> +		if (likely(!ret))
>>>>> +			drm_gem_object_get(obj);
>>>>> +	}
>>>>> +out:
>>>>> +	mas_unlock(&mas);
>>>>> +	return ret;
>>>>> +}
>>>>> +
>>>>> +static void
>>>>> +__drm_gpuva_extobj_remove(struct drm_gpuva_manager *mgr,
>>>>> +			  struct drm_gem_object *obj)
>>>>> +{
>>>>> +	MA_STATE(mas, &mgr->mt_ext, 0, 0);
>>>>> +	union {
>>>>> +		struct drm_gem_object *obj;
>>>>> +		uintptr_t index;
>>>>> +	} gem;
>>>>> +	union {
>>>>> +		void *ptr;
>>>>> +		uintptr_t cnt;
>>>>> +	} ref;
>>>>> +
>>>>> +	gem.obj = obj;
>>>>> +	mas_set(&mas, gem.index);
>>>>> +
>>>>> +	mas_lock(&mas);
>>>>> +	if (unlikely(!(ref.ptr = mas_walk(&mas))))
>>>>> +		goto out;
>>>>> +
>>>>> +	if (!--ref.cnt) {
>>>>> +		mas_erase(&mas);
>>>>> +		drm_gem_object_put(obj);
>>>>> +	} else {
>>>>> +		mas_store(&mas, ref.ptr);
>>>>> +	}
>>>>> +out:
>>>>> +	mas_unlock(&mas);
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_extobj_insert - insert an external &drm_gem_object
>>>>> + * @mgr: the &drm_gpuva_manager to insert into
>>>>> + * @obj: the &drm_gem_object to insert as extobj
>>>>> + *
>>>>> + * Insert a &drm_gem_object into the &drm_gpuva_managers external object tree.
>>>>> + * If the &drm_gem_object already exists in the tree, the reference counter
>>>>> + * of this external object is increased by one.
>>>>> + *
>>>>> + * Drivers should insert the external &drm_gem_object before the dma-fence
>>>>> + * signalling critical section, e.g. when submitting the job, and before
>>>>> + * locking all &drm_gem_objects of a GPU-VM, e.g. with drm_gpuva_manager_lock()
>>>>> + * or its dervates.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +int
>>>>> +drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>>>> +			struct drm_gem_object *obj)
>>>>> +{
>>>>> +	return drm_gpuva_is_extobj(mgr, obj) ?
>>>>> +		__drm_gpuva_extobj_insert(mgr, obj, GFP_KERNEL) : 0;
>>>>> +
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_insert);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_extobj_get - increase the referecne count of an external
>>>>> + * &drm_gem_object
>>>>> + * @mgr: the &drm_gpuva_manager storing the extobj
>>>>> + * @obj: the &drm_gem_object to representing the extobj
>>>>> + *
>>>>> + * Increases the reference count of the extobj represented by @obj.
>>>>> + *
>>>>> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
>>>>> + * being inserted.
>>>>> + *
>>>>> + * For &drm_gpuva_op_remap operations drivers should make sure to only take an
>>>>> + * additional reference if the re-map operation splits an existing &drm_gpuva
>>>>> + * into two separate ones.
>>>>> + *
>>>>> + * See also drm_gpuva_map_get() and drm_gpuva_remap_get().
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	if (drm_gpuva_is_extobj(mgr, obj))
>>>>> +		WARN(__drm_gpuva_extobj_insert(mgr, obj, 0),
>>>>> +		     "Can't increase ref-count of non-existent extobj.");
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_get);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_extobj_put - decrease the referecne count of an external
>>>>> + * &drm_gem_object
>>>>> + * @mgr: the &drm_gpuva_manager storing the extobj
>>>>> + * @obj: the &drm_gem_object to representing the extobj
>>>>> + *
>>>>> + * Decreases the reference count of the extobj represented by @obj.
>>>>> + *
>>>>> + * Drivers should call this for every &drm_gpuva backed by a &drm_gem_object
>>>>> + * being removed from the GPU VA space.
>>>>> + *
>>>>> + * See also drm_gpuva_unmap_put().
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj)
>>>>> +{
>>>>> +	if (drm_gpuva_is_extobj(mgr, obj))
>>>>> +		__drm_gpuva_extobj_remove(mgr, obj);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_extobj_put);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_evict() - add / remove a &drm_gem_object to / from a
>>>>> + * &drm_gpuva_managers evicted list
>>>>> + * @obj: the &drm_gem_object to add or remove
>>>>> + * @evict: indicates whether the object is evicted
>>>>> + *
>>>>> + * Adds a &drm_gem_object to or removes it from all &drm_gpuva_managers evicted
>>>>> + * list containing a mapping of this &drm_gem_object.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict)
>>>>> +{
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>> +
>>>>> +	/* Required for iterating the GEMs GPUVA GEM list. If no driver specific
>>>>> +	 * lock has been set, the list is protected with the GEMs dma-resv lock.
>>>>> +	 */
>>>>> +	drm_gem_gpuva_assert_lock_held(obj);
>>>>> +
>>>>> +	/* Required to protect the GPUVA managers evict list against concurrent
>>>>> +	 * access through drm_gpuva_manager_validate(). Concurrent insertions to
>>>>> +	 * the evict list through different GEM object evictions are protected
>>>>> +	 * by the GPUVA managers evict lock.
>>>>> +	 */
>>>>> +	dma_resv_assert_held(obj->resv);
>>>>> +
>>>>> +	drm_gem_for_each_gpuva_gem(vm_bo, obj) {
>>>>> +		struct drm_gpuva_manager *mgr = vm_bo->mgr;
>>>>> +
>>>>> +		spin_lock(&mgr->evict.lock);
>>>>> +		if (evict)
>>>>> +			list_add_tail(&vm_bo->list.entry.evict,
>>>>> +				      &mgr->evict.list);
>>>>> +		else
>>>>> +			list_del_init(&vm_bo->list.entry.evict);
>>>>> +		spin_unlock(&mgr->evict.lock);
>>>>> +	}
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_gem_evict);
>>>>> +
>>>>>     static int
>>>>>     __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>>>>>     		   struct drm_gpuva *va)
>>>>> @@ -806,15 +1381,20 @@ EXPORT_SYMBOL_GPL(drm_gpuva_remove);
>>>>>     /**
>>>>>      * drm_gpuva_link() - link a &drm_gpuva
>>>>>      * @va: the &drm_gpuva to link
>>>>> + * @vm_bo: the &drm_gpuva_gem to add the &drm_gpuva to
>>>>>      *
>>>>> - * This adds the given &va to the GPU VA list of the &drm_gem_object it is
>>>>> - * associated with.
>>>>> + * This adds the given &va to the GPU VA list of the &drm_gpuva_gem and the
>>>>> + * &drm_gpuva_gem to the &drm_gem_object it is associated with.
>>>>> + *
>>>>> + * For every &drm_gpuva entry added to the &drm_gpuva_gem an additional
>>>>> + * reference of the latter is taken.
>>>>>      *
>>>>>      * This function expects the caller to protect the GEM's GPUVA list against
>>>>> - * concurrent access using the GEMs dma_resv lock.
>>>>> + * concurrent access using either the GEMs dma_resv lock or a driver specific
>>>>> + * lock set through drm_gem_gpuva_set_lock().
>>>>>      */
>>>>>     void
>>>>> -drm_gpuva_link(struct drm_gpuva *va)
>>>>> +drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo)
>>>>>     {
>>>>>     	struct drm_gem_object *obj = va->gem.obj;
>>>>> @@ -823,7 +1403,10 @@ drm_gpuva_link(struct drm_gpuva *va)
>>>>>     	drm_gem_gpuva_assert_lock_held(obj);
>>>>> -	list_add_tail(&va->gem.entry, &obj->gpuva.list);
>>>>> +	drm_gpuva_gem_get(vm_bo);
>>>>> +	list_add_tail(&va->gem.entry, &vm_bo->list.gpuva);
>>>>> +	if (list_empty(&vm_bo->list.entry.gem))
>>>>> +		list_add_tail(&vm_bo->list.entry.gem, &obj->gpuva.list);
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_link);
>>>>> @@ -834,20 +1417,39 @@ EXPORT_SYMBOL_GPL(drm_gpuva_link);
>>>>>      * This removes the given &va from the GPU VA list of the &drm_gem_object it is
>>>>>      * associated with.
>>>>>      *
>>>>> + * This removes the given &va from the GPU VA list of the &drm_gpuva_gem and
>>>>> + * the &drm_gpuva_gem from the &drm_gem_object it is associated with in case
>>>>> + * this call unlinks the last &drm_gpuva from the &drm_gpuva_gem.
>>>>> + *
>>>>> + * For every &drm_gpuva entry removed from the &drm_gpuva_gem a reference of
>>>>> + * the latter is dropped.
>>>>> + *
>>>>>      * This function expects the caller to protect the GEM's GPUVA list against
>>>>> - * concurrent access using the GEMs dma_resv lock.
>>>>> + * concurrent access using either the GEMs dma_resv lock or a driver specific
>>>>> + * lock set through drm_gem_gpuva_set_lock().
>>>>>      */
>>>>>     void
>>>>>     drm_gpuva_unlink(struct drm_gpuva *va)
>>>>>     {
>>>>>     	struct drm_gem_object *obj = va->gem.obj;
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>>     	if (unlikely(!obj))
>>>>>     		return;
>>>>>     	drm_gem_gpuva_assert_lock_held(obj);
>>>>> +	vm_bo = __drm_gpuva_gem_find(va->mgr, obj);
>>>>> +	if (WARN(!vm_bo, "GPUVA doesn't seem to be linked.\n"))
>>>>> +		return;
>>>>> +
>>>>>     	list_del_init(&va->gem.entry);
>>>>> +
>>>>> +	if (list_empty(&vm_bo->list.gpuva)) {
>>>>> +		list_del_init(&vm_bo->list.entry.gem);
>>>>> +		list_del_init(&vm_bo->list.entry.evict);
>>>>> +	}
>>>>> +	drm_gpuva_gem_put(vm_bo);
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_unlink);
>>>>> @@ -977,6 +1579,26 @@ drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_map);
>>>>> +/**
>>>>> + * drm_gpuva_map_get() - helper to insert a &drm_gpuva according to a
>>>>> + * &drm_gpuva_op_map
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @va: the &drm_gpuva to insert
>>>>> + * @op: the &drm_gpuva_op_map to initialize @va with
>>>>> + *
>>>>> + * Initializes the @va from the @op and inserts it into the given @mgr and
>>>>> + * increases the reference count of the corresponding extobj.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
>>>>> +		  struct drm_gpuva *va,
>>>>> +		  struct drm_gpuva_op_map *op)
>>>>> +{
>>>>> +	drm_gpuva_map(mgr, va, op);
>>>>> +	drm_gpuva_extobj_get(mgr, va->gem.obj);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_map_get);
>>>>> +
>>>>>     /**
>>>>>      * drm_gpuva_remap() - helper to remap a &drm_gpuva according to a
>>>>>      * &drm_gpuva_op_remap
>>>>> @@ -992,10 +1614,10 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>>>>>     		struct drm_gpuva *next,
>>>>>     		struct drm_gpuva_op_remap *op)
>>>>>     {
>>>>> -	struct drm_gpuva *curr = op->unmap->va;
>>>>> -	struct drm_gpuva_manager *mgr = curr->mgr;
>>>>> +	struct drm_gpuva *va = op->unmap->va;
>>>>> +	struct drm_gpuva_manager *mgr = va->mgr;
>>>>> -	drm_gpuva_remove(curr);
>>>>> +	drm_gpuva_remove(va);
>>>>>     	if (op->prev) {
>>>>>     		drm_gpuva_init_from_op(prev, op->prev);
>>>>> @@ -1009,6 +1631,31 @@ drm_gpuva_remap(struct drm_gpuva *prev,
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_remap);
>>>>> +/**
>>>>> + * drm_gpuva_remap_get() - helper to remap a &drm_gpuva according to a
>>>>> + * &drm_gpuva_op_remap
>>>>> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
>>>>> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
>>>>> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
>>>>> + *
>>>>> + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
>>>>> + * @next. Additionally, if the re-map splits the existing &drm_gpuva into two
>>>>> + * separate mappings, increases the reference count of the corresponding extobj.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_remap_get(struct drm_gpuva *prev,
>>>>> +		    struct drm_gpuva *next,
>>>>> +		    struct drm_gpuva_op_remap *op)
>>>>> +{
>>>>> +	struct drm_gpuva *va = op->unmap->va;
>>>>> +	struct drm_gpuva_manager *mgr = va->mgr;
>>>>> +
>>>>> +	drm_gpuva_remap(prev, next, op);
>>>>> +	if (op->prev && op->next)
>>>>> +		drm_gpuva_extobj_get(mgr, va->gem.obj);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_remap_get);
>>>>> +
>>>>>     /**
>>>>>      * drm_gpuva_unmap() - helper to remove a &drm_gpuva according to a
>>>>>      * &drm_gpuva_op_unmap
>>>>> @@ -1023,6 +1670,24 @@ drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>>>>>     }
>>>>>     EXPORT_SYMBOL_GPL(drm_gpuva_unmap);
>>>>> +/**
>>>>> + * drm_gpuva_unmap_put() - helper to remove a &drm_gpuva according to a
>>>>> + * &drm_gpuva_op_unmap
>>>>> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
>>>>> + *
>>>>> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap and decreases
>>>>> + * the reference count of the corresponding extobj.
>>>>> + */
>>>>> +void
>>>>> +drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op)
>>>>> +{
>>>>> +	struct drm_gpuva *va = op->va;
>>>>> +
>>>>> +	drm_gpuva_unmap(op);
>>>>> +	drm_gpuva_extobj_put(va->mgr, va->gem.obj);
>>>>> +}
>>>>> +EXPORT_SYMBOL_GPL(drm_gpuva_unmap_put);
>>>>> +
>>>>>     static int
>>>>>     op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>>>>>     	  u64 addr, u64 range,
>>>>> @@ -1663,6 +2328,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>>>>     {
>>>>>     	struct drm_gpuva_ops *ops;
>>>>>     	struct drm_gpuva_op *op;
>>>>> +	struct drm_gpuva_gem *vm_bo;
>>>>>     	struct drm_gpuva *va;
>>>>>     	int ret;
>>>>> @@ -1674,7 +2340,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>>>>     	INIT_LIST_HEAD(&ops->list);
>>>>> -	drm_gem_for_each_gpuva(va, obj) {
>>>>> +	drm_gem_for_each_gpuva(va, vm_bo, mgr, obj) {
>>>>>     		op = gpuva_op_alloc(mgr);
>>>>>     		if (!op) {
>>>>>     			ret = -ENOMEM;
>>>>> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
>>>>> index bc9f6aa2f3fe..783ed3ab440d 100644
>>>>> --- a/include/drm/drm_gem.h
>>>>> +++ b/include/drm/drm_gem.h
>>>>> @@ -571,7 +571,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
>>>>>      * drm_gem_gpuva_init() - initialize the gpuva list of a GEM object
>>>>>      * @obj: the &drm_gem_object
>>>>>      *
>>>>> - * This initializes the &drm_gem_object's &drm_gpuva list.
>>>>> + * This initializes the &drm_gem_object's &drm_gpuva_gem list.
>>>>>      *
>>>>>      * Calling this function is only necessary for drivers intending to support the
>>>>>      * &drm_driver_feature DRIVER_GEM_GPUVA.
>>>>> @@ -584,28 +584,44 @@ static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>>>>>     }
>>>>>     /**
>>>>> - * drm_gem_for_each_gpuva() - iternator to walk over a list of gpuvas
>>>>> - * @entry__: &drm_gpuva structure to assign to in each iteration step
>>>>> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>>>> + * drm_gem_for_each_gpuva_gem() - iterator to walk over a list of &drm_gpuva_gem
>>>>> + * @entry__: &drm_gpuva_gem structure to assign to in each iteration step
>>>>> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>>>>>      *
>>>>> - * This iterator walks over all &drm_gpuva structures associated with the
>>>>> - * &drm_gpuva_manager.
>>>>> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>>>>> + * &drm_gem_object.
>>>>>      */
>>>>> -#define drm_gem_for_each_gpuva(entry__, obj__) \
>>>>> -	list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
>>>>> +#define drm_gem_for_each_gpuva_gem(entry__, obj__) \
>>>>> +	list_for_each_entry(entry__, &(obj__)->gpuva.list, list.entry.gem)
>>>>>     /**
>>>>> - * drm_gem_for_each_gpuva_safe() - iternator to safely walk over a list of
>>>>> - * gpuvas
>>>>> - * @entry__: &drm_gpuva structure to assign to in each iteration step
>>>>> - * @next__: &next &drm_gpuva to store the next step
>>>>> - * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>>>> + * drm_gem_for_each_gpuva_gem_safe() - iterator to safely walk over a list of
>>>>> + * &drm_gpuva_gem
>>>>> + * @entry__: &drm_gpuva_gemstructure to assign to in each iteration step
>>>>> + * @next__: &next &drm_gpuva_gem to store the next step
>>>>> + * @obj__: the &drm_gem_object the &drm_gpuva_gem to walk are associated with
>>>>>      *
>>>>> - * This iterator walks over all &drm_gpuva structures associated with the
>>>>> + * This iterator walks over all &drm_gpuva_gem structures associated with the
>>>>>      * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
>>>>>      * it is save against removal of elements.
>>>>>      */
>>>>> -#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
>>>>> -	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
>>>>> +#define drm_gem_for_each_gpuva_gem_safe(entry__, next__, obj__) \
>>>>> +	list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, list.entry.gem)
>>>>> +
>>>>> +/**
>>>>> + * drm_gem_for_each_gpuva() - iterator to walk over a list of &drm_gpuva
>>>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>>>> + * @vm_bo__: the &drm_gpuva_gem representing the @mgr__ and @obj__ combination
>>>>> + * @mgr__: the &drm_gpuva_manager the &drm_gpuvas to walk are associated with
>>>>> + * @obj__: the &drm_gem_object the &drm_gpuvas to walk are associated with
>>>>> + *
>>>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>>>> + * &drm_gpuva_manager and &drm_gem_object.
>>>>> + */
>>>>> +#define drm_gem_for_each_gpuva(va__, vm_bo__, mgr__, obj__) \
>>>>> +	for (vm_bo__ = drm_gpuva_gem_find(mgr__, obj__), \
>>>>> +	     va__ = vm_bo__ ? list_first_entry(&vm_bo__->list.gpuva, typeof(*va__), gem.entry) : NULL; \
>>>>> +	     va__ && !list_entry_is_head(va__, &vm_bo__->list.gpuva, gem.entry); \
>>>>> +	     va__ = list_next_entry(va__, gem.entry))
>>>>>     #endif /* __DRM_GEM_H__ */
>>>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>> @@ -26,12 +26,16 @@
>>>>>      */
>>>>>     #include <linux/list.h>
>>>>> +#include <linux/dma-resv.h>
>>>>> +#include <linux/maple_tree.h>
>>>>>     #include <linux/rbtree.h>
>>>>>     #include <linux/types.h>
>>>>>     #include <drm/drm_gem.h>
>>>>> +#include <drm/drm_exec.h>
>>>>>     struct drm_gpuva_manager;
>>>>> +struct drm_gpuva_gem;
>>>>>     struct drm_gpuva_fn_ops;
>>>>>     /**
>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>     int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>>>     void drm_gpuva_remove(struct drm_gpuva *va);
>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>>>     void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>     struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>     	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>>>     	 */
>>>>>     	const struct drm_gpuva_fn_ops *ops;
>>>>> +
>>>>> +	/**
>>>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>>>> +	 * dma-resv to &drm_exec.
>>>>> +	 */
>>>>> +	struct drm_gem_object d_obj;
>>>>> +
>>>>> +	/**
>>>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>>>> +	 * space
>>>>> +	 */
>>>>> +	struct dma_resv *resv;
>>>>> +
>>>>> +	/**
>>>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>>>> +	 */
>>>>> +	struct drm_exec exec;
>>>>> +
>>>>> +	/**
>>>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>> +	 */
>>>>> +	struct maple_tree mt_ext;
>>>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>>>> instead of O(1) for a list?
>>>>
>>> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
>>> could have mappings of the same extobj.
>>>
>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
>>> instead, which also seems to be the obvious choice. However, there is a locking
>>> conflict.
>>>
>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
>>> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
>>> the corresponding drm_gem_object, or with an external lock provided by the
>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
>>> changes on the GPUVA space directly from the fence signalling path.
>>>
>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
>>> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
>>> linked and we'd want to remove it for the last one being unlinked.
>>>
>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
>>> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
>>> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>
>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
>>> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
>>> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
>>> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>
>>> Considering all that, I thought it's probably better to track extobjs separate
>>> from the drm_gpuva_gem, hence the maple tree choice.
>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
>> external objects, or in the case of multiple mappings to the same gem
>> object, only one of the drm_gpuvas is in the list. These are protected by
>> the GPU-VM lock. I don't see a problem with removing those from the fence
>> signalling path, though?
> I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
> since this is generic code I don't know how much mappings of an external object
> the corresponding driver potentially creates. This could become a pretty large
> list to iterate. Another reason was, that I want to keep the drm_gpuva structure
> as small as possible, hence avoiding another list_head.

Yes, the list might be pretty large, but OTOH you never iterate to 
access a single list element. When you need to iterate the whole list 
you need to do that regardless of the data structure used. As for the 
list head, it might perhaps be aliased (union) with an upcoming userptr 
list head?

>
> Now, it sounds like in XE you're doing some kind of optimization just keeping a
> single mapping of an extobj in the list? How do you know when to remove it? What
> if the mapping from the extobj list gets unmapped, but there is still another
> one left in the GPU-VM being backed by the same BO?
When removing from the lists, we iterate through the object's list of 
vmas, and if there is one matching the same vm, we replace the old one 
with the new one. A similar iteration is done when adding to avoid 
adding one that is already on the list.
> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> choice, keeping O(1)?
> When tracking extobjs, the address of the drm_gem_object is the key while the
> reference count is the value. I was thinking of an XArray as well, but I was
> worried that the corresponding indices could be too much distributed for an
> XArray to still be efficient. Now that I think about it, it's probably not that
> bad.
>
> Btw., while I agree trying to make things as efficient as possible, what is the
> magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?

Not sure yet, TBH, but I think one of our UMDs can only use external 
object, because they don't know at creation time which ones need 
exporting. However if this turns out to be too bad, there are various 
flavours of "clever but complicated" optimizations that we could think 
of to reduce the list size. Still in our case, we opted for the vma list 
head for now.

/Thomas


>
>>>>> +
>>>>> +	/**
>>>>> +	 * @evict: structure holding the evict list and evict list lock
>>>>> +	 */
>>>>> +	struct {
>>>>> +		/**
>>>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>>>> +		 * evicted
>>>>> +		 */
>>>>> +		struct list_head list;
>>>>> +
>>>>> +		/**
>>>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>>>> +		 * insertion / removal of different &drm_gpuva_gems
>>>>> +		 */
>>>>> +		spinlock_t lock;
>>>>> +	} evict;
>>>>>     };
>>>>>     void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>> +			    struct drm_device *drm,
>>>>>     			    const char *name,
>>>>>     			    u64 start_offset, u64 range,
>>>>>     			    u64 reserve_offset, u64 reserve_range,
>>>>>     			    const struct drm_gpuva_fn_ops *ops);
>>>>>     void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>>>> +/**
>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>>>> + */
>>>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>>>> should typically be allocated on the stack. Otherwise you'd need to protect
>>>> the mgr->exec member with an exclusive lock throughout the locking process,
>>>> and that's not what we want.
>>> Oh, good point. I think it works in Nouveau, because there it's implicitly
>>> protected with the job submission lock.
>>>
>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>>>> needed ops to it: Like so:
>>> That's a good idea, will take this into V2.
>> Actually, I'm not fully sure that was a good idea: I've now have a working
>> version of Xe ported over to drm_exec, having these helpers in mind and with
>> the intention to start using them as they mature. What I found, though is
>> that open-coding the drm_exec loop is not all that bad, but that building
>> blocks that can be called from within the loop are useful:
>>
>> Like the drm_gpuva_prepare_objects() and an imaginary
>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
>> (if different and the gpuva points to the object. And
>> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
>> can use these building blocks like helpers and avoid the fn() callback by
>> instead open-coding.
>>
>> But I guess YMMV.
> That's exactly why those building blocks are exported, I already had in mind
> that there might be drivers which still want to open-code the drm_exec loop,
> while others might just want a simple interface to lock everything.
>
> I still think it is a good idea, but I'd keep that as simple as possible. And
> for everything else just let the driver open-code it and use the "building
> blocks" - will also expand the bulding blocks to what you mentioned above.
>
>>>> struct drm_gpuva_exec_ops {
>>>>       int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>
>>>>       int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>>>> *obj);
>>> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
>>> be the same callback, right?
>>>
>>>> };
>>>>
>>>> struct drm_gpuva_exec {
>>>>       const struct drm_gpuva_exec_ops *ops;
>>>>       struct drm_exec exec;
>>>>       struct drm_gpuva_manager *mgr;
>>>> };
>>>>
>>>> Although I'd actually expect bo_validate to be part of fn in the typical
>>>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
>>> This doesn't sound like my assumption about fn() above is correct.
>> Well one important thing in our conversion is that ttm_bo_validate () needs
>> to be in the until_all_locked() loop. We want to be able soon to use
>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>> temporarily, add locked objects to the drm_exec list of locked objects. That
>> means everything that may end up calling validate deep within the call chain
>> needs to be part of the until_all_locked() loop, so our
>> drm_gpuva_manager_lock_extra() fn callback would include those validates and
>> look different all the time. Hence that's why open-coding isn't all that
>> bad...
> Oh, I see. You indeed want to call validate() from within until_all_locked().
>
>> /Thomas
>>
>>
>>>>> +
>>>>> +int drm_gpuva_manager_lock_extra(struct drm_gpuva_manager *mgr,
>>>>> +				 int (*fn)(struct drm_gpuva_manager *mgr,
>>>>> +					   void *priv, unsigned int num_fences),
>>>>> +				 void *priv,
>>>>> +				 unsigned int num_fences,
>>>>> +				 bool interruptible);
>>>>> +
>>>>> +int drm_gpuva_manager_lock_array(struct drm_gpuva_manager *mgr,
>>>>> +				 struct drm_gem_object **objs,
>>>>> +				 unsigned int num_objs,
>>>>> +				 unsigned int num_fences,
>>>>> +				 bool interruptible);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + * @num_fences: the amount of &dma_fences to reserve
>>>>> + * @interruptible: sleep interruptible if waiting
>>>>> + *
>>>>> + * Acquires all dma-resv locks of all &drm_gem_objects the given
>>>>> + * &drm_gpuva_manager contains mappings of.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +static inline int
>>>>> +drm_gpuva_manager_lock(struct drm_gpuva_manager *mgr,
>>>>> +		       unsigned int num_fences,
>>>>> +		       bool interruptible)
>>>>> +{
>>>>> +	return drm_gpuva_manager_lock_extra(mgr, NULL, NULL, num_fences,
>>>>> +					    interruptible);
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_manager_lock() - lock all dma-resv of all assoiciated BOs
>>>>> + * @mgr: the &drm_gpuva_manager
>>>>> + *
>>>>> + * Releases all dma-resv locks of all &drm_gem_objects previously acquired
>>>>> + * through drm_gpuva_manager_lock() or its variants.
>>>>> + *
>>>>> + * Returns: 0 on success, negative error code on failure.
>>>>> + */
>>>>> +static inline void
>>>>> +drm_gpuva_manager_unlock(struct drm_gpuva_manager *mgr)
>>>>> +{
>>>>> +	drm_exec_fini(&mgr->exec);
>>>>> +}
>>>>> +
>>>>> +int drm_gpuva_manager_validate(struct drm_gpuva_manager *mgr);
>>>>> +void drm_gpuva_manager_resv_add_fence(struct drm_gpuva_manager *mgr,
>>>>> +				      struct dma_fence *fence,
>>>>> +				      enum dma_resv_usage private_usage,
>>>>> +				      enum dma_resv_usage extobj_usage);
>>>>> +
>>>>> +int drm_gpuva_extobj_insert(struct drm_gpuva_manager *mgr,
>>>>> +			    struct drm_gem_object *obj);
>>>>> +void drm_gpuva_extobj_get(struct drm_gpuva_manager *mgr,
>>>>> +			  struct drm_gem_object *obj);
>>>>> +void drm_gpuva_extobj_put(struct drm_gpuva_manager *mgr,
>>>>> +			  struct drm_gem_object *obj);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_is_extobj() - indicates whether the given &drm_gem_object is an
>>>>> + * external object
>>>>> + * @mgr: the &drm_gpuva_manager to check
>>>>> + * @obj: the &drm_gem_object to check
>>>>> + *
>>>>> + * Returns: true if the &drm_gem_object &dma_resv differs from the
>>>>> + * &drm_gpuva_managers &dma_resv, false otherwise
>>>>> + */
>>>>> +static inline bool drm_gpuva_is_extobj(struct drm_gpuva_manager *mgr,
>>>>> +				       struct drm_gem_object *obj)
>>>>> +{
>>>>> +	return obj && obj->resv != mgr->resv;
>>>>> +}
>>>>> +
>>>>>     static inline struct drm_gpuva *
>>>>>     __drm_gpuva_next(struct drm_gpuva *va)
>>>>>     {
>>>>> @@ -327,6 +453,138 @@ __drm_gpuva_next(struct drm_gpuva *va)
>>>>>     #define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>>>>>     	list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
>>>>> +/**
>>>>> + * struct drm_gpuva_gem - structure representing a &drm_gpuva_manager and
>>>>> + * &drm_gem_object combination
>>>>> + *
>>>>> + * This structure is an abstraction representing a &drm_gpuva_manager and
>>>>> + * &drm_gem_object combination. It serves as an indirection to accelerate
>>>>> + * iterating all &drm_gpuvas within a &drm_gpuva_manager backed by the same
>>>>> + * &drm_gem_object.
>>>>> + *
>>>>> + * Furthermore it is used cache evicted GEM objects for a certain GPU-VM to
>>>>> + * accelerate validation.
>>>>> + *
>>>>> + * Typically, drivers want to create an instance of a struct drm_gpuva_gem once
>>>>> + * a GEM object is mapped first in a GPU-VM and release the instance once the
>>>>> + * last mapping of the GEM object in this GPU-VM is unmapped.
>>>>> + */
>>>>> +struct drm_gpuva_gem {
>>>>> +
>>>>> +	/**
>>>>> +	 * @mgr: The &drm_gpuva_manager the @obj is mapped in.
>>>>> +	 */
>>>>> +	struct drm_gpuva_manager *mgr;
>>>>> +
>>>>> +	/**
>>>>> +	 * @obj: The &drm_gem_object being mapped in the @mgr.
>>>>> +	 */
>>>>> +	struct drm_gem_object *obj;
>>>>> +
>>>>> +	/**
>>>>> +	 * @kref: The reference count for this &drm_gpuva_gem.
>>>>> +	 */
>>>>> +	struct kref kref;
>>>>> +
>>>>> +	/**
>>>>> +	 * @list: Structure containing all &list_heads.
>>>>> +	 */
>>>>> +	struct {
>>>>> +		/**
>>>>> +		 * @gpuva: The list of linked &drm_gpuvas.
>>>>> +		 */
>>>>> +		struct list_head gpuva;
>>>>> +
>>>>> +		/**
>>>>> +		 * @entry: Structure containing all &list_heads serving as
>>>>> +		 * entry.
>>>>> +		 */
>>>>> +		struct {
>>>>> +			/**
>>>>> +			 * @gem: List entry to attach to the &drm_gem_objects
>>>>> +			 * gpuva list.
>>>>> +			 */
>>>>> +			struct list_head gem;
>>>>> +
>>>>> +			/**
>>>>> +			 * @evict: List entry to attach to the
>>>>> +			 * &drm_gpuva_managers evict list.
>>>>> +			 */
>>>>> +			struct list_head evict;
>>>>> +		} entry;
>>>>> +	} list;
>>>>> +};
>>>>> +
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_obtain(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj);
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_obtain_prealloc(struct drm_gpuva_manager *mgr,
>>>>> +			      struct drm_gem_object *obj,
>>>>> +			      struct drm_gpuva_gem *__vm_bo);
>>>>> +
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_find(struct drm_gpuva_manager *mgr,
>>>>> +		   struct drm_gem_object *obj);
>>>>> +
>>>>> +void drm_gpuva_gem_evict(struct drm_gem_object *obj, bool evict);
>>>>> +
>>>>> +struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_create(struct drm_gpuva_manager *mgr,
>>>>> +		     struct drm_gem_object *obj);
>>>>> +void drm_gpuva_gem_destroy(struct kref *kref);
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_get() - acquire a struct drm_gpuva_gem reference
>>>>> + * @vm_bo: the &drm_gpuva_gem to acquire the reference of
>>>>> + *
>>>>> + * This function acquires an additional reference to @vm_bo. It is illegal to
>>>>> + * call this without already holding a reference. No locks required.
>>>>> + */
>>>>> +static inline struct drm_gpuva_gem *
>>>>> +drm_gpuva_gem_get(struct drm_gpuva_gem *vm_bo)
>>>>> +{
>>>>> +	kref_get(&vm_bo->kref);
>>>>> +	return vm_bo;
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_put() - drop a struct drm_gpuva_gem reference
>>>>> + * @vm_bo: the &drm_gpuva_gem to release the reference of
>>>>> + *
>>>>> + * This releases a reference to @vm_bo.
>>>>> + */
>>>>> +static inline void
>>>>> +drm_gpuva_gem_put(struct drm_gpuva_gem *vm_bo)
>>>>> +{
>>>>> +	kref_put(&vm_bo->kref, drm_gpuva_gem_destroy);
>>>>> +}
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_for_each_va() - iterator to walk over a list of &drm_gpuva
>>>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>>>> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
>>>>> + *
>>>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>>>> + * &drm_gpuva_gem.
>>>>> + */
>>>>> +#define drm_gpuva_gem_for_each_va(va__, vm_bo__) \
>>>>> +	list_for_each_entry(va__, &(vm_bo)->list.gpuva, gem.entry)
>>>>> +
>>>>> +/**
>>>>> + * drm_gpuva_gem_for_each_va_safe() - iterator to safely walk over a list of
>>>>> + * &drm_gpuva
>>>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>>>> + * @next__: &next &drm_gpuva to store the next step
>>>>> + * @vm_bo__: the &drm_gpuva_gem the &drm_gpuva to walk are associated with
>>>>> + *
>>>>> + * This iterator walks over all &drm_gpuva structures associated with the
>>>>> + * &drm_gpuva_gem. It is implemented with list_for_each_entry_safe(), hence
>>>>> + * it is save against removal of elements.
>>>>> + */
>>>>> +#define drm_gpuva_gem_for_each_va_safe(va__, next__, vm_bo__) \
>>>>> +	list_for_each_entry_safe(va__, next__, &(vm_bo)->list.gpuva, gem.entry)
>>>>> +
>>>>>     /**
>>>>>      * enum drm_gpuva_op_type - GPU VA operation type
>>>>>      *
>>>>> @@ -641,6 +899,30 @@ struct drm_gpuva_fn_ops {
>>>>>     	 */
>>>>>     	void (*op_free)(struct drm_gpuva_op *op);
>>>>> +	/**
>>>>> +	 * @vm_bo_alloc: called when the &drm_gpuva_manager allocates
>>>>> +	 * a struct drm_gpuva_gem
>>>>> +	 *
>>>>> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
>>>>> +	 * specific structures. By implementing this callback drivers can
>>>>> +	 * allocate memory accordingly.
>>>>> +	 *
>>>>> +	 * This callback is optional.
>>>>> +	 */
>>>>> +	struct drm_gpuva_gem *(*vm_bo_alloc)(void);
>>>>> +
>>>>> +	/**
>>>>> +	 * @vm_bo_free: called when the &drm_gpuva_manager frees a
>>>>> +	 * struct drm_gpuva_gem
>>>>> +	 *
>>>>> +	 * Some drivers may want to embed struct drm_gpuva_gem into driver
>>>>> +	 * specific structures. By implementing this callback drivers can
>>>>> +	 * free the previously allocated memory accordingly.
>>>>> +	 *
>>>>> +	 * This callback is optional.
>>>>> +	 */
>>>>> +	void (*vm_bo_free)(struct drm_gpuva_gem *vm_bo);
>>>>> +
>>>>>     	/**
>>>>>     	 * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
>>>>>     	 * mapping once all previous steps were completed
>>>>> @@ -684,6 +966,17 @@ struct drm_gpuva_fn_ops {
>>>>>     	 * used.
>>>>>     	 */
>>>>>     	int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
>>>>> +
>>>>> +	/**
>>>>> +	 * @bo_validate: called from drm_gpuva_manager_validate()
>>>>> +	 *
>>>>> +	 * Drivers receive this callback for every evicted &drm_gem_object being
>>>>> +	 * mapped in the corresponding &drm_gpuva_manager.
>>>>> +	 *
>>>>> +	 * Typically, drivers would call their driver specific variant of
>>>>> +	 * ttm_bo_validate() from within this callback.
>>>>> +	 */
>>>>> +	int (*bo_validate)(struct drm_gem_object *obj);
>>>>>     };
>>>>>     int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
>>>>> @@ -696,11 +989,18 @@ int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>>>>>     void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>>>>     		   struct drm_gpuva *va,
>>>>>     		   struct drm_gpuva_op_map *op);
>>>>> +void drm_gpuva_map_get(struct drm_gpuva_manager *mgr,
>>>>> +		       struct drm_gpuva *va,
>>>>> +		       struct drm_gpuva_op_map *op);
>>>>>     void drm_gpuva_remap(struct drm_gpuva *prev,
>>>>>     		     struct drm_gpuva *next,
>>>>>     		     struct drm_gpuva_op_remap *op);
>>>>> +void drm_gpuva_remap_get(struct drm_gpuva *prev,
>>>>> +			 struct drm_gpuva *next,
>>>>> +			 struct drm_gpuva_op_remap *op);
>>>>>     void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
>>>>> +void drm_gpuva_unmap_put(struct drm_gpuva_op_unmap *op);
>>>>>     #endif /* __DRM_GPUVA_MGR_H__ */

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-31  9:04             ` Thomas Hellström (Intel)
  (?)
@ 2023-08-31 11:18               ` Danilo Krummrich
  -1 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-31 11:18 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, christian.koenig, faith.ekstrand, bskeggs

On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
> Hi!
> 
> On 8/30/23 17:00, Danilo Krummrich wrote:
> > On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
> > > On 8/30/23 14:49, Danilo Krummrich wrote:
> > > > Hi Thomas,
> > > > 
> > > > thanks for having a look!
> > > > 
> > > > On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> > > > > Hi, Danilo.
> > > > > 
> > > > > Some quick comments since I'm doing some Xe work in this area. Will probably
> > > > > get back with more.
> > > > > 
> > > > > On 8/20/23 23:53, Danilo Krummrich wrote:

<snip>

> > > > > > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > > > > > index ed8d50200cc3..693e2da3f425 100644
> > > > > > --- a/include/drm/drm_gpuva_mgr.h
> > > > > > +++ b/include/drm/drm_gpuva_mgr.h
> > > > > > @@ -26,12 +26,16 @@
> > > > > >      */
> > > > > >     #include <linux/list.h>
> > > > > > +#include <linux/dma-resv.h>
> > > > > > +#include <linux/maple_tree.h>
> > > > > >     #include <linux/rbtree.h>
> > > > > >     #include <linux/types.h>
> > > > > >     #include <drm/drm_gem.h>
> > > > > > +#include <drm/drm_exec.h>
> > > > > >     struct drm_gpuva_manager;
> > > > > > +struct drm_gpuva_gem;
> > > > > >     struct drm_gpuva_fn_ops;
> > > > > >     /**
> > > > > > @@ -140,7 +144,7 @@ struct drm_gpuva {
> > > > > >     int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> > > > > >     void drm_gpuva_remove(struct drm_gpuva *va);
> > > > > > -void drm_gpuva_link(struct drm_gpuva *va);
> > > > > > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> > > > > >     void drm_gpuva_unlink(struct drm_gpuva *va);
> > > > > >     struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > > > > > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> > > > > >     	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> > > > > >     	 */
> > > > > >     	const struct drm_gpuva_fn_ops *ops;
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > > > > > +	 * dma-resv to &drm_exec.
> > > > > > +	 */
> > > > > > +	struct drm_gem_object d_obj;
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > > > > > +	 * space
> > > > > > +	 */
> > > > > > +	struct dma_resv *resv;
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > > > > > +	 */
> > > > > > +	struct drm_exec exec;
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > > > > > +	 */
> > > > > > +	struct maple_tree mt_ext;
> > > > > Why are you using a maple tree here? Insertion and removal is O(log(n))
> > > > > instead of O(1) for a list?
> > > > > 
> > > > Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> > > > could have mappings of the same extobj.
> > > > 
> > > > I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> > > > instead, which also seems to be the obvious choice. However, there is a locking
> > > > conflict.
> > > > 
> > > > A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> > > > a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> > > > the corresponding drm_gem_object, or with an external lock provided by the
> > > > driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> > > > changes on the GPUVA space directly from the fence signalling path.
> > > > 
> > > > Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> > > > we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> > > > linked and we'd want to remove it for the last one being unlinked.
> > > > 
> > > > (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> > > > before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> > > > through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> > > > create the drm_gpuva_gem, which we need to do anyways.)
> > > > 
> > > > Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> > > > list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> > > > order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> > > > lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> > > > drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> > > > like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> > > > hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
> > > > 
> > > > Considering all that, I thought it's probably better to track extobjs separate
> > > > from the drm_gpuva_gem, hence the maple tree choice.
> > > Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
> > > external objects, or in the case of multiple mappings to the same gem
> > > object, only one of the drm_gpuvas is in the list. These are protected by
> > > the GPU-VM lock. I don't see a problem with removing those from the fence
> > > signalling path, though?
> > I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
> > since this is generic code I don't know how much mappings of an external object
> > the corresponding driver potentially creates. This could become a pretty large
> > list to iterate. Another reason was, that I want to keep the drm_gpuva structure
> > as small as possible, hence avoiding another list_head.
> 
> Yes, the list might be pretty large, but OTOH you never iterate to access a
> single list element. When you need to iterate the whole list you need to do
> that regardless of the data structure used. As for the list head, it might
> perhaps be aliased (union) with an upcoming userptr list head?
> 

Oh, I did not mean that I'm concerned about the size of a list of extobjs in
general, that would indeed be the same for every data structure chosen. But I
would be concerned about keeping a list of *all* mappings being backed by an
extobj.

> > 
> > Now, it sounds like in XE you're doing some kind of optimization just keeping a
> > single mapping of an extobj in the list? How do you know when to remove it? What
> > if the mapping from the extobj list gets unmapped, but there is still another
> > one left in the GPU-VM being backed by the same BO?
> When removing from the lists, we iterate through the object's list of vmas,
> and if there is one matching the same vm, we replace the old one with the
> new one. A similar iteration is done when adding to avoid adding one that is
> already on the list.

I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
while using the maple tree is O(log(n))?

> > Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> > choice, keeping O(1)?
> > When tracking extobjs, the address of the drm_gem_object is the key while the
> > reference count is the value. I was thinking of an XArray as well, but I was
> > worried that the corresponding indices could be too much distributed for an
> > XArray to still be efficient. Now that I think about it, it's probably not that
> > bad.
> > 
> > Btw., while I agree trying to make things as efficient as possible, what is the
> > magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
> 
> Not sure yet, TBH, but I think one of our UMDs can only use external object,
> because they don't know at creation time which ones need exporting. However
> if this turns out to be too bad, there are various flavours of "clever but
> complicated" optimizations that we could think of to reduce the list size.
> Still in our case, we opted for the vma list head for now.

Considering the above, I would guess that if your current approach is good
enough, a maple tree will work as well.

Otherwise, if you want, I could do some experiments with Xarray and see how
that works out compared to using a maple tree.

Btw. another nice thing about using Xarray or maple tree for that is that
drivers updating the VA space from the fence signalling path don't need to
hold a GPU-VM lock to update the extobj list. Actually, they might not need
a GPU-VM lock at all.

> 
> /Thomas
> 
> 
> > 
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @evict: structure holding the evict list and evict list lock
> > > > > > +	 */
> > > > > > +	struct {
> > > > > > +		/**
> > > > > > +		 * @list: &list_head storing &drm_gem_objects currently being
> > > > > > +		 * evicted
> > > > > > +		 */
> > > > > > +		struct list_head list;
> > > > > > +
> > > > > > +		/**
> > > > > > +		 * @lock: spinlock to protect the evict list against concurrent
> > > > > > +		 * insertion / removal of different &drm_gpuva_gems
> > > > > > +		 */
> > > > > > +		spinlock_t lock;
> > > > > > +	} evict;
> > > > > >     };
> > > > > >     void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > > > +			    struct drm_device *drm,
> > > > > >     			    const char *name,
> > > > > >     			    u64 start_offset, u64 range,
> > > > > >     			    u64 reserve_offset, u64 reserve_range,
> > > > > >     			    const struct drm_gpuva_fn_ops *ops);
> > > > > >     void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > > > > > +/**
> > > > > > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > > > > > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > > > > > + */
> > > > > > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > > > > A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> > > > > should typically be allocated on the stack. Otherwise you'd need to protect
> > > > > the mgr->exec member with an exclusive lock throughout the locking process,
> > > > > and that's not what we want.
> > > > Oh, good point. I think it works in Nouveau, because there it's implicitly
> > > > protected with the job submission lock.
> > > > 
> > > > > Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> > > > > needed ops to it: Like so:
> > > > That's a good idea, will take this into V2.
> > > Actually, I'm not fully sure that was a good idea: I've now have a working
> > > version of Xe ported over to drm_exec, having these helpers in mind and with
> > > the intention to start using them as they mature. What I found, though is
> > > that open-coding the drm_exec loop is not all that bad, but that building
> > > blocks that can be called from within the loop are useful:
> > > 
> > > Like the drm_gpuva_prepare_objects() and an imaginary
> > > drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
> > > (if different and the gpuva points to the object. And
> > > drm_gpuva_prepare_array() although we don't use it within Xe. That means you
> > > can use these building blocks like helpers and avoid the fn() callback by
> > > instead open-coding.
> > > 
> > > But I guess YMMV.
> > That's exactly why those building blocks are exported, I already had in mind
> > that there might be drivers which still want to open-code the drm_exec loop,
> > while others might just want a simple interface to lock everything.
> > 
> > I still think it is a good idea, but I'd keep that as simple as possible. And
> > for everything else just let the driver open-code it and use the "building
> > blocks" - will also expand the bulding blocks to what you mentioned above.
> > 
> > > > > struct drm_gpuva_exec_ops {
> > > > >       int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> > > > Is this the fn argument from drm_gpuva_manager_lock_extra()?
> > > > 
> > > > >       int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> > > > > *obj);
> > > > I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> > > > be the same callback, right?
> > > > 
> > > > > };
> > > > > 
> > > > > struct drm_gpuva_exec {
> > > > >       const struct drm_gpuva_exec_ops *ops;
> > > > >       struct drm_exec exec;
> > > > >       struct drm_gpuva_manager *mgr;
> > > > > };
> > > > > 
> > > > > Although I'd actually expect bo_validate to be part of fn in the typical
> > > > > case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> > > > This doesn't sound like my assumption about fn() above is correct.
> > > Well one important thing in our conversion is that ttm_bo_validate () needs
> > > to be in the until_all_locked() loop. We want to be able soon to use
> > > sleeping locks for eviction, so a xe_bo_validate() would, at least
> > > temporarily, add locked objects to the drm_exec list of locked objects. That
> > > means everything that may end up calling validate deep within the call chain
> > > needs to be part of the until_all_locked() loop, so our
> > > drm_gpuva_manager_lock_extra() fn callback would include those validates and
> > > look different all the time. Hence that's why open-coding isn't all that
> > > bad...
> > Oh, I see. You indeed want to call validate() from within until_all_locked().
> > 
> > > /Thomas
> > > 
> > > 

<snip>


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-31 11:18               ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-31 11:18 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, christian.koenig, faith.ekstrand, bskeggs

On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
> Hi!
> 
> On 8/30/23 17:00, Danilo Krummrich wrote:
> > On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
> > > On 8/30/23 14:49, Danilo Krummrich wrote:
> > > > Hi Thomas,
> > > > 
> > > > thanks for having a look!
> > > > 
> > > > On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> > > > > Hi, Danilo.
> > > > > 
> > > > > Some quick comments since I'm doing some Xe work in this area. Will probably
> > > > > get back with more.
> > > > > 
> > > > > On 8/20/23 23:53, Danilo Krummrich wrote:

<snip>

> > > > > > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > > > > > index ed8d50200cc3..693e2da3f425 100644
> > > > > > --- a/include/drm/drm_gpuva_mgr.h
> > > > > > +++ b/include/drm/drm_gpuva_mgr.h
> > > > > > @@ -26,12 +26,16 @@
> > > > > >      */
> > > > > >     #include <linux/list.h>
> > > > > > +#include <linux/dma-resv.h>
> > > > > > +#include <linux/maple_tree.h>
> > > > > >     #include <linux/rbtree.h>
> > > > > >     #include <linux/types.h>
> > > > > >     #include <drm/drm_gem.h>
> > > > > > +#include <drm/drm_exec.h>
> > > > > >     struct drm_gpuva_manager;
> > > > > > +struct drm_gpuva_gem;
> > > > > >     struct drm_gpuva_fn_ops;
> > > > > >     /**
> > > > > > @@ -140,7 +144,7 @@ struct drm_gpuva {
> > > > > >     int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> > > > > >     void drm_gpuva_remove(struct drm_gpuva *va);
> > > > > > -void drm_gpuva_link(struct drm_gpuva *va);
> > > > > > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> > > > > >     void drm_gpuva_unlink(struct drm_gpuva *va);
> > > > > >     struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > > > > > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> > > > > >     	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> > > > > >     	 */
> > > > > >     	const struct drm_gpuva_fn_ops *ops;
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > > > > > +	 * dma-resv to &drm_exec.
> > > > > > +	 */
> > > > > > +	struct drm_gem_object d_obj;
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > > > > > +	 * space
> > > > > > +	 */
> > > > > > +	struct dma_resv *resv;
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > > > > > +	 */
> > > > > > +	struct drm_exec exec;
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > > > > > +	 */
> > > > > > +	struct maple_tree mt_ext;
> > > > > Why are you using a maple tree here? Insertion and removal is O(log(n))
> > > > > instead of O(1) for a list?
> > > > > 
> > > > Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> > > > could have mappings of the same extobj.
> > > > 
> > > > I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> > > > instead, which also seems to be the obvious choice. However, there is a locking
> > > > conflict.
> > > > 
> > > > A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> > > > a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> > > > the corresponding drm_gem_object, or with an external lock provided by the
> > > > driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> > > > changes on the GPUVA space directly from the fence signalling path.
> > > > 
> > > > Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> > > > we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> > > > linked and we'd want to remove it for the last one being unlinked.
> > > > 
> > > > (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> > > > before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> > > > through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> > > > create the drm_gpuva_gem, which we need to do anyways.)
> > > > 
> > > > Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> > > > list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> > > > order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> > > > lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> > > > drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> > > > like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> > > > hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
> > > > 
> > > > Considering all that, I thought it's probably better to track extobjs separate
> > > > from the drm_gpuva_gem, hence the maple tree choice.
> > > Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
> > > external objects, or in the case of multiple mappings to the same gem
> > > object, only one of the drm_gpuvas is in the list. These are protected by
> > > the GPU-VM lock. I don't see a problem with removing those from the fence
> > > signalling path, though?
> > I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
> > since this is generic code I don't know how much mappings of an external object
> > the corresponding driver potentially creates. This could become a pretty large
> > list to iterate. Another reason was, that I want to keep the drm_gpuva structure
> > as small as possible, hence avoiding another list_head.
> 
> Yes, the list might be pretty large, but OTOH you never iterate to access a
> single list element. When you need to iterate the whole list you need to do
> that regardless of the data structure used. As for the list head, it might
> perhaps be aliased (union) with an upcoming userptr list head?
> 

Oh, I did not mean that I'm concerned about the size of a list of extobjs in
general, that would indeed be the same for every data structure chosen. But I
would be concerned about keeping a list of *all* mappings being backed by an
extobj.

> > 
> > Now, it sounds like in XE you're doing some kind of optimization just keeping a
> > single mapping of an extobj in the list? How do you know when to remove it? What
> > if the mapping from the extobj list gets unmapped, but there is still another
> > one left in the GPU-VM being backed by the same BO?
> When removing from the lists, we iterate through the object's list of vmas,
> and if there is one matching the same vm, we replace the old one with the
> new one. A similar iteration is done when adding to avoid adding one that is
> already on the list.

I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
while using the maple tree is O(log(n))?

> > Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> > choice, keeping O(1)?
> > When tracking extobjs, the address of the drm_gem_object is the key while the
> > reference count is the value. I was thinking of an XArray as well, but I was
> > worried that the corresponding indices could be too much distributed for an
> > XArray to still be efficient. Now that I think about it, it's probably not that
> > bad.
> > 
> > Btw., while I agree trying to make things as efficient as possible, what is the
> > magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
> 
> Not sure yet, TBH, but I think one of our UMDs can only use external object,
> because they don't know at creation time which ones need exporting. However
> if this turns out to be too bad, there are various flavours of "clever but
> complicated" optimizations that we could think of to reduce the list size.
> Still in our case, we opted for the vma list head for now.

Considering the above, I would guess that if your current approach is good
enough, a maple tree will work as well.

Otherwise, if you want, I could do some experiments with Xarray and see how
that works out compared to using a maple tree.

Btw. another nice thing about using Xarray or maple tree for that is that
drivers updating the VA space from the fence signalling path don't need to
hold a GPU-VM lock to update the extobj list. Actually, they might not need
a GPU-VM lock at all.

> 
> /Thomas
> 
> 
> > 
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @evict: structure holding the evict list and evict list lock
> > > > > > +	 */
> > > > > > +	struct {
> > > > > > +		/**
> > > > > > +		 * @list: &list_head storing &drm_gem_objects currently being
> > > > > > +		 * evicted
> > > > > > +		 */
> > > > > > +		struct list_head list;
> > > > > > +
> > > > > > +		/**
> > > > > > +		 * @lock: spinlock to protect the evict list against concurrent
> > > > > > +		 * insertion / removal of different &drm_gpuva_gems
> > > > > > +		 */
> > > > > > +		spinlock_t lock;
> > > > > > +	} evict;
> > > > > >     };
> > > > > >     void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > > > +			    struct drm_device *drm,
> > > > > >     			    const char *name,
> > > > > >     			    u64 start_offset, u64 range,
> > > > > >     			    u64 reserve_offset, u64 reserve_range,
> > > > > >     			    const struct drm_gpuva_fn_ops *ops);
> > > > > >     void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > > > > > +/**
> > > > > > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > > > > > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > > > > > + */
> > > > > > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > > > > A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> > > > > should typically be allocated on the stack. Otherwise you'd need to protect
> > > > > the mgr->exec member with an exclusive lock throughout the locking process,
> > > > > and that's not what we want.
> > > > Oh, good point. I think it works in Nouveau, because there it's implicitly
> > > > protected with the job submission lock.
> > > > 
> > > > > Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> > > > > needed ops to it: Like so:
> > > > That's a good idea, will take this into V2.
> > > Actually, I'm not fully sure that was a good idea: I've now have a working
> > > version of Xe ported over to drm_exec, having these helpers in mind and with
> > > the intention to start using them as they mature. What I found, though is
> > > that open-coding the drm_exec loop is not all that bad, but that building
> > > blocks that can be called from within the loop are useful:
> > > 
> > > Like the drm_gpuva_prepare_objects() and an imaginary
> > > drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
> > > (if different and the gpuva points to the object. And
> > > drm_gpuva_prepare_array() although we don't use it within Xe. That means you
> > > can use these building blocks like helpers and avoid the fn() callback by
> > > instead open-coding.
> > > 
> > > But I guess YMMV.
> > That's exactly why those building blocks are exported, I already had in mind
> > that there might be drivers which still want to open-code the drm_exec loop,
> > while others might just want a simple interface to lock everything.
> > 
> > I still think it is a good idea, but I'd keep that as simple as possible. And
> > for everything else just let the driver open-code it and use the "building
> > blocks" - will also expand the bulding blocks to what you mentioned above.
> > 
> > > > > struct drm_gpuva_exec_ops {
> > > > >       int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> > > > Is this the fn argument from drm_gpuva_manager_lock_extra()?
> > > > 
> > > > >       int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> > > > > *obj);
> > > > I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> > > > be the same callback, right?
> > > > 
> > > > > };
> > > > > 
> > > > > struct drm_gpuva_exec {
> > > > >       const struct drm_gpuva_exec_ops *ops;
> > > > >       struct drm_exec exec;
> > > > >       struct drm_gpuva_manager *mgr;
> > > > > };
> > > > > 
> > > > > Although I'd actually expect bo_validate to be part of fn in the typical
> > > > > case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> > > > This doesn't sound like my assumption about fn() above is correct.
> > > Well one important thing in our conversion is that ttm_bo_validate () needs
> > > to be in the until_all_locked() loop. We want to be able soon to use
> > > sleeping locks for eviction, so a xe_bo_validate() would, at least
> > > temporarily, add locked objects to the drm_exec list of locked objects. That
> > > means everything that may end up calling validate deep within the call chain
> > > needs to be part of the until_all_locked() loop, so our
> > > drm_gpuva_manager_lock_extra() fn callback would include those validates and
> > > look different all the time. Hence that's why open-coding isn't all that
> > > bad...
> > Oh, I see. You indeed want to call validate() from within until_all_locked().
> > 
> > > /Thomas
> > > 
> > > 

<snip>


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-31 11:18               ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-31 11:18 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
> Hi!
> 
> On 8/30/23 17:00, Danilo Krummrich wrote:
> > On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
> > > On 8/30/23 14:49, Danilo Krummrich wrote:
> > > > Hi Thomas,
> > > > 
> > > > thanks for having a look!
> > > > 
> > > > On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> > > > > Hi, Danilo.
> > > > > 
> > > > > Some quick comments since I'm doing some Xe work in this area. Will probably
> > > > > get back with more.
> > > > > 
> > > > > On 8/20/23 23:53, Danilo Krummrich wrote:

<snip>

> > > > > > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > > > > > index ed8d50200cc3..693e2da3f425 100644
> > > > > > --- a/include/drm/drm_gpuva_mgr.h
> > > > > > +++ b/include/drm/drm_gpuva_mgr.h
> > > > > > @@ -26,12 +26,16 @@
> > > > > >      */
> > > > > >     #include <linux/list.h>
> > > > > > +#include <linux/dma-resv.h>
> > > > > > +#include <linux/maple_tree.h>
> > > > > >     #include <linux/rbtree.h>
> > > > > >     #include <linux/types.h>
> > > > > >     #include <drm/drm_gem.h>
> > > > > > +#include <drm/drm_exec.h>
> > > > > >     struct drm_gpuva_manager;
> > > > > > +struct drm_gpuva_gem;
> > > > > >     struct drm_gpuva_fn_ops;
> > > > > >     /**
> > > > > > @@ -140,7 +144,7 @@ struct drm_gpuva {
> > > > > >     int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> > > > > >     void drm_gpuva_remove(struct drm_gpuva *va);
> > > > > > -void drm_gpuva_link(struct drm_gpuva *va);
> > > > > > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> > > > > >     void drm_gpuva_unlink(struct drm_gpuva *va);
> > > > > >     struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > > > > > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> > > > > >     	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> > > > > >     	 */
> > > > > >     	const struct drm_gpuva_fn_ops *ops;
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > > > > > +	 * dma-resv to &drm_exec.
> > > > > > +	 */
> > > > > > +	struct drm_gem_object d_obj;
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > > > > > +	 * space
> > > > > > +	 */
> > > > > > +	struct dma_resv *resv;
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > > > > > +	 */
> > > > > > +	struct drm_exec exec;
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > > > > > +	 */
> > > > > > +	struct maple_tree mt_ext;
> > > > > Why are you using a maple tree here? Insertion and removal is O(log(n))
> > > > > instead of O(1) for a list?
> > > > > 
> > > > Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> > > > could have mappings of the same extobj.
> > > > 
> > > > I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> > > > instead, which also seems to be the obvious choice. However, there is a locking
> > > > conflict.
> > > > 
> > > > A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> > > > a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> > > > the corresponding drm_gem_object, or with an external lock provided by the
> > > > driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> > > > changes on the GPUVA space directly from the fence signalling path.
> > > > 
> > > > Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> > > > we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> > > > linked and we'd want to remove it for the last one being unlinked.
> > > > 
> > > > (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> > > > before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> > > > through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> > > > create the drm_gpuva_gem, which we need to do anyways.)
> > > > 
> > > > Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> > > > list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> > > > order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> > > > lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> > > > drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> > > > like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> > > > hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
> > > > 
> > > > Considering all that, I thought it's probably better to track extobjs separate
> > > > from the drm_gpuva_gem, hence the maple tree choice.
> > > Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
> > > external objects, or in the case of multiple mappings to the same gem
> > > object, only one of the drm_gpuvas is in the list. These are protected by
> > > the GPU-VM lock. I don't see a problem with removing those from the fence
> > > signalling path, though?
> > I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
> > since this is generic code I don't know how much mappings of an external object
> > the corresponding driver potentially creates. This could become a pretty large
> > list to iterate. Another reason was, that I want to keep the drm_gpuva structure
> > as small as possible, hence avoiding another list_head.
> 
> Yes, the list might be pretty large, but OTOH you never iterate to access a
> single list element. When you need to iterate the whole list you need to do
> that regardless of the data structure used. As for the list head, it might
> perhaps be aliased (union) with an upcoming userptr list head?
> 

Oh, I did not mean that I'm concerned about the size of a list of extobjs in
general, that would indeed be the same for every data structure chosen. But I
would be concerned about keeping a list of *all* mappings being backed by an
extobj.

> > 
> > Now, it sounds like in XE you're doing some kind of optimization just keeping a
> > single mapping of an extobj in the list? How do you know when to remove it? What
> > if the mapping from the extobj list gets unmapped, but there is still another
> > one left in the GPU-VM being backed by the same BO?
> When removing from the lists, we iterate through the object's list of vmas,
> and if there is one matching the same vm, we replace the old one with the
> new one. A similar iteration is done when adding to avoid adding one that is
> already on the list.

I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
while using the maple tree is O(log(n))?

> > Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> > choice, keeping O(1)?
> > When tracking extobjs, the address of the drm_gem_object is the key while the
> > reference count is the value. I was thinking of an XArray as well, but I was
> > worried that the corresponding indices could be too much distributed for an
> > XArray to still be efficient. Now that I think about it, it's probably not that
> > bad.
> > 
> > Btw., while I agree trying to make things as efficient as possible, what is the
> > magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
> 
> Not sure yet, TBH, but I think one of our UMDs can only use external object,
> because they don't know at creation time which ones need exporting. However
> if this turns out to be too bad, there are various flavours of "clever but
> complicated" optimizations that we could think of to reduce the list size.
> Still in our case, we opted for the vma list head for now.

Considering the above, I would guess that if your current approach is good
enough, a maple tree will work as well.

Otherwise, if you want, I could do some experiments with Xarray and see how
that works out compared to using a maple tree.

Btw. another nice thing about using Xarray or maple tree for that is that
drivers updating the VA space from the fence signalling path don't need to
hold a GPU-VM lock to update the extobj list. Actually, they might not need
a GPU-VM lock at all.

> 
> /Thomas
> 
> 
> > 
> > > > > > +
> > > > > > +	/**
> > > > > > +	 * @evict: structure holding the evict list and evict list lock
> > > > > > +	 */
> > > > > > +	struct {
> > > > > > +		/**
> > > > > > +		 * @list: &list_head storing &drm_gem_objects currently being
> > > > > > +		 * evicted
> > > > > > +		 */
> > > > > > +		struct list_head list;
> > > > > > +
> > > > > > +		/**
> > > > > > +		 * @lock: spinlock to protect the evict list against concurrent
> > > > > > +		 * insertion / removal of different &drm_gpuva_gems
> > > > > > +		 */
> > > > > > +		spinlock_t lock;
> > > > > > +	} evict;
> > > > > >     };
> > > > > >     void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > > > +			    struct drm_device *drm,
> > > > > >     			    const char *name,
> > > > > >     			    u64 start_offset, u64 range,
> > > > > >     			    u64 reserve_offset, u64 reserve_range,
> > > > > >     			    const struct drm_gpuva_fn_ops *ops);
> > > > > >     void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > > > > > +/**
> > > > > > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > > > > > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > > > > > + */
> > > > > > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > > > > A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> > > > > should typically be allocated on the stack. Otherwise you'd need to protect
> > > > > the mgr->exec member with an exclusive lock throughout the locking process,
> > > > > and that's not what we want.
> > > > Oh, good point. I think it works in Nouveau, because there it's implicitly
> > > > protected with the job submission lock.
> > > > 
> > > > > Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> > > > > needed ops to it: Like so:
> > > > That's a good idea, will take this into V2.
> > > Actually, I'm not fully sure that was a good idea: I've now have a working
> > > version of Xe ported over to drm_exec, having these helpers in mind and with
> > > the intention to start using them as they mature. What I found, though is
> > > that open-coding the drm_exec loop is not all that bad, but that building
> > > blocks that can be called from within the loop are useful:
> > > 
> > > Like the drm_gpuva_prepare_objects() and an imaginary
> > > drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
> > > (if different and the gpuva points to the object. And
> > > drm_gpuva_prepare_array() although we don't use it within Xe. That means you
> > > can use these building blocks like helpers and avoid the fn() callback by
> > > instead open-coding.
> > > 
> > > But I guess YMMV.
> > That's exactly why those building blocks are exported, I already had in mind
> > that there might be drivers which still want to open-code the drm_exec loop,
> > while others might just want a simple interface to lock everything.
> > 
> > I still think it is a good idea, but I'd keep that as simple as possible. And
> > for everything else just let the driver open-code it and use the "building
> > blocks" - will also expand the bulding blocks to what you mentioned above.
> > 
> > > > > struct drm_gpuva_exec_ops {
> > > > >       int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> > > > Is this the fn argument from drm_gpuva_manager_lock_extra()?
> > > > 
> > > > >       int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> > > > > *obj);
> > > > I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> > > > be the same callback, right?
> > > > 
> > > > > };
> > > > > 
> > > > > struct drm_gpuva_exec {
> > > > >       const struct drm_gpuva_exec_ops *ops;
> > > > >       struct drm_exec exec;
> > > > >       struct drm_gpuva_manager *mgr;
> > > > > };
> > > > > 
> > > > > Although I'd actually expect bo_validate to be part of fn in the typical
> > > > > case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> > > > This doesn't sound like my assumption about fn() above is correct.
> > > Well one important thing in our conversion is that ttm_bo_validate () needs
> > > to be in the until_all_locked() loop. We want to be able soon to use
> > > sleeping locks for eviction, so a xe_bo_validate() would, at least
> > > temporarily, add locked objects to the drm_exec list of locked objects. That
> > > means everything that may end up calling validate deep within the call chain
> > > needs to be part of the until_all_locked() loop, so our
> > > drm_gpuva_manager_lock_extra() fn callback would include those validates and
> > > look different all the time. Hence that's why open-coding isn't all that
> > > bad...
> > Oh, I see. You indeed want to call validate() from within until_all_locked().
> > 
> > > /Thomas
> > > 
> > > 

<snip>


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-31 11:18               ` Danilo Krummrich
  (?)
@ 2023-08-31 16:53                 ` Thomas Hellström (Intel)
  -1 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-08-31 16:53 UTC (permalink / raw)
  To: Danilo Krummrich
  Cc: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

Hi,

On 8/31/23 13:18, Danilo Krummrich wrote:
> On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
>> Hi!
>>
>> On 8/30/23 17:00, Danilo Krummrich wrote:
>>> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
>>>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>>>> Hi Thomas,
>>>>>
>>>>> thanks for having a look!
>>>>>
>>>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>>>>>> Hi, Danilo.
>>>>>>
>>>>>> Some quick comments since I'm doing some Xe work in this area. Will probably
>>>>>> get back with more.
>>>>>>
>>>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
> <snip>
>
>>>>>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>>>> @@ -26,12 +26,16 @@
>>>>>>>       */
>>>>>>>      #include <linux/list.h>
>>>>>>> +#include <linux/dma-resv.h>
>>>>>>> +#include <linux/maple_tree.h>
>>>>>>>      #include <linux/rbtree.h>
>>>>>>>      #include <linux/types.h>
>>>>>>>      #include <drm/drm_gem.h>
>>>>>>> +#include <drm/drm_exec.h>
>>>>>>>      struct drm_gpuva_manager;
>>>>>>> +struct drm_gpuva_gem;
>>>>>>>      struct drm_gpuva_fn_ops;
>>>>>>>      /**
>>>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>>>      int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>>>>>      void drm_gpuva_remove(struct drm_gpuva *va);
>>>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>>>>>      void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>>>      struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>>>      	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>>>>>      	 */
>>>>>>>      	const struct drm_gpuva_fn_ops *ops;
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>>>>>> +	 * dma-resv to &drm_exec.
>>>>>>> +	 */
>>>>>>> +	struct drm_gem_object d_obj;
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>>>>>> +	 * space
>>>>>>> +	 */
>>>>>>> +	struct dma_resv *resv;
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>>>>>> +	 */
>>>>>>> +	struct drm_exec exec;
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>>>> +	 */
>>>>>>> +	struct maple_tree mt_ext;
>>>>>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>>>>>> instead of O(1) for a list?
>>>>>>
>>>>> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
>>>>> could have mappings of the same extobj.
>>>>>
>>>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
>>>>> instead, which also seems to be the obvious choice. However, there is a locking
>>>>> conflict.
>>>>>
>>>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
>>>>> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
>>>>> the corresponding drm_gem_object, or with an external lock provided by the
>>>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
>>>>> changes on the GPUVA space directly from the fence signalling path.
>>>>>
>>>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
>>>>> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
>>>>> linked and we'd want to remove it for the last one being unlinked.
>>>>>
>>>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
>>>>> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
>>>>> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
>>>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>>>
>>>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
>>>>> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
>>>>> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
>>>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>>>> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
>>>>> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
>>>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>>>
>>>>> Considering all that, I thought it's probably better to track extobjs separate
>>>>> from the drm_gpuva_gem, hence the maple tree choice.
>>>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
>>>> external objects, or in the case of multiple mappings to the same gem
>>>> object, only one of the drm_gpuvas is in the list. These are protected by
>>>> the GPU-VM lock. I don't see a problem with removing those from the fence
>>>> signalling path, though?
>>> I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
>>> since this is generic code I don't know how much mappings of an external object
>>> the corresponding driver potentially creates. This could become a pretty large
>>> list to iterate. Another reason was, that I want to keep the drm_gpuva structure
>>> as small as possible, hence avoiding another list_head.
>> Yes, the list might be pretty large, but OTOH you never iterate to access a
>> single list element. When you need to iterate the whole list you need to do
>> that regardless of the data structure used. As for the list head, it might
>> perhaps be aliased (union) with an upcoming userptr list head?
>>
> Oh, I did not mean that I'm concerned about the size of a list of extobjs in
> general, that would indeed be the same for every data structure chosen. But I
> would be concerned about keeping a list of *all* mappings being backed by an
> extobj.
>
>>> Now, it sounds like in XE you're doing some kind of optimization just keeping a
>>> single mapping of an extobj in the list? How do you know when to remove it? What
>>> if the mapping from the extobj list gets unmapped, but there is still another
>>> one left in the GPU-VM being backed by the same BO?
>> When removing from the lists, we iterate through the object's list of vmas,
>> and if there is one matching the same vm, we replace the old one with the
>> new one. A similar iteration is done when adding to avoid adding one that is
>> already on the list.
> I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
> while using the maple tree is O(log(n))?

No, insertion and removal is O(m) where m is the number of vms the 
object is currently bound to. Typically a very small number.

>
>>> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
>>> choice, keeping O(1)?
>>> When tracking extobjs, the address of the drm_gem_object is the key while the
>>> reference count is the value. I was thinking of an XArray as well, but I was
>>> worried that the corresponding indices could be too much distributed for an
>>> XArray to still be efficient. Now that I think about it, it's probably not that
>>> bad.
>>>
>>> Btw., while I agree trying to make things as efficient as possible, what is the
>>> magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
>> Not sure yet, TBH, but I think one of our UMDs can only use external object,
>> because they don't know at creation time which ones need exporting. However
>> if this turns out to be too bad, there are various flavours of "clever but
>> complicated" optimizations that we could think of to reduce the list size.
>> Still in our case, we opted for the vma list head for now.
> Considering the above, I would guess that if your current approach is good
> enough, a maple tree will work as well.

Hmm, Yeah it's probably a bikeshed since each drm_exec builds a 
realloced array of all external objects on each exec.

>
> Otherwise, if you want, I could do some experiments with Xarray and see how
> that works out compared to using a maple tree.
>
> Btw. another nice thing about using Xarray or maple tree for that is that
> drivers updating the VA space from the fence signalling path don't need to
> hold a GPU-VM lock to update the extobj list. Actually, they might not need
> a GPU-VM lock at all.

I still don't follow why drivers would want to do that. Isn't the VA 
space / fence object list always updated sync from the IOCTL?

/Thomas


>
>> /Thomas
>>
>>
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @evict: structure holding the evict list and evict list lock
>>>>>>> +	 */
>>>>>>> +	struct {
>>>>>>> +		/**
>>>>>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>>>>>> +		 * evicted
>>>>>>> +		 */
>>>>>>> +		struct list_head list;
>>>>>>> +
>>>>>>> +		/**
>>>>>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>>>>>> +		 * insertion / removal of different &drm_gpuva_gems
>>>>>>> +		 */
>>>>>>> +		spinlock_t lock;
>>>>>>> +	} evict;
>>>>>>>      };
>>>>>>>      void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>>> +			    struct drm_device *drm,
>>>>>>>      			    const char *name,
>>>>>>>      			    u64 start_offset, u64 range,
>>>>>>>      			    u64 reserve_offset, u64 reserve_range,
>>>>>>>      			    const struct drm_gpuva_fn_ops *ops);
>>>>>>>      void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>>>>>> +/**
>>>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>>>>>> + */
>>>>>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>>>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>>>>>> should typically be allocated on the stack. Otherwise you'd need to protect
>>>>>> the mgr->exec member with an exclusive lock throughout the locking process,
>>>>>> and that's not what we want.
>>>>> Oh, good point. I think it works in Nouveau, because there it's implicitly
>>>>> protected with the job submission lock.
>>>>>
>>>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>>>>>> needed ops to it: Like so:
>>>>> That's a good idea, will take this into V2.
>>>> Actually, I'm not fully sure that was a good idea: I've now have a working
>>>> version of Xe ported over to drm_exec, having these helpers in mind and with
>>>> the intention to start using them as they mature. What I found, though is
>>>> that open-coding the drm_exec loop is not all that bad, but that building
>>>> blocks that can be called from within the loop are useful:
>>>>
>>>> Like the drm_gpuva_prepare_objects() and an imaginary
>>>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
>>>> (if different and the gpuva points to the object. And
>>>> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
>>>> can use these building blocks like helpers and avoid the fn() callback by
>>>> instead open-coding.
>>>>
>>>> But I guess YMMV.
>>> That's exactly why those building blocks are exported, I already had in mind
>>> that there might be drivers which still want to open-code the drm_exec loop,
>>> while others might just want a simple interface to lock everything.
>>>
>>> I still think it is a good idea, but I'd keep that as simple as possible. And
>>> for everything else just let the driver open-code it and use the "building
>>> blocks" - will also expand the bulding blocks to what you mentioned above.
>>>
>>>>>> struct drm_gpuva_exec_ops {
>>>>>>        int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>>>
>>>>>>        int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>>>>>> *obj);
>>>>> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
>>>>> be the same callback, right?
>>>>>
>>>>>> };
>>>>>>
>>>>>> struct drm_gpuva_exec {
>>>>>>        const struct drm_gpuva_exec_ops *ops;
>>>>>>        struct drm_exec exec;
>>>>>>        struct drm_gpuva_manager *mgr;
>>>>>> };
>>>>>>
>>>>>> Although I'd actually expect bo_validate to be part of fn in the typical
>>>>>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
>>>>> This doesn't sound like my assumption about fn() above is correct.
>>>> Well one important thing in our conversion is that ttm_bo_validate () needs
>>>> to be in the until_all_locked() loop. We want to be able soon to use
>>>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>>>> temporarily, add locked objects to the drm_exec list of locked objects. That
>>>> means everything that may end up calling validate deep within the call chain
>>>> needs to be part of the until_all_locked() loop, so our
>>>> drm_gpuva_manager_lock_extra() fn callback would include those validates and
>>>> look different all the time. Hence that's why open-coding isn't all that
>>>> bad...
>>> Oh, I see. You indeed want to call validate() from within until_all_locked().
>>>
>>>> /Thomas
>>>>
>>>>
> <snip>

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-31 16:53                 ` Thomas Hellström (Intel)
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-08-31 16:53 UTC (permalink / raw)
  To: Danilo Krummrich
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, christian.koenig, faith.ekstrand, bskeggs

Hi,

On 8/31/23 13:18, Danilo Krummrich wrote:
> On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
>> Hi!
>>
>> On 8/30/23 17:00, Danilo Krummrich wrote:
>>> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
>>>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>>>> Hi Thomas,
>>>>>
>>>>> thanks for having a look!
>>>>>
>>>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>>>>>> Hi, Danilo.
>>>>>>
>>>>>> Some quick comments since I'm doing some Xe work in this area. Will probably
>>>>>> get back with more.
>>>>>>
>>>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
> <snip>
>
>>>>>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>>>> @@ -26,12 +26,16 @@
>>>>>>>       */
>>>>>>>      #include <linux/list.h>
>>>>>>> +#include <linux/dma-resv.h>
>>>>>>> +#include <linux/maple_tree.h>
>>>>>>>      #include <linux/rbtree.h>
>>>>>>>      #include <linux/types.h>
>>>>>>>      #include <drm/drm_gem.h>
>>>>>>> +#include <drm/drm_exec.h>
>>>>>>>      struct drm_gpuva_manager;
>>>>>>> +struct drm_gpuva_gem;
>>>>>>>      struct drm_gpuva_fn_ops;
>>>>>>>      /**
>>>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>>>      int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>>>>>      void drm_gpuva_remove(struct drm_gpuva *va);
>>>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>>>>>      void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>>>      struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>>>      	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>>>>>      	 */
>>>>>>>      	const struct drm_gpuva_fn_ops *ops;
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>>>>>> +	 * dma-resv to &drm_exec.
>>>>>>> +	 */
>>>>>>> +	struct drm_gem_object d_obj;
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>>>>>> +	 * space
>>>>>>> +	 */
>>>>>>> +	struct dma_resv *resv;
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>>>>>> +	 */
>>>>>>> +	struct drm_exec exec;
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>>>> +	 */
>>>>>>> +	struct maple_tree mt_ext;
>>>>>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>>>>>> instead of O(1) for a list?
>>>>>>
>>>>> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
>>>>> could have mappings of the same extobj.
>>>>>
>>>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
>>>>> instead, which also seems to be the obvious choice. However, there is a locking
>>>>> conflict.
>>>>>
>>>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
>>>>> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
>>>>> the corresponding drm_gem_object, or with an external lock provided by the
>>>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
>>>>> changes on the GPUVA space directly from the fence signalling path.
>>>>>
>>>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
>>>>> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
>>>>> linked and we'd want to remove it for the last one being unlinked.
>>>>>
>>>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
>>>>> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
>>>>> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
>>>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>>>
>>>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
>>>>> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
>>>>> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
>>>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>>>> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
>>>>> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
>>>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>>>
>>>>> Considering all that, I thought it's probably better to track extobjs separate
>>>>> from the drm_gpuva_gem, hence the maple tree choice.
>>>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
>>>> external objects, or in the case of multiple mappings to the same gem
>>>> object, only one of the drm_gpuvas is in the list. These are protected by
>>>> the GPU-VM lock. I don't see a problem with removing those from the fence
>>>> signalling path, though?
>>> I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
>>> since this is generic code I don't know how much mappings of an external object
>>> the corresponding driver potentially creates. This could become a pretty large
>>> list to iterate. Another reason was, that I want to keep the drm_gpuva structure
>>> as small as possible, hence avoiding another list_head.
>> Yes, the list might be pretty large, but OTOH you never iterate to access a
>> single list element. When you need to iterate the whole list you need to do
>> that regardless of the data structure used. As for the list head, it might
>> perhaps be aliased (union) with an upcoming userptr list head?
>>
> Oh, I did not mean that I'm concerned about the size of a list of extobjs in
> general, that would indeed be the same for every data structure chosen. But I
> would be concerned about keeping a list of *all* mappings being backed by an
> extobj.
>
>>> Now, it sounds like in XE you're doing some kind of optimization just keeping a
>>> single mapping of an extobj in the list? How do you know when to remove it? What
>>> if the mapping from the extobj list gets unmapped, but there is still another
>>> one left in the GPU-VM being backed by the same BO?
>> When removing from the lists, we iterate through the object's list of vmas,
>> and if there is one matching the same vm, we replace the old one with the
>> new one. A similar iteration is done when adding to avoid adding one that is
>> already on the list.
> I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
> while using the maple tree is O(log(n))?

No, insertion and removal is O(m) where m is the number of vms the 
object is currently bound to. Typically a very small number.

>
>>> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
>>> choice, keeping O(1)?
>>> When tracking extobjs, the address of the drm_gem_object is the key while the
>>> reference count is the value. I was thinking of an XArray as well, but I was
>>> worried that the corresponding indices could be too much distributed for an
>>> XArray to still be efficient. Now that I think about it, it's probably not that
>>> bad.
>>>
>>> Btw., while I agree trying to make things as efficient as possible, what is the
>>> magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
>> Not sure yet, TBH, but I think one of our UMDs can only use external object,
>> because they don't know at creation time which ones need exporting. However
>> if this turns out to be too bad, there are various flavours of "clever but
>> complicated" optimizations that we could think of to reduce the list size.
>> Still in our case, we opted for the vma list head for now.
> Considering the above, I would guess that if your current approach is good
> enough, a maple tree will work as well.

Hmm, Yeah it's probably a bikeshed since each drm_exec builds a 
realloced array of all external objects on each exec.

>
> Otherwise, if you want, I could do some experiments with Xarray and see how
> that works out compared to using a maple tree.
>
> Btw. another nice thing about using Xarray or maple tree for that is that
> drivers updating the VA space from the fence signalling path don't need to
> hold a GPU-VM lock to update the extobj list. Actually, they might not need
> a GPU-VM lock at all.

I still don't follow why drivers would want to do that. Isn't the VA 
space / fence object list always updated sync from the IOCTL?

/Thomas


>
>> /Thomas
>>
>>
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @evict: structure holding the evict list and evict list lock
>>>>>>> +	 */
>>>>>>> +	struct {
>>>>>>> +		/**
>>>>>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>>>>>> +		 * evicted
>>>>>>> +		 */
>>>>>>> +		struct list_head list;
>>>>>>> +
>>>>>>> +		/**
>>>>>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>>>>>> +		 * insertion / removal of different &drm_gpuva_gems
>>>>>>> +		 */
>>>>>>> +		spinlock_t lock;
>>>>>>> +	} evict;
>>>>>>>      };
>>>>>>>      void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>>> +			    struct drm_device *drm,
>>>>>>>      			    const char *name,
>>>>>>>      			    u64 start_offset, u64 range,
>>>>>>>      			    u64 reserve_offset, u64 reserve_range,
>>>>>>>      			    const struct drm_gpuva_fn_ops *ops);
>>>>>>>      void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>>>>>> +/**
>>>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>>>>>> + */
>>>>>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>>>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>>>>>> should typically be allocated on the stack. Otherwise you'd need to protect
>>>>>> the mgr->exec member with an exclusive lock throughout the locking process,
>>>>>> and that's not what we want.
>>>>> Oh, good point. I think it works in Nouveau, because there it's implicitly
>>>>> protected with the job submission lock.
>>>>>
>>>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>>>>>> needed ops to it: Like so:
>>>>> That's a good idea, will take this into V2.
>>>> Actually, I'm not fully sure that was a good idea: I've now have a working
>>>> version of Xe ported over to drm_exec, having these helpers in mind and with
>>>> the intention to start using them as they mature. What I found, though is
>>>> that open-coding the drm_exec loop is not all that bad, but that building
>>>> blocks that can be called from within the loop are useful:
>>>>
>>>> Like the drm_gpuva_prepare_objects() and an imaginary
>>>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
>>>> (if different and the gpuva points to the object. And
>>>> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
>>>> can use these building blocks like helpers and avoid the fn() callback by
>>>> instead open-coding.
>>>>
>>>> But I guess YMMV.
>>> That's exactly why those building blocks are exported, I already had in mind
>>> that there might be drivers which still want to open-code the drm_exec loop,
>>> while others might just want a simple interface to lock everything.
>>>
>>> I still think it is a good idea, but I'd keep that as simple as possible. And
>>> for everything else just let the driver open-code it and use the "building
>>> blocks" - will also expand the bulding blocks to what you mentioned above.
>>>
>>>>>> struct drm_gpuva_exec_ops {
>>>>>>        int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>>>
>>>>>>        int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>>>>>> *obj);
>>>>> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
>>>>> be the same callback, right?
>>>>>
>>>>>> };
>>>>>>
>>>>>> struct drm_gpuva_exec {
>>>>>>        const struct drm_gpuva_exec_ops *ops;
>>>>>>        struct drm_exec exec;
>>>>>>        struct drm_gpuva_manager *mgr;
>>>>>> };
>>>>>>
>>>>>> Although I'd actually expect bo_validate to be part of fn in the typical
>>>>>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
>>>>> This doesn't sound like my assumption about fn() above is correct.
>>>> Well one important thing in our conversion is that ttm_bo_validate () needs
>>>> to be in the until_all_locked() loop. We want to be able soon to use
>>>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>>>> temporarily, add locked objects to the drm_exec list of locked objects. That
>>>> means everything that may end up calling validate deep within the call chain
>>>> needs to be part of the until_all_locked() loop, so our
>>>> drm_gpuva_manager_lock_extra() fn callback would include those validates and
>>>> look different all the time. Hence that's why open-coding isn't all that
>>>> bad...
>>> Oh, I see. You indeed want to call validate() from within until_all_locked().
>>>
>>>> /Thomas
>>>>
>>>>
> <snip>

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-31 16:53                 ` Thomas Hellström (Intel)
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-08-31 16:53 UTC (permalink / raw)
  To: Danilo Krummrich
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, christian.koenig, faith.ekstrand, bskeggs

Hi,

On 8/31/23 13:18, Danilo Krummrich wrote:
> On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
>> Hi!
>>
>> On 8/30/23 17:00, Danilo Krummrich wrote:
>>> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
>>>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>>>> Hi Thomas,
>>>>>
>>>>> thanks for having a look!
>>>>>
>>>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>>>>>> Hi, Danilo.
>>>>>>
>>>>>> Some quick comments since I'm doing some Xe work in this area. Will probably
>>>>>> get back with more.
>>>>>>
>>>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
> <snip>
>
>>>>>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>>>> @@ -26,12 +26,16 @@
>>>>>>>       */
>>>>>>>      #include <linux/list.h>
>>>>>>> +#include <linux/dma-resv.h>
>>>>>>> +#include <linux/maple_tree.h>
>>>>>>>      #include <linux/rbtree.h>
>>>>>>>      #include <linux/types.h>
>>>>>>>      #include <drm/drm_gem.h>
>>>>>>> +#include <drm/drm_exec.h>
>>>>>>>      struct drm_gpuva_manager;
>>>>>>> +struct drm_gpuva_gem;
>>>>>>>      struct drm_gpuva_fn_ops;
>>>>>>>      /**
>>>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>>>      int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>>>>>      void drm_gpuva_remove(struct drm_gpuva *va);
>>>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>>>>>      void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>>>      struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>>>      	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>>>>>      	 */
>>>>>>>      	const struct drm_gpuva_fn_ops *ops;
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>>>>>> +	 * dma-resv to &drm_exec.
>>>>>>> +	 */
>>>>>>> +	struct drm_gem_object d_obj;
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>>>>>> +	 * space
>>>>>>> +	 */
>>>>>>> +	struct dma_resv *resv;
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>>>>>> +	 */
>>>>>>> +	struct drm_exec exec;
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>>>> +	 */
>>>>>>> +	struct maple_tree mt_ext;
>>>>>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>>>>>> instead of O(1) for a list?
>>>>>>
>>>>> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
>>>>> could have mappings of the same extobj.
>>>>>
>>>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
>>>>> instead, which also seems to be the obvious choice. However, there is a locking
>>>>> conflict.
>>>>>
>>>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
>>>>> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
>>>>> the corresponding drm_gem_object, or with an external lock provided by the
>>>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
>>>>> changes on the GPUVA space directly from the fence signalling path.
>>>>>
>>>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
>>>>> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
>>>>> linked and we'd want to remove it for the last one being unlinked.
>>>>>
>>>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
>>>>> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
>>>>> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
>>>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>>>
>>>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
>>>>> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
>>>>> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
>>>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>>>> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
>>>>> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
>>>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>>>
>>>>> Considering all that, I thought it's probably better to track extobjs separate
>>>>> from the drm_gpuva_gem, hence the maple tree choice.
>>>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
>>>> external objects, or in the case of multiple mappings to the same gem
>>>> object, only one of the drm_gpuvas is in the list. These are protected by
>>>> the GPU-VM lock. I don't see a problem with removing those from the fence
>>>> signalling path, though?
>>> I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
>>> since this is generic code I don't know how much mappings of an external object
>>> the corresponding driver potentially creates. This could become a pretty large
>>> list to iterate. Another reason was, that I want to keep the drm_gpuva structure
>>> as small as possible, hence avoiding another list_head.
>> Yes, the list might be pretty large, but OTOH you never iterate to access a
>> single list element. When you need to iterate the whole list you need to do
>> that regardless of the data structure used. As for the list head, it might
>> perhaps be aliased (union) with an upcoming userptr list head?
>>
> Oh, I did not mean that I'm concerned about the size of a list of extobjs in
> general, that would indeed be the same for every data structure chosen. But I
> would be concerned about keeping a list of *all* mappings being backed by an
> extobj.
>
>>> Now, it sounds like in XE you're doing some kind of optimization just keeping a
>>> single mapping of an extobj in the list? How do you know when to remove it? What
>>> if the mapping from the extobj list gets unmapped, but there is still another
>>> one left in the GPU-VM being backed by the same BO?
>> When removing from the lists, we iterate through the object's list of vmas,
>> and if there is one matching the same vm, we replace the old one with the
>> new one. A similar iteration is done when adding to avoid adding one that is
>> already on the list.
> I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
> while using the maple tree is O(log(n))?

No, insertion and removal is O(m) where m is the number of vms the 
object is currently bound to. Typically a very small number.

>
>>> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
>>> choice, keeping O(1)?
>>> When tracking extobjs, the address of the drm_gem_object is the key while the
>>> reference count is the value. I was thinking of an XArray as well, but I was
>>> worried that the corresponding indices could be too much distributed for an
>>> XArray to still be efficient. Now that I think about it, it's probably not that
>>> bad.
>>>
>>> Btw., while I agree trying to make things as efficient as possible, what is the
>>> magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
>> Not sure yet, TBH, but I think one of our UMDs can only use external object,
>> because they don't know at creation time which ones need exporting. However
>> if this turns out to be too bad, there are various flavours of "clever but
>> complicated" optimizations that we could think of to reduce the list size.
>> Still in our case, we opted for the vma list head for now.
> Considering the above, I would guess that if your current approach is good
> enough, a maple tree will work as well.

Hmm, Yeah it's probably a bikeshed since each drm_exec builds a 
realloced array of all external objects on each exec.

>
> Otherwise, if you want, I could do some experiments with Xarray and see how
> that works out compared to using a maple tree.
>
> Btw. another nice thing about using Xarray or maple tree for that is that
> drivers updating the VA space from the fence signalling path don't need to
> hold a GPU-VM lock to update the extobj list. Actually, they might not need
> a GPU-VM lock at all.

I still don't follow why drivers would want to do that. Isn't the VA 
space / fence object list always updated sync from the IOCTL?

/Thomas


>
>> /Thomas
>>
>>
>>>>>>> +
>>>>>>> +	/**
>>>>>>> +	 * @evict: structure holding the evict list and evict list lock
>>>>>>> +	 */
>>>>>>> +	struct {
>>>>>>> +		/**
>>>>>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>>>>>> +		 * evicted
>>>>>>> +		 */
>>>>>>> +		struct list_head list;
>>>>>>> +
>>>>>>> +		/**
>>>>>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>>>>>> +		 * insertion / removal of different &drm_gpuva_gems
>>>>>>> +		 */
>>>>>>> +		spinlock_t lock;
>>>>>>> +	} evict;
>>>>>>>      };
>>>>>>>      void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>>> +			    struct drm_device *drm,
>>>>>>>      			    const char *name,
>>>>>>>      			    u64 start_offset, u64 range,
>>>>>>>      			    u64 reserve_offset, u64 reserve_range,
>>>>>>>      			    const struct drm_gpuva_fn_ops *ops);
>>>>>>>      void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>>>>>> +/**
>>>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>>>>>> + */
>>>>>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>>>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>>>>>> should typically be allocated on the stack. Otherwise you'd need to protect
>>>>>> the mgr->exec member with an exclusive lock throughout the locking process,
>>>>>> and that's not what we want.
>>>>> Oh, good point. I think it works in Nouveau, because there it's implicitly
>>>>> protected with the job submission lock.
>>>>>
>>>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>>>>>> needed ops to it: Like so:
>>>>> That's a good idea, will take this into V2.
>>>> Actually, I'm not fully sure that was a good idea: I've now have a working
>>>> version of Xe ported over to drm_exec, having these helpers in mind and with
>>>> the intention to start using them as they mature. What I found, though is
>>>> that open-coding the drm_exec loop is not all that bad, but that building
>>>> blocks that can be called from within the loop are useful:
>>>>
>>>> Like the drm_gpuva_prepare_objects() and an imaginary
>>>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
>>>> (if different and the gpuva points to the object. And
>>>> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
>>>> can use these building blocks like helpers and avoid the fn() callback by
>>>> instead open-coding.
>>>>
>>>> But I guess YMMV.
>>> That's exactly why those building blocks are exported, I already had in mind
>>> that there might be drivers which still want to open-code the drm_exec loop,
>>> while others might just want a simple interface to lock everything.
>>>
>>> I still think it is a good idea, but I'd keep that as simple as possible. And
>>> for everything else just let the driver open-code it and use the "building
>>> blocks" - will also expand the bulding blocks to what you mentioned above.
>>>
>>>>>> struct drm_gpuva_exec_ops {
>>>>>>        int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>>>
>>>>>>        int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>>>>>> *obj);
>>>>> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
>>>>> be the same callback, right?
>>>>>
>>>>>> };
>>>>>>
>>>>>> struct drm_gpuva_exec {
>>>>>>        const struct drm_gpuva_exec_ops *ops;
>>>>>>        struct drm_exec exec;
>>>>>>        struct drm_gpuva_manager *mgr;
>>>>>> };
>>>>>>
>>>>>> Although I'd actually expect bo_validate to be part of fn in the typical
>>>>>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
>>>>> This doesn't sound like my assumption about fn() above is correct.
>>>> Well one important thing in our conversion is that ttm_bo_validate () needs
>>>> to be in the until_all_locked() loop. We want to be able soon to use
>>>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>>>> temporarily, add locked objects to the drm_exec list of locked objects. That
>>>> means everything that may end up calling validate deep within the call chain
>>>> needs to be part of the until_all_locked() loop, so our
>>>> drm_gpuva_manager_lock_extra() fn callback would include those validates and
>>>> look different all the time. Hence that's why open-coding isn't all that
>>>> bad...
>>> Oh, I see. You indeed want to call validate() from within until_all_locked().
>>>
>>>> /Thomas
>>>>
>>>>
> <snip>

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-31 16:53                 ` [Nouveau] " Thomas Hellström (Intel)
  (?)
@ 2023-08-31 17:23                   ` Thomas Hellström
  -1 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström @ 2023-08-31 17:23 UTC (permalink / raw)
  To: Thomas Hellström (Intel), Danilo Krummrich
  Cc: airlied, daniel, matthew.brost, sarah.walker, donald.robson,
	boris.brezillon, christian.koenig, faith.ekstrand, bskeggs,
	Liam.Howlett, nouveau, linux-kernel, dri-devel


On 8/31/23 18:53, Thomas Hellström (Intel) wrote:
> Hi,
>
> On 8/31/23 13:18, Danilo Krummrich wrote:
>> On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) 
>> wrote:
>>> Hi!
>>>
>>> On 8/30/23 17:00, Danilo Krummrich wrote:
>>>> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) 
>>>> wrote:
>>>>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>>>>> Hi Thomas,
>>>>>>
>>>>>> thanks for having a look!
>>>>>>
>>>>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström 
>>>>>> (Intel) wrote:
>>>>>>> Hi, Danilo.
>>>>>>>
>>>>>>> Some quick comments since I'm doing some Xe work in this area. 
>>>>>>> Will probably
>>>>>>> get back with more.
>>>>>>>
>>>>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
>> <snip>
>>
>>>>>>>> diff --git a/include/drm/drm_gpuva_mgr.h 
>>>>>>>> b/include/drm/drm_gpuva_mgr.h
>>>>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>>>>> @@ -26,12 +26,16 @@
>>>>>>>>       */
>>>>>>>>      #include <linux/list.h>
>>>>>>>> +#include <linux/dma-resv.h>
>>>>>>>> +#include <linux/maple_tree.h>
>>>>>>>>      #include <linux/rbtree.h>
>>>>>>>>      #include <linux/types.h>
>>>>>>>>      #include <drm/drm_gem.h>
>>>>>>>> +#include <drm/drm_exec.h>
>>>>>>>>      struct drm_gpuva_manager;
>>>>>>>> +struct drm_gpuva_gem;
>>>>>>>>      struct drm_gpuva_fn_ops;
>>>>>>>>      /**
>>>>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>>>>      int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct 
>>>>>>>> drm_gpuva *va);
>>>>>>>>      void drm_gpuva_remove(struct drm_gpuva *va);
>>>>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem 
>>>>>>>> *vm_bo);
>>>>>>>>      void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>>>>      struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager 
>>>>>>>> *mgr,
>>>>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>>>>           * @ops: &drm_gpuva_fn_ops providing the split/merge 
>>>>>>>> steps to drivers
>>>>>>>>           */
>>>>>>>>          const struct drm_gpuva_fn_ops *ops;
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @d_obj: Dummy GEM object; used internally to pass the 
>>>>>>>> GPU VMs
>>>>>>>> +     * dma-resv to &drm_exec.
>>>>>>>> +     */
>>>>>>>> +    struct drm_gem_object d_obj;
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @resv: the &dma_resv for &drm_gem_objects mapped in 
>>>>>>>> this GPU VA
>>>>>>>> +     * space
>>>>>>>> +     */
>>>>>>>> +    struct dma_resv *resv;
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @exec: the &drm_exec helper to lock external 
>>>>>>>> &drm_gem_objects
>>>>>>>> +     */
>>>>>>>> +    struct drm_exec exec;
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>>>>> +     */
>>>>>>>> +    struct maple_tree mt_ext;
>>>>>>> Why are you using a maple tree here? Insertion and removal is 
>>>>>>> O(log(n))
>>>>>>> instead of O(1) for a list?
>>>>>>>
>>>>>> Having a list of drm_gem_objects directly wouldn't work, as 
>>>>>> multiple GPU-VMs
>>>>>> could have mappings of the same extobj.
>>>>>>
>>>>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) 
>>>>>> as list entry
>>>>>> instead, which also seems to be the obvious choice. However, 
>>>>>> there is a locking
>>>>>> conflict.
>>>>>>
>>>>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each 
>>>>>> drm_gpuva_gem keeps
>>>>>> a list of drm_gpuvas. Both lists are either protected with the 
>>>>>> dma-resv lock of
>>>>>> the corresponding drm_gem_object, or with an external lock 
>>>>>> provided by the
>>>>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by 
>>>>>> drivers performing
>>>>>> changes on the GPUVA space directly from the fence signalling path.
>>>>>>
>>>>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are 
>>>>>> doing already,
>>>>>> we'd want to add a drm_gpuva_gem to the extobj list for the first 
>>>>>> mapping being
>>>>>> linked and we'd want to remove it for the last one being unlinked.
>>>>>>
>>>>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj 
>>>>>> list even
>>>>>> before, because otherwise we'd not acquire it's dma-resv lock of 
>>>>>> this GEM object
>>>>>> through drm_gpuva_manager_lock(). But that's trival, we could do 
>>>>>> that when we
>>>>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>>>>
>>>>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem 
>>>>>> from the extobj
>>>>>> list from drm_gpuva_unlink() when the last mapping of this BO is 
>>>>>> unlinked. In
>>>>>> order to do so, we'd (as discussed above) either need to hold the 
>>>>>> outer GPU-VM
>>>>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>>>>> drm_gpuva_unlink() is called from within the fence signalling 
>>>>>> path. For drivers
>>>>>> like XE or Nouveau, we'd at least need to make sure to not mess 
>>>>>> up the locking
>>>>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>>>>
>>>>>> Considering all that, I thought it's probably better to track 
>>>>>> extobjs separate
>>>>>> from the drm_gpuva_gem, hence the maple tree choice.
>>>>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that 
>>>>> point to
>>>>> external objects, or in the case of multiple mappings to the same gem
>>>>> object, only one of the drm_gpuvas is in the list. These are 
>>>>> protected by
>>>>> the GPU-VM lock. I don't see a problem with removing those from 
>>>>> the fence
>>>>> signalling path, though?
>>>> I intentionally tried to avoid keeping a list of drm_gpuvas to 
>>>> track extobjs,
>>>> since this is generic code I don't know how much mappings of an 
>>>> external object
>>>> the corresponding driver potentially creates. This could become a 
>>>> pretty large
>>>> list to iterate. Another reason was, that I want to keep the 
>>>> drm_gpuva structure
>>>> as small as possible, hence avoiding another list_head.
>>> Yes, the list might be pretty large, but OTOH you never iterate to 
>>> access a
>>> single list element. When you need to iterate the whole list you 
>>> need to do
>>> that regardless of the data structure used. As for the list head, it 
>>> might
>>> perhaps be aliased (union) with an upcoming userptr list head?
>>>
>> Oh, I did not mean that I'm concerned about the size of a list of 
>> extobjs in
>> general, that would indeed be the same for every data structure 
>> chosen. But I
>> would be concerned about keeping a list of *all* mappings being 
>> backed by an
>> extobj.
>>
>>>> Now, it sounds like in XE you're doing some kind of optimization 
>>>> just keeping a
>>>> single mapping of an extobj in the list? How do you know when to 
>>>> remove it? What
>>>> if the mapping from the extobj list gets unmapped, but there is 
>>>> still another
>>>> one left in the GPU-VM being backed by the same BO?
>>> When removing from the lists, we iterate through the object's list 
>>> of vmas,
>>> and if there is one matching the same vm, we replace the old one 
>>> with the
>>> new one. A similar iteration is done when adding to avoid adding one 
>>> that is
>>> already on the list.
>> I see, but wouldn't this be O(n) on insertion and O(m) on removal of 
>> an extobj,
>> while using the maple tree is O(log(n))?
>
> No, insertion and removal is O(m) where m is the number of vms the 
> object is currently bound to. Typically a very small number.
>
>>
>>>> Although assuming that's a no-go for GPUVA wouldn't an XArray be a 
>>>> better
>>>> choice, keeping O(1)?
>>>> When tracking extobjs, the address of the drm_gem_object is the key 
>>>> while the
>>>> reference count is the value. I was thinking of an XArray as well, 
>>>> but I was
>>>> worried that the corresponding indices could be too much 
>>>> distributed for an
>>>> XArray to still be efficient. Now that I think about it, it's 
>>>> probably not that
>>>> bad.
>>>>
>>>> Btw., while I agree trying to make things as efficient as possible, 
>>>> what is the
>>>> magnitue for extobjs to be tracked, do we need to worry about the 
>>>> O(log(n))?
>>> Not sure yet, TBH, but I think one of our UMDs can only use external 
>>> object,
>>> because they don't know at creation time which ones need exporting. 
>>> However
>>> if this turns out to be too bad, there are various flavours of 
>>> "clever but
>>> complicated" optimizations that we could think of to reduce the list 
>>> size.
>>> Still in our case, we opted for the vma list head for now.
>> Considering the above, I would guess that if your current approach is 
>> good
>> enough, a maple tree will work as well.
>
> Hmm, Yeah it's probably a bikeshed since each drm_exec builds a 
> realloced array of all external objects on each exec.
>
>>
>> Otherwise, if you want, I could do some experiments with Xarray and 
>> see how
>> that works out compared to using a maple tree.
>>
>> Btw. another nice thing about using Xarray or maple tree for that is 
>> that
>> drivers updating the VA space from the fence signalling path don't 
>> need to
>> hold a GPU-VM lock to update the extobj list. Actually, they might 
>> not need
>> a GPU-VM lock at all.
>
> I still don't follow why drivers would want to do that. Isn't the VA 
> space / fence object list always updated sync from the IOCTL?

meaning external object list ofc. :)

/Thomas


>
> /Thomas
>
>
>>
>>> /Thomas
>>>
>>>
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @evict: structure holding the evict list and evict list 
>>>>>>>> lock
>>>>>>>> +     */
>>>>>>>> +    struct {
>>>>>>>> +        /**
>>>>>>>> +         * @list: &list_head storing &drm_gem_objects 
>>>>>>>> currently being
>>>>>>>> +         * evicted
>>>>>>>> +         */
>>>>>>>> +        struct list_head list;
>>>>>>>> +
>>>>>>>> +        /**
>>>>>>>> +         * @lock: spinlock to protect the evict list against 
>>>>>>>> concurrent
>>>>>>>> +         * insertion / removal of different &drm_gpuva_gems
>>>>>>>> +         */
>>>>>>>> +        spinlock_t lock;
>>>>>>>> +    } evict;
>>>>>>>>      };
>>>>>>>>      void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>>>> +                struct drm_device *drm,
>>>>>>>>                      const char *name,
>>>>>>>>                      u64 start_offset, u64 range,
>>>>>>>>                      u64 reserve_offset, u64 reserve_range,
>>>>>>>>                      const struct drm_gpuva_fn_ops *ops);
>>>>>>>>      void drm_gpuva_manager_destroy(struct drm_gpuva_manager 
>>>>>>>> *mgr);
>>>>>>>> +/**
>>>>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec 
>>>>>>>> instance
>>>>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec 
>>>>>>>> instance for
>>>>>>>> + */
>>>>>>>> +#define DRM_GPUVA_EXEC(mgr)    &(mgr)->exec
>>>>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per 
>>>>>>> task and
>>>>>>> should typically be allocated on the stack. Otherwise you'd need 
>>>>>>> to protect
>>>>>>> the mgr->exec member with an exclusive lock throughout the 
>>>>>>> locking process,
>>>>>>> and that's not what we want.
>>>>>> Oh, good point. I think it works in Nouveau, because there it's 
>>>>>> implicitly
>>>>>> protected with the job submission lock.
>>>>>>
>>>>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes 
>>>>>>> and add
>>>>>>> needed ops to it: Like so:
>>>>>> That's a good idea, will take this into V2.
>>>>> Actually, I'm not fully sure that was a good idea: I've now have a 
>>>>> working
>>>>> version of Xe ported over to drm_exec, having these helpers in 
>>>>> mind and with
>>>>> the intention to start using them as they mature. What I found, 
>>>>> though is
>>>>> that open-coding the drm_exec loop is not all that bad, but that 
>>>>> building
>>>>> blocks that can be called from within the loop are useful:
>>>>>
>>>>> Like the drm_gpuva_prepare_objects() and an imaginary
>>>>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of 
>>>>> the object
>>>>> (if different and the gpuva points to the object. And
>>>>> drm_gpuva_prepare_array() although we don't use it within Xe. That 
>>>>> means you
>>>>> can use these building blocks like helpers and avoid the fn() 
>>>>> callback by
>>>>> instead open-coding.
>>>>>
>>>>> But I guess YMMV.
>>>> That's exactly why those building blocks are exported, I already 
>>>> had in mind
>>>> that there might be drivers which still want to open-code the 
>>>> drm_exec loop,
>>>> while others might just want a simple interface to lock everything.
>>>>
>>>> I still think it is a good idea, but I'd keep that as simple as 
>>>> possible. And
>>>> for everything else just let the driver open-code it and use the 
>>>> "building
>>>> blocks" - will also expand the bulding blocks to what you mentioned 
>>>> above.
>>>>
>>>>>>> struct drm_gpuva_exec_ops {
>>>>>>>        int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>>>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>>>>
>>>>>>>        int (*bo_validate) (struct drm_gpuva_exec *exec, struct 
>>>>>>> drm_gem_object
>>>>>>> *obj);
>>>>>> I guess we could also keep that within the drm_gpuva_fn_ops? This 
>>>>>> should always
>>>>>> be the same callback, right?
>>>>>>
>>>>>>> };
>>>>>>>
>>>>>>> struct drm_gpuva_exec {
>>>>>>>        const struct drm_gpuva_exec_ops *ops;
>>>>>>>        struct drm_exec exec;
>>>>>>>        struct drm_gpuva_manager *mgr;
>>>>>>> };
>>>>>>>
>>>>>>> Although I'd actually expect bo_validate to be part of fn in the 
>>>>>>> typical
>>>>>>> case. The drm_gpuva_exec would then be allocated by the caller 
>>>>>>> on the stack.
>>>>>> This doesn't sound like my assumption about fn() above is correct.
>>>>> Well one important thing in our conversion is that ttm_bo_validate 
>>>>> () needs
>>>>> to be in the until_all_locked() loop. We want to be able soon to use
>>>>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>>>>> temporarily, add locked objects to the drm_exec list of locked 
>>>>> objects. That
>>>>> means everything that may end up calling validate deep within the 
>>>>> call chain
>>>>> needs to be part of the until_all_locked() loop, so our
>>>>> drm_gpuva_manager_lock_extra() fn callback would include those 
>>>>> validates and
>>>>> look different all the time. Hence that's why open-coding isn't 
>>>>> all that
>>>>> bad...
>>>> Oh, I see. You indeed want to call validate() from within 
>>>> until_all_locked().
>>>>
>>>>> /Thomas
>>>>>
>>>>>
>> <snip>

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-31 17:23                   ` Thomas Hellström
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström @ 2023-08-31 17:23 UTC (permalink / raw)
  To: Thomas Hellström (Intel), Danilo Krummrich
  Cc: matthew.brost, sarah.walker, nouveau, dri-devel, linux-kernel,
	Liam.Howlett, boris.brezillon, donald.robson, christian.koenig,
	faith.ekstrand, bskeggs


On 8/31/23 18:53, Thomas Hellström (Intel) wrote:
> Hi,
>
> On 8/31/23 13:18, Danilo Krummrich wrote:
>> On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) 
>> wrote:
>>> Hi!
>>>
>>> On 8/30/23 17:00, Danilo Krummrich wrote:
>>>> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) 
>>>> wrote:
>>>>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>>>>> Hi Thomas,
>>>>>>
>>>>>> thanks for having a look!
>>>>>>
>>>>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström 
>>>>>> (Intel) wrote:
>>>>>>> Hi, Danilo.
>>>>>>>
>>>>>>> Some quick comments since I'm doing some Xe work in this area. 
>>>>>>> Will probably
>>>>>>> get back with more.
>>>>>>>
>>>>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
>> <snip>
>>
>>>>>>>> diff --git a/include/drm/drm_gpuva_mgr.h 
>>>>>>>> b/include/drm/drm_gpuva_mgr.h
>>>>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>>>>> @@ -26,12 +26,16 @@
>>>>>>>>       */
>>>>>>>>      #include <linux/list.h>
>>>>>>>> +#include <linux/dma-resv.h>
>>>>>>>> +#include <linux/maple_tree.h>
>>>>>>>>      #include <linux/rbtree.h>
>>>>>>>>      #include <linux/types.h>
>>>>>>>>      #include <drm/drm_gem.h>
>>>>>>>> +#include <drm/drm_exec.h>
>>>>>>>>      struct drm_gpuva_manager;
>>>>>>>> +struct drm_gpuva_gem;
>>>>>>>>      struct drm_gpuva_fn_ops;
>>>>>>>>      /**
>>>>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>>>>      int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct 
>>>>>>>> drm_gpuva *va);
>>>>>>>>      void drm_gpuva_remove(struct drm_gpuva *va);
>>>>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem 
>>>>>>>> *vm_bo);
>>>>>>>>      void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>>>>      struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager 
>>>>>>>> *mgr,
>>>>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>>>>           * @ops: &drm_gpuva_fn_ops providing the split/merge 
>>>>>>>> steps to drivers
>>>>>>>>           */
>>>>>>>>          const struct drm_gpuva_fn_ops *ops;
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @d_obj: Dummy GEM object; used internally to pass the 
>>>>>>>> GPU VMs
>>>>>>>> +     * dma-resv to &drm_exec.
>>>>>>>> +     */
>>>>>>>> +    struct drm_gem_object d_obj;
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @resv: the &dma_resv for &drm_gem_objects mapped in 
>>>>>>>> this GPU VA
>>>>>>>> +     * space
>>>>>>>> +     */
>>>>>>>> +    struct dma_resv *resv;
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @exec: the &drm_exec helper to lock external 
>>>>>>>> &drm_gem_objects
>>>>>>>> +     */
>>>>>>>> +    struct drm_exec exec;
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>>>>> +     */
>>>>>>>> +    struct maple_tree mt_ext;
>>>>>>> Why are you using a maple tree here? Insertion and removal is 
>>>>>>> O(log(n))
>>>>>>> instead of O(1) for a list?
>>>>>>>
>>>>>> Having a list of drm_gem_objects directly wouldn't work, as 
>>>>>> multiple GPU-VMs
>>>>>> could have mappings of the same extobj.
>>>>>>
>>>>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) 
>>>>>> as list entry
>>>>>> instead, which also seems to be the obvious choice. However, 
>>>>>> there is a locking
>>>>>> conflict.
>>>>>>
>>>>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each 
>>>>>> drm_gpuva_gem keeps
>>>>>> a list of drm_gpuvas. Both lists are either protected with the 
>>>>>> dma-resv lock of
>>>>>> the corresponding drm_gem_object, or with an external lock 
>>>>>> provided by the
>>>>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by 
>>>>>> drivers performing
>>>>>> changes on the GPUVA space directly from the fence signalling path.
>>>>>>
>>>>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are 
>>>>>> doing already,
>>>>>> we'd want to add a drm_gpuva_gem to the extobj list for the first 
>>>>>> mapping being
>>>>>> linked and we'd want to remove it for the last one being unlinked.
>>>>>>
>>>>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj 
>>>>>> list even
>>>>>> before, because otherwise we'd not acquire it's dma-resv lock of 
>>>>>> this GEM object
>>>>>> through drm_gpuva_manager_lock(). But that's trival, we could do 
>>>>>> that when we
>>>>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>>>>
>>>>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem 
>>>>>> from the extobj
>>>>>> list from drm_gpuva_unlink() when the last mapping of this BO is 
>>>>>> unlinked. In
>>>>>> order to do so, we'd (as discussed above) either need to hold the 
>>>>>> outer GPU-VM
>>>>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>>>>> drm_gpuva_unlink() is called from within the fence signalling 
>>>>>> path. For drivers
>>>>>> like XE or Nouveau, we'd at least need to make sure to not mess 
>>>>>> up the locking
>>>>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>>>>
>>>>>> Considering all that, I thought it's probably better to track 
>>>>>> extobjs separate
>>>>>> from the drm_gpuva_gem, hence the maple tree choice.
>>>>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that 
>>>>> point to
>>>>> external objects, or in the case of multiple mappings to the same gem
>>>>> object, only one of the drm_gpuvas is in the list. These are 
>>>>> protected by
>>>>> the GPU-VM lock. I don't see a problem with removing those from 
>>>>> the fence
>>>>> signalling path, though?
>>>> I intentionally tried to avoid keeping a list of drm_gpuvas to 
>>>> track extobjs,
>>>> since this is generic code I don't know how much mappings of an 
>>>> external object
>>>> the corresponding driver potentially creates. This could become a 
>>>> pretty large
>>>> list to iterate. Another reason was, that I want to keep the 
>>>> drm_gpuva structure
>>>> as small as possible, hence avoiding another list_head.
>>> Yes, the list might be pretty large, but OTOH you never iterate to 
>>> access a
>>> single list element. When you need to iterate the whole list you 
>>> need to do
>>> that regardless of the data structure used. As for the list head, it 
>>> might
>>> perhaps be aliased (union) with an upcoming userptr list head?
>>>
>> Oh, I did not mean that I'm concerned about the size of a list of 
>> extobjs in
>> general, that would indeed be the same for every data structure 
>> chosen. But I
>> would be concerned about keeping a list of *all* mappings being 
>> backed by an
>> extobj.
>>
>>>> Now, it sounds like in XE you're doing some kind of optimization 
>>>> just keeping a
>>>> single mapping of an extobj in the list? How do you know when to 
>>>> remove it? What
>>>> if the mapping from the extobj list gets unmapped, but there is 
>>>> still another
>>>> one left in the GPU-VM being backed by the same BO?
>>> When removing from the lists, we iterate through the object's list 
>>> of vmas,
>>> and if there is one matching the same vm, we replace the old one 
>>> with the
>>> new one. A similar iteration is done when adding to avoid adding one 
>>> that is
>>> already on the list.
>> I see, but wouldn't this be O(n) on insertion and O(m) on removal of 
>> an extobj,
>> while using the maple tree is O(log(n))?
>
> No, insertion and removal is O(m) where m is the number of vms the 
> object is currently bound to. Typically a very small number.
>
>>
>>>> Although assuming that's a no-go for GPUVA wouldn't an XArray be a 
>>>> better
>>>> choice, keeping O(1)?
>>>> When tracking extobjs, the address of the drm_gem_object is the key 
>>>> while the
>>>> reference count is the value. I was thinking of an XArray as well, 
>>>> but I was
>>>> worried that the corresponding indices could be too much 
>>>> distributed for an
>>>> XArray to still be efficient. Now that I think about it, it's 
>>>> probably not that
>>>> bad.
>>>>
>>>> Btw., while I agree trying to make things as efficient as possible, 
>>>> what is the
>>>> magnitue for extobjs to be tracked, do we need to worry about the 
>>>> O(log(n))?
>>> Not sure yet, TBH, but I think one of our UMDs can only use external 
>>> object,
>>> because they don't know at creation time which ones need exporting. 
>>> However
>>> if this turns out to be too bad, there are various flavours of 
>>> "clever but
>>> complicated" optimizations that we could think of to reduce the list 
>>> size.
>>> Still in our case, we opted for the vma list head for now.
>> Considering the above, I would guess that if your current approach is 
>> good
>> enough, a maple tree will work as well.
>
> Hmm, Yeah it's probably a bikeshed since each drm_exec builds a 
> realloced array of all external objects on each exec.
>
>>
>> Otherwise, if you want, I could do some experiments with Xarray and 
>> see how
>> that works out compared to using a maple tree.
>>
>> Btw. another nice thing about using Xarray or maple tree for that is 
>> that
>> drivers updating the VA space from the fence signalling path don't 
>> need to
>> hold a GPU-VM lock to update the extobj list. Actually, they might 
>> not need
>> a GPU-VM lock at all.
>
> I still don't follow why drivers would want to do that. Isn't the VA 
> space / fence object list always updated sync from the IOCTL?

meaning external object list ofc. :)

/Thomas


>
> /Thomas
>
>
>>
>>> /Thomas
>>>
>>>
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @evict: structure holding the evict list and evict list 
>>>>>>>> lock
>>>>>>>> +     */
>>>>>>>> +    struct {
>>>>>>>> +        /**
>>>>>>>> +         * @list: &list_head storing &drm_gem_objects 
>>>>>>>> currently being
>>>>>>>> +         * evicted
>>>>>>>> +         */
>>>>>>>> +        struct list_head list;
>>>>>>>> +
>>>>>>>> +        /**
>>>>>>>> +         * @lock: spinlock to protect the evict list against 
>>>>>>>> concurrent
>>>>>>>> +         * insertion / removal of different &drm_gpuva_gems
>>>>>>>> +         */
>>>>>>>> +        spinlock_t lock;
>>>>>>>> +    } evict;
>>>>>>>>      };
>>>>>>>>      void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>>>> +                struct drm_device *drm,
>>>>>>>>                      const char *name,
>>>>>>>>                      u64 start_offset, u64 range,
>>>>>>>>                      u64 reserve_offset, u64 reserve_range,
>>>>>>>>                      const struct drm_gpuva_fn_ops *ops);
>>>>>>>>      void drm_gpuva_manager_destroy(struct drm_gpuva_manager 
>>>>>>>> *mgr);
>>>>>>>> +/**
>>>>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec 
>>>>>>>> instance
>>>>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec 
>>>>>>>> instance for
>>>>>>>> + */
>>>>>>>> +#define DRM_GPUVA_EXEC(mgr)    &(mgr)->exec
>>>>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per 
>>>>>>> task and
>>>>>>> should typically be allocated on the stack. Otherwise you'd need 
>>>>>>> to protect
>>>>>>> the mgr->exec member with an exclusive lock throughout the 
>>>>>>> locking process,
>>>>>>> and that's not what we want.
>>>>>> Oh, good point. I think it works in Nouveau, because there it's 
>>>>>> implicitly
>>>>>> protected with the job submission lock.
>>>>>>
>>>>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes 
>>>>>>> and add
>>>>>>> needed ops to it: Like so:
>>>>>> That's a good idea, will take this into V2.
>>>>> Actually, I'm not fully sure that was a good idea: I've now have a 
>>>>> working
>>>>> version of Xe ported over to drm_exec, having these helpers in 
>>>>> mind and with
>>>>> the intention to start using them as they mature. What I found, 
>>>>> though is
>>>>> that open-coding the drm_exec loop is not all that bad, but that 
>>>>> building
>>>>> blocks that can be called from within the loop are useful:
>>>>>
>>>>> Like the drm_gpuva_prepare_objects() and an imaginary
>>>>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of 
>>>>> the object
>>>>> (if different and the gpuva points to the object. And
>>>>> drm_gpuva_prepare_array() although we don't use it within Xe. That 
>>>>> means you
>>>>> can use these building blocks like helpers and avoid the fn() 
>>>>> callback by
>>>>> instead open-coding.
>>>>>
>>>>> But I guess YMMV.
>>>> That's exactly why those building blocks are exported, I already 
>>>> had in mind
>>>> that there might be drivers which still want to open-code the 
>>>> drm_exec loop,
>>>> while others might just want a simple interface to lock everything.
>>>>
>>>> I still think it is a good idea, but I'd keep that as simple as 
>>>> possible. And
>>>> for everything else just let the driver open-code it and use the 
>>>> "building
>>>> blocks" - will also expand the bulding blocks to what you mentioned 
>>>> above.
>>>>
>>>>>>> struct drm_gpuva_exec_ops {
>>>>>>>        int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>>>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>>>>
>>>>>>>        int (*bo_validate) (struct drm_gpuva_exec *exec, struct 
>>>>>>> drm_gem_object
>>>>>>> *obj);
>>>>>> I guess we could also keep that within the drm_gpuva_fn_ops? This 
>>>>>> should always
>>>>>> be the same callback, right?
>>>>>>
>>>>>>> };
>>>>>>>
>>>>>>> struct drm_gpuva_exec {
>>>>>>>        const struct drm_gpuva_exec_ops *ops;
>>>>>>>        struct drm_exec exec;
>>>>>>>        struct drm_gpuva_manager *mgr;
>>>>>>> };
>>>>>>>
>>>>>>> Although I'd actually expect bo_validate to be part of fn in the 
>>>>>>> typical
>>>>>>> case. The drm_gpuva_exec would then be allocated by the caller 
>>>>>>> on the stack.
>>>>>> This doesn't sound like my assumption about fn() above is correct.
>>>>> Well one important thing in our conversion is that ttm_bo_validate 
>>>>> () needs
>>>>> to be in the until_all_locked() loop. We want to be able soon to use
>>>>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>>>>> temporarily, add locked objects to the drm_exec list of locked 
>>>>> objects. That
>>>>> means everything that may end up calling validate deep within the 
>>>>> call chain
>>>>> needs to be part of the until_all_locked() loop, so our
>>>>> drm_gpuva_manager_lock_extra() fn callback would include those 
>>>>> validates and
>>>>> look different all the time. Hence that's why open-coding isn't 
>>>>> all that
>>>>> bad...
>>>> Oh, I see. You indeed want to call validate() from within 
>>>> until_all_locked().
>>>>
>>>>> /Thomas
>>>>>
>>>>>
>> <snip>

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-31 17:23                   ` Thomas Hellström
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström @ 2023-08-31 17:23 UTC (permalink / raw)
  To: Thomas Hellström (Intel), Danilo Krummrich
  Cc: matthew.brost, sarah.walker, nouveau, dri-devel, linux-kernel,
	Liam.Howlett, boris.brezillon, donald.robson, daniel,
	christian.koenig, faith.ekstrand, bskeggs


On 8/31/23 18:53, Thomas Hellström (Intel) wrote:
> Hi,
>
> On 8/31/23 13:18, Danilo Krummrich wrote:
>> On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) 
>> wrote:
>>> Hi!
>>>
>>> On 8/30/23 17:00, Danilo Krummrich wrote:
>>>> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) 
>>>> wrote:
>>>>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>>>>> Hi Thomas,
>>>>>>
>>>>>> thanks for having a look!
>>>>>>
>>>>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström 
>>>>>> (Intel) wrote:
>>>>>>> Hi, Danilo.
>>>>>>>
>>>>>>> Some quick comments since I'm doing some Xe work in this area. 
>>>>>>> Will probably
>>>>>>> get back with more.
>>>>>>>
>>>>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
>> <snip>
>>
>>>>>>>> diff --git a/include/drm/drm_gpuva_mgr.h 
>>>>>>>> b/include/drm/drm_gpuva_mgr.h
>>>>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>>>>> @@ -26,12 +26,16 @@
>>>>>>>>       */
>>>>>>>>      #include <linux/list.h>
>>>>>>>> +#include <linux/dma-resv.h>
>>>>>>>> +#include <linux/maple_tree.h>
>>>>>>>>      #include <linux/rbtree.h>
>>>>>>>>      #include <linux/types.h>
>>>>>>>>      #include <drm/drm_gem.h>
>>>>>>>> +#include <drm/drm_exec.h>
>>>>>>>>      struct drm_gpuva_manager;
>>>>>>>> +struct drm_gpuva_gem;
>>>>>>>>      struct drm_gpuva_fn_ops;
>>>>>>>>      /**
>>>>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>>>>      int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct 
>>>>>>>> drm_gpuva *va);
>>>>>>>>      void drm_gpuva_remove(struct drm_gpuva *va);
>>>>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem 
>>>>>>>> *vm_bo);
>>>>>>>>      void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>>>>      struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager 
>>>>>>>> *mgr,
>>>>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>>>>           * @ops: &drm_gpuva_fn_ops providing the split/merge 
>>>>>>>> steps to drivers
>>>>>>>>           */
>>>>>>>>          const struct drm_gpuva_fn_ops *ops;
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @d_obj: Dummy GEM object; used internally to pass the 
>>>>>>>> GPU VMs
>>>>>>>> +     * dma-resv to &drm_exec.
>>>>>>>> +     */
>>>>>>>> +    struct drm_gem_object d_obj;
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @resv: the &dma_resv for &drm_gem_objects mapped in 
>>>>>>>> this GPU VA
>>>>>>>> +     * space
>>>>>>>> +     */
>>>>>>>> +    struct dma_resv *resv;
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @exec: the &drm_exec helper to lock external 
>>>>>>>> &drm_gem_objects
>>>>>>>> +     */
>>>>>>>> +    struct drm_exec exec;
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>>>>> +     */
>>>>>>>> +    struct maple_tree mt_ext;
>>>>>>> Why are you using a maple tree here? Insertion and removal is 
>>>>>>> O(log(n))
>>>>>>> instead of O(1) for a list?
>>>>>>>
>>>>>> Having a list of drm_gem_objects directly wouldn't work, as 
>>>>>> multiple GPU-VMs
>>>>>> could have mappings of the same extobj.
>>>>>>
>>>>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) 
>>>>>> as list entry
>>>>>> instead, which also seems to be the obvious choice. However, 
>>>>>> there is a locking
>>>>>> conflict.
>>>>>>
>>>>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each 
>>>>>> drm_gpuva_gem keeps
>>>>>> a list of drm_gpuvas. Both lists are either protected with the 
>>>>>> dma-resv lock of
>>>>>> the corresponding drm_gem_object, or with an external lock 
>>>>>> provided by the
>>>>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by 
>>>>>> drivers performing
>>>>>> changes on the GPUVA space directly from the fence signalling path.
>>>>>>
>>>>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are 
>>>>>> doing already,
>>>>>> we'd want to add a drm_gpuva_gem to the extobj list for the first 
>>>>>> mapping being
>>>>>> linked and we'd want to remove it for the last one being unlinked.
>>>>>>
>>>>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj 
>>>>>> list even
>>>>>> before, because otherwise we'd not acquire it's dma-resv lock of 
>>>>>> this GEM object
>>>>>> through drm_gpuva_manager_lock(). But that's trival, we could do 
>>>>>> that when we
>>>>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>>>>
>>>>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem 
>>>>>> from the extobj
>>>>>> list from drm_gpuva_unlink() when the last mapping of this BO is 
>>>>>> unlinked. In
>>>>>> order to do so, we'd (as discussed above) either need to hold the 
>>>>>> outer GPU-VM
>>>>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>>>>> drm_gpuva_unlink() is called from within the fence signalling 
>>>>>> path. For drivers
>>>>>> like XE or Nouveau, we'd at least need to make sure to not mess 
>>>>>> up the locking
>>>>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>>>>
>>>>>> Considering all that, I thought it's probably better to track 
>>>>>> extobjs separate
>>>>>> from the drm_gpuva_gem, hence the maple tree choice.
>>>>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that 
>>>>> point to
>>>>> external objects, or in the case of multiple mappings to the same gem
>>>>> object, only one of the drm_gpuvas is in the list. These are 
>>>>> protected by
>>>>> the GPU-VM lock. I don't see a problem with removing those from 
>>>>> the fence
>>>>> signalling path, though?
>>>> I intentionally tried to avoid keeping a list of drm_gpuvas to 
>>>> track extobjs,
>>>> since this is generic code I don't know how much mappings of an 
>>>> external object
>>>> the corresponding driver potentially creates. This could become a 
>>>> pretty large
>>>> list to iterate. Another reason was, that I want to keep the 
>>>> drm_gpuva structure
>>>> as small as possible, hence avoiding another list_head.
>>> Yes, the list might be pretty large, but OTOH you never iterate to 
>>> access a
>>> single list element. When you need to iterate the whole list you 
>>> need to do
>>> that regardless of the data structure used. As for the list head, it 
>>> might
>>> perhaps be aliased (union) with an upcoming userptr list head?
>>>
>> Oh, I did not mean that I'm concerned about the size of a list of 
>> extobjs in
>> general, that would indeed be the same for every data structure 
>> chosen. But I
>> would be concerned about keeping a list of *all* mappings being 
>> backed by an
>> extobj.
>>
>>>> Now, it sounds like in XE you're doing some kind of optimization 
>>>> just keeping a
>>>> single mapping of an extobj in the list? How do you know when to 
>>>> remove it? What
>>>> if the mapping from the extobj list gets unmapped, but there is 
>>>> still another
>>>> one left in the GPU-VM being backed by the same BO?
>>> When removing from the lists, we iterate through the object's list 
>>> of vmas,
>>> and if there is one matching the same vm, we replace the old one 
>>> with the
>>> new one. A similar iteration is done when adding to avoid adding one 
>>> that is
>>> already on the list.
>> I see, but wouldn't this be O(n) on insertion and O(m) on removal of 
>> an extobj,
>> while using the maple tree is O(log(n))?
>
> No, insertion and removal is O(m) where m is the number of vms the 
> object is currently bound to. Typically a very small number.
>
>>
>>>> Although assuming that's a no-go for GPUVA wouldn't an XArray be a 
>>>> better
>>>> choice, keeping O(1)?
>>>> When tracking extobjs, the address of the drm_gem_object is the key 
>>>> while the
>>>> reference count is the value. I was thinking of an XArray as well, 
>>>> but I was
>>>> worried that the corresponding indices could be too much 
>>>> distributed for an
>>>> XArray to still be efficient. Now that I think about it, it's 
>>>> probably not that
>>>> bad.
>>>>
>>>> Btw., while I agree trying to make things as efficient as possible, 
>>>> what is the
>>>> magnitue for extobjs to be tracked, do we need to worry about the 
>>>> O(log(n))?
>>> Not sure yet, TBH, but I think one of our UMDs can only use external 
>>> object,
>>> because they don't know at creation time which ones need exporting. 
>>> However
>>> if this turns out to be too bad, there are various flavours of 
>>> "clever but
>>> complicated" optimizations that we could think of to reduce the list 
>>> size.
>>> Still in our case, we opted for the vma list head for now.
>> Considering the above, I would guess that if your current approach is 
>> good
>> enough, a maple tree will work as well.
>
> Hmm, Yeah it's probably a bikeshed since each drm_exec builds a 
> realloced array of all external objects on each exec.
>
>>
>> Otherwise, if you want, I could do some experiments with Xarray and 
>> see how
>> that works out compared to using a maple tree.
>>
>> Btw. another nice thing about using Xarray or maple tree for that is 
>> that
>> drivers updating the VA space from the fence signalling path don't 
>> need to
>> hold a GPU-VM lock to update the extobj list. Actually, they might 
>> not need
>> a GPU-VM lock at all.
>
> I still don't follow why drivers would want to do that. Isn't the VA 
> space / fence object list always updated sync from the IOCTL?

meaning external object list ofc. :)

/Thomas


>
> /Thomas
>
>
>>
>>> /Thomas
>>>
>>>
>>>>>>>> +
>>>>>>>> +    /**
>>>>>>>> +     * @evict: structure holding the evict list and evict list 
>>>>>>>> lock
>>>>>>>> +     */
>>>>>>>> +    struct {
>>>>>>>> +        /**
>>>>>>>> +         * @list: &list_head storing &drm_gem_objects 
>>>>>>>> currently being
>>>>>>>> +         * evicted
>>>>>>>> +         */
>>>>>>>> +        struct list_head list;
>>>>>>>> +
>>>>>>>> +        /**
>>>>>>>> +         * @lock: spinlock to protect the evict list against 
>>>>>>>> concurrent
>>>>>>>> +         * insertion / removal of different &drm_gpuva_gems
>>>>>>>> +         */
>>>>>>>> +        spinlock_t lock;
>>>>>>>> +    } evict;
>>>>>>>>      };
>>>>>>>>      void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>>>> +                struct drm_device *drm,
>>>>>>>>                      const char *name,
>>>>>>>>                      u64 start_offset, u64 range,
>>>>>>>>                      u64 reserve_offset, u64 reserve_range,
>>>>>>>>                      const struct drm_gpuva_fn_ops *ops);
>>>>>>>>      void drm_gpuva_manager_destroy(struct drm_gpuva_manager 
>>>>>>>> *mgr);
>>>>>>>> +/**
>>>>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec 
>>>>>>>> instance
>>>>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec 
>>>>>>>> instance for
>>>>>>>> + */
>>>>>>>> +#define DRM_GPUVA_EXEC(mgr)    &(mgr)->exec
>>>>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per 
>>>>>>> task and
>>>>>>> should typically be allocated on the stack. Otherwise you'd need 
>>>>>>> to protect
>>>>>>> the mgr->exec member with an exclusive lock throughout the 
>>>>>>> locking process,
>>>>>>> and that's not what we want.
>>>>>> Oh, good point. I think it works in Nouveau, because there it's 
>>>>>> implicitly
>>>>>> protected with the job submission lock.
>>>>>>
>>>>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes 
>>>>>>> and add
>>>>>>> needed ops to it: Like so:
>>>>>> That's a good idea, will take this into V2.
>>>>> Actually, I'm not fully sure that was a good idea: I've now have a 
>>>>> working
>>>>> version of Xe ported over to drm_exec, having these helpers in 
>>>>> mind and with
>>>>> the intention to start using them as they mature. What I found, 
>>>>> though is
>>>>> that open-coding the drm_exec loop is not all that bad, but that 
>>>>> building
>>>>> blocks that can be called from within the loop are useful:
>>>>>
>>>>> Like the drm_gpuva_prepare_objects() and an imaginary
>>>>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of 
>>>>> the object
>>>>> (if different and the gpuva points to the object. And
>>>>> drm_gpuva_prepare_array() although we don't use it within Xe. That 
>>>>> means you
>>>>> can use these building blocks like helpers and avoid the fn() 
>>>>> callback by
>>>>> instead open-coding.
>>>>>
>>>>> But I guess YMMV.
>>>> That's exactly why those building blocks are exported, I already 
>>>> had in mind
>>>> that there might be drivers which still want to open-code the 
>>>> drm_exec loop,
>>>> while others might just want a simple interface to lock everything.
>>>>
>>>> I still think it is a good idea, but I'd keep that as simple as 
>>>> possible. And
>>>> for everything else just let the driver open-code it and use the 
>>>> "building
>>>> blocks" - will also expand the bulding blocks to what you mentioned 
>>>> above.
>>>>
>>>>>>> struct drm_gpuva_exec_ops {
>>>>>>>        int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>>>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>>>>
>>>>>>>        int (*bo_validate) (struct drm_gpuva_exec *exec, struct 
>>>>>>> drm_gem_object
>>>>>>> *obj);
>>>>>> I guess we could also keep that within the drm_gpuva_fn_ops? This 
>>>>>> should always
>>>>>> be the same callback, right?
>>>>>>
>>>>>>> };
>>>>>>>
>>>>>>> struct drm_gpuva_exec {
>>>>>>>        const struct drm_gpuva_exec_ops *ops;
>>>>>>>        struct drm_exec exec;
>>>>>>>        struct drm_gpuva_manager *mgr;
>>>>>>> };
>>>>>>>
>>>>>>> Although I'd actually expect bo_validate to be part of fn in the 
>>>>>>> typical
>>>>>>> case. The drm_gpuva_exec would then be allocated by the caller 
>>>>>>> on the stack.
>>>>>> This doesn't sound like my assumption about fn() above is correct.
>>>>> Well one important thing in our conversion is that ttm_bo_validate 
>>>>> () needs
>>>>> to be in the until_all_locked() loop. We want to be able soon to use
>>>>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>>>>> temporarily, add locked objects to the drm_exec list of locked 
>>>>> objects. That
>>>>> means everything that may end up calling validate deep within the 
>>>>> call chain
>>>>> needs to be part of the until_all_locked() loop, so our
>>>>> drm_gpuva_manager_lock_extra() fn callback would include those 
>>>>> validates and
>>>>> look different all the time. Hence that's why open-coding isn't 
>>>>> all that
>>>>> bad...
>>>> Oh, I see. You indeed want to call validate() from within 
>>>> until_all_locked().
>>>>
>>>>> /Thomas
>>>>>
>>>>>
>> <snip>

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-31 16:53                 ` [Nouveau] " Thomas Hellström (Intel)
  (?)
@ 2023-08-31 19:07                   ` Danilo Krummrich
  -1 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-31 19:07 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

On Thu, Aug 31, 2023 at 06:53:01PM +0200, Thomas Hellström (Intel) wrote:
> Hi,
> 
> On 8/31/23 13:18, Danilo Krummrich wrote:
> > On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
> > > Hi!
> > > 
> > > On 8/30/23 17:00, Danilo Krummrich wrote:
> > > > On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
> > > > > On 8/30/23 14:49, Danilo Krummrich wrote:
> > > > > > Hi Thomas,
> > > > > > 
> > > > > > thanks for having a look!
> > > > > > 
> > > > > > On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> > > > > > > Hi, Danilo.
> > > > > > > 
> > > > > > > Some quick comments since I'm doing some Xe work in this area. Will probably
> > > > > > > get back with more.
> > > > > > > 
> > > > > > > On 8/20/23 23:53, Danilo Krummrich wrote:
> > <snip>
> > 
> > > > > > > > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > > > > > > > index ed8d50200cc3..693e2da3f425 100644
> > > > > > > > --- a/include/drm/drm_gpuva_mgr.h
> > > > > > > > +++ b/include/drm/drm_gpuva_mgr.h
> > > > > > > > @@ -26,12 +26,16 @@
> > > > > > > >       */
> > > > > > > >      #include <linux/list.h>
> > > > > > > > +#include <linux/dma-resv.h>
> > > > > > > > +#include <linux/maple_tree.h>
> > > > > > > >      #include <linux/rbtree.h>
> > > > > > > >      #include <linux/types.h>
> > > > > > > >      #include <drm/drm_gem.h>
> > > > > > > > +#include <drm/drm_exec.h>
> > > > > > > >      struct drm_gpuva_manager;
> > > > > > > > +struct drm_gpuva_gem;
> > > > > > > >      struct drm_gpuva_fn_ops;
> > > > > > > >      /**
> > > > > > > > @@ -140,7 +144,7 @@ struct drm_gpuva {
> > > > > > > >      int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> > > > > > > >      void drm_gpuva_remove(struct drm_gpuva *va);
> > > > > > > > -void drm_gpuva_link(struct drm_gpuva *va);
> > > > > > > > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> > > > > > > >      void drm_gpuva_unlink(struct drm_gpuva *va);
> > > > > > > >      struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > > > > > > > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> > > > > > > >      	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> > > > > > > >      	 */
> > > > > > > >      	const struct drm_gpuva_fn_ops *ops;
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > > > > > > > +	 * dma-resv to &drm_exec.
> > > > > > > > +	 */
> > > > > > > > +	struct drm_gem_object d_obj;
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > > > > > > > +	 * space
> > > > > > > > +	 */
> > > > > > > > +	struct dma_resv *resv;
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > > > > > > > +	 */
> > > > > > > > +	struct drm_exec exec;
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > > > > > > > +	 */
> > > > > > > > +	struct maple_tree mt_ext;
> > > > > > > Why are you using a maple tree here? Insertion and removal is O(log(n))
> > > > > > > instead of O(1) for a list?
> > > > > > > 
> > > > > > Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> > > > > > could have mappings of the same extobj.
> > > > > > 
> > > > > > I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> > > > > > instead, which also seems to be the obvious choice. However, there is a locking
> > > > > > conflict.
> > > > > > 
> > > > > > A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> > > > > > a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> > > > > > the corresponding drm_gem_object, or with an external lock provided by the
> > > > > > driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> > > > > > changes on the GPUVA space directly from the fence signalling path.
> > > > > > 
> > > > > > Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> > > > > > we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> > > > > > linked and we'd want to remove it for the last one being unlinked.
> > > > > > 
> > > > > > (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> > > > > > before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> > > > > > through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> > > > > > create the drm_gpuva_gem, which we need to do anyways.)
> > > > > > 
> > > > > > Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> > > > > > list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> > > > > > order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> > > > > > lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> > > > > > drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> > > > > > like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> > > > > > hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
> > > > > > 
> > > > > > Considering all that, I thought it's probably better to track extobjs separate
> > > > > > from the drm_gpuva_gem, hence the maple tree choice.
> > > > > Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
> > > > > external objects, or in the case of multiple mappings to the same gem
> > > > > object, only one of the drm_gpuvas is in the list. These are protected by
> > > > > the GPU-VM lock. I don't see a problem with removing those from the fence
> > > > > signalling path, though?
> > > > I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
> > > > since this is generic code I don't know how much mappings of an external object
> > > > the corresponding driver potentially creates. This could become a pretty large
> > > > list to iterate. Another reason was, that I want to keep the drm_gpuva structure
> > > > as small as possible, hence avoiding another list_head.
> > > Yes, the list might be pretty large, but OTOH you never iterate to access a
> > > single list element. When you need to iterate the whole list you need to do
> > > that regardless of the data structure used. As for the list head, it might
> > > perhaps be aliased (union) with an upcoming userptr list head?
> > > 
> > Oh, I did not mean that I'm concerned about the size of a list of extobjs in
> > general, that would indeed be the same for every data structure chosen. But I
> > would be concerned about keeping a list of *all* mappings being backed by an
> > extobj.
> > 
> > > > Now, it sounds like in XE you're doing some kind of optimization just keeping a
> > > > single mapping of an extobj in the list? How do you know when to remove it? What
> > > > if the mapping from the extobj list gets unmapped, but there is still another
> > > > one left in the GPU-VM being backed by the same BO?
> > > When removing from the lists, we iterate through the object's list of vmas,
> > > and if there is one matching the same vm, we replace the old one with the
> > > new one. A similar iteration is done when adding to avoid adding one that is
> > > already on the list.
> > I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
> > while using the maple tree is O(log(n))?
> 
> No, insertion and removal is O(m) where m is the number of vms the object is
> currently bound to. Typically a very small number.

Ok, my guess was that on insertion you'd actually walk the extobj list and see
if there's a vma backed by the same BO already, while on removal you said you're
walking the BO's vma list. So I guess on insertion you're also walking the BO's
vma list and see if there's already a mapping for this VM?

In your case that might make sense if you expect the extobj list to be larger
than the BO's vma list typically. In general I don't think this is true.

> 
> > 
> > > > Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> > > > choice, keeping O(1)?
> > > > When tracking extobjs, the address of the drm_gem_object is the key while the
> > > > reference count is the value. I was thinking of an XArray as well, but I was
> > > > worried that the corresponding indices could be too much distributed for an
> > > > XArray to still be efficient. Now that I think about it, it's probably not that
> > > > bad.
> > > > 
> > > > Btw., while I agree trying to make things as efficient as possible, what is the
> > > > magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
> > > Not sure yet, TBH, but I think one of our UMDs can only use external object,
> > > because they don't know at creation time which ones need exporting. However
> > > if this turns out to be too bad, there are various flavours of "clever but
> > > complicated" optimizations that we could think of to reduce the list size.
> > > Still in our case, we opted for the vma list head for now.
> > Considering the above, I would guess that if your current approach is good
> > enough, a maple tree will work as well.
> 
> Hmm, Yeah it's probably a bikeshed since each drm_exec builds a realloced
> array of all external objects on each exec.

I did a quick sketchy benchmark, which is probably good enough. In a maple tree
with 0xFFFF - 1 existing entries insertion of a random (non-existant) entry
took on average ~530ns over 1k iterations.

The average insertion time for each entry to build up a tree with 0xFFFF - 1
entries in the first place was ~1.3us. That's expected since it should hit
memory allocations more often than the previous one. The maximum peak was ~10us.
Inserting already existing entries took ~300ns.

That's probably good enough.

> 
> > 
> > Otherwise, if you want, I could do some experiments with Xarray and see how
> > that works out compared to using a maple tree.
> > 
> > Btw. another nice thing about using Xarray or maple tree for that is that
> > drivers updating the VA space from the fence signalling path don't need to
> > hold a GPU-VM lock to update the extobj list. Actually, they might not need
> > a GPU-VM lock at all.
> 
> I still don't follow why drivers would want to do that. Isn't the VA space /
> fence object list always updated sync from the IOCTL?

For the extobj list I don't see any advantage not doing that in the IOCTL right
away. For the VA space there are a few advantages doing it in the fence
signalling path.

(1) No need to allocate drm_gpuva_ops at all. For a given map / unmap request
    the driver can receive the callbacks for map / remap / unmap directly.
(2) No need to unwind VA space updates on failure, also no need for any other
    unwind tricks.
(3) Synchronous bind jobs can be injected at any point of time and don't need to
    be queued up in the scheduler to preserve ordering.
(4) Potentially less error prone ressource management. Although, I admit partly
    this is just the consequence of (1) and (2).

Actually, once I get the page table management prepared for that I'd like to
move Nouveau over this approach.

> 
> /Thomas
> 
> 
> > 
> > > /Thomas
> > > 
> > > 
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @evict: structure holding the evict list and evict list lock
> > > > > > > > +	 */
> > > > > > > > +	struct {
> > > > > > > > +		/**
> > > > > > > > +		 * @list: &list_head storing &drm_gem_objects currently being
> > > > > > > > +		 * evicted
> > > > > > > > +		 */
> > > > > > > > +		struct list_head list;
> > > > > > > > +
> > > > > > > > +		/**
> > > > > > > > +		 * @lock: spinlock to protect the evict list against concurrent
> > > > > > > > +		 * insertion / removal of different &drm_gpuva_gems
> > > > > > > > +		 */
> > > > > > > > +		spinlock_t lock;
> > > > > > > > +	} evict;
> > > > > > > >      };
> > > > > > > >      void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > > > > > +			    struct drm_device *drm,
> > > > > > > >      			    const char *name,
> > > > > > > >      			    u64 start_offset, u64 range,
> > > > > > > >      			    u64 reserve_offset, u64 reserve_range,
> > > > > > > >      			    const struct drm_gpuva_fn_ops *ops);
> > > > > > > >      void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > > > > > > > +/**
> > > > > > > > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > > > > > > > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > > > > > > > + */
> > > > > > > > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > > > > > > A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> > > > > > > should typically be allocated on the stack. Otherwise you'd need to protect
> > > > > > > the mgr->exec member with an exclusive lock throughout the locking process,
> > > > > > > and that's not what we want.
> > > > > > Oh, good point. I think it works in Nouveau, because there it's implicitly
> > > > > > protected with the job submission lock.
> > > > > > 
> > > > > > > Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> > > > > > > needed ops to it: Like so:
> > > > > > That's a good idea, will take this into V2.
> > > > > Actually, I'm not fully sure that was a good idea: I've now have a working
> > > > > version of Xe ported over to drm_exec, having these helpers in mind and with
> > > > > the intention to start using them as they mature. What I found, though is
> > > > > that open-coding the drm_exec loop is not all that bad, but that building
> > > > > blocks that can be called from within the loop are useful:
> > > > > 
> > > > > Like the drm_gpuva_prepare_objects() and an imaginary
> > > > > drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
> > > > > (if different and the gpuva points to the object. And
> > > > > drm_gpuva_prepare_array() although we don't use it within Xe. That means you
> > > > > can use these building blocks like helpers and avoid the fn() callback by
> > > > > instead open-coding.
> > > > > 
> > > > > But I guess YMMV.
> > > > That's exactly why those building blocks are exported, I already had in mind
> > > > that there might be drivers which still want to open-code the drm_exec loop,
> > > > while others might just want a simple interface to lock everything.
> > > > 
> > > > I still think it is a good idea, but I'd keep that as simple as possible. And
> > > > for everything else just let the driver open-code it and use the "building
> > > > blocks" - will also expand the bulding blocks to what you mentioned above.
> > > > 
> > > > > > > struct drm_gpuva_exec_ops {
> > > > > > >        int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> > > > > > Is this the fn argument from drm_gpuva_manager_lock_extra()?
> > > > > > 
> > > > > > >        int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> > > > > > > *obj);
> > > > > > I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> > > > > > be the same callback, right?
> > > > > > 
> > > > > > > };
> > > > > > > 
> > > > > > > struct drm_gpuva_exec {
> > > > > > >        const struct drm_gpuva_exec_ops *ops;
> > > > > > >        struct drm_exec exec;
> > > > > > >        struct drm_gpuva_manager *mgr;
> > > > > > > };
> > > > > > > 
> > > > > > > Although I'd actually expect bo_validate to be part of fn in the typical
> > > > > > > case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> > > > > > This doesn't sound like my assumption about fn() above is correct.
> > > > > Well one important thing in our conversion is that ttm_bo_validate () needs
> > > > > to be in the until_all_locked() loop. We want to be able soon to use
> > > > > sleeping locks for eviction, so a xe_bo_validate() would, at least
> > > > > temporarily, add locked objects to the drm_exec list of locked objects. That
> > > > > means everything that may end up calling validate deep within the call chain
> > > > > needs to be part of the until_all_locked() loop, so our
> > > > > drm_gpuva_manager_lock_extra() fn callback would include those validates and
> > > > > look different all the time. Hence that's why open-coding isn't all that
> > > > > bad...
> > > > Oh, I see. You indeed want to call validate() from within until_all_locked().
> > > > 
> > > > > /Thomas
> > > > > 
> > > > > 
> > <snip>
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-31 19:07                   ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-31 19:07 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, christian.koenig, faith.ekstrand, bskeggs

On Thu, Aug 31, 2023 at 06:53:01PM +0200, Thomas Hellström (Intel) wrote:
> Hi,
> 
> On 8/31/23 13:18, Danilo Krummrich wrote:
> > On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
> > > Hi!
> > > 
> > > On 8/30/23 17:00, Danilo Krummrich wrote:
> > > > On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
> > > > > On 8/30/23 14:49, Danilo Krummrich wrote:
> > > > > > Hi Thomas,
> > > > > > 
> > > > > > thanks for having a look!
> > > > > > 
> > > > > > On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> > > > > > > Hi, Danilo.
> > > > > > > 
> > > > > > > Some quick comments since I'm doing some Xe work in this area. Will probably
> > > > > > > get back with more.
> > > > > > > 
> > > > > > > On 8/20/23 23:53, Danilo Krummrich wrote:
> > <snip>
> > 
> > > > > > > > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > > > > > > > index ed8d50200cc3..693e2da3f425 100644
> > > > > > > > --- a/include/drm/drm_gpuva_mgr.h
> > > > > > > > +++ b/include/drm/drm_gpuva_mgr.h
> > > > > > > > @@ -26,12 +26,16 @@
> > > > > > > >       */
> > > > > > > >      #include <linux/list.h>
> > > > > > > > +#include <linux/dma-resv.h>
> > > > > > > > +#include <linux/maple_tree.h>
> > > > > > > >      #include <linux/rbtree.h>
> > > > > > > >      #include <linux/types.h>
> > > > > > > >      #include <drm/drm_gem.h>
> > > > > > > > +#include <drm/drm_exec.h>
> > > > > > > >      struct drm_gpuva_manager;
> > > > > > > > +struct drm_gpuva_gem;
> > > > > > > >      struct drm_gpuva_fn_ops;
> > > > > > > >      /**
> > > > > > > > @@ -140,7 +144,7 @@ struct drm_gpuva {
> > > > > > > >      int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> > > > > > > >      void drm_gpuva_remove(struct drm_gpuva *va);
> > > > > > > > -void drm_gpuva_link(struct drm_gpuva *va);
> > > > > > > > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> > > > > > > >      void drm_gpuva_unlink(struct drm_gpuva *va);
> > > > > > > >      struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > > > > > > > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> > > > > > > >      	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> > > > > > > >      	 */
> > > > > > > >      	const struct drm_gpuva_fn_ops *ops;
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > > > > > > > +	 * dma-resv to &drm_exec.
> > > > > > > > +	 */
> > > > > > > > +	struct drm_gem_object d_obj;
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > > > > > > > +	 * space
> > > > > > > > +	 */
> > > > > > > > +	struct dma_resv *resv;
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > > > > > > > +	 */
> > > > > > > > +	struct drm_exec exec;
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > > > > > > > +	 */
> > > > > > > > +	struct maple_tree mt_ext;
> > > > > > > Why are you using a maple tree here? Insertion and removal is O(log(n))
> > > > > > > instead of O(1) for a list?
> > > > > > > 
> > > > > > Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> > > > > > could have mappings of the same extobj.
> > > > > > 
> > > > > > I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> > > > > > instead, which also seems to be the obvious choice. However, there is a locking
> > > > > > conflict.
> > > > > > 
> > > > > > A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> > > > > > a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> > > > > > the corresponding drm_gem_object, or with an external lock provided by the
> > > > > > driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> > > > > > changes on the GPUVA space directly from the fence signalling path.
> > > > > > 
> > > > > > Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> > > > > > we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> > > > > > linked and we'd want to remove it for the last one being unlinked.
> > > > > > 
> > > > > > (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> > > > > > before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> > > > > > through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> > > > > > create the drm_gpuva_gem, which we need to do anyways.)
> > > > > > 
> > > > > > Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> > > > > > list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> > > > > > order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> > > > > > lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> > > > > > drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> > > > > > like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> > > > > > hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
> > > > > > 
> > > > > > Considering all that, I thought it's probably better to track extobjs separate
> > > > > > from the drm_gpuva_gem, hence the maple tree choice.
> > > > > Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
> > > > > external objects, or in the case of multiple mappings to the same gem
> > > > > object, only one of the drm_gpuvas is in the list. These are protected by
> > > > > the GPU-VM lock. I don't see a problem with removing those from the fence
> > > > > signalling path, though?
> > > > I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
> > > > since this is generic code I don't know how much mappings of an external object
> > > > the corresponding driver potentially creates. This could become a pretty large
> > > > list to iterate. Another reason was, that I want to keep the drm_gpuva structure
> > > > as small as possible, hence avoiding another list_head.
> > > Yes, the list might be pretty large, but OTOH you never iterate to access a
> > > single list element. When you need to iterate the whole list you need to do
> > > that regardless of the data structure used. As for the list head, it might
> > > perhaps be aliased (union) with an upcoming userptr list head?
> > > 
> > Oh, I did not mean that I'm concerned about the size of a list of extobjs in
> > general, that would indeed be the same for every data structure chosen. But I
> > would be concerned about keeping a list of *all* mappings being backed by an
> > extobj.
> > 
> > > > Now, it sounds like in XE you're doing some kind of optimization just keeping a
> > > > single mapping of an extobj in the list? How do you know when to remove it? What
> > > > if the mapping from the extobj list gets unmapped, but there is still another
> > > > one left in the GPU-VM being backed by the same BO?
> > > When removing from the lists, we iterate through the object's list of vmas,
> > > and if there is one matching the same vm, we replace the old one with the
> > > new one. A similar iteration is done when adding to avoid adding one that is
> > > already on the list.
> > I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
> > while using the maple tree is O(log(n))?
> 
> No, insertion and removal is O(m) where m is the number of vms the object is
> currently bound to. Typically a very small number.

Ok, my guess was that on insertion you'd actually walk the extobj list and see
if there's a vma backed by the same BO already, while on removal you said you're
walking the BO's vma list. So I guess on insertion you're also walking the BO's
vma list and see if there's already a mapping for this VM?

In your case that might make sense if you expect the extobj list to be larger
than the BO's vma list typically. In general I don't think this is true.

> 
> > 
> > > > Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> > > > choice, keeping O(1)?
> > > > When tracking extobjs, the address of the drm_gem_object is the key while the
> > > > reference count is the value. I was thinking of an XArray as well, but I was
> > > > worried that the corresponding indices could be too much distributed for an
> > > > XArray to still be efficient. Now that I think about it, it's probably not that
> > > > bad.
> > > > 
> > > > Btw., while I agree trying to make things as efficient as possible, what is the
> > > > magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
> > > Not sure yet, TBH, but I think one of our UMDs can only use external object,
> > > because they don't know at creation time which ones need exporting. However
> > > if this turns out to be too bad, there are various flavours of "clever but
> > > complicated" optimizations that we could think of to reduce the list size.
> > > Still in our case, we opted for the vma list head for now.
> > Considering the above, I would guess that if your current approach is good
> > enough, a maple tree will work as well.
> 
> Hmm, Yeah it's probably a bikeshed since each drm_exec builds a realloced
> array of all external objects on each exec.

I did a quick sketchy benchmark, which is probably good enough. In a maple tree
with 0xFFFF - 1 existing entries insertion of a random (non-existant) entry
took on average ~530ns over 1k iterations.

The average insertion time for each entry to build up a tree with 0xFFFF - 1
entries in the first place was ~1.3us. That's expected since it should hit
memory allocations more often than the previous one. The maximum peak was ~10us.
Inserting already existing entries took ~300ns.

That's probably good enough.

> 
> > 
> > Otherwise, if you want, I could do some experiments with Xarray and see how
> > that works out compared to using a maple tree.
> > 
> > Btw. another nice thing about using Xarray or maple tree for that is that
> > drivers updating the VA space from the fence signalling path don't need to
> > hold a GPU-VM lock to update the extobj list. Actually, they might not need
> > a GPU-VM lock at all.
> 
> I still don't follow why drivers would want to do that. Isn't the VA space /
> fence object list always updated sync from the IOCTL?

For the extobj list I don't see any advantage not doing that in the IOCTL right
away. For the VA space there are a few advantages doing it in the fence
signalling path.

(1) No need to allocate drm_gpuva_ops at all. For a given map / unmap request
    the driver can receive the callbacks for map / remap / unmap directly.
(2) No need to unwind VA space updates on failure, also no need for any other
    unwind tricks.
(3) Synchronous bind jobs can be injected at any point of time and don't need to
    be queued up in the scheduler to preserve ordering.
(4) Potentially less error prone ressource management. Although, I admit partly
    this is just the consequence of (1) and (2).

Actually, once I get the page table management prepared for that I'd like to
move Nouveau over this approach.

> 
> /Thomas
> 
> 
> > 
> > > /Thomas
> > > 
> > > 
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @evict: structure holding the evict list and evict list lock
> > > > > > > > +	 */
> > > > > > > > +	struct {
> > > > > > > > +		/**
> > > > > > > > +		 * @list: &list_head storing &drm_gem_objects currently being
> > > > > > > > +		 * evicted
> > > > > > > > +		 */
> > > > > > > > +		struct list_head list;
> > > > > > > > +
> > > > > > > > +		/**
> > > > > > > > +		 * @lock: spinlock to protect the evict list against concurrent
> > > > > > > > +		 * insertion / removal of different &drm_gpuva_gems
> > > > > > > > +		 */
> > > > > > > > +		spinlock_t lock;
> > > > > > > > +	} evict;
> > > > > > > >      };
> > > > > > > >      void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > > > > > +			    struct drm_device *drm,
> > > > > > > >      			    const char *name,
> > > > > > > >      			    u64 start_offset, u64 range,
> > > > > > > >      			    u64 reserve_offset, u64 reserve_range,
> > > > > > > >      			    const struct drm_gpuva_fn_ops *ops);
> > > > > > > >      void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > > > > > > > +/**
> > > > > > > > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > > > > > > > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > > > > > > > + */
> > > > > > > > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > > > > > > A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> > > > > > > should typically be allocated on the stack. Otherwise you'd need to protect
> > > > > > > the mgr->exec member with an exclusive lock throughout the locking process,
> > > > > > > and that's not what we want.
> > > > > > Oh, good point. I think it works in Nouveau, because there it's implicitly
> > > > > > protected with the job submission lock.
> > > > > > 
> > > > > > > Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> > > > > > > needed ops to it: Like so:
> > > > > > That's a good idea, will take this into V2.
> > > > > Actually, I'm not fully sure that was a good idea: I've now have a working
> > > > > version of Xe ported over to drm_exec, having these helpers in mind and with
> > > > > the intention to start using them as they mature. What I found, though is
> > > > > that open-coding the drm_exec loop is not all that bad, but that building
> > > > > blocks that can be called from within the loop are useful:
> > > > > 
> > > > > Like the drm_gpuva_prepare_objects() and an imaginary
> > > > > drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
> > > > > (if different and the gpuva points to the object. And
> > > > > drm_gpuva_prepare_array() although we don't use it within Xe. That means you
> > > > > can use these building blocks like helpers and avoid the fn() callback by
> > > > > instead open-coding.
> > > > > 
> > > > > But I guess YMMV.
> > > > That's exactly why those building blocks are exported, I already had in mind
> > > > that there might be drivers which still want to open-code the drm_exec loop,
> > > > while others might just want a simple interface to lock everything.
> > > > 
> > > > I still think it is a good idea, but I'd keep that as simple as possible. And
> > > > for everything else just let the driver open-code it and use the "building
> > > > blocks" - will also expand the bulding blocks to what you mentioned above.
> > > > 
> > > > > > > struct drm_gpuva_exec_ops {
> > > > > > >        int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> > > > > > Is this the fn argument from drm_gpuva_manager_lock_extra()?
> > > > > > 
> > > > > > >        int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> > > > > > > *obj);
> > > > > > I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> > > > > > be the same callback, right?
> > > > > > 
> > > > > > > };
> > > > > > > 
> > > > > > > struct drm_gpuva_exec {
> > > > > > >        const struct drm_gpuva_exec_ops *ops;
> > > > > > >        struct drm_exec exec;
> > > > > > >        struct drm_gpuva_manager *mgr;
> > > > > > > };
> > > > > > > 
> > > > > > > Although I'd actually expect bo_validate to be part of fn in the typical
> > > > > > > case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> > > > > > This doesn't sound like my assumption about fn() above is correct.
> > > > > Well one important thing in our conversion is that ttm_bo_validate () needs
> > > > > to be in the until_all_locked() loop. We want to be able soon to use
> > > > > sleeping locks for eviction, so a xe_bo_validate() would, at least
> > > > > temporarily, add locked objects to the drm_exec list of locked objects. That
> > > > > means everything that may end up calling validate deep within the call chain
> > > > > needs to be part of the until_all_locked() loop, so our
> > > > > drm_gpuva_manager_lock_extra() fn callback would include those validates and
> > > > > look different all the time. Hence that's why open-coding isn't all that
> > > > > bad...
> > > > Oh, I see. You indeed want to call validate() from within until_all_locked().
> > > > 
> > > > > /Thomas
> > > > > 
> > > > > 
> > <snip>
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-08-31 19:07                   ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-08-31 19:07 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, christian.koenig, faith.ekstrand, bskeggs

On Thu, Aug 31, 2023 at 06:53:01PM +0200, Thomas Hellström (Intel) wrote:
> Hi,
> 
> On 8/31/23 13:18, Danilo Krummrich wrote:
> > On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
> > > Hi!
> > > 
> > > On 8/30/23 17:00, Danilo Krummrich wrote:
> > > > On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
> > > > > On 8/30/23 14:49, Danilo Krummrich wrote:
> > > > > > Hi Thomas,
> > > > > > 
> > > > > > thanks for having a look!
> > > > > > 
> > > > > > On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> > > > > > > Hi, Danilo.
> > > > > > > 
> > > > > > > Some quick comments since I'm doing some Xe work in this area. Will probably
> > > > > > > get back with more.
> > > > > > > 
> > > > > > > On 8/20/23 23:53, Danilo Krummrich wrote:
> > <snip>
> > 
> > > > > > > > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > > > > > > > index ed8d50200cc3..693e2da3f425 100644
> > > > > > > > --- a/include/drm/drm_gpuva_mgr.h
> > > > > > > > +++ b/include/drm/drm_gpuva_mgr.h
> > > > > > > > @@ -26,12 +26,16 @@
> > > > > > > >       */
> > > > > > > >      #include <linux/list.h>
> > > > > > > > +#include <linux/dma-resv.h>
> > > > > > > > +#include <linux/maple_tree.h>
> > > > > > > >      #include <linux/rbtree.h>
> > > > > > > >      #include <linux/types.h>
> > > > > > > >      #include <drm/drm_gem.h>
> > > > > > > > +#include <drm/drm_exec.h>
> > > > > > > >      struct drm_gpuva_manager;
> > > > > > > > +struct drm_gpuva_gem;
> > > > > > > >      struct drm_gpuva_fn_ops;
> > > > > > > >      /**
> > > > > > > > @@ -140,7 +144,7 @@ struct drm_gpuva {
> > > > > > > >      int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> > > > > > > >      void drm_gpuva_remove(struct drm_gpuva *va);
> > > > > > > > -void drm_gpuva_link(struct drm_gpuva *va);
> > > > > > > > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> > > > > > > >      void drm_gpuva_unlink(struct drm_gpuva *va);
> > > > > > > >      struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > > > > > > > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> > > > > > > >      	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> > > > > > > >      	 */
> > > > > > > >      	const struct drm_gpuva_fn_ops *ops;
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > > > > > > > +	 * dma-resv to &drm_exec.
> > > > > > > > +	 */
> > > > > > > > +	struct drm_gem_object d_obj;
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > > > > > > > +	 * space
> > > > > > > > +	 */
> > > > > > > > +	struct dma_resv *resv;
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > > > > > > > +	 */
> > > > > > > > +	struct drm_exec exec;
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > > > > > > > +	 */
> > > > > > > > +	struct maple_tree mt_ext;
> > > > > > > Why are you using a maple tree here? Insertion and removal is O(log(n))
> > > > > > > instead of O(1) for a list?
> > > > > > > 
> > > > > > Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> > > > > > could have mappings of the same extobj.
> > > > > > 
> > > > > > I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> > > > > > instead, which also seems to be the obvious choice. However, there is a locking
> > > > > > conflict.
> > > > > > 
> > > > > > A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> > > > > > a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> > > > > > the corresponding drm_gem_object, or with an external lock provided by the
> > > > > > driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> > > > > > changes on the GPUVA space directly from the fence signalling path.
> > > > > > 
> > > > > > Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> > > > > > we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> > > > > > linked and we'd want to remove it for the last one being unlinked.
> > > > > > 
> > > > > > (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> > > > > > before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> > > > > > through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> > > > > > create the drm_gpuva_gem, which we need to do anyways.)
> > > > > > 
> > > > > > Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> > > > > > list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> > > > > > order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> > > > > > lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> > > > > > drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> > > > > > like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> > > > > > hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
> > > > > > 
> > > > > > Considering all that, I thought it's probably better to track extobjs separate
> > > > > > from the drm_gpuva_gem, hence the maple tree choice.
> > > > > Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
> > > > > external objects, or in the case of multiple mappings to the same gem
> > > > > object, only one of the drm_gpuvas is in the list. These are protected by
> > > > > the GPU-VM lock. I don't see a problem with removing those from the fence
> > > > > signalling path, though?
> > > > I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
> > > > since this is generic code I don't know how much mappings of an external object
> > > > the corresponding driver potentially creates. This could become a pretty large
> > > > list to iterate. Another reason was, that I want to keep the drm_gpuva structure
> > > > as small as possible, hence avoiding another list_head.
> > > Yes, the list might be pretty large, but OTOH you never iterate to access a
> > > single list element. When you need to iterate the whole list you need to do
> > > that regardless of the data structure used. As for the list head, it might
> > > perhaps be aliased (union) with an upcoming userptr list head?
> > > 
> > Oh, I did not mean that I'm concerned about the size of a list of extobjs in
> > general, that would indeed be the same for every data structure chosen. But I
> > would be concerned about keeping a list of *all* mappings being backed by an
> > extobj.
> > 
> > > > Now, it sounds like in XE you're doing some kind of optimization just keeping a
> > > > single mapping of an extobj in the list? How do you know when to remove it? What
> > > > if the mapping from the extobj list gets unmapped, but there is still another
> > > > one left in the GPU-VM being backed by the same BO?
> > > When removing from the lists, we iterate through the object's list of vmas,
> > > and if there is one matching the same vm, we replace the old one with the
> > > new one. A similar iteration is done when adding to avoid adding one that is
> > > already on the list.
> > I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
> > while using the maple tree is O(log(n))?
> 
> No, insertion and removal is O(m) where m is the number of vms the object is
> currently bound to. Typically a very small number.

Ok, my guess was that on insertion you'd actually walk the extobj list and see
if there's a vma backed by the same BO already, while on removal you said you're
walking the BO's vma list. So I guess on insertion you're also walking the BO's
vma list and see if there's already a mapping for this VM?

In your case that might make sense if you expect the extobj list to be larger
than the BO's vma list typically. In general I don't think this is true.

> 
> > 
> > > > Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> > > > choice, keeping O(1)?
> > > > When tracking extobjs, the address of the drm_gem_object is the key while the
> > > > reference count is the value. I was thinking of an XArray as well, but I was
> > > > worried that the corresponding indices could be too much distributed for an
> > > > XArray to still be efficient. Now that I think about it, it's probably not that
> > > > bad.
> > > > 
> > > > Btw., while I agree trying to make things as efficient as possible, what is the
> > > > magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
> > > Not sure yet, TBH, but I think one of our UMDs can only use external object,
> > > because they don't know at creation time which ones need exporting. However
> > > if this turns out to be too bad, there are various flavours of "clever but
> > > complicated" optimizations that we could think of to reduce the list size.
> > > Still in our case, we opted for the vma list head for now.
> > Considering the above, I would guess that if your current approach is good
> > enough, a maple tree will work as well.
> 
> Hmm, Yeah it's probably a bikeshed since each drm_exec builds a realloced
> array of all external objects on each exec.

I did a quick sketchy benchmark, which is probably good enough. In a maple tree
with 0xFFFF - 1 existing entries insertion of a random (non-existant) entry
took on average ~530ns over 1k iterations.

The average insertion time for each entry to build up a tree with 0xFFFF - 1
entries in the first place was ~1.3us. That's expected since it should hit
memory allocations more often than the previous one. The maximum peak was ~10us.
Inserting already existing entries took ~300ns.

That's probably good enough.

> 
> > 
> > Otherwise, if you want, I could do some experiments with Xarray and see how
> > that works out compared to using a maple tree.
> > 
> > Btw. another nice thing about using Xarray or maple tree for that is that
> > drivers updating the VA space from the fence signalling path don't need to
> > hold a GPU-VM lock to update the extobj list. Actually, they might not need
> > a GPU-VM lock at all.
> 
> I still don't follow why drivers would want to do that. Isn't the VA space /
> fence object list always updated sync from the IOCTL?

For the extobj list I don't see any advantage not doing that in the IOCTL right
away. For the VA space there are a few advantages doing it in the fence
signalling path.

(1) No need to allocate drm_gpuva_ops at all. For a given map / unmap request
    the driver can receive the callbacks for map / remap / unmap directly.
(2) No need to unwind VA space updates on failure, also no need for any other
    unwind tricks.
(3) Synchronous bind jobs can be injected at any point of time and don't need to
    be queued up in the scheduler to preserve ordering.
(4) Potentially less error prone ressource management. Although, I admit partly
    this is just the consequence of (1) and (2).

Actually, once I get the page table management prepared for that I'd like to
move Nouveau over this approach.

> 
> /Thomas
> 
> 
> > 
> > > /Thomas
> > > 
> > > 
> > > > > > > > +
> > > > > > > > +	/**
> > > > > > > > +	 * @evict: structure holding the evict list and evict list lock
> > > > > > > > +	 */
> > > > > > > > +	struct {
> > > > > > > > +		/**
> > > > > > > > +		 * @list: &list_head storing &drm_gem_objects currently being
> > > > > > > > +		 * evicted
> > > > > > > > +		 */
> > > > > > > > +		struct list_head list;
> > > > > > > > +
> > > > > > > > +		/**
> > > > > > > > +		 * @lock: spinlock to protect the evict list against concurrent
> > > > > > > > +		 * insertion / removal of different &drm_gpuva_gems
> > > > > > > > +		 */
> > > > > > > > +		spinlock_t lock;
> > > > > > > > +	} evict;
> > > > > > > >      };
> > > > > > > >      void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > > > > > +			    struct drm_device *drm,
> > > > > > > >      			    const char *name,
> > > > > > > >      			    u64 start_offset, u64 range,
> > > > > > > >      			    u64 reserve_offset, u64 reserve_range,
> > > > > > > >      			    const struct drm_gpuva_fn_ops *ops);
> > > > > > > >      void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > > > > > > > +/**
> > > > > > > > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > > > > > > > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > > > > > > > + */
> > > > > > > > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > > > > > > A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> > > > > > > should typically be allocated on the stack. Otherwise you'd need to protect
> > > > > > > the mgr->exec member with an exclusive lock throughout the locking process,
> > > > > > > and that's not what we want.
> > > > > > Oh, good point. I think it works in Nouveau, because there it's implicitly
> > > > > > protected with the job submission lock.
> > > > > > 
> > > > > > > Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> > > > > > > needed ops to it: Like so:
> > > > > > That's a good idea, will take this into V2.
> > > > > Actually, I'm not fully sure that was a good idea: I've now have a working
> > > > > version of Xe ported over to drm_exec, having these helpers in mind and with
> > > > > the intention to start using them as they mature. What I found, though is
> > > > > that open-coding the drm_exec loop is not all that bad, but that building
> > > > > blocks that can be called from within the loop are useful:
> > > > > 
> > > > > Like the drm_gpuva_prepare_objects() and an imaginary
> > > > > drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
> > > > > (if different and the gpuva points to the object. And
> > > > > drm_gpuva_prepare_array() although we don't use it within Xe. That means you
> > > > > can use these building blocks like helpers and avoid the fn() callback by
> > > > > instead open-coding.
> > > > > 
> > > > > But I guess YMMV.
> > > > That's exactly why those building blocks are exported, I already had in mind
> > > > that there might be drivers which still want to open-code the drm_exec loop,
> > > > while others might just want a simple interface to lock everything.
> > > > 
> > > > I still think it is a good idea, but I'd keep that as simple as possible. And
> > > > for everything else just let the driver open-code it and use the "building
> > > > blocks" - will also expand the bulding blocks to what you mentioned above.
> > > > 
> > > > > > > struct drm_gpuva_exec_ops {
> > > > > > >        int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> > > > > > Is this the fn argument from drm_gpuva_manager_lock_extra()?
> > > > > > 
> > > > > > >        int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> > > > > > > *obj);
> > > > > > I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> > > > > > be the same callback, right?
> > > > > > 
> > > > > > > };
> > > > > > > 
> > > > > > > struct drm_gpuva_exec {
> > > > > > >        const struct drm_gpuva_exec_ops *ops;
> > > > > > >        struct drm_exec exec;
> > > > > > >        struct drm_gpuva_manager *mgr;
> > > > > > > };
> > > > > > > 
> > > > > > > Although I'd actually expect bo_validate to be part of fn in the typical
> > > > > > > case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> > > > > > This doesn't sound like my assumption about fn() above is correct.
> > > > > Well one important thing in our conversion is that ttm_bo_validate () needs
> > > > > to be in the until_all_locked() loop. We want to be able soon to use
> > > > > sleeping locks for eviction, so a xe_bo_validate() would, at least
> > > > > temporarily, add locked objects to the drm_exec list of locked objects. That
> > > > > means everything that may end up calling validate deep within the call chain
> > > > > needs to be part of the until_all_locked() loop, so our
> > > > > drm_gpuva_manager_lock_extra() fn callback would include those validates and
> > > > > look different all the time. Hence that's why open-coding isn't all that
> > > > > bad...
> > > > Oh, I see. You indeed want to call validate() from within until_all_locked().
> > > > 
> > > > > /Thomas
> > > > > 
> > > > > 
> > <snip>
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-08-31 19:07                   ` [Nouveau] " Danilo Krummrich
  (?)
@ 2023-09-01  5:59                     ` Thomas Hellström (Intel)
  -1 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-09-01  5:59 UTC (permalink / raw)
  To: Danilo Krummrich
  Cc: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel


On 8/31/23 21:07, Danilo Krummrich wrote:
> On Thu, Aug 31, 2023 at 06:53:01PM +0200, Thomas Hellström (Intel) wrote:
>> Hi,
>>
>> On 8/31/23 13:18, Danilo Krummrich wrote:
>>> On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
>>>> Hi!
>>>>
>>>> On 8/30/23 17:00, Danilo Krummrich wrote:
>>>>> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
>>>>>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>>>>>> Hi Thomas,
>>>>>>>
>>>>>>> thanks for having a look!
>>>>>>>
>>>>>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>>>>>>>> Hi, Danilo.
>>>>>>>>
>>>>>>>> Some quick comments since I'm doing some Xe work in this area. Will probably
>>>>>>>> get back with more.
>>>>>>>>
>>>>>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
>>> <snip>
>>>
>>>>>>>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>>>>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>>>>>> @@ -26,12 +26,16 @@
>>>>>>>>>        */
>>>>>>>>>       #include <linux/list.h>
>>>>>>>>> +#include <linux/dma-resv.h>
>>>>>>>>> +#include <linux/maple_tree.h>
>>>>>>>>>       #include <linux/rbtree.h>
>>>>>>>>>       #include <linux/types.h>
>>>>>>>>>       #include <drm/drm_gem.h>
>>>>>>>>> +#include <drm/drm_exec.h>
>>>>>>>>>       struct drm_gpuva_manager;
>>>>>>>>> +struct drm_gpuva_gem;
>>>>>>>>>       struct drm_gpuva_fn_ops;
>>>>>>>>>       /**
>>>>>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>>>>>       int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>>>>>>>       void drm_gpuva_remove(struct drm_gpuva *va);
>>>>>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>>>>>>>       void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>>>>>       struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>>>>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>>>>>       	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>>>>>>>       	 */
>>>>>>>>>       	const struct drm_gpuva_fn_ops *ops;
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>>>>>>>> +	 * dma-resv to &drm_exec.
>>>>>>>>> +	 */
>>>>>>>>> +	struct drm_gem_object d_obj;
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>>>>>>>> +	 * space
>>>>>>>>> +	 */
>>>>>>>>> +	struct dma_resv *resv;
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>>>>>>>> +	 */
>>>>>>>>> +	struct drm_exec exec;
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>>>>>> +	 */
>>>>>>>>> +	struct maple_tree mt_ext;
>>>>>>>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>>>>>>>> instead of O(1) for a list?
>>>>>>>>
>>>>>>> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
>>>>>>> could have mappings of the same extobj.
>>>>>>>
>>>>>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
>>>>>>> instead, which also seems to be the obvious choice. However, there is a locking
>>>>>>> conflict.
>>>>>>>
>>>>>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
>>>>>>> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
>>>>>>> the corresponding drm_gem_object, or with an external lock provided by the
>>>>>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
>>>>>>> changes on the GPUVA space directly from the fence signalling path.
>>>>>>>
>>>>>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
>>>>>>> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
>>>>>>> linked and we'd want to remove it for the last one being unlinked.
>>>>>>>
>>>>>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
>>>>>>> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
>>>>>>> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
>>>>>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>>>>>
>>>>>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
>>>>>>> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
>>>>>>> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
>>>>>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>>>>>> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
>>>>>>> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
>>>>>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>>>>>
>>>>>>> Considering all that, I thought it's probably better to track extobjs separate
>>>>>>> from the drm_gpuva_gem, hence the maple tree choice.
>>>>>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
>>>>>> external objects, or in the case of multiple mappings to the same gem
>>>>>> object, only one of the drm_gpuvas is in the list. These are protected by
>>>>>> the GPU-VM lock. I don't see a problem with removing those from the fence
>>>>>> signalling path, though?
>>>>> I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
>>>>> since this is generic code I don't know how much mappings of an external object
>>>>> the corresponding driver potentially creates. This could become a pretty large
>>>>> list to iterate. Another reason was, that I want to keep the drm_gpuva structure
>>>>> as small as possible, hence avoiding another list_head.
>>>> Yes, the list might be pretty large, but OTOH you never iterate to access a
>>>> single list element. When you need to iterate the whole list you need to do
>>>> that regardless of the data structure used. As for the list head, it might
>>>> perhaps be aliased (union) with an upcoming userptr list head?
>>>>
>>> Oh, I did not mean that I'm concerned about the size of a list of extobjs in
>>> general, that would indeed be the same for every data structure chosen. But I
>>> would be concerned about keeping a list of *all* mappings being backed by an
>>> extobj.
>>>
>>>>> Now, it sounds like in XE you're doing some kind of optimization just keeping a
>>>>> single mapping of an extobj in the list? How do you know when to remove it? What
>>>>> if the mapping from the extobj list gets unmapped, but there is still another
>>>>> one left in the GPU-VM being backed by the same BO?
>>>> When removing from the lists, we iterate through the object's list of vmas,
>>>> and if there is one matching the same vm, we replace the old one with the
>>>> new one. A similar iteration is done when adding to avoid adding one that is
>>>> already on the list.
>>> I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
>>> while using the maple tree is O(log(n))?
>> No, insertion and removal is O(m) where m is the number of vms the object is
>> currently bound to. Typically a very small number.
> Ok, my guess was that on insertion you'd actually walk the extobj list and see
> if there's a vma backed by the same BO already, while on removal you said you're
> walking the BO's vma list. So I guess on insertion you're also walking the BO's
> vma list and see if there's already a mapping for this VM?
>
> In your case that might make sense if you expect the extobj list to be larger
> than the BO's vma list typically. In general I don't think this is true.

I think we're then optimizing for different scenarios. Our compute 
driver will use mostly external objects only, and if shared, I don't 
forsee them bound to many VMs. What saves us currently here is that in 
compute mode we only really traverse the extobj list after a preempt 
fence wait, or when a vm is using a new context for the first time. So 
vm's extobj list is pretty large. Each bo's vma list will typically be 
pretty small.

Another reason for us to use the list is that one possible, but not yet 
implemented, workaround for this is the "vm fence", which when attached 
to external bos pulls them off the extobj list and on 
"enable_signalling()" splices its sublist of external bos back, and then 
snapshots the vm's dma_resv and waits for all its fences. (The idea is 
that it should very seldom be waited for in practice, and largely 
eliminate the extobj handling). Here a list is an ideal data structure 
for list removal and splicing. TBH we really want to avoid this 
optimization but we need to see how bad extobj handling ends up in 
practice for the compute drivers.


>
>>>>> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
>>>>> choice, keeping O(1)?
>>>>> When tracking extobjs, the address of the drm_gem_object is the key while the
>>>>> reference count is the value. I was thinking of an XArray as well, but I was
>>>>> worried that the corresponding indices could be too much distributed for an
>>>>> XArray to still be efficient. Now that I think about it, it's probably not that
>>>>> bad.
>>>>>
>>>>> Btw., while I agree trying to make things as efficient as possible, what is the
>>>>> magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
>>>> Not sure yet, TBH, but I think one of our UMDs can only use external object,
>>>> because they don't know at creation time which ones need exporting. However
>>>> if this turns out to be too bad, there are various flavours of "clever but
>>>> complicated" optimizations that we could think of to reduce the list size.
>>>> Still in our case, we opted for the vma list head for now.
>>> Considering the above, I would guess that if your current approach is good
>>> enough, a maple tree will work as well.
>> Hmm, Yeah it's probably a bikeshed since each drm_exec builds a realloced
>> array of all external objects on each exec.
> I did a quick sketchy benchmark, which is probably good enough. In a maple tree
> with 0xFFFF - 1 existing entries insertion of a random (non-existant) entry
> took on average ~530ns over 1k iterations.
>
> The average insertion time for each entry to build up a tree with 0xFFFF - 1
> entries in the first place was ~1.3us. That's expected since it should hit
> memory allocations more often than the previous one. The maximum peak was ~10us.
> Inserting already existing entries took ~300ns.
>
> That's probably good enough.

That's hard to tell because we have nothing to compare with. For 
drm_exec, Christian chose a realloced array because of linked list cache 
locality issues, and Xarray locking requirements causing measurable 
performance issues. Wouldn't a maple tree suffer from both of these?

In any case if you go for the maple tree would it be possible to hide 
the implementation in a way as to make it not too hard to replace if 
real-world workloads prove it necessary?

>
>>> Otherwise, if you want, I could do some experiments with Xarray and see how
>>> that works out compared to using a maple tree.
>>>
>>> Btw. another nice thing about using Xarray or maple tree for that is that
>>> drivers updating the VA space from the fence signalling path don't need to
>>> hold a GPU-VM lock to update the extobj list. Actually, they might not need
>>> a GPU-VM lock at all.
>> I still don't follow why drivers would want to do that. Isn't the VA space /
>> fence object list always updated sync from the IOCTL?
> For the extobj list I don't see any advantage not doing that in the IOCTL right
> away. For the VA space there are a few advantages doing it in the fence
> signalling path.
>
> (1) No need to allocate drm_gpuva_ops at all. For a given map / unmap request
>      the driver can receive the callbacks for map / remap / unmap directly.
> (2) No need to unwind VA space updates on failure, also no need for any other
>      unwind tricks.
> (3) Synchronous bind jobs can be injected at any point of time and don't need to
>      be queued up in the scheduler to preserve ordering.
> (4) Potentially less error prone ressource management. Although, I admit partly
>      this is just the consequence of (1) and (2).
>
> Actually, once I get the page table management prepared for that I'd like to
> move Nouveau over this approach.

OK. I guess I need to look at the resulting implementation to fully 
digest this.

Thanks,

Thomas


>
>> /Thomas
>>
>>
>>>> /Thomas
>>>>
>>>>
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @evict: structure holding the evict list and evict list lock
>>>>>>>>> +	 */
>>>>>>>>> +	struct {
>>>>>>>>> +		/**
>>>>>>>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>>>>>>>> +		 * evicted
>>>>>>>>> +		 */
>>>>>>>>> +		struct list_head list;
>>>>>>>>> +
>>>>>>>>> +		/**
>>>>>>>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>>>>>>>> +		 * insertion / removal of different &drm_gpuva_gems
>>>>>>>>> +		 */
>>>>>>>>> +		spinlock_t lock;
>>>>>>>>> +	} evict;
>>>>>>>>>       };
>>>>>>>>>       void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>>>>> +			    struct drm_device *drm,
>>>>>>>>>       			    const char *name,
>>>>>>>>>       			    u64 start_offset, u64 range,
>>>>>>>>>       			    u64 reserve_offset, u64 reserve_range,
>>>>>>>>>       			    const struct drm_gpuva_fn_ops *ops);
>>>>>>>>>       void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>>>>>>>> +/**
>>>>>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>>>>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>>>>>>>> + */
>>>>>>>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>>>>>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>>>>>>>> should typically be allocated on the stack. Otherwise you'd need to protect
>>>>>>>> the mgr->exec member with an exclusive lock throughout the locking process,
>>>>>>>> and that's not what we want.
>>>>>>> Oh, good point. I think it works in Nouveau, because there it's implicitly
>>>>>>> protected with the job submission lock.
>>>>>>>
>>>>>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>>>>>>>> needed ops to it: Like so:
>>>>>>> That's a good idea, will take this into V2.
>>>>>> Actually, I'm not fully sure that was a good idea: I've now have a working
>>>>>> version of Xe ported over to drm_exec, having these helpers in mind and with
>>>>>> the intention to start using them as they mature. What I found, though is
>>>>>> that open-coding the drm_exec loop is not all that bad, but that building
>>>>>> blocks that can be called from within the loop are useful:
>>>>>>
>>>>>> Like the drm_gpuva_prepare_objects() and an imaginary
>>>>>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
>>>>>> (if different and the gpuva points to the object. And
>>>>>> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
>>>>>> can use these building blocks like helpers and avoid the fn() callback by
>>>>>> instead open-coding.
>>>>>>
>>>>>> But I guess YMMV.
>>>>> That's exactly why those building blocks are exported, I already had in mind
>>>>> that there might be drivers which still want to open-code the drm_exec loop,
>>>>> while others might just want a simple interface to lock everything.
>>>>>
>>>>> I still think it is a good idea, but I'd keep that as simple as possible. And
>>>>> for everything else just let the driver open-code it and use the "building
>>>>> blocks" - will also expand the bulding blocks to what you mentioned above.
>>>>>
>>>>>>>> struct drm_gpuva_exec_ops {
>>>>>>>>         int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>>>>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>>>>>
>>>>>>>>         int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>>>>>>>> *obj);
>>>>>>> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
>>>>>>> be the same callback, right?
>>>>>>>
>>>>>>>> };
>>>>>>>>
>>>>>>>> struct drm_gpuva_exec {
>>>>>>>>         const struct drm_gpuva_exec_ops *ops;
>>>>>>>>         struct drm_exec exec;
>>>>>>>>         struct drm_gpuva_manager *mgr;
>>>>>>>> };
>>>>>>>>
>>>>>>>> Although I'd actually expect bo_validate to be part of fn in the typical
>>>>>>>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
>>>>>>> This doesn't sound like my assumption about fn() above is correct.
>>>>>> Well one important thing in our conversion is that ttm_bo_validate () needs
>>>>>> to be in the until_all_locked() loop. We want to be able soon to use
>>>>>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>>>>>> temporarily, add locked objects to the drm_exec list of locked objects. That
>>>>>> means everything that may end up calling validate deep within the call chain
>>>>>> needs to be part of the until_all_locked() loop, so our
>>>>>> drm_gpuva_manager_lock_extra() fn callback would include those validates and
>>>>>> look different all the time. Hence that's why open-coding isn't all that
>>>>>> bad...
>>>>> Oh, I see. You indeed want to call validate() from within until_all_locked().
>>>>>
>>>>>> /Thomas
>>>>>>
>>>>>>
>>> <snip>

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-09-01  5:59                     ` Thomas Hellström (Intel)
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-09-01  5:59 UTC (permalink / raw)
  To: Danilo Krummrich
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, christian.koenig, faith.ekstrand, bskeggs


On 8/31/23 21:07, Danilo Krummrich wrote:
> On Thu, Aug 31, 2023 at 06:53:01PM +0200, Thomas Hellström (Intel) wrote:
>> Hi,
>>
>> On 8/31/23 13:18, Danilo Krummrich wrote:
>>> On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
>>>> Hi!
>>>>
>>>> On 8/30/23 17:00, Danilo Krummrich wrote:
>>>>> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
>>>>>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>>>>>> Hi Thomas,
>>>>>>>
>>>>>>> thanks for having a look!
>>>>>>>
>>>>>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>>>>>>>> Hi, Danilo.
>>>>>>>>
>>>>>>>> Some quick comments since I'm doing some Xe work in this area. Will probably
>>>>>>>> get back with more.
>>>>>>>>
>>>>>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
>>> <snip>
>>>
>>>>>>>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>>>>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>>>>>> @@ -26,12 +26,16 @@
>>>>>>>>>        */
>>>>>>>>>       #include <linux/list.h>
>>>>>>>>> +#include <linux/dma-resv.h>
>>>>>>>>> +#include <linux/maple_tree.h>
>>>>>>>>>       #include <linux/rbtree.h>
>>>>>>>>>       #include <linux/types.h>
>>>>>>>>>       #include <drm/drm_gem.h>
>>>>>>>>> +#include <drm/drm_exec.h>
>>>>>>>>>       struct drm_gpuva_manager;
>>>>>>>>> +struct drm_gpuva_gem;
>>>>>>>>>       struct drm_gpuva_fn_ops;
>>>>>>>>>       /**
>>>>>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>>>>>       int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>>>>>>>       void drm_gpuva_remove(struct drm_gpuva *va);
>>>>>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>>>>>>>       void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>>>>>       struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>>>>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>>>>>       	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>>>>>>>       	 */
>>>>>>>>>       	const struct drm_gpuva_fn_ops *ops;
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>>>>>>>> +	 * dma-resv to &drm_exec.
>>>>>>>>> +	 */
>>>>>>>>> +	struct drm_gem_object d_obj;
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>>>>>>>> +	 * space
>>>>>>>>> +	 */
>>>>>>>>> +	struct dma_resv *resv;
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>>>>>>>> +	 */
>>>>>>>>> +	struct drm_exec exec;
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>>>>>> +	 */
>>>>>>>>> +	struct maple_tree mt_ext;
>>>>>>>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>>>>>>>> instead of O(1) for a list?
>>>>>>>>
>>>>>>> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
>>>>>>> could have mappings of the same extobj.
>>>>>>>
>>>>>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
>>>>>>> instead, which also seems to be the obvious choice. However, there is a locking
>>>>>>> conflict.
>>>>>>>
>>>>>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
>>>>>>> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
>>>>>>> the corresponding drm_gem_object, or with an external lock provided by the
>>>>>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
>>>>>>> changes on the GPUVA space directly from the fence signalling path.
>>>>>>>
>>>>>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
>>>>>>> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
>>>>>>> linked and we'd want to remove it for the last one being unlinked.
>>>>>>>
>>>>>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
>>>>>>> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
>>>>>>> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
>>>>>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>>>>>
>>>>>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
>>>>>>> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
>>>>>>> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
>>>>>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>>>>>> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
>>>>>>> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
>>>>>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>>>>>
>>>>>>> Considering all that, I thought it's probably better to track extobjs separate
>>>>>>> from the drm_gpuva_gem, hence the maple tree choice.
>>>>>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
>>>>>> external objects, or in the case of multiple mappings to the same gem
>>>>>> object, only one of the drm_gpuvas is in the list. These are protected by
>>>>>> the GPU-VM lock. I don't see a problem with removing those from the fence
>>>>>> signalling path, though?
>>>>> I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
>>>>> since this is generic code I don't know how much mappings of an external object
>>>>> the corresponding driver potentially creates. This could become a pretty large
>>>>> list to iterate. Another reason was, that I want to keep the drm_gpuva structure
>>>>> as small as possible, hence avoiding another list_head.
>>>> Yes, the list might be pretty large, but OTOH you never iterate to access a
>>>> single list element. When you need to iterate the whole list you need to do
>>>> that regardless of the data structure used. As for the list head, it might
>>>> perhaps be aliased (union) with an upcoming userptr list head?
>>>>
>>> Oh, I did not mean that I'm concerned about the size of a list of extobjs in
>>> general, that would indeed be the same for every data structure chosen. But I
>>> would be concerned about keeping a list of *all* mappings being backed by an
>>> extobj.
>>>
>>>>> Now, it sounds like in XE you're doing some kind of optimization just keeping a
>>>>> single mapping of an extobj in the list? How do you know when to remove it? What
>>>>> if the mapping from the extobj list gets unmapped, but there is still another
>>>>> one left in the GPU-VM being backed by the same BO?
>>>> When removing from the lists, we iterate through the object's list of vmas,
>>>> and if there is one matching the same vm, we replace the old one with the
>>>> new one. A similar iteration is done when adding to avoid adding one that is
>>>> already on the list.
>>> I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
>>> while using the maple tree is O(log(n))?
>> No, insertion and removal is O(m) where m is the number of vms the object is
>> currently bound to. Typically a very small number.
> Ok, my guess was that on insertion you'd actually walk the extobj list and see
> if there's a vma backed by the same BO already, while on removal you said you're
> walking the BO's vma list. So I guess on insertion you're also walking the BO's
> vma list and see if there's already a mapping for this VM?
>
> In your case that might make sense if you expect the extobj list to be larger
> than the BO's vma list typically. In general I don't think this is true.

I think we're then optimizing for different scenarios. Our compute 
driver will use mostly external objects only, and if shared, I don't 
forsee them bound to many VMs. What saves us currently here is that in 
compute mode we only really traverse the extobj list after a preempt 
fence wait, or when a vm is using a new context for the first time. So 
vm's extobj list is pretty large. Each bo's vma list will typically be 
pretty small.

Another reason for us to use the list is that one possible, but not yet 
implemented, workaround for this is the "vm fence", which when attached 
to external bos pulls them off the extobj list and on 
"enable_signalling()" splices its sublist of external bos back, and then 
snapshots the vm's dma_resv and waits for all its fences. (The idea is 
that it should very seldom be waited for in practice, and largely 
eliminate the extobj handling). Here a list is an ideal data structure 
for list removal and splicing. TBH we really want to avoid this 
optimization but we need to see how bad extobj handling ends up in 
practice for the compute drivers.


>
>>>>> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
>>>>> choice, keeping O(1)?
>>>>> When tracking extobjs, the address of the drm_gem_object is the key while the
>>>>> reference count is the value. I was thinking of an XArray as well, but I was
>>>>> worried that the corresponding indices could be too much distributed for an
>>>>> XArray to still be efficient. Now that I think about it, it's probably not that
>>>>> bad.
>>>>>
>>>>> Btw., while I agree trying to make things as efficient as possible, what is the
>>>>> magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
>>>> Not sure yet, TBH, but I think one of our UMDs can only use external object,
>>>> because they don't know at creation time which ones need exporting. However
>>>> if this turns out to be too bad, there are various flavours of "clever but
>>>> complicated" optimizations that we could think of to reduce the list size.
>>>> Still in our case, we opted for the vma list head for now.
>>> Considering the above, I would guess that if your current approach is good
>>> enough, a maple tree will work as well.
>> Hmm, Yeah it's probably a bikeshed since each drm_exec builds a realloced
>> array of all external objects on each exec.
> I did a quick sketchy benchmark, which is probably good enough. In a maple tree
> with 0xFFFF - 1 existing entries insertion of a random (non-existant) entry
> took on average ~530ns over 1k iterations.
>
> The average insertion time for each entry to build up a tree with 0xFFFF - 1
> entries in the first place was ~1.3us. That's expected since it should hit
> memory allocations more often than the previous one. The maximum peak was ~10us.
> Inserting already existing entries took ~300ns.
>
> That's probably good enough.

That's hard to tell because we have nothing to compare with. For 
drm_exec, Christian chose a realloced array because of linked list cache 
locality issues, and Xarray locking requirements causing measurable 
performance issues. Wouldn't a maple tree suffer from both of these?

In any case if you go for the maple tree would it be possible to hide 
the implementation in a way as to make it not too hard to replace if 
real-world workloads prove it necessary?

>
>>> Otherwise, if you want, I could do some experiments with Xarray and see how
>>> that works out compared to using a maple tree.
>>>
>>> Btw. another nice thing about using Xarray or maple tree for that is that
>>> drivers updating the VA space from the fence signalling path don't need to
>>> hold a GPU-VM lock to update the extobj list. Actually, they might not need
>>> a GPU-VM lock at all.
>> I still don't follow why drivers would want to do that. Isn't the VA space /
>> fence object list always updated sync from the IOCTL?
> For the extobj list I don't see any advantage not doing that in the IOCTL right
> away. For the VA space there are a few advantages doing it in the fence
> signalling path.
>
> (1) No need to allocate drm_gpuva_ops at all. For a given map / unmap request
>      the driver can receive the callbacks for map / remap / unmap directly.
> (2) No need to unwind VA space updates on failure, also no need for any other
>      unwind tricks.
> (3) Synchronous bind jobs can be injected at any point of time and don't need to
>      be queued up in the scheduler to preserve ordering.
> (4) Potentially less error prone ressource management. Although, I admit partly
>      this is just the consequence of (1) and (2).
>
> Actually, once I get the page table management prepared for that I'd like to
> move Nouveau over this approach.

OK. I guess I need to look at the resulting implementation to fully 
digest this.

Thanks,

Thomas


>
>> /Thomas
>>
>>
>>>> /Thomas
>>>>
>>>>
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @evict: structure holding the evict list and evict list lock
>>>>>>>>> +	 */
>>>>>>>>> +	struct {
>>>>>>>>> +		/**
>>>>>>>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>>>>>>>> +		 * evicted
>>>>>>>>> +		 */
>>>>>>>>> +		struct list_head list;
>>>>>>>>> +
>>>>>>>>> +		/**
>>>>>>>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>>>>>>>> +		 * insertion / removal of different &drm_gpuva_gems
>>>>>>>>> +		 */
>>>>>>>>> +		spinlock_t lock;
>>>>>>>>> +	} evict;
>>>>>>>>>       };
>>>>>>>>>       void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>>>>> +			    struct drm_device *drm,
>>>>>>>>>       			    const char *name,
>>>>>>>>>       			    u64 start_offset, u64 range,
>>>>>>>>>       			    u64 reserve_offset, u64 reserve_range,
>>>>>>>>>       			    const struct drm_gpuva_fn_ops *ops);
>>>>>>>>>       void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>>>>>>>> +/**
>>>>>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>>>>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>>>>>>>> + */
>>>>>>>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>>>>>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>>>>>>>> should typically be allocated on the stack. Otherwise you'd need to protect
>>>>>>>> the mgr->exec member with an exclusive lock throughout the locking process,
>>>>>>>> and that's not what we want.
>>>>>>> Oh, good point. I think it works in Nouveau, because there it's implicitly
>>>>>>> protected with the job submission lock.
>>>>>>>
>>>>>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>>>>>>>> needed ops to it: Like so:
>>>>>>> That's a good idea, will take this into V2.
>>>>>> Actually, I'm not fully sure that was a good idea: I've now have a working
>>>>>> version of Xe ported over to drm_exec, having these helpers in mind and with
>>>>>> the intention to start using them as they mature. What I found, though is
>>>>>> that open-coding the drm_exec loop is not all that bad, but that building
>>>>>> blocks that can be called from within the loop are useful:
>>>>>>
>>>>>> Like the drm_gpuva_prepare_objects() and an imaginary
>>>>>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
>>>>>> (if different and the gpuva points to the object. And
>>>>>> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
>>>>>> can use these building blocks like helpers and avoid the fn() callback by
>>>>>> instead open-coding.
>>>>>>
>>>>>> But I guess YMMV.
>>>>> That's exactly why those building blocks are exported, I already had in mind
>>>>> that there might be drivers which still want to open-code the drm_exec loop,
>>>>> while others might just want a simple interface to lock everything.
>>>>>
>>>>> I still think it is a good idea, but I'd keep that as simple as possible. And
>>>>> for everything else just let the driver open-code it and use the "building
>>>>> blocks" - will also expand the bulding blocks to what you mentioned above.
>>>>>
>>>>>>>> struct drm_gpuva_exec_ops {
>>>>>>>>         int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>>>>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>>>>>
>>>>>>>>         int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>>>>>>>> *obj);
>>>>>>> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
>>>>>>> be the same callback, right?
>>>>>>>
>>>>>>>> };
>>>>>>>>
>>>>>>>> struct drm_gpuva_exec {
>>>>>>>>         const struct drm_gpuva_exec_ops *ops;
>>>>>>>>         struct drm_exec exec;
>>>>>>>>         struct drm_gpuva_manager *mgr;
>>>>>>>> };
>>>>>>>>
>>>>>>>> Although I'd actually expect bo_validate to be part of fn in the typical
>>>>>>>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
>>>>>>> This doesn't sound like my assumption about fn() above is correct.
>>>>>> Well one important thing in our conversion is that ttm_bo_validate () needs
>>>>>> to be in the until_all_locked() loop. We want to be able soon to use
>>>>>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>>>>>> temporarily, add locked objects to the drm_exec list of locked objects. That
>>>>>> means everything that may end up calling validate deep within the call chain
>>>>>> needs to be part of the until_all_locked() loop, so our
>>>>>> drm_gpuva_manager_lock_extra() fn callback would include those validates and
>>>>>> look different all the time. Hence that's why open-coding isn't all that
>>>>>> bad...
>>>>> Oh, I see. You indeed want to call validate() from within until_all_locked().
>>>>>
>>>>>> /Thomas
>>>>>>
>>>>>>
>>> <snip>

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-09-01  5:59                     ` Thomas Hellström (Intel)
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström (Intel) @ 2023-09-01  5:59 UTC (permalink / raw)
  To: Danilo Krummrich
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, christian.koenig, faith.ekstrand, bskeggs


On 8/31/23 21:07, Danilo Krummrich wrote:
> On Thu, Aug 31, 2023 at 06:53:01PM +0200, Thomas Hellström (Intel) wrote:
>> Hi,
>>
>> On 8/31/23 13:18, Danilo Krummrich wrote:
>>> On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
>>>> Hi!
>>>>
>>>> On 8/30/23 17:00, Danilo Krummrich wrote:
>>>>> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
>>>>>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>>>>>> Hi Thomas,
>>>>>>>
>>>>>>> thanks for having a look!
>>>>>>>
>>>>>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>>>>>>>> Hi, Danilo.
>>>>>>>>
>>>>>>>> Some quick comments since I'm doing some Xe work in this area. Will probably
>>>>>>>> get back with more.
>>>>>>>>
>>>>>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
>>> <snip>
>>>
>>>>>>>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>>>>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>>>>>> @@ -26,12 +26,16 @@
>>>>>>>>>        */
>>>>>>>>>       #include <linux/list.h>
>>>>>>>>> +#include <linux/dma-resv.h>
>>>>>>>>> +#include <linux/maple_tree.h>
>>>>>>>>>       #include <linux/rbtree.h>
>>>>>>>>>       #include <linux/types.h>
>>>>>>>>>       #include <drm/drm_gem.h>
>>>>>>>>> +#include <drm/drm_exec.h>
>>>>>>>>>       struct drm_gpuva_manager;
>>>>>>>>> +struct drm_gpuva_gem;
>>>>>>>>>       struct drm_gpuva_fn_ops;
>>>>>>>>>       /**
>>>>>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>>>>>       int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>>>>>>>       void drm_gpuva_remove(struct drm_gpuva *va);
>>>>>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>>>>>>>       void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>>>>>       struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>>>>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>>>>>       	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>>>>>>>       	 */
>>>>>>>>>       	const struct drm_gpuva_fn_ops *ops;
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>>>>>>>> +	 * dma-resv to &drm_exec.
>>>>>>>>> +	 */
>>>>>>>>> +	struct drm_gem_object d_obj;
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>>>>>>>> +	 * space
>>>>>>>>> +	 */
>>>>>>>>> +	struct dma_resv *resv;
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>>>>>>>> +	 */
>>>>>>>>> +	struct drm_exec exec;
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>>>>>> +	 */
>>>>>>>>> +	struct maple_tree mt_ext;
>>>>>>>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>>>>>>>> instead of O(1) for a list?
>>>>>>>>
>>>>>>> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
>>>>>>> could have mappings of the same extobj.
>>>>>>>
>>>>>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
>>>>>>> instead, which also seems to be the obvious choice. However, there is a locking
>>>>>>> conflict.
>>>>>>>
>>>>>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
>>>>>>> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
>>>>>>> the corresponding drm_gem_object, or with an external lock provided by the
>>>>>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
>>>>>>> changes on the GPUVA space directly from the fence signalling path.
>>>>>>>
>>>>>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
>>>>>>> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
>>>>>>> linked and we'd want to remove it for the last one being unlinked.
>>>>>>>
>>>>>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
>>>>>>> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
>>>>>>> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
>>>>>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>>>>>
>>>>>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
>>>>>>> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
>>>>>>> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
>>>>>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>>>>>> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
>>>>>>> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
>>>>>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>>>>>
>>>>>>> Considering all that, I thought it's probably better to track extobjs separate
>>>>>>> from the drm_gpuva_gem, hence the maple tree choice.
>>>>>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
>>>>>> external objects, or in the case of multiple mappings to the same gem
>>>>>> object, only one of the drm_gpuvas is in the list. These are protected by
>>>>>> the GPU-VM lock. I don't see a problem with removing those from the fence
>>>>>> signalling path, though?
>>>>> I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
>>>>> since this is generic code I don't know how much mappings of an external object
>>>>> the corresponding driver potentially creates. This could become a pretty large
>>>>> list to iterate. Another reason was, that I want to keep the drm_gpuva structure
>>>>> as small as possible, hence avoiding another list_head.
>>>> Yes, the list might be pretty large, but OTOH you never iterate to access a
>>>> single list element. When you need to iterate the whole list you need to do
>>>> that regardless of the data structure used. As for the list head, it might
>>>> perhaps be aliased (union) with an upcoming userptr list head?
>>>>
>>> Oh, I did not mean that I'm concerned about the size of a list of extobjs in
>>> general, that would indeed be the same for every data structure chosen. But I
>>> would be concerned about keeping a list of *all* mappings being backed by an
>>> extobj.
>>>
>>>>> Now, it sounds like in XE you're doing some kind of optimization just keeping a
>>>>> single mapping of an extobj in the list? How do you know when to remove it? What
>>>>> if the mapping from the extobj list gets unmapped, but there is still another
>>>>> one left in the GPU-VM being backed by the same BO?
>>>> When removing from the lists, we iterate through the object's list of vmas,
>>>> and if there is one matching the same vm, we replace the old one with the
>>>> new one. A similar iteration is done when adding to avoid adding one that is
>>>> already on the list.
>>> I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
>>> while using the maple tree is O(log(n))?
>> No, insertion and removal is O(m) where m is the number of vms the object is
>> currently bound to. Typically a very small number.
> Ok, my guess was that on insertion you'd actually walk the extobj list and see
> if there's a vma backed by the same BO already, while on removal you said you're
> walking the BO's vma list. So I guess on insertion you're also walking the BO's
> vma list and see if there's already a mapping for this VM?
>
> In your case that might make sense if you expect the extobj list to be larger
> than the BO's vma list typically. In general I don't think this is true.

I think we're then optimizing for different scenarios. Our compute 
driver will use mostly external objects only, and if shared, I don't 
forsee them bound to many VMs. What saves us currently here is that in 
compute mode we only really traverse the extobj list after a preempt 
fence wait, or when a vm is using a new context for the first time. So 
vm's extobj list is pretty large. Each bo's vma list will typically be 
pretty small.

Another reason for us to use the list is that one possible, but not yet 
implemented, workaround for this is the "vm fence", which when attached 
to external bos pulls them off the extobj list and on 
"enable_signalling()" splices its sublist of external bos back, and then 
snapshots the vm's dma_resv and waits for all its fences. (The idea is 
that it should very seldom be waited for in practice, and largely 
eliminate the extobj handling). Here a list is an ideal data structure 
for list removal and splicing. TBH we really want to avoid this 
optimization but we need to see how bad extobj handling ends up in 
practice for the compute drivers.


>
>>>>> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
>>>>> choice, keeping O(1)?
>>>>> When tracking extobjs, the address of the drm_gem_object is the key while the
>>>>> reference count is the value. I was thinking of an XArray as well, but I was
>>>>> worried that the corresponding indices could be too much distributed for an
>>>>> XArray to still be efficient. Now that I think about it, it's probably not that
>>>>> bad.
>>>>>
>>>>> Btw., while I agree trying to make things as efficient as possible, what is the
>>>>> magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
>>>> Not sure yet, TBH, but I think one of our UMDs can only use external object,
>>>> because they don't know at creation time which ones need exporting. However
>>>> if this turns out to be too bad, there are various flavours of "clever but
>>>> complicated" optimizations that we could think of to reduce the list size.
>>>> Still in our case, we opted for the vma list head for now.
>>> Considering the above, I would guess that if your current approach is good
>>> enough, a maple tree will work as well.
>> Hmm, Yeah it's probably a bikeshed since each drm_exec builds a realloced
>> array of all external objects on each exec.
> I did a quick sketchy benchmark, which is probably good enough. In a maple tree
> with 0xFFFF - 1 existing entries insertion of a random (non-existant) entry
> took on average ~530ns over 1k iterations.
>
> The average insertion time for each entry to build up a tree with 0xFFFF - 1
> entries in the first place was ~1.3us. That's expected since it should hit
> memory allocations more often than the previous one. The maximum peak was ~10us.
> Inserting already existing entries took ~300ns.
>
> That's probably good enough.

That's hard to tell because we have nothing to compare with. For 
drm_exec, Christian chose a realloced array because of linked list cache 
locality issues, and Xarray locking requirements causing measurable 
performance issues. Wouldn't a maple tree suffer from both of these?

In any case if you go for the maple tree would it be possible to hide 
the implementation in a way as to make it not too hard to replace if 
real-world workloads prove it necessary?

>
>>> Otherwise, if you want, I could do some experiments with Xarray and see how
>>> that works out compared to using a maple tree.
>>>
>>> Btw. another nice thing about using Xarray or maple tree for that is that
>>> drivers updating the VA space from the fence signalling path don't need to
>>> hold a GPU-VM lock to update the extobj list. Actually, they might not need
>>> a GPU-VM lock at all.
>> I still don't follow why drivers would want to do that. Isn't the VA space /
>> fence object list always updated sync from the IOCTL?
> For the extobj list I don't see any advantage not doing that in the IOCTL right
> away. For the VA space there are a few advantages doing it in the fence
> signalling path.
>
> (1) No need to allocate drm_gpuva_ops at all. For a given map / unmap request
>      the driver can receive the callbacks for map / remap / unmap directly.
> (2) No need to unwind VA space updates on failure, also no need for any other
>      unwind tricks.
> (3) Synchronous bind jobs can be injected at any point of time and don't need to
>      be queued up in the scheduler to preserve ordering.
> (4) Potentially less error prone ressource management. Although, I admit partly
>      this is just the consequence of (1) and (2).
>
> Actually, once I get the page table management prepared for that I'd like to
> move Nouveau over this approach.

OK. I guess I need to look at the resulting implementation to fully 
digest this.

Thanks,

Thomas


>
>> /Thomas
>>
>>
>>>> /Thomas
>>>>
>>>>
>>>>>>>>> +
>>>>>>>>> +	/**
>>>>>>>>> +	 * @evict: structure holding the evict list and evict list lock
>>>>>>>>> +	 */
>>>>>>>>> +	struct {
>>>>>>>>> +		/**
>>>>>>>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>>>>>>>> +		 * evicted
>>>>>>>>> +		 */
>>>>>>>>> +		struct list_head list;
>>>>>>>>> +
>>>>>>>>> +		/**
>>>>>>>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>>>>>>>> +		 * insertion / removal of different &drm_gpuva_gems
>>>>>>>>> +		 */
>>>>>>>>> +		spinlock_t lock;
>>>>>>>>> +	} evict;
>>>>>>>>>       };
>>>>>>>>>       void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>>>>> +			    struct drm_device *drm,
>>>>>>>>>       			    const char *name,
>>>>>>>>>       			    u64 start_offset, u64 range,
>>>>>>>>>       			    u64 reserve_offset, u64 reserve_range,
>>>>>>>>>       			    const struct drm_gpuva_fn_ops *ops);
>>>>>>>>>       void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>>>>>>>> +/**
>>>>>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>>>>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>>>>>>>> + */
>>>>>>>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>>>>>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>>>>>>>> should typically be allocated on the stack. Otherwise you'd need to protect
>>>>>>>> the mgr->exec member with an exclusive lock throughout the locking process,
>>>>>>>> and that's not what we want.
>>>>>>> Oh, good point. I think it works in Nouveau, because there it's implicitly
>>>>>>> protected with the job submission lock.
>>>>>>>
>>>>>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>>>>>>>> needed ops to it: Like so:
>>>>>>> That's a good idea, will take this into V2.
>>>>>> Actually, I'm not fully sure that was a good idea: I've now have a working
>>>>>> version of Xe ported over to drm_exec, having these helpers in mind and with
>>>>>> the intention to start using them as they mature. What I found, though is
>>>>>> that open-coding the drm_exec loop is not all that bad, but that building
>>>>>> blocks that can be called from within the loop are useful:
>>>>>>
>>>>>> Like the drm_gpuva_prepare_objects() and an imaginary
>>>>>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
>>>>>> (if different and the gpuva points to the object. And
>>>>>> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
>>>>>> can use these building blocks like helpers and avoid the fn() callback by
>>>>>> instead open-coding.
>>>>>>
>>>>>> But I guess YMMV.
>>>>> That's exactly why those building blocks are exported, I already had in mind
>>>>> that there might be drivers which still want to open-code the drm_exec loop,
>>>>> while others might just want a simple interface to lock everything.
>>>>>
>>>>> I still think it is a good idea, but I'd keep that as simple as possible. And
>>>>> for everything else just let the driver open-code it and use the "building
>>>>> blocks" - will also expand the bulding blocks to what you mentioned above.
>>>>>
>>>>>>>> struct drm_gpuva_exec_ops {
>>>>>>>>         int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>>>>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>>>>>
>>>>>>>>         int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>>>>>>>> *obj);
>>>>>>> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
>>>>>>> be the same callback, right?
>>>>>>>
>>>>>>>> };
>>>>>>>>
>>>>>>>> struct drm_gpuva_exec {
>>>>>>>>         const struct drm_gpuva_exec_ops *ops;
>>>>>>>>         struct drm_exec exec;
>>>>>>>>         struct drm_gpuva_manager *mgr;
>>>>>>>> };
>>>>>>>>
>>>>>>>> Although I'd actually expect bo_validate to be part of fn in the typical
>>>>>>>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
>>>>>>> This doesn't sound like my assumption about fn() above is correct.
>>>>>> Well one important thing in our conversion is that ttm_bo_validate () needs
>>>>>> to be in the until_all_locked() loop. We want to be able soon to use
>>>>>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>>>>>> temporarily, add locked objects to the drm_exec list of locked objects. That
>>>>>> means everything that may end up calling validate deep within the call chain
>>>>>> needs to be part of the until_all_locked() loop, so our
>>>>>> drm_gpuva_manager_lock_extra() fn callback would include those validates and
>>>>>> look different all the time. Hence that's why open-coding isn't all that
>>>>>> bad...
>>>>> Oh, I see. You indeed want to call validate() from within until_all_locked().
>>>>>
>>>>>> /Thomas
>>>>>>
>>>>>>
>>> <snip>

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-09-01  5:59                     ` [Nouveau] " Thomas Hellström (Intel)
  (?)
@ 2023-09-01 12:10                       ` Danilo Krummrich
  -1 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-09-01 12:10 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, christian.koenig, faith.ekstrand, bskeggs

On Fri, Sep 01, 2023 at 07:59:21AM +0200, Thomas Hellström (Intel) wrote:
> 
> On 8/31/23 21:07, Danilo Krummrich wrote:
> > On Thu, Aug 31, 2023 at 06:53:01PM +0200, Thomas Hellström (Intel) wrote:
> > > Hi,
> > > 
> > > On 8/31/23 13:18, Danilo Krummrich wrote:
> > > > On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
> > > > > Hi!
> > > > > 
> > > > > On 8/30/23 17:00, Danilo Krummrich wrote:
> > > > > > On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
> > > > > > > On 8/30/23 14:49, Danilo Krummrich wrote:
> > > > > > > > Hi Thomas,
> > > > > > > > 
> > > > > > > > thanks for having a look!
> > > > > > > > 
> > > > > > > > On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> > > > > > > > > Hi, Danilo.
> > > > > > > > > 
> > > > > > > > > Some quick comments since I'm doing some Xe work in this area. Will probably
> > > > > > > > > get back with more.
> > > > > > > > > 
> > > > > > > > > On 8/20/23 23:53, Danilo Krummrich wrote:
> > > > <snip>
> > > > 
> > > > > > > > > > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > > > > > > > > > index ed8d50200cc3..693e2da3f425 100644
> > > > > > > > > > --- a/include/drm/drm_gpuva_mgr.h
> > > > > > > > > > +++ b/include/drm/drm_gpuva_mgr.h
> > > > > > > > > > @@ -26,12 +26,16 @@
> > > > > > > > > >        */
> > > > > > > > > >       #include <linux/list.h>
> > > > > > > > > > +#include <linux/dma-resv.h>
> > > > > > > > > > +#include <linux/maple_tree.h>
> > > > > > > > > >       #include <linux/rbtree.h>
> > > > > > > > > >       #include <linux/types.h>
> > > > > > > > > >       #include <drm/drm_gem.h>
> > > > > > > > > > +#include <drm/drm_exec.h>
> > > > > > > > > >       struct drm_gpuva_manager;
> > > > > > > > > > +struct drm_gpuva_gem;
> > > > > > > > > >       struct drm_gpuva_fn_ops;
> > > > > > > > > >       /**
> > > > > > > > > > @@ -140,7 +144,7 @@ struct drm_gpuva {
> > > > > > > > > >       int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> > > > > > > > > >       void drm_gpuva_remove(struct drm_gpuva *va);
> > > > > > > > > > -void drm_gpuva_link(struct drm_gpuva *va);
> > > > > > > > > > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> > > > > > > > > >       void drm_gpuva_unlink(struct drm_gpuva *va);
> > > > > > > > > >       struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > > > > > > > > > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> > > > > > > > > >       	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> > > > > > > > > >       	 */
> > > > > > > > > >       	const struct drm_gpuva_fn_ops *ops;
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > > > > > > > > > +	 * dma-resv to &drm_exec.
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct drm_gem_object d_obj;
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > > > > > > > > > +	 * space
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct dma_resv *resv;
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct drm_exec exec;
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct maple_tree mt_ext;
> > > > > > > > > Why are you using a maple tree here? Insertion and removal is O(log(n))
> > > > > > > > > instead of O(1) for a list?
> > > > > > > > > 
> > > > > > > > Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> > > > > > > > could have mappings of the same extobj.
> > > > > > > > 
> > > > > > > > I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> > > > > > > > instead, which also seems to be the obvious choice. However, there is a locking
> > > > > > > > conflict.
> > > > > > > > 
> > > > > > > > A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> > > > > > > > a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> > > > > > > > the corresponding drm_gem_object, or with an external lock provided by the
> > > > > > > > driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> > > > > > > > changes on the GPUVA space directly from the fence signalling path.
> > > > > > > > 
> > > > > > > > Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> > > > > > > > we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> > > > > > > > linked and we'd want to remove it for the last one being unlinked.
> > > > > > > > 
> > > > > > > > (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> > > > > > > > before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> > > > > > > > through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> > > > > > > > create the drm_gpuva_gem, which we need to do anyways.)
> > > > > > > > 
> > > > > > > > Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> > > > > > > > list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> > > > > > > > order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> > > > > > > > lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> > > > > > > > drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> > > > > > > > like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> > > > > > > > hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
> > > > > > > > 
> > > > > > > > Considering all that, I thought it's probably better to track extobjs separate
> > > > > > > > from the drm_gpuva_gem, hence the maple tree choice.
> > > > > > > Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
> > > > > > > external objects, or in the case of multiple mappings to the same gem
> > > > > > > object, only one of the drm_gpuvas is in the list. These are protected by
> > > > > > > the GPU-VM lock. I don't see a problem with removing those from the fence
> > > > > > > signalling path, though?
> > > > > > I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
> > > > > > since this is generic code I don't know how much mappings of an external object
> > > > > > the corresponding driver potentially creates. This could become a pretty large
> > > > > > list to iterate. Another reason was, that I want to keep the drm_gpuva structure
> > > > > > as small as possible, hence avoiding another list_head.
> > > > > Yes, the list might be pretty large, but OTOH you never iterate to access a
> > > > > single list element. When you need to iterate the whole list you need to do
> > > > > that regardless of the data structure used. As for the list head, it might
> > > > > perhaps be aliased (union) with an upcoming userptr list head?
> > > > > 
> > > > Oh, I did not mean that I'm concerned about the size of a list of extobjs in
> > > > general, that would indeed be the same for every data structure chosen. But I
> > > > would be concerned about keeping a list of *all* mappings being backed by an
> > > > extobj.
> > > > 
> > > > > > Now, it sounds like in XE you're doing some kind of optimization just keeping a
> > > > > > single mapping of an extobj in the list? How do you know when to remove it? What
> > > > > > if the mapping from the extobj list gets unmapped, but there is still another
> > > > > > one left in the GPU-VM being backed by the same BO?
> > > > > When removing from the lists, we iterate through the object's list of vmas,
> > > > > and if there is one matching the same vm, we replace the old one with the
> > > > > new one. A similar iteration is done when adding to avoid adding one that is
> > > > > already on the list.
> > > > I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
> > > > while using the maple tree is O(log(n))?
> > > No, insertion and removal is O(m) where m is the number of vms the object is
> > > currently bound to. Typically a very small number.
> > Ok, my guess was that on insertion you'd actually walk the extobj list and see
> > if there's a vma backed by the same BO already, while on removal you said you're
> > walking the BO's vma list. So I guess on insertion you're also walking the BO's
> > vma list and see if there's already a mapping for this VM?
> > 
> > In your case that might make sense if you expect the extobj list to be larger
> > than the BO's vma list typically. In general I don't think this is true.
> 
> I think we're then optimizing for different scenarios. Our compute driver
> will use mostly external objects only, and if shared, I don't forsee them
> bound to many VMs. What saves us currently here is that in compute mode we
> only really traverse the extobj list after a preempt fence wait, or when a
> vm is using a new context for the first time. So vm's extobj list is pretty
> large. Each bo's vma list will typically be pretty small.

Admittedly, I did not had in mind VMs where every GEM is an extobj. However,
especially for iterating a lot of extobjs a maple tree should perform better
than a list.

> 
> Another reason for us to use the list is that one possible, but not yet
> implemented, workaround for this is the "vm fence", which when attached to
> external bos pulls them off the extobj list and on "enable_signalling()"
> splices its sublist of external bos back, and then snapshots the vm's
> dma_resv and waits for all its fences. (The idea is that it should very
> seldom be waited for in practice, and largely eliminate the extobj
> handling). Here a list is an ideal data structure for list removal and
> splicing. TBH we really want to avoid this optimization but we need to see
> how bad extobj handling ends up in practice for the compute drivers.

If you end up doing this I highly doubt it'd make sense to use the GPUVA
manager for that, even if it would implement extobjs as a list of drm_gpuva_gems
(VM_BOs). It'd probably be a mess. When you remove extobjs from the GPUVA
manager, not because they're actually gone, but because you want to keep them
separate, you'd need to make sure to keep the drm_gpuva_gem structure alive,
which means you would need to increase the GPUVA managers refcount for extobjs
manually. You could probably also just "steal" them silently, but that'd be
quite nasty as well.

> 
> 
> > 
> > > > > > Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> > > > > > choice, keeping O(1)?
> > > > > > When tracking extobjs, the address of the drm_gem_object is the key while the
> > > > > > reference count is the value. I was thinking of an XArray as well, but I was
> > > > > > worried that the corresponding indices could be too much distributed for an
> > > > > > XArray to still be efficient. Now that I think about it, it's probably not that
> > > > > > bad.
> > > > > > 
> > > > > > Btw., while I agree trying to make things as efficient as possible, what is the
> > > > > > magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
> > > > > Not sure yet, TBH, but I think one of our UMDs can only use external object,
> > > > > because they don't know at creation time which ones need exporting. However
> > > > > if this turns out to be too bad, there are various flavours of "clever but
> > > > > complicated" optimizations that we could think of to reduce the list size.
> > > > > Still in our case, we opted for the vma list head for now.
> > > > Considering the above, I would guess that if your current approach is good
> > > > enough, a maple tree will work as well.
> > > Hmm, Yeah it's probably a bikeshed since each drm_exec builds a realloced
> > > array of all external objects on each exec.
> > I did a quick sketchy benchmark, which is probably good enough. In a maple tree
> > with 0xFFFF - 1 existing entries insertion of a random (non-existant) entry
> > took on average ~530ns over 1k iterations.
> > 
> > The average insertion time for each entry to build up a tree with 0xFFFF - 1
> > entries in the first place was ~1.3us. That's expected since it should hit
> > memory allocations more often than the previous one. The maximum peak was ~10us.
> > Inserting already existing entries took ~300ns.
> > 
> > That's probably good enough.
> 
> That's hard to tell because we have nothing to compare with. For drm_exec,
> Christian chose a realloced array because of linked list cache locality
> issues, and Xarray locking requirements causing measurable performance
> issues. Wouldn't a maple tree suffer from both of these?

Maple tree was designed for cache efficient traversal and to replace rbtree and
linked lists in MM because of their lack of cache efficiency. (That's also why
it is really unfortunate that we couldn't use maple tree for VMA tracking in the
GPUVA manager.)

In terms of locking, I can only imagine an issue because Xarray always seems to
use RCU and hence you can't get rid of some grace period latency? Otherwise it
should just be a spinlock.

@Christian: Or was there a different issue?

Maple tree can disable RCU entirely [1] AFAIK, hence likely we can avoid such an
issue.

[1] https://elixir.bootlin.com/linux/latest/source/include/linux/maple_tree.h#L612

> 
> In any case if you go for the maple tree would it be possible to hide the
> implementation in a way as to make it not too hard to replace if real-world
> workloads prove it necessary?

Of course, I would want to do that anyway.

> 
> > 
> > > > Otherwise, if you want, I could do some experiments with Xarray and see how
> > > > that works out compared to using a maple tree.
> > > > 
> > > > Btw. another nice thing about using Xarray or maple tree for that is that
> > > > drivers updating the VA space from the fence signalling path don't need to
> > > > hold a GPU-VM lock to update the extobj list. Actually, they might not need
> > > > a GPU-VM lock at all.
> > > I still don't follow why drivers would want to do that. Isn't the VA space /
> > > fence object list always updated sync from the IOCTL?
> > For the extobj list I don't see any advantage not doing that in the IOCTL right
> > away. For the VA space there are a few advantages doing it in the fence
> > signalling path.
> > 
> > (1) No need to allocate drm_gpuva_ops at all. For a given map / unmap request
> >      the driver can receive the callbacks for map / remap / unmap directly.
> > (2) No need to unwind VA space updates on failure, also no need for any other
> >      unwind tricks.
> > (3) Synchronous bind jobs can be injected at any point of time and don't need to
> >      be queued up in the scheduler to preserve ordering.
> > (4) Potentially less error prone ressource management. Although, I admit partly
> >      this is just the consequence of (1) and (2).
> > 
> > Actually, once I get the page table management prepared for that I'd like to
> > move Nouveau over this approach.
> 
> OK. I guess I need to look at the resulting implementation to fully digest
> this.
> 
> Thanks,
> 
> Thomas
> 
> 
> > 
> > > /Thomas
> > > 
> > > 
> > > > > /Thomas
> > > > > 
> > > > > 
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @evict: structure holding the evict list and evict list lock
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct {
> > > > > > > > > > +		/**
> > > > > > > > > > +		 * @list: &list_head storing &drm_gem_objects currently being
> > > > > > > > > > +		 * evicted
> > > > > > > > > > +		 */
> > > > > > > > > > +		struct list_head list;
> > > > > > > > > > +
> > > > > > > > > > +		/**
> > > > > > > > > > +		 * @lock: spinlock to protect the evict list against concurrent
> > > > > > > > > > +		 * insertion / removal of different &drm_gpuva_gems
> > > > > > > > > > +		 */
> > > > > > > > > > +		spinlock_t lock;
> > > > > > > > > > +	} evict;
> > > > > > > > > >       };
> > > > > > > > > >       void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > > > > > > > +			    struct drm_device *drm,
> > > > > > > > > >       			    const char *name,
> > > > > > > > > >       			    u64 start_offset, u64 range,
> > > > > > > > > >       			    u64 reserve_offset, u64 reserve_range,
> > > > > > > > > >       			    const struct drm_gpuva_fn_ops *ops);
> > > > > > > > > >       void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > > > > > > > > > +/**
> > > > > > > > > > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > > > > > > > > > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > > > > > > > > > + */
> > > > > > > > > > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > > > > > > > > A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> > > > > > > > > should typically be allocated on the stack. Otherwise you'd need to protect
> > > > > > > > > the mgr->exec member with an exclusive lock throughout the locking process,
> > > > > > > > > and that's not what we want.
> > > > > > > > Oh, good point. I think it works in Nouveau, because there it's implicitly
> > > > > > > > protected with the job submission lock.
> > > > > > > > 
> > > > > > > > > Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> > > > > > > > > needed ops to it: Like so:
> > > > > > > > That's a good idea, will take this into V2.
> > > > > > > Actually, I'm not fully sure that was a good idea: I've now have a working
> > > > > > > version of Xe ported over to drm_exec, having these helpers in mind and with
> > > > > > > the intention to start using them as they mature. What I found, though is
> > > > > > > that open-coding the drm_exec loop is not all that bad, but that building
> > > > > > > blocks that can be called from within the loop are useful:
> > > > > > > 
> > > > > > > Like the drm_gpuva_prepare_objects() and an imaginary
> > > > > > > drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
> > > > > > > (if different and the gpuva points to the object. And
> > > > > > > drm_gpuva_prepare_array() although we don't use it within Xe. That means you
> > > > > > > can use these building blocks like helpers and avoid the fn() callback by
> > > > > > > instead open-coding.
> > > > > > > 
> > > > > > > But I guess YMMV.
> > > > > > That's exactly why those building blocks are exported, I already had in mind
> > > > > > that there might be drivers which still want to open-code the drm_exec loop,
> > > > > > while others might just want a simple interface to lock everything.
> > > > > > 
> > > > > > I still think it is a good idea, but I'd keep that as simple as possible. And
> > > > > > for everything else just let the driver open-code it and use the "building
> > > > > > blocks" - will also expand the bulding blocks to what you mentioned above.
> > > > > > 
> > > > > > > > > struct drm_gpuva_exec_ops {
> > > > > > > > >         int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> > > > > > > > Is this the fn argument from drm_gpuva_manager_lock_extra()?
> > > > > > > > 
> > > > > > > > >         int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> > > > > > > > > *obj);
> > > > > > > > I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> > > > > > > > be the same callback, right?
> > > > > > > > 
> > > > > > > > > };
> > > > > > > > > 
> > > > > > > > > struct drm_gpuva_exec {
> > > > > > > > >         const struct drm_gpuva_exec_ops *ops;
> > > > > > > > >         struct drm_exec exec;
> > > > > > > > >         struct drm_gpuva_manager *mgr;
> > > > > > > > > };
> > > > > > > > > 
> > > > > > > > > Although I'd actually expect bo_validate to be part of fn in the typical
> > > > > > > > > case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> > > > > > > > This doesn't sound like my assumption about fn() above is correct.
> > > > > > > Well one important thing in our conversion is that ttm_bo_validate () needs
> > > > > > > to be in the until_all_locked() loop. We want to be able soon to use
> > > > > > > sleeping locks for eviction, so a xe_bo_validate() would, at least
> > > > > > > temporarily, add locked objects to the drm_exec list of locked objects. That
> > > > > > > means everything that may end up calling validate deep within the call chain
> > > > > > > needs to be part of the until_all_locked() loop, so our
> > > > > > > drm_gpuva_manager_lock_extra() fn callback would include those validates and
> > > > > > > look different all the time. Hence that's why open-coding isn't all that
> > > > > > > bad...
> > > > > > Oh, I see. You indeed want to call validate() from within until_all_locked().
> > > > > > 
> > > > > > > /Thomas
> > > > > > > 
> > > > > > > 
> > > > <snip>
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-09-01 12:10                       ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-09-01 12:10 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, christian.koenig, faith.ekstrand, bskeggs

On Fri, Sep 01, 2023 at 07:59:21AM +0200, Thomas Hellström (Intel) wrote:
> 
> On 8/31/23 21:07, Danilo Krummrich wrote:
> > On Thu, Aug 31, 2023 at 06:53:01PM +0200, Thomas Hellström (Intel) wrote:
> > > Hi,
> > > 
> > > On 8/31/23 13:18, Danilo Krummrich wrote:
> > > > On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
> > > > > Hi!
> > > > > 
> > > > > On 8/30/23 17:00, Danilo Krummrich wrote:
> > > > > > On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
> > > > > > > On 8/30/23 14:49, Danilo Krummrich wrote:
> > > > > > > > Hi Thomas,
> > > > > > > > 
> > > > > > > > thanks for having a look!
> > > > > > > > 
> > > > > > > > On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> > > > > > > > > Hi, Danilo.
> > > > > > > > > 
> > > > > > > > > Some quick comments since I'm doing some Xe work in this area. Will probably
> > > > > > > > > get back with more.
> > > > > > > > > 
> > > > > > > > > On 8/20/23 23:53, Danilo Krummrich wrote:
> > > > <snip>
> > > > 
> > > > > > > > > > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > > > > > > > > > index ed8d50200cc3..693e2da3f425 100644
> > > > > > > > > > --- a/include/drm/drm_gpuva_mgr.h
> > > > > > > > > > +++ b/include/drm/drm_gpuva_mgr.h
> > > > > > > > > > @@ -26,12 +26,16 @@
> > > > > > > > > >        */
> > > > > > > > > >       #include <linux/list.h>
> > > > > > > > > > +#include <linux/dma-resv.h>
> > > > > > > > > > +#include <linux/maple_tree.h>
> > > > > > > > > >       #include <linux/rbtree.h>
> > > > > > > > > >       #include <linux/types.h>
> > > > > > > > > >       #include <drm/drm_gem.h>
> > > > > > > > > > +#include <drm/drm_exec.h>
> > > > > > > > > >       struct drm_gpuva_manager;
> > > > > > > > > > +struct drm_gpuva_gem;
> > > > > > > > > >       struct drm_gpuva_fn_ops;
> > > > > > > > > >       /**
> > > > > > > > > > @@ -140,7 +144,7 @@ struct drm_gpuva {
> > > > > > > > > >       int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> > > > > > > > > >       void drm_gpuva_remove(struct drm_gpuva *va);
> > > > > > > > > > -void drm_gpuva_link(struct drm_gpuva *va);
> > > > > > > > > > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> > > > > > > > > >       void drm_gpuva_unlink(struct drm_gpuva *va);
> > > > > > > > > >       struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > > > > > > > > > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> > > > > > > > > >       	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> > > > > > > > > >       	 */
> > > > > > > > > >       	const struct drm_gpuva_fn_ops *ops;
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > > > > > > > > > +	 * dma-resv to &drm_exec.
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct drm_gem_object d_obj;
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > > > > > > > > > +	 * space
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct dma_resv *resv;
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct drm_exec exec;
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct maple_tree mt_ext;
> > > > > > > > > Why are you using a maple tree here? Insertion and removal is O(log(n))
> > > > > > > > > instead of O(1) for a list?
> > > > > > > > > 
> > > > > > > > Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> > > > > > > > could have mappings of the same extobj.
> > > > > > > > 
> > > > > > > > I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> > > > > > > > instead, which also seems to be the obvious choice. However, there is a locking
> > > > > > > > conflict.
> > > > > > > > 
> > > > > > > > A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> > > > > > > > a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> > > > > > > > the corresponding drm_gem_object, or with an external lock provided by the
> > > > > > > > driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> > > > > > > > changes on the GPUVA space directly from the fence signalling path.
> > > > > > > > 
> > > > > > > > Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> > > > > > > > we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> > > > > > > > linked and we'd want to remove it for the last one being unlinked.
> > > > > > > > 
> > > > > > > > (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> > > > > > > > before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> > > > > > > > through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> > > > > > > > create the drm_gpuva_gem, which we need to do anyways.)
> > > > > > > > 
> > > > > > > > Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> > > > > > > > list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> > > > > > > > order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> > > > > > > > lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> > > > > > > > drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> > > > > > > > like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> > > > > > > > hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
> > > > > > > > 
> > > > > > > > Considering all that, I thought it's probably better to track extobjs separate
> > > > > > > > from the drm_gpuva_gem, hence the maple tree choice.
> > > > > > > Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
> > > > > > > external objects, or in the case of multiple mappings to the same gem
> > > > > > > object, only one of the drm_gpuvas is in the list. These are protected by
> > > > > > > the GPU-VM lock. I don't see a problem with removing those from the fence
> > > > > > > signalling path, though?
> > > > > > I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
> > > > > > since this is generic code I don't know how much mappings of an external object
> > > > > > the corresponding driver potentially creates. This could become a pretty large
> > > > > > list to iterate. Another reason was, that I want to keep the drm_gpuva structure
> > > > > > as small as possible, hence avoiding another list_head.
> > > > > Yes, the list might be pretty large, but OTOH you never iterate to access a
> > > > > single list element. When you need to iterate the whole list you need to do
> > > > > that regardless of the data structure used. As for the list head, it might
> > > > > perhaps be aliased (union) with an upcoming userptr list head?
> > > > > 
> > > > Oh, I did not mean that I'm concerned about the size of a list of extobjs in
> > > > general, that would indeed be the same for every data structure chosen. But I
> > > > would be concerned about keeping a list of *all* mappings being backed by an
> > > > extobj.
> > > > 
> > > > > > Now, it sounds like in XE you're doing some kind of optimization just keeping a
> > > > > > single mapping of an extobj in the list? How do you know when to remove it? What
> > > > > > if the mapping from the extobj list gets unmapped, but there is still another
> > > > > > one left in the GPU-VM being backed by the same BO?
> > > > > When removing from the lists, we iterate through the object's list of vmas,
> > > > > and if there is one matching the same vm, we replace the old one with the
> > > > > new one. A similar iteration is done when adding to avoid adding one that is
> > > > > already on the list.
> > > > I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
> > > > while using the maple tree is O(log(n))?
> > > No, insertion and removal is O(m) where m is the number of vms the object is
> > > currently bound to. Typically a very small number.
> > Ok, my guess was that on insertion you'd actually walk the extobj list and see
> > if there's a vma backed by the same BO already, while on removal you said you're
> > walking the BO's vma list. So I guess on insertion you're also walking the BO's
> > vma list and see if there's already a mapping for this VM?
> > 
> > In your case that might make sense if you expect the extobj list to be larger
> > than the BO's vma list typically. In general I don't think this is true.
> 
> I think we're then optimizing for different scenarios. Our compute driver
> will use mostly external objects only, and if shared, I don't forsee them
> bound to many VMs. What saves us currently here is that in compute mode we
> only really traverse the extobj list after a preempt fence wait, or when a
> vm is using a new context for the first time. So vm's extobj list is pretty
> large. Each bo's vma list will typically be pretty small.

Admittedly, I did not had in mind VMs where every GEM is an extobj. However,
especially for iterating a lot of extobjs a maple tree should perform better
than a list.

> 
> Another reason for us to use the list is that one possible, but not yet
> implemented, workaround for this is the "vm fence", which when attached to
> external bos pulls them off the extobj list and on "enable_signalling()"
> splices its sublist of external bos back, and then snapshots the vm's
> dma_resv and waits for all its fences. (The idea is that it should very
> seldom be waited for in practice, and largely eliminate the extobj
> handling). Here a list is an ideal data structure for list removal and
> splicing. TBH we really want to avoid this optimization but we need to see
> how bad extobj handling ends up in practice for the compute drivers.

If you end up doing this I highly doubt it'd make sense to use the GPUVA
manager for that, even if it would implement extobjs as a list of drm_gpuva_gems
(VM_BOs). It'd probably be a mess. When you remove extobjs from the GPUVA
manager, not because they're actually gone, but because you want to keep them
separate, you'd need to make sure to keep the drm_gpuva_gem structure alive,
which means you would need to increase the GPUVA managers refcount for extobjs
manually. You could probably also just "steal" them silently, but that'd be
quite nasty as well.

> 
> 
> > 
> > > > > > Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> > > > > > choice, keeping O(1)?
> > > > > > When tracking extobjs, the address of the drm_gem_object is the key while the
> > > > > > reference count is the value. I was thinking of an XArray as well, but I was
> > > > > > worried that the corresponding indices could be too much distributed for an
> > > > > > XArray to still be efficient. Now that I think about it, it's probably not that
> > > > > > bad.
> > > > > > 
> > > > > > Btw., while I agree trying to make things as efficient as possible, what is the
> > > > > > magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
> > > > > Not sure yet, TBH, but I think one of our UMDs can only use external object,
> > > > > because they don't know at creation time which ones need exporting. However
> > > > > if this turns out to be too bad, there are various flavours of "clever but
> > > > > complicated" optimizations that we could think of to reduce the list size.
> > > > > Still in our case, we opted for the vma list head for now.
> > > > Considering the above, I would guess that if your current approach is good
> > > > enough, a maple tree will work as well.
> > > Hmm, Yeah it's probably a bikeshed since each drm_exec builds a realloced
> > > array of all external objects on each exec.
> > I did a quick sketchy benchmark, which is probably good enough. In a maple tree
> > with 0xFFFF - 1 existing entries insertion of a random (non-existant) entry
> > took on average ~530ns over 1k iterations.
> > 
> > The average insertion time for each entry to build up a tree with 0xFFFF - 1
> > entries in the first place was ~1.3us. That's expected since it should hit
> > memory allocations more often than the previous one. The maximum peak was ~10us.
> > Inserting already existing entries took ~300ns.
> > 
> > That's probably good enough.
> 
> That's hard to tell because we have nothing to compare with. For drm_exec,
> Christian chose a realloced array because of linked list cache locality
> issues, and Xarray locking requirements causing measurable performance
> issues. Wouldn't a maple tree suffer from both of these?

Maple tree was designed for cache efficient traversal and to replace rbtree and
linked lists in MM because of their lack of cache efficiency. (That's also why
it is really unfortunate that we couldn't use maple tree for VMA tracking in the
GPUVA manager.)

In terms of locking, I can only imagine an issue because Xarray always seems to
use RCU and hence you can't get rid of some grace period latency? Otherwise it
should just be a spinlock.

@Christian: Or was there a different issue?

Maple tree can disable RCU entirely [1] AFAIK, hence likely we can avoid such an
issue.

[1] https://elixir.bootlin.com/linux/latest/source/include/linux/maple_tree.h#L612

> 
> In any case if you go for the maple tree would it be possible to hide the
> implementation in a way as to make it not too hard to replace if real-world
> workloads prove it necessary?

Of course, I would want to do that anyway.

> 
> > 
> > > > Otherwise, if you want, I could do some experiments with Xarray and see how
> > > > that works out compared to using a maple tree.
> > > > 
> > > > Btw. another nice thing about using Xarray or maple tree for that is that
> > > > drivers updating the VA space from the fence signalling path don't need to
> > > > hold a GPU-VM lock to update the extobj list. Actually, they might not need
> > > > a GPU-VM lock at all.
> > > I still don't follow why drivers would want to do that. Isn't the VA space /
> > > fence object list always updated sync from the IOCTL?
> > For the extobj list I don't see any advantage not doing that in the IOCTL right
> > away. For the VA space there are a few advantages doing it in the fence
> > signalling path.
> > 
> > (1) No need to allocate drm_gpuva_ops at all. For a given map / unmap request
> >      the driver can receive the callbacks for map / remap / unmap directly.
> > (2) No need to unwind VA space updates on failure, also no need for any other
> >      unwind tricks.
> > (3) Synchronous bind jobs can be injected at any point of time and don't need to
> >      be queued up in the scheduler to preserve ordering.
> > (4) Potentially less error prone ressource management. Although, I admit partly
> >      this is just the consequence of (1) and (2).
> > 
> > Actually, once I get the page table management prepared for that I'd like to
> > move Nouveau over this approach.
> 
> OK. I guess I need to look at the resulting implementation to fully digest
> this.
> 
> Thanks,
> 
> Thomas
> 
> 
> > 
> > > /Thomas
> > > 
> > > 
> > > > > /Thomas
> > > > > 
> > > > > 
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @evict: structure holding the evict list and evict list lock
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct {
> > > > > > > > > > +		/**
> > > > > > > > > > +		 * @list: &list_head storing &drm_gem_objects currently being
> > > > > > > > > > +		 * evicted
> > > > > > > > > > +		 */
> > > > > > > > > > +		struct list_head list;
> > > > > > > > > > +
> > > > > > > > > > +		/**
> > > > > > > > > > +		 * @lock: spinlock to protect the evict list against concurrent
> > > > > > > > > > +		 * insertion / removal of different &drm_gpuva_gems
> > > > > > > > > > +		 */
> > > > > > > > > > +		spinlock_t lock;
> > > > > > > > > > +	} evict;
> > > > > > > > > >       };
> > > > > > > > > >       void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > > > > > > > +			    struct drm_device *drm,
> > > > > > > > > >       			    const char *name,
> > > > > > > > > >       			    u64 start_offset, u64 range,
> > > > > > > > > >       			    u64 reserve_offset, u64 reserve_range,
> > > > > > > > > >       			    const struct drm_gpuva_fn_ops *ops);
> > > > > > > > > >       void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > > > > > > > > > +/**
> > > > > > > > > > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > > > > > > > > > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > > > > > > > > > + */
> > > > > > > > > > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > > > > > > > > A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> > > > > > > > > should typically be allocated on the stack. Otherwise you'd need to protect
> > > > > > > > > the mgr->exec member with an exclusive lock throughout the locking process,
> > > > > > > > > and that's not what we want.
> > > > > > > > Oh, good point. I think it works in Nouveau, because there it's implicitly
> > > > > > > > protected with the job submission lock.
> > > > > > > > 
> > > > > > > > > Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> > > > > > > > > needed ops to it: Like so:
> > > > > > > > That's a good idea, will take this into V2.
> > > > > > > Actually, I'm not fully sure that was a good idea: I've now have a working
> > > > > > > version of Xe ported over to drm_exec, having these helpers in mind and with
> > > > > > > the intention to start using them as they mature. What I found, though is
> > > > > > > that open-coding the drm_exec loop is not all that bad, but that building
> > > > > > > blocks that can be called from within the loop are useful:
> > > > > > > 
> > > > > > > Like the drm_gpuva_prepare_objects() and an imaginary
> > > > > > > drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
> > > > > > > (if different and the gpuva points to the object. And
> > > > > > > drm_gpuva_prepare_array() although we don't use it within Xe. That means you
> > > > > > > can use these building blocks like helpers and avoid the fn() callback by
> > > > > > > instead open-coding.
> > > > > > > 
> > > > > > > But I guess YMMV.
> > > > > > That's exactly why those building blocks are exported, I already had in mind
> > > > > > that there might be drivers which still want to open-code the drm_exec loop,
> > > > > > while others might just want a simple interface to lock everything.
> > > > > > 
> > > > > > I still think it is a good idea, but I'd keep that as simple as possible. And
> > > > > > for everything else just let the driver open-code it and use the "building
> > > > > > blocks" - will also expand the bulding blocks to what you mentioned above.
> > > > > > 
> > > > > > > > > struct drm_gpuva_exec_ops {
> > > > > > > > >         int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> > > > > > > > Is this the fn argument from drm_gpuva_manager_lock_extra()?
> > > > > > > > 
> > > > > > > > >         int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> > > > > > > > > *obj);
> > > > > > > > I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> > > > > > > > be the same callback, right?
> > > > > > > > 
> > > > > > > > > };
> > > > > > > > > 
> > > > > > > > > struct drm_gpuva_exec {
> > > > > > > > >         const struct drm_gpuva_exec_ops *ops;
> > > > > > > > >         struct drm_exec exec;
> > > > > > > > >         struct drm_gpuva_manager *mgr;
> > > > > > > > > };
> > > > > > > > > 
> > > > > > > > > Although I'd actually expect bo_validate to be part of fn in the typical
> > > > > > > > > case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> > > > > > > > This doesn't sound like my assumption about fn() above is correct.
> > > > > > > Well one important thing in our conversion is that ttm_bo_validate () needs
> > > > > > > to be in the until_all_locked() loop. We want to be able soon to use
> > > > > > > sleeping locks for eviction, so a xe_bo_validate() would, at least
> > > > > > > temporarily, add locked objects to the drm_exec list of locked objects. That
> > > > > > > means everything that may end up calling validate deep within the call chain
> > > > > > > needs to be part of the until_all_locked() loop, so our
> > > > > > > drm_gpuva_manager_lock_extra() fn callback would include those validates and
> > > > > > > look different all the time. Hence that's why open-coding isn't all that
> > > > > > > bad...
> > > > > > Oh, I see. You indeed want to call validate() from within until_all_locked().
> > > > > > 
> > > > > > > /Thomas
> > > > > > > 
> > > > > > > 
> > > > <snip>
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-09-01 12:10                       ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-09-01 12:10 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

On Fri, Sep 01, 2023 at 07:59:21AM +0200, Thomas Hellström (Intel) wrote:
> 
> On 8/31/23 21:07, Danilo Krummrich wrote:
> > On Thu, Aug 31, 2023 at 06:53:01PM +0200, Thomas Hellström (Intel) wrote:
> > > Hi,
> > > 
> > > On 8/31/23 13:18, Danilo Krummrich wrote:
> > > > On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
> > > > > Hi!
> > > > > 
> > > > > On 8/30/23 17:00, Danilo Krummrich wrote:
> > > > > > On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
> > > > > > > On 8/30/23 14:49, Danilo Krummrich wrote:
> > > > > > > > Hi Thomas,
> > > > > > > > 
> > > > > > > > thanks for having a look!
> > > > > > > > 
> > > > > > > > On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
> > > > > > > > > Hi, Danilo.
> > > > > > > > > 
> > > > > > > > > Some quick comments since I'm doing some Xe work in this area. Will probably
> > > > > > > > > get back with more.
> > > > > > > > > 
> > > > > > > > > On 8/20/23 23:53, Danilo Krummrich wrote:
> > > > <snip>
> > > > 
> > > > > > > > > > diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> > > > > > > > > > index ed8d50200cc3..693e2da3f425 100644
> > > > > > > > > > --- a/include/drm/drm_gpuva_mgr.h
> > > > > > > > > > +++ b/include/drm/drm_gpuva_mgr.h
> > > > > > > > > > @@ -26,12 +26,16 @@
> > > > > > > > > >        */
> > > > > > > > > >       #include <linux/list.h>
> > > > > > > > > > +#include <linux/dma-resv.h>
> > > > > > > > > > +#include <linux/maple_tree.h>
> > > > > > > > > >       #include <linux/rbtree.h>
> > > > > > > > > >       #include <linux/types.h>
> > > > > > > > > >       #include <drm/drm_gem.h>
> > > > > > > > > > +#include <drm/drm_exec.h>
> > > > > > > > > >       struct drm_gpuva_manager;
> > > > > > > > > > +struct drm_gpuva_gem;
> > > > > > > > > >       struct drm_gpuva_fn_ops;
> > > > > > > > > >       /**
> > > > > > > > > > @@ -140,7 +144,7 @@ struct drm_gpuva {
> > > > > > > > > >       int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> > > > > > > > > >       void drm_gpuva_remove(struct drm_gpuva *va);
> > > > > > > > > > -void drm_gpuva_link(struct drm_gpuva *va);
> > > > > > > > > > +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
> > > > > > > > > >       void drm_gpuva_unlink(struct drm_gpuva *va);
> > > > > > > > > >       struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> > > > > > > > > > @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
> > > > > > > > > >       	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> > > > > > > > > >       	 */
> > > > > > > > > >       	const struct drm_gpuva_fn_ops *ops;
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
> > > > > > > > > > +	 * dma-resv to &drm_exec.
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct drm_gem_object d_obj;
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
> > > > > > > > > > +	 * space
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct dma_resv *resv;
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct drm_exec exec;
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct maple_tree mt_ext;
> > > > > > > > > Why are you using a maple tree here? Insertion and removal is O(log(n))
> > > > > > > > > instead of O(1) for a list?
> > > > > > > > > 
> > > > > > > > Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
> > > > > > > > could have mappings of the same extobj.
> > > > > > > > 
> > > > > > > > I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
> > > > > > > > instead, which also seems to be the obvious choice. However, there is a locking
> > > > > > > > conflict.
> > > > > > > > 
> > > > > > > > A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
> > > > > > > > a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
> > > > > > > > the corresponding drm_gem_object, or with an external lock provided by the
> > > > > > > > driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
> > > > > > > > changes on the GPUVA space directly from the fence signalling path.
> > > > > > > > 
> > > > > > > > Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
> > > > > > > > we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
> > > > > > > > linked and we'd want to remove it for the last one being unlinked.
> > > > > > > > 
> > > > > > > > (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
> > > > > > > > before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
> > > > > > > > through drm_gpuva_manager_lock(). But that's trival, we could do that when we
> > > > > > > > create the drm_gpuva_gem, which we need to do anyways.)
> > > > > > > > 
> > > > > > > > Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
> > > > > > > > list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
> > > > > > > > order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
> > > > > > > > lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
> > > > > > > > drm_gpuva_unlink() is called from within the fence signalling path. For drivers
> > > > > > > > like XE or Nouveau, we'd at least need to make sure to not mess up the locking
> > > > > > > > hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
> > > > > > > > 
> > > > > > > > Considering all that, I thought it's probably better to track extobjs separate
> > > > > > > > from the drm_gpuva_gem, hence the maple tree choice.
> > > > > > > Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
> > > > > > > external objects, or in the case of multiple mappings to the same gem
> > > > > > > object, only one of the drm_gpuvas is in the list. These are protected by
> > > > > > > the GPU-VM lock. I don't see a problem with removing those from the fence
> > > > > > > signalling path, though?
> > > > > > I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
> > > > > > since this is generic code I don't know how much mappings of an external object
> > > > > > the corresponding driver potentially creates. This could become a pretty large
> > > > > > list to iterate. Another reason was, that I want to keep the drm_gpuva structure
> > > > > > as small as possible, hence avoiding another list_head.
> > > > > Yes, the list might be pretty large, but OTOH you never iterate to access a
> > > > > single list element. When you need to iterate the whole list you need to do
> > > > > that regardless of the data structure used. As for the list head, it might
> > > > > perhaps be aliased (union) with an upcoming userptr list head?
> > > > > 
> > > > Oh, I did not mean that I'm concerned about the size of a list of extobjs in
> > > > general, that would indeed be the same for every data structure chosen. But I
> > > > would be concerned about keeping a list of *all* mappings being backed by an
> > > > extobj.
> > > > 
> > > > > > Now, it sounds like in XE you're doing some kind of optimization just keeping a
> > > > > > single mapping of an extobj in the list? How do you know when to remove it? What
> > > > > > if the mapping from the extobj list gets unmapped, but there is still another
> > > > > > one left in the GPU-VM being backed by the same BO?
> > > > > When removing from the lists, we iterate through the object's list of vmas,
> > > > > and if there is one matching the same vm, we replace the old one with the
> > > > > new one. A similar iteration is done when adding to avoid adding one that is
> > > > > already on the list.
> > > > I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
> > > > while using the maple tree is O(log(n))?
> > > No, insertion and removal is O(m) where m is the number of vms the object is
> > > currently bound to. Typically a very small number.
> > Ok, my guess was that on insertion you'd actually walk the extobj list and see
> > if there's a vma backed by the same BO already, while on removal you said you're
> > walking the BO's vma list. So I guess on insertion you're also walking the BO's
> > vma list and see if there's already a mapping for this VM?
> > 
> > In your case that might make sense if you expect the extobj list to be larger
> > than the BO's vma list typically. In general I don't think this is true.
> 
> I think we're then optimizing for different scenarios. Our compute driver
> will use mostly external objects only, and if shared, I don't forsee them
> bound to many VMs. What saves us currently here is that in compute mode we
> only really traverse the extobj list after a preempt fence wait, or when a
> vm is using a new context for the first time. So vm's extobj list is pretty
> large. Each bo's vma list will typically be pretty small.

Admittedly, I did not had in mind VMs where every GEM is an extobj. However,
especially for iterating a lot of extobjs a maple tree should perform better
than a list.

> 
> Another reason for us to use the list is that one possible, but not yet
> implemented, workaround for this is the "vm fence", which when attached to
> external bos pulls them off the extobj list and on "enable_signalling()"
> splices its sublist of external bos back, and then snapshots the vm's
> dma_resv and waits for all its fences. (The idea is that it should very
> seldom be waited for in practice, and largely eliminate the extobj
> handling). Here a list is an ideal data structure for list removal and
> splicing. TBH we really want to avoid this optimization but we need to see
> how bad extobj handling ends up in practice for the compute drivers.

If you end up doing this I highly doubt it'd make sense to use the GPUVA
manager for that, even if it would implement extobjs as a list of drm_gpuva_gems
(VM_BOs). It'd probably be a mess. When you remove extobjs from the GPUVA
manager, not because they're actually gone, but because you want to keep them
separate, you'd need to make sure to keep the drm_gpuva_gem structure alive,
which means you would need to increase the GPUVA managers refcount for extobjs
manually. You could probably also just "steal" them silently, but that'd be
quite nasty as well.

> 
> 
> > 
> > > > > > Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
> > > > > > choice, keeping O(1)?
> > > > > > When tracking extobjs, the address of the drm_gem_object is the key while the
> > > > > > reference count is the value. I was thinking of an XArray as well, but I was
> > > > > > worried that the corresponding indices could be too much distributed for an
> > > > > > XArray to still be efficient. Now that I think about it, it's probably not that
> > > > > > bad.
> > > > > > 
> > > > > > Btw., while I agree trying to make things as efficient as possible, what is the
> > > > > > magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
> > > > > Not sure yet, TBH, but I think one of our UMDs can only use external object,
> > > > > because they don't know at creation time which ones need exporting. However
> > > > > if this turns out to be too bad, there are various flavours of "clever but
> > > > > complicated" optimizations that we could think of to reduce the list size.
> > > > > Still in our case, we opted for the vma list head for now.
> > > > Considering the above, I would guess that if your current approach is good
> > > > enough, a maple tree will work as well.
> > > Hmm, Yeah it's probably a bikeshed since each drm_exec builds a realloced
> > > array of all external objects on each exec.
> > I did a quick sketchy benchmark, which is probably good enough. In a maple tree
> > with 0xFFFF - 1 existing entries insertion of a random (non-existant) entry
> > took on average ~530ns over 1k iterations.
> > 
> > The average insertion time for each entry to build up a tree with 0xFFFF - 1
> > entries in the first place was ~1.3us. That's expected since it should hit
> > memory allocations more often than the previous one. The maximum peak was ~10us.
> > Inserting already existing entries took ~300ns.
> > 
> > That's probably good enough.
> 
> That's hard to tell because we have nothing to compare with. For drm_exec,
> Christian chose a realloced array because of linked list cache locality
> issues, and Xarray locking requirements causing measurable performance
> issues. Wouldn't a maple tree suffer from both of these?

Maple tree was designed for cache efficient traversal and to replace rbtree and
linked lists in MM because of their lack of cache efficiency. (That's also why
it is really unfortunate that we couldn't use maple tree for VMA tracking in the
GPUVA manager.)

In terms of locking, I can only imagine an issue because Xarray always seems to
use RCU and hence you can't get rid of some grace period latency? Otherwise it
should just be a spinlock.

@Christian: Or was there a different issue?

Maple tree can disable RCU entirely [1] AFAIK, hence likely we can avoid such an
issue.

[1] https://elixir.bootlin.com/linux/latest/source/include/linux/maple_tree.h#L612

> 
> In any case if you go for the maple tree would it be possible to hide the
> implementation in a way as to make it not too hard to replace if real-world
> workloads prove it necessary?

Of course, I would want to do that anyway.

> 
> > 
> > > > Otherwise, if you want, I could do some experiments with Xarray and see how
> > > > that works out compared to using a maple tree.
> > > > 
> > > > Btw. another nice thing about using Xarray or maple tree for that is that
> > > > drivers updating the VA space from the fence signalling path don't need to
> > > > hold a GPU-VM lock to update the extobj list. Actually, they might not need
> > > > a GPU-VM lock at all.
> > > I still don't follow why drivers would want to do that. Isn't the VA space /
> > > fence object list always updated sync from the IOCTL?
> > For the extobj list I don't see any advantage not doing that in the IOCTL right
> > away. For the VA space there are a few advantages doing it in the fence
> > signalling path.
> > 
> > (1) No need to allocate drm_gpuva_ops at all. For a given map / unmap request
> >      the driver can receive the callbacks for map / remap / unmap directly.
> > (2) No need to unwind VA space updates on failure, also no need for any other
> >      unwind tricks.
> > (3) Synchronous bind jobs can be injected at any point of time and don't need to
> >      be queued up in the scheduler to preserve ordering.
> > (4) Potentially less error prone ressource management. Although, I admit partly
> >      this is just the consequence of (1) and (2).
> > 
> > Actually, once I get the page table management prepared for that I'd like to
> > move Nouveau over this approach.
> 
> OK. I guess I need to look at the resulting implementation to fully digest
> this.
> 
> Thanks,
> 
> Thomas
> 
> 
> > 
> > > /Thomas
> > > 
> > > 
> > > > > /Thomas
> > > > > 
> > > > > 
> > > > > > > > > > +
> > > > > > > > > > +	/**
> > > > > > > > > > +	 * @evict: structure holding the evict list and evict list lock
> > > > > > > > > > +	 */
> > > > > > > > > > +	struct {
> > > > > > > > > > +		/**
> > > > > > > > > > +		 * @list: &list_head storing &drm_gem_objects currently being
> > > > > > > > > > +		 * evicted
> > > > > > > > > > +		 */
> > > > > > > > > > +		struct list_head list;
> > > > > > > > > > +
> > > > > > > > > > +		/**
> > > > > > > > > > +		 * @lock: spinlock to protect the evict list against concurrent
> > > > > > > > > > +		 * insertion / removal of different &drm_gpuva_gems
> > > > > > > > > > +		 */
> > > > > > > > > > +		spinlock_t lock;
> > > > > > > > > > +	} evict;
> > > > > > > > > >       };
> > > > > > > > > >       void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> > > > > > > > > > +			    struct drm_device *drm,
> > > > > > > > > >       			    const char *name,
> > > > > > > > > >       			    u64 start_offset, u64 range,
> > > > > > > > > >       			    u64 reserve_offset, u64 reserve_range,
> > > > > > > > > >       			    const struct drm_gpuva_fn_ops *ops);
> > > > > > > > > >       void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> > > > > > > > > > +/**
> > > > > > > > > > + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
> > > > > > > > > > + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
> > > > > > > > > > + */
> > > > > > > > > > +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
> > > > > > > > > A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
> > > > > > > > > should typically be allocated on the stack. Otherwise you'd need to protect
> > > > > > > > > the mgr->exec member with an exclusive lock throughout the locking process,
> > > > > > > > > and that's not what we want.
> > > > > > > > Oh, good point. I think it works in Nouveau, because there it's implicitly
> > > > > > > > protected with the job submission lock.
> > > > > > > > 
> > > > > > > > > Did you consider subclassing a drm_exec for drm_gpuva purposes and add
> > > > > > > > > needed ops to it: Like so:
> > > > > > > > That's a good idea, will take this into V2.
> > > > > > > Actually, I'm not fully sure that was a good idea: I've now have a working
> > > > > > > version of Xe ported over to drm_exec, having these helpers in mind and with
> > > > > > > the intention to start using them as they mature. What I found, though is
> > > > > > > that open-coding the drm_exec loop is not all that bad, but that building
> > > > > > > blocks that can be called from within the loop are useful:
> > > > > > > 
> > > > > > > Like the drm_gpuva_prepare_objects() and an imaginary
> > > > > > > drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
> > > > > > > (if different and the gpuva points to the object. And
> > > > > > > drm_gpuva_prepare_array() although we don't use it within Xe. That means you
> > > > > > > can use these building blocks like helpers and avoid the fn() callback by
> > > > > > > instead open-coding.
> > > > > > > 
> > > > > > > But I guess YMMV.
> > > > > > That's exactly why those building blocks are exported, I already had in mind
> > > > > > that there might be drivers which still want to open-code the drm_exec loop,
> > > > > > while others might just want a simple interface to lock everything.
> > > > > > 
> > > > > > I still think it is a good idea, but I'd keep that as simple as possible. And
> > > > > > for everything else just let the driver open-code it and use the "building
> > > > > > blocks" - will also expand the bulding blocks to what you mentioned above.
> > > > > > 
> > > > > > > > > struct drm_gpuva_exec_ops {
> > > > > > > > >         int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
> > > > > > > > Is this the fn argument from drm_gpuva_manager_lock_extra()?
> > > > > > > > 
> > > > > > > > >         int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
> > > > > > > > > *obj);
> > > > > > > > I guess we could also keep that within the drm_gpuva_fn_ops? This should always
> > > > > > > > be the same callback, right?
> > > > > > > > 
> > > > > > > > > };
> > > > > > > > > 
> > > > > > > > > struct drm_gpuva_exec {
> > > > > > > > >         const struct drm_gpuva_exec_ops *ops;
> > > > > > > > >         struct drm_exec exec;
> > > > > > > > >         struct drm_gpuva_manager *mgr;
> > > > > > > > > };
> > > > > > > > > 
> > > > > > > > > Although I'd actually expect bo_validate to be part of fn in the typical
> > > > > > > > > case. The drm_gpuva_exec would then be allocated by the caller on the stack.
> > > > > > > > This doesn't sound like my assumption about fn() above is correct.
> > > > > > > Well one important thing in our conversion is that ttm_bo_validate () needs
> > > > > > > to be in the until_all_locked() loop. We want to be able soon to use
> > > > > > > sleeping locks for eviction, so a xe_bo_validate() would, at least
> > > > > > > temporarily, add locked objects to the drm_exec list of locked objects. That
> > > > > > > means everything that may end up calling validate deep within the call chain
> > > > > > > needs to be part of the until_all_locked() loop, so our
> > > > > > > drm_gpuva_manager_lock_extra() fn callback would include those validates and
> > > > > > > look different all the time. Hence that's why open-coding isn't all that
> > > > > > > bad...
> > > > > > Oh, I see. You indeed want to call validate() from within until_all_locked().
> > > > > > 
> > > > > > > /Thomas
> > > > > > > 
> > > > > > > 
> > > > <snip>
> 


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-09-01 12:10                       ` Danilo Krummrich
  (?)
@ 2023-09-06 14:20                         ` Danilo Krummrich
  -1 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-09-06 14:20 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, christian.koenig, faith.ekstrand, bskeggs

On 9/1/23 14:10, Danilo Krummrich wrote:
> On Fri, Sep 01, 2023 at 07:59:21AM +0200, Thomas Hellström (Intel) wrote:
>>
>> On 8/31/23 21:07, Danilo Krummrich wrote:
>>> On Thu, Aug 31, 2023 at 06:53:01PM +0200, Thomas Hellström (Intel) wrote:
>>>> Hi,
>>>>
>>>> On 8/31/23 13:18, Danilo Krummrich wrote:
>>>>> On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
>>>>>> Hi!
>>>>>>
>>>>>> On 8/30/23 17:00, Danilo Krummrich wrote:
>>>>>>> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
>>>>>>>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>>>>>>>> Hi Thomas,
>>>>>>>>>
>>>>>>>>> thanks for having a look!
>>>>>>>>>
>>>>>>>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>>>>>>>>>> Hi, Danilo.
>>>>>>>>>>
>>>>>>>>>> Some quick comments since I'm doing some Xe work in this area. Will probably
>>>>>>>>>> get back with more.
>>>>>>>>>>
>>>>>>>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
>>>>> <snip>
>>>>>
>>>>>>>>>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>>>>>>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>>>>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>>>>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>>>>>>>> @@ -26,12 +26,16 @@
>>>>>>>>>>>         */
>>>>>>>>>>>        #include <linux/list.h>
>>>>>>>>>>> +#include <linux/dma-resv.h>
>>>>>>>>>>> +#include <linux/maple_tree.h>
>>>>>>>>>>>        #include <linux/rbtree.h>
>>>>>>>>>>>        #include <linux/types.h>
>>>>>>>>>>>        #include <drm/drm_gem.h>
>>>>>>>>>>> +#include <drm/drm_exec.h>
>>>>>>>>>>>        struct drm_gpuva_manager;
>>>>>>>>>>> +struct drm_gpuva_gem;
>>>>>>>>>>>        struct drm_gpuva_fn_ops;
>>>>>>>>>>>        /**
>>>>>>>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>>>>>>>        int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>>>>>>>>>        void drm_gpuva_remove(struct drm_gpuva *va);
>>>>>>>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>>>>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>>>>>>>>>        void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>>>>>>>        struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>>>>>>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>>>>>>>        	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>>>>>>>>>        	 */
>>>>>>>>>>>        	const struct drm_gpuva_fn_ops *ops;
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>>>>>>>>>> +	 * dma-resv to &drm_exec.
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct drm_gem_object d_obj;
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>>>>>>>>>> +	 * space
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct dma_resv *resv;
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct drm_exec exec;
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct maple_tree mt_ext;
>>>>>>>>>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>>>>>>>>>> instead of O(1) for a list?
>>>>>>>>>>
>>>>>>>>> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
>>>>>>>>> could have mappings of the same extobj.
>>>>>>>>>
>>>>>>>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
>>>>>>>>> instead, which also seems to be the obvious choice. However, there is a locking
>>>>>>>>> conflict.
>>>>>>>>>
>>>>>>>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
>>>>>>>>> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
>>>>>>>>> the corresponding drm_gem_object, or with an external lock provided by the
>>>>>>>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
>>>>>>>>> changes on the GPUVA space directly from the fence signalling path.
>>>>>>>>>
>>>>>>>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
>>>>>>>>> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
>>>>>>>>> linked and we'd want to remove it for the last one being unlinked.
>>>>>>>>>
>>>>>>>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
>>>>>>>>> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
>>>>>>>>> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
>>>>>>>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>>>>>>>
>>>>>>>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
>>>>>>>>> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
>>>>>>>>> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
>>>>>>>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>>>>>>>> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
>>>>>>>>> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
>>>>>>>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>>>>>>>
>>>>>>>>> Considering all that, I thought it's probably better to track extobjs separate
>>>>>>>>> from the drm_gpuva_gem, hence the maple tree choice.
>>>>>>>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
>>>>>>>> external objects, or in the case of multiple mappings to the same gem
>>>>>>>> object, only one of the drm_gpuvas is in the list. These are protected by
>>>>>>>> the GPU-VM lock. I don't see a problem with removing those from the fence
>>>>>>>> signalling path, though?
>>>>>>> I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
>>>>>>> since this is generic code I don't know how much mappings of an external object
>>>>>>> the corresponding driver potentially creates. This could become a pretty large
>>>>>>> list to iterate. Another reason was, that I want to keep the drm_gpuva structure
>>>>>>> as small as possible, hence avoiding another list_head.
>>>>>> Yes, the list might be pretty large, but OTOH you never iterate to access a
>>>>>> single list element. When you need to iterate the whole list you need to do
>>>>>> that regardless of the data structure used. As for the list head, it might
>>>>>> perhaps be aliased (union) with an upcoming userptr list head?
>>>>>>
>>>>> Oh, I did not mean that I'm concerned about the size of a list of extobjs in
>>>>> general, that would indeed be the same for every data structure chosen. But I
>>>>> would be concerned about keeping a list of *all* mappings being backed by an
>>>>> extobj.
>>>>>
>>>>>>> Now, it sounds like in XE you're doing some kind of optimization just keeping a
>>>>>>> single mapping of an extobj in the list? How do you know when to remove it? What
>>>>>>> if the mapping from the extobj list gets unmapped, but there is still another
>>>>>>> one left in the GPU-VM being backed by the same BO?
>>>>>> When removing from the lists, we iterate through the object's list of vmas,
>>>>>> and if there is one matching the same vm, we replace the old one with the
>>>>>> new one. A similar iteration is done when adding to avoid adding one that is
>>>>>> already on the list.
>>>>> I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
>>>>> while using the maple tree is O(log(n))?
>>>> No, insertion and removal is O(m) where m is the number of vms the object is
>>>> currently bound to. Typically a very small number.
>>> Ok, my guess was that on insertion you'd actually walk the extobj list and see
>>> if there's a vma backed by the same BO already, while on removal you said you're
>>> walking the BO's vma list. So I guess on insertion you're also walking the BO's
>>> vma list and see if there's already a mapping for this VM?
>>>
>>> In your case that might make sense if you expect the extobj list to be larger
>>> than the BO's vma list typically. In general I don't think this is true.
>>
>> I think we're then optimizing for different scenarios. Our compute driver
>> will use mostly external objects only, and if shared, I don't forsee them
>> bound to many VMs. What saves us currently here is that in compute mode we
>> only really traverse the extobj list after a preempt fence wait, or when a
>> vm is using a new context for the first time. So vm's extobj list is pretty
>> large. Each bo's vma list will typically be pretty small.
> 
> Admittedly, I did not had in mind VMs where every GEM is an extobj. However,
> especially for iterating a lot of extobjs a maple tree should perform better
> than a list.
> 
>>
>> Another reason for us to use the list is that one possible, but not yet
>> implemented, workaround for this is the "vm fence", which when attached to
>> external bos pulls them off the extobj list and on "enable_signalling()"
>> splices its sublist of external bos back, and then snapshots the vm's
>> dma_resv and waits for all its fences. (The idea is that it should very
>> seldom be waited for in practice, and largely eliminate the extobj
>> handling). Here a list is an ideal data structure for list removal and
>> splicing. TBH we really want to avoid this optimization but we need to see
>> how bad extobj handling ends up in practice for the compute drivers.
> 
> If you end up doing this I highly doubt it'd make sense to use the GPUVA
> manager for that, even if it would implement extobjs as a list of drm_gpuva_gems
> (VM_BOs). It'd probably be a mess. When you remove extobjs from the GPUVA
> manager, not because they're actually gone, but because you want to keep them
> separate, you'd need to make sure to keep the drm_gpuva_gem structure alive,
> which means you would need to increase the GPUVA managers refcount for extobjs
> manually. You could probably also just "steal" them silently, but that'd be
> quite nasty as well.
> 
>>
>>
>>>
>>>>>>> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
>>>>>>> choice, keeping O(1)?
>>>>>>> When tracking extobjs, the address of the drm_gem_object is the key while the
>>>>>>> reference count is the value. I was thinking of an XArray as well, but I was
>>>>>>> worried that the corresponding indices could be too much distributed for an
>>>>>>> XArray to still be efficient. Now that I think about it, it's probably not that
>>>>>>> bad.
>>>>>>>
>>>>>>> Btw., while I agree trying to make things as efficient as possible, what is the
>>>>>>> magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
>>>>>> Not sure yet, TBH, but I think one of our UMDs can only use external object,
>>>>>> because they don't know at creation time which ones need exporting. However
>>>>>> if this turns out to be too bad, there are various flavours of "clever but
>>>>>> complicated" optimizations that we could think of to reduce the list size.
>>>>>> Still in our case, we opted for the vma list head for now.
>>>>> Considering the above, I would guess that if your current approach is good
>>>>> enough, a maple tree will work as well.
>>>> Hmm, Yeah it's probably a bikeshed since each drm_exec builds a realloced
>>>> array of all external objects on each exec.
>>> I did a quick sketchy benchmark, which is probably good enough. In a maple tree
>>> with 0xFFFF - 1 existing entries insertion of a random (non-existant) entry
>>> took on average ~530ns over 1k iterations.
>>>
>>> The average insertion time for each entry to build up a tree with 0xFFFF - 1
>>> entries in the first place was ~1.3us. That's expected since it should hit
>>> memory allocations more often than the previous one. The maximum peak was ~10us.
>>> Inserting already existing entries took ~300ns.
>>>
>>> That's probably good enough.
>>
>> That's hard to tell because we have nothing to compare with. For drm_exec,
>> Christian chose a realloced array because of linked list cache locality
>> issues, and Xarray locking requirements causing measurable performance
>> issues. Wouldn't a maple tree suffer from both of these?
> 
> Maple tree was designed for cache efficient traversal and to replace rbtree and
> linked lists in MM because of their lack of cache efficiency. (That's also why
> it is really unfortunate that we couldn't use maple tree for VMA tracking in the
> GPUVA manager.)
> 
> In terms of locking, I can only imagine an issue because Xarray always seems to
> use RCU and hence you can't get rid of some grace period latency? Otherwise it
> should just be a spinlock.
> 
> @Christian: Or was there a different issue?
> 
> Maple tree can disable RCU entirely [1] AFAIK, hence likely we can avoid such an
> issue.
> 
> [1] https://elixir.bootlin.com/linux/latest/source/include/linux/maple_tree.h#L612
> 
>>
>> In any case if you go for the maple tree would it be possible to hide the
>> implementation in a way as to make it not too hard to replace if real-world
>> workloads prove it necessary?
> 
> Of course, I would want to do that anyway.

Just a heads-up. It looks like (with the help of Boris) I can come up with a solution
everyone should be happy with. I think we can move extobjs to a list and API and locking
wise just do (almost) everything as we would as if there wouldn't be the use-case of
updating the VA space with direct callback from the fence signaling path.

Drivers doing that can simply schedule work to call drm_gpuva_unlink() from the fence
signaling path for explicit unmaps to avoid locking issues.

Drivers relying on VA space updates in the IOCTL already aren't affected at all.

I'll probably send out a v2 today or tomorrow.

- Danilo

> 
>>
>>>
>>>>> Otherwise, if you want, I could do some experiments with Xarray and see how
>>>>> that works out compared to using a maple tree.
>>>>>
>>>>> Btw. another nice thing about using Xarray or maple tree for that is that
>>>>> drivers updating the VA space from the fence signalling path don't need to
>>>>> hold a GPU-VM lock to update the extobj list. Actually, they might not need
>>>>> a GPU-VM lock at all.
>>>> I still don't follow why drivers would want to do that. Isn't the VA space /
>>>> fence object list always updated sync from the IOCTL?
>>> For the extobj list I don't see any advantage not doing that in the IOCTL right
>>> away. For the VA space there are a few advantages doing it in the fence
>>> signalling path.
>>>
>>> (1) No need to allocate drm_gpuva_ops at all. For a given map / unmap request
>>>       the driver can receive the callbacks for map / remap / unmap directly.
>>> (2) No need to unwind VA space updates on failure, also no need for any other
>>>       unwind tricks.
>>> (3) Synchronous bind jobs can be injected at any point of time and don't need to
>>>       be queued up in the scheduler to preserve ordering.
>>> (4) Potentially less error prone ressource management. Although, I admit partly
>>>       this is just the consequence of (1) and (2).
>>>
>>> Actually, once I get the page table management prepared for that I'd like to
>>> move Nouveau over this approach.
>>
>> OK. I guess I need to look at the resulting implementation to fully digest
>> this.
>>
>> Thanks,
>>
>> Thomas
>>
>>
>>>
>>>> /Thomas
>>>>
>>>>
>>>>>> /Thomas
>>>>>>
>>>>>>
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @evict: structure holding the evict list and evict list lock
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct {
>>>>>>>>>>> +		/**
>>>>>>>>>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>>>>>>>>>> +		 * evicted
>>>>>>>>>>> +		 */
>>>>>>>>>>> +		struct list_head list;
>>>>>>>>>>> +
>>>>>>>>>>> +		/**
>>>>>>>>>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>>>>>>>>>> +		 * insertion / removal of different &drm_gpuva_gems
>>>>>>>>>>> +		 */
>>>>>>>>>>> +		spinlock_t lock;
>>>>>>>>>>> +	} evict;
>>>>>>>>>>>        };
>>>>>>>>>>>        void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>>>>>>> +			    struct drm_device *drm,
>>>>>>>>>>>        			    const char *name,
>>>>>>>>>>>        			    u64 start_offset, u64 range,
>>>>>>>>>>>        			    u64 reserve_offset, u64 reserve_range,
>>>>>>>>>>>        			    const struct drm_gpuva_fn_ops *ops);
>>>>>>>>>>>        void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>>>>>>>>>> +/**
>>>>>>>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>>>>>>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>>>>>>>>>> + */
>>>>>>>>>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>>>>>>>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>>>>>>>>>> should typically be allocated on the stack. Otherwise you'd need to protect
>>>>>>>>>> the mgr->exec member with an exclusive lock throughout the locking process,
>>>>>>>>>> and that's not what we want.
>>>>>>>>> Oh, good point. I think it works in Nouveau, because there it's implicitly
>>>>>>>>> protected with the job submission lock.
>>>>>>>>>
>>>>>>>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>>>>>>>>>> needed ops to it: Like so:
>>>>>>>>> That's a good idea, will take this into V2.
>>>>>>>> Actually, I'm not fully sure that was a good idea: I've now have a working
>>>>>>>> version of Xe ported over to drm_exec, having these helpers in mind and with
>>>>>>>> the intention to start using them as they mature. What I found, though is
>>>>>>>> that open-coding the drm_exec loop is not all that bad, but that building
>>>>>>>> blocks that can be called from within the loop are useful:
>>>>>>>>
>>>>>>>> Like the drm_gpuva_prepare_objects() and an imaginary
>>>>>>>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
>>>>>>>> (if different and the gpuva points to the object. And
>>>>>>>> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
>>>>>>>> can use these building blocks like helpers and avoid the fn() callback by
>>>>>>>> instead open-coding.
>>>>>>>>
>>>>>>>> But I guess YMMV.
>>>>>>> That's exactly why those building blocks are exported, I already had in mind
>>>>>>> that there might be drivers which still want to open-code the drm_exec loop,
>>>>>>> while others might just want a simple interface to lock everything.
>>>>>>>
>>>>>>> I still think it is a good idea, but I'd keep that as simple as possible. And
>>>>>>> for everything else just let the driver open-code it and use the "building
>>>>>>> blocks" - will also expand the bulding blocks to what you mentioned above.
>>>>>>>
>>>>>>>>>> struct drm_gpuva_exec_ops {
>>>>>>>>>>          int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>>>>>>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>>>>>>>
>>>>>>>>>>          int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>>>>>>>>>> *obj);
>>>>>>>>> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
>>>>>>>>> be the same callback, right?
>>>>>>>>>
>>>>>>>>>> };
>>>>>>>>>>
>>>>>>>>>> struct drm_gpuva_exec {
>>>>>>>>>>          const struct drm_gpuva_exec_ops *ops;
>>>>>>>>>>          struct drm_exec exec;
>>>>>>>>>>          struct drm_gpuva_manager *mgr;
>>>>>>>>>> };
>>>>>>>>>>
>>>>>>>>>> Although I'd actually expect bo_validate to be part of fn in the typical
>>>>>>>>>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
>>>>>>>>> This doesn't sound like my assumption about fn() above is correct.
>>>>>>>> Well one important thing in our conversion is that ttm_bo_validate () needs
>>>>>>>> to be in the until_all_locked() loop. We want to be able soon to use
>>>>>>>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>>>>>>>> temporarily, add locked objects to the drm_exec list of locked objects. That
>>>>>>>> means everything that may end up calling validate deep within the call chain
>>>>>>>> needs to be part of the until_all_locked() loop, so our
>>>>>>>> drm_gpuva_manager_lock_extra() fn callback would include those validates and
>>>>>>>> look different all the time. Hence that's why open-coding isn't all that
>>>>>>>> bad...
>>>>>>> Oh, I see. You indeed want to call validate() from within until_all_locked().
>>>>>>>
>>>>>>>> /Thomas
>>>>>>>>
>>>>>>>>
>>>>> <snip>
>>


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-09-06 14:20                         ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-09-06 14:20 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, christian.koenig, faith.ekstrand, bskeggs

On 9/1/23 14:10, Danilo Krummrich wrote:
> On Fri, Sep 01, 2023 at 07:59:21AM +0200, Thomas Hellström (Intel) wrote:
>>
>> On 8/31/23 21:07, Danilo Krummrich wrote:
>>> On Thu, Aug 31, 2023 at 06:53:01PM +0200, Thomas Hellström (Intel) wrote:
>>>> Hi,
>>>>
>>>> On 8/31/23 13:18, Danilo Krummrich wrote:
>>>>> On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
>>>>>> Hi!
>>>>>>
>>>>>> On 8/30/23 17:00, Danilo Krummrich wrote:
>>>>>>> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
>>>>>>>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>>>>>>>> Hi Thomas,
>>>>>>>>>
>>>>>>>>> thanks for having a look!
>>>>>>>>>
>>>>>>>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>>>>>>>>>> Hi, Danilo.
>>>>>>>>>>
>>>>>>>>>> Some quick comments since I'm doing some Xe work in this area. Will probably
>>>>>>>>>> get back with more.
>>>>>>>>>>
>>>>>>>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
>>>>> <snip>
>>>>>
>>>>>>>>>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>>>>>>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>>>>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>>>>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>>>>>>>> @@ -26,12 +26,16 @@
>>>>>>>>>>>         */
>>>>>>>>>>>        #include <linux/list.h>
>>>>>>>>>>> +#include <linux/dma-resv.h>
>>>>>>>>>>> +#include <linux/maple_tree.h>
>>>>>>>>>>>        #include <linux/rbtree.h>
>>>>>>>>>>>        #include <linux/types.h>
>>>>>>>>>>>        #include <drm/drm_gem.h>
>>>>>>>>>>> +#include <drm/drm_exec.h>
>>>>>>>>>>>        struct drm_gpuva_manager;
>>>>>>>>>>> +struct drm_gpuva_gem;
>>>>>>>>>>>        struct drm_gpuva_fn_ops;
>>>>>>>>>>>        /**
>>>>>>>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>>>>>>>        int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>>>>>>>>>        void drm_gpuva_remove(struct drm_gpuva *va);
>>>>>>>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>>>>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>>>>>>>>>        void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>>>>>>>        struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>>>>>>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>>>>>>>        	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>>>>>>>>>        	 */
>>>>>>>>>>>        	const struct drm_gpuva_fn_ops *ops;
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>>>>>>>>>> +	 * dma-resv to &drm_exec.
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct drm_gem_object d_obj;
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>>>>>>>>>> +	 * space
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct dma_resv *resv;
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct drm_exec exec;
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct maple_tree mt_ext;
>>>>>>>>>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>>>>>>>>>> instead of O(1) for a list?
>>>>>>>>>>
>>>>>>>>> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
>>>>>>>>> could have mappings of the same extobj.
>>>>>>>>>
>>>>>>>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
>>>>>>>>> instead, which also seems to be the obvious choice. However, there is a locking
>>>>>>>>> conflict.
>>>>>>>>>
>>>>>>>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
>>>>>>>>> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
>>>>>>>>> the corresponding drm_gem_object, or with an external lock provided by the
>>>>>>>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
>>>>>>>>> changes on the GPUVA space directly from the fence signalling path.
>>>>>>>>>
>>>>>>>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
>>>>>>>>> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
>>>>>>>>> linked and we'd want to remove it for the last one being unlinked.
>>>>>>>>>
>>>>>>>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
>>>>>>>>> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
>>>>>>>>> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
>>>>>>>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>>>>>>>
>>>>>>>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
>>>>>>>>> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
>>>>>>>>> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
>>>>>>>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>>>>>>>> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
>>>>>>>>> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
>>>>>>>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>>>>>>>
>>>>>>>>> Considering all that, I thought it's probably better to track extobjs separate
>>>>>>>>> from the drm_gpuva_gem, hence the maple tree choice.
>>>>>>>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
>>>>>>>> external objects, or in the case of multiple mappings to the same gem
>>>>>>>> object, only one of the drm_gpuvas is in the list. These are protected by
>>>>>>>> the GPU-VM lock. I don't see a problem with removing those from the fence
>>>>>>>> signalling path, though?
>>>>>>> I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
>>>>>>> since this is generic code I don't know how much mappings of an external object
>>>>>>> the corresponding driver potentially creates. This could become a pretty large
>>>>>>> list to iterate. Another reason was, that I want to keep the drm_gpuva structure
>>>>>>> as small as possible, hence avoiding another list_head.
>>>>>> Yes, the list might be pretty large, but OTOH you never iterate to access a
>>>>>> single list element. When you need to iterate the whole list you need to do
>>>>>> that regardless of the data structure used. As for the list head, it might
>>>>>> perhaps be aliased (union) with an upcoming userptr list head?
>>>>>>
>>>>> Oh, I did not mean that I'm concerned about the size of a list of extobjs in
>>>>> general, that would indeed be the same for every data structure chosen. But I
>>>>> would be concerned about keeping a list of *all* mappings being backed by an
>>>>> extobj.
>>>>>
>>>>>>> Now, it sounds like in XE you're doing some kind of optimization just keeping a
>>>>>>> single mapping of an extobj in the list? How do you know when to remove it? What
>>>>>>> if the mapping from the extobj list gets unmapped, but there is still another
>>>>>>> one left in the GPU-VM being backed by the same BO?
>>>>>> When removing from the lists, we iterate through the object's list of vmas,
>>>>>> and if there is one matching the same vm, we replace the old one with the
>>>>>> new one. A similar iteration is done when adding to avoid adding one that is
>>>>>> already on the list.
>>>>> I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
>>>>> while using the maple tree is O(log(n))?
>>>> No, insertion and removal is O(m) where m is the number of vms the object is
>>>> currently bound to. Typically a very small number.
>>> Ok, my guess was that on insertion you'd actually walk the extobj list and see
>>> if there's a vma backed by the same BO already, while on removal you said you're
>>> walking the BO's vma list. So I guess on insertion you're also walking the BO's
>>> vma list and see if there's already a mapping for this VM?
>>>
>>> In your case that might make sense if you expect the extobj list to be larger
>>> than the BO's vma list typically. In general I don't think this is true.
>>
>> I think we're then optimizing for different scenarios. Our compute driver
>> will use mostly external objects only, and if shared, I don't forsee them
>> bound to many VMs. What saves us currently here is that in compute mode we
>> only really traverse the extobj list after a preempt fence wait, or when a
>> vm is using a new context for the first time. So vm's extobj list is pretty
>> large. Each bo's vma list will typically be pretty small.
> 
> Admittedly, I did not had in mind VMs where every GEM is an extobj. However,
> especially for iterating a lot of extobjs a maple tree should perform better
> than a list.
> 
>>
>> Another reason for us to use the list is that one possible, but not yet
>> implemented, workaround for this is the "vm fence", which when attached to
>> external bos pulls them off the extobj list and on "enable_signalling()"
>> splices its sublist of external bos back, and then snapshots the vm's
>> dma_resv and waits for all its fences. (The idea is that it should very
>> seldom be waited for in practice, and largely eliminate the extobj
>> handling). Here a list is an ideal data structure for list removal and
>> splicing. TBH we really want to avoid this optimization but we need to see
>> how bad extobj handling ends up in practice for the compute drivers.
> 
> If you end up doing this I highly doubt it'd make sense to use the GPUVA
> manager for that, even if it would implement extobjs as a list of drm_gpuva_gems
> (VM_BOs). It'd probably be a mess. When you remove extobjs from the GPUVA
> manager, not because they're actually gone, but because you want to keep them
> separate, you'd need to make sure to keep the drm_gpuva_gem structure alive,
> which means you would need to increase the GPUVA managers refcount for extobjs
> manually. You could probably also just "steal" them silently, but that'd be
> quite nasty as well.
> 
>>
>>
>>>
>>>>>>> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
>>>>>>> choice, keeping O(1)?
>>>>>>> When tracking extobjs, the address of the drm_gem_object is the key while the
>>>>>>> reference count is the value. I was thinking of an XArray as well, but I was
>>>>>>> worried that the corresponding indices could be too much distributed for an
>>>>>>> XArray to still be efficient. Now that I think about it, it's probably not that
>>>>>>> bad.
>>>>>>>
>>>>>>> Btw., while I agree trying to make things as efficient as possible, what is the
>>>>>>> magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
>>>>>> Not sure yet, TBH, but I think one of our UMDs can only use external object,
>>>>>> because they don't know at creation time which ones need exporting. However
>>>>>> if this turns out to be too bad, there are various flavours of "clever but
>>>>>> complicated" optimizations that we could think of to reduce the list size.
>>>>>> Still in our case, we opted for the vma list head for now.
>>>>> Considering the above, I would guess that if your current approach is good
>>>>> enough, a maple tree will work as well.
>>>> Hmm, Yeah it's probably a bikeshed since each drm_exec builds a realloced
>>>> array of all external objects on each exec.
>>> I did a quick sketchy benchmark, which is probably good enough. In a maple tree
>>> with 0xFFFF - 1 existing entries insertion of a random (non-existant) entry
>>> took on average ~530ns over 1k iterations.
>>>
>>> The average insertion time for each entry to build up a tree with 0xFFFF - 1
>>> entries in the first place was ~1.3us. That's expected since it should hit
>>> memory allocations more often than the previous one. The maximum peak was ~10us.
>>> Inserting already existing entries took ~300ns.
>>>
>>> That's probably good enough.
>>
>> That's hard to tell because we have nothing to compare with. For drm_exec,
>> Christian chose a realloced array because of linked list cache locality
>> issues, and Xarray locking requirements causing measurable performance
>> issues. Wouldn't a maple tree suffer from both of these?
> 
> Maple tree was designed for cache efficient traversal and to replace rbtree and
> linked lists in MM because of their lack of cache efficiency. (That's also why
> it is really unfortunate that we couldn't use maple tree for VMA tracking in the
> GPUVA manager.)
> 
> In terms of locking, I can only imagine an issue because Xarray always seems to
> use RCU and hence you can't get rid of some grace period latency? Otherwise it
> should just be a spinlock.
> 
> @Christian: Or was there a different issue?
> 
> Maple tree can disable RCU entirely [1] AFAIK, hence likely we can avoid such an
> issue.
> 
> [1] https://elixir.bootlin.com/linux/latest/source/include/linux/maple_tree.h#L612
> 
>>
>> In any case if you go for the maple tree would it be possible to hide the
>> implementation in a way as to make it not too hard to replace if real-world
>> workloads prove it necessary?
> 
> Of course, I would want to do that anyway.

Just a heads-up. It looks like (with the help of Boris) I can come up with a solution
everyone should be happy with. I think we can move extobjs to a list and API and locking
wise just do (almost) everything as we would as if there wouldn't be the use-case of
updating the VA space with direct callback from the fence signaling path.

Drivers doing that can simply schedule work to call drm_gpuva_unlink() from the fence
signaling path for explicit unmaps to avoid locking issues.

Drivers relying on VA space updates in the IOCTL already aren't affected at all.

I'll probably send out a v2 today or tomorrow.

- Danilo

> 
>>
>>>
>>>>> Otherwise, if you want, I could do some experiments with Xarray and see how
>>>>> that works out compared to using a maple tree.
>>>>>
>>>>> Btw. another nice thing about using Xarray or maple tree for that is that
>>>>> drivers updating the VA space from the fence signalling path don't need to
>>>>> hold a GPU-VM lock to update the extobj list. Actually, they might not need
>>>>> a GPU-VM lock at all.
>>>> I still don't follow why drivers would want to do that. Isn't the VA space /
>>>> fence object list always updated sync from the IOCTL?
>>> For the extobj list I don't see any advantage not doing that in the IOCTL right
>>> away. For the VA space there are a few advantages doing it in the fence
>>> signalling path.
>>>
>>> (1) No need to allocate drm_gpuva_ops at all. For a given map / unmap request
>>>       the driver can receive the callbacks for map / remap / unmap directly.
>>> (2) No need to unwind VA space updates on failure, also no need for any other
>>>       unwind tricks.
>>> (3) Synchronous bind jobs can be injected at any point of time and don't need to
>>>       be queued up in the scheduler to preserve ordering.
>>> (4) Potentially less error prone ressource management. Although, I admit partly
>>>       this is just the consequence of (1) and (2).
>>>
>>> Actually, once I get the page table management prepared for that I'd like to
>>> move Nouveau over this approach.
>>
>> OK. I guess I need to look at the resulting implementation to fully digest
>> this.
>>
>> Thanks,
>>
>> Thomas
>>
>>
>>>
>>>> /Thomas
>>>>
>>>>
>>>>>> /Thomas
>>>>>>
>>>>>>
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @evict: structure holding the evict list and evict list lock
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct {
>>>>>>>>>>> +		/**
>>>>>>>>>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>>>>>>>>>> +		 * evicted
>>>>>>>>>>> +		 */
>>>>>>>>>>> +		struct list_head list;
>>>>>>>>>>> +
>>>>>>>>>>> +		/**
>>>>>>>>>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>>>>>>>>>> +		 * insertion / removal of different &drm_gpuva_gems
>>>>>>>>>>> +		 */
>>>>>>>>>>> +		spinlock_t lock;
>>>>>>>>>>> +	} evict;
>>>>>>>>>>>        };
>>>>>>>>>>>        void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>>>>>>> +			    struct drm_device *drm,
>>>>>>>>>>>        			    const char *name,
>>>>>>>>>>>        			    u64 start_offset, u64 range,
>>>>>>>>>>>        			    u64 reserve_offset, u64 reserve_range,
>>>>>>>>>>>        			    const struct drm_gpuva_fn_ops *ops);
>>>>>>>>>>>        void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>>>>>>>>>> +/**
>>>>>>>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>>>>>>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>>>>>>>>>> + */
>>>>>>>>>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>>>>>>>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>>>>>>>>>> should typically be allocated on the stack. Otherwise you'd need to protect
>>>>>>>>>> the mgr->exec member with an exclusive lock throughout the locking process,
>>>>>>>>>> and that's not what we want.
>>>>>>>>> Oh, good point. I think it works in Nouveau, because there it's implicitly
>>>>>>>>> protected with the job submission lock.
>>>>>>>>>
>>>>>>>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>>>>>>>>>> needed ops to it: Like so:
>>>>>>>>> That's a good idea, will take this into V2.
>>>>>>>> Actually, I'm not fully sure that was a good idea: I've now have a working
>>>>>>>> version of Xe ported over to drm_exec, having these helpers in mind and with
>>>>>>>> the intention to start using them as they mature. What I found, though is
>>>>>>>> that open-coding the drm_exec loop is not all that bad, but that building
>>>>>>>> blocks that can be called from within the loop are useful:
>>>>>>>>
>>>>>>>> Like the drm_gpuva_prepare_objects() and an imaginary
>>>>>>>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
>>>>>>>> (if different and the gpuva points to the object. And
>>>>>>>> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
>>>>>>>> can use these building blocks like helpers and avoid the fn() callback by
>>>>>>>> instead open-coding.
>>>>>>>>
>>>>>>>> But I guess YMMV.
>>>>>>> That's exactly why those building blocks are exported, I already had in mind
>>>>>>> that there might be drivers which still want to open-code the drm_exec loop,
>>>>>>> while others might just want a simple interface to lock everything.
>>>>>>>
>>>>>>> I still think it is a good idea, but I'd keep that as simple as possible. And
>>>>>>> for everything else just let the driver open-code it and use the "building
>>>>>>> blocks" - will also expand the bulding blocks to what you mentioned above.
>>>>>>>
>>>>>>>>>> struct drm_gpuva_exec_ops {
>>>>>>>>>>          int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>>>>>>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>>>>>>>
>>>>>>>>>>          int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>>>>>>>>>> *obj);
>>>>>>>>> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
>>>>>>>>> be the same callback, right?
>>>>>>>>>
>>>>>>>>>> };
>>>>>>>>>>
>>>>>>>>>> struct drm_gpuva_exec {
>>>>>>>>>>          const struct drm_gpuva_exec_ops *ops;
>>>>>>>>>>          struct drm_exec exec;
>>>>>>>>>>          struct drm_gpuva_manager *mgr;
>>>>>>>>>> };
>>>>>>>>>>
>>>>>>>>>> Although I'd actually expect bo_validate to be part of fn in the typical
>>>>>>>>>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
>>>>>>>>> This doesn't sound like my assumption about fn() above is correct.
>>>>>>>> Well one important thing in our conversion is that ttm_bo_validate () needs
>>>>>>>> to be in the until_all_locked() loop. We want to be able soon to use
>>>>>>>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>>>>>>>> temporarily, add locked objects to the drm_exec list of locked objects. That
>>>>>>>> means everything that may end up calling validate deep within the call chain
>>>>>>>> needs to be part of the until_all_locked() loop, so our
>>>>>>>> drm_gpuva_manager_lock_extra() fn callback would include those validates and
>>>>>>>> look different all the time. Hence that's why open-coding isn't all that
>>>>>>>> bad...
>>>>>>> Oh, I see. You indeed want to call validate() from within until_all_locked().
>>>>>>>
>>>>>>>> /Thomas
>>>>>>>>
>>>>>>>>
>>>>> <snip>
>>


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-09-06 14:20                         ` Danilo Krummrich
  0 siblings, 0 replies; 88+ messages in thread
From: Danilo Krummrich @ 2023-09-06 14:20 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: airlied, daniel, matthew.brost, thomas.hellstrom, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

On 9/1/23 14:10, Danilo Krummrich wrote:
> On Fri, Sep 01, 2023 at 07:59:21AM +0200, Thomas Hellström (Intel) wrote:
>>
>> On 8/31/23 21:07, Danilo Krummrich wrote:
>>> On Thu, Aug 31, 2023 at 06:53:01PM +0200, Thomas Hellström (Intel) wrote:
>>>> Hi,
>>>>
>>>> On 8/31/23 13:18, Danilo Krummrich wrote:
>>>>> On Thu, Aug 31, 2023 at 11:04:06AM +0200, Thomas Hellström (Intel) wrote:
>>>>>> Hi!
>>>>>>
>>>>>> On 8/30/23 17:00, Danilo Krummrich wrote:
>>>>>>> On Wed, Aug 30, 2023 at 03:42:08PM +0200, Thomas Hellström (Intel) wrote:
>>>>>>>> On 8/30/23 14:49, Danilo Krummrich wrote:
>>>>>>>>> Hi Thomas,
>>>>>>>>>
>>>>>>>>> thanks for having a look!
>>>>>>>>>
>>>>>>>>> On Wed, Aug 30, 2023 at 09:27:45AM +0200, Thomas Hellström (Intel) wrote:
>>>>>>>>>> Hi, Danilo.
>>>>>>>>>>
>>>>>>>>>> Some quick comments since I'm doing some Xe work in this area. Will probably
>>>>>>>>>> get back with more.
>>>>>>>>>>
>>>>>>>>>> On 8/20/23 23:53, Danilo Krummrich wrote:
>>>>> <snip>
>>>>>
>>>>>>>>>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>>>>>>>>>> index ed8d50200cc3..693e2da3f425 100644
>>>>>>>>>>> --- a/include/drm/drm_gpuva_mgr.h
>>>>>>>>>>> +++ b/include/drm/drm_gpuva_mgr.h
>>>>>>>>>>> @@ -26,12 +26,16 @@
>>>>>>>>>>>         */
>>>>>>>>>>>        #include <linux/list.h>
>>>>>>>>>>> +#include <linux/dma-resv.h>
>>>>>>>>>>> +#include <linux/maple_tree.h>
>>>>>>>>>>>        #include <linux/rbtree.h>
>>>>>>>>>>>        #include <linux/types.h>
>>>>>>>>>>>        #include <drm/drm_gem.h>
>>>>>>>>>>> +#include <drm/drm_exec.h>
>>>>>>>>>>>        struct drm_gpuva_manager;
>>>>>>>>>>> +struct drm_gpuva_gem;
>>>>>>>>>>>        struct drm_gpuva_fn_ops;
>>>>>>>>>>>        /**
>>>>>>>>>>> @@ -140,7 +144,7 @@ struct drm_gpuva {
>>>>>>>>>>>        int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
>>>>>>>>>>>        void drm_gpuva_remove(struct drm_gpuva *va);
>>>>>>>>>>> -void drm_gpuva_link(struct drm_gpuva *va);
>>>>>>>>>>> +void drm_gpuva_link(struct drm_gpuva *va, struct drm_gpuva_gem *vm_bo);
>>>>>>>>>>>        void drm_gpuva_unlink(struct drm_gpuva *va);
>>>>>>>>>>>        struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>>>>>>>>>> @@ -240,15 +244,137 @@ struct drm_gpuva_manager {
>>>>>>>>>>>        	 * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
>>>>>>>>>>>        	 */
>>>>>>>>>>>        	const struct drm_gpuva_fn_ops *ops;
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @d_obj: Dummy GEM object; used internally to pass the GPU VMs
>>>>>>>>>>> +	 * dma-resv to &drm_exec.
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct drm_gem_object d_obj;
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @resv: the &dma_resv for &drm_gem_objects mapped in this GPU VA
>>>>>>>>>>> +	 * space
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct dma_resv *resv;
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @exec: the &drm_exec helper to lock external &drm_gem_objects
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct drm_exec exec;
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @mt_ext: &maple_tree storing external &drm_gem_objects
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct maple_tree mt_ext;
>>>>>>>>>> Why are you using a maple tree here? Insertion and removal is O(log(n))
>>>>>>>>>> instead of O(1) for a list?
>>>>>>>>>>
>>>>>>>>> Having a list of drm_gem_objects directly wouldn't work, as multiple GPU-VMs
>>>>>>>>> could have mappings of the same extobj.
>>>>>>>>>
>>>>>>>>> I considered using the VM_BO abstraction (struct drm_gpuva_gem) as list entry
>>>>>>>>> instead, which also seems to be the obvious choice. However, there is a locking
>>>>>>>>> conflict.
>>>>>>>>>
>>>>>>>>> A drm_gem_object keeps a list of drm_gpuva_gems, while each drm_gpuva_gem keeps
>>>>>>>>> a list of drm_gpuvas. Both lists are either protected with the dma-resv lock of
>>>>>>>>> the corresponding drm_gem_object, or with an external lock provided by the
>>>>>>>>> driver (see drm_gem_gpuva_set_lock()). The latter is used by drivers performing
>>>>>>>>> changes on the GPUVA space directly from the fence signalling path.
>>>>>>>>>
>>>>>>>>> Now, similar to what drm_gpuva_link() and drm_gpuva_unlink() are doing already,
>>>>>>>>> we'd want to add a drm_gpuva_gem to the extobj list for the first mapping being
>>>>>>>>> linked and we'd want to remove it for the last one being unlinked.
>>>>>>>>>
>>>>>>>>> (Actually we'd want to add the drm_gpuva_gem object to the extobj list even
>>>>>>>>> before, because otherwise we'd not acquire it's dma-resv lock of this GEM object
>>>>>>>>> through drm_gpuva_manager_lock(). But that's trival, we could do that when we
>>>>>>>>> create the drm_gpuva_gem, which we need to do anyways.)
>>>>>>>>>
>>>>>>>>> Anyway, we'd probably want to keep removing the drm_gpuva_gem from the extobj
>>>>>>>>> list from drm_gpuva_unlink() when the last mapping of this BO is unlinked. In
>>>>>>>>> order to do so, we'd (as discussed above) either need to hold the outer GPU-VM
>>>>>>>>> lock or the GPU-VMs dma-resv lock. Both would be illegal in the case
>>>>>>>>> drm_gpuva_unlink() is called from within the fence signalling path. For drivers
>>>>>>>>> like XE or Nouveau, we'd at least need to make sure to not mess up the locking
>>>>>>>>> hierarchy of GPU-VM lock and dma-resv lock of the corresponding BO.
>>>>>>>>>
>>>>>>>>> Considering all that, I thought it's probably better to track extobjs separate
>>>>>>>>> from the drm_gpuva_gem, hence the maple tree choice.
>>>>>>>> Hm. OK, in Xe we're having a list of the xe_vmas (drm_gpuvas) that point to
>>>>>>>> external objects, or in the case of multiple mappings to the same gem
>>>>>>>> object, only one of the drm_gpuvas is in the list. These are protected by
>>>>>>>> the GPU-VM lock. I don't see a problem with removing those from the fence
>>>>>>>> signalling path, though?
>>>>>>> I intentionally tried to avoid keeping a list of drm_gpuvas to track extobjs,
>>>>>>> since this is generic code I don't know how much mappings of an external object
>>>>>>> the corresponding driver potentially creates. This could become a pretty large
>>>>>>> list to iterate. Another reason was, that I want to keep the drm_gpuva structure
>>>>>>> as small as possible, hence avoiding another list_head.
>>>>>> Yes, the list might be pretty large, but OTOH you never iterate to access a
>>>>>> single list element. When you need to iterate the whole list you need to do
>>>>>> that regardless of the data structure used. As for the list head, it might
>>>>>> perhaps be aliased (union) with an upcoming userptr list head?
>>>>>>
>>>>> Oh, I did not mean that I'm concerned about the size of a list of extobjs in
>>>>> general, that would indeed be the same for every data structure chosen. But I
>>>>> would be concerned about keeping a list of *all* mappings being backed by an
>>>>> extobj.
>>>>>
>>>>>>> Now, it sounds like in XE you're doing some kind of optimization just keeping a
>>>>>>> single mapping of an extobj in the list? How do you know when to remove it? What
>>>>>>> if the mapping from the extobj list gets unmapped, but there is still another
>>>>>>> one left in the GPU-VM being backed by the same BO?
>>>>>> When removing from the lists, we iterate through the object's list of vmas,
>>>>>> and if there is one matching the same vm, we replace the old one with the
>>>>>> new one. A similar iteration is done when adding to avoid adding one that is
>>>>>> already on the list.
>>>>> I see, but wouldn't this be O(n) on insertion and O(m) on removal of an extobj,
>>>>> while using the maple tree is O(log(n))?
>>>> No, insertion and removal is O(m) where m is the number of vms the object is
>>>> currently bound to. Typically a very small number.
>>> Ok, my guess was that on insertion you'd actually walk the extobj list and see
>>> if there's a vma backed by the same BO already, while on removal you said you're
>>> walking the BO's vma list. So I guess on insertion you're also walking the BO's
>>> vma list and see if there's already a mapping for this VM?
>>>
>>> In your case that might make sense if you expect the extobj list to be larger
>>> than the BO's vma list typically. In general I don't think this is true.
>>
>> I think we're then optimizing for different scenarios. Our compute driver
>> will use mostly external objects only, and if shared, I don't forsee them
>> bound to many VMs. What saves us currently here is that in compute mode we
>> only really traverse the extobj list after a preempt fence wait, or when a
>> vm is using a new context for the first time. So vm's extobj list is pretty
>> large. Each bo's vma list will typically be pretty small.
> 
> Admittedly, I did not had in mind VMs where every GEM is an extobj. However,
> especially for iterating a lot of extobjs a maple tree should perform better
> than a list.
> 
>>
>> Another reason for us to use the list is that one possible, but not yet
>> implemented, workaround for this is the "vm fence", which when attached to
>> external bos pulls them off the extobj list and on "enable_signalling()"
>> splices its sublist of external bos back, and then snapshots the vm's
>> dma_resv and waits for all its fences. (The idea is that it should very
>> seldom be waited for in practice, and largely eliminate the extobj
>> handling). Here a list is an ideal data structure for list removal and
>> splicing. TBH we really want to avoid this optimization but we need to see
>> how bad extobj handling ends up in practice for the compute drivers.
> 
> If you end up doing this I highly doubt it'd make sense to use the GPUVA
> manager for that, even if it would implement extobjs as a list of drm_gpuva_gems
> (VM_BOs). It'd probably be a mess. When you remove extobjs from the GPUVA
> manager, not because they're actually gone, but because you want to keep them
> separate, you'd need to make sure to keep the drm_gpuva_gem structure alive,
> which means you would need to increase the GPUVA managers refcount for extobjs
> manually. You could probably also just "steal" them silently, but that'd be
> quite nasty as well.
> 
>>
>>
>>>
>>>>>>> Although assuming that's a no-go for GPUVA wouldn't an XArray be a better
>>>>>>> choice, keeping O(1)?
>>>>>>> When tracking extobjs, the address of the drm_gem_object is the key while the
>>>>>>> reference count is the value. I was thinking of an XArray as well, but I was
>>>>>>> worried that the corresponding indices could be too much distributed for an
>>>>>>> XArray to still be efficient. Now that I think about it, it's probably not that
>>>>>>> bad.
>>>>>>>
>>>>>>> Btw., while I agree trying to make things as efficient as possible, what is the
>>>>>>> magnitue for extobjs to be tracked, do we need to worry about the O(log(n))?
>>>>>> Not sure yet, TBH, but I think one of our UMDs can only use external object,
>>>>>> because they don't know at creation time which ones need exporting. However
>>>>>> if this turns out to be too bad, there are various flavours of "clever but
>>>>>> complicated" optimizations that we could think of to reduce the list size.
>>>>>> Still in our case, we opted for the vma list head for now.
>>>>> Considering the above, I would guess that if your current approach is good
>>>>> enough, a maple tree will work as well.
>>>> Hmm, Yeah it's probably a bikeshed since each drm_exec builds a realloced
>>>> array of all external objects on each exec.
>>> I did a quick sketchy benchmark, which is probably good enough. In a maple tree
>>> with 0xFFFF - 1 existing entries insertion of a random (non-existant) entry
>>> took on average ~530ns over 1k iterations.
>>>
>>> The average insertion time for each entry to build up a tree with 0xFFFF - 1
>>> entries in the first place was ~1.3us. That's expected since it should hit
>>> memory allocations more often than the previous one. The maximum peak was ~10us.
>>> Inserting already existing entries took ~300ns.
>>>
>>> That's probably good enough.
>>
>> That's hard to tell because we have nothing to compare with. For drm_exec,
>> Christian chose a realloced array because of linked list cache locality
>> issues, and Xarray locking requirements causing measurable performance
>> issues. Wouldn't a maple tree suffer from both of these?
> 
> Maple tree was designed for cache efficient traversal and to replace rbtree and
> linked lists in MM because of their lack of cache efficiency. (That's also why
> it is really unfortunate that we couldn't use maple tree for VMA tracking in the
> GPUVA manager.)
> 
> In terms of locking, I can only imagine an issue because Xarray always seems to
> use RCU and hence you can't get rid of some grace period latency? Otherwise it
> should just be a spinlock.
> 
> @Christian: Or was there a different issue?
> 
> Maple tree can disable RCU entirely [1] AFAIK, hence likely we can avoid such an
> issue.
> 
> [1] https://elixir.bootlin.com/linux/latest/source/include/linux/maple_tree.h#L612
> 
>>
>> In any case if you go for the maple tree would it be possible to hide the
>> implementation in a way as to make it not too hard to replace if real-world
>> workloads prove it necessary?
> 
> Of course, I would want to do that anyway.

Just a heads-up. It looks like (with the help of Boris) I can come up with a solution
everyone should be happy with. I think we can move extobjs to a list and API and locking
wise just do (almost) everything as we would as if there wouldn't be the use-case of
updating the VA space with direct callback from the fence signaling path.

Drivers doing that can simply schedule work to call drm_gpuva_unlink() from the fence
signaling path for explicit unmaps to avoid locking issues.

Drivers relying on VA space updates in the IOCTL already aren't affected at all.

I'll probably send out a v2 today or tomorrow.

- Danilo

> 
>>
>>>
>>>>> Otherwise, if you want, I could do some experiments with Xarray and see how
>>>>> that works out compared to using a maple tree.
>>>>>
>>>>> Btw. another nice thing about using Xarray or maple tree for that is that
>>>>> drivers updating the VA space from the fence signalling path don't need to
>>>>> hold a GPU-VM lock to update the extobj list. Actually, they might not need
>>>>> a GPU-VM lock at all.
>>>> I still don't follow why drivers would want to do that. Isn't the VA space /
>>>> fence object list always updated sync from the IOCTL?
>>> For the extobj list I don't see any advantage not doing that in the IOCTL right
>>> away. For the VA space there are a few advantages doing it in the fence
>>> signalling path.
>>>
>>> (1) No need to allocate drm_gpuva_ops at all. For a given map / unmap request
>>>       the driver can receive the callbacks for map / remap / unmap directly.
>>> (2) No need to unwind VA space updates on failure, also no need for any other
>>>       unwind tricks.
>>> (3) Synchronous bind jobs can be injected at any point of time and don't need to
>>>       be queued up in the scheduler to preserve ordering.
>>> (4) Potentially less error prone ressource management. Although, I admit partly
>>>       this is just the consequence of (1) and (2).
>>>
>>> Actually, once I get the page table management prepared for that I'd like to
>>> move Nouveau over this approach.
>>
>> OK. I guess I need to look at the resulting implementation to fully digest
>> this.
>>
>> Thanks,
>>
>> Thomas
>>
>>
>>>
>>>> /Thomas
>>>>
>>>>
>>>>>> /Thomas
>>>>>>
>>>>>>
>>>>>>>>>>> +
>>>>>>>>>>> +	/**
>>>>>>>>>>> +	 * @evict: structure holding the evict list and evict list lock
>>>>>>>>>>> +	 */
>>>>>>>>>>> +	struct {
>>>>>>>>>>> +		/**
>>>>>>>>>>> +		 * @list: &list_head storing &drm_gem_objects currently being
>>>>>>>>>>> +		 * evicted
>>>>>>>>>>> +		 */
>>>>>>>>>>> +		struct list_head list;
>>>>>>>>>>> +
>>>>>>>>>>> +		/**
>>>>>>>>>>> +		 * @lock: spinlock to protect the evict list against concurrent
>>>>>>>>>>> +		 * insertion / removal of different &drm_gpuva_gems
>>>>>>>>>>> +		 */
>>>>>>>>>>> +		spinlock_t lock;
>>>>>>>>>>> +	} evict;
>>>>>>>>>>>        };
>>>>>>>>>>>        void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>>>>>>>>>> +			    struct drm_device *drm,
>>>>>>>>>>>        			    const char *name,
>>>>>>>>>>>        			    u64 start_offset, u64 range,
>>>>>>>>>>>        			    u64 reserve_offset, u64 reserve_range,
>>>>>>>>>>>        			    const struct drm_gpuva_fn_ops *ops);
>>>>>>>>>>>        void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>>>>>>>>>> +/**
>>>>>>>>>>> + * DRM_GPUVA_EXEC - returns the &drm_gpuva_managers &drm_exec instance
>>>>>>>>>>> + * @mgr: the &drm_gpuva_managers to return the &drm_exec instance for
>>>>>>>>>>> + */
>>>>>>>>>>> +#define DRM_GPUVA_EXEC(mgr)	&(mgr)->exec
>>>>>>>>>> A struct ww_acquire_ctx and thus a drm_exec is fundamentally per task and
>>>>>>>>>> should typically be allocated on the stack. Otherwise you'd need to protect
>>>>>>>>>> the mgr->exec member with an exclusive lock throughout the locking process,
>>>>>>>>>> and that's not what we want.
>>>>>>>>> Oh, good point. I think it works in Nouveau, because there it's implicitly
>>>>>>>>> protected with the job submission lock.
>>>>>>>>>
>>>>>>>>>> Did you consider subclassing a drm_exec for drm_gpuva purposes and add
>>>>>>>>>> needed ops to it: Like so:
>>>>>>>>> That's a good idea, will take this into V2.
>>>>>>>> Actually, I'm not fully sure that was a good idea: I've now have a working
>>>>>>>> version of Xe ported over to drm_exec, having these helpers in mind and with
>>>>>>>> the intention to start using them as they mature. What I found, though is
>>>>>>>> that open-coding the drm_exec loop is not all that bad, but that building
>>>>>>>> blocks that can be called from within the loop are useful:
>>>>>>>>
>>>>>>>> Like the drm_gpuva_prepare_objects() and an imaginary
>>>>>>>> drm_gpuva_prepare_gpuva() that locks the vm resv and the resv of the object
>>>>>>>> (if different and the gpuva points to the object. And
>>>>>>>> drm_gpuva_prepare_array() although we don't use it within Xe. That means you
>>>>>>>> can use these building blocks like helpers and avoid the fn() callback by
>>>>>>>> instead open-coding.
>>>>>>>>
>>>>>>>> But I guess YMMV.
>>>>>>> That's exactly why those building blocks are exported, I already had in mind
>>>>>>> that there might be drivers which still want to open-code the drm_exec loop,
>>>>>>> while others might just want a simple interface to lock everything.
>>>>>>>
>>>>>>> I still think it is a good idea, but I'd keep that as simple as possible. And
>>>>>>> for everything else just let the driver open-code it and use the "building
>>>>>>> blocks" - will also expand the bulding blocks to what you mentioned above.
>>>>>>>
>>>>>>>>>> struct drm_gpuva_exec_ops {
>>>>>>>>>>          int (*fn) (struct drm_gpuva_exec *exec, int num_fences);
>>>>>>>>> Is this the fn argument from drm_gpuva_manager_lock_extra()?
>>>>>>>>>
>>>>>>>>>>          int (*bo_validate) (struct drm_gpuva_exec *exec, struct drm_gem_object
>>>>>>>>>> *obj);
>>>>>>>>> I guess we could also keep that within the drm_gpuva_fn_ops? This should always
>>>>>>>>> be the same callback, right?
>>>>>>>>>
>>>>>>>>>> };
>>>>>>>>>>
>>>>>>>>>> struct drm_gpuva_exec {
>>>>>>>>>>          const struct drm_gpuva_exec_ops *ops;
>>>>>>>>>>          struct drm_exec exec;
>>>>>>>>>>          struct drm_gpuva_manager *mgr;
>>>>>>>>>> };
>>>>>>>>>>
>>>>>>>>>> Although I'd actually expect bo_validate to be part of fn in the typical
>>>>>>>>>> case. The drm_gpuva_exec would then be allocated by the caller on the stack.
>>>>>>>>> This doesn't sound like my assumption about fn() above is correct.
>>>>>>>> Well one important thing in our conversion is that ttm_bo_validate () needs
>>>>>>>> to be in the until_all_locked() loop. We want to be able soon to use
>>>>>>>> sleeping locks for eviction, so a xe_bo_validate() would, at least
>>>>>>>> temporarily, add locked objects to the drm_exec list of locked objects. That
>>>>>>>> means everything that may end up calling validate deep within the call chain
>>>>>>>> needs to be part of the until_all_locked() loop, so our
>>>>>>>> drm_gpuva_manager_lock_extra() fn callback would include those validates and
>>>>>>>> look different all the time. Hence that's why open-coding isn't all that
>>>>>>>> bad...
>>>>>>> Oh, I see. You indeed want to call validate() from within until_all_locked().
>>>>>>>
>>>>>>>> /Thomas
>>>>>>>>
>>>>>>>>
>>>>> <snip>
>>


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-09-01  5:59                     ` [Nouveau] " Thomas Hellström (Intel)
  (?)
@ 2023-10-10 20:23                       ` Dave Airlie
  -1 siblings, 0 replies; 88+ messages in thread
From: Dave Airlie @ 2023-10-10 20:23 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: Danilo Krummrich, daniel, matthew.brost, thomas.hellstrom,
	sarah.walker, donald.robson, boris.brezillon, christian.koenig,
	faith.ekstrand, bskeggs, Liam.Howlett, nouveau, linux-kernel,
	dri-devel

> I think we're then optimizing for different scenarios. Our compute
> driver will use mostly external objects only, and if shared, I don't
> forsee them bound to many VMs. What saves us currently here is that in
> compute mode we only really traverse the extobj list after a preempt
> fence wait, or when a vm is using a new context for the first time. So
> vm's extobj list is pretty large. Each bo's vma list will typically be
> pretty small.

Can I ask why we are optimising for this userspace, this seems
incredibly broken.

We've has this sort of problem in the past with Intel letting the tail
wag the horse, does anyone remember optimising relocations for a
userspace that didn't actually need to use relocations?

We need to ask why this userspace is doing this, can we get some
pointers to it? compute driver should have no reason to use mostly
external objects, the OpenCL and level0 APIs should be good enough to
figure this out.

Dave.

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-10 20:23                       ` Dave Airlie
  0 siblings, 0 replies; 88+ messages in thread
From: Dave Airlie @ 2023-10-10 20:23 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	Danilo Krummrich, donald.robson, christian.koenig,
	faith.ekstrand, bskeggs

> I think we're then optimizing for different scenarios. Our compute
> driver will use mostly external objects only, and if shared, I don't
> forsee them bound to many VMs. What saves us currently here is that in
> compute mode we only really traverse the extobj list after a preempt
> fence wait, or when a vm is using a new context for the first time. So
> vm's extobj list is pretty large. Each bo's vma list will typically be
> pretty small.

Can I ask why we are optimising for this userspace, this seems
incredibly broken.

We've has this sort of problem in the past with Intel letting the tail
wag the horse, does anyone remember optimising relocations for a
userspace that didn't actually need to use relocations?

We need to ask why this userspace is doing this, can we get some
pointers to it? compute driver should have no reason to use mostly
external objects, the OpenCL and level0 APIs should be good enough to
figure this out.

Dave.

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-10 20:23                       ` Dave Airlie
  0 siblings, 0 replies; 88+ messages in thread
From: Dave Airlie @ 2023-10-10 20:23 UTC (permalink / raw)
  To: Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, christian.koenig, faith.ekstrand, bskeggs

> I think we're then optimizing for different scenarios. Our compute
> driver will use mostly external objects only, and if shared, I don't
> forsee them bound to many VMs. What saves us currently here is that in
> compute mode we only really traverse the extobj list after a preempt
> fence wait, or when a vm is using a new context for the first time. So
> vm's extobj list is pretty large. Each bo's vma list will typically be
> pretty small.

Can I ask why we are optimising for this userspace, this seems
incredibly broken.

We've has this sort of problem in the past with Intel letting the tail
wag the horse, does anyone remember optimising relocations for a
userspace that didn't actually need to use relocations?

We need to ask why this userspace is doing this, can we get some
pointers to it? compute driver should have no reason to use mostly
external objects, the OpenCL and level0 APIs should be good enough to
figure this out.

Dave.

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-10-10 20:23                       ` Dave Airlie
  (?)
@ 2023-10-11  7:07                         ` Christian König
  -1 siblings, 0 replies; 88+ messages in thread
From: Christian König @ 2023-10-11  7:07 UTC (permalink / raw)
  To: Dave Airlie, Thomas Hellström (Intel)
  Cc: Danilo Krummrich, daniel, matthew.brost, thomas.hellstrom,
	sarah.walker, donald.robson, boris.brezillon, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

Am 10.10.23 um 22:23 schrieb Dave Airlie:
>> I think we're then optimizing for different scenarios. Our compute
>> driver will use mostly external objects only, and if shared, I don't
>> forsee them bound to many VMs. What saves us currently here is that in
>> compute mode we only really traverse the extobj list after a preempt
>> fence wait, or when a vm is using a new context for the first time. So
>> vm's extobj list is pretty large. Each bo's vma list will typically be
>> pretty small.
> Can I ask why we are optimising for this userspace, this seems
> incredibly broken.
>
> We've has this sort of problem in the past with Intel letting the tail
> wag the horse, does anyone remember optimising relocations for a
> userspace that didn't actually need to use relocations?
>
> We need to ask why this userspace is doing this, can we get some
> pointers to it? compute driver should have no reason to use mostly
> external objects, the OpenCL and level0 APIs should be good enough to
> figure this out.

Well that is pretty normal use case, AMD works the same way.

In a multi GPU compute stack you have mostly all the data shared between 
different hardware devices.

As I said before looking at just the Vulcan use case is not a good idea 
at all.

Christian.

>
> Dave.


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-11  7:07                         ` Christian König
  0 siblings, 0 replies; 88+ messages in thread
From: Christian König @ 2023-10-11  7:07 UTC (permalink / raw)
  To: Dave Airlie, Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	donald.robson, daniel, faith.ekstrand, bskeggs

Am 10.10.23 um 22:23 schrieb Dave Airlie:
>> I think we're then optimizing for different scenarios. Our compute
>> driver will use mostly external objects only, and if shared, I don't
>> forsee them bound to many VMs. What saves us currently here is that in
>> compute mode we only really traverse the extobj list after a preempt
>> fence wait, or when a vm is using a new context for the first time. So
>> vm's extobj list is pretty large. Each bo's vma list will typically be
>> pretty small.
> Can I ask why we are optimising for this userspace, this seems
> incredibly broken.
>
> We've has this sort of problem in the past with Intel letting the tail
> wag the horse, does anyone remember optimising relocations for a
> userspace that didn't actually need to use relocations?
>
> We need to ask why this userspace is doing this, can we get some
> pointers to it? compute driver should have no reason to use mostly
> external objects, the OpenCL and level0 APIs should be good enough to
> figure this out.

Well that is pretty normal use case, AMD works the same way.

In a multi GPU compute stack you have mostly all the data shared between 
different hardware devices.

As I said before looking at just the Vulcan use case is not a good idea 
at all.

Christian.

>
> Dave.


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-11  7:07                         ` Christian König
  0 siblings, 0 replies; 88+ messages in thread
From: Christian König @ 2023-10-11  7:07 UTC (permalink / raw)
  To: Dave Airlie, Thomas Hellström (Intel)
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, nouveau,
	dri-devel, linux-kernel, Liam.Howlett, boris.brezillon,
	Danilo Krummrich, donald.robson, faith.ekstrand, bskeggs

Am 10.10.23 um 22:23 schrieb Dave Airlie:
>> I think we're then optimizing for different scenarios. Our compute
>> driver will use mostly external objects only, and if shared, I don't
>> forsee them bound to many VMs. What saves us currently here is that in
>> compute mode we only really traverse the extobj list after a preempt
>> fence wait, or when a vm is using a new context for the first time. So
>> vm's extobj list is pretty large. Each bo's vma list will typically be
>> pretty small.
> Can I ask why we are optimising for this userspace, this seems
> incredibly broken.
>
> We've has this sort of problem in the past with Intel letting the tail
> wag the horse, does anyone remember optimising relocations for a
> userspace that didn't actually need to use relocations?
>
> We need to ask why this userspace is doing this, can we get some
> pointers to it? compute driver should have no reason to use mostly
> external objects, the OpenCL and level0 APIs should be good enough to
> figure this out.

Well that is pretty normal use case, AMD works the same way.

In a multi GPU compute stack you have mostly all the data shared between 
different hardware devices.

As I said before looking at just the Vulcan use case is not a good idea 
at all.

Christian.

>
> Dave.


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-10-10 20:23                       ` Dave Airlie
  (?)
@ 2023-10-11  8:22                         ` Thomas Hellström
  -1 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström @ 2023-10-11  8:22 UTC (permalink / raw)
  To: Dave Airlie, Thomas Hellström (Intel)
  Cc: Danilo Krummrich, daniel, matthew.brost, sarah.walker,
	donald.robson, boris.brezillon, christian.koenig, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

On Wed, 2023-10-11 at 06:23 +1000, Dave Airlie wrote:
> > I think we're then optimizing for different scenarios. Our compute
> > driver will use mostly external objects only, and if shared, I
> > don't
> > forsee them bound to many VMs. What saves us currently here is that
> > in
> > compute mode we only really traverse the extobj list after a
> > preempt
> > fence wait, or when a vm is using a new context for the first time.
> > So
> > vm's extobj list is pretty large. Each bo's vma list will typically
> > be
> > pretty small.
> 
> Can I ask why we are optimising for this userspace, this seems
> incredibly broken.

First Judging from the discussion with Christian this is not really
uncommon. There *are* ways that we can play tricks in KMD of assorted
cleverness to reduce the extobj list size, but doing that in KMD that
wouldn't be much different than accepting a large extobj list size and
do what we can to reduce overhead of iterating over it.

Second the discussion here really was about whether we should be using
a lower level lock to allow for async state updates, with a rather
complex mechanism with weak reference counting and a requirement to
drop the locks within the loop to avoid locking inversion. If that were
a simplification with little or no overhead all fine, but IMO it's not
a simplification?

> 
> We've has this sort of problem in the past with Intel letting the
> tail
> wag the horse, does anyone remember optimising relocations for a
> userspace that didn't actually need to use relocations?

> 
> We need to ask why this userspace is doing this, can we get some
> pointers to it? compute driver should have no reason to use mostly
> external objects, the OpenCL and level0 APIs should be good enough to
> figure this out.

TBH for the compute UMD case, I'd be prepared to drop the *performance*
argument of fine-grained locking the extobj list since it's really only
traversed on new contexts and preemption. But as Christian mentions
there might be other cases. We should perhaps figure those out and
document?

/Thoams

> 
> Dave.

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-11  8:22                         ` Thomas Hellström
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström @ 2023-10-11  8:22 UTC (permalink / raw)
  To: Dave Airlie, Thomas Hellström (Intel)
  Cc: matthew.brost, sarah.walker, nouveau, dri-devel, linux-kernel,
	Liam.Howlett, boris.brezillon, donald.robson, daniel,
	christian.koenig, faith.ekstrand, bskeggs

On Wed, 2023-10-11 at 06:23 +1000, Dave Airlie wrote:
> > I think we're then optimizing for different scenarios. Our compute
> > driver will use mostly external objects only, and if shared, I
> > don't
> > forsee them bound to many VMs. What saves us currently here is that
> > in
> > compute mode we only really traverse the extobj list after a
> > preempt
> > fence wait, or when a vm is using a new context for the first time.
> > So
> > vm's extobj list is pretty large. Each bo's vma list will typically
> > be
> > pretty small.
> 
> Can I ask why we are optimising for this userspace, this seems
> incredibly broken.

First Judging from the discussion with Christian this is not really
uncommon. There *are* ways that we can play tricks in KMD of assorted
cleverness to reduce the extobj list size, but doing that in KMD that
wouldn't be much different than accepting a large extobj list size and
do what we can to reduce overhead of iterating over it.

Second the discussion here really was about whether we should be using
a lower level lock to allow for async state updates, with a rather
complex mechanism with weak reference counting and a requirement to
drop the locks within the loop to avoid locking inversion. If that were
a simplification with little or no overhead all fine, but IMO it's not
a simplification?

> 
> We've has this sort of problem in the past with Intel letting the
> tail
> wag the horse, does anyone remember optimising relocations for a
> userspace that didn't actually need to use relocations?

> 
> We need to ask why this userspace is doing this, can we get some
> pointers to it? compute driver should have no reason to use mostly
> external objects, the OpenCL and level0 APIs should be good enough to
> figure this out.

TBH for the compute UMD case, I'd be prepared to drop the *performance*
argument of fine-grained locking the extobj list since it's really only
traversed on new contexts and preemption. But as Christian mentions
there might be other cases. We should perhaps figure those out and
document?

/Thoams

> 
> Dave.

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-11  8:22                         ` Thomas Hellström
  0 siblings, 0 replies; 88+ messages in thread
From: Thomas Hellström @ 2023-10-11  8:22 UTC (permalink / raw)
  To: Dave Airlie, Thomas Hellström (Intel)
  Cc: matthew.brost, sarah.walker, nouveau, dri-devel, linux-kernel,
	Liam.Howlett, boris.brezillon, Danilo Krummrich, donald.robson,
	christian.koenig, faith.ekstrand, bskeggs

On Wed, 2023-10-11 at 06:23 +1000, Dave Airlie wrote:
> > I think we're then optimizing for different scenarios. Our compute
> > driver will use mostly external objects only, and if shared, I
> > don't
> > forsee them bound to many VMs. What saves us currently here is that
> > in
> > compute mode we only really traverse the extobj list after a
> > preempt
> > fence wait, or when a vm is using a new context for the first time.
> > So
> > vm's extobj list is pretty large. Each bo's vma list will typically
> > be
> > pretty small.
> 
> Can I ask why we are optimising for this userspace, this seems
> incredibly broken.

First Judging from the discussion with Christian this is not really
uncommon. There *are* ways that we can play tricks in KMD of assorted
cleverness to reduce the extobj list size, but doing that in KMD that
wouldn't be much different than accepting a large extobj list size and
do what we can to reduce overhead of iterating over it.

Second the discussion here really was about whether we should be using
a lower level lock to allow for async state updates, with a rather
complex mechanism with weak reference counting and a requirement to
drop the locks within the loop to avoid locking inversion. If that were
a simplification with little or no overhead all fine, but IMO it's not
a simplification?

> 
> We've has this sort of problem in the past with Intel letting the
> tail
> wag the horse, does anyone remember optimising relocations for a
> userspace that didn't actually need to use relocations?

> 
> We need to ask why this userspace is doing this, can we get some
> pointers to it? compute driver should have no reason to use mostly
> external objects, the OpenCL and level0 APIs should be good enough to
> figure this out.

TBH for the compute UMD case, I'd be prepared to drop the *performance*
argument of fine-grained locking the extobj list since it's really only
traversed on new contexts and preemption. But as Christian mentions
there might be other cases. We should perhaps figure those out and
document?

/Thoams

> 
> Dave.

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-10-11  7:07                         ` [Nouveau] " Christian König
  (?)
@ 2023-10-12 10:33                           ` Dave Airlie
  -1 siblings, 0 replies; 88+ messages in thread
From: Dave Airlie @ 2023-10-12 10:33 UTC (permalink / raw)
  To: Christian König
  Cc: Thomas Hellström (Intel),
	Danilo Krummrich, daniel, matthew.brost, thomas.hellstrom,
	sarah.walker, donald.robson, boris.brezillon, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

On Wed, 11 Oct 2023 at 17:07, Christian König <christian.koenig@amd.com> wrote:
>
> Am 10.10.23 um 22:23 schrieb Dave Airlie:
> >> I think we're then optimizing for different scenarios. Our compute
> >> driver will use mostly external objects only, and if shared, I don't
> >> forsee them bound to many VMs. What saves us currently here is that in
> >> compute mode we only really traverse the extobj list after a preempt
> >> fence wait, or when a vm is using a new context for the first time. So
> >> vm's extobj list is pretty large. Each bo's vma list will typically be
> >> pretty small.
> > Can I ask why we are optimising for this userspace, this seems
> > incredibly broken.
> >
> > We've has this sort of problem in the past with Intel letting the tail
> > wag the horse, does anyone remember optimising relocations for a
> > userspace that didn't actually need to use relocations?
> >
> > We need to ask why this userspace is doing this, can we get some
> > pointers to it? compute driver should have no reason to use mostly
> > external objects, the OpenCL and level0 APIs should be good enough to
> > figure this out.
>
> Well that is pretty normal use case, AMD works the same way.
>
> In a multi GPU compute stack you have mostly all the data shared between
> different hardware devices.
>
> As I said before looking at just the Vulcan use case is not a good idea
> at all.
>

It's okay, I don't think anyone is doing that, some of the these
use-cases are buried in server land and you guys don't communicate
them very well.

multi-gpu compute would I'd hope be moving towards HMM/SVM type
solutions though?

I'm also not into looking at use-cases that used to be important but
might not as important going forward.

Dave.


> Christian.
>
> >
> > Dave.
>

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-12 10:33                           ` Dave Airlie
  0 siblings, 0 replies; 88+ messages in thread
From: Dave Airlie @ 2023-10-12 10:33 UTC (permalink / raw)
  To: Christian König
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, dri-devel,
	nouveau, Thomas Hellström (Intel),
	linux-kernel, Liam.Howlett, boris.brezillon, donald.robson,
	daniel, faith.ekstrand, bskeggs

On Wed, 11 Oct 2023 at 17:07, Christian König <christian.koenig@amd.com> wrote:
>
> Am 10.10.23 um 22:23 schrieb Dave Airlie:
> >> I think we're then optimizing for different scenarios. Our compute
> >> driver will use mostly external objects only, and if shared, I don't
> >> forsee them bound to many VMs. What saves us currently here is that in
> >> compute mode we only really traverse the extobj list after a preempt
> >> fence wait, or when a vm is using a new context for the first time. So
> >> vm's extobj list is pretty large. Each bo's vma list will typically be
> >> pretty small.
> > Can I ask why we are optimising for this userspace, this seems
> > incredibly broken.
> >
> > We've has this sort of problem in the past with Intel letting the tail
> > wag the horse, does anyone remember optimising relocations for a
> > userspace that didn't actually need to use relocations?
> >
> > We need to ask why this userspace is doing this, can we get some
> > pointers to it? compute driver should have no reason to use mostly
> > external objects, the OpenCL and level0 APIs should be good enough to
> > figure this out.
>
> Well that is pretty normal use case, AMD works the same way.
>
> In a multi GPU compute stack you have mostly all the data shared between
> different hardware devices.
>
> As I said before looking at just the Vulcan use case is not a good idea
> at all.
>

It's okay, I don't think anyone is doing that, some of the these
use-cases are buried in server land and you guys don't communicate
them very well.

multi-gpu compute would I'd hope be moving towards HMM/SVM type
solutions though?

I'm also not into looking at use-cases that used to be important but
might not as important going forward.

Dave.


> Christian.
>
> >
> > Dave.
>

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-12 10:33                           ` Dave Airlie
  0 siblings, 0 replies; 88+ messages in thread
From: Dave Airlie @ 2023-10-12 10:33 UTC (permalink / raw)
  To: Christian König
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, dri-devel,
	nouveau, Thomas Hellström (Intel),
	linux-kernel, Liam.Howlett, boris.brezillon, Danilo Krummrich,
	donald.robson, faith.ekstrand, bskeggs

On Wed, 11 Oct 2023 at 17:07, Christian König <christian.koenig@amd.com> wrote:
>
> Am 10.10.23 um 22:23 schrieb Dave Airlie:
> >> I think we're then optimizing for different scenarios. Our compute
> >> driver will use mostly external objects only, and if shared, I don't
> >> forsee them bound to many VMs. What saves us currently here is that in
> >> compute mode we only really traverse the extobj list after a preempt
> >> fence wait, or when a vm is using a new context for the first time. So
> >> vm's extobj list is pretty large. Each bo's vma list will typically be
> >> pretty small.
> > Can I ask why we are optimising for this userspace, this seems
> > incredibly broken.
> >
> > We've has this sort of problem in the past with Intel letting the tail
> > wag the horse, does anyone remember optimising relocations for a
> > userspace that didn't actually need to use relocations?
> >
> > We need to ask why this userspace is doing this, can we get some
> > pointers to it? compute driver should have no reason to use mostly
> > external objects, the OpenCL and level0 APIs should be good enough to
> > figure this out.
>
> Well that is pretty normal use case, AMD works the same way.
>
> In a multi GPU compute stack you have mostly all the data shared between
> different hardware devices.
>
> As I said before looking at just the Vulcan use case is not a good idea
> at all.
>

It's okay, I don't think anyone is doing that, some of the these
use-cases are buried in server land and you guys don't communicate
them very well.

multi-gpu compute would I'd hope be moving towards HMM/SVM type
solutions though?

I'm also not into looking at use-cases that used to be important but
might not as important going forward.

Dave.


> Christian.
>
> >
> > Dave.
>

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-10-12 10:33                           ` [Nouveau] " Dave Airlie
  (?)
@ 2023-10-12 12:35                             ` Christian König
  -1 siblings, 0 replies; 88+ messages in thread
From: Christian König @ 2023-10-12 12:35 UTC (permalink / raw)
  To: Dave Airlie
  Cc: Thomas Hellström (Intel),
	Danilo Krummrich, daniel, matthew.brost, thomas.hellstrom,
	sarah.walker, donald.robson, boris.brezillon, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

Am 12.10.23 um 12:33 schrieb Dave Airlie:
> On Wed, 11 Oct 2023 at 17:07, Christian König <christian.koenig@amd.com> wrote:
>> Am 10.10.23 um 22:23 schrieb Dave Airlie:
>>>> I think we're then optimizing for different scenarios. Our compute
>>>> driver will use mostly external objects only, and if shared, I don't
>>>> forsee them bound to many VMs. What saves us currently here is that in
>>>> compute mode we only really traverse the extobj list after a preempt
>>>> fence wait, or when a vm is using a new context for the first time. So
>>>> vm's extobj list is pretty large. Each bo's vma list will typically be
>>>> pretty small.
>>> Can I ask why we are optimising for this userspace, this seems
>>> incredibly broken.
>>>
>>> We've has this sort of problem in the past with Intel letting the tail
>>> wag the horse, does anyone remember optimising relocations for a
>>> userspace that didn't actually need to use relocations?
>>>
>>> We need to ask why this userspace is doing this, can we get some
>>> pointers to it? compute driver should have no reason to use mostly
>>> external objects, the OpenCL and level0 APIs should be good enough to
>>> figure this out.
>> Well that is pretty normal use case, AMD works the same way.
>>
>> In a multi GPU compute stack you have mostly all the data shared between
>> different hardware devices.
>>
>> As I said before looking at just the Vulcan use case is not a good idea
>> at all.
>>
> It's okay, I don't think anyone is doing that, some of the these
> use-cases are buried in server land and you guys don't communicate
> them very well.

Yeah, well everybody is trying very hard to get away from those 
approaches :)

But so far there hasn't been any breakthrough.

>
> multi-gpu compute would I'd hope be moving towards HMM/SVM type
> solutions though?

Unfortunately not in the foreseeable future. HMM seems more and more 
like a dead end, at least for AMD.

AMD still has hardware support in all of their MI* products, but for 
Navi the features necessary for implementing HMM have been dropped. And 
it looks more and more like their are not going to come back.

Additional to that from the software side Felix summarized it in the HMM 
peer2peer discussion thread recently quite well. A buffer object based 
approach is not only simpler to handle, but also performant vise 
multiple magnitudes faster.

> I'm also not into looking at use-cases that used to be important but
> might not as important going forward.

Well multimedia applications and OpenGL are still around, but it's not 
the main focus any more.

Christian.

>
> Dave.
>
>
>> Christian.
>>
>>> Dave.


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-12 12:35                             ` Christian König
  0 siblings, 0 replies; 88+ messages in thread
From: Christian König @ 2023-10-12 12:35 UTC (permalink / raw)
  To: Dave Airlie
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, dri-devel,
	nouveau, Thomas Hellström (Intel),
	linux-kernel, Liam.Howlett, boris.brezillon, donald.robson,
	daniel, faith.ekstrand, bskeggs

Am 12.10.23 um 12:33 schrieb Dave Airlie:
> On Wed, 11 Oct 2023 at 17:07, Christian König <christian.koenig@amd.com> wrote:
>> Am 10.10.23 um 22:23 schrieb Dave Airlie:
>>>> I think we're then optimizing for different scenarios. Our compute
>>>> driver will use mostly external objects only, and if shared, I don't
>>>> forsee them bound to many VMs. What saves us currently here is that in
>>>> compute mode we only really traverse the extobj list after a preempt
>>>> fence wait, or when a vm is using a new context for the first time. So
>>>> vm's extobj list is pretty large. Each bo's vma list will typically be
>>>> pretty small.
>>> Can I ask why we are optimising for this userspace, this seems
>>> incredibly broken.
>>>
>>> We've has this sort of problem in the past with Intel letting the tail
>>> wag the horse, does anyone remember optimising relocations for a
>>> userspace that didn't actually need to use relocations?
>>>
>>> We need to ask why this userspace is doing this, can we get some
>>> pointers to it? compute driver should have no reason to use mostly
>>> external objects, the OpenCL and level0 APIs should be good enough to
>>> figure this out.
>> Well that is pretty normal use case, AMD works the same way.
>>
>> In a multi GPU compute stack you have mostly all the data shared between
>> different hardware devices.
>>
>> As I said before looking at just the Vulcan use case is not a good idea
>> at all.
>>
> It's okay, I don't think anyone is doing that, some of the these
> use-cases are buried in server land and you guys don't communicate
> them very well.

Yeah, well everybody is trying very hard to get away from those 
approaches :)

But so far there hasn't been any breakthrough.

>
> multi-gpu compute would I'd hope be moving towards HMM/SVM type
> solutions though?

Unfortunately not in the foreseeable future. HMM seems more and more 
like a dead end, at least for AMD.

AMD still has hardware support in all of their MI* products, but for 
Navi the features necessary for implementing HMM have been dropped. And 
it looks more and more like their are not going to come back.

Additional to that from the software side Felix summarized it in the HMM 
peer2peer discussion thread recently quite well. A buffer object based 
approach is not only simpler to handle, but also performant vise 
multiple magnitudes faster.

> I'm also not into looking at use-cases that used to be important but
> might not as important going forward.

Well multimedia applications and OpenGL are still around, but it's not 
the main focus any more.

Christian.

>
> Dave.
>
>
>> Christian.
>>
>>> Dave.


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-12 12:35                             ` Christian König
  0 siblings, 0 replies; 88+ messages in thread
From: Christian König @ 2023-10-12 12:35 UTC (permalink / raw)
  To: Dave Airlie
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, dri-devel,
	nouveau, Thomas Hellström (Intel),
	linux-kernel, Liam.Howlett, boris.brezillon, Danilo Krummrich,
	donald.robson, faith.ekstrand, bskeggs

Am 12.10.23 um 12:33 schrieb Dave Airlie:
> On Wed, 11 Oct 2023 at 17:07, Christian König <christian.koenig@amd.com> wrote:
>> Am 10.10.23 um 22:23 schrieb Dave Airlie:
>>>> I think we're then optimizing for different scenarios. Our compute
>>>> driver will use mostly external objects only, and if shared, I don't
>>>> forsee them bound to many VMs. What saves us currently here is that in
>>>> compute mode we only really traverse the extobj list after a preempt
>>>> fence wait, or when a vm is using a new context for the first time. So
>>>> vm's extobj list is pretty large. Each bo's vma list will typically be
>>>> pretty small.
>>> Can I ask why we are optimising for this userspace, this seems
>>> incredibly broken.
>>>
>>> We've has this sort of problem in the past with Intel letting the tail
>>> wag the horse, does anyone remember optimising relocations for a
>>> userspace that didn't actually need to use relocations?
>>>
>>> We need to ask why this userspace is doing this, can we get some
>>> pointers to it? compute driver should have no reason to use mostly
>>> external objects, the OpenCL and level0 APIs should be good enough to
>>> figure this out.
>> Well that is pretty normal use case, AMD works the same way.
>>
>> In a multi GPU compute stack you have mostly all the data shared between
>> different hardware devices.
>>
>> As I said before looking at just the Vulcan use case is not a good idea
>> at all.
>>
> It's okay, I don't think anyone is doing that, some of the these
> use-cases are buried in server land and you guys don't communicate
> them very well.

Yeah, well everybody is trying very hard to get away from those 
approaches :)

But so far there hasn't been any breakthrough.

>
> multi-gpu compute would I'd hope be moving towards HMM/SVM type
> solutions though?

Unfortunately not in the foreseeable future. HMM seems more and more 
like a dead end, at least for AMD.

AMD still has hardware support in all of their MI* products, but for 
Navi the features necessary for implementing HMM have been dropped. And 
it looks more and more like their are not going to come back.

Additional to that from the software side Felix summarized it in the HMM 
peer2peer discussion thread recently quite well. A buffer object based 
approach is not only simpler to handle, but also performant vise 
multiple magnitudes faster.

> I'm also not into looking at use-cases that used to be important but
> might not as important going forward.

Well multimedia applications and OpenGL are still around, but it's not 
the main focus any more.

Christian.

>
> Dave.
>
>
>> Christian.
>>
>>> Dave.


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-10-12 12:35                             ` [Nouveau] " Christian König
  (?)
@ 2023-10-12 13:15                               ` Daniel Vetter
  -1 siblings, 0 replies; 88+ messages in thread
From: Daniel Vetter @ 2023-10-12 13:15 UTC (permalink / raw)
  To: Christian König
  Cc: Dave Airlie, Thomas Hellström (Intel),
	Danilo Krummrich, daniel, matthew.brost, thomas.hellstrom,
	sarah.walker, donald.robson, boris.brezillon, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

On Thu, Oct 12, 2023 at 02:35:15PM +0200, Christian König wrote:
> Am 12.10.23 um 12:33 schrieb Dave Airlie:
> > On Wed, 11 Oct 2023 at 17:07, Christian König <christian.koenig@amd.com> wrote:
> > > Am 10.10.23 um 22:23 schrieb Dave Airlie:
> > > > > I think we're then optimizing for different scenarios. Our compute
> > > > > driver will use mostly external objects only, and if shared, I don't
> > > > > forsee them bound to many VMs. What saves us currently here is that in
> > > > > compute mode we only really traverse the extobj list after a preempt
> > > > > fence wait, or when a vm is using a new context for the first time. So
> > > > > vm's extobj list is pretty large. Each bo's vma list will typically be
> > > > > pretty small.
> > > > Can I ask why we are optimising for this userspace, this seems
> > > > incredibly broken.
> > > > 
> > > > We've has this sort of problem in the past with Intel letting the tail
> > > > wag the horse, does anyone remember optimising relocations for a
> > > > userspace that didn't actually need to use relocations?
> > > > 
> > > > We need to ask why this userspace is doing this, can we get some
> > > > pointers to it? compute driver should have no reason to use mostly
> > > > external objects, the OpenCL and level0 APIs should be good enough to
> > > > figure this out.
> > > Well that is pretty normal use case, AMD works the same way.
> > > 
> > > In a multi GPU compute stack you have mostly all the data shared between
> > > different hardware devices.
> > > 
> > > As I said before looking at just the Vulcan use case is not a good idea
> > > at all.
> > > 
> > It's okay, I don't think anyone is doing that, some of the these
> > use-cases are buried in server land and you guys don't communicate
> > them very well.
> 
> Yeah, well everybody is trying very hard to get away from those approaches
> :)
> 
> But so far there hasn't been any breakthrough.
> 
> > 
> > multi-gpu compute would I'd hope be moving towards HMM/SVM type
> > solutions though?
> 
> Unfortunately not in the foreseeable future. HMM seems more and more like a
> dead end, at least for AMD.
> 
> AMD still has hardware support in all of their MI* products, but for Navi
> the features necessary for implementing HMM have been dropped. And it looks
> more and more like their are not going to come back.
> 
> Additional to that from the software side Felix summarized it in the HMM
> peer2peer discussion thread recently quite well. A buffer object based
> approach is not only simpler to handle, but also performant vise multiple
> magnitudes faster.

This matches what I'm hearing from all over. Turns out that handling page
faults in full generality in a compute/accel device (not just gpu) is just
too damn hard. At least for anyone who isn't nvidia. Usually time bound
preemption guarantees are the first to go, followed right after by a long
list of more fixed function hardware blocks that outright can't cope with
page faults.

There's so many corner cases where it breaks down that I feel like device
driver allocated memory of one flavor or another will stick around for a
very long time.

This isn't even counting the software challenges.
-Sima

> > I'm also not into looking at use-cases that used to be important but
> > might not as important going forward.
> 
> Well multimedia applications and OpenGL are still around, but it's not the
> main focus any more.
> 
> Christian.
> 
> > 
> > Dave.
> > 
> > 
> > > Christian.
> > > 
> > > > Dave.
> 

-- 
Daniel Vetter
Software Engineer, Intel Corporation
http://blog.ffwll.ch

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-12 13:15                               ` Daniel Vetter
  0 siblings, 0 replies; 88+ messages in thread
From: Daniel Vetter @ 2023-10-12 13:15 UTC (permalink / raw)
  To: Christian König
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, dri-devel,
	nouveau, Thomas Hellström (Intel),
	linux-kernel, Liam.Howlett, boris.brezillon, donald.robson,
	daniel, faith.ekstrand, bskeggs

On Thu, Oct 12, 2023 at 02:35:15PM +0200, Christian König wrote:
> Am 12.10.23 um 12:33 schrieb Dave Airlie:
> > On Wed, 11 Oct 2023 at 17:07, Christian König <christian.koenig@amd.com> wrote:
> > > Am 10.10.23 um 22:23 schrieb Dave Airlie:
> > > > > I think we're then optimizing for different scenarios. Our compute
> > > > > driver will use mostly external objects only, and if shared, I don't
> > > > > forsee them bound to many VMs. What saves us currently here is that in
> > > > > compute mode we only really traverse the extobj list after a preempt
> > > > > fence wait, or when a vm is using a new context for the first time. So
> > > > > vm's extobj list is pretty large. Each bo's vma list will typically be
> > > > > pretty small.
> > > > Can I ask why we are optimising for this userspace, this seems
> > > > incredibly broken.
> > > > 
> > > > We've has this sort of problem in the past with Intel letting the tail
> > > > wag the horse, does anyone remember optimising relocations for a
> > > > userspace that didn't actually need to use relocations?
> > > > 
> > > > We need to ask why this userspace is doing this, can we get some
> > > > pointers to it? compute driver should have no reason to use mostly
> > > > external objects, the OpenCL and level0 APIs should be good enough to
> > > > figure this out.
> > > Well that is pretty normal use case, AMD works the same way.
> > > 
> > > In a multi GPU compute stack you have mostly all the data shared between
> > > different hardware devices.
> > > 
> > > As I said before looking at just the Vulcan use case is not a good idea
> > > at all.
> > > 
> > It's okay, I don't think anyone is doing that, some of the these
> > use-cases are buried in server land and you guys don't communicate
> > them very well.
> 
> Yeah, well everybody is trying very hard to get away from those approaches
> :)
> 
> But so far there hasn't been any breakthrough.
> 
> > 
> > multi-gpu compute would I'd hope be moving towards HMM/SVM type
> > solutions though?
> 
> Unfortunately not in the foreseeable future. HMM seems more and more like a
> dead end, at least for AMD.
> 
> AMD still has hardware support in all of their MI* products, but for Navi
> the features necessary for implementing HMM have been dropped. And it looks
> more and more like their are not going to come back.
> 
> Additional to that from the software side Felix summarized it in the HMM
> peer2peer discussion thread recently quite well. A buffer object based
> approach is not only simpler to handle, but also performant vise multiple
> magnitudes faster.

This matches what I'm hearing from all over. Turns out that handling page
faults in full generality in a compute/accel device (not just gpu) is just
too damn hard. At least for anyone who isn't nvidia. Usually time bound
preemption guarantees are the first to go, followed right after by a long
list of more fixed function hardware blocks that outright can't cope with
page faults.

There's so many corner cases where it breaks down that I feel like device
driver allocated memory of one flavor or another will stick around for a
very long time.

This isn't even counting the software challenges.
-Sima

> > I'm also not into looking at use-cases that used to be important but
> > might not as important going forward.
> 
> Well multimedia applications and OpenGL are still around, but it's not the
> main focus any more.
> 
> Christian.
> 
> > 
> > Dave.
> > 
> > 
> > > Christian.
> > > 
> > > > Dave.
> 

-- 
Daniel Vetter
Software Engineer, Intel Corporation
http://blog.ffwll.ch

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-12 13:15                               ` Daniel Vetter
  0 siblings, 0 replies; 88+ messages in thread
From: Daniel Vetter @ 2023-10-12 13:15 UTC (permalink / raw)
  To: Christian König
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, dri-devel,
	nouveau, Thomas Hellström (Intel),
	linux-kernel, Liam.Howlett, boris.brezillon, Danilo Krummrich,
	donald.robson, faith.ekstrand, bskeggs

On Thu, Oct 12, 2023 at 02:35:15PM +0200, Christian König wrote:
> Am 12.10.23 um 12:33 schrieb Dave Airlie:
> > On Wed, 11 Oct 2023 at 17:07, Christian König <christian.koenig@amd.com> wrote:
> > > Am 10.10.23 um 22:23 schrieb Dave Airlie:
> > > > > I think we're then optimizing for different scenarios. Our compute
> > > > > driver will use mostly external objects only, and if shared, I don't
> > > > > forsee them bound to many VMs. What saves us currently here is that in
> > > > > compute mode we only really traverse the extobj list after a preempt
> > > > > fence wait, or when a vm is using a new context for the first time. So
> > > > > vm's extobj list is pretty large. Each bo's vma list will typically be
> > > > > pretty small.
> > > > Can I ask why we are optimising for this userspace, this seems
> > > > incredibly broken.
> > > > 
> > > > We've has this sort of problem in the past with Intel letting the tail
> > > > wag the horse, does anyone remember optimising relocations for a
> > > > userspace that didn't actually need to use relocations?
> > > > 
> > > > We need to ask why this userspace is doing this, can we get some
> > > > pointers to it? compute driver should have no reason to use mostly
> > > > external objects, the OpenCL and level0 APIs should be good enough to
> > > > figure this out.
> > > Well that is pretty normal use case, AMD works the same way.
> > > 
> > > In a multi GPU compute stack you have mostly all the data shared between
> > > different hardware devices.
> > > 
> > > As I said before looking at just the Vulcan use case is not a good idea
> > > at all.
> > > 
> > It's okay, I don't think anyone is doing that, some of the these
> > use-cases are buried in server land and you guys don't communicate
> > them very well.
> 
> Yeah, well everybody is trying very hard to get away from those approaches
> :)
> 
> But so far there hasn't been any breakthrough.
> 
> > 
> > multi-gpu compute would I'd hope be moving towards HMM/SVM type
> > solutions though?
> 
> Unfortunately not in the foreseeable future. HMM seems more and more like a
> dead end, at least for AMD.
> 
> AMD still has hardware support in all of their MI* products, but for Navi
> the features necessary for implementing HMM have been dropped. And it looks
> more and more like their are not going to come back.
> 
> Additional to that from the software side Felix summarized it in the HMM
> peer2peer discussion thread recently quite well. A buffer object based
> approach is not only simpler to handle, but also performant vise multiple
> magnitudes faster.

This matches what I'm hearing from all over. Turns out that handling page
faults in full generality in a compute/accel device (not just gpu) is just
too damn hard. At least for anyone who isn't nvidia. Usually time bound
preemption guarantees are the first to go, followed right after by a long
list of more fixed function hardware blocks that outright can't cope with
page faults.

There's so many corner cases where it breaks down that I feel like device
driver allocated memory of one flavor or another will stick around for a
very long time.

This isn't even counting the software challenges.
-Sima

> > I'm also not into looking at use-cases that used to be important but
> > might not as important going forward.
> 
> Well multimedia applications and OpenGL are still around, but it's not the
> main focus any more.
> 
> Christian.
> 
> > 
> > Dave.
> > 
> > 
> > > Christian.
> > > 
> > > > Dave.
> 

-- 
Daniel Vetter
Software Engineer, Intel Corporation
http://blog.ffwll.ch

^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
  2023-10-12 12:35                             ` [Nouveau] " Christian König
@ 2023-10-13  5:19                               ` Christoph Hellwig
  -1 siblings, 0 replies; 88+ messages in thread
From: Christoph Hellwig @ 2023-10-13  5:19 UTC (permalink / raw)
  To: Christian König
  Cc: Dave Airlie, Thomas Hellström (Intel),
	Danilo Krummrich, daniel, matthew.brost, thomas.hellstrom,
	sarah.walker, donald.robson, boris.brezillon, faith.ekstrand,
	bskeggs, Liam.Howlett, nouveau, linux-kernel, dri-devel

On Thu, Oct 12, 2023 at 02:35:15PM +0200, Christian König wrote:
> Additional to that from the software side Felix summarized it in the HMM
> peer2peer discussion thread recently quite well.

Do you have a pointer to that discussion?


^ permalink raw reply	[flat|nested] 88+ messages in thread

* Re: [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation
@ 2023-10-13  5:19                               ` Christoph Hellwig
  0 siblings, 0 replies; 88+ messages in thread
From: Christoph Hellwig @ 2023-10-13  5:19 UTC (permalink / raw)
  To: Christian König
  Cc: matthew.brost, thomas.hellstrom, sarah.walker, dri-devel,
	nouveau, Thomas Hellström (Intel),
	linux-kernel, Liam.Howlett, boris.brezillon, donald.robson,
	daniel, faith.ekstrand, bskeggs

On Thu, Oct 12, 2023 at 02:35:15PM +0200, Christian König wrote:
> Additional to that from the software side Felix summarized it in the HMM
> peer2peer discussion thread recently quite well.

Do you have a pointer to that discussion?


^ permalink raw reply	[flat|nested] 88+ messages in thread

end of thread, other threads:[~2023-10-13  5:20 UTC | newest]

Thread overview: 88+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-08-20 21:53 [Nouveau] [PATCH drm-misc-next 0/3] [RFC] DRM GPUVA Manager GPU-VM features Danilo Krummrich
2023-08-20 21:53 ` Danilo Krummrich
2023-08-20 21:53 ` Danilo Krummrich
2023-08-20 21:53 ` [Nouveau] [PATCH drm-misc-next 1/3] drm: drm_exec: build always builtin Danilo Krummrich
2023-08-20 21:53   ` Danilo Krummrich
2023-08-20 21:53   ` Danilo Krummrich
2023-08-21  9:49   ` Christian König
2023-08-21  9:49     ` Christian König
2023-08-21  9:49     ` [Nouveau] " Christian König
2023-08-21 19:14     ` Danilo Krummrich
2023-08-21 19:14       ` Danilo Krummrich
2023-08-21 19:14       ` Danilo Krummrich
2023-08-20 21:53 ` [Nouveau] [PATCH drm-misc-next 2/3] drm/gpuva_mgr: generalize dma_resv/extobj handling and GEM validation Danilo Krummrich
2023-08-20 21:53   ` Danilo Krummrich
2023-08-20 21:53   ` Danilo Krummrich
2023-08-22  1:31   ` kernel test robot
2023-08-22  1:31     ` kernel test robot
2023-08-22  1:31     ` [Nouveau] " kernel test robot
2023-08-22  2:18   ` kernel test robot
2023-08-22  2:18     ` kernel test robot
2023-08-22  2:18     ` [Nouveau] " kernel test robot
2023-08-22  3:01   ` kernel test robot
2023-08-22  3:01     ` kernel test robot
2023-08-22  3:01     ` [Nouveau] " kernel test robot
2023-08-30  7:27   ` Thomas Hellström (Intel)
2023-08-30  7:27     ` Thomas Hellström (Intel)
2023-08-30 12:49     ` [Nouveau] " Danilo Krummrich
2023-08-30 12:49       ` Danilo Krummrich
2023-08-30 12:49       ` Danilo Krummrich
2023-08-30 13:42       ` [Nouveau] " Thomas Hellström (Intel)
2023-08-30 13:42         ` Thomas Hellström (Intel)
2023-08-30 13:42         ` Thomas Hellström (Intel)
2023-08-30 15:00         ` [Nouveau] " Danilo Krummrich
2023-08-30 15:00           ` Danilo Krummrich
2023-08-30 15:00           ` Danilo Krummrich
2023-08-31  9:04           ` [Nouveau] " Thomas Hellström (Intel)
2023-08-31  9:04             ` Thomas Hellström (Intel)
2023-08-31  9:04             ` Thomas Hellström (Intel)
2023-08-31 11:18             ` [Nouveau] " Danilo Krummrich
2023-08-31 11:18               ` Danilo Krummrich
2023-08-31 11:18               ` Danilo Krummrich
2023-08-31 16:53               ` Thomas Hellström (Intel)
2023-08-31 16:53                 ` Thomas Hellström (Intel)
2023-08-31 16:53                 ` [Nouveau] " Thomas Hellström (Intel)
2023-08-31 17:23                 ` Thomas Hellström
2023-08-31 17:23                   ` [Nouveau] " Thomas Hellström
2023-08-31 17:23                   ` Thomas Hellström
2023-08-31 19:07                 ` Danilo Krummrich
2023-08-31 19:07                   ` Danilo Krummrich
2023-08-31 19:07                   ` [Nouveau] " Danilo Krummrich
2023-09-01  5:59                   ` Thomas Hellström (Intel)
2023-09-01  5:59                     ` Thomas Hellström (Intel)
2023-09-01  5:59                     ` [Nouveau] " Thomas Hellström (Intel)
2023-09-01 12:10                     ` Danilo Krummrich
2023-09-01 12:10                       ` Danilo Krummrich
2023-09-01 12:10                       ` Danilo Krummrich
2023-09-06 14:20                       ` [Nouveau] " Danilo Krummrich
2023-09-06 14:20                         ` Danilo Krummrich
2023-09-06 14:20                         ` Danilo Krummrich
2023-10-10 20:23                     ` Dave Airlie
2023-10-10 20:23                       ` [Nouveau] " Dave Airlie
2023-10-10 20:23                       ` Dave Airlie
2023-10-11  7:07                       ` Christian König
2023-10-11  7:07                         ` Christian König
2023-10-11  7:07                         ` [Nouveau] " Christian König
2023-10-12 10:33                         ` Dave Airlie
2023-10-12 10:33                           ` Dave Airlie
2023-10-12 10:33                           ` [Nouveau] " Dave Airlie
2023-10-12 12:35                           ` Christian König
2023-10-12 12:35                             ` Christian König
2023-10-12 12:35                             ` [Nouveau] " Christian König
2023-10-12 13:15                             ` Daniel Vetter
2023-10-12 13:15                               ` Daniel Vetter
2023-10-12 13:15                               ` [Nouveau] " Daniel Vetter
2023-10-13  5:19                             ` Christoph Hellwig
2023-10-13  5:19                               ` [Nouveau] " Christoph Hellwig
2023-10-11  8:22                       ` Thomas Hellström
2023-10-11  8:22                         ` Thomas Hellström
2023-10-11  8:22                         ` [Nouveau] " Thomas Hellström
2023-08-30  7:48   ` Christian König
2023-08-30  7:48     ` Christian König
2023-08-30  7:48     ` Christian König
2023-08-30 13:05     ` [Nouveau] " Danilo Krummrich
2023-08-30 13:05       ` Danilo Krummrich
2023-08-30 13:05       ` Danilo Krummrich
2023-08-20 21:53 ` [Nouveau] [PATCH drm-misc-next 3/3] drm/nouveau: gpuva mgr dma-resv/extobj handling, " Danilo Krummrich
2023-08-20 21:53   ` Danilo Krummrich
2023-08-20 21:53   ` Danilo Krummrich

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.