All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH 0/5] drm/ttm, amdgpu: fix crashes due to shadow page table evictions
@ 2016-12-15 17:10 Nicolai Hähnle
       [not found] ` <1481821815-9971-1-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  0 siblings, 1 reply; 15+ messages in thread
From: Nicolai Hähnle @ 2016-12-15 17:10 UTC (permalink / raw)
  To: amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

Fix a bunch of related crashes in amdgpu that occur when shadow page tables
are kicked out of the GART.

One of the issues was that during command submission, we rely on a
device-global evictions counter to skip some of the work of page-table
validation. The driver was never informed of evictions from GART, so this
series adds an evict parameter to ttm_bo_driver::move_notify.

There's still the evict parameter on ttm_bo_driver::move which is used by
radeon and nouveau for the call to ttm_bo_move_accel_cleanup. The 'evict'
parameter there should probably be more accurately called 'wait', but unless
that wait can always be avoided, the evict parameter on ttm_bo_driver::move
needs to stay.

Please review!
Nicolai
---
 drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c    | 20 ++++++++++++++------
 drivers/gpu/drm/amd/amdgpu/amdgpu_object.c |  5 +++++
 drivers/gpu/drm/amd/amdgpu/amdgpu_object.h | 12 +++++++++++-
 drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c    |  4 ----
 drivers/gpu/drm/nouveau/nouveau_bo.c       |  3 ++-
 drivers/gpu/drm/qxl/qxl_ttm.c              |  1 +
 drivers/gpu/drm/radeon/radeon_object.c     |  1 +
 drivers/gpu/drm/radeon/radeon_object.h     |  1 +
 drivers/gpu/drm/ttm/ttm_bo.c               |  8 ++++----
 drivers/gpu/drm/virtio/virtgpu_ttm.c       |  1 +
 drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c     |  1 +
 include/drm/ttm/ttm_bo_driver.h            | 10 ++++++++--
 12 files changed, 49 insertions(+), 18 deletions(-)

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 15+ messages in thread

* [PATCH 1/5] drm/ttm: add evict parameter to ttm_bo_driver::move_notify
       [not found] ` <1481821815-9971-1-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
@ 2016-12-15 17:10   ` Nicolai Hähnle
       [not found]     ` <1481821815-9971-2-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  2016-12-15 17:10   ` [PATCH 2/5] drm/amd/amdgpu: move eviction counting to amdgpu_bo_move_notify Nicolai Hähnle
                     ` (3 subsequent siblings)
  4 siblings, 1 reply; 15+ messages in thread
From: Nicolai Hähnle @ 2016-12-15 17:10 UTC (permalink / raw)
  To: amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: Nicolai Hähnle, dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

From: Nicolai Hähnle <nicolai.haehnle@amd.com>

Ensure that the driver can listen to evictions even when they don't take the
path through ttm_bo_driver::move.

This is crucial for amdgpu, which relies on an eviction counter to skip
re-binding page tables when possible.

Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
---
 drivers/gpu/drm/amd/amdgpu/amdgpu_object.c |  1 +
 drivers/gpu/drm/amd/amdgpu/amdgpu_object.h |  3 ++-
 drivers/gpu/drm/nouveau/nouveau_bo.c       |  3 ++-
 drivers/gpu/drm/qxl/qxl_ttm.c              |  1 +
 drivers/gpu/drm/radeon/radeon_object.c     |  1 +
 drivers/gpu/drm/radeon/radeon_object.h     |  1 +
 drivers/gpu/drm/ttm/ttm_bo.c               |  8 ++++----
 drivers/gpu/drm/virtio/virtgpu_ttm.c       |  1 +
 drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c     |  1 +
 include/drm/ttm/ttm_bo_driver.h            | 10 ++++++++--
 10 files changed, 22 insertions(+), 8 deletions(-)

diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
index bf79b73..c29db99 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
@@ -842,20 +842,21 @@ int amdgpu_bo_get_metadata(struct amdgpu_bo *bo, void *buffer,
 
 	if (metadata_size)
 		*metadata_size = bo->metadata_size;
 	if (flags)
 		*flags = bo->metadata_flags;
 
 	return 0;
 }
 
 void amdgpu_bo_move_notify(struct ttm_buffer_object *bo,
+			   bool evict,
 			   struct ttm_mem_reg *new_mem)
 {
 	struct amdgpu_device *adev = amdgpu_ttm_adev(bo->bdev);
 	struct amdgpu_bo *abo;
 	struct ttm_mem_reg *old_mem = &bo->mem;
 
 	if (!amdgpu_ttm_bo_is_amdgpu_bo(bo))
 		return;
 
 	abo = container_of(bo, struct amdgpu_bo, tbo);
diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
index 5cbf59e..4306b2f 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
@@ -148,21 +148,22 @@ void amdgpu_bo_fini(struct amdgpu_device *adev);
 int amdgpu_bo_fbdev_mmap(struct amdgpu_bo *bo,
 				struct vm_area_struct *vma);
 int amdgpu_bo_set_tiling_flags(struct amdgpu_bo *bo, u64 tiling_flags);
 void amdgpu_bo_get_tiling_flags(struct amdgpu_bo *bo, u64 *tiling_flags);
 int amdgpu_bo_set_metadata (struct amdgpu_bo *bo, void *metadata,
 			    uint32_t metadata_size, uint64_t flags);
 int amdgpu_bo_get_metadata(struct amdgpu_bo *bo, void *buffer,
 			   size_t buffer_size, uint32_t *metadata_size,
 			   uint64_t *flags);
 void amdgpu_bo_move_notify(struct ttm_buffer_object *bo,
-				  struct ttm_mem_reg *new_mem);
+			   bool evict,
+			   struct ttm_mem_reg *new_mem);
 int amdgpu_bo_fault_reserve_notify(struct ttm_buffer_object *bo);
 void amdgpu_bo_fence(struct amdgpu_bo *bo, struct dma_fence *fence,
 		     bool shared);
 u64 amdgpu_bo_gpu_offset(struct amdgpu_bo *bo);
 int amdgpu_bo_backup_to_shadow(struct amdgpu_device *adev,
 			       struct amdgpu_ring *ring,
 			       struct amdgpu_bo *bo,
 			       struct reservation_object *resv,
 			       struct dma_fence **fence, bool direct);
 int amdgpu_bo_restore_from_shadow(struct amdgpu_device *adev,
diff --git a/drivers/gpu/drm/nouveau/nouveau_bo.c b/drivers/gpu/drm/nouveau/nouveau_bo.c
index e0c0007..6fa1521 100644
--- a/drivers/gpu/drm/nouveau/nouveau_bo.c
+++ b/drivers/gpu/drm/nouveau/nouveau_bo.c
@@ -1187,21 +1187,22 @@ nouveau_bo_move_flips(struct ttm_buffer_object *bo, bool evict, bool intr,
 	ret = nouveau_bo_move_m2mf(bo, true, intr, no_wait_gpu, new_mem);
 	if (ret)
 		goto out;
 
 out:
 	ttm_bo_mem_put(bo, &tmp_mem);
 	return ret;
 }
 
 static void
-nouveau_bo_move_ntfy(struct ttm_buffer_object *bo, struct ttm_mem_reg *new_mem)
+nouveau_bo_move_ntfy(struct ttm_buffer_object *bo, bool evict,
+		     struct ttm_mem_reg *new_mem)
 {
 	struct nouveau_bo *nvbo = nouveau_bo(bo);
 	struct nvkm_vma *vma;
 
 	/* ttm can now (stupidly) pass the driver bos it didn't create... */
 	if (bo->destroy != nouveau_bo_del_ttm)
 		return;
 
 	list_for_each_entry(vma, &nvbo->vma_list, head) {
 		if (new_mem && new_mem->mem_type != TTM_PL_SYSTEM &&
diff --git a/drivers/gpu/drm/qxl/qxl_ttm.c b/drivers/gpu/drm/qxl/qxl_ttm.c
index 1176133..f3939a9 100644
--- a/drivers/gpu/drm/qxl/qxl_ttm.c
+++ b/drivers/gpu/drm/qxl/qxl_ttm.c
@@ -360,20 +360,21 @@ static int qxl_bo_move(struct ttm_buffer_object *bo,
 
 	if (old_mem->mem_type == TTM_PL_SYSTEM && bo->ttm == NULL) {
 		qxl_move_null(bo, new_mem);
 		return 0;
 	}
 	return ttm_bo_move_memcpy(bo, interruptible, no_wait_gpu,
 				  new_mem);
 }
 
 static void qxl_bo_move_notify(struct ttm_buffer_object *bo,
+			       bool evict,
 			       struct ttm_mem_reg *new_mem)
 {
 	struct qxl_bo *qbo;
 	struct qxl_device *qdev;
 
 	if (!qxl_ttm_bo_is_qxl_bo(bo))
 		return;
 	qbo = to_qxl_bo(bo);
 	qdev = qbo->gem_base.dev->dev_private;
 
diff --git a/drivers/gpu/drm/radeon/radeon_object.c b/drivers/gpu/drm/radeon/radeon_object.c
index 41b72ce..74b2760 100644
--- a/drivers/gpu/drm/radeon/radeon_object.c
+++ b/drivers/gpu/drm/radeon/radeon_object.c
@@ -758,20 +758,21 @@ int radeon_bo_check_tiling(struct radeon_bo *bo, bool has_moved,
 		return 0;
 	}
 
 	if ((bo->surface_reg >= 0) && !has_moved)
 		return 0;
 
 	return radeon_bo_get_surface_reg(bo);
 }
 
 void radeon_bo_move_notify(struct ttm_buffer_object *bo,
+			   bool evict,
 			   struct ttm_mem_reg *new_mem)
 {
 	struct radeon_bo *rbo;
 
 	if (!radeon_ttm_bo_is_radeon_bo(bo))
 		return;
 
 	rbo = container_of(bo, struct radeon_bo, tbo);
 	radeon_bo_check_tiling(rbo, 0, 1);
 	radeon_vm_bo_invalidate(rbo->rdev, rbo);
diff --git a/drivers/gpu/drm/radeon/radeon_object.h b/drivers/gpu/drm/radeon/radeon_object.h
index a10bb3d..9ffd821 100644
--- a/drivers/gpu/drm/radeon/radeon_object.h
+++ b/drivers/gpu/drm/radeon/radeon_object.h
@@ -143,20 +143,21 @@ extern void radeon_bo_fini(struct radeon_device *rdev);
 extern int radeon_bo_list_validate(struct radeon_device *rdev,
 				   struct ww_acquire_ctx *ticket,
 				   struct list_head *head, int ring);
 extern int radeon_bo_set_tiling_flags(struct radeon_bo *bo,
 				u32 tiling_flags, u32 pitch);
 extern void radeon_bo_get_tiling_flags(struct radeon_bo *bo,
 				u32 *tiling_flags, u32 *pitch);
 extern int radeon_bo_check_tiling(struct radeon_bo *bo, bool has_moved,
 				bool force_drop);
 extern void radeon_bo_move_notify(struct ttm_buffer_object *bo,
+				  bool evict,
 				  struct ttm_mem_reg *new_mem);
 extern int radeon_bo_fault_reserve_notify(struct ttm_buffer_object *bo);
 extern int radeon_bo_get_surface_reg(struct radeon_bo *bo);
 extern void radeon_bo_fence(struct radeon_bo *bo, struct radeon_fence *fence,
 			    bool shared);
 
 /*
  * sub allocation
  */
 
diff --git a/drivers/gpu/drm/ttm/ttm_bo.c b/drivers/gpu/drm/ttm/ttm_bo.c
index d506361..d4973e9c 100644
--- a/drivers/gpu/drm/ttm/ttm_bo.c
+++ b/drivers/gpu/drm/ttm/ttm_bo.c
@@ -335,45 +335,45 @@ static int ttm_bo_handle_move_mem(struct ttm_buffer_object *bo,
 			goto out_err;
 
 		if (mem->mem_type != TTM_PL_SYSTEM) {
 			ret = ttm_tt_bind(bo->ttm, mem);
 			if (ret)
 				goto out_err;
 		}
 
 		if (bo->mem.mem_type == TTM_PL_SYSTEM) {
 			if (bdev->driver->move_notify)
-				bdev->driver->move_notify(bo, mem);
+				bdev->driver->move_notify(bo, evict, mem);
 			bo->mem = *mem;
 			mem->mm_node = NULL;
 			goto moved;
 		}
 	}
 
 	if (bdev->driver->move_notify)
-		bdev->driver->move_notify(bo, mem);
+		bdev->driver->move_notify(bo, evict, mem);
 
 	if (!(old_man->flags & TTM_MEMTYPE_FLAG_FIXED) &&
 	    !(new_man->flags & TTM_MEMTYPE_FLAG_FIXED))
 		ret = ttm_bo_move_ttm(bo, interruptible, no_wait_gpu, mem);
 	else if (bdev->driver->move)
 		ret = bdev->driver->move(bo, evict, interruptible,
 					 no_wait_gpu, mem);
 	else
 		ret = ttm_bo_move_memcpy(bo, interruptible, no_wait_gpu, mem);
 
 	if (ret) {
 		if (bdev->driver->move_notify) {
 			struct ttm_mem_reg tmp_mem = *mem;
 			*mem = bo->mem;
 			bo->mem = tmp_mem;
-			bdev->driver->move_notify(bo, mem);
+			bdev->driver->move_notify(bo, false, mem);
 			bo->mem = *mem;
 			*mem = tmp_mem;
 		}
 
 		goto out_err;
 	}
 
 moved:
 	if (bo->evicted) {
 		if (bdev->driver->invalidate_caches) {
@@ -407,21 +407,21 @@ static int ttm_bo_handle_move_mem(struct ttm_buffer_object *bo,
  * Call bo::reserved.
  * Will release GPU memory type usage on destruction.
  * This is the place to put in driver specific hooks to release
  * driver private resources.
  * Will release the bo::reserved lock.
  */
 
 static void ttm_bo_cleanup_memtype_use(struct ttm_buffer_object *bo)
 {
 	if (bo->bdev->driver->move_notify)
-		bo->bdev->driver->move_notify(bo, NULL);
+		bo->bdev->driver->move_notify(bo, false, NULL);
 
 	ttm_tt_destroy(bo->ttm);
 	bo->ttm = NULL;
 	ttm_bo_mem_put(bo, &bo->mem);
 
 	ww_mutex_unlock (&bo->resv->lock);
 }
 
 static void ttm_bo_flush_all_fences(struct ttm_buffer_object *bo)
 {
diff --git a/drivers/gpu/drm/virtio/virtgpu_ttm.c b/drivers/gpu/drm/virtio/virtgpu_ttm.c
index 4a1de9f..10387d7 100644
--- a/drivers/gpu/drm/virtio/virtgpu_ttm.c
+++ b/drivers/gpu/drm/virtio/virtgpu_ttm.c
@@ -379,20 +379,21 @@ static int virtio_gpu_bo_move(struct ttm_buffer_object *bo,
 
 	ret = ttm_bo_wait(bo, interruptible, no_wait_gpu);
 	if (ret)
 		return ret;
 
 	virtio_gpu_move_null(bo, new_mem);
 	return 0;
 }
 
 static void virtio_gpu_bo_move_notify(struct ttm_buffer_object *tbo,
+				      bool evict,
 				      struct ttm_mem_reg *new_mem)
 {
 	struct virtio_gpu_object *bo;
 	struct virtio_gpu_device *vgdev;
 
 	bo = container_of(tbo, struct virtio_gpu_object, tbo);
 	vgdev = (struct virtio_gpu_device *)bo->gem_base.dev->dev_private;
 
 	if (!new_mem || (new_mem->placement & TTM_PL_FLAG_SYSTEM)) {
 		if (bo->hw_res_handle)
diff --git a/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c b/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
index c894a48..caa279b 100644
--- a/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
+++ b/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
@@ -818,20 +818,21 @@ static int vmw_ttm_fault_reserve_notify(struct ttm_buffer_object *bo)
  * vmw_move_notify - TTM move_notify_callback
  *
  * @bo: The TTM buffer object about to move.
  * @mem: The struct ttm_mem_reg indicating to what memory
  *       region the move is taking place.
  *
  * Calls move_notify for all subsystems needing it.
  * (currently only resources).
  */
 static void vmw_move_notify(struct ttm_buffer_object *bo,
+			    bool evict,
 			    struct ttm_mem_reg *mem)
 {
 	vmw_resource_move_notify(bo, mem);
 	vmw_query_move_notify(bo, mem);
 }
 
 
 /**
  * vmw_swap_notify - TTM move_notify_callback
  *
diff --git a/include/drm/ttm/ttm_bo_driver.h b/include/drm/ttm/ttm_bo_driver.h
index cdbdb40..8b4bec7 100644
--- a/include/drm/ttm/ttm_bo_driver.h
+++ b/include/drm/ttm/ttm_bo_driver.h
@@ -424,23 +424,29 @@ struct ttm_bo_driver {
 	 *
 	 * Called from the map / write / read methods to verify that the
 	 * caller is permitted to access the buffer object.
 	 * This member may be set to NULL, which will refuse this kind of
 	 * access for all buffer objects.
 	 * This function should return 0 if access is granted, -EPERM otherwise.
 	 */
 	int (*verify_access)(struct ttm_buffer_object *bo,
 			     struct file *filp);
 
-	/* hook to notify driver about a driver move so it
-	 * can do tiling things */
+	/**
+	 * Hook to notify driver about a driver move so it
+	 * can do tiling things and book-keeping.
+	 *
+	 * @evict: whether this move is evicting the buffer from the graphics
+	 * address space
+	 */
 	void (*move_notify)(struct ttm_buffer_object *bo,
+			    bool evict,
 			    struct ttm_mem_reg *new_mem);
 	/* notify the driver we are taking a fault on this BO
 	 * and have reserved it */
 	int (*fault_reserve_notify)(struct ttm_buffer_object *bo);
 
 	/**
 	 * notify the driver that we're about to swap out this bo
 	 */
 	void (*swap_notify)(struct ttm_buffer_object *bo);
 
-- 
2.7.4

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply related	[flat|nested] 15+ messages in thread

* [PATCH 2/5] drm/amd/amdgpu: move eviction counting to amdgpu_bo_move_notify
       [not found] ` <1481821815-9971-1-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  2016-12-15 17:10   ` [PATCH 1/5] drm/ttm: add evict parameter to ttm_bo_driver::move_notify Nicolai Hähnle
@ 2016-12-15 17:10   ` Nicolai Hähnle
       [not found]     ` <1481821815-9971-3-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  2016-12-15 17:10   ` [PATCH 3/5] drm/amd/amdgpu: add amdgpu_bo_gpu_accessible helper function Nicolai Hähnle
                     ` (2 subsequent siblings)
  4 siblings, 1 reply; 15+ messages in thread
From: Nicolai Hähnle @ 2016-12-15 17:10 UTC (permalink / raw)
  To: amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: Nicolai Hähnle, dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

From: Nicolai Hähnle <nicolai.haehnle@amd.com>

This catches evictions of shadow page tables from the GART. Since shadow
page tables are always stored in system memory, amdgpu_bo_move is never
called for them.

This fixes a crash during command submission that occurs when only a shadow
page table and no other BOs were evicted since the last submission.

Fixes: 1baa439fb2f4e586 ("drm/amdgpu: allocate shadow for pd/pt bo V2")
Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
---
 drivers/gpu/drm/amd/amdgpu/amdgpu_object.c | 4 ++++
 drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c    | 4 ----
 2 files changed, 4 insertions(+), 4 deletions(-)

diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
index c29db99..d94cdef 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
@@ -855,20 +855,24 @@ void amdgpu_bo_move_notify(struct ttm_buffer_object *bo,
 	struct amdgpu_device *adev = amdgpu_ttm_adev(bo->bdev);
 	struct amdgpu_bo *abo;
 	struct ttm_mem_reg *old_mem = &bo->mem;
 
 	if (!amdgpu_ttm_bo_is_amdgpu_bo(bo))
 		return;
 
 	abo = container_of(bo, struct amdgpu_bo, tbo);
 	amdgpu_vm_bo_invalidate(adev, abo);
 
+	/* remember the eviction */
+	if (evict)
+		atomic64_inc(&adev->num_evictions);
+
 	/* update statistics */
 	if (!new_mem)
 		return;
 
 	/* move_notify is called before move happens */
 	amdgpu_update_memory_usage(adev, &bo->mem, new_mem);
 
 	trace_amdgpu_ttm_bo_move(abo, new_mem->mem_type, old_mem->mem_type);
 }
 
diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c
index 8f18b8e..80924c2 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c
@@ -460,24 +460,20 @@ static int amdgpu_bo_move(struct ttm_buffer_object *bo,
 	struct ttm_mem_reg *old_mem = &bo->mem;
 	int r;
 
 	/* Can't move a pinned BO */
 	abo = container_of(bo, struct amdgpu_bo, tbo);
 	if (WARN_ON_ONCE(abo->pin_count > 0))
 		return -EINVAL;
 
 	adev = amdgpu_ttm_adev(bo->bdev);
 
-	/* remember the eviction */
-	if (evict)
-		atomic64_inc(&adev->num_evictions);
-
 	if (old_mem->mem_type == TTM_PL_SYSTEM && bo->ttm == NULL) {
 		amdgpu_move_null(bo, new_mem);
 		return 0;
 	}
 	if ((old_mem->mem_type == TTM_PL_TT &&
 	     new_mem->mem_type == TTM_PL_SYSTEM) ||
 	    (old_mem->mem_type == TTM_PL_SYSTEM &&
 	     new_mem->mem_type == TTM_PL_TT)) {
 		/* bind is enough */
 		amdgpu_move_null(bo, new_mem);
-- 
2.7.4

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply related	[flat|nested] 15+ messages in thread

* [PATCH 3/5] drm/amd/amdgpu: add amdgpu_bo_gpu_accessible helper function
       [not found] ` <1481821815-9971-1-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  2016-12-15 17:10   ` [PATCH 1/5] drm/ttm: add evict parameter to ttm_bo_driver::move_notify Nicolai Hähnle
  2016-12-15 17:10   ` [PATCH 2/5] drm/amd/amdgpu: move eviction counting to amdgpu_bo_move_notify Nicolai Hähnle
@ 2016-12-15 17:10   ` Nicolai Hähnle
       [not found]     ` <1481821815-9971-4-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  2016-12-15 17:10   ` [PATCH 4/5] drm/amd/amdgpu: add check that shadow page directory is GPU-accessible Nicolai Hähnle
  2016-12-15 17:10   ` [PATCH 5/5] drm/amd/amdgpu: add check that shadow page tables are GPU-accessible Nicolai Hähnle
  4 siblings, 1 reply; 15+ messages in thread
From: Nicolai Hähnle @ 2016-12-15 17:10 UTC (permalink / raw)
  To: amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: Nicolai Hähnle, dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

From: Nicolai Hähnle <nicolai.haehnle@amd.com>

Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
---
 drivers/gpu/drm/amd/amdgpu/amdgpu_object.h | 9 +++++++++
 1 file changed, 9 insertions(+)

diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
index 4306b2f..15a723a 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
@@ -107,20 +107,29 @@ static inline unsigned amdgpu_bo_gpu_page_alignment(struct amdgpu_bo *bo)
  * amdgpu_bo_mmap_offset - return mmap offset of bo
  * @bo:	amdgpu object for which we query the offset
  *
  * Returns mmap offset of the object.
  */
 static inline u64 amdgpu_bo_mmap_offset(struct amdgpu_bo *bo)
 {
 	return drm_vma_node_offset_addr(&bo->tbo.vma_node);
 }
 
+/**
+ * amdgpu_bo_gpu_accessible - return whether the bo is currently in memory that
+ * is accessible to the GPU.
+ */
+static inline bool amdgpu_bo_gpu_accessible(struct amdgpu_bo *bo)
+{
+	return bo->tbo.mem.mem_type != TTM_PL_SYSTEM;
+}
+
 int amdgpu_bo_create(struct amdgpu_device *adev,
 			    unsigned long size, int byte_align,
 			    bool kernel, u32 domain, u64 flags,
 			    struct sg_table *sg,
 			    struct reservation_object *resv,
 			    struct amdgpu_bo **bo_ptr);
 int amdgpu_bo_create_restricted(struct amdgpu_device *adev,
 				unsigned long size, int byte_align,
 				bool kernel, u32 domain, u64 flags,
 				struct sg_table *sg,
-- 
2.7.4

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply related	[flat|nested] 15+ messages in thread

* [PATCH 4/5] drm/amd/amdgpu: add check that shadow page directory is GPU-accessible
       [not found] ` <1481821815-9971-1-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
                     ` (2 preceding siblings ...)
  2016-12-15 17:10   ` [PATCH 3/5] drm/amd/amdgpu: add amdgpu_bo_gpu_accessible helper function Nicolai Hähnle
@ 2016-12-15 17:10   ` Nicolai Hähnle
       [not found]     ` <1481821815-9971-5-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  2016-12-15 17:10   ` [PATCH 5/5] drm/amd/amdgpu: add check that shadow page tables are GPU-accessible Nicolai Hähnle
  4 siblings, 1 reply; 15+ messages in thread
From: Nicolai Hähnle @ 2016-12-15 17:10 UTC (permalink / raw)
  To: amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: Nicolai Hähnle, dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

From: Nicolai Hähnle <nicolai.haehnle@amd.com>

Skip amdgpu_gem_va_update_vm when shadow the page directory is swapped out.
Clean up the check for non-shadow BOs as well using the new helper function.

This fixes a crash with the stack trace:

amdgpu_gem_va_update_vm
-> amdgpu_vm_update_page_directory
 -> amdgpu_ttm_bind
  -> amdgpu_gtt_mgr_alloc

Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
---
 drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c | 11 ++++++++---
 1 file changed, 8 insertions(+), 3 deletions(-)

diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
index cd62f6f..4e1eb05 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
@@ -489,44 +489,49 @@ static int amdgpu_gem_va_check(void *param, struct amdgpu_bo *bo)
  * vital here, so they are not reported back to userspace.
  */
 static void amdgpu_gem_va_update_vm(struct amdgpu_device *adev,
 				    struct amdgpu_bo_va *bo_va,
 				    uint32_t operation)
 {
 	struct ttm_validate_buffer tv, *entry;
 	struct amdgpu_bo_list_entry vm_pd;
 	struct ww_acquire_ctx ticket;
 	struct list_head list, duplicates;
-	unsigned domain;
 	int r;
 
 	INIT_LIST_HEAD(&list);
 	INIT_LIST_HEAD(&duplicates);
 
 	tv.bo = &bo_va->bo->tbo;
 	tv.shared = true;
 	list_add(&tv.head, &list);
 
 	amdgpu_vm_get_pd_bo(bo_va->vm, &list, &vm_pd);
 
 	/* Provide duplicates to avoid -EALREADY */
 	r = ttm_eu_reserve_buffers(&ticket, &list, true, &duplicates);
 	if (r)
 		goto error_print;
 
 	list_for_each_entry(entry, &list, head) {
-		domain = amdgpu_mem_type_to_domain(entry->bo->mem.mem_type);
+		struct amdgpu_bo *bo =
+			container_of(entry->bo, struct amdgpu_bo, tbo);
+
 		/* if anything is swapped out don't swap it in here,
 		   just abort and wait for the next CS */
-		if (domain == AMDGPU_GEM_DOMAIN_CPU)
+		if (!amdgpu_bo_gpu_accessible(bo))
+			goto error_unreserve;
+
+		if (bo->shadow && !amdgpu_bo_gpu_accessible(bo->shadow))
 			goto error_unreserve;
 	}
+
 	r = amdgpu_vm_validate_pt_bos(adev, bo_va->vm, amdgpu_gem_va_check,
 				      NULL);
 	if (r)
 		goto error_unreserve;
 
 	r = amdgpu_vm_update_page_directory(adev, bo_va->vm);
 	if (r)
 		goto error_unreserve;
 
 	r = amdgpu_vm_clear_freed(adev, bo_va->vm);
-- 
2.7.4

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply related	[flat|nested] 15+ messages in thread

* [PATCH 5/5] drm/amd/amdgpu: add check that shadow page tables are GPU-accessible
       [not found] ` <1481821815-9971-1-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
                     ` (3 preceding siblings ...)
  2016-12-15 17:10   ` [PATCH 4/5] drm/amd/amdgpu: add check that shadow page directory is GPU-accessible Nicolai Hähnle
@ 2016-12-15 17:10   ` Nicolai Hähnle
       [not found]     ` <1481821815-9971-6-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  4 siblings, 1 reply; 15+ messages in thread
From: Nicolai Hähnle @ 2016-12-15 17:10 UTC (permalink / raw)
  To: amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: Nicolai Hähnle, dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

From: Nicolai Hähnle <nicolai.haehnle@amd.com>

Skip amdgpu_gem_va_update_vm otherwise. Also clean up the check for the
non-shadow page tables using the new helper function.

This fixes a crash with the stack trace:

amdgpu_gem_va_update_vm
-> amdgpu_vm_update_page_directory
 -> amdgpu_ttm_bind
  -> amdgpu_gtt_mgr_alloc

Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
---
 drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c | 9 ++++++---
 1 file changed, 6 insertions(+), 3 deletions(-)

diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
index 4e1eb05..d91c80b 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
@@ -464,26 +464,29 @@ int amdgpu_gem_metadata_ioctl(struct drm_device *dev, void *data,
 
 unreserve:
 	amdgpu_bo_unreserve(robj);
 out:
 	drm_gem_object_unreference_unlocked(gobj);
 	return r;
 }
 
 static int amdgpu_gem_va_check(void *param, struct amdgpu_bo *bo)
 {
-	unsigned domain = amdgpu_mem_type_to_domain(bo->tbo.mem.mem_type);
-
 	/* if anything is swapped out don't swap it in here,
 	   just abort and wait for the next CS */
+	if (!amdgpu_bo_gpu_accessible(bo))
+		return -ERESTARTSYS;
+
+	if (bo->shadow && !amdgpu_bo_gpu_accessible(bo))
+		return -ERESTARTSYS;
 
-	return domain == AMDGPU_GEM_DOMAIN_CPU ? -ERESTARTSYS : 0;
+	return 0;
 }
 
 /**
  * amdgpu_gem_va_update_vm -update the bo_va in its VM
  *
  * @adev: amdgpu_device pointer
  * @bo_va: bo_va to update
  *
  * Update the bo_va directly after setting it's address. Errors are not
  * vital here, so they are not reported back to userspace.
-- 
2.7.4

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply related	[flat|nested] 15+ messages in thread

* [PATCH v2] drm/amd/amdgpu: add check that shadow page tables are GPU-accessible
       [not found]     ` <1481821815-9971-6-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
@ 2016-12-15 17:59       ` Nicolai Hähnle
       [not found]         ` <1481824743-11139-1-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  2016-12-16  2:07       ` [PATCH 5/5] " zhoucm1
  1 sibling, 1 reply; 15+ messages in thread
From: Nicolai Hähnle @ 2016-12-15 17:59 UTC (permalink / raw)
  To: amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: Nicolai Hähnle, dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

From: Nicolai Hähnle <nicolai.haehnle@amd.com>

Skip amdgpu_gem_va_update_vm otherwise. Also clean up the check for the
non-shadow page tables using the new helper function.

This fixes a crash with the stack trace:

amdgpu_gem_va_update_vm
-> amdgpu_vm_update_page_directory
 -> amdgpu_ttm_bind
  -> amdgpu_gtt_mgr_alloc

v2: actually check bo->shadow instead of just checking bo twice

Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
---
 drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c | 9 ++++++---
 1 file changed, 6 insertions(+), 3 deletions(-)

diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
index 4e1eb05..9bd1b4e 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
@@ -464,26 +464,29 @@ int amdgpu_gem_metadata_ioctl(struct drm_device *dev, void *data,
 
 unreserve:
 	amdgpu_bo_unreserve(robj);
 out:
 	drm_gem_object_unreference_unlocked(gobj);
 	return r;
 }
 
 static int amdgpu_gem_va_check(void *param, struct amdgpu_bo *bo)
 {
-	unsigned domain = amdgpu_mem_type_to_domain(bo->tbo.mem.mem_type);
-
 	/* if anything is swapped out don't swap it in here,
 	   just abort and wait for the next CS */
+	if (!amdgpu_bo_gpu_accessible(bo))
+		return -ERESTARTSYS;
+
+	if (bo->shadow && !amdgpu_bo_gpu_accessible(bo->shadow))
+		return -ERESTARTSYS;
 
-	return domain == AMDGPU_GEM_DOMAIN_CPU ? -ERESTARTSYS : 0;
+	return 0;
 }
 
 /**
  * amdgpu_gem_va_update_vm -update the bo_va in its VM
  *
  * @adev: amdgpu_device pointer
  * @bo_va: bo_va to update
  *
  * Update the bo_va directly after setting it's address. Errors are not
  * vital here, so they are not reported back to userspace.
-- 
2.7.4

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply related	[flat|nested] 15+ messages in thread

* Re: [PATCH 5/5] drm/amd/amdgpu: add check that shadow page tables are GPU-accessible
       [not found]     ` <1481821815-9971-6-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  2016-12-15 17:59       ` [PATCH v2] " Nicolai Hähnle
@ 2016-12-16  2:07       ` zhoucm1
  1 sibling, 0 replies; 15+ messages in thread
From: zhoucm1 @ 2016-12-16  2:07 UTC (permalink / raw)
  To: Nicolai Hähnle, amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW, Nicolai Hähnle



On 2016年12月16日 01:10, Nicolai Hähnle wrote:
> From: Nicolai Hähnle <nicolai.haehnle@amd.com>
>
> Skip amdgpu_gem_va_update_vm otherwise. Also clean up the check for the
> non-shadow page tables using the new helper function.
>
> This fixes a crash with the stack trace:
>
> amdgpu_gem_va_update_vm
> -> amdgpu_vm_update_page_directory
>   -> amdgpu_ttm_bind
>    -> amdgpu_gtt_mgr_alloc
>
> Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
> ---
>   drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c | 9 ++++++---
>   1 file changed, 6 insertions(+), 3 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
> index 4e1eb05..d91c80b 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
> @@ -464,26 +464,29 @@ int amdgpu_gem_metadata_ioctl(struct drm_device *dev, void *data,
>   
>   unreserve:
>   	amdgpu_bo_unreserve(robj);
>   out:
>   	drm_gem_object_unreference_unlocked(gobj);
>   	return r;
>   }
>   
>   static int amdgpu_gem_va_check(void *param, struct amdgpu_bo *bo)
>   {
> -	unsigned domain = amdgpu_mem_type_to_domain(bo->tbo.mem.mem_type);
> -
>   	/* if anything is swapped out don't swap it in here,
>   	   just abort and wait for the next CS */
> +	if (!amdgpu_bo_gpu_accessible(bo))
> +		return -ERESTARTSYS;
> +
> +	if (bo->shadow && !amdgpu_bo_gpu_accessible(bo))

this should be 'if (bo->shadow && !amdgpu_bo_gpu_accessible(bo->shadow))'.

Regards,
David Zhou

> +		return -ERESTARTSYS;
>   
> -	return domain == AMDGPU_GEM_DOMAIN_CPU ? -ERESTARTSYS : 0;
> +	return 0;
>   }
>   
>   /**
>    * amdgpu_gem_va_update_vm -update the bo_va in its VM
>    *
>    * @adev: amdgpu_device pointer
>    * @bo_va: bo_va to update
>    *
>    * Update the bo_va directly after setting it's address. Errors are not
>    * vital here, so they are not reported back to userspace.

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [PATCH v2] drm/amd/amdgpu: add check that shadow page tables are GPU-accessible
       [not found]         ` <1481824743-11139-1-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
@ 2016-12-16  2:48           ` zhoucm1
  0 siblings, 0 replies; 15+ messages in thread
From: zhoucm1 @ 2016-12-16  2:48 UTC (permalink / raw)
  To: Nicolai Hähnle, amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW, Nicolai Hähnle



On 2016年12月16日 01:59, Nicolai Hähnle wrote:
> From: Nicolai Hähnle <nicolai.haehnle@amd.com>
>
> Skip amdgpu_gem_va_update_vm otherwise. Also clean up the check for the
> non-shadow page tables using the new helper function.
>
> This fixes a crash with the stack trace:
>
> amdgpu_gem_va_update_vm
> -> amdgpu_vm_update_page_directory
>   -> amdgpu_ttm_bind
>    -> amdgpu_gtt_mgr_alloc
>
> v2: actually check bo->shadow instead of just checking bo twice
>
> Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
Reviewed-by: Chunming Zhou <david1.zhou@amd.com>
> ---
>   drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c | 9 ++++++---
>   1 file changed, 6 insertions(+), 3 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
> index 4e1eb05..9bd1b4e 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
> @@ -464,26 +464,29 @@ int amdgpu_gem_metadata_ioctl(struct drm_device *dev, void *data,
>   
>   unreserve:
>   	amdgpu_bo_unreserve(robj);
>   out:
>   	drm_gem_object_unreference_unlocked(gobj);
>   	return r;
>   }
>   
>   static int amdgpu_gem_va_check(void *param, struct amdgpu_bo *bo)
>   {
> -	unsigned domain = amdgpu_mem_type_to_domain(bo->tbo.mem.mem_type);
> -
>   	/* if anything is swapped out don't swap it in here,
>   	   just abort and wait for the next CS */
> +	if (!amdgpu_bo_gpu_accessible(bo))
> +		return -ERESTARTSYS;
> +
> +	if (bo->shadow && !amdgpu_bo_gpu_accessible(bo->shadow))
> +		return -ERESTARTSYS;
>   
> -	return domain == AMDGPU_GEM_DOMAIN_CPU ? -ERESTARTSYS : 0;
> +	return 0;
>   }
>   
>   /**
>    * amdgpu_gem_va_update_vm -update the bo_va in its VM
>    *
>    * @adev: amdgpu_device pointer
>    * @bo_va: bo_va to update
>    *
>    * Update the bo_va directly after setting it's address. Errors are not
>    * vital here, so they are not reported back to userspace.

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [PATCH 3/5] drm/amd/amdgpu: add amdgpu_bo_gpu_accessible helper function
       [not found]     ` <1481821815-9971-4-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
@ 2016-12-16  2:48       ` zhoucm1
  0 siblings, 0 replies; 15+ messages in thread
From: zhoucm1 @ 2016-12-16  2:48 UTC (permalink / raw)
  To: Nicolai Hähnle, amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW, Nicolai Hähnle



On 2016年12月16日 01:10, Nicolai Hähnle wrote:
> From: Nicolai Hähnle <nicolai.haehnle@amd.com>
>
> Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
Reviewed-by: Chunming Zhou <david1.zhou@amd.com>
> ---
>   drivers/gpu/drm/amd/amdgpu/amdgpu_object.h | 9 +++++++++
>   1 file changed, 9 insertions(+)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
> index 4306b2f..15a723a 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
> @@ -107,20 +107,29 @@ static inline unsigned amdgpu_bo_gpu_page_alignment(struct amdgpu_bo *bo)
>    * amdgpu_bo_mmap_offset - return mmap offset of bo
>    * @bo:	amdgpu object for which we query the offset
>    *
>    * Returns mmap offset of the object.
>    */
>   static inline u64 amdgpu_bo_mmap_offset(struct amdgpu_bo *bo)
>   {
>   	return drm_vma_node_offset_addr(&bo->tbo.vma_node);
>   }
>   
> +/**
> + * amdgpu_bo_gpu_accessible - return whether the bo is currently in memory that
> + * is accessible to the GPU.
> + */
> +static inline bool amdgpu_bo_gpu_accessible(struct amdgpu_bo *bo)
> +{
> +	return bo->tbo.mem.mem_type != TTM_PL_SYSTEM;
> +}
> +
>   int amdgpu_bo_create(struct amdgpu_device *adev,
>   			    unsigned long size, int byte_align,
>   			    bool kernel, u32 domain, u64 flags,
>   			    struct sg_table *sg,
>   			    struct reservation_object *resv,
>   			    struct amdgpu_bo **bo_ptr);
>   int amdgpu_bo_create_restricted(struct amdgpu_device *adev,
>   				unsigned long size, int byte_align,
>   				bool kernel, u32 domain, u64 flags,
>   				struct sg_table *sg,

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [PATCH 4/5] drm/amd/amdgpu: add check that shadow page directory is GPU-accessible
       [not found]     ` <1481821815-9971-5-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
@ 2016-12-16  2:49       ` zhoucm1
  0 siblings, 0 replies; 15+ messages in thread
From: zhoucm1 @ 2016-12-16  2:49 UTC (permalink / raw)
  To: Nicolai Hähnle, amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW, Nicolai Hähnle



On 2016年12月16日 01:10, Nicolai Hähnle wrote:
> From: Nicolai Hähnle <nicolai.haehnle@amd.com>
>
> Skip amdgpu_gem_va_update_vm when shadow the page directory is swapped out.
> Clean up the check for non-shadow BOs as well using the new helper function.
>
> This fixes a crash with the stack trace:
>
> amdgpu_gem_va_update_vm
> -> amdgpu_vm_update_page_directory
>   -> amdgpu_ttm_bind
>    -> amdgpu_gtt_mgr_alloc
>
> Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
Reviewed-by: Chunming Zhou <david1.zhou@amd.com>
> ---
>   drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c | 11 ++++++++---
>   1 file changed, 8 insertions(+), 3 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
> index cd62f6f..4e1eb05 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
> @@ -489,44 +489,49 @@ static int amdgpu_gem_va_check(void *param, struct amdgpu_bo *bo)
>    * vital here, so they are not reported back to userspace.
>    */
>   static void amdgpu_gem_va_update_vm(struct amdgpu_device *adev,
>   				    struct amdgpu_bo_va *bo_va,
>   				    uint32_t operation)
>   {
>   	struct ttm_validate_buffer tv, *entry;
>   	struct amdgpu_bo_list_entry vm_pd;
>   	struct ww_acquire_ctx ticket;
>   	struct list_head list, duplicates;
> -	unsigned domain;
>   	int r;
>   
>   	INIT_LIST_HEAD(&list);
>   	INIT_LIST_HEAD(&duplicates);
>   
>   	tv.bo = &bo_va->bo->tbo;
>   	tv.shared = true;
>   	list_add(&tv.head, &list);
>   
>   	amdgpu_vm_get_pd_bo(bo_va->vm, &list, &vm_pd);
>   
>   	/* Provide duplicates to avoid -EALREADY */
>   	r = ttm_eu_reserve_buffers(&ticket, &list, true, &duplicates);
>   	if (r)
>   		goto error_print;
>   
>   	list_for_each_entry(entry, &list, head) {
> -		domain = amdgpu_mem_type_to_domain(entry->bo->mem.mem_type);
> +		struct amdgpu_bo *bo =
> +			container_of(entry->bo, struct amdgpu_bo, tbo);
> +
>   		/* if anything is swapped out don't swap it in here,
>   		   just abort and wait for the next CS */
> -		if (domain == AMDGPU_GEM_DOMAIN_CPU)
> +		if (!amdgpu_bo_gpu_accessible(bo))
> +			goto error_unreserve;
> +
> +		if (bo->shadow && !amdgpu_bo_gpu_accessible(bo->shadow))
>   			goto error_unreserve;
>   	}
> +
>   	r = amdgpu_vm_validate_pt_bos(adev, bo_va->vm, amdgpu_gem_va_check,
>   				      NULL);
>   	if (r)
>   		goto error_unreserve;
>   
>   	r = amdgpu_vm_update_page_directory(adev, bo_va->vm);
>   	if (r)
>   		goto error_unreserve;
>   
>   	r = amdgpu_vm_clear_freed(adev, bo_va->vm);

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [PATCH 1/5] drm/ttm: add evict parameter to ttm_bo_driver::move_notify
       [not found]     ` <1481821815-9971-2-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
@ 2016-12-16  2:49       ` zhoucm1
       [not found]         ` <5853564E.5020005-5C7GfCeVMHo@public.gmane.org>
  0 siblings, 1 reply; 15+ messages in thread
From: zhoucm1 @ 2016-12-16  2:49 UTC (permalink / raw)
  To: Nicolai Hähnle, amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW, Nicolai Hähnle



On 2016年12月16日 01:10, Nicolai Hähnle wrote:
> From: Nicolai Hähnle <nicolai.haehnle@amd.com>
>
> Ensure that the driver can listen to evictions even when they don't take the
> path through ttm_bo_driver::move.
>
> This is crucial for amdgpu, which relies on an eviction counter to skip
> re-binding page tables when possible.
>
> Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
Acked-by: Chunming Zhou <david1.zhou@amd.com>
> ---
>   drivers/gpu/drm/amd/amdgpu/amdgpu_object.c |  1 +
>   drivers/gpu/drm/amd/amdgpu/amdgpu_object.h |  3 ++-
>   drivers/gpu/drm/nouveau/nouveau_bo.c       |  3 ++-
>   drivers/gpu/drm/qxl/qxl_ttm.c              |  1 +
>   drivers/gpu/drm/radeon/radeon_object.c     |  1 +
>   drivers/gpu/drm/radeon/radeon_object.h     |  1 +
>   drivers/gpu/drm/ttm/ttm_bo.c               |  8 ++++----
>   drivers/gpu/drm/virtio/virtgpu_ttm.c       |  1 +
>   drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c     |  1 +
>   include/drm/ttm/ttm_bo_driver.h            | 10 ++++++++--
>   10 files changed, 22 insertions(+), 8 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
> index bf79b73..c29db99 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
> @@ -842,20 +842,21 @@ int amdgpu_bo_get_metadata(struct amdgpu_bo *bo, void *buffer,
>   
>   	if (metadata_size)
>   		*metadata_size = bo->metadata_size;
>   	if (flags)
>   		*flags = bo->metadata_flags;
>   
>   	return 0;
>   }
>   
>   void amdgpu_bo_move_notify(struct ttm_buffer_object *bo,
> +			   bool evict,
>   			   struct ttm_mem_reg *new_mem)
>   {
>   	struct amdgpu_device *adev = amdgpu_ttm_adev(bo->bdev);
>   	struct amdgpu_bo *abo;
>   	struct ttm_mem_reg *old_mem = &bo->mem;
>   
>   	if (!amdgpu_ttm_bo_is_amdgpu_bo(bo))
>   		return;
>   
>   	abo = container_of(bo, struct amdgpu_bo, tbo);
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
> index 5cbf59e..4306b2f 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
> @@ -148,21 +148,22 @@ void amdgpu_bo_fini(struct amdgpu_device *adev);
>   int amdgpu_bo_fbdev_mmap(struct amdgpu_bo *bo,
>   				struct vm_area_struct *vma);
>   int amdgpu_bo_set_tiling_flags(struct amdgpu_bo *bo, u64 tiling_flags);
>   void amdgpu_bo_get_tiling_flags(struct amdgpu_bo *bo, u64 *tiling_flags);
>   int amdgpu_bo_set_metadata (struct amdgpu_bo *bo, void *metadata,
>   			    uint32_t metadata_size, uint64_t flags);
>   int amdgpu_bo_get_metadata(struct amdgpu_bo *bo, void *buffer,
>   			   size_t buffer_size, uint32_t *metadata_size,
>   			   uint64_t *flags);
>   void amdgpu_bo_move_notify(struct ttm_buffer_object *bo,
> -				  struct ttm_mem_reg *new_mem);
> +			   bool evict,
> +			   struct ttm_mem_reg *new_mem);
>   int amdgpu_bo_fault_reserve_notify(struct ttm_buffer_object *bo);
>   void amdgpu_bo_fence(struct amdgpu_bo *bo, struct dma_fence *fence,
>   		     bool shared);
>   u64 amdgpu_bo_gpu_offset(struct amdgpu_bo *bo);
>   int amdgpu_bo_backup_to_shadow(struct amdgpu_device *adev,
>   			       struct amdgpu_ring *ring,
>   			       struct amdgpu_bo *bo,
>   			       struct reservation_object *resv,
>   			       struct dma_fence **fence, bool direct);
>   int amdgpu_bo_restore_from_shadow(struct amdgpu_device *adev,
> diff --git a/drivers/gpu/drm/nouveau/nouveau_bo.c b/drivers/gpu/drm/nouveau/nouveau_bo.c
> index e0c0007..6fa1521 100644
> --- a/drivers/gpu/drm/nouveau/nouveau_bo.c
> +++ b/drivers/gpu/drm/nouveau/nouveau_bo.c
> @@ -1187,21 +1187,22 @@ nouveau_bo_move_flips(struct ttm_buffer_object *bo, bool evict, bool intr,
>   	ret = nouveau_bo_move_m2mf(bo, true, intr, no_wait_gpu, new_mem);
>   	if (ret)
>   		goto out;
>   
>   out:
>   	ttm_bo_mem_put(bo, &tmp_mem);
>   	return ret;
>   }
>   
>   static void
> -nouveau_bo_move_ntfy(struct ttm_buffer_object *bo, struct ttm_mem_reg *new_mem)
> +nouveau_bo_move_ntfy(struct ttm_buffer_object *bo, bool evict,
> +		     struct ttm_mem_reg *new_mem)
>   {
>   	struct nouveau_bo *nvbo = nouveau_bo(bo);
>   	struct nvkm_vma *vma;
>   
>   	/* ttm can now (stupidly) pass the driver bos it didn't create... */
>   	if (bo->destroy != nouveau_bo_del_ttm)
>   		return;
>   
>   	list_for_each_entry(vma, &nvbo->vma_list, head) {
>   		if (new_mem && new_mem->mem_type != TTM_PL_SYSTEM &&
> diff --git a/drivers/gpu/drm/qxl/qxl_ttm.c b/drivers/gpu/drm/qxl/qxl_ttm.c
> index 1176133..f3939a9 100644
> --- a/drivers/gpu/drm/qxl/qxl_ttm.c
> +++ b/drivers/gpu/drm/qxl/qxl_ttm.c
> @@ -360,20 +360,21 @@ static int qxl_bo_move(struct ttm_buffer_object *bo,
>   
>   	if (old_mem->mem_type == TTM_PL_SYSTEM && bo->ttm == NULL) {
>   		qxl_move_null(bo, new_mem);
>   		return 0;
>   	}
>   	return ttm_bo_move_memcpy(bo, interruptible, no_wait_gpu,
>   				  new_mem);
>   }
>   
>   static void qxl_bo_move_notify(struct ttm_buffer_object *bo,
> +			       bool evict,
>   			       struct ttm_mem_reg *new_mem)
>   {
>   	struct qxl_bo *qbo;
>   	struct qxl_device *qdev;
>   
>   	if (!qxl_ttm_bo_is_qxl_bo(bo))
>   		return;
>   	qbo = to_qxl_bo(bo);
>   	qdev = qbo->gem_base.dev->dev_private;
>   
> diff --git a/drivers/gpu/drm/radeon/radeon_object.c b/drivers/gpu/drm/radeon/radeon_object.c
> index 41b72ce..74b2760 100644
> --- a/drivers/gpu/drm/radeon/radeon_object.c
> +++ b/drivers/gpu/drm/radeon/radeon_object.c
> @@ -758,20 +758,21 @@ int radeon_bo_check_tiling(struct radeon_bo *bo, bool has_moved,
>   		return 0;
>   	}
>   
>   	if ((bo->surface_reg >= 0) && !has_moved)
>   		return 0;
>   
>   	return radeon_bo_get_surface_reg(bo);
>   }
>   
>   void radeon_bo_move_notify(struct ttm_buffer_object *bo,
> +			   bool evict,
>   			   struct ttm_mem_reg *new_mem)
>   {
>   	struct radeon_bo *rbo;
>   
>   	if (!radeon_ttm_bo_is_radeon_bo(bo))
>   		return;
>   
>   	rbo = container_of(bo, struct radeon_bo, tbo);
>   	radeon_bo_check_tiling(rbo, 0, 1);
>   	radeon_vm_bo_invalidate(rbo->rdev, rbo);
> diff --git a/drivers/gpu/drm/radeon/radeon_object.h b/drivers/gpu/drm/radeon/radeon_object.h
> index a10bb3d..9ffd821 100644
> --- a/drivers/gpu/drm/radeon/radeon_object.h
> +++ b/drivers/gpu/drm/radeon/radeon_object.h
> @@ -143,20 +143,21 @@ extern void radeon_bo_fini(struct radeon_device *rdev);
>   extern int radeon_bo_list_validate(struct radeon_device *rdev,
>   				   struct ww_acquire_ctx *ticket,
>   				   struct list_head *head, int ring);
>   extern int radeon_bo_set_tiling_flags(struct radeon_bo *bo,
>   				u32 tiling_flags, u32 pitch);
>   extern void radeon_bo_get_tiling_flags(struct radeon_bo *bo,
>   				u32 *tiling_flags, u32 *pitch);
>   extern int radeon_bo_check_tiling(struct radeon_bo *bo, bool has_moved,
>   				bool force_drop);
>   extern void radeon_bo_move_notify(struct ttm_buffer_object *bo,
> +				  bool evict,
>   				  struct ttm_mem_reg *new_mem);
>   extern int radeon_bo_fault_reserve_notify(struct ttm_buffer_object *bo);
>   extern int radeon_bo_get_surface_reg(struct radeon_bo *bo);
>   extern void radeon_bo_fence(struct radeon_bo *bo, struct radeon_fence *fence,
>   			    bool shared);
>   
>   /*
>    * sub allocation
>    */
>   
> diff --git a/drivers/gpu/drm/ttm/ttm_bo.c b/drivers/gpu/drm/ttm/ttm_bo.c
> index d506361..d4973e9c 100644
> --- a/drivers/gpu/drm/ttm/ttm_bo.c
> +++ b/drivers/gpu/drm/ttm/ttm_bo.c
> @@ -335,45 +335,45 @@ static int ttm_bo_handle_move_mem(struct ttm_buffer_object *bo,
>   			goto out_err;
>   
>   		if (mem->mem_type != TTM_PL_SYSTEM) {
>   			ret = ttm_tt_bind(bo->ttm, mem);
>   			if (ret)
>   				goto out_err;
>   		}
>   
>   		if (bo->mem.mem_type == TTM_PL_SYSTEM) {
>   			if (bdev->driver->move_notify)
> -				bdev->driver->move_notify(bo, mem);
> +				bdev->driver->move_notify(bo, evict, mem);
>   			bo->mem = *mem;
>   			mem->mm_node = NULL;
>   			goto moved;
>   		}
>   	}
>   
>   	if (bdev->driver->move_notify)
> -		bdev->driver->move_notify(bo, mem);
> +		bdev->driver->move_notify(bo, evict, mem);
>   
>   	if (!(old_man->flags & TTM_MEMTYPE_FLAG_FIXED) &&
>   	    !(new_man->flags & TTM_MEMTYPE_FLAG_FIXED))
>   		ret = ttm_bo_move_ttm(bo, interruptible, no_wait_gpu, mem);
>   	else if (bdev->driver->move)
>   		ret = bdev->driver->move(bo, evict, interruptible,
>   					 no_wait_gpu, mem);
>   	else
>   		ret = ttm_bo_move_memcpy(bo, interruptible, no_wait_gpu, mem);
>   
>   	if (ret) {
>   		if (bdev->driver->move_notify) {
>   			struct ttm_mem_reg tmp_mem = *mem;
>   			*mem = bo->mem;
>   			bo->mem = tmp_mem;
> -			bdev->driver->move_notify(bo, mem);
> +			bdev->driver->move_notify(bo, false, mem);
>   			bo->mem = *mem;
>   			*mem = tmp_mem;
>   		}
>   
>   		goto out_err;
>   	}
>   
>   moved:
>   	if (bo->evicted) {
>   		if (bdev->driver->invalidate_caches) {
> @@ -407,21 +407,21 @@ static int ttm_bo_handle_move_mem(struct ttm_buffer_object *bo,
>    * Call bo::reserved.
>    * Will release GPU memory type usage on destruction.
>    * This is the place to put in driver specific hooks to release
>    * driver private resources.
>    * Will release the bo::reserved lock.
>    */
>   
>   static void ttm_bo_cleanup_memtype_use(struct ttm_buffer_object *bo)
>   {
>   	if (bo->bdev->driver->move_notify)
> -		bo->bdev->driver->move_notify(bo, NULL);
> +		bo->bdev->driver->move_notify(bo, false, NULL);
>   
>   	ttm_tt_destroy(bo->ttm);
>   	bo->ttm = NULL;
>   	ttm_bo_mem_put(bo, &bo->mem);
>   
>   	ww_mutex_unlock (&bo->resv->lock);
>   }
>   
>   static void ttm_bo_flush_all_fences(struct ttm_buffer_object *bo)
>   {
> diff --git a/drivers/gpu/drm/virtio/virtgpu_ttm.c b/drivers/gpu/drm/virtio/virtgpu_ttm.c
> index 4a1de9f..10387d7 100644
> --- a/drivers/gpu/drm/virtio/virtgpu_ttm.c
> +++ b/drivers/gpu/drm/virtio/virtgpu_ttm.c
> @@ -379,20 +379,21 @@ static int virtio_gpu_bo_move(struct ttm_buffer_object *bo,
>   
>   	ret = ttm_bo_wait(bo, interruptible, no_wait_gpu);
>   	if (ret)
>   		return ret;
>   
>   	virtio_gpu_move_null(bo, new_mem);
>   	return 0;
>   }
>   
>   static void virtio_gpu_bo_move_notify(struct ttm_buffer_object *tbo,
> +				      bool evict,
>   				      struct ttm_mem_reg *new_mem)
>   {
>   	struct virtio_gpu_object *bo;
>   	struct virtio_gpu_device *vgdev;
>   
>   	bo = container_of(tbo, struct virtio_gpu_object, tbo);
>   	vgdev = (struct virtio_gpu_device *)bo->gem_base.dev->dev_private;
>   
>   	if (!new_mem || (new_mem->placement & TTM_PL_FLAG_SYSTEM)) {
>   		if (bo->hw_res_handle)
> diff --git a/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c b/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
> index c894a48..caa279b 100644
> --- a/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
> +++ b/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
> @@ -818,20 +818,21 @@ static int vmw_ttm_fault_reserve_notify(struct ttm_buffer_object *bo)
>    * vmw_move_notify - TTM move_notify_callback
>    *
>    * @bo: The TTM buffer object about to move.
>    * @mem: The struct ttm_mem_reg indicating to what memory
>    *       region the move is taking place.
>    *
>    * Calls move_notify for all subsystems needing it.
>    * (currently only resources).
>    */
>   static void vmw_move_notify(struct ttm_buffer_object *bo,
> +			    bool evict,
>   			    struct ttm_mem_reg *mem)
>   {
>   	vmw_resource_move_notify(bo, mem);
>   	vmw_query_move_notify(bo, mem);
>   }
>   
>   
>   /**
>    * vmw_swap_notify - TTM move_notify_callback
>    *
> diff --git a/include/drm/ttm/ttm_bo_driver.h b/include/drm/ttm/ttm_bo_driver.h
> index cdbdb40..8b4bec7 100644
> --- a/include/drm/ttm/ttm_bo_driver.h
> +++ b/include/drm/ttm/ttm_bo_driver.h
> @@ -424,23 +424,29 @@ struct ttm_bo_driver {
>   	 *
>   	 * Called from the map / write / read methods to verify that the
>   	 * caller is permitted to access the buffer object.
>   	 * This member may be set to NULL, which will refuse this kind of
>   	 * access for all buffer objects.
>   	 * This function should return 0 if access is granted, -EPERM otherwise.
>   	 */
>   	int (*verify_access)(struct ttm_buffer_object *bo,
>   			     struct file *filp);
>   
> -	/* hook to notify driver about a driver move so it
> -	 * can do tiling things */
> +	/**
> +	 * Hook to notify driver about a driver move so it
> +	 * can do tiling things and book-keeping.
> +	 *
> +	 * @evict: whether this move is evicting the buffer from the graphics
> +	 * address space
> +	 */
>   	void (*move_notify)(struct ttm_buffer_object *bo,
> +			    bool evict,
>   			    struct ttm_mem_reg *new_mem);
>   	/* notify the driver we are taking a fault on this BO
>   	 * and have reserved it */
>   	int (*fault_reserve_notify)(struct ttm_buffer_object *bo);
>   
>   	/**
>   	 * notify the driver that we're about to swap out this bo
>   	 */
>   	void (*swap_notify)(struct ttm_buffer_object *bo);
>   

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [PATCH 2/5] drm/amd/amdgpu: move eviction counting to amdgpu_bo_move_notify
       [not found]     ` <1481821815-9971-3-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
@ 2016-12-16  2:50       ` zhoucm1
  0 siblings, 0 replies; 15+ messages in thread
From: zhoucm1 @ 2016-12-16  2:50 UTC (permalink / raw)
  To: Nicolai Hähnle, amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW, Nicolai Hähnle



On 2016年12月16日 01:10, Nicolai Hähnle wrote:
> From: Nicolai Hähnle <nicolai.haehnle@amd.com>
>
> This catches evictions of shadow page tables from the GART. Since shadow
> page tables are always stored in system memory, amdgpu_bo_move is never
> called for them.
>
> This fixes a crash during command submission that occurs when only a shadow
> page table and no other BOs were evicted since the last submission.
>
> Fixes: 1baa439fb2f4e586 ("drm/amdgpu: allocate shadow for pd/pt bo V2")
> Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
Acked-by: Chunming Zhou <david1.zhou@amd.com>
> ---
>   drivers/gpu/drm/amd/amdgpu/amdgpu_object.c | 4 ++++
>   drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c    | 4 ----
>   2 files changed, 4 insertions(+), 4 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
> index c29db99..d94cdef 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
> @@ -855,20 +855,24 @@ void amdgpu_bo_move_notify(struct ttm_buffer_object *bo,
>   	struct amdgpu_device *adev = amdgpu_ttm_adev(bo->bdev);
>   	struct amdgpu_bo *abo;
>   	struct ttm_mem_reg *old_mem = &bo->mem;
>   
>   	if (!amdgpu_ttm_bo_is_amdgpu_bo(bo))
>   		return;
>   
>   	abo = container_of(bo, struct amdgpu_bo, tbo);
>   	amdgpu_vm_bo_invalidate(adev, abo);
>   
> +	/* remember the eviction */
> +	if (evict)
> +		atomic64_inc(&adev->num_evictions);
> +
>   	/* update statistics */
>   	if (!new_mem)
>   		return;
>   
>   	/* move_notify is called before move happens */
>   	amdgpu_update_memory_usage(adev, &bo->mem, new_mem);
>   
>   	trace_amdgpu_ttm_bo_move(abo, new_mem->mem_type, old_mem->mem_type);
>   }
>   
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c
> index 8f18b8e..80924c2 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c
> @@ -460,24 +460,20 @@ static int amdgpu_bo_move(struct ttm_buffer_object *bo,
>   	struct ttm_mem_reg *old_mem = &bo->mem;
>   	int r;
>   
>   	/* Can't move a pinned BO */
>   	abo = container_of(bo, struct amdgpu_bo, tbo);
>   	if (WARN_ON_ONCE(abo->pin_count > 0))
>   		return -EINVAL;
>   
>   	adev = amdgpu_ttm_adev(bo->bdev);
>   
> -	/* remember the eviction */
> -	if (evict)
> -		atomic64_inc(&adev->num_evictions);
> -
>   	if (old_mem->mem_type == TTM_PL_SYSTEM && bo->ttm == NULL) {
>   		amdgpu_move_null(bo, new_mem);
>   		return 0;
>   	}
>   	if ((old_mem->mem_type == TTM_PL_TT &&
>   	     new_mem->mem_type == TTM_PL_SYSTEM) ||
>   	    (old_mem->mem_type == TTM_PL_SYSTEM &&
>   	     new_mem->mem_type == TTM_PL_TT)) {
>   		/* bind is enough */
>   		amdgpu_move_null(bo, new_mem);

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [PATCH 1/5] drm/ttm: add evict parameter to ttm_bo_driver::move_notify
       [not found]         ` <5853564E.5020005-5C7GfCeVMHo@public.gmane.org>
@ 2016-12-21 15:12           ` Nicolai Hähnle
       [not found]             ` <8f68adf1-52bc-b2ef-2fa7-79a8f9695164-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  0 siblings, 1 reply; 15+ messages in thread
From: Nicolai Hähnle @ 2016-12-21 15:12 UTC (permalink / raw)
  To: zhoucm1, amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW, Nicolai Hähnle

On 16.12.2016 03:49, zhoucm1 wrote:
> On 2016年12月16日 01:10, Nicolai Hähnle wrote:
>> From: Nicolai Hähnle <nicolai.haehnle@amd.com>
>>
>> Ensure that the driver can listen to evictions even when they don't
>> take the
>> path through ttm_bo_driver::move.
>>
>> This is crucial for amdgpu, which relies on an eviction counter to skip
>> re-binding page tables when possible.
>>
>> Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
> Acked-by: Chunming Zhou <david1.zhou@amd.com>

Thanks. Ping for feedback from non-AMD people?

Nicolai


>> ---
>>   drivers/gpu/drm/amd/amdgpu/amdgpu_object.c |  1 +
>>   drivers/gpu/drm/amd/amdgpu/amdgpu_object.h |  3 ++-
>>   drivers/gpu/drm/nouveau/nouveau_bo.c       |  3 ++-
>>   drivers/gpu/drm/qxl/qxl_ttm.c              |  1 +
>>   drivers/gpu/drm/radeon/radeon_object.c     |  1 +
>>   drivers/gpu/drm/radeon/radeon_object.h     |  1 +
>>   drivers/gpu/drm/ttm/ttm_bo.c               |  8 ++++----
>>   drivers/gpu/drm/virtio/virtgpu_ttm.c       |  1 +
>>   drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c     |  1 +
>>   include/drm/ttm/ttm_bo_driver.h            | 10 ++++++++--
>>   10 files changed, 22 insertions(+), 8 deletions(-)
>>
>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
>> b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
>> index bf79b73..c29db99 100644
>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
>> @@ -842,20 +842,21 @@ int amdgpu_bo_get_metadata(struct amdgpu_bo *bo,
>> void *buffer,
>>         if (metadata_size)
>>           *metadata_size = bo->metadata_size;
>>       if (flags)
>>           *flags = bo->metadata_flags;
>>         return 0;
>>   }
>>     void amdgpu_bo_move_notify(struct ttm_buffer_object *bo,
>> +               bool evict,
>>                  struct ttm_mem_reg *new_mem)
>>   {
>>       struct amdgpu_device *adev = amdgpu_ttm_adev(bo->bdev);
>>       struct amdgpu_bo *abo;
>>       struct ttm_mem_reg *old_mem = &bo->mem;
>>         if (!amdgpu_ttm_bo_is_amdgpu_bo(bo))
>>           return;
>>         abo = container_of(bo, struct amdgpu_bo, tbo);
>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
>> b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
>> index 5cbf59e..4306b2f 100644
>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
>> @@ -148,21 +148,22 @@ void amdgpu_bo_fini(struct amdgpu_device *adev);
>>   int amdgpu_bo_fbdev_mmap(struct amdgpu_bo *bo,
>>                   struct vm_area_struct *vma);
>>   int amdgpu_bo_set_tiling_flags(struct amdgpu_bo *bo, u64 tiling_flags);
>>   void amdgpu_bo_get_tiling_flags(struct amdgpu_bo *bo, u64
>> *tiling_flags);
>>   int amdgpu_bo_set_metadata (struct amdgpu_bo *bo, void *metadata,
>>                   uint32_t metadata_size, uint64_t flags);
>>   int amdgpu_bo_get_metadata(struct amdgpu_bo *bo, void *buffer,
>>                  size_t buffer_size, uint32_t *metadata_size,
>>                  uint64_t *flags);
>>   void amdgpu_bo_move_notify(struct ttm_buffer_object *bo,
>> -                  struct ttm_mem_reg *new_mem);
>> +               bool evict,
>> +               struct ttm_mem_reg *new_mem);
>>   int amdgpu_bo_fault_reserve_notify(struct ttm_buffer_object *bo);
>>   void amdgpu_bo_fence(struct amdgpu_bo *bo, struct dma_fence *fence,
>>                bool shared);
>>   u64 amdgpu_bo_gpu_offset(struct amdgpu_bo *bo);
>>   int amdgpu_bo_backup_to_shadow(struct amdgpu_device *adev,
>>                      struct amdgpu_ring *ring,
>>                      struct amdgpu_bo *bo,
>>                      struct reservation_object *resv,
>>                      struct dma_fence **fence, bool direct);
>>   int amdgpu_bo_restore_from_shadow(struct amdgpu_device *adev,
>> diff --git a/drivers/gpu/drm/nouveau/nouveau_bo.c
>> b/drivers/gpu/drm/nouveau/nouveau_bo.c
>> index e0c0007..6fa1521 100644
>> --- a/drivers/gpu/drm/nouveau/nouveau_bo.c
>> +++ b/drivers/gpu/drm/nouveau/nouveau_bo.c
>> @@ -1187,21 +1187,22 @@ nouveau_bo_move_flips(struct ttm_buffer_object
>> *bo, bool evict, bool intr,
>>       ret = nouveau_bo_move_m2mf(bo, true, intr, no_wait_gpu, new_mem);
>>       if (ret)
>>           goto out;
>>     out:
>>       ttm_bo_mem_put(bo, &tmp_mem);
>>       return ret;
>>   }
>>     static void
>> -nouveau_bo_move_ntfy(struct ttm_buffer_object *bo, struct ttm_mem_reg
>> *new_mem)
>> +nouveau_bo_move_ntfy(struct ttm_buffer_object *bo, bool evict,
>> +             struct ttm_mem_reg *new_mem)
>>   {
>>       struct nouveau_bo *nvbo = nouveau_bo(bo);
>>       struct nvkm_vma *vma;
>>         /* ttm can now (stupidly) pass the driver bos it didn't
>> create... */
>>       if (bo->destroy != nouveau_bo_del_ttm)
>>           return;
>>         list_for_each_entry(vma, &nvbo->vma_list, head) {
>>           if (new_mem && new_mem->mem_type != TTM_PL_SYSTEM &&
>> diff --git a/drivers/gpu/drm/qxl/qxl_ttm.c
>> b/drivers/gpu/drm/qxl/qxl_ttm.c
>> index 1176133..f3939a9 100644
>> --- a/drivers/gpu/drm/qxl/qxl_ttm.c
>> +++ b/drivers/gpu/drm/qxl/qxl_ttm.c
>> @@ -360,20 +360,21 @@ static int qxl_bo_move(struct ttm_buffer_object
>> *bo,
>>         if (old_mem->mem_type == TTM_PL_SYSTEM && bo->ttm == NULL) {
>>           qxl_move_null(bo, new_mem);
>>           return 0;
>>       }
>>       return ttm_bo_move_memcpy(bo, interruptible, no_wait_gpu,
>>                     new_mem);
>>   }
>>     static void qxl_bo_move_notify(struct ttm_buffer_object *bo,
>> +                   bool evict,
>>                      struct ttm_mem_reg *new_mem)
>>   {
>>       struct qxl_bo *qbo;
>>       struct qxl_device *qdev;
>>         if (!qxl_ttm_bo_is_qxl_bo(bo))
>>           return;
>>       qbo = to_qxl_bo(bo);
>>       qdev = qbo->gem_base.dev->dev_private;
>>   diff --git a/drivers/gpu/drm/radeon/radeon_object.c
>> b/drivers/gpu/drm/radeon/radeon_object.c
>> index 41b72ce..74b2760 100644
>> --- a/drivers/gpu/drm/radeon/radeon_object.c
>> +++ b/drivers/gpu/drm/radeon/radeon_object.c
>> @@ -758,20 +758,21 @@ int radeon_bo_check_tiling(struct radeon_bo *bo,
>> bool has_moved,
>>           return 0;
>>       }
>>         if ((bo->surface_reg >= 0) && !has_moved)
>>           return 0;
>>         return radeon_bo_get_surface_reg(bo);
>>   }
>>     void radeon_bo_move_notify(struct ttm_buffer_object *bo,
>> +               bool evict,
>>                  struct ttm_mem_reg *new_mem)
>>   {
>>       struct radeon_bo *rbo;
>>         if (!radeon_ttm_bo_is_radeon_bo(bo))
>>           return;
>>         rbo = container_of(bo, struct radeon_bo, tbo);
>>       radeon_bo_check_tiling(rbo, 0, 1);
>>       radeon_vm_bo_invalidate(rbo->rdev, rbo);
>> diff --git a/drivers/gpu/drm/radeon/radeon_object.h
>> b/drivers/gpu/drm/radeon/radeon_object.h
>> index a10bb3d..9ffd821 100644
>> --- a/drivers/gpu/drm/radeon/radeon_object.h
>> +++ b/drivers/gpu/drm/radeon/radeon_object.h
>> @@ -143,20 +143,21 @@ extern void radeon_bo_fini(struct radeon_device
>> *rdev);
>>   extern int radeon_bo_list_validate(struct radeon_device *rdev,
>>                      struct ww_acquire_ctx *ticket,
>>                      struct list_head *head, int ring);
>>   extern int radeon_bo_set_tiling_flags(struct radeon_bo *bo,
>>                   u32 tiling_flags, u32 pitch);
>>   extern void radeon_bo_get_tiling_flags(struct radeon_bo *bo,
>>                   u32 *tiling_flags, u32 *pitch);
>>   extern int radeon_bo_check_tiling(struct radeon_bo *bo, bool has_moved,
>>                   bool force_drop);
>>   extern void radeon_bo_move_notify(struct ttm_buffer_object *bo,
>> +                  bool evict,
>>                     struct ttm_mem_reg *new_mem);
>>   extern int radeon_bo_fault_reserve_notify(struct ttm_buffer_object
>> *bo);
>>   extern int radeon_bo_get_surface_reg(struct radeon_bo *bo);
>>   extern void radeon_bo_fence(struct radeon_bo *bo, struct
>> radeon_fence *fence,
>>                   bool shared);
>>     /*
>>    * sub allocation
>>    */
>>   diff --git a/drivers/gpu/drm/ttm/ttm_bo.c
>> b/drivers/gpu/drm/ttm/ttm_bo.c
>> index d506361..d4973e9c 100644
>> --- a/drivers/gpu/drm/ttm/ttm_bo.c
>> +++ b/drivers/gpu/drm/ttm/ttm_bo.c
>> @@ -335,45 +335,45 @@ static int ttm_bo_handle_move_mem(struct
>> ttm_buffer_object *bo,
>>               goto out_err;
>>             if (mem->mem_type != TTM_PL_SYSTEM) {
>>               ret = ttm_tt_bind(bo->ttm, mem);
>>               if (ret)
>>                   goto out_err;
>>           }
>>             if (bo->mem.mem_type == TTM_PL_SYSTEM) {
>>               if (bdev->driver->move_notify)
>> -                bdev->driver->move_notify(bo, mem);
>> +                bdev->driver->move_notify(bo, evict, mem);
>>               bo->mem = *mem;
>>               mem->mm_node = NULL;
>>               goto moved;
>>           }
>>       }
>>         if (bdev->driver->move_notify)
>> -        bdev->driver->move_notify(bo, mem);
>> +        bdev->driver->move_notify(bo, evict, mem);
>>         if (!(old_man->flags & TTM_MEMTYPE_FLAG_FIXED) &&
>>           !(new_man->flags & TTM_MEMTYPE_FLAG_FIXED))
>>           ret = ttm_bo_move_ttm(bo, interruptible, no_wait_gpu, mem);
>>       else if (bdev->driver->move)
>>           ret = bdev->driver->move(bo, evict, interruptible,
>>                        no_wait_gpu, mem);
>>       else
>>           ret = ttm_bo_move_memcpy(bo, interruptible, no_wait_gpu, mem);
>>         if (ret) {
>>           if (bdev->driver->move_notify) {
>>               struct ttm_mem_reg tmp_mem = *mem;
>>               *mem = bo->mem;
>>               bo->mem = tmp_mem;
>> -            bdev->driver->move_notify(bo, mem);
>> +            bdev->driver->move_notify(bo, false, mem);
>>               bo->mem = *mem;
>>               *mem = tmp_mem;
>>           }
>>             goto out_err;
>>       }
>>     moved:
>>       if (bo->evicted) {
>>           if (bdev->driver->invalidate_caches) {
>> @@ -407,21 +407,21 @@ static int ttm_bo_handle_move_mem(struct
>> ttm_buffer_object *bo,
>>    * Call bo::reserved.
>>    * Will release GPU memory type usage on destruction.
>>    * This is the place to put in driver specific hooks to release
>>    * driver private resources.
>>    * Will release the bo::reserved lock.
>>    */
>>     static void ttm_bo_cleanup_memtype_use(struct ttm_buffer_object *bo)
>>   {
>>       if (bo->bdev->driver->move_notify)
>> -        bo->bdev->driver->move_notify(bo, NULL);
>> +        bo->bdev->driver->move_notify(bo, false, NULL);
>>         ttm_tt_destroy(bo->ttm);
>>       bo->ttm = NULL;
>>       ttm_bo_mem_put(bo, &bo->mem);
>>         ww_mutex_unlock (&bo->resv->lock);
>>   }
>>     static void ttm_bo_flush_all_fences(struct ttm_buffer_object *bo)
>>   {
>> diff --git a/drivers/gpu/drm/virtio/virtgpu_ttm.c
>> b/drivers/gpu/drm/virtio/virtgpu_ttm.c
>> index 4a1de9f..10387d7 100644
>> --- a/drivers/gpu/drm/virtio/virtgpu_ttm.c
>> +++ b/drivers/gpu/drm/virtio/virtgpu_ttm.c
>> @@ -379,20 +379,21 @@ static int virtio_gpu_bo_move(struct
>> ttm_buffer_object *bo,
>>         ret = ttm_bo_wait(bo, interruptible, no_wait_gpu);
>>       if (ret)
>>           return ret;
>>         virtio_gpu_move_null(bo, new_mem);
>>       return 0;
>>   }
>>     static void virtio_gpu_bo_move_notify(struct ttm_buffer_object *tbo,
>> +                      bool evict,
>>                         struct ttm_mem_reg *new_mem)
>>   {
>>       struct virtio_gpu_object *bo;
>>       struct virtio_gpu_device *vgdev;
>>         bo = container_of(tbo, struct virtio_gpu_object, tbo);
>>       vgdev = (struct virtio_gpu_device *)bo->gem_base.dev->dev_private;
>>         if (!new_mem || (new_mem->placement & TTM_PL_FLAG_SYSTEM)) {
>>           if (bo->hw_res_handle)
>> diff --git a/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
>> b/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
>> index c894a48..caa279b 100644
>> --- a/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
>> +++ b/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
>> @@ -818,20 +818,21 @@ static int vmw_ttm_fault_reserve_notify(struct
>> ttm_buffer_object *bo)
>>    * vmw_move_notify - TTM move_notify_callback
>>    *
>>    * @bo: The TTM buffer object about to move.
>>    * @mem: The struct ttm_mem_reg indicating to what memory
>>    *       region the move is taking place.
>>    *
>>    * Calls move_notify for all subsystems needing it.
>>    * (currently only resources).
>>    */
>>   static void vmw_move_notify(struct ttm_buffer_object *bo,
>> +                bool evict,
>>                   struct ttm_mem_reg *mem)
>>   {
>>       vmw_resource_move_notify(bo, mem);
>>       vmw_query_move_notify(bo, mem);
>>   }
>>       /**
>>    * vmw_swap_notify - TTM move_notify_callback
>>    *
>> diff --git a/include/drm/ttm/ttm_bo_driver.h
>> b/include/drm/ttm/ttm_bo_driver.h
>> index cdbdb40..8b4bec7 100644
>> --- a/include/drm/ttm/ttm_bo_driver.h
>> +++ b/include/drm/ttm/ttm_bo_driver.h
>> @@ -424,23 +424,29 @@ struct ttm_bo_driver {
>>        *
>>        * Called from the map / write / read methods to verify that the
>>        * caller is permitted to access the buffer object.
>>        * This member may be set to NULL, which will refuse this kind of
>>        * access for all buffer objects.
>>        * This function should return 0 if access is granted, -EPERM
>> otherwise.
>>        */
>>       int (*verify_access)(struct ttm_buffer_object *bo,
>>                    struct file *filp);
>>   -    /* hook to notify driver about a driver move so it
>> -     * can do tiling things */
>> +    /**
>> +     * Hook to notify driver about a driver move so it
>> +     * can do tiling things and book-keeping.
>> +     *
>> +     * @evict: whether this move is evicting the buffer from the
>> graphics
>> +     * address space
>> +     */
>>       void (*move_notify)(struct ttm_buffer_object *bo,
>> +                bool evict,
>>                   struct ttm_mem_reg *new_mem);
>>       /* notify the driver we are taking a fault on this BO
>>        * and have reserved it */
>>       int (*fault_reserve_notify)(struct ttm_buffer_object *bo);
>>         /**
>>        * notify the driver that we're about to swap out this bo
>>        */
>>       void (*swap_notify)(struct ttm_buffer_object *bo);
>>
>
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: [PATCH 1/5] drm/ttm: add evict parameter to ttm_bo_driver::move_notify
       [not found]             ` <8f68adf1-52bc-b2ef-2fa7-79a8f9695164-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
@ 2017-01-03 11:42               ` Christian König
  0 siblings, 0 replies; 15+ messages in thread
From: Christian König @ 2017-01-03 11:42 UTC (permalink / raw)
  To: Nicolai Hähnle, zhoucm1, amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: Nicolai Hähnle, dri-devel-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

Am 21.12.2016 um 16:12 schrieb Nicolai Hähnle:
> On 16.12.2016 03:49, zhoucm1 wrote:
>> On 2016年12月16日 01:10, Nicolai Hähnle wrote:
>>> From: Nicolai Hähnle <nicolai.haehnle@amd.com>
>>>
>>> Ensure that the driver can listen to evictions even when they don't
>>> take the
>>> path through ttm_bo_driver::move.
>>>
>>> This is crucial for amdgpu, which relies on an eviction counter to skip
>>> re-binding page tables when possible.
>>>
>>> Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
>> Acked-by: Chunming Zhou <david1.zhou@amd.com>

Feel free to add my Reviewed-by: Christian König 
<christian.koenig@amd.com> to patch #1-#4 and V2 of patch #5 as well.

>
> Thanks. Ping for feedback from non-AMD people?

Ping once more. Would be nice if somebody else can take a look as well.

Christian.

>
> Nicolai
>
>
>>> ---
>>>   drivers/gpu/drm/amd/amdgpu/amdgpu_object.c |  1 +
>>>   drivers/gpu/drm/amd/amdgpu/amdgpu_object.h |  3 ++-
>>>   drivers/gpu/drm/nouveau/nouveau_bo.c       |  3 ++-
>>>   drivers/gpu/drm/qxl/qxl_ttm.c              |  1 +
>>>   drivers/gpu/drm/radeon/radeon_object.c     |  1 +
>>>   drivers/gpu/drm/radeon/radeon_object.h     |  1 +
>>>   drivers/gpu/drm/ttm/ttm_bo.c               |  8 ++++----
>>>   drivers/gpu/drm/virtio/virtgpu_ttm.c       |  1 +
>>>   drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c     |  1 +
>>>   include/drm/ttm/ttm_bo_driver.h            | 10 ++++++++--
>>>   10 files changed, 22 insertions(+), 8 deletions(-)
>>>
>>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
>>> b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
>>> index bf79b73..c29db99 100644
>>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
>>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
>>> @@ -842,20 +842,21 @@ int amdgpu_bo_get_metadata(struct amdgpu_bo *bo,
>>> void *buffer,
>>>         if (metadata_size)
>>>           *metadata_size = bo->metadata_size;
>>>       if (flags)
>>>           *flags = bo->metadata_flags;
>>>         return 0;
>>>   }
>>>     void amdgpu_bo_move_notify(struct ttm_buffer_object *bo,
>>> +               bool evict,
>>>                  struct ttm_mem_reg *new_mem)
>>>   {
>>>       struct amdgpu_device *adev = amdgpu_ttm_adev(bo->bdev);
>>>       struct amdgpu_bo *abo;
>>>       struct ttm_mem_reg *old_mem = &bo->mem;
>>>         if (!amdgpu_ttm_bo_is_amdgpu_bo(bo))
>>>           return;
>>>         abo = container_of(bo, struct amdgpu_bo, tbo);
>>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
>>> b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
>>> index 5cbf59e..4306b2f 100644
>>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
>>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
>>> @@ -148,21 +148,22 @@ void amdgpu_bo_fini(struct amdgpu_device *adev);
>>>   int amdgpu_bo_fbdev_mmap(struct amdgpu_bo *bo,
>>>                   struct vm_area_struct *vma);
>>>   int amdgpu_bo_set_tiling_flags(struct amdgpu_bo *bo, u64 
>>> tiling_flags);
>>>   void amdgpu_bo_get_tiling_flags(struct amdgpu_bo *bo, u64
>>> *tiling_flags);
>>>   int amdgpu_bo_set_metadata (struct amdgpu_bo *bo, void *metadata,
>>>                   uint32_t metadata_size, uint64_t flags);
>>>   int amdgpu_bo_get_metadata(struct amdgpu_bo *bo, void *buffer,
>>>                  size_t buffer_size, uint32_t *metadata_size,
>>>                  uint64_t *flags);
>>>   void amdgpu_bo_move_notify(struct ttm_buffer_object *bo,
>>> -                  struct ttm_mem_reg *new_mem);
>>> +               bool evict,
>>> +               struct ttm_mem_reg *new_mem);
>>>   int amdgpu_bo_fault_reserve_notify(struct ttm_buffer_object *bo);
>>>   void amdgpu_bo_fence(struct amdgpu_bo *bo, struct dma_fence *fence,
>>>                bool shared);
>>>   u64 amdgpu_bo_gpu_offset(struct amdgpu_bo *bo);
>>>   int amdgpu_bo_backup_to_shadow(struct amdgpu_device *adev,
>>>                      struct amdgpu_ring *ring,
>>>                      struct amdgpu_bo *bo,
>>>                      struct reservation_object *resv,
>>>                      struct dma_fence **fence, bool direct);
>>>   int amdgpu_bo_restore_from_shadow(struct amdgpu_device *adev,
>>> diff --git a/drivers/gpu/drm/nouveau/nouveau_bo.c
>>> b/drivers/gpu/drm/nouveau/nouveau_bo.c
>>> index e0c0007..6fa1521 100644
>>> --- a/drivers/gpu/drm/nouveau/nouveau_bo.c
>>> +++ b/drivers/gpu/drm/nouveau/nouveau_bo.c
>>> @@ -1187,21 +1187,22 @@ nouveau_bo_move_flips(struct ttm_buffer_object
>>> *bo, bool evict, bool intr,
>>>       ret = nouveau_bo_move_m2mf(bo, true, intr, no_wait_gpu, new_mem);
>>>       if (ret)
>>>           goto out;
>>>     out:
>>>       ttm_bo_mem_put(bo, &tmp_mem);
>>>       return ret;
>>>   }
>>>     static void
>>> -nouveau_bo_move_ntfy(struct ttm_buffer_object *bo, struct ttm_mem_reg
>>> *new_mem)
>>> +nouveau_bo_move_ntfy(struct ttm_buffer_object *bo, bool evict,
>>> +             struct ttm_mem_reg *new_mem)
>>>   {
>>>       struct nouveau_bo *nvbo = nouveau_bo(bo);
>>>       struct nvkm_vma *vma;
>>>         /* ttm can now (stupidly) pass the driver bos it didn't
>>> create... */
>>>       if (bo->destroy != nouveau_bo_del_ttm)
>>>           return;
>>>         list_for_each_entry(vma, &nvbo->vma_list, head) {
>>>           if (new_mem && new_mem->mem_type != TTM_PL_SYSTEM &&
>>> diff --git a/drivers/gpu/drm/qxl/qxl_ttm.c
>>> b/drivers/gpu/drm/qxl/qxl_ttm.c
>>> index 1176133..f3939a9 100644
>>> --- a/drivers/gpu/drm/qxl/qxl_ttm.c
>>> +++ b/drivers/gpu/drm/qxl/qxl_ttm.c
>>> @@ -360,20 +360,21 @@ static int qxl_bo_move(struct ttm_buffer_object
>>> *bo,
>>>         if (old_mem->mem_type == TTM_PL_SYSTEM && bo->ttm == NULL) {
>>>           qxl_move_null(bo, new_mem);
>>>           return 0;
>>>       }
>>>       return ttm_bo_move_memcpy(bo, interruptible, no_wait_gpu,
>>>                     new_mem);
>>>   }
>>>     static void qxl_bo_move_notify(struct ttm_buffer_object *bo,
>>> +                   bool evict,
>>>                      struct ttm_mem_reg *new_mem)
>>>   {
>>>       struct qxl_bo *qbo;
>>>       struct qxl_device *qdev;
>>>         if (!qxl_ttm_bo_is_qxl_bo(bo))
>>>           return;
>>>       qbo = to_qxl_bo(bo);
>>>       qdev = qbo->gem_base.dev->dev_private;
>>>   diff --git a/drivers/gpu/drm/radeon/radeon_object.c
>>> b/drivers/gpu/drm/radeon/radeon_object.c
>>> index 41b72ce..74b2760 100644
>>> --- a/drivers/gpu/drm/radeon/radeon_object.c
>>> +++ b/drivers/gpu/drm/radeon/radeon_object.c
>>> @@ -758,20 +758,21 @@ int radeon_bo_check_tiling(struct radeon_bo *bo,
>>> bool has_moved,
>>>           return 0;
>>>       }
>>>         if ((bo->surface_reg >= 0) && !has_moved)
>>>           return 0;
>>>         return radeon_bo_get_surface_reg(bo);
>>>   }
>>>     void radeon_bo_move_notify(struct ttm_buffer_object *bo,
>>> +               bool evict,
>>>                  struct ttm_mem_reg *new_mem)
>>>   {
>>>       struct radeon_bo *rbo;
>>>         if (!radeon_ttm_bo_is_radeon_bo(bo))
>>>           return;
>>>         rbo = container_of(bo, struct radeon_bo, tbo);
>>>       radeon_bo_check_tiling(rbo, 0, 1);
>>>       radeon_vm_bo_invalidate(rbo->rdev, rbo);
>>> diff --git a/drivers/gpu/drm/radeon/radeon_object.h
>>> b/drivers/gpu/drm/radeon/radeon_object.h
>>> index a10bb3d..9ffd821 100644
>>> --- a/drivers/gpu/drm/radeon/radeon_object.h
>>> +++ b/drivers/gpu/drm/radeon/radeon_object.h
>>> @@ -143,20 +143,21 @@ extern void radeon_bo_fini(struct radeon_device
>>> *rdev);
>>>   extern int radeon_bo_list_validate(struct radeon_device *rdev,
>>>                      struct ww_acquire_ctx *ticket,
>>>                      struct list_head *head, int ring);
>>>   extern int radeon_bo_set_tiling_flags(struct radeon_bo *bo,
>>>                   u32 tiling_flags, u32 pitch);
>>>   extern void radeon_bo_get_tiling_flags(struct radeon_bo *bo,
>>>                   u32 *tiling_flags, u32 *pitch);
>>>   extern int radeon_bo_check_tiling(struct radeon_bo *bo, bool 
>>> has_moved,
>>>                   bool force_drop);
>>>   extern void radeon_bo_move_notify(struct ttm_buffer_object *bo,
>>> +                  bool evict,
>>>                     struct ttm_mem_reg *new_mem);
>>>   extern int radeon_bo_fault_reserve_notify(struct ttm_buffer_object
>>> *bo);
>>>   extern int radeon_bo_get_surface_reg(struct radeon_bo *bo);
>>>   extern void radeon_bo_fence(struct radeon_bo *bo, struct
>>> radeon_fence *fence,
>>>                   bool shared);
>>>     /*
>>>    * sub allocation
>>>    */
>>>   diff --git a/drivers/gpu/drm/ttm/ttm_bo.c
>>> b/drivers/gpu/drm/ttm/ttm_bo.c
>>> index d506361..d4973e9c 100644
>>> --- a/drivers/gpu/drm/ttm/ttm_bo.c
>>> +++ b/drivers/gpu/drm/ttm/ttm_bo.c
>>> @@ -335,45 +335,45 @@ static int ttm_bo_handle_move_mem(struct
>>> ttm_buffer_object *bo,
>>>               goto out_err;
>>>             if (mem->mem_type != TTM_PL_SYSTEM) {
>>>               ret = ttm_tt_bind(bo->ttm, mem);
>>>               if (ret)
>>>                   goto out_err;
>>>           }
>>>             if (bo->mem.mem_type == TTM_PL_SYSTEM) {
>>>               if (bdev->driver->move_notify)
>>> -                bdev->driver->move_notify(bo, mem);
>>> +                bdev->driver->move_notify(bo, evict, mem);
>>>               bo->mem = *mem;
>>>               mem->mm_node = NULL;
>>>               goto moved;
>>>           }
>>>       }
>>>         if (bdev->driver->move_notify)
>>> -        bdev->driver->move_notify(bo, mem);
>>> +        bdev->driver->move_notify(bo, evict, mem);
>>>         if (!(old_man->flags & TTM_MEMTYPE_FLAG_FIXED) &&
>>>           !(new_man->flags & TTM_MEMTYPE_FLAG_FIXED))
>>>           ret = ttm_bo_move_ttm(bo, interruptible, no_wait_gpu, mem);
>>>       else if (bdev->driver->move)
>>>           ret = bdev->driver->move(bo, evict, interruptible,
>>>                        no_wait_gpu, mem);
>>>       else
>>>           ret = ttm_bo_move_memcpy(bo, interruptible, no_wait_gpu, 
>>> mem);
>>>         if (ret) {
>>>           if (bdev->driver->move_notify) {
>>>               struct ttm_mem_reg tmp_mem = *mem;
>>>               *mem = bo->mem;
>>>               bo->mem = tmp_mem;
>>> -            bdev->driver->move_notify(bo, mem);
>>> +            bdev->driver->move_notify(bo, false, mem);
>>>               bo->mem = *mem;
>>>               *mem = tmp_mem;
>>>           }
>>>             goto out_err;
>>>       }
>>>     moved:
>>>       if (bo->evicted) {
>>>           if (bdev->driver->invalidate_caches) {
>>> @@ -407,21 +407,21 @@ static int ttm_bo_handle_move_mem(struct
>>> ttm_buffer_object *bo,
>>>    * Call bo::reserved.
>>>    * Will release GPU memory type usage on destruction.
>>>    * This is the place to put in driver specific hooks to release
>>>    * driver private resources.
>>>    * Will release the bo::reserved lock.
>>>    */
>>>     static void ttm_bo_cleanup_memtype_use(struct ttm_buffer_object 
>>> *bo)
>>>   {
>>>       if (bo->bdev->driver->move_notify)
>>> -        bo->bdev->driver->move_notify(bo, NULL);
>>> +        bo->bdev->driver->move_notify(bo, false, NULL);
>>>         ttm_tt_destroy(bo->ttm);
>>>       bo->ttm = NULL;
>>>       ttm_bo_mem_put(bo, &bo->mem);
>>>         ww_mutex_unlock (&bo->resv->lock);
>>>   }
>>>     static void ttm_bo_flush_all_fences(struct ttm_buffer_object *bo)
>>>   {
>>> diff --git a/drivers/gpu/drm/virtio/virtgpu_ttm.c
>>> b/drivers/gpu/drm/virtio/virtgpu_ttm.c
>>> index 4a1de9f..10387d7 100644
>>> --- a/drivers/gpu/drm/virtio/virtgpu_ttm.c
>>> +++ b/drivers/gpu/drm/virtio/virtgpu_ttm.c
>>> @@ -379,20 +379,21 @@ static int virtio_gpu_bo_move(struct
>>> ttm_buffer_object *bo,
>>>         ret = ttm_bo_wait(bo, interruptible, no_wait_gpu);
>>>       if (ret)
>>>           return ret;
>>>         virtio_gpu_move_null(bo, new_mem);
>>>       return 0;
>>>   }
>>>     static void virtio_gpu_bo_move_notify(struct ttm_buffer_object 
>>> *tbo,
>>> +                      bool evict,
>>>                         struct ttm_mem_reg *new_mem)
>>>   {
>>>       struct virtio_gpu_object *bo;
>>>       struct virtio_gpu_device *vgdev;
>>>         bo = container_of(tbo, struct virtio_gpu_object, tbo);
>>>       vgdev = (struct virtio_gpu_device 
>>> *)bo->gem_base.dev->dev_private;
>>>         if (!new_mem || (new_mem->placement & TTM_PL_FLAG_SYSTEM)) {
>>>           if (bo->hw_res_handle)
>>> diff --git a/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
>>> b/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
>>> index c894a48..caa279b 100644
>>> --- a/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
>>> +++ b/drivers/gpu/drm/vmwgfx/vmwgfx_buffer.c
>>> @@ -818,20 +818,21 @@ static int vmw_ttm_fault_reserve_notify(struct
>>> ttm_buffer_object *bo)
>>>    * vmw_move_notify - TTM move_notify_callback
>>>    *
>>>    * @bo: The TTM buffer object about to move.
>>>    * @mem: The struct ttm_mem_reg indicating to what memory
>>>    *       region the move is taking place.
>>>    *
>>>    * Calls move_notify for all subsystems needing it.
>>>    * (currently only resources).
>>>    */
>>>   static void vmw_move_notify(struct ttm_buffer_object *bo,
>>> +                bool evict,
>>>                   struct ttm_mem_reg *mem)
>>>   {
>>>       vmw_resource_move_notify(bo, mem);
>>>       vmw_query_move_notify(bo, mem);
>>>   }
>>>       /**
>>>    * vmw_swap_notify - TTM move_notify_callback
>>>    *
>>> diff --git a/include/drm/ttm/ttm_bo_driver.h
>>> b/include/drm/ttm/ttm_bo_driver.h
>>> index cdbdb40..8b4bec7 100644
>>> --- a/include/drm/ttm/ttm_bo_driver.h
>>> +++ b/include/drm/ttm/ttm_bo_driver.h
>>> @@ -424,23 +424,29 @@ struct ttm_bo_driver {
>>>        *
>>>        * Called from the map / write / read methods to verify that the
>>>        * caller is permitted to access the buffer object.
>>>        * This member may be set to NULL, which will refuse this kind of
>>>        * access for all buffer objects.
>>>        * This function should return 0 if access is granted, -EPERM
>>> otherwise.
>>>        */
>>>       int (*verify_access)(struct ttm_buffer_object *bo,
>>>                    struct file *filp);
>>>   -    /* hook to notify driver about a driver move so it
>>> -     * can do tiling things */
>>> +    /**
>>> +     * Hook to notify driver about a driver move so it
>>> +     * can do tiling things and book-keeping.
>>> +     *
>>> +     * @evict: whether this move is evicting the buffer from the
>>> graphics
>>> +     * address space
>>> +     */
>>>       void (*move_notify)(struct ttm_buffer_object *bo,
>>> +                bool evict,
>>>                   struct ttm_mem_reg *new_mem);
>>>       /* notify the driver we are taking a fault on this BO
>>>        * and have reserved it */
>>>       int (*fault_reserve_notify)(struct ttm_buffer_object *bo);
>>>         /**
>>>        * notify the driver that we're about to swap out this bo
>>>        */
>>>       void (*swap_notify)(struct ttm_buffer_object *bo);
>>>
>>
> _______________________________________________
> amd-gfx mailing list
> amd-gfx@lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/amd-gfx


_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 15+ messages in thread

end of thread, other threads:[~2017-01-03 11:42 UTC | newest]

Thread overview: 15+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-12-15 17:10 [PATCH 0/5] drm/ttm, amdgpu: fix crashes due to shadow page table evictions Nicolai Hähnle
     [not found] ` <1481821815-9971-1-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2016-12-15 17:10   ` [PATCH 1/5] drm/ttm: add evict parameter to ttm_bo_driver::move_notify Nicolai Hähnle
     [not found]     ` <1481821815-9971-2-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2016-12-16  2:49       ` zhoucm1
     [not found]         ` <5853564E.5020005-5C7GfCeVMHo@public.gmane.org>
2016-12-21 15:12           ` Nicolai Hähnle
     [not found]             ` <8f68adf1-52bc-b2ef-2fa7-79a8f9695164-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2017-01-03 11:42               ` Christian König
2016-12-15 17:10   ` [PATCH 2/5] drm/amd/amdgpu: move eviction counting to amdgpu_bo_move_notify Nicolai Hähnle
     [not found]     ` <1481821815-9971-3-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2016-12-16  2:50       ` zhoucm1
2016-12-15 17:10   ` [PATCH 3/5] drm/amd/amdgpu: add amdgpu_bo_gpu_accessible helper function Nicolai Hähnle
     [not found]     ` <1481821815-9971-4-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2016-12-16  2:48       ` zhoucm1
2016-12-15 17:10   ` [PATCH 4/5] drm/amd/amdgpu: add check that shadow page directory is GPU-accessible Nicolai Hähnle
     [not found]     ` <1481821815-9971-5-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2016-12-16  2:49       ` zhoucm1
2016-12-15 17:10   ` [PATCH 5/5] drm/amd/amdgpu: add check that shadow page tables are GPU-accessible Nicolai Hähnle
     [not found]     ` <1481821815-9971-6-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2016-12-15 17:59       ` [PATCH v2] " Nicolai Hähnle
     [not found]         ` <1481824743-11139-1-git-send-email-nhaehnle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2016-12-16  2:48           ` zhoucm1
2016-12-16  2:07       ` [PATCH 5/5] " zhoucm1

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.