[PATCH 0/1] Fix gem_ctx_persistence failures with GuC submission

dri-devel.lists.freedesktop.org archive mirror
 help / color / mirror / Atom feed

* [PATCH 0/1] Fix gem_ctx_persistence failures with GuC submission
@ 2021-07-29  0:33 Matthew Brost
  2021-07-29  0:34 ` [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context Matthew Brost
  2021-08-10  6:38 ` [PATCH 0/1] Fix gem_ctx_persistence failures with GuC submission Daniel Vetter
  0 siblings, 2 replies; 16+ messages in thread
From: Matthew Brost @ 2021-07-29  0:33 UTC (permalink / raw)
  To: intel-gfx, dri-devel

Should fix below failures with GuC submission for the following tests:
gem_exec_balancer --r noheartbeat
gem_ctx_persistence --r heartbeat-close

Not going to fix:
gem_ctx_persistence --r heartbeat-many
gem_ctx_persistence --r heartbeat-stop

As the above tests change the heartbeat value to 0 (off) after the
context is closed and we have no way to detect that with GuC submission
unless we keep a list of closed but running contexts which seems like
overkill for a non-real world use case. We likely should just skip these
tests with GuC submission.

Signed-off-by: Matthew Brost <matthew.brost@intel.com>

Matthew Brost (1):
  drm/i915: Check if engine has heartbeat when closing a context

 drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
 drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
 drivers/gpu/drm/i915/gt/intel_engine.h        | 21 ++-----------------
 .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
 .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
 .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
 6 files changed, 26 insertions(+), 24 deletions(-)

-- 
2.28.0


^ permalink raw reply	[flat|nested] 16+ messages in thread

* [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context
  2021-07-29  0:33 [PATCH 0/1] Fix gem_ctx_persistence failures with GuC submission Matthew Brost
@ 2021-07-29  0:34 ` Matthew Brost
  2021-07-30  0:13   ` [Intel-gfx] " John Harrison
  2021-08-10  6:38 ` [PATCH 0/1] Fix gem_ctx_persistence failures with GuC submission Daniel Vetter
  1 sibling, 1 reply; 16+ messages in thread
From: Matthew Brost @ 2021-07-29  0:34 UTC (permalink / raw)
  To: intel-gfx, dri-devel

If an engine associated with a context does not have a heartbeat, ban it
immediately. This is needed for GuC submission as a idle pulse doesn't
kick the context off the hardware where it then can check for a
heartbeat and ban the context.

This patch also updates intel_engine_has_heartbeat to be a vfunc as we
now need to call this function on execlists virtual engines too.  

Signed-off-by: Matthew Brost <matthew.brost@intel.com>
---
 drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
 drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
 drivers/gpu/drm/i915/gt/intel_engine.h        | 21 ++-----------------
 .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
 .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
 .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
 6 files changed, 26 insertions(+), 24 deletions(-)

diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c b/drivers/gpu/drm/i915/gem/i915_gem_context.c
index 9c3672bac0e2..b8e01c5ba9e5 100644
--- a/drivers/gpu/drm/i915/gem/i915_gem_context.c
+++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c
@@ -1090,8 +1090,9 @@ static void kill_engines(struct i915_gem_engines *engines, bool ban)
 	 */
 	for_each_gem_engine(ce, engines, it) {
 		struct intel_engine_cs *engine;
+		bool local_ban = ban || !intel_engine_has_heartbeat(ce->engine);
 
-		if (ban && intel_context_ban(ce, NULL))
+		if (local_ban && intel_context_ban(ce, NULL))
 			continue;
 
 		/*
@@ -1104,7 +1105,7 @@ static void kill_engines(struct i915_gem_engines *engines, bool ban)
 		engine = active_engine(ce);
 
 		/* First attempt to gracefully cancel the context */
-		if (engine && !__cancel_engine(engine) && ban)
+		if (engine && !__cancel_engine(engine) && local_ban)
 			/*
 			 * If we are unable to send a preemptive pulse to bump
 			 * the context from the GPU, we have to resort to a full
diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h b/drivers/gpu/drm/i915/gt/intel_context_types.h
index e54351a170e2..65f2eb2a78e4 100644
--- a/drivers/gpu/drm/i915/gt/intel_context_types.h
+++ b/drivers/gpu/drm/i915/gt/intel_context_types.h
@@ -55,6 +55,8 @@ struct intel_context_ops {
 	void (*reset)(struct intel_context *ce);
 	void (*destroy)(struct kref *kref);
 
+	bool (*has_heartbeat)(const struct intel_engine_cs *engine);
+
 	/* virtual engine/context interface */
 	struct intel_context *(*create_virtual)(struct intel_engine_cs **engine,
 						unsigned int count);
diff --git a/drivers/gpu/drm/i915/gt/intel_engine.h b/drivers/gpu/drm/i915/gt/intel_engine.h
index c2a5640ae055..1b11a808acc4 100644
--- a/drivers/gpu/drm/i915/gt/intel_engine.h
+++ b/drivers/gpu/drm/i915/gt/intel_engine.h
@@ -283,28 +283,11 @@ struct intel_context *
 intel_engine_create_virtual(struct intel_engine_cs **siblings,
 			    unsigned int count);
 
-static inline bool
-intel_virtual_engine_has_heartbeat(const struct intel_engine_cs *engine)
-{
-	/*
-	 * For non-GuC submission we expect the back-end to look at the
-	 * heartbeat status of the actual physical engine that the work
-	 * has been (or is being) scheduled on, so we should only reach
-	 * here with GuC submission enabled.
-	 */
-	GEM_BUG_ON(!intel_engine_uses_guc(engine));
-
-	return intel_guc_virtual_engine_has_heartbeat(engine);
-}
-
 static inline bool
 intel_engine_has_heartbeat(const struct intel_engine_cs *engine)
 {
-	if (!IS_ACTIVE(CONFIG_DRM_I915_HEARTBEAT_INTERVAL))
-		return false;
-
-	if (intel_engine_is_virtual(engine))
-		return intel_virtual_engine_has_heartbeat(engine);
+	if (engine->cops->has_heartbeat)
+		return engine->cops->has_heartbeat(engine);
 	else
 		return READ_ONCE(engine->props.heartbeat_interval_ms);
 }
diff --git a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
index de5f9c86b9a4..18005b5546b6 100644
--- a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
+++ b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
@@ -3619,6 +3619,18 @@ virtual_get_sibling(struct intel_engine_cs *engine, unsigned int sibling)
 	return ve->siblings[sibling];
 }
 
+static bool virtual_engine_has_heartbeat(const struct intel_engine_cs *ve)
+{
+	struct intel_engine_cs *engine;
+	intel_engine_mask_t tmp, mask = ve->mask;
+
+	for_each_engine_masked(engine, ve->gt, mask, tmp)
+		if (READ_ONCE(engine->props.heartbeat_interval_ms))
+			return true;
+
+	return false;
+}
+
 static const struct intel_context_ops virtual_context_ops = {
 	.flags = COPS_HAS_INFLIGHT,
 
@@ -3634,6 +3646,8 @@ static const struct intel_context_ops virtual_context_ops = {
 	.enter = virtual_context_enter,
 	.exit = virtual_context_exit,
 
+	.has_heartbeat = virtual_engine_has_heartbeat,
+
 	.destroy = virtual_context_destroy,
 
 	.get_sibling = virtual_get_sibling,
diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
index 89ff0e4b4bc7..ae70bff3605f 100644
--- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
+++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
@@ -2168,6 +2168,8 @@ static int guc_virtual_context_alloc(struct intel_context *ce)
 	return lrc_alloc(ce, engine);
 }
 
+static bool guc_virtual_engine_has_heartbeat(const struct intel_engine_cs *ve);
+
 static const struct intel_context_ops virtual_guc_context_ops = {
 	.alloc = guc_virtual_context_alloc,
 
@@ -2183,6 +2185,8 @@ static const struct intel_context_ops virtual_guc_context_ops = {
 	.enter = guc_virtual_context_enter,
 	.exit = guc_virtual_context_exit,
 
+	.has_heartbeat = guc_virtual_engine_has_heartbeat,
+
 	.sched_disable = guc_context_sched_disable,
 
 	.destroy = guc_context_destroy,
@@ -3029,7 +3033,7 @@ guc_create_virtual(struct intel_engine_cs **siblings, unsigned int count)
 	return ERR_PTR(err);
 }
 
-bool intel_guc_virtual_engine_has_heartbeat(const struct intel_engine_cs *ve)
+static bool guc_virtual_engine_has_heartbeat(const struct intel_engine_cs *ve)
 {
 	struct intel_engine_cs *engine;
 	intel_engine_mask_t tmp, mask = ve->mask;
diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
index c7ef44fa0c36..c2afc3b88fd8 100644
--- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
+++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
@@ -29,8 +29,6 @@ void intel_guc_dump_active_requests(struct intel_engine_cs *engine,
 				    struct i915_request *hung_rq,
 				    struct drm_printer *m);
 
-bool intel_guc_virtual_engine_has_heartbeat(const struct intel_engine_cs *ve);
-
 int intel_guc_wait_for_pending_msg(struct intel_guc *guc,
 				   atomic_t *wait_var,
 				   bool interruptible,
-- 
2.28.0


^ permalink raw reply related	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context
  2021-07-29  0:34 ` [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context Matthew Brost
@ 2021-07-30  0:13   ` John Harrison
  2021-07-30  9:49     ` Tvrtko Ursulin
  0 siblings, 1 reply; 16+ messages in thread
From: John Harrison @ 2021-07-30  0:13 UTC (permalink / raw)
  To: Matthew Brost, intel-gfx, dri-devel

On 7/28/2021 17:34, Matthew Brost wrote:
> If an engine associated with a context does not have a heartbeat, ban it
> immediately. This is needed for GuC submission as a idle pulse doesn't
> kick the context off the hardware where it then can check for a
> heartbeat and ban the context.
It's worse than this. If the engine in question is an individual 
physical engine then sending a pulse (with sufficiently high priority) 
will pre-empt the engine and kick the context off. However, the GuC 
scheduler does not have hacks in it to check the state of the heartbeat 
or whether a context is actually a zombie or not. Thus, the context will 
get resubmitted to the hardware after the pulse completes and 
effectively nothing will have happened.

I would assume that the DRM scheduler which we are meant to be switching 
to for execlist as well as GuC submission is also unlikely to have hacks 
for zombie contexts and tests for whether the i915 specific heartbeat 
has been disabled since the context became a zombie. So when that switch 
happens, this test will also fail in execlist mode as well as GuC mode.

The choices I see here are to simply remove persistence completely (it 
is a basically a bug that became UAPI because it wasn't caught soon 
enough!) or to implement it in a way that does not require hacks in the 
back end scheduler. Apparently, the DRM scheduler is expected to allow 
zombie contexts to persist until the DRM file handle is closed. So 
presumably we will have to go with option two.

That means flagging a context as being a zombie when it is closed but 
still active. The driver would then add it to a zombie list owned by the 
DRM client object. When that client object is closed, i915 would go 
through the list and genuinely kill all the contexts. No back end 
scheduler hacks required and no intimate knowledge of the i915 heartbeat 
mechanism required either.

John.


>
> This patch also updates intel_engine_has_heartbeat to be a vfunc as we
> now need to call this function on execlists virtual engines too.
>
> Signed-off-by: Matthew Brost <matthew.brost@intel.com>
> ---
>   drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
>   drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
>   drivers/gpu/drm/i915/gt/intel_engine.h        | 21 ++-----------------
>   .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
>   .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
>   .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
>   6 files changed, 26 insertions(+), 24 deletions(-)
>
> diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c b/drivers/gpu/drm/i915/gem/i915_gem_context.c
> index 9c3672bac0e2..b8e01c5ba9e5 100644
> --- a/drivers/gpu/drm/i915/gem/i915_gem_context.c
> +++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c
> @@ -1090,8 +1090,9 @@ static void kill_engines(struct i915_gem_engines *engines, bool ban)
>   	 */
>   	for_each_gem_engine(ce, engines, it) {
>   		struct intel_engine_cs *engine;
> +		bool local_ban = ban || !intel_engine_has_heartbeat(ce->engine);
>   
> -		if (ban && intel_context_ban(ce, NULL))
> +		if (local_ban && intel_context_ban(ce, NULL))
>   			continue;
>   
>   		/*
> @@ -1104,7 +1105,7 @@ static void kill_engines(struct i915_gem_engines *engines, bool ban)
>   		engine = active_engine(ce);
>   
>   		/* First attempt to gracefully cancel the context */
> -		if (engine && !__cancel_engine(engine) && ban)
> +		if (engine && !__cancel_engine(engine) && local_ban)
>   			/*
>   			 * If we are unable to send a preemptive pulse to bump
>   			 * the context from the GPU, we have to resort to a full
> diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h b/drivers/gpu/drm/i915/gt/intel_context_types.h
> index e54351a170e2..65f2eb2a78e4 100644
> --- a/drivers/gpu/drm/i915/gt/intel_context_types.h
> +++ b/drivers/gpu/drm/i915/gt/intel_context_types.h
> @@ -55,6 +55,8 @@ struct intel_context_ops {
>   	void (*reset)(struct intel_context *ce);
>   	void (*destroy)(struct kref *kref);
>   
> +	bool (*has_heartbeat)(const struct intel_engine_cs *engine);
> +
>   	/* virtual engine/context interface */
>   	struct intel_context *(*create_virtual)(struct intel_engine_cs **engine,
>   						unsigned int count);
> diff --git a/drivers/gpu/drm/i915/gt/intel_engine.h b/drivers/gpu/drm/i915/gt/intel_engine.h
> index c2a5640ae055..1b11a808acc4 100644
> --- a/drivers/gpu/drm/i915/gt/intel_engine.h
> +++ b/drivers/gpu/drm/i915/gt/intel_engine.h
> @@ -283,28 +283,11 @@ struct intel_context *
>   intel_engine_create_virtual(struct intel_engine_cs **siblings,
>   			    unsigned int count);
>   
> -static inline bool
> -intel_virtual_engine_has_heartbeat(const struct intel_engine_cs *engine)
> -{
> -	/*
> -	 * For non-GuC submission we expect the back-end to look at the
> -	 * heartbeat status of the actual physical engine that the work
> -	 * has been (or is being) scheduled on, so we should only reach
> -	 * here with GuC submission enabled.
> -	 */
> -	GEM_BUG_ON(!intel_engine_uses_guc(engine));
> -
> -	return intel_guc_virtual_engine_has_heartbeat(engine);
> -}
> -
>   static inline bool
>   intel_engine_has_heartbeat(const struct intel_engine_cs *engine)
>   {
> -	if (!IS_ACTIVE(CONFIG_DRM_I915_HEARTBEAT_INTERVAL))
> -		return false;
> -
> -	if (intel_engine_is_virtual(engine))
> -		return intel_virtual_engine_has_heartbeat(engine);
> +	if (engine->cops->has_heartbeat)
> +		return engine->cops->has_heartbeat(engine);
>   	else
>   		return READ_ONCE(engine->props.heartbeat_interval_ms);
>   }
> diff --git a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> index de5f9c86b9a4..18005b5546b6 100644
> --- a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> +++ b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> @@ -3619,6 +3619,18 @@ virtual_get_sibling(struct intel_engine_cs *engine, unsigned int sibling)
>   	return ve->siblings[sibling];
>   }
>   
> +static bool virtual_engine_has_heartbeat(const struct intel_engine_cs *ve)
> +{
> +	struct intel_engine_cs *engine;
> +	intel_engine_mask_t tmp, mask = ve->mask;
> +
> +	for_each_engine_masked(engine, ve->gt, mask, tmp)
> +		if (READ_ONCE(engine->props.heartbeat_interval_ms))
> +			return true;
> +
> +	return false;
> +}
> +
>   static const struct intel_context_ops virtual_context_ops = {
>   	.flags = COPS_HAS_INFLIGHT,
>   
> @@ -3634,6 +3646,8 @@ static const struct intel_context_ops virtual_context_ops = {
>   	.enter = virtual_context_enter,
>   	.exit = virtual_context_exit,
>   
> +	.has_heartbeat = virtual_engine_has_heartbeat,
> +
>   	.destroy = virtual_context_destroy,
>   
>   	.get_sibling = virtual_get_sibling,
> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> index 89ff0e4b4bc7..ae70bff3605f 100644
> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> @@ -2168,6 +2168,8 @@ static int guc_virtual_context_alloc(struct intel_context *ce)
>   	return lrc_alloc(ce, engine);
>   }
>   
> +static bool guc_virtual_engine_has_heartbeat(const struct intel_engine_cs *ve);
> +
>   static const struct intel_context_ops virtual_guc_context_ops = {
>   	.alloc = guc_virtual_context_alloc,
>   
> @@ -2183,6 +2185,8 @@ static const struct intel_context_ops virtual_guc_context_ops = {
>   	.enter = guc_virtual_context_enter,
>   	.exit = guc_virtual_context_exit,
>   
> +	.has_heartbeat = guc_virtual_engine_has_heartbeat,
> +
>   	.sched_disable = guc_context_sched_disable,
>   
>   	.destroy = guc_context_destroy,
> @@ -3029,7 +3033,7 @@ guc_create_virtual(struct intel_engine_cs **siblings, unsigned int count)
>   	return ERR_PTR(err);
>   }
>   
> -bool intel_guc_virtual_engine_has_heartbeat(const struct intel_engine_cs *ve)
> +static bool guc_virtual_engine_has_heartbeat(const struct intel_engine_cs *ve)
>   {
>   	struct intel_engine_cs *engine;
>   	intel_engine_mask_t tmp, mask = ve->mask;
> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> index c7ef44fa0c36..c2afc3b88fd8 100644
> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> @@ -29,8 +29,6 @@ void intel_guc_dump_active_requests(struct intel_engine_cs *engine,
>   				    struct i915_request *hung_rq,
>   				    struct drm_printer *m);
>   
> -bool intel_guc_virtual_engine_has_heartbeat(const struct intel_engine_cs *ve);
> -
>   int intel_guc_wait_for_pending_msg(struct intel_guc *guc,
>   				   atomic_t *wait_var,
>   				   bool interruptible,


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context
  2021-07-30  0:13   ` [Intel-gfx] " John Harrison
@ 2021-07-30  9:49     ` Tvrtko Ursulin
  2021-07-30 18:13       ` John Harrison
  2021-07-30 18:13       ` Matthew Brost
  0 siblings, 2 replies; 16+ messages in thread
From: Tvrtko Ursulin @ 2021-07-30  9:49 UTC (permalink / raw)
  To: John Harrison, Matthew Brost, intel-gfx, dri-devel


On 30/07/2021 01:13, John Harrison wrote:
> On 7/28/2021 17:34, Matthew Brost wrote:
>> If an engine associated with a context does not have a heartbeat, ban it
>> immediately. This is needed for GuC submission as a idle pulse doesn't
>> kick the context off the hardware where it then can check for a
>> heartbeat and ban the context.

Pulse, that is a request with I915_PRIORITY_BARRIER, does not preempt a 
running normal priority context?

Why does it matter then whether or not heartbeats are enabled - when 
heartbeat just ends up sending the same engine pulse (eventually, with 
raising priority)?

> It's worse than this. If the engine in question is an individual 
> physical engine then sending a pulse (with sufficiently high priority) 
> will pre-empt the engine and kick the context off. However, the GuC 

Why it is different for physical vs virtual, aren't both just 
schedulable contexts with different engine masks for what GuC is 
concerned? Oh, is it a matter of needing to send pulses to all engines 
which comprise a virtual one?

> scheduler does not have hacks in it to check the state of the heartbeat 
> or whether a context is actually a zombie or not. Thus, the context will 
> get resubmitted to the hardware after the pulse completes and 
> effectively nothing will have happened.
> 
> I would assume that the DRM scheduler which we are meant to be switching 
> to for execlist as well as GuC submission is also unlikely to have hacks 
> for zombie contexts and tests for whether the i915 specific heartbeat 
> has been disabled since the context became a zombie. So when that switch 
> happens, this test will also fail in execlist mode as well as GuC mode.
> 
> The choices I see here are to simply remove persistence completely (it 
> is a basically a bug that became UAPI because it wasn't caught soon 
> enough!) or to implement it in a way that does not require hacks in the 
> back end scheduler. Apparently, the DRM scheduler is expected to allow 
> zombie contexts to persist until the DRM file handle is closed. So 
> presumably we will have to go with option two.
> 
> That means flagging a context as being a zombie when it is closed but 
> still active. The driver would then add it to a zombie list owned by the 
> DRM client object. When that client object is closed, i915 would go 
> through the list and genuinely kill all the contexts. No back end 
> scheduler hacks required and no intimate knowledge of the i915 heartbeat 
> mechanism required either.
> 
> John.
> 
> 
>>
>> This patch also updates intel_engine_has_heartbeat to be a vfunc as we
>> now need to call this function on execlists virtual engines too.
>>
>> Signed-off-by: Matthew Brost <matthew.brost@intel.com>
>> ---
>>   drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
>>   drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
>>   drivers/gpu/drm/i915/gt/intel_engine.h        | 21 ++-----------------
>>   .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
>>   .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
>>   .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
>>   6 files changed, 26 insertions(+), 24 deletions(-)
>>
>> diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c 
>> b/drivers/gpu/drm/i915/gem/i915_gem_context.c
>> index 9c3672bac0e2..b8e01c5ba9e5 100644
>> --- a/drivers/gpu/drm/i915/gem/i915_gem_context.c
>> +++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c
>> @@ -1090,8 +1090,9 @@ static void kill_engines(struct i915_gem_engines 
>> *engines, bool ban)
>>        */
>>       for_each_gem_engine(ce, engines, it) {
>>           struct intel_engine_cs *engine;
>> +        bool local_ban = ban || !intel_engine_has_heartbeat(ce->engine);

In any case (pending me understanding what's really going on there), why 
would this check not be in kill_context with currently does this:

	bool ban = (!i915_gem_context_is_persistent(ctx) ||
		    !ctx->i915->params.enable_hangcheck);
...
		kill_engines(pos, ban);

So whether to ban decision would be consolidated to one place.

In fact, decision on whether to allow persistent is tied to 
enable_hangcheck, which also drives hearbeat emission. So perhaps one 
part of the correct fix is to extend the above (kill_context) ban 
criteria to include hearbeat values anyway. Otherwise isn't it a simple 
miss that this check fails to account to hearbeat disablement via sysfs?

Regards,

Tvrtko

>> -        if (ban && intel_context_ban(ce, NULL))
>> +        if (local_ban && intel_context_ban(ce, NULL))
>>               continue;
>>           /*
>> @@ -1104,7 +1105,7 @@ static void kill_engines(struct i915_gem_engines 
>> *engines, bool ban)
>>           engine = active_engine(ce);
>>           /* First attempt to gracefully cancel the context */
>> -        if (engine && !__cancel_engine(engine) && ban)
>> +        if (engine && !__cancel_engine(engine) && local_ban)
>>               /*
>>                * If we are unable to send a preemptive pulse to bump
>>                * the context from the GPU, we have to resort to a full
>> diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h 
>> b/drivers/gpu/drm/i915/gt/intel_context_types.h
>> index e54351a170e2..65f2eb2a78e4 100644
>> --- a/drivers/gpu/drm/i915/gt/intel_context_types.h
>> +++ b/drivers/gpu/drm/i915/gt/intel_context_types.h
>> @@ -55,6 +55,8 @@ struct intel_context_ops {
>>       void (*reset)(struct intel_context *ce);
>>       void (*destroy)(struct kref *kref);
>> +    bool (*has_heartbeat)(const struct intel_engine_cs *engine);
>> +
>>       /* virtual engine/context interface */
>>       struct intel_context *(*create_virtual)(struct intel_engine_cs 
>> **engine,
>>                           unsigned int count);
>> diff --git a/drivers/gpu/drm/i915/gt/intel_engine.h 
>> b/drivers/gpu/drm/i915/gt/intel_engine.h
>> index c2a5640ae055..1b11a808acc4 100644
>> --- a/drivers/gpu/drm/i915/gt/intel_engine.h
>> +++ b/drivers/gpu/drm/i915/gt/intel_engine.h
>> @@ -283,28 +283,11 @@ struct intel_context *
>>   intel_engine_create_virtual(struct intel_engine_cs **siblings,
>>                   unsigned int count);
>> -static inline bool
>> -intel_virtual_engine_has_heartbeat(const struct intel_engine_cs *engine)
>> -{
>> -    /*
>> -     * For non-GuC submission we expect the back-end to look at the
>> -     * heartbeat status of the actual physical engine that the work
>> -     * has been (or is being) scheduled on, so we should only reach
>> -     * here with GuC submission enabled.
>> -     */
>> -    GEM_BUG_ON(!intel_engine_uses_guc(engine));
>> -
>> -    return intel_guc_virtual_engine_has_heartbeat(engine);
>> -}
>> -
>>   static inline bool
>>   intel_engine_has_heartbeat(const struct intel_engine_cs *engine)
>>   {
>> -    if (!IS_ACTIVE(CONFIG_DRM_I915_HEARTBEAT_INTERVAL))
>> -        return false;
>> -
>> -    if (intel_engine_is_virtual(engine))
>> -        return intel_virtual_engine_has_heartbeat(engine);
>> +    if (engine->cops->has_heartbeat)
>> +        return engine->cops->has_heartbeat(engine);
>>       else
>>           return READ_ONCE(engine->props.heartbeat_interval_ms);
>>   }
>> diff --git a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c 
>> b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>> index de5f9c86b9a4..18005b5546b6 100644
>> --- a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>> +++ b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>> @@ -3619,6 +3619,18 @@ virtual_get_sibling(struct intel_engine_cs 
>> *engine, unsigned int sibling)
>>       return ve->siblings[sibling];
>>   }
>> +static bool virtual_engine_has_heartbeat(const struct intel_engine_cs 
>> *ve)
>> +{
>> +    struct intel_engine_cs *engine;
>> +    intel_engine_mask_t tmp, mask = ve->mask;
>> +
>> +    for_each_engine_masked(engine, ve->gt, mask, tmp)
>> +        if (READ_ONCE(engine->props.heartbeat_interval_ms))
>> +            return true;
>> +
>> +    return false;
>> +}
>> +
>>   static const struct intel_context_ops virtual_context_ops = {
>>       .flags = COPS_HAS_INFLIGHT,
>> @@ -3634,6 +3646,8 @@ static const struct intel_context_ops 
>> virtual_context_ops = {
>>       .enter = virtual_context_enter,
>>       .exit = virtual_context_exit,
>> +    .has_heartbeat = virtual_engine_has_heartbeat,
>> +
>>       .destroy = virtual_context_destroy,
>>       .get_sibling = virtual_get_sibling,
>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c 
>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>> index 89ff0e4b4bc7..ae70bff3605f 100644
>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>> @@ -2168,6 +2168,8 @@ static int guc_virtual_context_alloc(struct 
>> intel_context *ce)
>>       return lrc_alloc(ce, engine);
>>   }
>> +static bool guc_virtual_engine_has_heartbeat(const struct 
>> intel_engine_cs *ve);
>> +
>>   static const struct intel_context_ops virtual_guc_context_ops = {
>>       .alloc = guc_virtual_context_alloc,
>> @@ -2183,6 +2185,8 @@ static const struct intel_context_ops 
>> virtual_guc_context_ops = {
>>       .enter = guc_virtual_context_enter,
>>       .exit = guc_virtual_context_exit,
>> +    .has_heartbeat = guc_virtual_engine_has_heartbeat,
>> +
>>       .sched_disable = guc_context_sched_disable,
>>       .destroy = guc_context_destroy,
>> @@ -3029,7 +3033,7 @@ guc_create_virtual(struct intel_engine_cs 
>> **siblings, unsigned int count)
>>       return ERR_PTR(err);
>>   }
>> -bool intel_guc_virtual_engine_has_heartbeat(const struct 
>> intel_engine_cs *ve)
>> +static bool guc_virtual_engine_has_heartbeat(const struct 
>> intel_engine_cs *ve)
>>   {
>>       struct intel_engine_cs *engine;
>>       intel_engine_mask_t tmp, mask = ve->mask;
>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h 
>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>> index c7ef44fa0c36..c2afc3b88fd8 100644
>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>> @@ -29,8 +29,6 @@ void intel_guc_dump_active_requests(struct 
>> intel_engine_cs *engine,
>>                       struct i915_request *hung_rq,
>>                       struct drm_printer *m);
>> -bool intel_guc_virtual_engine_has_heartbeat(const struct 
>> intel_engine_cs *ve);
>> -
>>   int intel_guc_wait_for_pending_msg(struct intel_guc *guc,
>>                      atomic_t *wait_var,
>>                      bool interruptible,
> 
> _______________________________________________
> Intel-gfx mailing list
> Intel-gfx@lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context
  2021-07-30  9:49     ` Tvrtko Ursulin
@ 2021-07-30 18:13       ` John Harrison
  2021-08-02  9:40         ` Tvrtko Ursulin
  2021-07-30 18:13       ` Matthew Brost
  1 sibling, 1 reply; 16+ messages in thread
From: John Harrison @ 2021-07-30 18:13 UTC (permalink / raw)
  To: Tvrtko Ursulin, Matthew Brost, intel-gfx, dri-devel

On 7/30/2021 02:49, Tvrtko Ursulin wrote:
> On 30/07/2021 01:13, John Harrison wrote:
>> On 7/28/2021 17:34, Matthew Brost wrote:
>>> If an engine associated with a context does not have a heartbeat, 
>>> ban it
>>> immediately. This is needed for GuC submission as a idle pulse doesn't
>>> kick the context off the hardware where it then can check for a
>>> heartbeat and ban the context.
>
> Pulse, that is a request with I915_PRIORITY_BARRIER, does not preempt 
> a running normal priority context?
>
> Why does it matter then whether or not heartbeats are enabled - when 
> heartbeat just ends up sending the same engine pulse (eventually, with 
> raising priority)?
The point is that the pulse is pointless. See the rest of my comments 
below, specifically "the context will get resubmitted to the hardware 
after the pulse completes". To re-iterate...

Yes, it preempts the context. Yes, it does so whether heartbeats are 
enabled or not. But so what? Who cares? You have preempted a context. It 
is no longer running on the hardware. BUT IT IS STILL A VALID CONTEXT. 
The backend scheduler will just resubmit it to the hardware as soon as 
the pulse completes. The only reason this works at all is because of the 
horrid hack in the execlist scheduler's back end implementation (in 
__execlists_schedule_in):
         if (unlikely(intel_context_is_closed(ce) &&
                      !intel_engine_has_heartbeat(engine)))
                 intel_context_set_banned(ce);

The actual back end scheduler is saying "Is this a zombie context? Is 
the heartbeat disabled? Then ban it". No other scheduler backend is 
going to have knowledge of zombie context status or of the heartbeat 
status. Nor are they going to call back into the higher levels of the 
i915 driver to trigger a ban operation. Certainly a hardware implemented 
scheduler is not going to be looking at private i915 driver information 
to decide whether to submit a context or whether to tell the OS to kill 
it off instead.

For persistence to work with a hardware scheduler (or a non-Intel 
specific scheduler such as the DRM one), the handling of zombie 
contexts, banning, etc. *must* be done entirely in the front end. It 
cannot rely on any backend hacks. That means you can't rely on any fancy 
behaviour of pulses.

If you want to ban a context then you must explicitly ban that context. 
If you want to ban it at some later point then you need to track it at 
the top level as a zombie and then explicitly ban that zombie at 
whatever later point.


>
>> It's worse than this. If the engine in question is an individual 
>> physical engine then sending a pulse (with sufficiently high 
>> priority) will pre-empt the engine and kick the context off. However, 
>> the GuC 
>
> Why it is different for physical vs virtual, aren't both just 
> schedulable contexts with different engine masks for what GuC is 
> concerned? Oh, is it a matter of needing to send pulses to all engines 
> which comprise a virtual one?
It isn't different. It is totally broken for both. It is potentially 
more broken for virtual engines because of the question of which engine 
to pulse. But as stated above, the pulse is pointless anyway so the 
which engine question doesn't even matter.

John.


>
>> scheduler does not have hacks in it to check the state of the 
>> heartbeat or whether a context is actually a zombie or not. Thus, the 
>> context will get resubmitted to the hardware after the pulse 
>> completes and effectively nothing will have happened.
>>
>> I would assume that the DRM scheduler which we are meant to be 
>> switching to for execlist as well as GuC submission is also unlikely 
>> to have hacks for zombie contexts and tests for whether the i915 
>> specific heartbeat has been disabled since the context became a 
>> zombie. So when that switch happens, this test will also fail in 
>> execlist mode as well as GuC mode.
>>
>> The choices I see here are to simply remove persistence completely 
>> (it is a basically a bug that became UAPI because it wasn't caught 
>> soon enough!) or to implement it in a way that does not require hacks 
>> in the back end scheduler. Apparently, the DRM scheduler is expected 
>> to allow zombie contexts to persist until the DRM file handle is 
>> closed. So presumably we will have to go with option two.
>>
>> That means flagging a context as being a zombie when it is closed but 
>> still active. The driver would then add it to a zombie list owned by 
>> the DRM client object. When that client object is closed, i915 would 
>> go through the list and genuinely kill all the contexts. No back end 
>> scheduler hacks required and no intimate knowledge of the i915 
>> heartbeat mechanism required either.
>>
>> John.
>>
>>
>>>
>>> This patch also updates intel_engine_has_heartbeat to be a vfunc as we
>>> now need to call this function on execlists virtual engines too.
>>>
>>> Signed-off-by: Matthew Brost <matthew.brost@intel.com>
>>> ---
>>>   drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
>>>   drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
>>>   drivers/gpu/drm/i915/gt/intel_engine.h        | 21 
>>> ++-----------------
>>>   .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
>>>   .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
>>>   .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
>>>   6 files changed, 26 insertions(+), 24 deletions(-)
>>>
>>> diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c 
>>> b/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>> index 9c3672bac0e2..b8e01c5ba9e5 100644
>>> --- a/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>> +++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>> @@ -1090,8 +1090,9 @@ static void kill_engines(struct 
>>> i915_gem_engines *engines, bool ban)
>>>        */
>>>       for_each_gem_engine(ce, engines, it) {
>>>           struct intel_engine_cs *engine;
>>> +        bool local_ban = ban || 
>>> !intel_engine_has_heartbeat(ce->engine);
>
> In any case (pending me understanding what's really going on there), 
> why would this check not be in kill_context with currently does this:
>
>     bool ban = (!i915_gem_context_is_persistent(ctx) ||
>             !ctx->i915->params.enable_hangcheck);
> ...
>         kill_engines(pos, ban);
>
> So whether to ban decision would be consolidated to one place.
>
> In fact, decision on whether to allow persistent is tied to 
> enable_hangcheck, which also drives hearbeat emission. So perhaps one 
> part of the correct fix is to extend the above (kill_context) ban 
> criteria to include hearbeat values anyway. Otherwise isn't it a 
> simple miss that this check fails to account to hearbeat disablement 
> via sysfs?
>
> Regards,
>
> Tvrtko
>
>>> -        if (ban && intel_context_ban(ce, NULL))
>>> +        if (local_ban && intel_context_ban(ce, NULL))
>>>               continue;
>>>           /*
>>> @@ -1104,7 +1105,7 @@ static void kill_engines(struct 
>>> i915_gem_engines *engines, bool ban)
>>>           engine = active_engine(ce);
>>>           /* First attempt to gracefully cancel the context */
>>> -        if (engine && !__cancel_engine(engine) && ban)
>>> +        if (engine && !__cancel_engine(engine) && local_ban)
>>>               /*
>>>                * If we are unable to send a preemptive pulse to bump
>>>                * the context from the GPU, we have to resort to a full
>>> diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h 
>>> b/drivers/gpu/drm/i915/gt/intel_context_types.h
>>> index e54351a170e2..65f2eb2a78e4 100644
>>> --- a/drivers/gpu/drm/i915/gt/intel_context_types.h
>>> +++ b/drivers/gpu/drm/i915/gt/intel_context_types.h
>>> @@ -55,6 +55,8 @@ struct intel_context_ops {
>>>       void (*reset)(struct intel_context *ce);
>>>       void (*destroy)(struct kref *kref);
>>> +    bool (*has_heartbeat)(const struct intel_engine_cs *engine);
>>> +
>>>       /* virtual engine/context interface */
>>>       struct intel_context *(*create_virtual)(struct intel_engine_cs 
>>> **engine,
>>>                           unsigned int count);
>>> diff --git a/drivers/gpu/drm/i915/gt/intel_engine.h 
>>> b/drivers/gpu/drm/i915/gt/intel_engine.h
>>> index c2a5640ae055..1b11a808acc4 100644
>>> --- a/drivers/gpu/drm/i915/gt/intel_engine.h
>>> +++ b/drivers/gpu/drm/i915/gt/intel_engine.h
>>> @@ -283,28 +283,11 @@ struct intel_context *
>>>   intel_engine_create_virtual(struct intel_engine_cs **siblings,
>>>                   unsigned int count);
>>> -static inline bool
>>> -intel_virtual_engine_has_heartbeat(const struct intel_engine_cs 
>>> *engine)
>>> -{
>>> -    /*
>>> -     * For non-GuC submission we expect the back-end to look at the
>>> -     * heartbeat status of the actual physical engine that the work
>>> -     * has been (or is being) scheduled on, so we should only reach
>>> -     * here with GuC submission enabled.
>>> -     */
>>> -    GEM_BUG_ON(!intel_engine_uses_guc(engine));
>>> -
>>> -    return intel_guc_virtual_engine_has_heartbeat(engine);
>>> -}
>>> -
>>>   static inline bool
>>>   intel_engine_has_heartbeat(const struct intel_engine_cs *engine)
>>>   {
>>> -    if (!IS_ACTIVE(CONFIG_DRM_I915_HEARTBEAT_INTERVAL))
>>> -        return false;
>>> -
>>> -    if (intel_engine_is_virtual(engine))
>>> -        return intel_virtual_engine_has_heartbeat(engine);
>>> +    if (engine->cops->has_heartbeat)
>>> +        return engine->cops->has_heartbeat(engine);
>>>       else
>>>           return READ_ONCE(engine->props.heartbeat_interval_ms);
>>>   }
>>> diff --git a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c 
>>> b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>> index de5f9c86b9a4..18005b5546b6 100644
>>> --- a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>> +++ b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>> @@ -3619,6 +3619,18 @@ virtual_get_sibling(struct intel_engine_cs 
>>> *engine, unsigned int sibling)
>>>       return ve->siblings[sibling];
>>>   }
>>> +static bool virtual_engine_has_heartbeat(const struct 
>>> intel_engine_cs *ve)
>>> +{
>>> +    struct intel_engine_cs *engine;
>>> +    intel_engine_mask_t tmp, mask = ve->mask;
>>> +
>>> +    for_each_engine_masked(engine, ve->gt, mask, tmp)
>>> +        if (READ_ONCE(engine->props.heartbeat_interval_ms))
>>> +            return true;
>>> +
>>> +    return false;
>>> +}
>>> +
>>>   static const struct intel_context_ops virtual_context_ops = {
>>>       .flags = COPS_HAS_INFLIGHT,
>>> @@ -3634,6 +3646,8 @@ static const struct intel_context_ops 
>>> virtual_context_ops = {
>>>       .enter = virtual_context_enter,
>>>       .exit = virtual_context_exit,
>>> +    .has_heartbeat = virtual_engine_has_heartbeat,
>>> +
>>>       .destroy = virtual_context_destroy,
>>>       .get_sibling = virtual_get_sibling,
>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c 
>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>> index 89ff0e4b4bc7..ae70bff3605f 100644
>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>> @@ -2168,6 +2168,8 @@ static int guc_virtual_context_alloc(struct 
>>> intel_context *ce)
>>>       return lrc_alloc(ce, engine);
>>>   }
>>> +static bool guc_virtual_engine_has_heartbeat(const struct 
>>> intel_engine_cs *ve);
>>> +
>>>   static const struct intel_context_ops virtual_guc_context_ops = {
>>>       .alloc = guc_virtual_context_alloc,
>>> @@ -2183,6 +2185,8 @@ static const struct intel_context_ops 
>>> virtual_guc_context_ops = {
>>>       .enter = guc_virtual_context_enter,
>>>       .exit = guc_virtual_context_exit,
>>> +    .has_heartbeat = guc_virtual_engine_has_heartbeat,
>>> +
>>>       .sched_disable = guc_context_sched_disable,
>>>       .destroy = guc_context_destroy,
>>> @@ -3029,7 +3033,7 @@ guc_create_virtual(struct intel_engine_cs 
>>> **siblings, unsigned int count)
>>>       return ERR_PTR(err);
>>>   }
>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct 
>>> intel_engine_cs *ve)
>>> +static bool guc_virtual_engine_has_heartbeat(const struct 
>>> intel_engine_cs *ve)
>>>   {
>>>       struct intel_engine_cs *engine;
>>>       intel_engine_mask_t tmp, mask = ve->mask;
>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h 
>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>> index c7ef44fa0c36..c2afc3b88fd8 100644
>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>> @@ -29,8 +29,6 @@ void intel_guc_dump_active_requests(struct 
>>> intel_engine_cs *engine,
>>>                       struct i915_request *hung_rq,
>>>                       struct drm_printer *m);
>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct 
>>> intel_engine_cs *ve);
>>> -
>>>   int intel_guc_wait_for_pending_msg(struct intel_guc *guc,
>>>                      atomic_t *wait_var,
>>>                      bool interruptible,
>>
>> _______________________________________________
>> Intel-gfx mailing list
>> Intel-gfx@lists.freedesktop.org
>> https://lists.freedesktop.org/mailman/listinfo/intel-gfx


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context
  2021-07-30  9:49     ` Tvrtko Ursulin
  2021-07-30 18:13       ` John Harrison
@ 2021-07-30 18:13       ` Matthew Brost
  1 sibling, 0 replies; 16+ messages in thread
From: Matthew Brost @ 2021-07-30 18:13 UTC (permalink / raw)
  To: Tvrtko Ursulin; +Cc: John Harrison, intel-gfx, dri-devel

On Fri, Jul 30, 2021 at 10:49:01AM +0100, Tvrtko Ursulin wrote:
> 
> On 30/07/2021 01:13, John Harrison wrote:
> > On 7/28/2021 17:34, Matthew Brost wrote:
> > > If an engine associated with a context does not have a heartbeat, ban it
> > > immediately. This is needed for GuC submission as a idle pulse doesn't
> > > kick the context off the hardware where it then can check for a
> > > heartbeat and ban the context.
> 
> Pulse, that is a request with I915_PRIORITY_BARRIER, does not preempt a
> running normal priority context?
> 

Yes, in both execlists and GuC submission the contexts gets preempted.
With execlists the i915 see the preempt CSB while with GuC submission
the GUC sees it.

> Why does it matter then whether or not heartbeats are enabled - when
> heartbeat just ends up sending the same engine pulse (eventually, with
> raising priority)?
>

With execlists when the request gets resubmitted, there is check if the
context is closed and the heartbeat is disabled. If this is true, the
context gets banned. See __execlists_schedule_in.

With the Guc sense it owns the CSB / resubmission, the heartbeat /
closed check doesn't exist to ban the context. 

> > It's worse than this. If the engine in question is an individual
> > physical engine then sending a pulse (with sufficiently high priority)
> > will pre-empt the engine and kick the context off. However, the GuC
> 
> Why it is different for physical vs virtual, aren't both just schedulable
> contexts with different engine masks for what GuC is concerned? Oh, is it a
> matter of needing to send pulses to all engines which comprise a virtual
> one?

Yes. The whole idle pulse thing is kinda junk. It really makes an
assumption that the backend is execlists. We likely have a bit more work
here.

> 
> > scheduler does not have hacks in it to check the state of the heartbeat
> > or whether a context is actually a zombie or not. Thus, the context will
> > get resubmitted to the hardware after the pulse completes and
> > effectively nothing will have happened.
> > 
> > I would assume that the DRM scheduler which we are meant to be switching
> > to for execlist as well as GuC submission is also unlikely to have hacks
> > for zombie contexts and tests for whether the i915 specific heartbeat
> > has been disabled since the context became a zombie. So when that switch
> > happens, this test will also fail in execlist mode as well as GuC mode.
> > 
> > The choices I see here are to simply remove persistence completely (it
> > is a basically a bug that became UAPI because it wasn't caught soon
> > enough!) or to implement it in a way that does not require hacks in the
> > back end scheduler. Apparently, the DRM scheduler is expected to allow
> > zombie contexts to persist until the DRM file handle is closed. So
> > presumably we will have to go with option two.
> > 
> > That means flagging a context as being a zombie when it is closed but
> > still active. The driver would then add it to a zombie list owned by the
> > DRM client object. When that client object is closed, i915 would go
> > through the list and genuinely kill all the contexts. No back end
> > scheduler hacks required and no intimate knowledge of the i915 heartbeat
> > mechanism required either.
> > 
> > John.
> > 
> > 
> > > 
> > > This patch also updates intel_engine_has_heartbeat to be a vfunc as we
> > > now need to call this function on execlists virtual engines too.
> > > 
> > > Signed-off-by: Matthew Brost <matthew.brost@intel.com>
> > > ---
> > >   drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
> > >   drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
> > >   drivers/gpu/drm/i915/gt/intel_engine.h        | 21 ++-----------------
> > >   .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
> > >   .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
> > >   .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
> > >   6 files changed, 26 insertions(+), 24 deletions(-)
> > > 
> > > diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c
> > > b/drivers/gpu/drm/i915/gem/i915_gem_context.c
> > > index 9c3672bac0e2..b8e01c5ba9e5 100644
> > > --- a/drivers/gpu/drm/i915/gem/i915_gem_context.c
> > > +++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c
> > > @@ -1090,8 +1090,9 @@ static void kill_engines(struct
> > > i915_gem_engines *engines, bool ban)
> > >        */
> > >       for_each_gem_engine(ce, engines, it) {
> > >           struct intel_engine_cs *engine;
> > > +        bool local_ban = ban || !intel_engine_has_heartbeat(ce->engine);
> 
> In any case (pending me understanding what's really going on there), why
> would this check not be in kill_context with currently does this:
> 
> 	bool ban = (!i915_gem_context_is_persistent(ctx) ||
> 		    !ctx->i915->params.enable_hangcheck);
> ...

This gem_context level check, while the other check is per
intel_context. We don't have the intel_context here.

> 		kill_engines(pos, ban);
> 
> So whether to ban decision would be consolidated to one place.
> 
> In fact, decision on whether to allow persistent is tied to
> enable_hangcheck, which also drives hearbeat emission. So perhaps one part
> of the correct fix is to extend the above (kill_context) ban criteria to
> include hearbeat values anyway. Otherwise isn't it a simple miss that this
> check fails to account to hearbeat disablement via sysfs?
> 

The execlists has that check in the resubmission path which doesn't
exist for the GuC (explained above). This code just moves this check to
a place where it works with GuC submission.

Matt

> Regards,
> 
> Tvrtko
> 
> > > -        if (ban && intel_context_ban(ce, NULL))
> > > +        if (local_ban && intel_context_ban(ce, NULL))
> > >               continue;
> > >           /*
> > > @@ -1104,7 +1105,7 @@ static void kill_engines(struct
> > > i915_gem_engines *engines, bool ban)
> > >           engine = active_engine(ce);
> > >           /* First attempt to gracefully cancel the context */
> > > -        if (engine && !__cancel_engine(engine) && ban)
> > > +        if (engine && !__cancel_engine(engine) && local_ban)
> > >               /*
> > >                * If we are unable to send a preemptive pulse to bump
> > >                * the context from the GPU, we have to resort to a full
> > > diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h
> > > b/drivers/gpu/drm/i915/gt/intel_context_types.h
> > > index e54351a170e2..65f2eb2a78e4 100644
> > > --- a/drivers/gpu/drm/i915/gt/intel_context_types.h
> > > +++ b/drivers/gpu/drm/i915/gt/intel_context_types.h
> > > @@ -55,6 +55,8 @@ struct intel_context_ops {
> > >       void (*reset)(struct intel_context *ce);
> > >       void (*destroy)(struct kref *kref);
> > > +    bool (*has_heartbeat)(const struct intel_engine_cs *engine);
> > > +
> > >       /* virtual engine/context interface */
> > >       struct intel_context *(*create_virtual)(struct intel_engine_cs
> > > **engine,
> > >                           unsigned int count);
> > > diff --git a/drivers/gpu/drm/i915/gt/intel_engine.h
> > > b/drivers/gpu/drm/i915/gt/intel_engine.h
> > > index c2a5640ae055..1b11a808acc4 100644
> > > --- a/drivers/gpu/drm/i915/gt/intel_engine.h
> > > +++ b/drivers/gpu/drm/i915/gt/intel_engine.h
> > > @@ -283,28 +283,11 @@ struct intel_context *
> > >   intel_engine_create_virtual(struct intel_engine_cs **siblings,
> > >                   unsigned int count);
> > > -static inline bool
> > > -intel_virtual_engine_has_heartbeat(const struct intel_engine_cs *engine)
> > > -{
> > > -    /*
> > > -     * For non-GuC submission we expect the back-end to look at the
> > > -     * heartbeat status of the actual physical engine that the work
> > > -     * has been (or is being) scheduled on, so we should only reach
> > > -     * here with GuC submission enabled.
> > > -     */
> > > -    GEM_BUG_ON(!intel_engine_uses_guc(engine));
> > > -
> > > -    return intel_guc_virtual_engine_has_heartbeat(engine);
> > > -}
> > > -
> > >   static inline bool
> > >   intel_engine_has_heartbeat(const struct intel_engine_cs *engine)
> > >   {
> > > -    if (!IS_ACTIVE(CONFIG_DRM_I915_HEARTBEAT_INTERVAL))
> > > -        return false;
> > > -
> > > -    if (intel_engine_is_virtual(engine))
> > > -        return intel_virtual_engine_has_heartbeat(engine);
> > > +    if (engine->cops->has_heartbeat)
> > > +        return engine->cops->has_heartbeat(engine);
> > >       else
> > >           return READ_ONCE(engine->props.heartbeat_interval_ms);
> > >   }
> > > diff --git a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> > > b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> > > index de5f9c86b9a4..18005b5546b6 100644
> > > --- a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> > > +++ b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> > > @@ -3619,6 +3619,18 @@ virtual_get_sibling(struct intel_engine_cs
> > > *engine, unsigned int sibling)
> > >       return ve->siblings[sibling];
> > >   }
> > > +static bool virtual_engine_has_heartbeat(const struct
> > > intel_engine_cs *ve)
> > > +{
> > > +    struct intel_engine_cs *engine;
> > > +    intel_engine_mask_t tmp, mask = ve->mask;
> > > +
> > > +    for_each_engine_masked(engine, ve->gt, mask, tmp)
> > > +        if (READ_ONCE(engine->props.heartbeat_interval_ms))
> > > +            return true;
> > > +
> > > +    return false;
> > > +}
> > > +
> > >   static const struct intel_context_ops virtual_context_ops = {
> > >       .flags = COPS_HAS_INFLIGHT,
> > > @@ -3634,6 +3646,8 @@ static const struct intel_context_ops
> > > virtual_context_ops = {
> > >       .enter = virtual_context_enter,
> > >       .exit = virtual_context_exit,
> > > +    .has_heartbeat = virtual_engine_has_heartbeat,
> > > +
> > >       .destroy = virtual_context_destroy,
> > >       .get_sibling = virtual_get_sibling,
> > > diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> > > b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> > > index 89ff0e4b4bc7..ae70bff3605f 100644
> > > --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> > > +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> > > @@ -2168,6 +2168,8 @@ static int guc_virtual_context_alloc(struct
> > > intel_context *ce)
> > >       return lrc_alloc(ce, engine);
> > >   }
> > > +static bool guc_virtual_engine_has_heartbeat(const struct
> > > intel_engine_cs *ve);
> > > +
> > >   static const struct intel_context_ops virtual_guc_context_ops = {
> > >       .alloc = guc_virtual_context_alloc,
> > > @@ -2183,6 +2185,8 @@ static const struct intel_context_ops
> > > virtual_guc_context_ops = {
> > >       .enter = guc_virtual_context_enter,
> > >       .exit = guc_virtual_context_exit,
> > > +    .has_heartbeat = guc_virtual_engine_has_heartbeat,
> > > +
> > >       .sched_disable = guc_context_sched_disable,
> > >       .destroy = guc_context_destroy,
> > > @@ -3029,7 +3033,7 @@ guc_create_virtual(struct intel_engine_cs
> > > **siblings, unsigned int count)
> > >       return ERR_PTR(err);
> > >   }
> > > -bool intel_guc_virtual_engine_has_heartbeat(const struct
> > > intel_engine_cs *ve)
> > > +static bool guc_virtual_engine_has_heartbeat(const struct
> > > intel_engine_cs *ve)
> > >   {
> > >       struct intel_engine_cs *engine;
> > >       intel_engine_mask_t tmp, mask = ve->mask;
> > > diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> > > b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> > > index c7ef44fa0c36..c2afc3b88fd8 100644
> > > --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> > > +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> > > @@ -29,8 +29,6 @@ void intel_guc_dump_active_requests(struct
> > > intel_engine_cs *engine,
> > >                       struct i915_request *hung_rq,
> > >                       struct drm_printer *m);
> > > -bool intel_guc_virtual_engine_has_heartbeat(const struct
> > > intel_engine_cs *ve);
> > > -
> > >   int intel_guc_wait_for_pending_msg(struct intel_guc *guc,
> > >                      atomic_t *wait_var,
> > >                      bool interruptible,
> > 
> > _______________________________________________
> > Intel-gfx mailing list
> > Intel-gfx@lists.freedesktop.org
> > https://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context
  2021-07-30 18:13       ` John Harrison
@ 2021-08-02  9:40         ` Tvrtko Ursulin
  2021-08-06 18:00           ` John Harrison
  0 siblings, 1 reply; 16+ messages in thread
From: Tvrtko Ursulin @ 2021-08-02  9:40 UTC (permalink / raw)
  To: John Harrison, Matthew Brost, intel-gfx, dri-devel


On 30/07/2021 19:13, John Harrison wrote:
> On 7/30/2021 02:49, Tvrtko Ursulin wrote:
>> On 30/07/2021 01:13, John Harrison wrote:
>>> On 7/28/2021 17:34, Matthew Brost wrote:
>>>> If an engine associated with a context does not have a heartbeat, 
>>>> ban it
>>>> immediately. This is needed for GuC submission as a idle pulse doesn't
>>>> kick the context off the hardware where it then can check for a
>>>> heartbeat and ban the context.
>>
>> Pulse, that is a request with I915_PRIORITY_BARRIER, does not preempt 
>> a running normal priority context?
>>
>> Why does it matter then whether or not heartbeats are enabled - when 
>> heartbeat just ends up sending the same engine pulse (eventually, with 
>> raising priority)?
> The point is that the pulse is pointless. See the rest of my comments 
> below, specifically "the context will get resubmitted to the hardware 
> after the pulse completes". To re-iterate...
> 
> Yes, it preempts the context. Yes, it does so whether heartbeats are 
> enabled or not. But so what? Who cares? You have preempted a context. It 
> is no longer running on the hardware. BUT IT IS STILL A VALID CONTEXT. 

It is valid yes, and it even may be the current ABI so another question 
is whether it is okay to change that.

> The backend scheduler will just resubmit it to the hardware as soon as 
> the pulse completes. The only reason this works at all is because of the 
> horrid hack in the execlist scheduler's back end implementation (in 
> __execlists_schedule_in):
>          if (unlikely(intel_context_is_closed(ce) &&
>                       !intel_engine_has_heartbeat(engine)))
>                  intel_context_set_banned(ce);

Right, is the above code then needed with this patch - when ban is 
immediately applied on the higher level?

> The actual back end scheduler is saying "Is this a zombie context? Is 
> the heartbeat disabled? Then ban it". No other scheduler backend is 
> going to have knowledge of zombie context status or of the heartbeat 
> status. Nor are they going to call back into the higher levels of the 
> i915 driver to trigger a ban operation. Certainly a hardware implemented 
> scheduler is not going to be looking at private i915 driver information 
> to decide whether to submit a context or whether to tell the OS to kill 
> it off instead.
> 
> For persistence to work with a hardware scheduler (or a non-Intel 
> specific scheduler such as the DRM one), the handling of zombie 
> contexts, banning, etc. *must* be done entirely in the front end. It 
> cannot rely on any backend hacks. That means you can't rely on any fancy 
> behaviour of pulses.
> 
> If you want to ban a context then you must explicitly ban that context. 
> If you want to ban it at some later point then you need to track it at 
> the top level as a zombie and then explicitly ban that zombie at 
> whatever later point.

I am still trying to understand it all. If I go by the commit message:

"""
This is needed for GuC submission as a idle pulse doesn't
kick the context off the hardware where it then can check for a
heartbeat and ban the context.
"""

That did not explain things for me. Sentence does not appear to make 
sense. Now, it seems "kick off the hardware" is meant as revoke and not 
just preempt. Which is fine, perhaps just needs to be written more 
explicitly. But the part of checking for heartbeat after idle pulse does 
not compute for me. It is the heartbeat which emits idle pulses, not 
idle pulse emitting heartbeats.

But anyway, I can buy the handling at the front end story completely. It 
makes sense. We just need to agree that a) it is okay to change the ABI 
and b) remove the backend check from execlists if it is not needed any 
longer.

And if ABI change is okay then commit message needs to talk about it 
loudly and clearly.

Or perhaps there is no ABI change? I am not really clear how does 
setting banned status propagate to the GuC backend. I mean at which 
point does i915 ends up passing that info to the firmware?

Regards,

Tvrtko

> 
> 
>>
>>> It's worse than this. If the engine in question is an individual 
>>> physical engine then sending a pulse (with sufficiently high 
>>> priority) will pre-empt the engine and kick the context off. However, 
>>> the GuC 
>>
>> Why it is different for physical vs virtual, aren't both just 
>> schedulable contexts with different engine masks for what GuC is 
>> concerned? Oh, is it a matter of needing to send pulses to all engines 
>> which comprise a virtual one?
> It isn't different. It is totally broken for both. It is potentially 
> more broken for virtual engines because of the question of which engine 
> to pulse. But as stated above, the pulse is pointless anyway so the 
> which engine question doesn't even matter.
> 
> John.
> 
> 
>>
>>> scheduler does not have hacks in it to check the state of the 
>>> heartbeat or whether a context is actually a zombie or not. Thus, the 
>>> context will get resubmitted to the hardware after the pulse 
>>> completes and effectively nothing will have happened.
>>>
>>> I would assume that the DRM scheduler which we are meant to be 
>>> switching to for execlist as well as GuC submission is also unlikely 
>>> to have hacks for zombie contexts and tests for whether the i915 
>>> specific heartbeat has been disabled since the context became a 
>>> zombie. So when that switch happens, this test will also fail in 
>>> execlist mode as well as GuC mode.
>>>
>>> The choices I see here are to simply remove persistence completely 
>>> (it is a basically a bug that became UAPI because it wasn't caught 
>>> soon enough!) or to implement it in a way that does not require hacks 
>>> in the back end scheduler. Apparently, the DRM scheduler is expected 
>>> to allow zombie contexts to persist until the DRM file handle is 
>>> closed. So presumably we will have to go with option two.
>>>
>>> That means flagging a context as being a zombie when it is closed but 
>>> still active. The driver would then add it to a zombie list owned by 
>>> the DRM client object. When that client object is closed, i915 would 
>>> go through the list and genuinely kill all the contexts. No back end 
>>> scheduler hacks required and no intimate knowledge of the i915 
>>> heartbeat mechanism required either.
>>>
>>> John.
>>>
>>>
>>>>
>>>> This patch also updates intel_engine_has_heartbeat to be a vfunc as we
>>>> now need to call this function on execlists virtual engines too.
>>>>
>>>> Signed-off-by: Matthew Brost <matthew.brost@intel.com>
>>>> ---
>>>>   drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
>>>>   drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
>>>>   drivers/gpu/drm/i915/gt/intel_engine.h        | 21 
>>>> ++-----------------
>>>>   .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
>>>>   .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
>>>>   .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
>>>>   6 files changed, 26 insertions(+), 24 deletions(-)
>>>>
>>>> diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c 
>>>> b/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>> index 9c3672bac0e2..b8e01c5ba9e5 100644
>>>> --- a/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>> +++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>> @@ -1090,8 +1090,9 @@ static void kill_engines(struct 
>>>> i915_gem_engines *engines, bool ban)
>>>>        */
>>>>       for_each_gem_engine(ce, engines, it) {
>>>>           struct intel_engine_cs *engine;
>>>> +        bool local_ban = ban || 
>>>> !intel_engine_has_heartbeat(ce->engine);
>>
>> In any case (pending me understanding what's really going on there), 
>> why would this check not be in kill_context with currently does this:
>>
>>     bool ban = (!i915_gem_context_is_persistent(ctx) ||
>>             !ctx->i915->params.enable_hangcheck);
>> ...
>>         kill_engines(pos, ban);
>>
>> So whether to ban decision would be consolidated to one place.
>>
>> In fact, decision on whether to allow persistent is tied to 
>> enable_hangcheck, which also drives hearbeat emission. So perhaps one 
>> part of the correct fix is to extend the above (kill_context) ban 
>> criteria to include hearbeat values anyway. Otherwise isn't it a 
>> simple miss that this check fails to account to hearbeat disablement 
>> via sysfs?
>>
>> Regards,
>>
>> Tvrtko
>>
>>>> -        if (ban && intel_context_ban(ce, NULL))
>>>> +        if (local_ban && intel_context_ban(ce, NULL))
>>>>               continue;
>>>>           /*
>>>> @@ -1104,7 +1105,7 @@ static void kill_engines(struct 
>>>> i915_gem_engines *engines, bool ban)
>>>>           engine = active_engine(ce);
>>>>           /* First attempt to gracefully cancel the context */
>>>> -        if (engine && !__cancel_engine(engine) && ban)
>>>> +        if (engine && !__cancel_engine(engine) && local_ban)
>>>>               /*
>>>>                * If we are unable to send a preemptive pulse to bump
>>>>                * the context from the GPU, we have to resort to a full
>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h 
>>>> b/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>> index e54351a170e2..65f2eb2a78e4 100644
>>>> --- a/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>> +++ b/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>> @@ -55,6 +55,8 @@ struct intel_context_ops {
>>>>       void (*reset)(struct intel_context *ce);
>>>>       void (*destroy)(struct kref *kref);
>>>> +    bool (*has_heartbeat)(const struct intel_engine_cs *engine);
>>>> +
>>>>       /* virtual engine/context interface */
>>>>       struct intel_context *(*create_virtual)(struct intel_engine_cs 
>>>> **engine,
>>>>                           unsigned int count);
>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_engine.h 
>>>> b/drivers/gpu/drm/i915/gt/intel_engine.h
>>>> index c2a5640ae055..1b11a808acc4 100644
>>>> --- a/drivers/gpu/drm/i915/gt/intel_engine.h
>>>> +++ b/drivers/gpu/drm/i915/gt/intel_engine.h
>>>> @@ -283,28 +283,11 @@ struct intel_context *
>>>>   intel_engine_create_virtual(struct intel_engine_cs **siblings,
>>>>                   unsigned int count);
>>>> -static inline bool
>>>> -intel_virtual_engine_has_heartbeat(const struct intel_engine_cs 
>>>> *engine)
>>>> -{
>>>> -    /*
>>>> -     * For non-GuC submission we expect the back-end to look at the
>>>> -     * heartbeat status of the actual physical engine that the work
>>>> -     * has been (or is being) scheduled on, so we should only reach
>>>> -     * here with GuC submission enabled.
>>>> -     */
>>>> -    GEM_BUG_ON(!intel_engine_uses_guc(engine));
>>>> -
>>>> -    return intel_guc_virtual_engine_has_heartbeat(engine);
>>>> -}
>>>> -
>>>>   static inline bool
>>>>   intel_engine_has_heartbeat(const struct intel_engine_cs *engine)
>>>>   {
>>>> -    if (!IS_ACTIVE(CONFIG_DRM_I915_HEARTBEAT_INTERVAL))
>>>> -        return false;
>>>> -
>>>> -    if (intel_engine_is_virtual(engine))
>>>> -        return intel_virtual_engine_has_heartbeat(engine);
>>>> +    if (engine->cops->has_heartbeat)
>>>> +        return engine->cops->has_heartbeat(engine);
>>>>       else
>>>>           return READ_ONCE(engine->props.heartbeat_interval_ms);
>>>>   }
>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c 
>>>> b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>> index de5f9c86b9a4..18005b5546b6 100644
>>>> --- a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>> +++ b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>> @@ -3619,6 +3619,18 @@ virtual_get_sibling(struct intel_engine_cs 
>>>> *engine, unsigned int sibling)
>>>>       return ve->siblings[sibling];
>>>>   }
>>>> +static bool virtual_engine_has_heartbeat(const struct 
>>>> intel_engine_cs *ve)
>>>> +{
>>>> +    struct intel_engine_cs *engine;
>>>> +    intel_engine_mask_t tmp, mask = ve->mask;
>>>> +
>>>> +    for_each_engine_masked(engine, ve->gt, mask, tmp)
>>>> +        if (READ_ONCE(engine->props.heartbeat_interval_ms))
>>>> +            return true;
>>>> +
>>>> +    return false;
>>>> +}
>>>> +
>>>>   static const struct intel_context_ops virtual_context_ops = {
>>>>       .flags = COPS_HAS_INFLIGHT,
>>>> @@ -3634,6 +3646,8 @@ static const struct intel_context_ops 
>>>> virtual_context_ops = {
>>>>       .enter = virtual_context_enter,
>>>>       .exit = virtual_context_exit,
>>>> +    .has_heartbeat = virtual_engine_has_heartbeat,
>>>> +
>>>>       .destroy = virtual_context_destroy,
>>>>       .get_sibling = virtual_get_sibling,
>>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c 
>>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>> index 89ff0e4b4bc7..ae70bff3605f 100644
>>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>> @@ -2168,6 +2168,8 @@ static int guc_virtual_context_alloc(struct 
>>>> intel_context *ce)
>>>>       return lrc_alloc(ce, engine);
>>>>   }
>>>> +static bool guc_virtual_engine_has_heartbeat(const struct 
>>>> intel_engine_cs *ve);
>>>> +
>>>>   static const struct intel_context_ops virtual_guc_context_ops = {
>>>>       .alloc = guc_virtual_context_alloc,
>>>> @@ -2183,6 +2185,8 @@ static const struct intel_context_ops 
>>>> virtual_guc_context_ops = {
>>>>       .enter = guc_virtual_context_enter,
>>>>       .exit = guc_virtual_context_exit,
>>>> +    .has_heartbeat = guc_virtual_engine_has_heartbeat,
>>>> +
>>>>       .sched_disable = guc_context_sched_disable,
>>>>       .destroy = guc_context_destroy,
>>>> @@ -3029,7 +3033,7 @@ guc_create_virtual(struct intel_engine_cs 
>>>> **siblings, unsigned int count)
>>>>       return ERR_PTR(err);
>>>>   }
>>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct 
>>>> intel_engine_cs *ve)
>>>> +static bool guc_virtual_engine_has_heartbeat(const struct 
>>>> intel_engine_cs *ve)
>>>>   {
>>>>       struct intel_engine_cs *engine;
>>>>       intel_engine_mask_t tmp, mask = ve->mask;
>>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h 
>>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>> index c7ef44fa0c36..c2afc3b88fd8 100644
>>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>> @@ -29,8 +29,6 @@ void intel_guc_dump_active_requests(struct 
>>>> intel_engine_cs *engine,
>>>>                       struct i915_request *hung_rq,
>>>>                       struct drm_printer *m);
>>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct 
>>>> intel_engine_cs *ve);
>>>> -
>>>>   int intel_guc_wait_for_pending_msg(struct intel_guc *guc,
>>>>                      atomic_t *wait_var,
>>>>                      bool interruptible,
>>>
>>> _______________________________________________
>>> Intel-gfx mailing list
>>> Intel-gfx@lists.freedesktop.org
>>> https://lists.freedesktop.org/mailman/listinfo/intel-gfx
> 

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context
  2021-08-02  9:40         ` Tvrtko Ursulin
@ 2021-08-06 18:00           ` John Harrison
  2021-08-06 19:46             ` Daniel Vetter
  0 siblings, 1 reply; 16+ messages in thread
From: John Harrison @ 2021-08-06 18:00 UTC (permalink / raw)
  To: Tvrtko Ursulin, Matthew Brost, intel-gfx, dri-devel

On 8/2/2021 02:40, Tvrtko Ursulin wrote:
> On 30/07/2021 19:13, John Harrison wrote:
>> On 7/30/2021 02:49, Tvrtko Ursulin wrote:
>>> On 30/07/2021 01:13, John Harrison wrote:
>>>> On 7/28/2021 17:34, Matthew Brost wrote:
>>>>> If an engine associated with a context does not have a heartbeat, 
>>>>> ban it
>>>>> immediately. This is needed for GuC submission as a idle pulse 
>>>>> doesn't
>>>>> kick the context off the hardware where it then can check for a
>>>>> heartbeat and ban the context.
>>>
>>> Pulse, that is a request with I915_PRIORITY_BARRIER, does not 
>>> preempt a running normal priority context?
>>>
>>> Why does it matter then whether or not heartbeats are enabled - when 
>>> heartbeat just ends up sending the same engine pulse (eventually, 
>>> with raising priority)?
>> The point is that the pulse is pointless. See the rest of my comments 
>> below, specifically "the context will get resubmitted to the hardware 
>> after the pulse completes". To re-iterate...
>>
>> Yes, it preempts the context. Yes, it does so whether heartbeats are 
>> enabled or not. But so what? Who cares? You have preempted a context. 
>> It is no longer running on the hardware. BUT IT IS STILL A VALID 
>> CONTEXT. 
>
> It is valid yes, and it even may be the current ABI so another 
> question is whether it is okay to change that.
>
>> The backend scheduler will just resubmit it to the hardware as soon 
>> as the pulse completes. The only reason this works at all is because 
>> of the horrid hack in the execlist scheduler's back end 
>> implementation (in __execlists_schedule_in):
>>          if (unlikely(intel_context_is_closed(ce) &&
>>                       !intel_engine_has_heartbeat(engine)))
>>                  intel_context_set_banned(ce);
>
> Right, is the above code then needed with this patch - when ban is 
> immediately applied on the higher level?
>
>> The actual back end scheduler is saying "Is this a zombie context? Is 
>> the heartbeat disabled? Then ban it". No other scheduler backend is 
>> going to have knowledge of zombie context status or of the heartbeat 
>> status. Nor are they going to call back into the higher levels of the 
>> i915 driver to trigger a ban operation. Certainly a hardware 
>> implemented scheduler is not going to be looking at private i915 
>> driver information to decide whether to submit a context or whether 
>> to tell the OS to kill it off instead.
>>
>> For persistence to work with a hardware scheduler (or a non-Intel 
>> specific scheduler such as the DRM one), the handling of zombie 
>> contexts, banning, etc. *must* be done entirely in the front end. It 
>> cannot rely on any backend hacks. That means you can't rely on any 
>> fancy behaviour of pulses.
>>
>> If you want to ban a context then you must explicitly ban that 
>> context. If you want to ban it at some later point then you need to 
>> track it at the top level as a zombie and then explicitly ban that 
>> zombie at whatever later point.
>
> I am still trying to understand it all. If I go by the commit message:
>
> """
> This is needed for GuC submission as a idle pulse doesn't
> kick the context off the hardware where it then can check for a
> heartbeat and ban the context.
> """
>
> That did not explain things for me. Sentence does not appear to make 
> sense. Now, it seems "kick off the hardware" is meant as revoke and 
> not just preempt. Which is fine, perhaps just needs to be written more 
> explicitly. But the part of checking for heartbeat after idle pulse 
> does not compute for me. It is the heartbeat which emits idle pulses, 
> not idle pulse emitting heartbeats.
I am in agreement that the commit message is confusing and does not 
explain either the problem or the solution.


>
>
> But anyway, I can buy the handling at the front end story completely. 
> It makes sense. We just need to agree that a) it is okay to change the 
> ABI and b) remove the backend check from execlists if it is not needed 
> any longer.
>
> And if ABI change is okay then commit message needs to talk about it 
> loudly and clearly.
I don't think we have a choice. The current ABI is not and cannot ever 
be compatible with any scheduler external to i915. It cannot be 
implemented with a hardware scheduler such as the GuC and it cannot be 
implemented with an external software scheduler such as the DRM one.

My view is that any implementation involving knowledge of the heartbeat 
is fundamentally broken.

According to Daniel Vetter, the DRM ABI on this subject is that an 
actively executing context should persist until the DRM file handle is 
closed. That seems like a much more plausible and simple ABI than one 
that says 'if the heartbeat is running then a context will persist 
forever, if the heartbeat is not running then it will be killed 
immediately, if the heart was running but then stops running then the 
context will be killed on the next context switch, ...'. And if I 
understand it correctly, the current ABI allows a badly written user app 
to cause a denial of service by leaving contexts permanently running an 
infinite loop on the hardware even after the app has been killed! How 
can that ever be considered a good idea?

Therefore, the context close implementation should be to add an active 
context to a zombie list. If a context is in zombie state and its last 
request completes then the context can be immediately killed at that 
point. Otherwise, on DRM handle close, we go through the zombie list and 
immediately kill all contexts.

Simple, clean, no back-end scheduler hacks, no reliance on heartbeats or 
pulses. Also no opportunity for rogue (or just badly written) user 
processes to leave zombie contexts running on the hardware forever and 
causing a denial of service attack. If the host process is killed, all 
of its GPU processes are also killed irrespective of what dodgy context 
flags they may or may not have set.

John.


>
> Or perhaps there is no ABI change? I am not really clear how does 
> setting banned status propagate to the GuC backend. I mean at which 
> point does i915 ends up passing that info to the firmware?
>
> Regards,
>
> Tvrtko
>
>>
>>
>>>
>>>> It's worse than this. If the engine in question is an individual 
>>>> physical engine then sending a pulse (with sufficiently high 
>>>> priority) will pre-empt the engine and kick the context off. 
>>>> However, the GuC 
>>>
>>> Why it is different for physical vs virtual, aren't both just 
>>> schedulable contexts with different engine masks for what GuC is 
>>> concerned? Oh, is it a matter of needing to send pulses to all 
>>> engines which comprise a virtual one?
>> It isn't different. It is totally broken for both. It is potentially 
>> more broken for virtual engines because of the question of which 
>> engine to pulse. But as stated above, the pulse is pointless anyway 
>> so the which engine question doesn't even matter.
>>
>> John.
>>
>>
>>>
>>>> scheduler does not have hacks in it to check the state of the 
>>>> heartbeat or whether a context is actually a zombie or not. Thus, 
>>>> the context will get resubmitted to the hardware after the pulse 
>>>> completes and effectively nothing will have happened.
>>>>
>>>> I would assume that the DRM scheduler which we are meant to be 
>>>> switching to for execlist as well as GuC submission is also 
>>>> unlikely to have hacks for zombie contexts and tests for whether 
>>>> the i915 specific heartbeat has been disabled since the context 
>>>> became a zombie. So when that switch happens, this test will also 
>>>> fail in execlist mode as well as GuC mode.
>>>>
>>>> The choices I see here are to simply remove persistence completely 
>>>> (it is a basically a bug that became UAPI because it wasn't caught 
>>>> soon enough!) or to implement it in a way that does not require 
>>>> hacks in the back end scheduler. Apparently, the DRM scheduler is 
>>>> expected to allow zombie contexts to persist until the DRM file 
>>>> handle is closed. So presumably we will have to go with option two.
>>>>
>>>> That means flagging a context as being a zombie when it is closed 
>>>> but still active. The driver would then add it to a zombie list 
>>>> owned by the DRM client object. When that client object is closed, 
>>>> i915 would go through the list and genuinely kill all the contexts. 
>>>> No back end scheduler hacks required and no intimate knowledge of 
>>>> the i915 heartbeat mechanism required either.
>>>>
>>>> John.
>>>>
>>>>
>>>>>
>>>>> This patch also updates intel_engine_has_heartbeat to be a vfunc 
>>>>> as we
>>>>> now need to call this function on execlists virtual engines too.
>>>>>
>>>>> Signed-off-by: Matthew Brost <matthew.brost@intel.com>
>>>>> ---
>>>>>   drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
>>>>>   drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
>>>>>   drivers/gpu/drm/i915/gt/intel_engine.h        | 21 
>>>>> ++-----------------
>>>>>   .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
>>>>>   .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
>>>>>   .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
>>>>>   6 files changed, 26 insertions(+), 24 deletions(-)
>>>>>
>>>>> diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c 
>>>>> b/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>>> index 9c3672bac0e2..b8e01c5ba9e5 100644
>>>>> --- a/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>>> +++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>>> @@ -1090,8 +1090,9 @@ static void kill_engines(struct 
>>>>> i915_gem_engines *engines, bool ban)
>>>>>        */
>>>>>       for_each_gem_engine(ce, engines, it) {
>>>>>           struct intel_engine_cs *engine;
>>>>> +        bool local_ban = ban || 
>>>>> !intel_engine_has_heartbeat(ce->engine);
>>>
>>> In any case (pending me understanding what's really going on there), 
>>> why would this check not be in kill_context with currently does this:
>>>
>>>     bool ban = (!i915_gem_context_is_persistent(ctx) ||
>>>             !ctx->i915->params.enable_hangcheck);
>>> ...
>>>         kill_engines(pos, ban);
>>>
>>> So whether to ban decision would be consolidated to one place.
>>>
>>> In fact, decision on whether to allow persistent is tied to 
>>> enable_hangcheck, which also drives hearbeat emission. So perhaps 
>>> one part of the correct fix is to extend the above (kill_context) 
>>> ban criteria to include hearbeat values anyway. Otherwise isn't it a 
>>> simple miss that this check fails to account to hearbeat disablement 
>>> via sysfs?
>>>
>>> Regards,
>>>
>>> Tvrtko
>>>
>>>>> -        if (ban && intel_context_ban(ce, NULL))
>>>>> +        if (local_ban && intel_context_ban(ce, NULL))
>>>>>               continue;
>>>>>           /*
>>>>> @@ -1104,7 +1105,7 @@ static void kill_engines(struct 
>>>>> i915_gem_engines *engines, bool ban)
>>>>>           engine = active_engine(ce);
>>>>>           /* First attempt to gracefully cancel the context */
>>>>> -        if (engine && !__cancel_engine(engine) && ban)
>>>>> +        if (engine && !__cancel_engine(engine) && local_ban)
>>>>>               /*
>>>>>                * If we are unable to send a preemptive pulse to bump
>>>>>                * the context from the GPU, we have to resort to a 
>>>>> full
>>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h 
>>>>> b/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>>> index e54351a170e2..65f2eb2a78e4 100644
>>>>> --- a/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>>> +++ b/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>>> @@ -55,6 +55,8 @@ struct intel_context_ops {
>>>>>       void (*reset)(struct intel_context *ce);
>>>>>       void (*destroy)(struct kref *kref);
>>>>> +    bool (*has_heartbeat)(const struct intel_engine_cs *engine);
>>>>> +
>>>>>       /* virtual engine/context interface */
>>>>>       struct intel_context *(*create_virtual)(struct 
>>>>> intel_engine_cs **engine,
>>>>>                           unsigned int count);
>>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_engine.h 
>>>>> b/drivers/gpu/drm/i915/gt/intel_engine.h
>>>>> index c2a5640ae055..1b11a808acc4 100644
>>>>> --- a/drivers/gpu/drm/i915/gt/intel_engine.h
>>>>> +++ b/drivers/gpu/drm/i915/gt/intel_engine.h
>>>>> @@ -283,28 +283,11 @@ struct intel_context *
>>>>>   intel_engine_create_virtual(struct intel_engine_cs **siblings,
>>>>>                   unsigned int count);
>>>>> -static inline bool
>>>>> -intel_virtual_engine_has_heartbeat(const struct intel_engine_cs 
>>>>> *engine)
>>>>> -{
>>>>> -    /*
>>>>> -     * For non-GuC submission we expect the back-end to look at the
>>>>> -     * heartbeat status of the actual physical engine that the work
>>>>> -     * has been (or is being) scheduled on, so we should only reach
>>>>> -     * here with GuC submission enabled.
>>>>> -     */
>>>>> -    GEM_BUG_ON(!intel_engine_uses_guc(engine));
>>>>> -
>>>>> -    return intel_guc_virtual_engine_has_heartbeat(engine);
>>>>> -}
>>>>> -
>>>>>   static inline bool
>>>>>   intel_engine_has_heartbeat(const struct intel_engine_cs *engine)
>>>>>   {
>>>>> -    if (!IS_ACTIVE(CONFIG_DRM_I915_HEARTBEAT_INTERVAL))
>>>>> -        return false;
>>>>> -
>>>>> -    if (intel_engine_is_virtual(engine))
>>>>> -        return intel_virtual_engine_has_heartbeat(engine);
>>>>> +    if (engine->cops->has_heartbeat)
>>>>> +        return engine->cops->has_heartbeat(engine);
>>>>>       else
>>>>>           return READ_ONCE(engine->props.heartbeat_interval_ms);
>>>>>   }
>>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c 
>>>>> b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>>> index de5f9c86b9a4..18005b5546b6 100644
>>>>> --- a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>>> +++ b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>>> @@ -3619,6 +3619,18 @@ virtual_get_sibling(struct intel_engine_cs 
>>>>> *engine, unsigned int sibling)
>>>>>       return ve->siblings[sibling];
>>>>>   }
>>>>> +static bool virtual_engine_has_heartbeat(const struct 
>>>>> intel_engine_cs *ve)
>>>>> +{
>>>>> +    struct intel_engine_cs *engine;
>>>>> +    intel_engine_mask_t tmp, mask = ve->mask;
>>>>> +
>>>>> +    for_each_engine_masked(engine, ve->gt, mask, tmp)
>>>>> +        if (READ_ONCE(engine->props.heartbeat_interval_ms))
>>>>> +            return true;
>>>>> +
>>>>> +    return false;
>>>>> +}
>>>>> +
>>>>>   static const struct intel_context_ops virtual_context_ops = {
>>>>>       .flags = COPS_HAS_INFLIGHT,
>>>>> @@ -3634,6 +3646,8 @@ static const struct intel_context_ops 
>>>>> virtual_context_ops = {
>>>>>       .enter = virtual_context_enter,
>>>>>       .exit = virtual_context_exit,
>>>>> +    .has_heartbeat = virtual_engine_has_heartbeat,
>>>>> +
>>>>>       .destroy = virtual_context_destroy,
>>>>>       .get_sibling = virtual_get_sibling,
>>>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c 
>>>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>>> index 89ff0e4b4bc7..ae70bff3605f 100644
>>>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>>> @@ -2168,6 +2168,8 @@ static int guc_virtual_context_alloc(struct 
>>>>> intel_context *ce)
>>>>>       return lrc_alloc(ce, engine);
>>>>>   }
>>>>> +static bool guc_virtual_engine_has_heartbeat(const struct 
>>>>> intel_engine_cs *ve);
>>>>> +
>>>>>   static const struct intel_context_ops virtual_guc_context_ops = {
>>>>>       .alloc = guc_virtual_context_alloc,
>>>>> @@ -2183,6 +2185,8 @@ static const struct intel_context_ops 
>>>>> virtual_guc_context_ops = {
>>>>>       .enter = guc_virtual_context_enter,
>>>>>       .exit = guc_virtual_context_exit,
>>>>> +    .has_heartbeat = guc_virtual_engine_has_heartbeat,
>>>>> +
>>>>>       .sched_disable = guc_context_sched_disable,
>>>>>       .destroy = guc_context_destroy,
>>>>> @@ -3029,7 +3033,7 @@ guc_create_virtual(struct intel_engine_cs 
>>>>> **siblings, unsigned int count)
>>>>>       return ERR_PTR(err);
>>>>>   }
>>>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct 
>>>>> intel_engine_cs *ve)
>>>>> +static bool guc_virtual_engine_has_heartbeat(const struct 
>>>>> intel_engine_cs *ve)
>>>>>   {
>>>>>       struct intel_engine_cs *engine;
>>>>>       intel_engine_mask_t tmp, mask = ve->mask;
>>>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h 
>>>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>>> index c7ef44fa0c36..c2afc3b88fd8 100644
>>>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>>> @@ -29,8 +29,6 @@ void intel_guc_dump_active_requests(struct 
>>>>> intel_engine_cs *engine,
>>>>>                       struct i915_request *hung_rq,
>>>>>                       struct drm_printer *m);
>>>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct 
>>>>> intel_engine_cs *ve);
>>>>> -
>>>>>   int intel_guc_wait_for_pending_msg(struct intel_guc *guc,
>>>>>                      atomic_t *wait_var,
>>>>>                      bool interruptible,
>>>>
>>>> _______________________________________________
>>>> Intel-gfx mailing list
>>>> Intel-gfx@lists.freedesktop.org
>>>> https://lists.freedesktop.org/mailman/listinfo/intel-gfx
>>


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context
  2021-08-06 18:00           ` John Harrison
@ 2021-08-06 19:46             ` Daniel Vetter
  2021-08-09 23:12               ` John Harrison
  0 siblings, 1 reply; 16+ messages in thread
From: Daniel Vetter @ 2021-08-06 19:46 UTC (permalink / raw)
  To: John Harrison; +Cc: Tvrtko Ursulin, Matthew Brost, intel-gfx, dri-devel

Seen this fly by and figured I dropped a few thoughts in here. At the
likely cost of looking a bit out of whack :-)

On Fri, Aug 6, 2021 at 8:01 PM John Harrison <john.c.harrison@intel.com> wrote:
> On 8/2/2021 02:40, Tvrtko Ursulin wrote:
> > On 30/07/2021 19:13, John Harrison wrote:
> >> On 7/30/2021 02:49, Tvrtko Ursulin wrote:
> >>> On 30/07/2021 01:13, John Harrison wrote:
> >>>> On 7/28/2021 17:34, Matthew Brost wrote:
> >>>>> If an engine associated with a context does not have a heartbeat,
> >>>>> ban it
> >>>>> immediately. This is needed for GuC submission as a idle pulse
> >>>>> doesn't
> >>>>> kick the context off the hardware where it then can check for a
> >>>>> heartbeat and ban the context.
> >>>
> >>> Pulse, that is a request with I915_PRIORITY_BARRIER, does not
> >>> preempt a running normal priority context?
> >>>
> >>> Why does it matter then whether or not heartbeats are enabled - when
> >>> heartbeat just ends up sending the same engine pulse (eventually,
> >>> with raising priority)?
> >> The point is that the pulse is pointless. See the rest of my comments
> >> below, specifically "the context will get resubmitted to the hardware
> >> after the pulse completes". To re-iterate...
> >>
> >> Yes, it preempts the context. Yes, it does so whether heartbeats are
> >> enabled or not. But so what? Who cares? You have preempted a context.
> >> It is no longer running on the hardware. BUT IT IS STILL A VALID
> >> CONTEXT.
> >
> > It is valid yes, and it even may be the current ABI so another
> > question is whether it is okay to change that.
> >
> >> The backend scheduler will just resubmit it to the hardware as soon
> >> as the pulse completes. The only reason this works at all is because
> >> of the horrid hack in the execlist scheduler's back end
> >> implementation (in __execlists_schedule_in):
> >>          if (unlikely(intel_context_is_closed(ce) &&
> >>                       !intel_engine_has_heartbeat(engine)))
> >>                  intel_context_set_banned(ce);
> >
> > Right, is the above code then needed with this patch - when ban is
> > immediately applied on the higher level?
> >
> >> The actual back end scheduler is saying "Is this a zombie context? Is
> >> the heartbeat disabled? Then ban it". No other scheduler backend is
> >> going to have knowledge of zombie context status or of the heartbeat
> >> status. Nor are they going to call back into the higher levels of the
> >> i915 driver to trigger a ban operation. Certainly a hardware
> >> implemented scheduler is not going to be looking at private i915
> >> driver information to decide whether to submit a context or whether
> >> to tell the OS to kill it off instead.
> >>
> >> For persistence to work with a hardware scheduler (or a non-Intel
> >> specific scheduler such as the DRM one), the handling of zombie
> >> contexts, banning, etc. *must* be done entirely in the front end. It
> >> cannot rely on any backend hacks. That means you can't rely on any
> >> fancy behaviour of pulses.
> >>
> >> If you want to ban a context then you must explicitly ban that
> >> context. If you want to ban it at some later point then you need to
> >> track it at the top level as a zombie and then explicitly ban that
> >> zombie at whatever later point.
> >
> > I am still trying to understand it all. If I go by the commit message:
> >
> > """
> > This is needed for GuC submission as a idle pulse doesn't
> > kick the context off the hardware where it then can check for a
> > heartbeat and ban the context.
> > """
> >
> > That did not explain things for me. Sentence does not appear to make
> > sense. Now, it seems "kick off the hardware" is meant as revoke and
> > not just preempt. Which is fine, perhaps just needs to be written more
> > explicitly. But the part of checking for heartbeat after idle pulse
> > does not compute for me. It is the heartbeat which emits idle pulses,
> > not idle pulse emitting heartbeats.
> I am in agreement that the commit message is confusing and does not
> explain either the problem or the solution.
>
>
> >
> >
> > But anyway, I can buy the handling at the front end story completely.
> > It makes sense. We just need to agree that a) it is okay to change the
> > ABI and b) remove the backend check from execlists if it is not needed
> > any longer.
> >
> > And if ABI change is okay then commit message needs to talk about it
> > loudly and clearly.
> I don't think we have a choice. The current ABI is not and cannot ever
> be compatible with any scheduler external to i915. It cannot be
> implemented with a hardware scheduler such as the GuC and it cannot be
> implemented with an external software scheduler such as the DRM one.

So generally on linux we implement helper libraries, which means
massive flexibility everywhere.

https://blog.ffwll.ch/2016/12/midlayers-once-more-with-feeling.html

So it shouldn't be an insurmountable problem to make this happen even
with drm/scheduler, we can patch it up.

Whether that's justified is another question.

> My view is that any implementation involving knowledge of the heartbeat
> is fundamentally broken.
>
> According to Daniel Vetter, the DRM ABI on this subject is that an
> actively executing context should persist until the DRM file handle is
> closed. That seems like a much more plausible and simple ABI than one

DRM ABI is maybe a bit an overkill statement. It's more "what other
drivers do", but it's generally a good idea to not ignore that :-)

> that says 'if the heartbeat is running then a context will persist
> forever, if the heartbeat is not running then it will be killed
> immediately, if the heart was running but then stops running then the
> context will be killed on the next context switch, ...'. And if I
> understand it correctly, the current ABI allows a badly written user app
> to cause a denial of service by leaving contexts permanently running an
> infinite loop on the hardware even after the app has been killed! How
> can that ever be considered a good idea?

We're not going to support changing all these settings at runtime.
There's just not point in trying to make that work race-free, it
either adds complexity to the code for no reason, or it adds overhead
to the code for no reason.

Yes I know existing customers and all that, but
- they can change this stuff, and when they change it while anyting is
in-flight they get to keep the pieces. These options taint the kernel
for a reason (and if they don't, that should be fixed)
- quite a few around heartbeat and compute support as we've merged a
while ago hang by design when trying to smash them into drm rules.
We're not going to fix that, and we should not use any existing such
assumptions as justification for code changes.

Wrt infinitely running: Right now nothing is allowed to run forever,
because hangcheck will step in and kill that job. Once we add compute
mode ctx flag we'll require killing on process exit to stop escape.

> Therefore, the context close implementation should be to add an active
> context to a zombie list. If a context is in zombie state and its last
> request completes then the context can be immediately killed at that
> point. Otherwise, on DRM handle close, we go through the zombie list and
> immediately kill all contexts.
>
> Simple, clean, no back-end scheduler hacks, no reliance on heartbeats or
> pulses. Also no opportunity for rogue (or just badly written) user
> processes to leave zombie contexts running on the hardware forever and
> causing a denial of service attack. If the host process is killed, all
> of its GPU processes are also killed irrespective of what dodgy context
> flags they may or may not have set.

Uh, the intel_context state machine is already a bit too complex, and
the implementation lacks a bunch of barriers at least from the cursor
look I've given it thus far.

So if we really need to make that more complex with more states then I
think someone needs to come up with an actual clean design, with
proper state transitions and all the barriers (or really, a design
which doesn't need barriers). This is going to be work.
-Daniel

>
> John.
>
>
> >
> > Or perhaps there is no ABI change? I am not really clear how does
> > setting banned status propagate to the GuC backend. I mean at which
> > point does i915 ends up passing that info to the firmware?
> >
> > Regards,
> >
> > Tvrtko
> >
> >>
> >>
> >>>
> >>>> It's worse than this. If the engine in question is an individual
> >>>> physical engine then sending a pulse (with sufficiently high
> >>>> priority) will pre-empt the engine and kick the context off.
> >>>> However, the GuC
> >>>
> >>> Why it is different for physical vs virtual, aren't both just
> >>> schedulable contexts with different engine masks for what GuC is
> >>> concerned? Oh, is it a matter of needing to send pulses to all
> >>> engines which comprise a virtual one?
> >> It isn't different. It is totally broken for both. It is potentially
> >> more broken for virtual engines because of the question of which
> >> engine to pulse. But as stated above, the pulse is pointless anyway
> >> so the which engine question doesn't even matter.
> >>
> >> John.
> >>
> >>
> >>>
> >>>> scheduler does not have hacks in it to check the state of the
> >>>> heartbeat or whether a context is actually a zombie or not. Thus,
> >>>> the context will get resubmitted to the hardware after the pulse
> >>>> completes and effectively nothing will have happened.
> >>>>
> >>>> I would assume that the DRM scheduler which we are meant to be
> >>>> switching to for execlist as well as GuC submission is also
> >>>> unlikely to have hacks for zombie contexts and tests for whether
> >>>> the i915 specific heartbeat has been disabled since the context
> >>>> became a zombie. So when that switch happens, this test will also
> >>>> fail in execlist mode as well as GuC mode.
> >>>>
> >>>> The choices I see here are to simply remove persistence completely
> >>>> (it is a basically a bug that became UAPI because it wasn't caught
> >>>> soon enough!) or to implement it in a way that does not require
> >>>> hacks in the back end scheduler. Apparently, the DRM scheduler is
> >>>> expected to allow zombie contexts to persist until the DRM file
> >>>> handle is closed. So presumably we will have to go with option two.
> >>>>
> >>>> That means flagging a context as being a zombie when it is closed
> >>>> but still active. The driver would then add it to a zombie list
> >>>> owned by the DRM client object. When that client object is closed,
> >>>> i915 would go through the list and genuinely kill all the contexts.
> >>>> No back end scheduler hacks required and no intimate knowledge of
> >>>> the i915 heartbeat mechanism required either.
> >>>>
> >>>> John.
> >>>>
> >>>>
> >>>>>
> >>>>> This patch also updates intel_engine_has_heartbeat to be a vfunc
> >>>>> as we
> >>>>> now need to call this function on execlists virtual engines too.
> >>>>>
> >>>>> Signed-off-by: Matthew Brost <matthew.brost@intel.com>
> >>>>> ---
> >>>>>   drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
> >>>>>   drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
> >>>>>   drivers/gpu/drm/i915/gt/intel_engine.h        | 21
> >>>>> ++-----------------
> >>>>>   .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
> >>>>>   .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
> >>>>>   .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
> >>>>>   6 files changed, 26 insertions(+), 24 deletions(-)
> >>>>>
> >>>>> diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c
> >>>>> b/drivers/gpu/drm/i915/gem/i915_gem_context.c
> >>>>> index 9c3672bac0e2..b8e01c5ba9e5 100644
> >>>>> --- a/drivers/gpu/drm/i915/gem/i915_gem_context.c
> >>>>> +++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c
> >>>>> @@ -1090,8 +1090,9 @@ static void kill_engines(struct
> >>>>> i915_gem_engines *engines, bool ban)
> >>>>>        */
> >>>>>       for_each_gem_engine(ce, engines, it) {
> >>>>>           struct intel_engine_cs *engine;
> >>>>> +        bool local_ban = ban ||
> >>>>> !intel_engine_has_heartbeat(ce->engine);
> >>>
> >>> In any case (pending me understanding what's really going on there),
> >>> why would this check not be in kill_context with currently does this:
> >>>
> >>>     bool ban = (!i915_gem_context_is_persistent(ctx) ||
> >>>             !ctx->i915->params.enable_hangcheck);
> >>> ...
> >>>         kill_engines(pos, ban);
> >>>
> >>> So whether to ban decision would be consolidated to one place.
> >>>
> >>> In fact, decision on whether to allow persistent is tied to
> >>> enable_hangcheck, which also drives hearbeat emission. So perhaps
> >>> one part of the correct fix is to extend the above (kill_context)
> >>> ban criteria to include hearbeat values anyway. Otherwise isn't it a
> >>> simple miss that this check fails to account to hearbeat disablement
> >>> via sysfs?
> >>>
> >>> Regards,
> >>>
> >>> Tvrtko
> >>>
> >>>>> -        if (ban && intel_context_ban(ce, NULL))
> >>>>> +        if (local_ban && intel_context_ban(ce, NULL))
> >>>>>               continue;
> >>>>>           /*
> >>>>> @@ -1104,7 +1105,7 @@ static void kill_engines(struct
> >>>>> i915_gem_engines *engines, bool ban)
> >>>>>           engine = active_engine(ce);
> >>>>>           /* First attempt to gracefully cancel the context */
> >>>>> -        if (engine && !__cancel_engine(engine) && ban)
> >>>>> +        if (engine && !__cancel_engine(engine) && local_ban)
> >>>>>               /*
> >>>>>                * If we are unable to send a preemptive pulse to bump
> >>>>>                * the context from the GPU, we have to resort to a
> >>>>> full
> >>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h
> >>>>> b/drivers/gpu/drm/i915/gt/intel_context_types.h
> >>>>> index e54351a170e2..65f2eb2a78e4 100644
> >>>>> --- a/drivers/gpu/drm/i915/gt/intel_context_types.h
> >>>>> +++ b/drivers/gpu/drm/i915/gt/intel_context_types.h
> >>>>> @@ -55,6 +55,8 @@ struct intel_context_ops {
> >>>>>       void (*reset)(struct intel_context *ce);
> >>>>>       void (*destroy)(struct kref *kref);
> >>>>> +    bool (*has_heartbeat)(const struct intel_engine_cs *engine);
> >>>>> +
> >>>>>       /* virtual engine/context interface */
> >>>>>       struct intel_context *(*create_virtual)(struct
> >>>>> intel_engine_cs **engine,
> >>>>>                           unsigned int count);
> >>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_engine.h
> >>>>> b/drivers/gpu/drm/i915/gt/intel_engine.h
> >>>>> index c2a5640ae055..1b11a808acc4 100644
> >>>>> --- a/drivers/gpu/drm/i915/gt/intel_engine.h
> >>>>> +++ b/drivers/gpu/drm/i915/gt/intel_engine.h
> >>>>> @@ -283,28 +283,11 @@ struct intel_context *
> >>>>>   intel_engine_create_virtual(struct intel_engine_cs **siblings,
> >>>>>                   unsigned int count);
> >>>>> -static inline bool
> >>>>> -intel_virtual_engine_has_heartbeat(const struct intel_engine_cs
> >>>>> *engine)
> >>>>> -{
> >>>>> -    /*
> >>>>> -     * For non-GuC submission we expect the back-end to look at the
> >>>>> -     * heartbeat status of the actual physical engine that the work
> >>>>> -     * has been (or is being) scheduled on, so we should only reach
> >>>>> -     * here with GuC submission enabled.
> >>>>> -     */
> >>>>> -    GEM_BUG_ON(!intel_engine_uses_guc(engine));
> >>>>> -
> >>>>> -    return intel_guc_virtual_engine_has_heartbeat(engine);
> >>>>> -}
> >>>>> -
> >>>>>   static inline bool
> >>>>>   intel_engine_has_heartbeat(const struct intel_engine_cs *engine)
> >>>>>   {
> >>>>> -    if (!IS_ACTIVE(CONFIG_DRM_I915_HEARTBEAT_INTERVAL))
> >>>>> -        return false;
> >>>>> -
> >>>>> -    if (intel_engine_is_virtual(engine))
> >>>>> -        return intel_virtual_engine_has_heartbeat(engine);
> >>>>> +    if (engine->cops->has_heartbeat)
> >>>>> +        return engine->cops->has_heartbeat(engine);
> >>>>>       else
> >>>>>           return READ_ONCE(engine->props.heartbeat_interval_ms);
> >>>>>   }
> >>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> >>>>> b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> >>>>> index de5f9c86b9a4..18005b5546b6 100644
> >>>>> --- a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> >>>>> +++ b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> >>>>> @@ -3619,6 +3619,18 @@ virtual_get_sibling(struct intel_engine_cs
> >>>>> *engine, unsigned int sibling)
> >>>>>       return ve->siblings[sibling];
> >>>>>   }
> >>>>> +static bool virtual_engine_has_heartbeat(const struct
> >>>>> intel_engine_cs *ve)
> >>>>> +{
> >>>>> +    struct intel_engine_cs *engine;
> >>>>> +    intel_engine_mask_t tmp, mask = ve->mask;
> >>>>> +
> >>>>> +    for_each_engine_masked(engine, ve->gt, mask, tmp)
> >>>>> +        if (READ_ONCE(engine->props.heartbeat_interval_ms))
> >>>>> +            return true;
> >>>>> +
> >>>>> +    return false;
> >>>>> +}
> >>>>> +
> >>>>>   static const struct intel_context_ops virtual_context_ops = {
> >>>>>       .flags = COPS_HAS_INFLIGHT,
> >>>>> @@ -3634,6 +3646,8 @@ static const struct intel_context_ops
> >>>>> virtual_context_ops = {
> >>>>>       .enter = virtual_context_enter,
> >>>>>       .exit = virtual_context_exit,
> >>>>> +    .has_heartbeat = virtual_engine_has_heartbeat,
> >>>>> +
> >>>>>       .destroy = virtual_context_destroy,
> >>>>>       .get_sibling = virtual_get_sibling,
> >>>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> >>>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> >>>>> index 89ff0e4b4bc7..ae70bff3605f 100644
> >>>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> >>>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> >>>>> @@ -2168,6 +2168,8 @@ static int guc_virtual_context_alloc(struct
> >>>>> intel_context *ce)
> >>>>>       return lrc_alloc(ce, engine);
> >>>>>   }
> >>>>> +static bool guc_virtual_engine_has_heartbeat(const struct
> >>>>> intel_engine_cs *ve);
> >>>>> +
> >>>>>   static const struct intel_context_ops virtual_guc_context_ops = {
> >>>>>       .alloc = guc_virtual_context_alloc,
> >>>>> @@ -2183,6 +2185,8 @@ static const struct intel_context_ops
> >>>>> virtual_guc_context_ops = {
> >>>>>       .enter = guc_virtual_context_enter,
> >>>>>       .exit = guc_virtual_context_exit,
> >>>>> +    .has_heartbeat = guc_virtual_engine_has_heartbeat,
> >>>>> +
> >>>>>       .sched_disable = guc_context_sched_disable,
> >>>>>       .destroy = guc_context_destroy,
> >>>>> @@ -3029,7 +3033,7 @@ guc_create_virtual(struct intel_engine_cs
> >>>>> **siblings, unsigned int count)
> >>>>>       return ERR_PTR(err);
> >>>>>   }
> >>>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct
> >>>>> intel_engine_cs *ve)
> >>>>> +static bool guc_virtual_engine_has_heartbeat(const struct
> >>>>> intel_engine_cs *ve)
> >>>>>   {
> >>>>>       struct intel_engine_cs *engine;
> >>>>>       intel_engine_mask_t tmp, mask = ve->mask;
> >>>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> >>>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> >>>>> index c7ef44fa0c36..c2afc3b88fd8 100644
> >>>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> >>>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> >>>>> @@ -29,8 +29,6 @@ void intel_guc_dump_active_requests(struct
> >>>>> intel_engine_cs *engine,
> >>>>>                       struct i915_request *hung_rq,
> >>>>>                       struct drm_printer *m);
> >>>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct
> >>>>> intel_engine_cs *ve);
> >>>>> -
> >>>>>   int intel_guc_wait_for_pending_msg(struct intel_guc *guc,
> >>>>>                      atomic_t *wait_var,
> >>>>>                      bool interruptible,
> >>>>
> >>>> _______________________________________________
> >>>> Intel-gfx mailing list
> >>>> Intel-gfx@lists.freedesktop.org
> >>>> https://lists.freedesktop.org/mailman/listinfo/intel-gfx
> >>
>


-- 
Daniel Vetter
Software Engineer, Intel Corporation
http://blog.ffwll.ch

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context
  2021-08-06 19:46             ` Daniel Vetter
@ 2021-08-09 23:12               ` John Harrison
  2021-08-10  6:36                 ` Daniel Vetter
  0 siblings, 1 reply; 16+ messages in thread
From: John Harrison @ 2021-08-09 23:12 UTC (permalink / raw)
  To: Daniel Vetter; +Cc: Tvrtko Ursulin, Matthew Brost, intel-gfx, dri-devel

On 8/6/2021 12:46, Daniel Vetter wrote:
> Seen this fly by and figured I dropped a few thoughts in here. At the
> likely cost of looking a bit out of whack :-)
>
> On Fri, Aug 6, 2021 at 8:01 PM John Harrison <john.c.harrison@intel.com> wrote:
>> On 8/2/2021 02:40, Tvrtko Ursulin wrote:
>>> On 30/07/2021 19:13, John Harrison wrote:
>>>> On 7/30/2021 02:49, Tvrtko Ursulin wrote:
>>>>> On 30/07/2021 01:13, John Harrison wrote:
>>>>>> On 7/28/2021 17:34, Matthew Brost wrote:
>>>>>>> If an engine associated with a context does not have a heartbeat,
>>>>>>> ban it
>>>>>>> immediately. This is needed for GuC submission as a idle pulse
>>>>>>> doesn't
>>>>>>> kick the context off the hardware where it then can check for a
>>>>>>> heartbeat and ban the context.
>>>>> Pulse, that is a request with I915_PRIORITY_BARRIER, does not
>>>>> preempt a running normal priority context?
>>>>>
>>>>> Why does it matter then whether or not heartbeats are enabled - when
>>>>> heartbeat just ends up sending the same engine pulse (eventually,
>>>>> with raising priority)?
>>>> The point is that the pulse is pointless. See the rest of my comments
>>>> below, specifically "the context will get resubmitted to the hardware
>>>> after the pulse completes". To re-iterate...
>>>>
>>>> Yes, it preempts the context. Yes, it does so whether heartbeats are
>>>> enabled or not. But so what? Who cares? You have preempted a context.
>>>> It is no longer running on the hardware. BUT IT IS STILL A VALID
>>>> CONTEXT.
>>> It is valid yes, and it even may be the current ABI so another
>>> question is whether it is okay to change that.
>>>
>>>> The backend scheduler will just resubmit it to the hardware as soon
>>>> as the pulse completes. The only reason this works at all is because
>>>> of the horrid hack in the execlist scheduler's back end
>>>> implementation (in __execlists_schedule_in):
>>>>           if (unlikely(intel_context_is_closed(ce) &&
>>>>                        !intel_engine_has_heartbeat(engine)))
>>>>                   intel_context_set_banned(ce);
>>> Right, is the above code then needed with this patch - when ban is
>>> immediately applied on the higher level?
>>>
>>>> The actual back end scheduler is saying "Is this a zombie context? Is
>>>> the heartbeat disabled? Then ban it". No other scheduler backend is
>>>> going to have knowledge of zombie context status or of the heartbeat
>>>> status. Nor are they going to call back into the higher levels of the
>>>> i915 driver to trigger a ban operation. Certainly a hardware
>>>> implemented scheduler is not going to be looking at private i915
>>>> driver information to decide whether to submit a context or whether
>>>> to tell the OS to kill it off instead.
>>>>
>>>> For persistence to work with a hardware scheduler (or a non-Intel
>>>> specific scheduler such as the DRM one), the handling of zombie
>>>> contexts, banning, etc. *must* be done entirely in the front end. It
>>>> cannot rely on any backend hacks. That means you can't rely on any
>>>> fancy behaviour of pulses.
>>>>
>>>> If you want to ban a context then you must explicitly ban that
>>>> context. If you want to ban it at some later point then you need to
>>>> track it at the top level as a zombie and then explicitly ban that
>>>> zombie at whatever later point.
>>> I am still trying to understand it all. If I go by the commit message:
>>>
>>> """
>>> This is needed for GuC submission as a idle pulse doesn't
>>> kick the context off the hardware where it then can check for a
>>> heartbeat and ban the context.
>>> """
>>>
>>> That did not explain things for me. Sentence does not appear to make
>>> sense. Now, it seems "kick off the hardware" is meant as revoke and
>>> not just preempt. Which is fine, perhaps just needs to be written more
>>> explicitly. But the part of checking for heartbeat after idle pulse
>>> does not compute for me. It is the heartbeat which emits idle pulses,
>>> not idle pulse emitting heartbeats.
>> I am in agreement that the commit message is confusing and does not
>> explain either the problem or the solution.
>>
>>
>>>
>>> But anyway, I can buy the handling at the front end story completely.
>>> It makes sense. We just need to agree that a) it is okay to change the
>>> ABI and b) remove the backend check from execlists if it is not needed
>>> any longer.
>>>
>>> And if ABI change is okay then commit message needs to talk about it
>>> loudly and clearly.
>> I don't think we have a choice. The current ABI is not and cannot ever
>> be compatible with any scheduler external to i915. It cannot be
>> implemented with a hardware scheduler such as the GuC and it cannot be
>> implemented with an external software scheduler such as the DRM one.
> So generally on linux we implement helper libraries, which means
> massive flexibility everywhere.
>
> https://blog.ffwll.ch/2016/12/midlayers-once-more-with-feeling.html
>
> So it shouldn't be an insurmountable problem to make this happen even
> with drm/scheduler, we can patch it up.
>
> Whether that's justified is another question.
Helper libraries won't work with a hardware scheduler.

>
>> My view is that any implementation involving knowledge of the heartbeat
>> is fundamentally broken.
>>
>> According to Daniel Vetter, the DRM ABI on this subject is that an
>> actively executing context should persist until the DRM file handle is
>> closed. That seems like a much more plausible and simple ABI than one
> DRM ABI is maybe a bit an overkill statement. It's more "what other
> drivers do", but it's generally a good idea to not ignore that :-)
>
>> that says 'if the heartbeat is running then a context will persist
>> forever, if the heartbeat is not running then it will be killed
>> immediately, if the heart was running but then stops running then the
>> context will be killed on the next context switch, ...'. And if I
>> understand it correctly, the current ABI allows a badly written user app
>> to cause a denial of service by leaving contexts permanently running an
>> infinite loop on the hardware even after the app has been killed! How
>> can that ever be considered a good idea?
> We're not going to support changing all these settings at runtime.
> There's just not point in trying to make that work race-free, it
> either adds complexity to the code for no reason, or it adds overhead
> to the code for no reason.
>
> Yes I know existing customers and all that, but
> - they can change this stuff, and when they change it while anyting is
> in-flight they get to keep the pieces. These options taint the kernel
> for a reason (and if they don't, that should be fixed)
> - quite a few around heartbeat and compute support as we've merged a
> while ago hang by design when trying to smash them into drm rules.
> We're not going to fix that, and we should not use any existing such
> assumptions as justification for code changes.
>
> Wrt infinitely running: Right now nothing is allowed to run forever,
> because hangcheck will step in and kill that job. Once we add compute
> mode ctx flag we'll require killing on process exit to stop escape.
If the infinite loop is pre-emptible then the heartbeat won't kill it 
off. It will just run forever. Okay, it won't be a huge denial of 
service because other work can pre-empt and run. However, you are down 
one timeslice execution slot at that priority level. You have also 
permanently lost whatever memory is allocated and in use by that workload.


>
>> Therefore, the context close implementation should be to add an active
>> context to a zombie list. If a context is in zombie state and its last
>> request completes then the context can be immediately killed at that
>> point. Otherwise, on DRM handle close, we go through the zombie list and
>> immediately kill all contexts.
>>
>> Simple, clean, no back-end scheduler hacks, no reliance on heartbeats or
>> pulses. Also no opportunity for rogue (or just badly written) user
>> processes to leave zombie contexts running on the hardware forever and
>> causing a denial of service attack. If the host process is killed, all
>> of its GPU processes are also killed irrespective of what dodgy context
>> flags they may or may not have set.
> Uh, the intel_context state machine is already a bit too complex, and
> the implementation lacks a bunch of barriers at least from the cursor
> look I've given it thus far.
>
> So if we really need to make that more complex with more states then I
> think someone needs to come up with an actual clean design, with
> proper state transitions and all the barriers (or really, a design
> which doesn't need barriers). This is going to be work.
> -Daniel
Personally, I would rather just drop the whole persistence/zombie idea 
completely. If you close your context then you should expect that 
context to be destroyed and any outstanding workloads killed off. If you 
wanted the results then you should have waited for them.

If we do have to support some level of persistence then it doesn't seem 
like tracking closed contexts should be especially complex. Not sure why 
it would need special barriers either.

John.

>> John.
>>
>>
>>> Or perhaps there is no ABI change? I am not really clear how does
>>> setting banned status propagate to the GuC backend. I mean at which
>>> point does i915 ends up passing that info to the firmware?
>>>
>>> Regards,
>>>
>>> Tvrtko
>>>
>>>>
>>>>>> It's worse than this. If the engine in question is an individual
>>>>>> physical engine then sending a pulse (with sufficiently high
>>>>>> priority) will pre-empt the engine and kick the context off.
>>>>>> However, the GuC
>>>>> Why it is different for physical vs virtual, aren't both just
>>>>> schedulable contexts with different engine masks for what GuC is
>>>>> concerned? Oh, is it a matter of needing to send pulses to all
>>>>> engines which comprise a virtual one?
>>>> It isn't different. It is totally broken for both. It is potentially
>>>> more broken for virtual engines because of the question of which
>>>> engine to pulse. But as stated above, the pulse is pointless anyway
>>>> so the which engine question doesn't even matter.
>>>>
>>>> John.
>>>>
>>>>
>>>>>> scheduler does not have hacks in it to check the state of the
>>>>>> heartbeat or whether a context is actually a zombie or not. Thus,
>>>>>> the context will get resubmitted to the hardware after the pulse
>>>>>> completes and effectively nothing will have happened.
>>>>>>
>>>>>> I would assume that the DRM scheduler which we are meant to be
>>>>>> switching to for execlist as well as GuC submission is also
>>>>>> unlikely to have hacks for zombie contexts and tests for whether
>>>>>> the i915 specific heartbeat has been disabled since the context
>>>>>> became a zombie. So when that switch happens, this test will also
>>>>>> fail in execlist mode as well as GuC mode.
>>>>>>
>>>>>> The choices I see here are to simply remove persistence completely
>>>>>> (it is a basically a bug that became UAPI because it wasn't caught
>>>>>> soon enough!) or to implement it in a way that does not require
>>>>>> hacks in the back end scheduler. Apparently, the DRM scheduler is
>>>>>> expected to allow zombie contexts to persist until the DRM file
>>>>>> handle is closed. So presumably we will have to go with option two.
>>>>>>
>>>>>> That means flagging a context as being a zombie when it is closed
>>>>>> but still active. The driver would then add it to a zombie list
>>>>>> owned by the DRM client object. When that client object is closed,
>>>>>> i915 would go through the list and genuinely kill all the contexts.
>>>>>> No back end scheduler hacks required and no intimate knowledge of
>>>>>> the i915 heartbeat mechanism required either.
>>>>>>
>>>>>> John.
>>>>>>
>>>>>>
>>>>>>> This patch also updates intel_engine_has_heartbeat to be a vfunc
>>>>>>> as we
>>>>>>> now need to call this function on execlists virtual engines too.
>>>>>>>
>>>>>>> Signed-off-by: Matthew Brost <matthew.brost@intel.com>
>>>>>>> ---
>>>>>>>    drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
>>>>>>>    drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
>>>>>>>    drivers/gpu/drm/i915/gt/intel_engine.h        | 21
>>>>>>> ++-----------------
>>>>>>>    .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
>>>>>>>    .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
>>>>>>>    .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
>>>>>>>    6 files changed, 26 insertions(+), 24 deletions(-)
>>>>>>>
>>>>>>> diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>>>>> b/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>>>>> index 9c3672bac0e2..b8e01c5ba9e5 100644
>>>>>>> --- a/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>>>>> +++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>>>>> @@ -1090,8 +1090,9 @@ static void kill_engines(struct
>>>>>>> i915_gem_engines *engines, bool ban)
>>>>>>>         */
>>>>>>>        for_each_gem_engine(ce, engines, it) {
>>>>>>>            struct intel_engine_cs *engine;
>>>>>>> +        bool local_ban = ban ||
>>>>>>> !intel_engine_has_heartbeat(ce->engine);
>>>>> In any case (pending me understanding what's really going on there),
>>>>> why would this check not be in kill_context with currently does this:
>>>>>
>>>>>      bool ban = (!i915_gem_context_is_persistent(ctx) ||
>>>>>              !ctx->i915->params.enable_hangcheck);
>>>>> ...
>>>>>          kill_engines(pos, ban);
>>>>>
>>>>> So whether to ban decision would be consolidated to one place.
>>>>>
>>>>> In fact, decision on whether to allow persistent is tied to
>>>>> enable_hangcheck, which also drives hearbeat emission. So perhaps
>>>>> one part of the correct fix is to extend the above (kill_context)
>>>>> ban criteria to include hearbeat values anyway. Otherwise isn't it a
>>>>> simple miss that this check fails to account to hearbeat disablement
>>>>> via sysfs?
>>>>>
>>>>> Regards,
>>>>>
>>>>> Tvrtko
>>>>>
>>>>>>> -        if (ban && intel_context_ban(ce, NULL))
>>>>>>> +        if (local_ban && intel_context_ban(ce, NULL))
>>>>>>>                continue;
>>>>>>>            /*
>>>>>>> @@ -1104,7 +1105,7 @@ static void kill_engines(struct
>>>>>>> i915_gem_engines *engines, bool ban)
>>>>>>>            engine = active_engine(ce);
>>>>>>>            /* First attempt to gracefully cancel the context */
>>>>>>> -        if (engine && !__cancel_engine(engine) && ban)
>>>>>>> +        if (engine && !__cancel_engine(engine) && local_ban)
>>>>>>>                /*
>>>>>>>                 * If we are unable to send a preemptive pulse to bump
>>>>>>>                 * the context from the GPU, we have to resort to a
>>>>>>> full
>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>>>>> b/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>>>>> index e54351a170e2..65f2eb2a78e4 100644
>>>>>>> --- a/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>>>>> +++ b/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>>>>> @@ -55,6 +55,8 @@ struct intel_context_ops {
>>>>>>>        void (*reset)(struct intel_context *ce);
>>>>>>>        void (*destroy)(struct kref *kref);
>>>>>>> +    bool (*has_heartbeat)(const struct intel_engine_cs *engine);
>>>>>>> +
>>>>>>>        /* virtual engine/context interface */
>>>>>>>        struct intel_context *(*create_virtual)(struct
>>>>>>> intel_engine_cs **engine,
>>>>>>>                            unsigned int count);
>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_engine.h
>>>>>>> b/drivers/gpu/drm/i915/gt/intel_engine.h
>>>>>>> index c2a5640ae055..1b11a808acc4 100644
>>>>>>> --- a/drivers/gpu/drm/i915/gt/intel_engine.h
>>>>>>> +++ b/drivers/gpu/drm/i915/gt/intel_engine.h
>>>>>>> @@ -283,28 +283,11 @@ struct intel_context *
>>>>>>>    intel_engine_create_virtual(struct intel_engine_cs **siblings,
>>>>>>>                    unsigned int count);
>>>>>>> -static inline bool
>>>>>>> -intel_virtual_engine_has_heartbeat(const struct intel_engine_cs
>>>>>>> *engine)
>>>>>>> -{
>>>>>>> -    /*
>>>>>>> -     * For non-GuC submission we expect the back-end to look at the
>>>>>>> -     * heartbeat status of the actual physical engine that the work
>>>>>>> -     * has been (or is being) scheduled on, so we should only reach
>>>>>>> -     * here with GuC submission enabled.
>>>>>>> -     */
>>>>>>> -    GEM_BUG_ON(!intel_engine_uses_guc(engine));
>>>>>>> -
>>>>>>> -    return intel_guc_virtual_engine_has_heartbeat(engine);
>>>>>>> -}
>>>>>>> -
>>>>>>>    static inline bool
>>>>>>>    intel_engine_has_heartbeat(const struct intel_engine_cs *engine)
>>>>>>>    {
>>>>>>> -    if (!IS_ACTIVE(CONFIG_DRM_I915_HEARTBEAT_INTERVAL))
>>>>>>> -        return false;
>>>>>>> -
>>>>>>> -    if (intel_engine_is_virtual(engine))
>>>>>>> -        return intel_virtual_engine_has_heartbeat(engine);
>>>>>>> +    if (engine->cops->has_heartbeat)
>>>>>>> +        return engine->cops->has_heartbeat(engine);
>>>>>>>        else
>>>>>>>            return READ_ONCE(engine->props.heartbeat_interval_ms);
>>>>>>>    }
>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>>>>> b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>>>>> index de5f9c86b9a4..18005b5546b6 100644
>>>>>>> --- a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>>>>> +++ b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>>>>> @@ -3619,6 +3619,18 @@ virtual_get_sibling(struct intel_engine_cs
>>>>>>> *engine, unsigned int sibling)
>>>>>>>        return ve->siblings[sibling];
>>>>>>>    }
>>>>>>> +static bool virtual_engine_has_heartbeat(const struct
>>>>>>> intel_engine_cs *ve)
>>>>>>> +{
>>>>>>> +    struct intel_engine_cs *engine;
>>>>>>> +    intel_engine_mask_t tmp, mask = ve->mask;
>>>>>>> +
>>>>>>> +    for_each_engine_masked(engine, ve->gt, mask, tmp)
>>>>>>> +        if (READ_ONCE(engine->props.heartbeat_interval_ms))
>>>>>>> +            return true;
>>>>>>> +
>>>>>>> +    return false;
>>>>>>> +}
>>>>>>> +
>>>>>>>    static const struct intel_context_ops virtual_context_ops = {
>>>>>>>        .flags = COPS_HAS_INFLIGHT,
>>>>>>> @@ -3634,6 +3646,8 @@ static const struct intel_context_ops
>>>>>>> virtual_context_ops = {
>>>>>>>        .enter = virtual_context_enter,
>>>>>>>        .exit = virtual_context_exit,
>>>>>>> +    .has_heartbeat = virtual_engine_has_heartbeat,
>>>>>>> +
>>>>>>>        .destroy = virtual_context_destroy,
>>>>>>>        .get_sibling = virtual_get_sibling,
>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>>>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>>>>> index 89ff0e4b4bc7..ae70bff3605f 100644
>>>>>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>>>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>>>>> @@ -2168,6 +2168,8 @@ static int guc_virtual_context_alloc(struct
>>>>>>> intel_context *ce)
>>>>>>>        return lrc_alloc(ce, engine);
>>>>>>>    }
>>>>>>> +static bool guc_virtual_engine_has_heartbeat(const struct
>>>>>>> intel_engine_cs *ve);
>>>>>>> +
>>>>>>>    static const struct intel_context_ops virtual_guc_context_ops = {
>>>>>>>        .alloc = guc_virtual_context_alloc,
>>>>>>> @@ -2183,6 +2185,8 @@ static const struct intel_context_ops
>>>>>>> virtual_guc_context_ops = {
>>>>>>>        .enter = guc_virtual_context_enter,
>>>>>>>        .exit = guc_virtual_context_exit,
>>>>>>> +    .has_heartbeat = guc_virtual_engine_has_heartbeat,
>>>>>>> +
>>>>>>>        .sched_disable = guc_context_sched_disable,
>>>>>>>        .destroy = guc_context_destroy,
>>>>>>> @@ -3029,7 +3033,7 @@ guc_create_virtual(struct intel_engine_cs
>>>>>>> **siblings, unsigned int count)
>>>>>>>        return ERR_PTR(err);
>>>>>>>    }
>>>>>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct
>>>>>>> intel_engine_cs *ve)
>>>>>>> +static bool guc_virtual_engine_has_heartbeat(const struct
>>>>>>> intel_engine_cs *ve)
>>>>>>>    {
>>>>>>>        struct intel_engine_cs *engine;
>>>>>>>        intel_engine_mask_t tmp, mask = ve->mask;
>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>>>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>>>>> index c7ef44fa0c36..c2afc3b88fd8 100644
>>>>>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>>>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>>>>> @@ -29,8 +29,6 @@ void intel_guc_dump_active_requests(struct
>>>>>>> intel_engine_cs *engine,
>>>>>>>                        struct i915_request *hung_rq,
>>>>>>>                        struct drm_printer *m);
>>>>>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct
>>>>>>> intel_engine_cs *ve);
>>>>>>> -
>>>>>>>    int intel_guc_wait_for_pending_msg(struct intel_guc *guc,
>>>>>>>                       atomic_t *wait_var,
>>>>>>>                       bool interruptible,
>>>>>> _______________________________________________
>>>>>> Intel-gfx mailing list
>>>>>> Intel-gfx@lists.freedesktop.org
>>>>>> https://lists.freedesktop.org/mailman/listinfo/intel-gfx
>


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context
  2021-08-09 23:12               ` John Harrison
@ 2021-08-10  6:36                 ` Daniel Vetter
  2021-08-18  0:28                   ` John Harrison
  0 siblings, 1 reply; 16+ messages in thread
From: Daniel Vetter @ 2021-08-10  6:36 UTC (permalink / raw)
  To: John Harrison
  Cc: Daniel Vetter, Tvrtko Ursulin, Matthew Brost, intel-gfx, dri-devel

On Mon, Aug 09, 2021 at 04:12:52PM -0700, John Harrison wrote:
> On 8/6/2021 12:46, Daniel Vetter wrote:
> > Seen this fly by and figured I dropped a few thoughts in here. At the
> > likely cost of looking a bit out of whack :-)
> > 
> > On Fri, Aug 6, 2021 at 8:01 PM John Harrison <john.c.harrison@intel.com> wrote:
> > > On 8/2/2021 02:40, Tvrtko Ursulin wrote:
> > > > On 30/07/2021 19:13, John Harrison wrote:
> > > > > On 7/30/2021 02:49, Tvrtko Ursulin wrote:
> > > > > > On 30/07/2021 01:13, John Harrison wrote:
> > > > > > > On 7/28/2021 17:34, Matthew Brost wrote:
> > > > > > > > If an engine associated with a context does not have a heartbeat,
> > > > > > > > ban it
> > > > > > > > immediately. This is needed for GuC submission as a idle pulse
> > > > > > > > doesn't
> > > > > > > > kick the context off the hardware where it then can check for a
> > > > > > > > heartbeat and ban the context.
> > > > > > Pulse, that is a request with I915_PRIORITY_BARRIER, does not
> > > > > > preempt a running normal priority context?
> > > > > > 
> > > > > > Why does it matter then whether or not heartbeats are enabled - when
> > > > > > heartbeat just ends up sending the same engine pulse (eventually,
> > > > > > with raising priority)?
> > > > > The point is that the pulse is pointless. See the rest of my comments
> > > > > below, specifically "the context will get resubmitted to the hardware
> > > > > after the pulse completes". To re-iterate...
> > > > > 
> > > > > Yes, it preempts the context. Yes, it does so whether heartbeats are
> > > > > enabled or not. But so what? Who cares? You have preempted a context.
> > > > > It is no longer running on the hardware. BUT IT IS STILL A VALID
> > > > > CONTEXT.
> > > > It is valid yes, and it even may be the current ABI so another
> > > > question is whether it is okay to change that.
> > > > 
> > > > > The backend scheduler will just resubmit it to the hardware as soon
> > > > > as the pulse completes. The only reason this works at all is because
> > > > > of the horrid hack in the execlist scheduler's back end
> > > > > implementation (in __execlists_schedule_in):
> > > > >           if (unlikely(intel_context_is_closed(ce) &&
> > > > >                        !intel_engine_has_heartbeat(engine)))
> > > > >                   intel_context_set_banned(ce);
> > > > Right, is the above code then needed with this patch - when ban is
> > > > immediately applied on the higher level?
> > > > 
> > > > > The actual back end scheduler is saying "Is this a zombie context? Is
> > > > > the heartbeat disabled? Then ban it". No other scheduler backend is
> > > > > going to have knowledge of zombie context status or of the heartbeat
> > > > > status. Nor are they going to call back into the higher levels of the
> > > > > i915 driver to trigger a ban operation. Certainly a hardware
> > > > > implemented scheduler is not going to be looking at private i915
> > > > > driver information to decide whether to submit a context or whether
> > > > > to tell the OS to kill it off instead.
> > > > > 
> > > > > For persistence to work with a hardware scheduler (or a non-Intel
> > > > > specific scheduler such as the DRM one), the handling of zombie
> > > > > contexts, banning, etc. *must* be done entirely in the front end. It
> > > > > cannot rely on any backend hacks. That means you can't rely on any
> > > > > fancy behaviour of pulses.
> > > > > 
> > > > > If you want to ban a context then you must explicitly ban that
> > > > > context. If you want to ban it at some later point then you need to
> > > > > track it at the top level as a zombie and then explicitly ban that
> > > > > zombie at whatever later point.
> > > > I am still trying to understand it all. If I go by the commit message:
> > > > 
> > > > """
> > > > This is needed for GuC submission as a idle pulse doesn't
> > > > kick the context off the hardware where it then can check for a
> > > > heartbeat and ban the context.
> > > > """
> > > > 
> > > > That did not explain things for me. Sentence does not appear to make
> > > > sense. Now, it seems "kick off the hardware" is meant as revoke and
> > > > not just preempt. Which is fine, perhaps just needs to be written more
> > > > explicitly. But the part of checking for heartbeat after idle pulse
> > > > does not compute for me. It is the heartbeat which emits idle pulses,
> > > > not idle pulse emitting heartbeats.
> > > I am in agreement that the commit message is confusing and does not
> > > explain either the problem or the solution.
> > > 
> > > 
> > > > 
> > > > But anyway, I can buy the handling at the front end story completely.
> > > > It makes sense. We just need to agree that a) it is okay to change the
> > > > ABI and b) remove the backend check from execlists if it is not needed
> > > > any longer.
> > > > 
> > > > And if ABI change is okay then commit message needs to talk about it
> > > > loudly and clearly.
> > > I don't think we have a choice. The current ABI is not and cannot ever
> > > be compatible with any scheduler external to i915. It cannot be
> > > implemented with a hardware scheduler such as the GuC and it cannot be
> > > implemented with an external software scheduler such as the DRM one.
> > So generally on linux we implement helper libraries, which means
> > massive flexibility everywhere.
> > 
> > https://blog.ffwll.ch/2016/12/midlayers-once-more-with-feeling.html
> > 
> > So it shouldn't be an insurmountable problem to make this happen even
> > with drm/scheduler, we can patch it up.
> > 
> > Whether that's justified is another question.
> Helper libraries won't work with a hardware scheduler.

Hm I guess I misunderstood then what exactly the hold-up is. This entire
discussion feels at least a bit like "heartbeat is unchangeable and guc
must fit", which is pretty much the midlayer mistake. We need to figure
out an implementation that works with GuC of the goals of the uapi,
instead of assuming that the current heartbeat is the only possible way to
achieve that.

Or I'm just very confused about what the problem is.

> > > My view is that any implementation involving knowledge of the heartbeat
> > > is fundamentally broken.
> > > 
> > > According to Daniel Vetter, the DRM ABI on this subject is that an
> > > actively executing context should persist until the DRM file handle is
> > > closed. That seems like a much more plausible and simple ABI than one
> > DRM ABI is maybe a bit an overkill statement. It's more "what other
> > drivers do", but it's generally a good idea to not ignore that :-)
> > 
> > > that says 'if the heartbeat is running then a context will persist
> > > forever, if the heartbeat is not running then it will be killed
> > > immediately, if the heart was running but then stops running then the
> > > context will be killed on the next context switch, ...'. And if I
> > > understand it correctly, the current ABI allows a badly written user app
> > > to cause a denial of service by leaving contexts permanently running an
> > > infinite loop on the hardware even after the app has been killed! How
> > > can that ever be considered a good idea?
> > We're not going to support changing all these settings at runtime.
> > There's just not point in trying to make that work race-free, it
> > either adds complexity to the code for no reason, or it adds overhead
> > to the code for no reason.
> > 
> > Yes I know existing customers and all that, but
> > - they can change this stuff, and when they change it while anyting is
> > in-flight they get to keep the pieces. These options taint the kernel
> > for a reason (and if they don't, that should be fixed)
> > - quite a few around heartbeat and compute support as we've merged a
> > while ago hang by design when trying to smash them into drm rules.
> > We're not going to fix that, and we should not use any existing such
> > assumptions as justification for code changes.
> > 
> > Wrt infinitely running: Right now nothing is allowed to run forever,
> > because hangcheck will step in and kill that job. Once we add compute
> > mode ctx flag we'll require killing on process exit to stop escape.
> If the infinite loop is pre-emptible then the heartbeat won't kill it off.
> It will just run forever. Okay, it won't be a huge denial of service because
> other work can pre-empt and run. However, you are down one timeslice
> execution slot at that priority level. You have also permanently lost
> whatever memory is allocated and in use by that workload.

Ok I think I'm definitely lost.

Right now, in upstream, you can't run forever without regularly calling
execbuf to stuff new work in. So it will die out, it wont be persistent
for very long.

> > > Therefore, the context close implementation should be to add an active
> > > context to a zombie list. If a context is in zombie state and its last
> > > request completes then the context can be immediately killed at that
> > > point. Otherwise, on DRM handle close, we go through the zombie list and
> > > immediately kill all contexts.
> > > 
> > > Simple, clean, no back-end scheduler hacks, no reliance on heartbeats or
> > > pulses. Also no opportunity for rogue (or just badly written) user
> > > processes to leave zombie contexts running on the hardware forever and
> > > causing a denial of service attack. If the host process is killed, all
> > > of its GPU processes are also killed irrespective of what dodgy context
> > > flags they may or may not have set.
> > Uh, the intel_context state machine is already a bit too complex, and
> > the implementation lacks a bunch of barriers at least from the cursor
> > look I've given it thus far.
> > 
> > So if we really need to make that more complex with more states then I
> > think someone needs to come up with an actual clean design, with
> > proper state transitions and all the barriers (or really, a design
> > which doesn't need barriers). This is going to be work.
> > -Daniel
> Personally, I would rather just drop the whole persistence/zombie idea
> completely. If you close your context then you should expect that context to
> be destroyed and any outstanding workloads killed off. If you wanted the
> results then you should have waited for them.
> 
> If we do have to support some level of persistence then it doesn't seem like
> tracking closed contexts should be especially complex. Not sure why it would
> need special barriers either.

Frankly I think I'm lost, and I think the confusion (for me at least)
starts with what the current uapi is.

Can someone please document that, with kerneldoc in the uapi header
ideally? Once we have that defined I think we can have an actual
discussion about what exactly this should look like with GuC (and also
eventually with drm/scheduler), and which parts of the uapi are just
artifacts of the current implementation, and which parts actually matter.

Otherwise I think we're just spinning wheels a bit much here.
-Daniel

> 
> John.
> 
> > > John.
> > > 
> > > 
> > > > Or perhaps there is no ABI change? I am not really clear how does
> > > > setting banned status propagate to the GuC backend. I mean at which
> > > > point does i915 ends up passing that info to the firmware?
> > > > 
> > > > Regards,
> > > > 
> > > > Tvrtko
> > > > 
> > > > > 
> > > > > > > It's worse than this. If the engine in question is an individual
> > > > > > > physical engine then sending a pulse (with sufficiently high
> > > > > > > priority) will pre-empt the engine and kick the context off.
> > > > > > > However, the GuC
> > > > > > Why it is different for physical vs virtual, aren't both just
> > > > > > schedulable contexts with different engine masks for what GuC is
> > > > > > concerned? Oh, is it a matter of needing to send pulses to all
> > > > > > engines which comprise a virtual one?
> > > > > It isn't different. It is totally broken for both. It is potentially
> > > > > more broken for virtual engines because of the question of which
> > > > > engine to pulse. But as stated above, the pulse is pointless anyway
> > > > > so the which engine question doesn't even matter.
> > > > > 
> > > > > John.
> > > > > 
> > > > > 
> > > > > > > scheduler does not have hacks in it to check the state of the
> > > > > > > heartbeat or whether a context is actually a zombie or not. Thus,
> > > > > > > the context will get resubmitted to the hardware after the pulse
> > > > > > > completes and effectively nothing will have happened.
> > > > > > > 
> > > > > > > I would assume that the DRM scheduler which we are meant to be
> > > > > > > switching to for execlist as well as GuC submission is also
> > > > > > > unlikely to have hacks for zombie contexts and tests for whether
> > > > > > > the i915 specific heartbeat has been disabled since the context
> > > > > > > became a zombie. So when that switch happens, this test will also
> > > > > > > fail in execlist mode as well as GuC mode.
> > > > > > > 
> > > > > > > The choices I see here are to simply remove persistence completely
> > > > > > > (it is a basically a bug that became UAPI because it wasn't caught
> > > > > > > soon enough!) or to implement it in a way that does not require
> > > > > > > hacks in the back end scheduler. Apparently, the DRM scheduler is
> > > > > > > expected to allow zombie contexts to persist until the DRM file
> > > > > > > handle is closed. So presumably we will have to go with option two.
> > > > > > > 
> > > > > > > That means flagging a context as being a zombie when it is closed
> > > > > > > but still active. The driver would then add it to a zombie list
> > > > > > > owned by the DRM client object. When that client object is closed,
> > > > > > > i915 would go through the list and genuinely kill all the contexts.
> > > > > > > No back end scheduler hacks required and no intimate knowledge of
> > > > > > > the i915 heartbeat mechanism required either.
> > > > > > > 
> > > > > > > John.
> > > > > > > 
> > > > > > > 
> > > > > > > > This patch also updates intel_engine_has_heartbeat to be a vfunc
> > > > > > > > as we
> > > > > > > > now need to call this function on execlists virtual engines too.
> > > > > > > > 
> > > > > > > > Signed-off-by: Matthew Brost <matthew.brost@intel.com>
> > > > > > > > ---
> > > > > > > >    drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
> > > > > > > >    drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
> > > > > > > >    drivers/gpu/drm/i915/gt/intel_engine.h        | 21
> > > > > > > > ++-----------------
> > > > > > > >    .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
> > > > > > > >    .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
> > > > > > > >    .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
> > > > > > > >    6 files changed, 26 insertions(+), 24 deletions(-)
> > > > > > > > 
> > > > > > > > diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c
> > > > > > > > b/drivers/gpu/drm/i915/gem/i915_gem_context.c
> > > > > > > > index 9c3672bac0e2..b8e01c5ba9e5 100644
> > > > > > > > --- a/drivers/gpu/drm/i915/gem/i915_gem_context.c
> > > > > > > > +++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c
> > > > > > > > @@ -1090,8 +1090,9 @@ static void kill_engines(struct
> > > > > > > > i915_gem_engines *engines, bool ban)
> > > > > > > >         */
> > > > > > > >        for_each_gem_engine(ce, engines, it) {
> > > > > > > >            struct intel_engine_cs *engine;
> > > > > > > > +        bool local_ban = ban ||
> > > > > > > > !intel_engine_has_heartbeat(ce->engine);
> > > > > > In any case (pending me understanding what's really going on there),
> > > > > > why would this check not be in kill_context with currently does this:
> > > > > > 
> > > > > >      bool ban = (!i915_gem_context_is_persistent(ctx) ||
> > > > > >              !ctx->i915->params.enable_hangcheck);
> > > > > > ...
> > > > > >          kill_engines(pos, ban);
> > > > > > 
> > > > > > So whether to ban decision would be consolidated to one place.
> > > > > > 
> > > > > > In fact, decision on whether to allow persistent is tied to
> > > > > > enable_hangcheck, which also drives hearbeat emission. So perhaps
> > > > > > one part of the correct fix is to extend the above (kill_context)
> > > > > > ban criteria to include hearbeat values anyway. Otherwise isn't it a
> > > > > > simple miss that this check fails to account to hearbeat disablement
> > > > > > via sysfs?
> > > > > > 
> > > > > > Regards,
> > > > > > 
> > > > > > Tvrtko
> > > > > > 
> > > > > > > > -        if (ban && intel_context_ban(ce, NULL))
> > > > > > > > +        if (local_ban && intel_context_ban(ce, NULL))
> > > > > > > >                continue;
> > > > > > > >            /*
> > > > > > > > @@ -1104,7 +1105,7 @@ static void kill_engines(struct
> > > > > > > > i915_gem_engines *engines, bool ban)
> > > > > > > >            engine = active_engine(ce);
> > > > > > > >            /* First attempt to gracefully cancel the context */
> > > > > > > > -        if (engine && !__cancel_engine(engine) && ban)
> > > > > > > > +        if (engine && !__cancel_engine(engine) && local_ban)
> > > > > > > >                /*
> > > > > > > >                 * If we are unable to send a preemptive pulse to bump
> > > > > > > >                 * the context from the GPU, we have to resort to a
> > > > > > > > full
> > > > > > > > diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h
> > > > > > > > b/drivers/gpu/drm/i915/gt/intel_context_types.h
> > > > > > > > index e54351a170e2..65f2eb2a78e4 100644
> > > > > > > > --- a/drivers/gpu/drm/i915/gt/intel_context_types.h
> > > > > > > > +++ b/drivers/gpu/drm/i915/gt/intel_context_types.h
> > > > > > > > @@ -55,6 +55,8 @@ struct intel_context_ops {
> > > > > > > >        void (*reset)(struct intel_context *ce);
> > > > > > > >        void (*destroy)(struct kref *kref);
> > > > > > > > +    bool (*has_heartbeat)(const struct intel_engine_cs *engine);
> > > > > > > > +
> > > > > > > >        /* virtual engine/context interface */
> > > > > > > >        struct intel_context *(*create_virtual)(struct
> > > > > > > > intel_engine_cs **engine,
> > > > > > > >                            unsigned int count);
> > > > > > > > diff --git a/drivers/gpu/drm/i915/gt/intel_engine.h
> > > > > > > > b/drivers/gpu/drm/i915/gt/intel_engine.h
> > > > > > > > index c2a5640ae055..1b11a808acc4 100644
> > > > > > > > --- a/drivers/gpu/drm/i915/gt/intel_engine.h
> > > > > > > > +++ b/drivers/gpu/drm/i915/gt/intel_engine.h
> > > > > > > > @@ -283,28 +283,11 @@ struct intel_context *
> > > > > > > >    intel_engine_create_virtual(struct intel_engine_cs **siblings,
> > > > > > > >                    unsigned int count);
> > > > > > > > -static inline bool
> > > > > > > > -intel_virtual_engine_has_heartbeat(const struct intel_engine_cs
> > > > > > > > *engine)
> > > > > > > > -{
> > > > > > > > -    /*
> > > > > > > > -     * For non-GuC submission we expect the back-end to look at the
> > > > > > > > -     * heartbeat status of the actual physical engine that the work
> > > > > > > > -     * has been (or is being) scheduled on, so we should only reach
> > > > > > > > -     * here with GuC submission enabled.
> > > > > > > > -     */
> > > > > > > > -    GEM_BUG_ON(!intel_engine_uses_guc(engine));
> > > > > > > > -
> > > > > > > > -    return intel_guc_virtual_engine_has_heartbeat(engine);
> > > > > > > > -}
> > > > > > > > -
> > > > > > > >    static inline bool
> > > > > > > >    intel_engine_has_heartbeat(const struct intel_engine_cs *engine)
> > > > > > > >    {
> > > > > > > > -    if (!IS_ACTIVE(CONFIG_DRM_I915_HEARTBEAT_INTERVAL))
> > > > > > > > -        return false;
> > > > > > > > -
> > > > > > > > -    if (intel_engine_is_virtual(engine))
> > > > > > > > -        return intel_virtual_engine_has_heartbeat(engine);
> > > > > > > > +    if (engine->cops->has_heartbeat)
> > > > > > > > +        return engine->cops->has_heartbeat(engine);
> > > > > > > >        else
> > > > > > > >            return READ_ONCE(engine->props.heartbeat_interval_ms);
> > > > > > > >    }
> > > > > > > > diff --git a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> > > > > > > > b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> > > > > > > > index de5f9c86b9a4..18005b5546b6 100644
> > > > > > > > --- a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> > > > > > > > +++ b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> > > > > > > > @@ -3619,6 +3619,18 @@ virtual_get_sibling(struct intel_engine_cs
> > > > > > > > *engine, unsigned int sibling)
> > > > > > > >        return ve->siblings[sibling];
> > > > > > > >    }
> > > > > > > > +static bool virtual_engine_has_heartbeat(const struct
> > > > > > > > intel_engine_cs *ve)
> > > > > > > > +{
> > > > > > > > +    struct intel_engine_cs *engine;
> > > > > > > > +    intel_engine_mask_t tmp, mask = ve->mask;
> > > > > > > > +
> > > > > > > > +    for_each_engine_masked(engine, ve->gt, mask, tmp)
> > > > > > > > +        if (READ_ONCE(engine->props.heartbeat_interval_ms))
> > > > > > > > +            return true;
> > > > > > > > +
> > > > > > > > +    return false;
> > > > > > > > +}
> > > > > > > > +
> > > > > > > >    static const struct intel_context_ops virtual_context_ops = {
> > > > > > > >        .flags = COPS_HAS_INFLIGHT,
> > > > > > > > @@ -3634,6 +3646,8 @@ static const struct intel_context_ops
> > > > > > > > virtual_context_ops = {
> > > > > > > >        .enter = virtual_context_enter,
> > > > > > > >        .exit = virtual_context_exit,
> > > > > > > > +    .has_heartbeat = virtual_engine_has_heartbeat,
> > > > > > > > +
> > > > > > > >        .destroy = virtual_context_destroy,
> > > > > > > >        .get_sibling = virtual_get_sibling,
> > > > > > > > diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> > > > > > > > b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> > > > > > > > index 89ff0e4b4bc7..ae70bff3605f 100644
> > > > > > > > --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> > > > > > > > +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> > > > > > > > @@ -2168,6 +2168,8 @@ static int guc_virtual_context_alloc(struct
> > > > > > > > intel_context *ce)
> > > > > > > >        return lrc_alloc(ce, engine);
> > > > > > > >    }
> > > > > > > > +static bool guc_virtual_engine_has_heartbeat(const struct
> > > > > > > > intel_engine_cs *ve);
> > > > > > > > +
> > > > > > > >    static const struct intel_context_ops virtual_guc_context_ops = {
> > > > > > > >        .alloc = guc_virtual_context_alloc,
> > > > > > > > @@ -2183,6 +2185,8 @@ static const struct intel_context_ops
> > > > > > > > virtual_guc_context_ops = {
> > > > > > > >        .enter = guc_virtual_context_enter,
> > > > > > > >        .exit = guc_virtual_context_exit,
> > > > > > > > +    .has_heartbeat = guc_virtual_engine_has_heartbeat,
> > > > > > > > +
> > > > > > > >        .sched_disable = guc_context_sched_disable,
> > > > > > > >        .destroy = guc_context_destroy,
> > > > > > > > @@ -3029,7 +3033,7 @@ guc_create_virtual(struct intel_engine_cs
> > > > > > > > **siblings, unsigned int count)
> > > > > > > >        return ERR_PTR(err);
> > > > > > > >    }
> > > > > > > > -bool intel_guc_virtual_engine_has_heartbeat(const struct
> > > > > > > > intel_engine_cs *ve)
> > > > > > > > +static bool guc_virtual_engine_has_heartbeat(const struct
> > > > > > > > intel_engine_cs *ve)
> > > > > > > >    {
> > > > > > > >        struct intel_engine_cs *engine;
> > > > > > > >        intel_engine_mask_t tmp, mask = ve->mask;
> > > > > > > > diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> > > > > > > > b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> > > > > > > > index c7ef44fa0c36..c2afc3b88fd8 100644
> > > > > > > > --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> > > > > > > > +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> > > > > > > > @@ -29,8 +29,6 @@ void intel_guc_dump_active_requests(struct
> > > > > > > > intel_engine_cs *engine,
> > > > > > > >                        struct i915_request *hung_rq,
> > > > > > > >                        struct drm_printer *m);
> > > > > > > > -bool intel_guc_virtual_engine_has_heartbeat(const struct
> > > > > > > > intel_engine_cs *ve);
> > > > > > > > -
> > > > > > > >    int intel_guc_wait_for_pending_msg(struct intel_guc *guc,
> > > > > > > >                       atomic_t *wait_var,
> > > > > > > >                       bool interruptible,
> > > > > > > _______________________________________________
> > > > > > > Intel-gfx mailing list
> > > > > > > Intel-gfx@lists.freedesktop.org
> > > > > > > https://lists.freedesktop.org/mailman/listinfo/intel-gfx
> > 
> 

-- 
Daniel Vetter
Software Engineer, Intel Corporation
http://blog.ffwll.ch

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH 0/1] Fix gem_ctx_persistence failures with GuC submission
  2021-07-29  0:33 [PATCH 0/1] Fix gem_ctx_persistence failures with GuC submission Matthew Brost
  2021-07-29  0:34 ` [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context Matthew Brost
@ 2021-08-10  6:38 ` Daniel Vetter
  2021-08-18  0:08   ` [Intel-gfx] " John Harrison
  1 sibling, 1 reply; 16+ messages in thread
From: Daniel Vetter @ 2021-08-10  6:38 UTC (permalink / raw)
  To: Matthew Brost; +Cc: intel-gfx, dri-devel

On Wed, Jul 28, 2021 at 05:33:59PM -0700, Matthew Brost wrote:
> Should fix below failures with GuC submission for the following tests:
> gem_exec_balancer --r noheartbeat
> gem_ctx_persistence --r heartbeat-close
> 
> Not going to fix:
> gem_ctx_persistence --r heartbeat-many
> gem_ctx_persistence --r heartbeat-stop

After looking at that big thread and being very confused: Are we fixing an
actual use-case here, or is this another case of blindly following igts
tests just because they exist?

I'm leaning towards that we should stall on this, and first document what
exactly is the actual intention behind all this, and then fix up the tests
to match (if needed). And only then fix up GuC to match whatever we
actually want to do.
-Daniel

> 
> As the above tests change the heartbeat value to 0 (off) after the
> context is closed and we have no way to detect that with GuC submission
> unless we keep a list of closed but running contexts which seems like
> overkill for a non-real world use case. We likely should just skip these
> tests with GuC submission.
> 
> Signed-off-by: Matthew Brost <matthew.brost@intel.com>
> 
> Matthew Brost (1):
>   drm/i915: Check if engine has heartbeat when closing a context
> 
>  drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
>  drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
>  drivers/gpu/drm/i915/gt/intel_engine.h        | 21 ++-----------------
>  .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
>  .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
>  .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
>  6 files changed, 26 insertions(+), 24 deletions(-)
> 
> -- 
> 2.28.0
> 

-- 
Daniel Vetter
Software Engineer, Intel Corporation
http://blog.ffwll.ch

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 0/1] Fix gem_ctx_persistence failures with GuC submission
  2021-08-10  6:38 ` [PATCH 0/1] Fix gem_ctx_persistence failures with GuC submission Daniel Vetter
@ 2021-08-18  0:08   ` John Harrison
  2021-08-18  9:49     ` Daniel Vetter
  0 siblings, 1 reply; 16+ messages in thread
From: John Harrison @ 2021-08-18  0:08 UTC (permalink / raw)
  To: Daniel Vetter, Matthew Brost; +Cc: intel-gfx, dri-devel

On 8/9/2021 23:38, Daniel Vetter wrote:
> On Wed, Jul 28, 2021 at 05:33:59PM -0700, Matthew Brost wrote:
>> Should fix below failures with GuC submission for the following tests:
>> gem_exec_balancer --r noheartbeat
>> gem_ctx_persistence --r heartbeat-close
>>
>> Not going to fix:
>> gem_ctx_persistence --r heartbeat-many
>> gem_ctx_persistence --r heartbeat-stop
> After looking at that big thread and being very confused: Are we fixing an
> actual use-case here, or is this another case of blindly following igts
> tests just because they exist?
My understanding is that this is established behaviour and therefore 
must be maintained because the UAPI (whether documented or not) is 
inviolate. Therefore IGTs have been written to validate this past 
behaviour and now we must conform to the IGTs in order to keep the 
existing behaviour unchanged.

Whether anybody actually makes use of this behaviour or not is another 
matter entirely. I am certainly not aware of any vital use case. Others 
might have more recollection. I do know that we tell the UMD teams to 
explicitly disable persistence on every context they create.

>
> I'm leaning towards that we should stall on this, and first document what
> exactly is the actual intention behind all this, and then fix up the tests
I'm not sure there ever was an 'intention'. The rumour I heard way back 
when was that persistence was a bug on earlier platforms (or possibly we 
didn't have hardware support for doing engine resets?). But once the bug 
was realised (or the hardware support was added), it was too late to 
change the default behaviour because existing kernel behaviour must 
never change on pain of painful things. Thus the persistence flag was 
added so that people could opt out of the broken, leaky behaviour and 
have their contexts clean up properly.

Feel free to document what you believe should be the behaviour from a 
software architect point of view. Any documentation I produce is 
basically going to be created by reverse engineering the existing code. 
That is the only 'spec' that I am aware of and as I keep saying, I 
personally think it is a totally broken concept that should just be removed.

> to match (if needed). And only then fix up GuC to match whatever we
> actually want to do.
I also still maintain there is no 'fix up the GuC'. This is not 
behaviour we should be adding to a hardware scheduler. It is behaviour 
that should be implemented at the front end not the back end. If we 
absolutely need to do this then we need to do it solely at the context 
management level not at the back end submission level. And the solution 
should work by default on any submission back end.

John.


> -Daniel
>
>> As the above tests change the heartbeat value to 0 (off) after the
>> context is closed and we have no way to detect that with GuC submission
>> unless we keep a list of closed but running contexts which seems like
>> overkill for a non-real world use case. We likely should just skip these
>> tests with GuC submission.
>>
>> Signed-off-by: Matthew Brost <matthew.brost@intel.com>
>>
>> Matthew Brost (1):
>>    drm/i915: Check if engine has heartbeat when closing a context
>>
>>   drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
>>   drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
>>   drivers/gpu/drm/i915/gt/intel_engine.h        | 21 ++-----------------
>>   .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
>>   .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
>>   .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
>>   6 files changed, 26 insertions(+), 24 deletions(-)
>>
>> -- 
>> 2.28.0
>>


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context
  2021-08-10  6:36                 ` Daniel Vetter
@ 2021-08-18  0:28                   ` John Harrison
  2021-08-18  9:26                     ` Daniel Vetter
  0 siblings, 1 reply; 16+ messages in thread
From: John Harrison @ 2021-08-18  0:28 UTC (permalink / raw)
  To: Daniel Vetter; +Cc: Tvrtko Ursulin, Matthew Brost, intel-gfx, dri-devel

On 8/9/2021 23:36, Daniel Vetter wrote:
> On Mon, Aug 09, 2021 at 04:12:52PM -0700, John Harrison wrote:
>> On 8/6/2021 12:46, Daniel Vetter wrote:
>>> Seen this fly by and figured I dropped a few thoughts in here. At the
>>> likely cost of looking a bit out of whack :-)
>>>
>>> On Fri, Aug 6, 2021 at 8:01 PM John Harrison <john.c.harrison@intel.com> wrote:
>>>> On 8/2/2021 02:40, Tvrtko Ursulin wrote:
>>>>> On 30/07/2021 19:13, John Harrison wrote:
>>>>>> On 7/30/2021 02:49, Tvrtko Ursulin wrote:
>>>>>>> On 30/07/2021 01:13, John Harrison wrote:
>>>>>>>> On 7/28/2021 17:34, Matthew Brost wrote:
>>>>>>>>> If an engine associated with a context does not have a heartbeat,
>>>>>>>>> ban it
>>>>>>>>> immediately. This is needed for GuC submission as a idle pulse
>>>>>>>>> doesn't
>>>>>>>>> kick the context off the hardware where it then can check for a
>>>>>>>>> heartbeat and ban the context.
>>>>>>> Pulse, that is a request with I915_PRIORITY_BARRIER, does not
>>>>>>> preempt a running normal priority context?
>>>>>>>
>>>>>>> Why does it matter then whether or not heartbeats are enabled - when
>>>>>>> heartbeat just ends up sending the same engine pulse (eventually,
>>>>>>> with raising priority)?
>>>>>> The point is that the pulse is pointless. See the rest of my comments
>>>>>> below, specifically "the context will get resubmitted to the hardware
>>>>>> after the pulse completes". To re-iterate...
>>>>>>
>>>>>> Yes, it preempts the context. Yes, it does so whether heartbeats are
>>>>>> enabled or not. But so what? Who cares? You have preempted a context.
>>>>>> It is no longer running on the hardware. BUT IT IS STILL A VALID
>>>>>> CONTEXT.
>>>>> It is valid yes, and it even may be the current ABI so another
>>>>> question is whether it is okay to change that.
>>>>>
>>>>>> The backend scheduler will just resubmit it to the hardware as soon
>>>>>> as the pulse completes. The only reason this works at all is because
>>>>>> of the horrid hack in the execlist scheduler's back end
>>>>>> implementation (in __execlists_schedule_in):
>>>>>>            if (unlikely(intel_context_is_closed(ce) &&
>>>>>>                         !intel_engine_has_heartbeat(engine)))
>>>>>>                    intel_context_set_banned(ce);
>>>>> Right, is the above code then needed with this patch - when ban is
>>>>> immediately applied on the higher level?
>>>>>
>>>>>> The actual back end scheduler is saying "Is this a zombie context? Is
>>>>>> the heartbeat disabled? Then ban it". No other scheduler backend is
>>>>>> going to have knowledge of zombie context status or of the heartbeat
>>>>>> status. Nor are they going to call back into the higher levels of the
>>>>>> i915 driver to trigger a ban operation. Certainly a hardware
>>>>>> implemented scheduler is not going to be looking at private i915
>>>>>> driver information to decide whether to submit a context or whether
>>>>>> to tell the OS to kill it off instead.
>>>>>>
>>>>>> For persistence to work with a hardware scheduler (or a non-Intel
>>>>>> specific scheduler such as the DRM one), the handling of zombie
>>>>>> contexts, banning, etc. *must* be done entirely in the front end. It
>>>>>> cannot rely on any backend hacks. That means you can't rely on any
>>>>>> fancy behaviour of pulses.
>>>>>>
>>>>>> If you want to ban a context then you must explicitly ban that
>>>>>> context. If you want to ban it at some later point then you need to
>>>>>> track it at the top level as a zombie and then explicitly ban that
>>>>>> zombie at whatever later point.
>>>>> I am still trying to understand it all. If I go by the commit message:
>>>>>
>>>>> """
>>>>> This is needed for GuC submission as a idle pulse doesn't
>>>>> kick the context off the hardware where it then can check for a
>>>>> heartbeat and ban the context.
>>>>> """
>>>>>
>>>>> That did not explain things for me. Sentence does not appear to make
>>>>> sense. Now, it seems "kick off the hardware" is meant as revoke and
>>>>> not just preempt. Which is fine, perhaps just needs to be written more
>>>>> explicitly. But the part of checking for heartbeat after idle pulse
>>>>> does not compute for me. It is the heartbeat which emits idle pulses,
>>>>> not idle pulse emitting heartbeats.
>>>> I am in agreement that the commit message is confusing and does not
>>>> explain either the problem or the solution.
>>>>
>>>>
>>>>> But anyway, I can buy the handling at the front end story completely.
>>>>> It makes sense. We just need to agree that a) it is okay to change the
>>>>> ABI and b) remove the backend check from execlists if it is not needed
>>>>> any longer.
>>>>>
>>>>> And if ABI change is okay then commit message needs to talk about it
>>>>> loudly and clearly.
>>>> I don't think we have a choice. The current ABI is not and cannot ever
>>>> be compatible with any scheduler external to i915. It cannot be
>>>> implemented with a hardware scheduler such as the GuC and it cannot be
>>>> implemented with an external software scheduler such as the DRM one.
>>> So generally on linux we implement helper libraries, which means
>>> massive flexibility everywhere.
>>>
>>> https://blog.ffwll.ch/2016/12/midlayers-once-more-with-feeling.html
>>>
>>> So it shouldn't be an insurmountable problem to make this happen even
>>> with drm/scheduler, we can patch it up.
>>>
>>> Whether that's justified is another question.
>> Helper libraries won't work with a hardware scheduler.
> Hm I guess I misunderstood then what exactly the hold-up is. This entire
> discussion feels at least a bit like "heartbeat is unchangeable and guc
> must fit", which is pretty much the midlayer mistake. We need to figure
> out an implementation that works with GuC of the goals of the uapi,
> instead of assuming that the current heartbeat is the only possible way to
> achieve that.
>
> Or I'm just very confused about what the problem is.

What I mean is that you can't add helper callback hook things into a 
hardware scheduler. It's hardware. It does what it does. Sure, the GuC 
is firmware but it is very limited in what it can do. It certainly can't 
peek into internal KMD state such as the heartbeat. Nor can it call back 
to i915 to execute code every time it wants to make a scheduling 
decision. That would be defeating the whole point of it being a CPU 
offload accelerator thing.

Also, what I'm arguing is that the heartbeat should not be involved in 
the management of persistent contexts in the first place. It is way over 
complicated, unnecessary and not intuitive to an end user in the slightest.

>
>>>> My view is that any implementation involving knowledge of the heartbeat
>>>> is fundamentally broken.
>>>>
>>>> According to Daniel Vetter, the DRM ABI on this subject is that an
>>>> actively executing context should persist until the DRM file handle is
>>>> closed. That seems like a much more plausible and simple ABI than one
>>> DRM ABI is maybe a bit an overkill statement. It's more "what other
>>> drivers do", but it's generally a good idea to not ignore that :-)
>>>
>>>> that says 'if the heartbeat is running then a context will persist
>>>> forever, if the heartbeat is not running then it will be killed
>>>> immediately, if the heart was running but then stops running then the
>>>> context will be killed on the next context switch, ...'. And if I
>>>> understand it correctly, the current ABI allows a badly written user app
>>>> to cause a denial of service by leaving contexts permanently running an
>>>> infinite loop on the hardware even after the app has been killed! How
>>>> can that ever be considered a good idea?
>>> We're not going to support changing all these settings at runtime.
>>> There's just not point in trying to make that work race-free, it
>>> either adds complexity to the code for no reason, or it adds overhead
>>> to the code for no reason.
>>>
>>> Yes I know existing customers and all that, but
>>> - they can change this stuff, and when they change it while anyting is
>>> in-flight they get to keep the pieces. These options taint the kernel
>>> for a reason (and if they don't, that should be fixed)
>>> - quite a few around heartbeat and compute support as we've merged a
>>> while ago hang by design when trying to smash them into drm rules.
>>> We're not going to fix that, and we should not use any existing such
>>> assumptions as justification for code changes.
>>>
>>> Wrt infinitely running: Right now nothing is allowed to run forever,
>>> because hangcheck will step in and kill that job. Once we add compute
>>> mode ctx flag we'll require killing on process exit to stop escape.
>> If the infinite loop is pre-emptible then the heartbeat won't kill it off.
>> It will just run forever. Okay, it won't be a huge denial of service because
>> other work can pre-empt and run. However, you are down one timeslice
>> execution slot at that priority level. You have also permanently lost
>> whatever memory is allocated and in use by that workload.
> Ok I think I'm definitely lost.
>
> Right now, in upstream, you can't run forever without regularly calling
> execbuf to stuff new work in. So it will die out, it wont be persistent
> for very long.
It is possible to write an infinite loop batch buffer that is 
pre-emptible. Once you set that running, no amount of heartbeats will 
kill it off. The heartbeat will happily pre-empt it and tell you that 
the system as a whole is still running just fine. And then the scheduler 
will set the infinite loop task running again because it still has more 
'work' to do.


>
>>>> Therefore, the context close implementation should be to add an active
>>>> context to a zombie list. If a context is in zombie state and its last
>>>> request completes then the context can be immediately killed at that
>>>> point. Otherwise, on DRM handle close, we go through the zombie list and
>>>> immediately kill all contexts.
>>>>
>>>> Simple, clean, no back-end scheduler hacks, no reliance on heartbeats or
>>>> pulses. Also no opportunity for rogue (or just badly written) user
>>>> processes to leave zombie contexts running on the hardware forever and
>>>> causing a denial of service attack. If the host process is killed, all
>>>> of its GPU processes are also killed irrespective of what dodgy context
>>>> flags they may or may not have set.
>>> Uh, the intel_context state machine is already a bit too complex, and
>>> the implementation lacks a bunch of barriers at least from the cursor
>>> look I've given it thus far.
>>>
>>> So if we really need to make that more complex with more states then I
>>> think someone needs to come up with an actual clean design, with
>>> proper state transitions and all the barriers (or really, a design
>>> which doesn't need barriers). This is going to be work.
>>> -Daniel
>> Personally, I would rather just drop the whole persistence/zombie idea
>> completely. If you close your context then you should expect that context to
>> be destroyed and any outstanding workloads killed off. If you wanted the
>> results then you should have waited for them.
>>
>> If we do have to support some level of persistence then it doesn't seem like
>> tracking closed contexts should be especially complex. Not sure why it would
>> need special barriers either.
> Frankly I think I'm lost, and I think the confusion (for me at least)
> starts with what the current uapi is.
>
> Can someone please document that, with kerneldoc in the uapi header
> ideally? Once we have that defined I think we can have an actual
> discussion about what exactly this should look like with GuC (and also
> eventually with drm/scheduler), and which parts of the uapi are just
> artifacts of the current implementation, and which parts actually matter.
>
> Otherwise I think we're just spinning wheels a bit much here.
> -Daniel
See other branch of this thread - feel free to write it yourself or 
elect someone who actually knows the history/reasons behind this to 
write it up. All I can do is reverse engineer the code and document what 
it currently does and what is required to pass the IGT test.

If you want documentation about what the interface *should* be then I 
can offer two options:

1. No persistence at all.
If you close a context (whether explicitly through a close context call 
or implicitly through closing the DRM file handle, being killed, etc.) 
then that context is destroyed immediately. All outstanding work is 
discarded.

2. Persistence until DRM handle closure.
You can close a context and have it keep running previously submitted 
work. However, as soon as the DRM file handle is closed (either 
explicitly or by being killed, etc.) then all contexts are immediately 
destroyed and all outstanding work is discarded.

Simple. Concise. Sensible. No long discussions about what the heartbeat 
enable state was when the context was closed versus what that state is 
at some future point. No platform specific caveats or interactions. And 
no opportunity to cause denial of service attacks either deliberately or 
accidentally (and no opportunity for hideously complex KMD 
implementations to introduce potential DOS bugs either).

John.


>
>> John.
>>
>>>> John.
>>>>
>>>>
>>>>> Or perhaps there is no ABI change? I am not really clear how does
>>>>> setting banned status propagate to the GuC backend. I mean at which
>>>>> point does i915 ends up passing that info to the firmware?
>>>>>
>>>>> Regards,
>>>>>
>>>>> Tvrtko
>>>>>
>>>>>>>> It's worse than this. If the engine in question is an individual
>>>>>>>> physical engine then sending a pulse (with sufficiently high
>>>>>>>> priority) will pre-empt the engine and kick the context off.
>>>>>>>> However, the GuC
>>>>>>> Why it is different for physical vs virtual, aren't both just
>>>>>>> schedulable contexts with different engine masks for what GuC is
>>>>>>> concerned? Oh, is it a matter of needing to send pulses to all
>>>>>>> engines which comprise a virtual one?
>>>>>> It isn't different. It is totally broken for both. It is potentially
>>>>>> more broken for virtual engines because of the question of which
>>>>>> engine to pulse. But as stated above, the pulse is pointless anyway
>>>>>> so the which engine question doesn't even matter.
>>>>>>
>>>>>> John.
>>>>>>
>>>>>>
>>>>>>>> scheduler does not have hacks in it to check the state of the
>>>>>>>> heartbeat or whether a context is actually a zombie or not. Thus,
>>>>>>>> the context will get resubmitted to the hardware after the pulse
>>>>>>>> completes and effectively nothing will have happened.
>>>>>>>>
>>>>>>>> I would assume that the DRM scheduler which we are meant to be
>>>>>>>> switching to for execlist as well as GuC submission is also
>>>>>>>> unlikely to have hacks for zombie contexts and tests for whether
>>>>>>>> the i915 specific heartbeat has been disabled since the context
>>>>>>>> became a zombie. So when that switch happens, this test will also
>>>>>>>> fail in execlist mode as well as GuC mode.
>>>>>>>>
>>>>>>>> The choices I see here are to simply remove persistence completely
>>>>>>>> (it is a basically a bug that became UAPI because it wasn't caught
>>>>>>>> soon enough!) or to implement it in a way that does not require
>>>>>>>> hacks in the back end scheduler. Apparently, the DRM scheduler is
>>>>>>>> expected to allow zombie contexts to persist until the DRM file
>>>>>>>> handle is closed. So presumably we will have to go with option two.
>>>>>>>>
>>>>>>>> That means flagging a context as being a zombie when it is closed
>>>>>>>> but still active. The driver would then add it to a zombie list
>>>>>>>> owned by the DRM client object. When that client object is closed,
>>>>>>>> i915 would go through the list and genuinely kill all the contexts.
>>>>>>>> No back end scheduler hacks required and no intimate knowledge of
>>>>>>>> the i915 heartbeat mechanism required either.
>>>>>>>>
>>>>>>>> John.
>>>>>>>>
>>>>>>>>
>>>>>>>>> This patch also updates intel_engine_has_heartbeat to be a vfunc
>>>>>>>>> as we
>>>>>>>>> now need to call this function on execlists virtual engines too.
>>>>>>>>>
>>>>>>>>> Signed-off-by: Matthew Brost <matthew.brost@intel.com>
>>>>>>>>> ---
>>>>>>>>>     drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
>>>>>>>>>     drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
>>>>>>>>>     drivers/gpu/drm/i915/gt/intel_engine.h        | 21
>>>>>>>>> ++-----------------
>>>>>>>>>     .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
>>>>>>>>>     .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
>>>>>>>>>     .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
>>>>>>>>>     6 files changed, 26 insertions(+), 24 deletions(-)
>>>>>>>>>
>>>>>>>>> diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>>>>>>> b/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>>>>>>> index 9c3672bac0e2..b8e01c5ba9e5 100644
>>>>>>>>> --- a/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>>>>>>> +++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c
>>>>>>>>> @@ -1090,8 +1090,9 @@ static void kill_engines(struct
>>>>>>>>> i915_gem_engines *engines, bool ban)
>>>>>>>>>          */
>>>>>>>>>         for_each_gem_engine(ce, engines, it) {
>>>>>>>>>             struct intel_engine_cs *engine;
>>>>>>>>> +        bool local_ban = ban ||
>>>>>>>>> !intel_engine_has_heartbeat(ce->engine);
>>>>>>> In any case (pending me understanding what's really going on there),
>>>>>>> why would this check not be in kill_context with currently does this:
>>>>>>>
>>>>>>>       bool ban = (!i915_gem_context_is_persistent(ctx) ||
>>>>>>>               !ctx->i915->params.enable_hangcheck);
>>>>>>> ...
>>>>>>>           kill_engines(pos, ban);
>>>>>>>
>>>>>>> So whether to ban decision would be consolidated to one place.
>>>>>>>
>>>>>>> In fact, decision on whether to allow persistent is tied to
>>>>>>> enable_hangcheck, which also drives hearbeat emission. So perhaps
>>>>>>> one part of the correct fix is to extend the above (kill_context)
>>>>>>> ban criteria to include hearbeat values anyway. Otherwise isn't it a
>>>>>>> simple miss that this check fails to account to hearbeat disablement
>>>>>>> via sysfs?
>>>>>>>
>>>>>>> Regards,
>>>>>>>
>>>>>>> Tvrtko
>>>>>>>
>>>>>>>>> -        if (ban && intel_context_ban(ce, NULL))
>>>>>>>>> +        if (local_ban && intel_context_ban(ce, NULL))
>>>>>>>>>                 continue;
>>>>>>>>>             /*
>>>>>>>>> @@ -1104,7 +1105,7 @@ static void kill_engines(struct
>>>>>>>>> i915_gem_engines *engines, bool ban)
>>>>>>>>>             engine = active_engine(ce);
>>>>>>>>>             /* First attempt to gracefully cancel the context */
>>>>>>>>> -        if (engine && !__cancel_engine(engine) && ban)
>>>>>>>>> +        if (engine && !__cancel_engine(engine) && local_ban)
>>>>>>>>>                 /*
>>>>>>>>>                  * If we are unable to send a preemptive pulse to bump
>>>>>>>>>                  * the context from the GPU, we have to resort to a
>>>>>>>>> full
>>>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>>>>>>> b/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>>>>>>> index e54351a170e2..65f2eb2a78e4 100644
>>>>>>>>> --- a/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>>>>>>> +++ b/drivers/gpu/drm/i915/gt/intel_context_types.h
>>>>>>>>> @@ -55,6 +55,8 @@ struct intel_context_ops {
>>>>>>>>>         void (*reset)(struct intel_context *ce);
>>>>>>>>>         void (*destroy)(struct kref *kref);
>>>>>>>>> +    bool (*has_heartbeat)(const struct intel_engine_cs *engine);
>>>>>>>>> +
>>>>>>>>>         /* virtual engine/context interface */
>>>>>>>>>         struct intel_context *(*create_virtual)(struct
>>>>>>>>> intel_engine_cs **engine,
>>>>>>>>>                             unsigned int count);
>>>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_engine.h
>>>>>>>>> b/drivers/gpu/drm/i915/gt/intel_engine.h
>>>>>>>>> index c2a5640ae055..1b11a808acc4 100644
>>>>>>>>> --- a/drivers/gpu/drm/i915/gt/intel_engine.h
>>>>>>>>> +++ b/drivers/gpu/drm/i915/gt/intel_engine.h
>>>>>>>>> @@ -283,28 +283,11 @@ struct intel_context *
>>>>>>>>>     intel_engine_create_virtual(struct intel_engine_cs **siblings,
>>>>>>>>>                     unsigned int count);
>>>>>>>>> -static inline bool
>>>>>>>>> -intel_virtual_engine_has_heartbeat(const struct intel_engine_cs
>>>>>>>>> *engine)
>>>>>>>>> -{
>>>>>>>>> -    /*
>>>>>>>>> -     * For non-GuC submission we expect the back-end to look at the
>>>>>>>>> -     * heartbeat status of the actual physical engine that the work
>>>>>>>>> -     * has been (or is being) scheduled on, so we should only reach
>>>>>>>>> -     * here with GuC submission enabled.
>>>>>>>>> -     */
>>>>>>>>> -    GEM_BUG_ON(!intel_engine_uses_guc(engine));
>>>>>>>>> -
>>>>>>>>> -    return intel_guc_virtual_engine_has_heartbeat(engine);
>>>>>>>>> -}
>>>>>>>>> -
>>>>>>>>>     static inline bool
>>>>>>>>>     intel_engine_has_heartbeat(const struct intel_engine_cs *engine)
>>>>>>>>>     {
>>>>>>>>> -    if (!IS_ACTIVE(CONFIG_DRM_I915_HEARTBEAT_INTERVAL))
>>>>>>>>> -        return false;
>>>>>>>>> -
>>>>>>>>> -    if (intel_engine_is_virtual(engine))
>>>>>>>>> -        return intel_virtual_engine_has_heartbeat(engine);
>>>>>>>>> +    if (engine->cops->has_heartbeat)
>>>>>>>>> +        return engine->cops->has_heartbeat(engine);
>>>>>>>>>         else
>>>>>>>>>             return READ_ONCE(engine->props.heartbeat_interval_ms);
>>>>>>>>>     }
>>>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>>>>>>> b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>>>>>>> index de5f9c86b9a4..18005b5546b6 100644
>>>>>>>>> --- a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>>>>>>> +++ b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
>>>>>>>>> @@ -3619,6 +3619,18 @@ virtual_get_sibling(struct intel_engine_cs
>>>>>>>>> *engine, unsigned int sibling)
>>>>>>>>>         return ve->siblings[sibling];
>>>>>>>>>     }
>>>>>>>>> +static bool virtual_engine_has_heartbeat(const struct
>>>>>>>>> intel_engine_cs *ve)
>>>>>>>>> +{
>>>>>>>>> +    struct intel_engine_cs *engine;
>>>>>>>>> +    intel_engine_mask_t tmp, mask = ve->mask;
>>>>>>>>> +
>>>>>>>>> +    for_each_engine_masked(engine, ve->gt, mask, tmp)
>>>>>>>>> +        if (READ_ONCE(engine->props.heartbeat_interval_ms))
>>>>>>>>> +            return true;
>>>>>>>>> +
>>>>>>>>> +    return false;
>>>>>>>>> +}
>>>>>>>>> +
>>>>>>>>>     static const struct intel_context_ops virtual_context_ops = {
>>>>>>>>>         .flags = COPS_HAS_INFLIGHT,
>>>>>>>>> @@ -3634,6 +3646,8 @@ static const struct intel_context_ops
>>>>>>>>> virtual_context_ops = {
>>>>>>>>>         .enter = virtual_context_enter,
>>>>>>>>>         .exit = virtual_context_exit,
>>>>>>>>> +    .has_heartbeat = virtual_engine_has_heartbeat,
>>>>>>>>> +
>>>>>>>>>         .destroy = virtual_context_destroy,
>>>>>>>>>         .get_sibling = virtual_get_sibling,
>>>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>>>>>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>>>>>>> index 89ff0e4b4bc7..ae70bff3605f 100644
>>>>>>>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>>>>>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
>>>>>>>>> @@ -2168,6 +2168,8 @@ static int guc_virtual_context_alloc(struct
>>>>>>>>> intel_context *ce)
>>>>>>>>>         return lrc_alloc(ce, engine);
>>>>>>>>>     }
>>>>>>>>> +static bool guc_virtual_engine_has_heartbeat(const struct
>>>>>>>>> intel_engine_cs *ve);
>>>>>>>>> +
>>>>>>>>>     static const struct intel_context_ops virtual_guc_context_ops = {
>>>>>>>>>         .alloc = guc_virtual_context_alloc,
>>>>>>>>> @@ -2183,6 +2185,8 @@ static const struct intel_context_ops
>>>>>>>>> virtual_guc_context_ops = {
>>>>>>>>>         .enter = guc_virtual_context_enter,
>>>>>>>>>         .exit = guc_virtual_context_exit,
>>>>>>>>> +    .has_heartbeat = guc_virtual_engine_has_heartbeat,
>>>>>>>>> +
>>>>>>>>>         .sched_disable = guc_context_sched_disable,
>>>>>>>>>         .destroy = guc_context_destroy,
>>>>>>>>> @@ -3029,7 +3033,7 @@ guc_create_virtual(struct intel_engine_cs
>>>>>>>>> **siblings, unsigned int count)
>>>>>>>>>         return ERR_PTR(err);
>>>>>>>>>     }
>>>>>>>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct
>>>>>>>>> intel_engine_cs *ve)
>>>>>>>>> +static bool guc_virtual_engine_has_heartbeat(const struct
>>>>>>>>> intel_engine_cs *ve)
>>>>>>>>>     {
>>>>>>>>>         struct intel_engine_cs *engine;
>>>>>>>>>         intel_engine_mask_t tmp, mask = ve->mask;
>>>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>>>>>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>>>>>>> index c7ef44fa0c36..c2afc3b88fd8 100644
>>>>>>>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>>>>>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
>>>>>>>>> @@ -29,8 +29,6 @@ void intel_guc_dump_active_requests(struct
>>>>>>>>> intel_engine_cs *engine,
>>>>>>>>>                         struct i915_request *hung_rq,
>>>>>>>>>                         struct drm_printer *m);
>>>>>>>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct
>>>>>>>>> intel_engine_cs *ve);
>>>>>>>>> -
>>>>>>>>>     int intel_guc_wait_for_pending_msg(struct intel_guc *guc,
>>>>>>>>>                        atomic_t *wait_var,
>>>>>>>>>                        bool interruptible,
>>>>>>>> _______________________________________________
>>>>>>>> Intel-gfx mailing list
>>>>>>>> Intel-gfx@lists.freedesktop.org
>>>>>>>> https://lists.freedesktop.org/mailman/listinfo/intel-gfx


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context
  2021-08-18  0:28                   ` John Harrison
@ 2021-08-18  9:26                     ` Daniel Vetter
  0 siblings, 0 replies; 16+ messages in thread
From: Daniel Vetter @ 2021-08-18  9:26 UTC (permalink / raw)
  To: John Harrison; +Cc: Tvrtko Ursulin, Matthew Brost, intel-gfx, dri-devel

On Wed, Aug 18, 2021 at 2:28 AM John Harrison <john.c.harrison@intel.com> wrote:
> On 8/9/2021 23:36, Daniel Vetter wrote:
> > On Mon, Aug 09, 2021 at 04:12:52PM -0700, John Harrison wrote:
> >> On 8/6/2021 12:46, Daniel Vetter wrote:
> >>> Seen this fly by and figured I dropped a few thoughts in here. At the
> >>> likely cost of looking a bit out of whack :-)
> >>>
> >>> On Fri, Aug 6, 2021 at 8:01 PM John Harrison <john.c.harrison@intel.com> wrote:
> >>>> On 8/2/2021 02:40, Tvrtko Ursulin wrote:
> >>>>> On 30/07/2021 19:13, John Harrison wrote:
> >>>>>> On 7/30/2021 02:49, Tvrtko Ursulin wrote:
> >>>>>>> On 30/07/2021 01:13, John Harrison wrote:
> >>>>>>>> On 7/28/2021 17:34, Matthew Brost wrote:
> >>>>>>>>> If an engine associated with a context does not have a heartbeat,
> >>>>>>>>> ban it
> >>>>>>>>> immediately. This is needed for GuC submission as a idle pulse
> >>>>>>>>> doesn't
> >>>>>>>>> kick the context off the hardware where it then can check for a
> >>>>>>>>> heartbeat and ban the context.
> >>>>>>> Pulse, that is a request with I915_PRIORITY_BARRIER, does not
> >>>>>>> preempt a running normal priority context?
> >>>>>>>
> >>>>>>> Why does it matter then whether or not heartbeats are enabled - when
> >>>>>>> heartbeat just ends up sending the same engine pulse (eventually,
> >>>>>>> with raising priority)?
> >>>>>> The point is that the pulse is pointless. See the rest of my comments
> >>>>>> below, specifically "the context will get resubmitted to the hardware
> >>>>>> after the pulse completes". To re-iterate...
> >>>>>>
> >>>>>> Yes, it preempts the context. Yes, it does so whether heartbeats are
> >>>>>> enabled or not. But so what? Who cares? You have preempted a context.
> >>>>>> It is no longer running on the hardware. BUT IT IS STILL A VALID
> >>>>>> CONTEXT.
> >>>>> It is valid yes, and it even may be the current ABI so another
> >>>>> question is whether it is okay to change that.
> >>>>>
> >>>>>> The backend scheduler will just resubmit it to the hardware as soon
> >>>>>> as the pulse completes. The only reason this works at all is because
> >>>>>> of the horrid hack in the execlist scheduler's back end
> >>>>>> implementation (in __execlists_schedule_in):
> >>>>>>            if (unlikely(intel_context_is_closed(ce) &&
> >>>>>>                         !intel_engine_has_heartbeat(engine)))
> >>>>>>                    intel_context_set_banned(ce);
> >>>>> Right, is the above code then needed with this patch - when ban is
> >>>>> immediately applied on the higher level?
> >>>>>
> >>>>>> The actual back end scheduler is saying "Is this a zombie context? Is
> >>>>>> the heartbeat disabled? Then ban it". No other scheduler backend is
> >>>>>> going to have knowledge of zombie context status or of the heartbeat
> >>>>>> status. Nor are they going to call back into the higher levels of the
> >>>>>> i915 driver to trigger a ban operation. Certainly a hardware
> >>>>>> implemented scheduler is not going to be looking at private i915
> >>>>>> driver information to decide whether to submit a context or whether
> >>>>>> to tell the OS to kill it off instead.
> >>>>>>
> >>>>>> For persistence to work with a hardware scheduler (or a non-Intel
> >>>>>> specific scheduler such as the DRM one), the handling of zombie
> >>>>>> contexts, banning, etc. *must* be done entirely in the front end. It
> >>>>>> cannot rely on any backend hacks. That means you can't rely on any
> >>>>>> fancy behaviour of pulses.
> >>>>>>
> >>>>>> If you want to ban a context then you must explicitly ban that
> >>>>>> context. If you want to ban it at some later point then you need to
> >>>>>> track it at the top level as a zombie and then explicitly ban that
> >>>>>> zombie at whatever later point.
> >>>>> I am still trying to understand it all. If I go by the commit message:
> >>>>>
> >>>>> """
> >>>>> This is needed for GuC submission as a idle pulse doesn't
> >>>>> kick the context off the hardware where it then can check for a
> >>>>> heartbeat and ban the context.
> >>>>> """
> >>>>>
> >>>>> That did not explain things for me. Sentence does not appear to make
> >>>>> sense. Now, it seems "kick off the hardware" is meant as revoke and
> >>>>> not just preempt. Which is fine, perhaps just needs to be written more
> >>>>> explicitly. But the part of checking for heartbeat after idle pulse
> >>>>> does not compute for me. It is the heartbeat which emits idle pulses,
> >>>>> not idle pulse emitting heartbeats.
> >>>> I am in agreement that the commit message is confusing and does not
> >>>> explain either the problem or the solution.
> >>>>
> >>>>
> >>>>> But anyway, I can buy the handling at the front end story completely.
> >>>>> It makes sense. We just need to agree that a) it is okay to change the
> >>>>> ABI and b) remove the backend check from execlists if it is not needed
> >>>>> any longer.
> >>>>>
> >>>>> And if ABI change is okay then commit message needs to talk about it
> >>>>> loudly and clearly.
> >>>> I don't think we have a choice. The current ABI is not and cannot ever
> >>>> be compatible with any scheduler external to i915. It cannot be
> >>>> implemented with a hardware scheduler such as the GuC and it cannot be
> >>>> implemented with an external software scheduler such as the DRM one.
> >>> So generally on linux we implement helper libraries, which means
> >>> massive flexibility everywhere.
> >>>
> >>> https://blog.ffwll.ch/2016/12/midlayers-once-more-with-feeling.html
> >>>
> >>> So it shouldn't be an insurmountable problem to make this happen even
> >>> with drm/scheduler, we can patch it up.
> >>>
> >>> Whether that's justified is another question.
> >> Helper libraries won't work with a hardware scheduler.
> > Hm I guess I misunderstood then what exactly the hold-up is. This entire
> > discussion feels at least a bit like "heartbeat is unchangeable and guc
> > must fit", which is pretty much the midlayer mistake. We need to figure
> > out an implementation that works with GuC of the goals of the uapi,
> > instead of assuming that the current heartbeat is the only possible way to
> > achieve that.
> >
> > Or I'm just very confused about what the problem is.
>
> What I mean is that you can't add helper callback hook things into a
> hardware scheduler. It's hardware. It does what it does. Sure, the GuC
> is firmware but it is very limited in what it can do. It certainly can't
> peek into internal KMD state such as the heartbeat. Nor can it call back
> to i915 to execute code every time it wants to make a scheduling
> decision. That would be defeating the whole point of it being a CPU
> offload accelerator thing.
>
> Also, what I'm arguing is that the heartbeat should not be involved in
> the management of persistent contexts in the first place. It is way over
> complicated, unnecessary and not intuitive to an end user in the slightest.

Yeah so heartbeat was also the attempt to support long-running compute
jobs without changing the uapi. That part is reverted, and now it's
essentially just a tool to make sure the gpu keeps preempting when we
expect it too.

Which also I guess should be GuC's job now, so why do we need
heartbeat even still with the guc backend? This is the part where I
meant we're looking at this way too strictly, you most definitely
_can_ change anything in the i915 kmd and igt test suite that doesn't
fit. We're maybe saying the same thing really, dunno.

Orthogonal issue, the current code trying to support changing
heartbeat status while the driver is running is also bonkers, we don't
support that. That should simplify at least the decision making a lot,
becuse we can safely assume that a persistent or non-persistent
context was only created when we thought it's ok to do so.

> >>>> My view is that any implementation involving knowledge of the heartbeat
> >>>> is fundamentally broken.
> >>>>
> >>>> According to Daniel Vetter, the DRM ABI on this subject is that an
> >>>> actively executing context should persist until the DRM file handle is
> >>>> closed. That seems like a much more plausible and simple ABI than one
> >>> DRM ABI is maybe a bit an overkill statement. It's more "what other
> >>> drivers do", but it's generally a good idea to not ignore that :-)
> >>>
> >>>> that says 'if the heartbeat is running then a context will persist
> >>>> forever, if the heartbeat is not running then it will be killed
> >>>> immediately, if the heart was running but then stops running then the
> >>>> context will be killed on the next context switch, ...'. And if I
> >>>> understand it correctly, the current ABI allows a badly written user app
> >>>> to cause a denial of service by leaving contexts permanently running an
> >>>> infinite loop on the hardware even after the app has been killed! How
> >>>> can that ever be considered a good idea?
> >>> We're not going to support changing all these settings at runtime.
> >>> There's just not point in trying to make that work race-free, it
> >>> either adds complexity to the code for no reason, or it adds overhead
> >>> to the code for no reason.
> >>>
> >>> Yes I know existing customers and all that, but
> >>> - they can change this stuff, and when they change it while anyting is
> >>> in-flight they get to keep the pieces. These options taint the kernel
> >>> for a reason (and if they don't, that should be fixed)
> >>> - quite a few around heartbeat and compute support as we've merged a
> >>> while ago hang by design when trying to smash them into drm rules.
> >>> We're not going to fix that, and we should not use any existing such
> >>> assumptions as justification for code changes.
> >>>
> >>> Wrt infinitely running: Right now nothing is allowed to run forever,
> >>> because hangcheck will step in and kill that job. Once we add compute
> >>> mode ctx flag we'll require killing on process exit to stop escape.
> >> If the infinite loop is pre-emptible then the heartbeat won't kill it off.
> >> It will just run forever. Okay, it won't be a huge denial of service because
> >> other work can pre-empt and run. However, you are down one timeslice
> >> execution slot at that priority level. You have also permanently lost
> >> whatever memory is allocated and in use by that workload.
> > Ok I think I'm definitely lost.
> >
> > Right now, in upstream, you can't run forever without regularly calling
> > execbuf to stuff new work in. So it will die out, it wont be persistent
> > for very long.
> It is possible to write an infinite loop batch buffer that is
> pre-emptible. Once you set that running, no amount of heartbeats will
> kill it off. The heartbeat will happily pre-empt it and tell you that
> the system as a whole is still running just fine. And then the scheduler
> will set the infinite loop task running again because it still has more
> 'work' to do.

There is a hangcheck timeout which kills you after 20s (which is
probably about 15s too long, but that's another bikeshed). This is
part of the contract that we can't remove, but we did (I think it's
still not yet in DII, not sure about status) and took quite long to
restore that.

So no, your scenario doesn't happen.

> >>>> Therefore, the context close implementation should be to add an active
> >>>> context to a zombie list. If a context is in zombie state and its last
> >>>> request completes then the context can be immediately killed at that
> >>>> point. Otherwise, on DRM handle close, we go through the zombie list and
> >>>> immediately kill all contexts.
> >>>>
> >>>> Simple, clean, no back-end scheduler hacks, no reliance on heartbeats or
> >>>> pulses. Also no opportunity for rogue (or just badly written) user
> >>>> processes to leave zombie contexts running on the hardware forever and
> >>>> causing a denial of service attack. If the host process is killed, all
> >>>> of its GPU processes are also killed irrespective of what dodgy context
> >>>> flags they may or may not have set.
> >>> Uh, the intel_context state machine is already a bit too complex, and
> >>> the implementation lacks a bunch of barriers at least from the cursor
> >>> look I've given it thus far.
> >>>
> >>> So if we really need to make that more complex with more states then I
> >>> think someone needs to come up with an actual clean design, with
> >>> proper state transitions and all the barriers (or really, a design
> >>> which doesn't need barriers). This is going to be work.
> >>> -Daniel
> >> Personally, I would rather just drop the whole persistence/zombie idea
> >> completely. If you close your context then you should expect that context to
> >> be destroyed and any outstanding workloads killed off. If you wanted the
> >> results then you should have waited for them.
> >>
> >> If we do have to support some level of persistence then it doesn't seem like
> >> tracking closed contexts should be especially complex. Not sure why it would
> >> need special barriers either.
> > Frankly I think I'm lost, and I think the confusion (for me at least)
> > starts with what the current uapi is.
> >
> > Can someone please document that, with kerneldoc in the uapi header
> > ideally? Once we have that defined I think we can have an actual
> > discussion about what exactly this should look like with GuC (and also
> > eventually with drm/scheduler), and which parts of the uapi are just
> > artifacts of the current implementation, and which parts actually matter.
> >
> > Otherwise I think we're just spinning wheels a bit much here.
> > -Daniel
> See other branch of this thread - feel free to write it yourself or
> elect someone who actually knows the history/reasons behind this to
> write it up. All I can do is reverse engineer the code and document what
> it currently does and what is required to pass the IGT test.
>
> If you want documentation about what the interface *should* be then I
> can offer two options:
>
> 1. No persistence at all.
> If you close a context (whether explicitly through a close context call
> or implicitly through closing the DRM file handle, being killed, etc.)
> then that context is destroyed immediately. All outstanding work is
> discarded.
>
> 2. Persistence until DRM handle closure.
> You can close a context and have it keep running previously submitted
> work. However, as soon as the DRM file handle is closed (either
> explicitly or by being killed, etc.) then all contexts are immediately
> destroyed and all outstanding work is discarded.

This one is pretty close to what I think drm/sched does too. We might
need a slight change in that userspace which explicitly asked for
non-persistent context to kill those immediately in all cases.

> Simple. Concise. Sensible. No long discussions about what the heartbeat
> enable state was when the context was closed versus what that state is
> at some future point. No platform specific caveats or interactions. And
> no opportunity to cause denial of service attacks either deliberately or
> accidentally (and no opportunity for hideously complex KMD
> implementations to introduce potential DOS bugs either).

That's another thing: That implementation just needs to be simplified.
It supports a lot of things that make little to no sense, and
especially if soemthing is in the way we should just remove it.
-Daniel

>
> John.
>
>
> >
> >> John.
> >>
> >>>> John.
> >>>>
> >>>>
> >>>>> Or perhaps there is no ABI change? I am not really clear how does
> >>>>> setting banned status propagate to the GuC backend. I mean at which
> >>>>> point does i915 ends up passing that info to the firmware?
> >>>>>
> >>>>> Regards,
> >>>>>
> >>>>> Tvrtko
> >>>>>
> >>>>>>>> It's worse than this. If the engine in question is an individual
> >>>>>>>> physical engine then sending a pulse (with sufficiently high
> >>>>>>>> priority) will pre-empt the engine and kick the context off.
> >>>>>>>> However, the GuC
> >>>>>>> Why it is different for physical vs virtual, aren't both just
> >>>>>>> schedulable contexts with different engine masks for what GuC is
> >>>>>>> concerned? Oh, is it a matter of needing to send pulses to all
> >>>>>>> engines which comprise a virtual one?
> >>>>>> It isn't different. It is totally broken for both. It is potentially
> >>>>>> more broken for virtual engines because of the question of which
> >>>>>> engine to pulse. But as stated above, the pulse is pointless anyway
> >>>>>> so the which engine question doesn't even matter.
> >>>>>>
> >>>>>> John.
> >>>>>>
> >>>>>>
> >>>>>>>> scheduler does not have hacks in it to check the state of the
> >>>>>>>> heartbeat or whether a context is actually a zombie or not. Thus,
> >>>>>>>> the context will get resubmitted to the hardware after the pulse
> >>>>>>>> completes and effectively nothing will have happened.
> >>>>>>>>
> >>>>>>>> I would assume that the DRM scheduler which we are meant to be
> >>>>>>>> switching to for execlist as well as GuC submission is also
> >>>>>>>> unlikely to have hacks for zombie contexts and tests for whether
> >>>>>>>> the i915 specific heartbeat has been disabled since the context
> >>>>>>>> became a zombie. So when that switch happens, this test will also
> >>>>>>>> fail in execlist mode as well as GuC mode.
> >>>>>>>>
> >>>>>>>> The choices I see here are to simply remove persistence completely
> >>>>>>>> (it is a basically a bug that became UAPI because it wasn't caught
> >>>>>>>> soon enough!) or to implement it in a way that does not require
> >>>>>>>> hacks in the back end scheduler. Apparently, the DRM scheduler is
> >>>>>>>> expected to allow zombie contexts to persist until the DRM file
> >>>>>>>> handle is closed. So presumably we will have to go with option two.
> >>>>>>>>
> >>>>>>>> That means flagging a context as being a zombie when it is closed
> >>>>>>>> but still active. The driver would then add it to a zombie list
> >>>>>>>> owned by the DRM client object. When that client object is closed,
> >>>>>>>> i915 would go through the list and genuinely kill all the contexts.
> >>>>>>>> No back end scheduler hacks required and no intimate knowledge of
> >>>>>>>> the i915 heartbeat mechanism required either.
> >>>>>>>>
> >>>>>>>> John.
> >>>>>>>>
> >>>>>>>>
> >>>>>>>>> This patch also updates intel_engine_has_heartbeat to be a vfunc
> >>>>>>>>> as we
> >>>>>>>>> now need to call this function on execlists virtual engines too.
> >>>>>>>>>
> >>>>>>>>> Signed-off-by: Matthew Brost <matthew.brost@intel.com>
> >>>>>>>>> ---
> >>>>>>>>>     drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
> >>>>>>>>>     drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
> >>>>>>>>>     drivers/gpu/drm/i915/gt/intel_engine.h        | 21
> >>>>>>>>> ++-----------------
> >>>>>>>>>     .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
> >>>>>>>>>     .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
> >>>>>>>>>     .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
> >>>>>>>>>     6 files changed, 26 insertions(+), 24 deletions(-)
> >>>>>>>>>
> >>>>>>>>> diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c
> >>>>>>>>> b/drivers/gpu/drm/i915/gem/i915_gem_context.c
> >>>>>>>>> index 9c3672bac0e2..b8e01c5ba9e5 100644
> >>>>>>>>> --- a/drivers/gpu/drm/i915/gem/i915_gem_context.c
> >>>>>>>>> +++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c
> >>>>>>>>> @@ -1090,8 +1090,9 @@ static void kill_engines(struct
> >>>>>>>>> i915_gem_engines *engines, bool ban)
> >>>>>>>>>          */
> >>>>>>>>>         for_each_gem_engine(ce, engines, it) {
> >>>>>>>>>             struct intel_engine_cs *engine;
> >>>>>>>>> +        bool local_ban = ban ||
> >>>>>>>>> !intel_engine_has_heartbeat(ce->engine);
> >>>>>>> In any case (pending me understanding what's really going on there),
> >>>>>>> why would this check not be in kill_context with currently does this:
> >>>>>>>
> >>>>>>>       bool ban = (!i915_gem_context_is_persistent(ctx) ||
> >>>>>>>               !ctx->i915->params.enable_hangcheck);
> >>>>>>> ...
> >>>>>>>           kill_engines(pos, ban);
> >>>>>>>
> >>>>>>> So whether to ban decision would be consolidated to one place.
> >>>>>>>
> >>>>>>> In fact, decision on whether to allow persistent is tied to
> >>>>>>> enable_hangcheck, which also drives hearbeat emission. So perhaps
> >>>>>>> one part of the correct fix is to extend the above (kill_context)
> >>>>>>> ban criteria to include hearbeat values anyway. Otherwise isn't it a
> >>>>>>> simple miss that this check fails to account to hearbeat disablement
> >>>>>>> via sysfs?
> >>>>>>>
> >>>>>>> Regards,
> >>>>>>>
> >>>>>>> Tvrtko
> >>>>>>>
> >>>>>>>>> -        if (ban && intel_context_ban(ce, NULL))
> >>>>>>>>> +        if (local_ban && intel_context_ban(ce, NULL))
> >>>>>>>>>                 continue;
> >>>>>>>>>             /*
> >>>>>>>>> @@ -1104,7 +1105,7 @@ static void kill_engines(struct
> >>>>>>>>> i915_gem_engines *engines, bool ban)
> >>>>>>>>>             engine = active_engine(ce);
> >>>>>>>>>             /* First attempt to gracefully cancel the context */
> >>>>>>>>> -        if (engine && !__cancel_engine(engine) && ban)
> >>>>>>>>> +        if (engine && !__cancel_engine(engine) && local_ban)
> >>>>>>>>>                 /*
> >>>>>>>>>                  * If we are unable to send a preemptive pulse to bump
> >>>>>>>>>                  * the context from the GPU, we have to resort to a
> >>>>>>>>> full
> >>>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h
> >>>>>>>>> b/drivers/gpu/drm/i915/gt/intel_context_types.h
> >>>>>>>>> index e54351a170e2..65f2eb2a78e4 100644
> >>>>>>>>> --- a/drivers/gpu/drm/i915/gt/intel_context_types.h
> >>>>>>>>> +++ b/drivers/gpu/drm/i915/gt/intel_context_types.h
> >>>>>>>>> @@ -55,6 +55,8 @@ struct intel_context_ops {
> >>>>>>>>>         void (*reset)(struct intel_context *ce);
> >>>>>>>>>         void (*destroy)(struct kref *kref);
> >>>>>>>>> +    bool (*has_heartbeat)(const struct intel_engine_cs *engine);
> >>>>>>>>> +
> >>>>>>>>>         /* virtual engine/context interface */
> >>>>>>>>>         struct intel_context *(*create_virtual)(struct
> >>>>>>>>> intel_engine_cs **engine,
> >>>>>>>>>                             unsigned int count);
> >>>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_engine.h
> >>>>>>>>> b/drivers/gpu/drm/i915/gt/intel_engine.h
> >>>>>>>>> index c2a5640ae055..1b11a808acc4 100644
> >>>>>>>>> --- a/drivers/gpu/drm/i915/gt/intel_engine.h
> >>>>>>>>> +++ b/drivers/gpu/drm/i915/gt/intel_engine.h
> >>>>>>>>> @@ -283,28 +283,11 @@ struct intel_context *
> >>>>>>>>>     intel_engine_create_virtual(struct intel_engine_cs **siblings,
> >>>>>>>>>                     unsigned int count);
> >>>>>>>>> -static inline bool
> >>>>>>>>> -intel_virtual_engine_has_heartbeat(const struct intel_engine_cs
> >>>>>>>>> *engine)
> >>>>>>>>> -{
> >>>>>>>>> -    /*
> >>>>>>>>> -     * For non-GuC submission we expect the back-end to look at the
> >>>>>>>>> -     * heartbeat status of the actual physical engine that the work
> >>>>>>>>> -     * has been (or is being) scheduled on, so we should only reach
> >>>>>>>>> -     * here with GuC submission enabled.
> >>>>>>>>> -     */
> >>>>>>>>> -    GEM_BUG_ON(!intel_engine_uses_guc(engine));
> >>>>>>>>> -
> >>>>>>>>> -    return intel_guc_virtual_engine_has_heartbeat(engine);
> >>>>>>>>> -}
> >>>>>>>>> -
> >>>>>>>>>     static inline bool
> >>>>>>>>>     intel_engine_has_heartbeat(const struct intel_engine_cs *engine)
> >>>>>>>>>     {
> >>>>>>>>> -    if (!IS_ACTIVE(CONFIG_DRM_I915_HEARTBEAT_INTERVAL))
> >>>>>>>>> -        return false;
> >>>>>>>>> -
> >>>>>>>>> -    if (intel_engine_is_virtual(engine))
> >>>>>>>>> -        return intel_virtual_engine_has_heartbeat(engine);
> >>>>>>>>> +    if (engine->cops->has_heartbeat)
> >>>>>>>>> +        return engine->cops->has_heartbeat(engine);
> >>>>>>>>>         else
> >>>>>>>>>             return READ_ONCE(engine->props.heartbeat_interval_ms);
> >>>>>>>>>     }
> >>>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> >>>>>>>>> b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> >>>>>>>>> index de5f9c86b9a4..18005b5546b6 100644
> >>>>>>>>> --- a/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> >>>>>>>>> +++ b/drivers/gpu/drm/i915/gt/intel_execlists_submission.c
> >>>>>>>>> @@ -3619,6 +3619,18 @@ virtual_get_sibling(struct intel_engine_cs
> >>>>>>>>> *engine, unsigned int sibling)
> >>>>>>>>>         return ve->siblings[sibling];
> >>>>>>>>>     }
> >>>>>>>>> +static bool virtual_engine_has_heartbeat(const struct
> >>>>>>>>> intel_engine_cs *ve)
> >>>>>>>>> +{
> >>>>>>>>> +    struct intel_engine_cs *engine;
> >>>>>>>>> +    intel_engine_mask_t tmp, mask = ve->mask;
> >>>>>>>>> +
> >>>>>>>>> +    for_each_engine_masked(engine, ve->gt, mask, tmp)
> >>>>>>>>> +        if (READ_ONCE(engine->props.heartbeat_interval_ms))
> >>>>>>>>> +            return true;
> >>>>>>>>> +
> >>>>>>>>> +    return false;
> >>>>>>>>> +}
> >>>>>>>>> +
> >>>>>>>>>     static const struct intel_context_ops virtual_context_ops = {
> >>>>>>>>>         .flags = COPS_HAS_INFLIGHT,
> >>>>>>>>> @@ -3634,6 +3646,8 @@ static const struct intel_context_ops
> >>>>>>>>> virtual_context_ops = {
> >>>>>>>>>         .enter = virtual_context_enter,
> >>>>>>>>>         .exit = virtual_context_exit,
> >>>>>>>>> +    .has_heartbeat = virtual_engine_has_heartbeat,
> >>>>>>>>> +
> >>>>>>>>>         .destroy = virtual_context_destroy,
> >>>>>>>>>         .get_sibling = virtual_get_sibling,
> >>>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> >>>>>>>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> >>>>>>>>> index 89ff0e4b4bc7..ae70bff3605f 100644
> >>>>>>>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> >>>>>>>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
> >>>>>>>>> @@ -2168,6 +2168,8 @@ static int guc_virtual_context_alloc(struct
> >>>>>>>>> intel_context *ce)
> >>>>>>>>>         return lrc_alloc(ce, engine);
> >>>>>>>>>     }
> >>>>>>>>> +static bool guc_virtual_engine_has_heartbeat(const struct
> >>>>>>>>> intel_engine_cs *ve);
> >>>>>>>>> +
> >>>>>>>>>     static const struct intel_context_ops virtual_guc_context_ops = {
> >>>>>>>>>         .alloc = guc_virtual_context_alloc,
> >>>>>>>>> @@ -2183,6 +2185,8 @@ static const struct intel_context_ops
> >>>>>>>>> virtual_guc_context_ops = {
> >>>>>>>>>         .enter = guc_virtual_context_enter,
> >>>>>>>>>         .exit = guc_virtual_context_exit,
> >>>>>>>>> +    .has_heartbeat = guc_virtual_engine_has_heartbeat,
> >>>>>>>>> +
> >>>>>>>>>         .sched_disable = guc_context_sched_disable,
> >>>>>>>>>         .destroy = guc_context_destroy,
> >>>>>>>>> @@ -3029,7 +3033,7 @@ guc_create_virtual(struct intel_engine_cs
> >>>>>>>>> **siblings, unsigned int count)
> >>>>>>>>>         return ERR_PTR(err);
> >>>>>>>>>     }
> >>>>>>>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct
> >>>>>>>>> intel_engine_cs *ve)
> >>>>>>>>> +static bool guc_virtual_engine_has_heartbeat(const struct
> >>>>>>>>> intel_engine_cs *ve)
> >>>>>>>>>     {
> >>>>>>>>>         struct intel_engine_cs *engine;
> >>>>>>>>>         intel_engine_mask_t tmp, mask = ve->mask;
> >>>>>>>>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> >>>>>>>>> b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> >>>>>>>>> index c7ef44fa0c36..c2afc3b88fd8 100644
> >>>>>>>>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> >>>>>>>>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc_submission.h
> >>>>>>>>> @@ -29,8 +29,6 @@ void intel_guc_dump_active_requests(struct
> >>>>>>>>> intel_engine_cs *engine,
> >>>>>>>>>                         struct i915_request *hung_rq,
> >>>>>>>>>                         struct drm_printer *m);
> >>>>>>>>> -bool intel_guc_virtual_engine_has_heartbeat(const struct
> >>>>>>>>> intel_engine_cs *ve);
> >>>>>>>>> -
> >>>>>>>>>     int intel_guc_wait_for_pending_msg(struct intel_guc *guc,
> >>>>>>>>>                        atomic_t *wait_var,
> >>>>>>>>>                        bool interruptible,
> >>>>>>>> _______________________________________________
> >>>>>>>> Intel-gfx mailing list
> >>>>>>>> Intel-gfx@lists.freedesktop.org
> >>>>>>>> https://lists.freedesktop.org/mailman/listinfo/intel-gfx
>


-- 
Daniel Vetter
Software Engineer, Intel Corporation
http://blog.ffwll.ch

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [Intel-gfx] [PATCH 0/1] Fix gem_ctx_persistence failures with GuC submission
  2021-08-18  0:08   ` [Intel-gfx] " John Harrison
@ 2021-08-18  9:49     ` Daniel Vetter
  0 siblings, 0 replies; 16+ messages in thread
From: Daniel Vetter @ 2021-08-18  9:49 UTC (permalink / raw)
  To: John Harrison; +Cc: Daniel Vetter, Matthew Brost, intel-gfx, dri-devel

On Tue, Aug 17, 2021 at 05:08:02PM -0700, John Harrison wrote:
> On 8/9/2021 23:38, Daniel Vetter wrote:
> > On Wed, Jul 28, 2021 at 05:33:59PM -0700, Matthew Brost wrote:
> > > Should fix below failures with GuC submission for the following tests:
> > > gem_exec_balancer --r noheartbeat
> > > gem_ctx_persistence --r heartbeat-close
> > > 
> > > Not going to fix:
> > > gem_ctx_persistence --r heartbeat-many
> > > gem_ctx_persistence --r heartbeat-stop
> > After looking at that big thread and being very confused: Are we fixing an
> > actual use-case here, or is this another case of blindly following igts
> > tests just because they exist?
> My understanding is that this is established behaviour and therefore must be
> maintained because the UAPI (whether documented or not) is inviolate.
> Therefore IGTs have been written to validate this past behaviour and now we
> must conform to the IGTs in order to keep the existing behaviour unchanged.

No, we do not need to blindly conform to igts. We've found enough examples
in the past few months where the igt tests where just testing stuff
because it's possible, not because any UMD actually needs the behaviour.

And drm subsystem rules are very clear that low-level tests do _not_
qualify as userspace, so if they're wrong we just have to fix them.

> Whether anybody actually makes use of this behaviour or not is another
> matter entirely. I am certainly not aware of any vital use case. Others
> might have more recollection. I do know that we tell the UMD teams to
> explicitly disable persistence on every context they create.

Does that include mesa?

> > I'm leaning towards that we should stall on this, and first document what
> > exactly is the actual intention behind all this, and then fix up the tests
> I'm not sure there ever was an 'intention'. The rumour I heard way back when
> was that persistence was a bug on earlier platforms (or possibly we didn't
> have hardware support for doing engine resets?). But once the bug was
> realised (or the hardware support was added), it was too late to change the
> default behaviour because existing kernel behaviour must never change on
> pain of painful things. Thus the persistence flag was added so that people
> could opt out of the broken, leaky behaviour and have their contexts clean
> up properly.
> 
> Feel free to document what you believe should be the behaviour from a
> software architect point of view. Any documentation I produce is basically
> going to be created by reverse engineering the existing code. That is the
> only 'spec' that I am aware of and as I keep saying, I personally think it
> is a totally broken concept that should just be removed.

There is most likely no spec except "what does current userspace actually
expect". Yes this sucks. Also if you expect me to do this, I'm backlogged
by a few months on random studies here, and largely this boils down to
checking all the umds and checking what they actually need.

Important: What igt does doesn't matter if there's not a corresponding
real world umd use-case.

> > to match (if needed). And only then fix up GuC to match whatever we
> > actually want to do.
> I also still maintain there is no 'fix up the GuC'. This is not behaviour we
> should be adding to a hardware scheduler. It is behaviour that should be
> implemented at the front end not the back end. If we absolutely need to do
> this then we need to do it solely at the context management level not at the
> back end submission level. And the solution should work by default on any
> submission back end.

With "Fix up GuC" I dont mean necessarily the guc fw, but our entire
backend. We can very much fix that to fix most anything reasonable.

Also we don't actually need the same solution on all backends, because the
uapi can have slight differences across platforms. That's why changing the
defaults is so hard once they're set in stone.
-Daniel

> 
> John.
> 
> 
> > -Daniel
> > 
> > > As the above tests change the heartbeat value to 0 (off) after the
> > > context is closed and we have no way to detect that with GuC submission
> > > unless we keep a list of closed but running contexts which seems like
> > > overkill for a non-real world use case. We likely should just skip these
> > > tests with GuC submission.
> > > 
> > > Signed-off-by: Matthew Brost <matthew.brost@intel.com>
> > > 
> > > Matthew Brost (1):
> > >    drm/i915: Check if engine has heartbeat when closing a context
> > > 
> > >   drivers/gpu/drm/i915/gem/i915_gem_context.c   |  5 +++--
> > >   drivers/gpu/drm/i915/gt/intel_context_types.h |  2 ++
> > >   drivers/gpu/drm/i915/gt/intel_engine.h        | 21 ++-----------------
> > >   .../drm/i915/gt/intel_execlists_submission.c  | 14 +++++++++++++
> > >   .../gpu/drm/i915/gt/uc/intel_guc_submission.c |  6 +++++-
> > >   .../gpu/drm/i915/gt/uc/intel_guc_submission.h |  2 --
> > >   6 files changed, 26 insertions(+), 24 deletions(-)
> > > 
> > > -- 
> > > 2.28.0
> > > 
> 

-- 
Daniel Vetter
Software Engineer, Intel Corporation
http://blog.ffwll.ch

^ permalink raw reply	[flat|nested] 16+ messages in thread

end of thread, other threads:[~2021-08-18  9:49 UTC | newest]

Thread overview: 16+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-07-29  0:33 [PATCH 0/1] Fix gem_ctx_persistence failures with GuC submission Matthew Brost
2021-07-29  0:34 ` [PATCH 1/1] drm/i915: Check if engine has heartbeat when closing a context Matthew Brost
2021-07-30  0:13   ` [Intel-gfx] " John Harrison
2021-07-30  9:49     ` Tvrtko Ursulin
2021-07-30 18:13       ` John Harrison
2021-08-02  9:40         ` Tvrtko Ursulin
2021-08-06 18:00           ` John Harrison
2021-08-06 19:46             ` Daniel Vetter
2021-08-09 23:12               ` John Harrison
2021-08-10  6:36                 ` Daniel Vetter
2021-08-18  0:28                   ` John Harrison
2021-08-18  9:26                     ` Daniel Vetter
2021-07-30 18:13       ` Matthew Brost
2021-08-10  6:38 ` [PATCH 0/1] Fix gem_ctx_persistence failures with GuC submission Daniel Vetter
2021-08-18  0:08   ` [Intel-gfx] " John Harrison
2021-08-18  9:49     ` Daniel Vetter

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).