dri-devel.lists.freedesktop.org archive mirror
 help / color / mirror / Atom feed
* drm/i915 GPU hang
@ 2020-01-18 20:28 Piper Fowler-Wright
  2020-01-20 13:27 ` Chris Wilson
  0 siblings, 1 reply; 2+ messages in thread
From: Piper Fowler-Wright @ 2020-01-18 20:28 UTC (permalink / raw)
  To: dri-devel

I have recently (since the New Year) been experiencing regular GPU hangs
which typically render the system unusable. 

During the hangs the kernel buffer is filled with messages of the form

[ 8269.599926] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
[ 8269.600022] i915 0000:00:02.0: Resetting chip for hang on rcs0
[ 8269.601827] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
[ 8269.602595] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
[ 8277.705805] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0

etc.

Most recently the following message was displayed

[12796.753277] i915 0000:00:02.0: GPU HANG: ecode 9:1:0x00000000, hang on rcs0
[12796.753281] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[12796.753282] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[12796.753283] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[12796.753285] The GPU crash dump is required to analyze GPU hangs, so please always attach it.
[12796.753286] GPU crash dump saved to /sys/class/drm/card0/error
[12796.753304] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0

Unfortunately, the /sys/class/drm/card0/error file contained only "No error
state collected". 

bugs.freedesktop.org is no longer in operation so I decided to post here. Please
redirect me to the correct list if this is one is not appropriate.

My system info is:
CPU/GPU: Intel i5-8250U/Intel UHD Graphics 620
Kernel: 5.4.12-arch1-1
Video Driver: xf86-video-intel 1:2.99.917+899+gf66d3954-1 (mesa 19.3.2-2)

Let me know if any other information may be helpful.

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: drm/i915 GPU hang
  2020-01-18 20:28 drm/i915 GPU hang Piper Fowler-Wright
@ 2020-01-20 13:27 ` Chris Wilson
  0 siblings, 0 replies; 2+ messages in thread
From: Chris Wilson @ 2020-01-20 13:27 UTC (permalink / raw)
  To: Piper Fowler-Wright, dri-devel

Quoting Piper Fowler-Wright (2020-01-18 20:28:42)
> I have recently (since the New Year) been experiencing regular GPU hangs
> which typically render the system unusable. 
> 
> During the hangs the kernel buffer is filled with messages of the form
> 
> [ 8269.599926] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
> [ 8269.600022] i915 0000:00:02.0: Resetting chip for hang on rcs0
> [ 8269.601827] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
> [ 8269.602595] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
> [ 8277.705805] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
> 

Sadly it is known and the backport of the fix seems to have slipped
through the stable@ cracks.

It should be fixed in 5.5, which is in -rc7 already so should be usable.
On the other hand, if the problem reoccurs, we need to use drm-tip as a
known baseline for patching anyway.

> etc.
> 
> Most recently the following message was displayed
> 
> [12796.753277] i915 0000:00:02.0: GPU HANG: ecode 9:1:0x00000000, hang on rcs0
> [12796.753281] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
> [12796.753282] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
> [12796.753283] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
> [12796.753285] The GPU crash dump is required to analyze GPU hangs, so please always attach it.
> [12796.753286] GPU crash dump saved to /sys/class/drm/card0/error
> [12796.753304] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
> 
> Unfortunately, the /sys/class/drm/card0/error file contained only "No error
> state collected". 

It's only valid until the next reboot (since it's only kept in memory).
 
> bugs.freedesktop.org is no longer in operation so I decided to post here. Please
> redirect me to the correct list if this is one is not appropriate.

Fyi, the bug list is at gitlab.freedesktop.org/drm/intel now.
-Chris
_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2020-01-20 13:27 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-01-18 20:28 drm/i915 GPU hang Piper Fowler-Wright
2020-01-20 13:27 ` Chris Wilson

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).