linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* GPU hangs and X shot down with 4.11-rc6
@ 2017-04-25 16:41 Michal Hocko
  2017-04-25 20:03 ` [Intel-gfx] " Chris Wilson
  0 siblings, 1 reply; 3+ messages in thread
From: Michal Hocko @ 2017-04-25 16:41 UTC (permalink / raw)
  To: Daniel Vetter, Jani Nikula, David Airlie
  Cc: intel-gfx, dri-devel, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1946 bytes --]

Hi,
I have just experienced X being shut down once with 4.11-rc2 and 2 times
with 4.11-rc6 kernel.  I do not remember seeing something like this
before but it is quite possible I was just lucky to not trigger this
issue before. It always happened while I was working on a presentation
in LibreOffice which I do very seldom. The kernel log contains:

[ 7456.721893] [drm] GPU HANG: ecode 9:0:0x86dffffd, in Xorg [3594], reason: Hang on render ring, action: reset
[ 7456.721897] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[ 7456.721898] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[ 7456.721900] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[ 7456.721901] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[ 7456.721902] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[ 7456.721925] drm/i915: Resetting chip after gpu hang
[ 7456.722117] [drm] RC6 on
[ 7456.734588] [drm] GuC firmware load skipped
[ 7464.686209] drm/i915: Resetting chip after gpu hang
[ 7464.686284] [drm] RC6 on
[ 7464.702469] [drm] GuC firmware load skipped
[ 7472.686180] drm/i915: Resetting chip after gpu hang
[ 7472.686241] [drm] RC6 on
[ 7472.704565] [drm] GuC firmware load skipped
[ 7480.686179] drm/i915: Resetting chip after gpu hang
[ 7480.686241] [drm] RC6 on
[ 7480.704583] [drm] GuC firmware load skipped
[ 7493.678130] drm/i915: Resetting chip after gpu hang
[ 7493.678206] [drm] RC6 on
[ 7493.696505] [drm] GuC firmware load skipped

The kernel message tells that the problem might be anywhere and I should
report to freedesktop but I haven't changed the userspace recently so it
smells more like a kernel bug to me. Does this ring bells? The GPU crash
dump is attached in case it is useful.

Let me know if you need additional information.

Thanks!
-- 
Michal Hocko
SUSE Labs

[-- Attachment #2: gpu_dump.gz --]
[-- Type: application/gzip, Size: 14952 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [Intel-gfx] GPU hangs and X shot down with 4.11-rc6
  2017-04-25 16:41 GPU hangs and X shot down with 4.11-rc6 Michal Hocko
@ 2017-04-25 20:03 ` Chris Wilson
  2017-04-26  9:28   ` Michal Hocko
  0 siblings, 1 reply; 3+ messages in thread
From: Chris Wilson @ 2017-04-25 20:03 UTC (permalink / raw)
  To: Michal Hocko
  Cc: Daniel Vetter, Jani Nikula, David Airlie, intel-gfx,
	linux-kernel, dri-devel

On Tue, Apr 25, 2017 at 06:41:20PM +0200, Michal Hocko wrote:
> Hi,
> I have just experienced X being shut down once with 4.11-rc2 and 2 times
> with 4.11-rc6 kernel.  I do not remember seeing something like this
> before but it is quite possible I was just lucky to not trigger this
> issue before. It always happened while I was working on a presentation
> in LibreOffice which I do very seldom. The kernel log contains:
> 
> [ 7456.721893] [drm] GPU HANG: ecode 9:0:0x86dffffd, in Xorg [3594], reason: Hang on render ring, action: reset
> [ 7456.721897] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
> [ 7456.721898] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
> [ 7456.721900] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
> [ 7456.721901] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
> [ 7456.721902] [drm] GPU crash dump saved to /sys/class/drm/card0/error
> [ 7456.721925] drm/i915: Resetting chip after gpu hang
> [ 7456.722117] [drm] RC6 on
> [ 7456.734588] [drm] GuC firmware load skipped
> [ 7464.686209] drm/i915: Resetting chip after gpu hang
> [ 7464.686284] [drm] RC6 on
> [ 7464.702469] [drm] GuC firmware load skipped
> [ 7472.686180] drm/i915: Resetting chip after gpu hang
> [ 7472.686241] [drm] RC6 on
> [ 7472.704565] [drm] GuC firmware load skipped
> [ 7480.686179] drm/i915: Resetting chip after gpu hang
> [ 7480.686241] [drm] RC6 on
> [ 7480.704583] [drm] GuC firmware load skipped
> [ 7493.678130] drm/i915: Resetting chip after gpu hang
> [ 7493.678206] [drm] RC6 on
> [ 7493.696505] [drm] GuC firmware load skipped
> 
> The kernel message tells that the problem might be anywhere and I should
> report to freedesktop but I haven't changed the userspace recently so it
> smells more like a kernel bug to me. Does this ring bells? The GPU crash
> dump is attached in case it is useful.

There are lots of very similar GPU hangs for mesa across a wide range of
kernels, with several reports noting a correlation with libreoffice.

At first glance, I would say you were just unlucky to hit it.
-Chris

-- 
Chris Wilson, Intel Open Source Technology Centre

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [Intel-gfx] GPU hangs and X shot down with 4.11-rc6
  2017-04-25 20:03 ` [Intel-gfx] " Chris Wilson
@ 2017-04-26  9:28   ` Michal Hocko
  0 siblings, 0 replies; 3+ messages in thread
From: Michal Hocko @ 2017-04-26  9:28 UTC (permalink / raw)
  To: Chris Wilson
  Cc: Daniel Vetter, Jani Nikula, David Airlie, intel-gfx,
	linux-kernel, dri-devel

On Tue 25-04-17 21:03:32, Chris Wilson wrote:
> On Tue, Apr 25, 2017 at 06:41:20PM +0200, Michal Hocko wrote:
> > Hi,
> > I have just experienced X being shut down once with 4.11-rc2 and 2 times
> > with 4.11-rc6 kernel.  I do not remember seeing something like this
> > before but it is quite possible I was just lucky to not trigger this
> > issue before. It always happened while I was working on a presentation
> > in LibreOffice which I do very seldom. The kernel log contains:
> > 
> > [ 7456.721893] [drm] GPU HANG: ecode 9:0:0x86dffffd, in Xorg [3594], reason: Hang on render ring, action: reset
> > [ 7456.721897] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
> > [ 7456.721898] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
> > [ 7456.721900] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
> > [ 7456.721901] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
> > [ 7456.721902] [drm] GPU crash dump saved to /sys/class/drm/card0/error
> > [ 7456.721925] drm/i915: Resetting chip after gpu hang
> > [ 7456.722117] [drm] RC6 on
> > [ 7456.734588] [drm] GuC firmware load skipped
> > [ 7464.686209] drm/i915: Resetting chip after gpu hang
> > [ 7464.686284] [drm] RC6 on
> > [ 7464.702469] [drm] GuC firmware load skipped
> > [ 7472.686180] drm/i915: Resetting chip after gpu hang
> > [ 7472.686241] [drm] RC6 on
> > [ 7472.704565] [drm] GuC firmware load skipped
> > [ 7480.686179] drm/i915: Resetting chip after gpu hang
> > [ 7480.686241] [drm] RC6 on
> > [ 7480.704583] [drm] GuC firmware load skipped
> > [ 7493.678130] drm/i915: Resetting chip after gpu hang
> > [ 7493.678206] [drm] RC6 on
> > [ 7493.696505] [drm] GuC firmware load skipped
> > 
> > The kernel message tells that the problem might be anywhere and I should
> > report to freedesktop but I haven't changed the userspace recently so it
> > smells more like a kernel bug to me. Does this ring bells? The GPU crash
> > dump is attached in case it is useful.
> 
> There are lots of very similar GPU hangs for mesa across a wide range of
> kernels, with several reports noting a correlation with libreoffice.
> 
> At first glance, I would say you were just unlucky to hit it.

OK, good to know. Thanks!
-- 
Michal Hocko
SUSE Labs

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2017-04-26  9:47 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-04-25 16:41 GPU hangs and X shot down with 4.11-rc6 Michal Hocko
2017-04-25 20:03 ` [Intel-gfx] " Chris Wilson
2017-04-26  9:28   ` Michal Hocko

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).