From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 105425] 3D & games produce periodic GPU crashes (Radeon R7 370) Date: Tue, 17 Apr 2018 09:27:54 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1249448257==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 298346E3F4 for ; Tue, 17 Apr 2018 09:27:54 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1249448257== Content-Type: multipart/alternative; boundary="15239572741.E5bbCc.15403" Content-Transfer-Encoding: 7bit --15239572741.E5bbCc.15403 Date: Tue, 17 Apr 2018 09:27:54 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D105425 --- Comment #31 from iive@yahoo.com --- (In reply to MirceaKitsune from comment #29) > For the first time ever, I might finally have some very good news on this > issue! It will take several more days to confirm, then possibly another > month to pinpoint the exact option responsible. However it's possible I m= ay > have found something that finally gets rid of the crash. >=20 > The issue appears to go away when playing Xonotic with those parameters: >=20 > export LIBGL_DEBUG=3Dtrue LIBGL_NO_DRAWARRAYS=3Dtrue LIBGL_DRI3_DISABLE= =3Dtrue > MESA_DEBUG=3Dtrue MESA_NO_ASM=3Dtrue MESA_NO_MMX=3Dtrue MESA_NO_3DNOW=3Dt= rue > MESA_NO_SSE=3Dtrue MESA_NO_ERROR=3Dtrue MESA_GLSL_CACHE_DISABLE=3Dtrue > MESA_NO_MINMAX_CACHE=3Dtrue RADEON_NO_TCL=3Dtrue DRAW_NO_FSE=3Dtrue DRAW_= USE_LLVM=3D0 >=20 > I additionally disabled the cvar "r_shadows 2" which I forgot I had on fo= r a > while now, as it enabled a shadowing system that might have itself been t= he > culprit. >=20 > With these two changes, I was able to clock up to 120 minutes of continuo= us > gameplay last night, followed by an outstanding 200 minutes today! That's > over 2 respectively 3 hours with no system freeze whatsoever. I need to > repeat this test several times to be 100% sure there's not still some > obscure chance of it happening, but in any case there is definitely a maj= or > difference visible. "MESA_NO_ASM=3Dtrue" supersedes the other "MESA_NO_MMX=3Dtrue MESA_NO_3DNOW= =3Dtrue MESA_NO_SSE=3Dtrue", so you don't need to make combinations with all of the= m. Also I don't see you testing `export mesa_glthread=3Dfalse`. Race condition= s are one of the hardest bugs to catch and reproduce. If you think that 'r_shadow' could quickly and "reliably" trigger a hang, t= hen I would ask you to focus on it first.=20 1. Read about sysrq and make sure you have it enabled in the kernel and tha= t it works. Make sure you have text console, as it might need it. 2. Enable back "r_shadows 2" 3. Use apitrace to capture a hang, while playing the game. 4. Try to reboot gracefully, using sysrq to sync and reboot, or get in text console and restart. 5. Test if the recorded trace could reproduce the crash reliably. If the trace seems complete and it cannot reproduce the bug, then maybe it = does capture everything, but the bug is not simple infinite loop in the shader. (These seem to be common cause of hangs). If the bug can be reliably reproduced, it will be fixed. Good luck. --=20 You are receiving this mail because: You are the assignee for the bug.= --15239572741.E5bbCc.15403 Date: Tue, 17 Apr 2018 09:27:54 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 31 on bug 10542= 5 from iive@yahoo.com
(In reply to MirceaKitsune from comment #29)
> For the first time ever, I might finally have so=
me very good news on this
> issue! It will take several more days to confirm, then possibly another
> month to pinpoint the exact option responsible. However it's possible =
I may
> have found something that finally gets rid of the crash.
>=20
> The issue appears to go away when playing Xonotic with those parameter=
s:
>=20
> export LIBGL_DEBUG=3Dtrue LIBGL_NO_DRAWARRAYS=3Dtrue LIBGL_DRI3_DISABL=
E=3Dtrue
> MESA_DEBUG=3Dtrue MESA_NO_ASM=3Dtrue MESA_NO_MMX=3Dtrue MESA_NO_3DNOW=
=3Dtrue
> MESA_NO_SSE=3Dtrue MESA_NO_ERROR=3Dtrue MESA_GLSL_CACHE_DISABLE=3Dtrue
> MESA_NO_MINMAX_CACHE=3Dtrue RADEON_NO_TCL=3Dtrue DRAW_NO_FSE=3Dtrue DR=
AW_USE_LLVM=3D0
>=20
> I additionally disabled the cvar "r_shadows 2" which I forgo=
t I had on for a
> while now, as it enabled a shadowing system that might have itself bee=
n the
> culprit.
>=20
> With these two changes, I was able to clock up to 120 minutes of conti=
nuous
> gameplay last night, followed by an outstanding 200 minutes today! Tha=
t's
> over 2 respectively 3 hours with no system freeze whatsoever. I need to
> repeat this test several times to be 100% sure there's not still some
> obscure chance of it happening, but in any case there is definitely a =
major
> difference visible.

"MESA_NO_ASM=3Dtrue" supersedes the other "MESA_NO_MMX=3Dtru=
e MESA_NO_3DNOW=3Dtrue
MESA_NO_SSE=3Dtrue", so you don't need to make combinations with all o=
f them.


Also I don't see you testing `export mesa_glthread=3Dfalse`. Race condition=
s are
one of the hardest bugs to catch and reproduce.

If you think that 'r_shadow' could quickly and "reliably" trigger=
 a hang, then
I would ask you to focus on it first.=20
1. Read about sysrq and make sure you have it enabled in the kernel and tha=
t it
works. Make sure you have text console, as it might need it.
2. Enable back "r_shadows 2"
3. Use apitrace to capture a hang, while playing the game.
4. Try to reboot gracefully, using sysrq to sync and reboot, or get in text
console and restart.
5. Test if the recorded trace could reproduce the crash reliably.

If the trace seems complete and it cannot reproduce the bug, then maybe it =
does
capture everything, but the bug is not simple infinite loop in the shader.
(These seem to be common cause of hangs).

If the bug can be reliably reproduced, it will be fixed.

Good luck.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15239572741.E5bbCc.15403-- --===============1249448257== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1249448257==--