All of lore.kernel.org
 help / color / mirror / Atom feed
From: bugzilla-daemon@freedesktop.org
To: dri-devel@lists.freedesktop.org
Subject: [Bug 111481] AMD Navi GPU frequent freezes on both Manjaro/Ubuntu with kernel 5.3 and mesa 19.2 -git/llvm9
Date: Wed, 23 Oct 2019 17:18:44 +0000	[thread overview]
Message-ID: <bug-111481-502-poZhpghs7K@http.bugs.freedesktop.org/> (raw)
In-Reply-To: <bug-111481-502@http.bugs.freedesktop.org/>


[-- Attachment #1.1: Type: text/plain, Size: 3706 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=111481

--- Comment #128 from yamagi@yamagi.org ---
(In reply to yamagi from comment #124)
> Interestingly I've got the problem the other way round. My 5700XT was
> running fine since I got it about two weeks ago. This is Arch Linux, I've
> run Mesa 19.2.1 and llvm-libs 9.0.0 since day one. The card was stable with
> 5.4-RC2 and 5.4-RC3, not a single hang in about 10 hours The Witcher 3 under
> wine + dxvk and Yamagi Quake II with OpenGL 3.2 renderer. After I upgraded
> to 5.4-RC4 I've seen several GPU hangs. The last one, and the only one
> that's still in the logs was:
> 
> [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled
> seq=85270, emitted seq=85272
> 
> That one was in Yamagi Quake II, but I had hangs on the desktop and in The
> Witcher 3. I have no umr reports so far. I've just compiled the tool and
> will see if I can get some.


As promised, some more informations:

For me the crash is fairly easy to reproduce with Linux 5.4-RC4. All it takes
is Yamagi Quake II (Revision 1232289, can be found at
https://github.com/yquake2/yquake2) with OpenGL 3.2 renderer. The old OpenGL
1.4 doesn't trigger it. Start the game, it's a good idea to set set timedemo
mode to 1, and just let it cycle through the demo loop until it crashes. I used
'./quake +set timedemo 1 +set vid_renderer gl3'. I've never experienced this
crash in the wild with Linux 5.4-RC3 until I learned that I can trigger with
the Quake II demo loop. In Linux 5.4-RC3 it usually takes somewhere between 20
to 30 cycles through loop to trigger, with 5.4-RC4 only 5 to 10 cycles. So
something changed between RC3 and RC4 that made it more likely.

I suspect some kind of timing issue. The demo loop is deterministic, it
generates exactly the same API calls each time it's run. While the crash always
happens while the loading screen is up, it never occures at the same one.
Sometimes it's in the fifth iteration, the next time at the 12th and so on.
Putting apitrace (adds some latency!) onto it, makes it much less likely to
occure. To the point I thought that it's a heisenbug. The same goes for cycling
through the loop without timedemo mode enabled (~20 FPS in normal mode, ~1000
FPS in timedemo mode).

I made an apitrace for easier reproduction. It's a little bit big for bugzilla,
so I've uploaded it here: https://deponie.yamagi.org/temp/quake2.trace.xz
Replaying it usually triggers the crash during the first or second run.

The exact software versions were:
* Linux 5.4-RC4 with https://bugzilla.freedesktop.org/attachment.cgi?id=145323
and https://bugzilla.freedesktop.org/attachment.cgi?id=145734 applied.
* Mesa 19.2.1-2
* LLVM 9.0.0

dmesg output after a crash in Quake IIs demo loop is:
[  122.294181] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout,
signaled seq=177737, emitted seq=177739
[  122.294256] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information:
process glretrace pid 1302 thread glretrace:cs0 pid 1303
[  122.294257] [drm] GPU recovery disabled.

dmesg output after a crash by replaying the apitrace is:
[  266.695388] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout,
signaled seq=27598, emitted seq=27600
[  266.695463] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information:
process glretrace pid 1372 thread glretrace:cs0 pid 1373
[  266.695465] [drm] GPU recovery disabled.

I'm attaching the state of sdma0 is both cases.

I hope this helps to find the root cause of this. If can provide more
informations don't hesitate to ask.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 4922 bytes --]

[-- Attachment #2: Type: text/plain, Size: 159 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

  parent reply	other threads:[~2019-10-23 17:18 UTC|newest]

Thread overview: 246+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-08-25  0:50 [Bug 111481] AMD Navi GPU frequent freezes on both Manjaro/Ubuntu with kernel 5.3 and mesa 19.2 -git/llvm9 bugzilla-daemon
2019-08-25 13:51 ` bugzilla-daemon
2019-08-25 17:10 ` bugzilla-daemon
2019-08-25 21:05 ` bugzilla-daemon
2019-08-28 15:39 ` bugzilla-daemon
2019-08-30 22:07 ` bugzilla-daemon
2019-08-30 22:13 ` bugzilla-daemon
2019-08-30 23:20 ` bugzilla-daemon
2019-08-31  7:14 ` bugzilla-daemon
2019-08-31 22:15 ` bugzilla-daemon
2019-08-31 22:18 ` bugzilla-daemon
2019-08-31 23:50 ` bugzilla-daemon
2019-09-01  0:36 ` bugzilla-daemon
2019-09-01 10:24 ` bugzilla-daemon
2019-09-01 16:36 ` bugzilla-daemon
2019-09-02  6:05 ` bugzilla-daemon
2019-09-02  7:24 ` bugzilla-daemon
2019-09-02  8:01 ` bugzilla-daemon
2019-09-02  8:25 ` bugzilla-daemon
2019-09-02  8:26 ` bugzilla-daemon
2019-09-02  9:13 ` bugzilla-daemon
2019-09-02 11:53 ` bugzilla-daemon
2019-09-02 12:24 ` bugzilla-daemon
2019-09-02 16:45 ` bugzilla-daemon
2019-09-02 17:01 ` bugzilla-daemon
2019-09-02 17:05 ` bugzilla-daemon
2019-09-02 17:16 ` bugzilla-daemon
2019-09-03 14:56 ` bugzilla-daemon
2019-09-04 12:20 ` bugzilla-daemon
2019-09-04 12:24 ` bugzilla-daemon
2019-09-04 15:36 ` bugzilla-daemon
2019-09-05 11:14 ` bugzilla-daemon
2019-09-05 11:50 ` bugzilla-daemon
2019-09-06  1:58 ` bugzilla-daemon
2019-09-10 14:19 ` bugzilla-daemon
2019-09-10 15:23 ` bugzilla-daemon
2019-09-10 15:36 ` bugzilla-daemon
2019-09-10 18:25 ` bugzilla-daemon
2019-09-10 21:02 ` bugzilla-daemon
2019-09-12 12:21 ` bugzilla-daemon
2019-09-13  5:22 ` bugzilla-daemon
2019-09-15  2:41 ` bugzilla-daemon
2019-09-15  7:52 ` bugzilla-daemon
2019-09-15 17:45 ` bugzilla-daemon
2019-09-15 19:57 ` bugzilla-daemon
2019-09-15 20:37 ` bugzilla-daemon
2019-09-15 22:22 ` bugzilla-daemon
2019-09-16  5:31 ` bugzilla-daemon
2019-09-16  6:47 ` bugzilla-daemon
2019-09-16 18:16 ` bugzilla-daemon
2019-09-17 10:23 ` bugzilla-daemon
2019-09-17 21:24 ` bugzilla-daemon
2019-09-18 13:45 ` bugzilla-daemon
2019-09-19 20:11 ` bugzilla-daemon
2019-09-19 20:12 ` bugzilla-daemon
2019-09-20  3:54 ` bugzilla-daemon
2019-09-20  6:27 ` bugzilla-daemon
2019-09-20 16:53 ` bugzilla-daemon
2019-09-20 22:06 ` bugzilla-daemon
2019-09-21  9:47 ` bugzilla-daemon
2019-09-21 16:40 ` bugzilla-daemon
2019-09-21 17:22 ` bugzilla-daemon
2019-09-22  6:28 ` bugzilla-daemon
2019-09-22  9:50 ` bugzilla-daemon
2019-09-22 12:03 ` bugzilla-daemon
2019-09-30 12:32 ` bugzilla-daemon
2019-10-02 16:51 ` bugzilla-daemon
2019-10-03  6:46 ` bugzilla-daemon
2019-10-03  9:03 ` bugzilla-daemon
2019-10-03 11:17 ` bugzilla-daemon
2019-10-03 12:26 ` bugzilla-daemon
2019-10-04 21:14 ` bugzilla-daemon
2019-10-04 21:28 ` bugzilla-daemon
2019-10-04 21:35 ` bugzilla-daemon
2019-10-04 21:40 ` bugzilla-daemon
2019-10-05 17:01 ` bugzilla-daemon
2019-10-05 17:02 ` bugzilla-daemon
2019-10-05 22:17 ` bugzilla-daemon
2019-10-06 19:20 ` bugzilla-daemon
2019-10-06 19:37 ` bugzilla-daemon
2019-10-06 20:10 ` bugzilla-daemon
2019-10-07  0:57 ` bugzilla-daemon
2019-10-08 16:57 ` bugzilla-daemon
2019-10-08 17:44 ` bugzilla-daemon
2019-10-10  7:57 ` bugzilla-daemon
2019-10-11 10:24 ` bugzilla-daemon
2019-10-11 20:14 ` bugzilla-daemon
2019-10-11 20:14 ` bugzilla-daemon
2019-10-11 21:03 ` bugzilla-daemon
2019-10-11 23:06 ` bugzilla-daemon
2019-10-13  1:30 ` bugzilla-daemon
2019-10-13 15:13 ` bugzilla-daemon
2019-10-13 15:51 ` bugzilla-daemon
2019-10-13 15:54 ` bugzilla-daemon
2019-10-13 21:44 ` bugzilla-daemon
2019-10-14 10:09 ` bugzilla-daemon
2019-10-14 10:20 ` bugzilla-daemon
2019-10-14 16:48 ` bugzilla-daemon
2019-10-15 19:48 ` bugzilla-daemon
2019-10-16 17:50 ` bugzilla-daemon
2019-10-16 22:41 ` bugzilla-daemon
2019-10-16 23:01 ` bugzilla-daemon
2019-10-17  6:59 ` bugzilla-daemon
2019-10-17 19:31 ` bugzilla-daemon
2019-10-17 19:33 ` bugzilla-daemon
2019-10-17 19:38 ` bugzilla-daemon
2019-10-18 11:05 ` bugzilla-daemon
2019-10-18 11:13 ` bugzilla-daemon
2019-10-18 12:01 ` bugzilla-daemon
2019-10-18 12:05 ` bugzilla-daemon
2019-10-18 13:21 ` bugzilla-daemon
2019-10-18 20:48 ` bugzilla-daemon
2019-10-19  5:49 ` bugzilla-daemon
2019-10-19 20:57 ` bugzilla-daemon
2019-10-20  1:38 ` bugzilla-daemon
2019-10-20 13:59 ` bugzilla-daemon
2019-10-20 20:13 ` bugzilla-daemon
2019-10-20 21:07 ` bugzilla-daemon
2019-10-20 21:17 ` bugzilla-daemon
2019-10-21  1:24 ` bugzilla-daemon
2019-10-21  2:21 ` bugzilla-daemon
2019-10-21 10:36 ` bugzilla-daemon
2019-10-21 12:02 ` bugzilla-daemon
2019-10-22 15:50 ` bugzilla-daemon
2019-10-22 15:57 ` bugzilla-daemon
2019-10-22 16:19 ` bugzilla-daemon
2019-10-22 17:23 ` bugzilla-daemon
2019-10-22 18:00 ` bugzilla-daemon
2019-10-23  2:32 ` bugzilla-daemon
2019-10-23 12:54 ` bugzilla-daemon
2019-10-23 13:50 ` bugzilla-daemon
2019-10-23 17:18 ` bugzilla-daemon [this message]
2019-10-23 17:19 ` bugzilla-daemon
2019-10-23 17:20 ` bugzilla-daemon
2019-10-23 18:18 ` bugzilla-daemon
2019-10-23 18:21 ` bugzilla-daemon
2019-10-23 18:31 ` bugzilla-daemon
2019-10-23 18:35 ` bugzilla-daemon
2019-10-23 18:36 ` bugzilla-daemon
2019-10-23 18:56 ` bugzilla-daemon
2019-10-23 19:04 ` bugzilla-daemon
2019-10-23 19:30 ` bugzilla-daemon
2019-10-23 20:04 ` bugzilla-daemon
2019-10-23 20:12 ` bugzilla-daemon
2019-10-23 20:16 ` bugzilla-daemon
2019-10-23 20:25 ` bugzilla-daemon
2019-10-23 20:30 ` bugzilla-daemon
2019-10-23 20:36 ` bugzilla-daemon
2019-10-23 20:44 ` bugzilla-daemon
2019-10-23 20:49 ` bugzilla-daemon
2019-10-23 21:04 ` bugzilla-daemon
2019-10-23 21:06 ` bugzilla-daemon
2019-10-24  1:05 ` bugzilla-daemon
2019-10-24  8:14 ` bugzilla-daemon
2019-10-24 13:25 ` bugzilla-daemon
2019-10-24 14:21 ` bugzilla-daemon
2019-10-24 16:18 ` bugzilla-daemon
2019-10-24 17:09 ` bugzilla-daemon
2019-10-24 19:00 ` bugzilla-daemon
2019-10-24 19:12 ` bugzilla-daemon
2019-10-24 19:15 ` bugzilla-daemon
2019-10-24 19:33 ` bugzilla-daemon
2019-10-24 19:37 ` bugzilla-daemon
2019-10-25  1:18 ` bugzilla-daemon
2019-10-25  3:26 ` bugzilla-daemon
2019-10-25  3:28 ` bugzilla-daemon
2019-10-25 13:16 ` bugzilla-daemon
2019-10-25 13:27 ` bugzilla-daemon
2019-10-25 14:49 ` bugzilla-daemon
2019-10-25 15:00 ` bugzilla-daemon
2019-10-25 15:07 ` bugzilla-daemon
2019-10-25 15:34 ` bugzilla-daemon
2019-10-25 15:35 ` bugzilla-daemon
2019-10-25 15:36 ` bugzilla-daemon
2019-10-25 15:42 ` bugzilla-daemon
2019-10-25 15:43 ` bugzilla-daemon
2019-10-25 15:57 ` bugzilla-daemon
2019-10-25 16:03 ` bugzilla-daemon
2019-10-25 16:06 ` bugzilla-daemon
2019-10-26  6:03 ` bugzilla-daemon
2019-10-27  2:44 ` bugzilla-daemon
2019-10-27  3:10 ` bugzilla-daemon
2019-10-27  3:10   ` bugzilla-daemon
2019-10-28 21:43 ` bugzilla-daemon
2019-10-28 21:43   ` bugzilla-daemon
2019-10-29 12:17 ` bugzilla-daemon
2019-10-29 17:19 ` bugzilla-daemon
2019-10-29 17:30 ` bugzilla-daemon
2019-10-31  5:14 ` bugzilla-daemon
2019-10-31 11:54 ` bugzilla-daemon
2019-10-31 12:00 ` bugzilla-daemon
2019-10-31 12:11 ` bugzilla-daemon
2019-10-31 19:17 ` bugzilla-daemon
2019-11-01 14:27 ` bugzilla-daemon
2019-11-01 16:29 ` bugzilla-daemon
2019-11-01 19:20 ` bugzilla-daemon
2019-11-01 19:21 ` bugzilla-daemon
2019-11-01 20:16 ` bugzilla-daemon
2019-11-02 23:11 ` bugzilla-daemon
2019-11-03  0:40 ` bugzilla-daemon
2019-11-03  1:26 ` bugzilla-daemon
2019-11-03  4:16 ` bugzilla-daemon
2019-11-04 16:12 ` bugzilla-daemon
2019-11-04 16:14 ` bugzilla-daemon
2019-11-04 20:21 ` bugzilla-daemon
2019-11-04 20:37 ` bugzilla-daemon
2019-11-04 20:44 ` bugzilla-daemon
2019-11-04 20:46 ` bugzilla-daemon
2019-11-04 20:47 ` bugzilla-daemon
2019-11-04 21:16 ` bugzilla-daemon
2019-11-04 22:25 ` bugzilla-daemon
2019-11-05  1:32 ` bugzilla-daemon
2019-11-05  2:19 ` bugzilla-daemon
2019-11-05  9:23 ` bugzilla-daemon
2019-11-06  0:43 ` bugzilla-daemon
2019-11-06  8:38 ` bugzilla-daemon
2019-11-06  9:40 ` bugzilla-daemon
2019-11-06 13:43 ` bugzilla-daemon
2019-11-06 19:39 ` bugzilla-daemon
2019-11-06 19:41 ` bugzilla-daemon
2019-11-07  0:41 ` bugzilla-daemon
2019-11-07  5:12 ` bugzilla-daemon
2019-11-07  5:20 ` bugzilla-daemon
2019-11-07  5:35 ` bugzilla-daemon
2019-11-07  5:37 ` bugzilla-daemon
2019-11-08 21:57 ` bugzilla-daemon
2019-11-08 23:38 ` bugzilla-daemon
2019-11-09 12:39 ` bugzilla-daemon
2019-11-09 17:57 ` bugzilla-daemon
2019-11-10 12:26 ` bugzilla-daemon
2019-11-10 12:42 ` bugzilla-daemon
2019-11-10 14:15 ` bugzilla-daemon
2019-11-11  2:50 ` bugzilla-daemon
2019-11-11  3:01 ` bugzilla-daemon
2019-11-11  8:05 ` bugzilla-daemon
2019-11-14  0:44 ` bugzilla-daemon
2019-11-15 20:10 ` bugzilla-daemon
2019-11-15 21:29 ` bugzilla-daemon
2019-11-16 16:22 ` bugzilla-daemon
2019-11-16 17:53 ` bugzilla-daemon
2019-11-16 17:58 ` bugzilla-daemon
2019-11-18 15:11 ` bugzilla-daemon
2019-11-19  1:21 ` bugzilla-daemon
2019-11-19  1:29 ` bugzilla-daemon
2019-11-19  1:30 ` bugzilla-daemon
2019-11-19  9:50 ` bugzilla-daemon

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=bug-111481-502-poZhpghs7K@http.bugs.freedesktop.org/ \
    --to=bugzilla-daemon@freedesktop.org \
    --cc=dri-devel@lists.freedesktop.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.