All of lore.kernel.org
 help / color / mirror / Atom feed
* [Bug 112266] [Navi] Pathfinder: Kingmaker is causing a GPU hang: flip_done timed out error
@ 2019-11-14  1:54 bugzilla-daemon
  2019-11-14 17:28 ` bugzilla-daemon
                   ` (9 more replies)
  0 siblings, 10 replies; 11+ messages in thread
From: bugzilla-daemon @ 2019-11-14  1:54 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 11730 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=112266

            Bug ID: 112266
           Summary: [Navi] Pathfinder: Kingmaker is causing a GPU hang:
                    flip_done timed out error
           Product: DRI
           Version: unspecified
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: NEW
          Severity: normal
          Priority: not set
         Component: DRM/AMDgpu
          Assignee: dri-devel@lists.freedesktop.org
          Reporter: shtetldik@gmail.com

When running Pathfinder: Kingmaker (latest GOG release, which should be the
same as latest Steam one) on Sapphire Pulse RX 5700 XT, it's causing a weird
GPU hang with flip_done timed out error (see below for detailed log), that
doesn't look like the common shader hangs with ring gfx_0.0.0 timeout or common
sdma hangs.

The game is using OpenGL, and I run the game on Debian testing, using this
configuration:

kernel: 5.4-rc7
radeonsi: Mesa-master / llvm10:

OpenGL renderer string: AMD NAVI10 (DRM 3.35.0, 5.4.0-rc7, LLVM 10.0.0)
OpenGL core profile version string: 4.5 (Core Profile) Mesa 20.0.0-devel
(git-eb6352162d)

llvm: 10~+201911120943210600592dd459242
from this llvm10 snapshot:
https://tracker.debian.org/news/1079513/accepted-llvm-toolchain-snapshot-110201911120943210600592dd459242-1exp1-source-into-experimental/


DE: KDE Plasma 5.14.5 (X session).
GPU: Sapphire Pulse RX 5700 XT
Monitor: LG 27GL85-B (2560x1440, 144 Hz, DisplayPort 1.4 connection, adaptive
sync activated in Xorg configuration).

When launching, I'm using AMD_DEBUG=nodma,nongg

Recording apitrace doesn't help, since replaying it is not reproducing the
hang. So it could be some amdgpu issue? Please let me know, what additional
info can be useful to help you narrow it down. However the hang is quite
reproducible, and you can try it yourself with Pathfinder: Kingmaker.

The hang produces this in dmesg:

[  659.445501] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]]
*ERROR* [CRTC:62:crtc-0] flip_done timed out
[  669.685601] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]]
*ERROR* [PLANE:55:plane-5] flip_done timed out
[  669.685644] ------------[ cut here ]------------
[  669.685729] WARNING: CPU: 6 PID: 1018 at
drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:5851
amdgpu_dm_atomic_commit_tail+0x1c56/0x1d70 [amdgpu]
[  669.685730] Modules linked in: rfcomm(E) nf_tables(E) nfnetlink(E) bnep(E)
edac_mce_amd(E) kvm_amd(E) kvm(E) irqbypass(E) crct10dif_pclmul(E) btusb(E)
btrtl(E) snd_hda_codec_realtek(E) btbcm(E) crc32_pclmul(E) btintel(E) iwlmvm(E)
snd_hda_codec_generic(E) bluetooth(E) ghash_clmulni_intel(E) ledtrig_audio(E)
mac80211(E) libarc4(E) snd_hda_codec_hdmi(E) uvcvideo(E) snd_hda_intel(E)
videobuf2_vmalloc(E) snd_usb_audio(E) snd_intel_nhlt(E) videobuf2_memops(E)
drbg(E) snd_hda_codec(E) videobuf2_v4l2(E) snd_usbmidi_lib(E) iwlwifi(E)
nls_ascii(E) snd_hda_core(E) snd_rawmidi(E) videobuf2_common(E)
snd_seq_device(E) snd_hwdep(E) efi_pstore(E) nls_cp437(E) ansi_cprng(E)
snd_pcm(E) videodev(E) sp5100_tco(E) aesni_intel(E) cfg80211(E) vfat(E)
ecdh_generic(E) crypto_simd(E) ecc(E) snd_timer(E) fat(E) ccp(E) snd(E)
cryptd(E) mc(E) glue_helper(E) crc16(E) wmi_bmof(E) pcspkr(E) efivars(E)
k10temp(E) watchdog(E) sg(E) rfkill(E) soundcore(E) rng_core(E) evdev(E)
acpi_cpufreq(E) nct6775(E) hwmon_vid(E)
[  669.685753]  parport_pc(E) ppdev(E) lp(E) parport(E) efivarfs(E)
ip_tables(E) x_tables(E) autofs4(E) xfs(E) btrfs(E) xor(E) zstd_decompress(E)
zstd_compress(E) raid6_pq(E) libcrc32c(E) crc32c_generic(E) sd_mod(E)
hid_generic(E) usbhid(E) hid(E) amdgpu(E) gpu_sched(E) mxm_wmi(E) ahci(E)
ttm(E) libahci(E) drm_kms_helper(E) xhci_pci(E) crc32c_intel(E) xhci_hcd(E)
i2c_piix4(E) libata(E) drm(E) igb(E) dca(E) mfd_core(E) ptp(E) scsi_mod(E)
usbcore(E) pps_core(E) i2c_algo_bit(E) nvme(E) nvme_core(E) wmi(E) button(E)
[  669.685770] CPU: 6 PID: 1018 Comm: Xorg Tainted: G            E    
5.4.0-rc7 #31
[  669.685771] Hardware name: To Be Filled By O.E.M. To Be Filled By
O.E.M./X570 Taichi, BIOS P2.50 11/02/2019
[  669.685846] RIP: 0010:amdgpu_dm_atomic_commit_tail+0x1c56/0x1d70 [amdgpu]
[  669.685847] Code: 67 fb ff ff 41 8b 4c 24 60 48 c7 c2 60 d6 a2 c0 bf 02 00
00 00 48 c7 c6 80 f8 a9 c0 e8 e3 7d bb ff 49 8b 47 08 e9 31 e5 ff ff <0f> 0b e9
b4 ec ff ff 0f 0b 0f 0b e9 cb ec ff ff 48 8b 85 b0 fd ff
[  669.685848] RSP: 0018:ffffb80fc1a978d0 EFLAGS: 00010002
[  669.685849] RAX: 0000000000000002 RBX: ffff9454b5d54c00 RCX:
ffff9455ec2c6170
[  669.685850] RDX: 0000000000000001 RSI: 0000000000000206 RDI:
ffff9455eaba6158
[  669.685851] RBP: ffffb80fc1a97b80 R08: 0000000000000005 R09:
0000000000000000
[  669.685851] R10: ffffb80fc1a97838 R11: ffffb80fc1a9783c R12:
0000000000000206
[  669.685852] R13: ffff9455ec2c6000 R14: ffff94559d443800 R15:
ffff9455eda20000
[  669.685853] FS:  00007fc6a5a21f00(0000) GS:ffff9455fe980000(0000)
knlGS:0000000000000000
[  669.685854] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  669.685855] CR2: 00007fc6a5991678 CR3: 00000007f0390000 CR4:
0000000000340ee0
[  669.685856] Call Trace:
[  669.685864]  ? __irq_work_queue_local+0x50/0x60
[  669.685872]  ? commit_tail+0x94/0x110 [drm_kms_helper]
[  669.685878]  commit_tail+0x94/0x110 [drm_kms_helper]
[  669.685884]  drm_atomic_helper_commit+0xb8/0x130 [drm_kms_helper]
[  669.685889]  drm_atomic_helper_set_config+0x79/0x90 [drm_kms_helper]
[  669.685902]  drm_mode_setcrtc+0x194/0x6a0 [drm]
[  669.685956]  ? amdgpu_cs_wait_ioctl+0xeb/0x160 [amdgpu]
[  669.685966]  ? drm_mode_getcrtc+0x180/0x180 [drm]
[  669.685976]  drm_ioctl_kernel+0xaa/0xf0 [drm]
[  669.685986]  drm_ioctl+0x208/0x390 [drm]
[  669.685995]  ? drm_mode_getcrtc+0x180/0x180 [drm]
[  669.686044]  amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
[  669.686048]  do_vfs_ioctl+0x40e/0x670
[  669.686050]  ksys_ioctl+0x5e/0x90
[  669.686052]  __x64_sys_ioctl+0x16/0x20
[  669.686055]  do_syscall_64+0x52/0x160
[  669.686058]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  669.686060] RIP: 0033:0x7fc6a5f6a5b7
[  669.686061] Code: 00 00 90 48 8b 05 d9 78 0c 00 64 c7 00 26 00 00 00 48 c7
c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01
f0 ff ff 73 01 c3 48 8b 0d a9 78 0c 00 f7 d8 64 89 01 48
[  669.686062] RSP: 002b:00007ffd36fb37a8 EFLAGS: 00003246 ORIG_RAX:
0000000000000010
[  669.686063] RAX: ffffffffffffffda RBX: 00007ffd36fb37e0 RCX:
00007fc6a5f6a5b7
[  669.686064] RDX: 00007ffd36fb37e0 RSI: 00000000c06864a2 RDI:
000000000000000d
[  669.686064] RBP: 00000000c06864a2 R08: 0000000000000000 R09:
000055c668ad0740
[  669.686065] R10: 0000000000000000 R11: 0000000000003246 R12:
0000000000000000
[  669.686065] R13: 000000000000000d R14: 000055c668a607d0 R15:
0000000000000000
[  669.686067] ---[ end trace 47feccd771299f6b ]---
[  669.686082] ------------[ cut here ]------------
[  669.686158] WARNING: CPU: 6 PID: 1018 at
drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:5458
amdgpu_dm_atomic_commit_tail+0x1c5f/0x1d70 [amdgpu]
[  669.686158] Modules linked in: rfcomm(E) nf_tables(E) nfnetlink(E) bnep(E)
edac_mce_amd(E) kvm_amd(E) kvm(E) irqbypass(E) crct10dif_pclmul(E) btusb(E)
btrtl(E) snd_hda_codec_realtek(E) btbcm(E) crc32_pclmul(E) btintel(E) iwlmvm(E)
snd_hda_codec_generic(E) bluetooth(E) ghash_clmulni_intel(E) ledtrig_audio(E)
mac80211(E) libarc4(E) snd_hda_codec_hdmi(E) uvcvideo(E) snd_hda_intel(E)
videobuf2_vmalloc(E) snd_usb_audio(E) snd_intel_nhlt(E) videobuf2_memops(E)
drbg(E) snd_hda_codec(E) videobuf2_v4l2(E) snd_usbmidi_lib(E) iwlwifi(E)
nls_ascii(E) snd_hda_core(E) snd_rawmidi(E) videobuf2_common(E)
snd_seq_device(E) snd_hwdep(E) efi_pstore(E) nls_cp437(E) ansi_cprng(E)
snd_pcm(E) videodev(E) sp5100_tco(E) aesni_intel(E) cfg80211(E) vfat(E)
ecdh_generic(E) crypto_simd(E) ecc(E) snd_timer(E) fat(E) ccp(E) snd(E)
cryptd(E) mc(E) glue_helper(E) crc16(E) wmi_bmof(E) pcspkr(E) efivars(E)
k10temp(E) watchdog(E) sg(E) rfkill(E) soundcore(E) rng_core(E) evdev(E)
acpi_cpufreq(E) nct6775(E) hwmon_vid(E)
[  669.686175]  parport_pc(E) ppdev(E) lp(E) parport(E) efivarfs(E)
ip_tables(E) x_tables(E) autofs4(E) xfs(E) btrfs(E) xor(E) zstd_decompress(E)
zstd_compress(E) raid6_pq(E) libcrc32c(E) crc32c_generic(E) sd_mod(E)
hid_generic(E) usbhid(E) hid(E) amdgpu(E) gpu_sched(E) mxm_wmi(E) ahci(E)
ttm(E) libahci(E) drm_kms_helper(E) xhci_pci(E) crc32c_intel(E) xhci_hcd(E)
i2c_piix4(E) libata(E) drm(E) igb(E) dca(E) mfd_core(E) ptp(E) scsi_mod(E)
usbcore(E) pps_core(E) i2c_algo_bit(E) nvme(E) nvme_core(E) wmi(E) button(E)
[  669.686187] CPU: 6 PID: 1018 Comm: Xorg Tainted: G        W   E    
5.4.0-rc7 #31
[  669.686187] Hardware name: To Be Filled By O.E.M. To Be Filled By
O.E.M./X570 Taichi, BIOS P2.50 11/02/2019
[  669.686258] RIP: 0010:amdgpu_dm_atomic_commit_tail+0x1c5f/0x1d70 [amdgpu]
[  669.686259] Code: 48 c7 c2 60 d6 a2 c0 bf 02 00 00 00 48 c7 c6 80 f8 a9 c0
e8 e3 7d bb ff 49 8b 47 08 e9 31 e5 ff ff 0f 0b e9 b4 ec ff ff 0f 0b <0f> 0b e9
cb ec ff ff 48 8b 85 b0 fd ff ff 48 8d 8d 18 fe ff ff 48
[  669.686259] RSP: 0018:ffffb80fc1a978d0 EFLAGS: 00010082
[  669.686260] RAX: 0000000000000002 RBX: ffff9454b5d54c00 RCX:
ffff9455ec2c6170
[  669.686261] RDX: 0000000000000001 RSI: 0000000000000206 RDI:
ffff9455eaba6158
[  669.686261] RBP: ffffb80fc1a97b80 R08: 0000000000000005 R09:
0000000000000000
[  669.686262] R10: ffffb80fc1a97838 R11: ffffb80fc1a9783c R12:
0000000000000206
[  669.686263] R13: ffff9455ec2c6000 R14: ffff94559d443800 R15:
ffff9455eda20000
[  669.686264] FS:  00007fc6a5a21f00(0000) GS:ffff9455fe980000(0000)
knlGS:0000000000000000
[  669.686264] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  669.686265] CR2: 00007fc6a5991678 CR3: 00000007f0390000 CR4:
0000000000340ee0
[  669.686266] Call Trace:
[  669.686270]  ? __irq_work_queue_local+0x50/0x60
[  669.686277]  ? commit_tail+0x94/0x110 [drm_kms_helper]
[  669.686282]  commit_tail+0x94/0x110 [drm_kms_helper]
[  669.686288]  drm_atomic_helper_commit+0xb8/0x130 [drm_kms_helper]
[  669.686293]  drm_atomic_helper_set_config+0x79/0x90 [drm_kms_helper]
[  669.686304]  drm_mode_setcrtc+0x194/0x6a0 [drm]
[  669.686357]  ? amdgpu_cs_wait_ioctl+0xeb/0x160 [amdgpu]
[  669.686367]  ? drm_mode_getcrtc+0x180/0x180 [drm]
[  669.686377]  drm_ioctl_kernel+0xaa/0xf0 [drm]
[  669.686386]  drm_ioctl+0x208/0x390 [drm]
[  669.686396]  ? drm_mode_getcrtc+0x180/0x180 [drm]
[  669.686445]  amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
[  669.686447]  do_vfs_ioctl+0x40e/0x670
[  669.686449]  ksys_ioctl+0x5e/0x90
[  669.686451]  __x64_sys_ioctl+0x16/0x20
[  669.686453]  do_syscall_64+0x52/0x160
[  669.686454]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  669.686455] RIP: 0033:0x7fc6a5f6a5b7
[  669.686457] Code: 00 00 90 48 8b 05 d9 78 0c 00 64 c7 00 26 00 00 00 48 c7
c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01
f0 ff ff 73 01 c3 48 8b 0d a9 78 0c 00 f7 d8 64 89 01 48
[  669.686457] RSP: 002b:00007ffd36fb37a8 EFLAGS: 00003246 ORIG_RAX:
0000000000000010
[  669.686458] RAX: ffffffffffffffda RBX: 00007ffd36fb37e0 RCX:
00007fc6a5f6a5b7
[  669.686459] RDX: 00007ffd36fb37e0 RSI: 00000000c06864a2 RDI:
000000000000000d
[  669.686459] RBP: 00000000c06864a2 R08: 0000000000000000 R09:
000055c668ad0740
[  669.686460] R10: 0000000000000000 R11: 0000000000003246 R12:
0000000000000000
[  669.686461] R13: 000000000000000d R14: 000055c668a607d0 R15:
0000000000000000
[  669.686462] ---[ end trace 47feccd771299f6c ]---

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 13210 bytes --]

[-- Attachment #2: Type: text/plain, Size: 159 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2019-11-19 10:01 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-11-14  1:54 [Bug 112266] [Navi] Pathfinder: Kingmaker is causing a GPU hang: flip_done timed out error bugzilla-daemon
2019-11-14 17:28 ` bugzilla-daemon
2019-11-14 17:38 ` bugzilla-daemon
2019-11-14 17:46 ` bugzilla-daemon
2019-11-14 18:12 ` bugzilla-daemon
2019-11-15  1:18 ` bugzilla-daemon
2019-11-15 16:04 ` bugzilla-daemon
2019-11-15 19:05 ` bugzilla-daemon
2019-11-15 21:22 ` bugzilla-daemon
2019-11-16 11:39 ` bugzilla-daemon
2019-11-19 10:01 ` bugzilla-daemon

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.