From mboxrd@z Thu Jan 1 00:00:00 1970
From: bugzilla-daemon@freedesktop.org
Subject: [Bug 109692] deadlock occurs during GPU reset
Date: Sun, 12 May 2019 20:58:56 +0000
Message-ID:
References:
Mime-Version: 1.0
Content-Type: multipart/mixed; boundary="===============1495493921=="
Return-path:
Received: from culpepper.freedesktop.org (culpepper.freedesktop.org
[131.252.210.165])
by gabe.freedesktop.org (Postfix) with ESMTP id DC6C389308
for ; Sun, 12 May 2019 20:58:56 +0000 (UTC)
In-Reply-To:
List-Unsubscribe: ,
List-Archive:
List-Post:
List-Help:
List-Subscribe: ,
Errors-To: dri-devel-bounces@lists.freedesktop.org
Sender: "dri-devel"
To: dri-devel@lists.freedesktop.org
List-Id: dri-devel@lists.freedesktop.org
--===============1495493921==
Content-Type: multipart/alternative; boundary="15576947363.D263Ae.8334"
Content-Transfer-Encoding: 7bit
--15576947363.D263Ae.8334
Date: Sun, 12 May 2019 20:58:56 +0000
MIME-Version: 1.0
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Bugzilla-URL: http://bugs.freedesktop.org/
Auto-Submitted: auto-generated
https://bugs.freedesktop.org/show_bug.cgi?id=3D109692
--- Comment #42 from mikhail.v.gavrilov@gmail.com ---
Andrey, could you look on another GPU reset issue?
[18735.255511] sony 0003:054C:09CC.0008: input,hidraw4: USB HID v81.11 Game=
pad
[Sony Interactive Entertainment Wireless Controller] on
usb-0000:0c:00.3-1.4.4/input3
[18735.340415] usbcore: registered new interface driver snd-usb-audio
[18742.241131] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring page0 timeou=
t,
signaled seq=3D201025, emitted seq=3D201027
[18742.241141] [drm:drm_atomic_helper_wait_for_flip_done [drm_kms_helper]]
*ERROR* [CRTC:47:crtc-0] flip_done timed out
[18742.241196] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process informati=
on:
process pid 0 thread pid 0
[18742.241200] amdgpu 0000:0b:00.0: GPU reset begin!
[18742.251116] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout,
signaled seq=3D2264508, emitted seq=3D2264510
[18742.251153] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process informati=
on:
process chrome pid 17912 thread chrome:cs0 pid 17915
[18742.251156] amdgpu 0000:0b:00.0: GPU reset begin!
[18742.754134] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring page1 timeou=
t,
signaled seq=3D15388, emitted seq=3D15388
[18742.754203] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process informati=
on:
process pid 0 thread pid 0
[18742.754207] amdgpu 0000:0b:00.0: GPU reset begin!
[18751.968977] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeou=
t,
signaled seq=3D346, emitted seq=3D346
[18751.969042] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process informati=
on:
process chrome pid 3587 thread chrome:cs0 pid 3604
[18751.969047] amdgpu 0000:0b:00.0: GPU reset begin!
[18753.504894] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper=
]]
*ERROR* [CRTC:47:crtc-0] flip_done timed out
[18763.744722] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper=
]]
*ERROR* [PLANE:45:plane-5] flip_done timed out
[18763.744758] amdgpu: [powerplay] Failed to send message 0x47, response
0xffffffff
[18763.744770] amdgpu: [powerplay] Failed to send message 0x28, response
0xffffffff
[18763.744772] amdgpu: [powerplay] [SetUclkToHightestDpmLevel] Set hard min
uclk failed!
[18763.744784] amdgpu: [powerplay] Failed to send message 0x28, response
0xffffffff
[18763.744785] amdgpu: [powerplay] Attempt to set Hard Min for DCEFCLK Fail=
ed!
[18763.744796] amdgpu: [powerplay] Failed to send message 0x28, response
0xffffffff
[18763.744798] amdgpu: [powerplay] [SetHardMinFreq] Set hard min uclk faile=
d!
[18763.744809] amdgpu: [powerplay] Failed to send message 0x26, response
0xffffffff
[18763.744810] amdgpu: [powerplay] Failed to set soft min gfxclk !
[18763.744811] amdgpu: [powerplay] Failed to upload DPM Bootup Levels!
[18763.779547] [drm] REG_WAIT timeout 10us * 3000 tries -
dce110_stream_encoder_dp_blank line:950
[18763.779651] WARNING: CPU: 4 PID: 25404 at
drivers/gpu/drm/amd/amdgpu/../display/dc/dc_helper.c:277
generic_reg_wait.cold+0x29/0x30 [amdgpu]
[18763.779652] Modules linked in: snd_usb_audio hid_sony ff_memless
snd_usbmidi_lib snd_rawmidi snd_seq_dummy uinput fuse rfcomm cmac
nf_conntrack_netbios_ns nf_conntrack_broadcast xt_CT ip6t_rpfilter ip6t_REJ=
ECT
nf_reject_ipv6 ipt_REJECT nf_reject_ipv4 xt_conntrack ebtable_nat ip6table_=
nat
ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat
iptable_mangle iptable_raw iptable_security nf_conntrack nf_defrag_ipv6
nf_defrag_ipv4 ip_set nfnetlink ebtable_filter ebtables ip6table_filter
ip6_tables iptable_filter ip_tables bnep sunrpc vfat fat arc4 joydev r8822b=
e(C)
snd_hda_codec_realtek snd_hda_codec_generic edac_mce_amd ledtrig_audio
eeepc_wmi snd_hda_codec_hdmi asus_wmi kvm_amd sparse_keymap snd_hda_intel v=
ideo
wmi_bmof mac80211 snd_hda_codec kvm snd_hda_core snd_hwdep snd_seq btusb bt=
rtl
snd_seq_device btbcm irqbypass btintel bluetooth snd_pcm crct10dif_pclmul
cfg80211 crc32_pclmul snd_timer ghash_clmulni_intel snd sp5100_tco ecdh_gen=
eric
soundcore i2c_piix4 k10temp rfkill ccp
[18763.779676] gpio_amdpt gpio_generic pcc_cpufreq acpi_cpufreq xfs libcrc=
32c
amdgpu chash gpu_sched amd_iommu_v2 ttm drm_kms_helper igb drm nvme dca
i2c_algo_bit crc32c_intel nvme_core wmi pinctrl_amd
[18763.779687] CPU: 4 PID: 25404 Comm: kworker/4:0 Tainted: G C=20=
=20=20=20=20=20=20
5.1.0-1.fc31.x86_64 #1
[18763.779689] Hardware name: System manufacturer System Product Name/ROG S=
TRIX
X470-I GAMING, BIOS 2202 04/11/2019
[18763.779694] Workqueue: events drm_sched_job_timedout [gpu_sched]
[18763.779764] RIP: 0010:generic_reg_wait.cold+0x29/0x30 [amdgpu]
[18763.779766] Code: ff 44 8b 44 24 68 48 8b 4c 24 60 48 c7 c7 10 c0 8b c0 =
8b
54 24 58 8b 34 24 e8 b0 59 92 d0 41 83 7c 24 20 01 0f 84 9b 78 fe ff <0f> 0=
b e9
94 78 fe ff e8 9a 4d ed ff 48 c7 c7 00 b0 95 c0 e8 de 5f
[18763.779767] RSP: 0018:ffff9b5320bb7720 EFLAGS: 00010297
[18763.779769] RAX: 0000000000000052 RBX: 0000000000000bb9 RCX:
0000000000000000
[18763.779771] RDX: 0000000000000000 RSI: ffff8d40fd9168c8 RDI:
ffff8d40fd9168c8
[18763.779772] RBP: 00000000000050e2 R08: ffff8d40fd9168c8 R09:
000000000000053d
[18763.779773] R10: ffff8d411e39d7d0 R11: ffff9b5320bb75d5 R12:
ffff8d40f97ca580
[18763.779774] R13: 00000000000050e2 R14: 00000000ffffffff R15:
00000000ffffffff
[18763.779776] FS: 0000000000000000(0000) GS:ffff8d40fd900000(0000)
knlGS:0000000000000000
[18763.779778] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[18763.779779] CR2: 00000c0d6e33c000 CR3: 000000027ed00000 CR4:
00000000003406e0
[18763.779780] Call Trace:
[18763.779854] dce110_stream_encoder_dp_blank+0x159/0x2e0 [amdgpu]
[18763.779921] core_link_disable_stream+0x42/0x270 [amdgpu]
[18763.779987] dce110_reset_hw_ctx_wrap+0xca/0x1f0 [amdgpu]
[18763.780053] dce110_apply_ctx_to_hw+0x4a/0x490 [amdgpu]
[18763.780100] ? amdgpu_pm_compute_clocks+0xb9/0x5e0 [amdgpu]
[18763.780168] ? dm_pp_apply_display_requirements+0x19e/0x1b0 [amdgpu]
[18763.780232] dc_commit_state+0x26b/0x570 [amdgpu]
[18763.780236] ? vsnprintf+0x3aa/0x4f0
[18763.780304] amdgpu_dm_atomic_commit_tail+0x3e2/0x1980 [amdgpu]
[18763.780308] ? vt_console_print+0x74/0x3f0
[18763.780311] ? up+0x12/0x60
[18763.780314] ? irq_work_queue+0x91/0xa0
[18763.780316] ? wake_up_klogd+0x30/0x40
[18763.780318] ? vprintk_emit+0x1ef/0x250
[18763.780320] ? printk+0x58/0x6f
[18763.780328] ? drm_atomic_helper_wait_for_dependencies+0x1e4/0x1f0
[drm_kms_helper]
[18763.780341] ? drm_err+0x72/0x90 [drm]
[18763.780349] ? commit_tail+0x3c/0x70 [drm_kms_helper]
[18763.780356] commit_tail+0x3c/0x70 [drm_kms_helper]
[18763.780364] drm_atomic_helper_commit+0x108/0x110 [drm_kms_helper]
[18763.780371] drm_atomic_helper_disable_all+0x144/0x160 [drm_kms_helper]
[18763.780378] drm_atomic_helper_suspend+0x60/0xf0 [drm_kms_helper]
[18763.780445] dm_suspend+0x1c/0x60 [amdgpu]
[18763.780489] amdgpu_device_ip_suspend_phase1+0x91/0xc0 [amdgpu]
[18763.780532] amdgpu_device_ip_suspend+0x1c/0x60 [amdgpu]
[18763.780601] amdgpu_device_pre_asic_reset+0x1f4/0x209 [amdgpu]
[18763.780668] amdgpu_device_gpu_recover+0x77/0x731 [amdgpu]
[18763.780730] amdgpu_job_timedout+0xf7/0x120 [amdgpu]
[18763.780733] drm_sched_job_timedout+0x3a/0x70 [gpu_sched]
[18763.780737] process_one_work+0x19d/0x380
[18763.780739] worker_thread+0x50/0x3b0
[18763.780742] kthread+0xfb/0x130
[18763.780744] ? process_one_work+0x380/0x380
[18763.780745] ? kthread_park+0x90/0x90
[18763.780748] ret_from_fork+0x22/0x40
[18763.780751] ---[ end trace 1b9ec5613589027e ]---
--=20
You are receiving this mail because:
You are the assignee for the bug.=
--15576947363.D263Ae.8334
Date: Sun, 12 May 2019 20:58:56 +0000
MIME-Version: 1.0
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Bugzilla-URL: http://bugs.freedesktop.org/
Auto-Submitted: auto-generated
Comme=
nt # 42
on bug 10969=
2
from mikhail.v.gavrilov@gmail.com
Andrey, could you look on another GPU reset issue?
[18735.255511] sony 0003:054C:09CC.0008: input,hidraw4: USB HID v81.11 Game=
pad
[Sony Interactive Entertainment Wireless Controller] on
usb-0000:0c:00.3-1.4.4/input3
[18735.340415] usbcore: registered new interface driver snd-usb-audio
[18742.241131] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring page0 timeou=
t,
signaled seq=3D201025, emitted seq=3D201027
[18742.241141] [drm:drm_atomic_helper_wait_for_flip_done [drm_kms_helper]]
*ERROR* [CRTC:47:crtc-0] flip_done timed out
[18742.241196] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process informati=
on:
process pid 0 thread pid 0
[18742.241200] amdgpu 0000:0b:00.0: GPU reset begin!
[18742.251116] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout,
signaled seq=3D2264508, emitted seq=3D2264510
[18742.251153] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process informati=
on:
process chrome pid 17912 thread chrome:cs0 pid 17915
[18742.251156] amdgpu 0000:0b:00.0: GPU reset begin!
[18742.754134] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring page1 timeou=
t,
signaled seq=3D15388, emitted seq=3D15388
[18742.754203] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process informati=
on:
process pid 0 thread pid 0
[18742.754207] amdgpu 0000:0b:00.0: GPU reset begin!
[18751.968977] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeou=
t,
signaled seq=3D346, emitted seq=3D346
[18751.969042] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process informati=
on:
process chrome pid 3587 thread chrome:cs0 pid 3604
[18751.969047] amdgpu 0000:0b:00.0: GPU reset begin!
[18753.504894] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper=
]]
*ERROR* [CRTC:47:crtc-0] flip_done timed out
[18763.744722] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper=
]]
*ERROR* [PLANE:45:plane-5] flip_done timed out
[18763.744758] amdgpu: [powerplay] Failed to send message 0x47, response
0xffffffff
[18763.744770] amdgpu: [powerplay] Failed to send message 0x28, response
0xffffffff
[18763.744772] amdgpu: [powerplay] [SetUclkToHightestDpmLevel] Set hard min
uclk failed!
[18763.744784] amdgpu: [powerplay] Failed to send message 0x28, response
0xffffffff
[18763.744785] amdgpu: [powerplay] Attempt to set Hard Min for DCEFCLK Fail=
ed!
[18763.744796] amdgpu: [powerplay] Failed to send message 0x28, response
0xffffffff
[18763.744798] amdgpu: [powerplay] [SetHardMinFreq] Set hard min uclk faile=
d!
[18763.744809] amdgpu: [powerplay] Failed to send message 0x26, response
0xffffffff
[18763.744810] amdgpu: [powerplay] Failed to set soft min gfxclk !
[18763.744811] amdgpu: [powerplay] Failed to upload DPM Bootup Levels!
[18763.779547] [drm] REG_WAIT timeout 10us * 3000 tries -
dce110_stream_encoder_dp_blank line:950
[18763.779651] WARNING: CPU: 4 PID: 25404 at
drivers/gpu/drm/amd/amdgpu/../display/dc/dc_helper.c:277
generic_reg_wait.cold+0x29/0x30 [amdgpu]
[18763.779652] Modules linked in: snd_usb_audio hid_sony ff_memless
snd_usbmidi_lib snd_rawmidi snd_seq_dummy uinput fuse rfcomm cmac
nf_conntrack_netbios_ns nf_conntrack_broadcast xt_CT ip6t_rpfilter ip6t_REJ=
ECT
nf_reject_ipv6 ipt_REJECT nf_reject_ipv4 xt_conntrack ebtable_nat ip6table_=
nat
ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat
iptable_mangle iptable_raw iptable_security nf_conntrack nf_defrag_ipv6
nf_defrag_ipv4 ip_set nfnetlink ebtable_filter ebtables ip6table_filter
ip6_tables iptable_filter ip_tables bnep sunrpc vfat fat arc4 joydev r8822b=
e(C)
snd_hda_codec_realtek snd_hda_codec_generic edac_mce_amd ledtrig_audio
eeepc_wmi snd_hda_codec_hdmi asus_wmi kvm_amd sparse_keymap snd_hda_intel v=
ideo
wmi_bmof mac80211 snd_hda_codec kvm snd_hda_core snd_hwdep snd_seq btusb bt=
rtl
snd_seq_device btbcm irqbypass btintel bluetooth snd_pcm crct10dif_pclmul
cfg80211 crc32_pclmul snd_timer ghash_clmulni_intel snd sp5100_tco ecdh_gen=
eric
soundcore i2c_piix4 k10temp rfkill ccp
[18763.779676] gpio_amdpt gpio_generic pcc_cpufreq acpi_cpufreq xfs libcrc=
32c
amdgpu chash gpu_sched amd_iommu_v2 ttm drm_kms_helper igb drm nvme dca
i2c_algo_bit crc32c_intel nvme_core wmi pinctrl_amd
[18763.779687] CPU: 4 PID: 25404 Comm: kworker/4:0 Tainted: G C=20=
=20=20=20=20=20=20
5.1.0-1.fc31.x86_64 #1
[18763.779689] Hardware name: System manufacturer System Product Name/ROG S=
TRIX
X470-I GAMING, BIOS 2202 04/11/2019
[18763.779694] Workqueue: events drm_sched_job_timedout [gpu_sched]
[18763.779764] RIP: 0010:generic_reg_wait.cold+0x29/0x30 [amdgpu]
[18763.779766] Code: ff 44 8b 44 24 68 48 8b 4c 24 60 48 c7 c7 10 c0 8b c0 =
8b
54 24 58 8b 34 24 e8 b0 59 92 d0 41 83 7c 24 20 01 0f 84 9b 78 fe ff <0f=
> 0b e9
94 78 fe ff e8 9a 4d ed ff 48 c7 c7 00 b0 95 c0 e8 de 5f
[18763.779767] RSP: 0018:ffff9b5320bb7720 EFLAGS: 00010297
[18763.779769] RAX: 0000000000000052 RBX: 0000000000000bb9 RCX:
0000000000000000
[18763.779771] RDX: 0000000000000000 RSI: ffff8d40fd9168c8 RDI:
ffff8d40fd9168c8
[18763.779772] RBP: 00000000000050e2 R08: ffff8d40fd9168c8 R09:
000000000000053d
[18763.779773] R10: ffff8d411e39d7d0 R11: ffff9b5320bb75d5 R12:
ffff8d40f97ca580
[18763.779774] R13: 00000000000050e2 R14: 00000000ffffffff R15:
00000000ffffffff
[18763.779776] FS: 0000000000000000(0000) GS:ffff8d40fd900000(0000)
knlGS:0000000000000000
[18763.779778] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[18763.779779] CR2: 00000c0d6e33c000 CR3: 000000027ed00000 CR4:
00000000003406e0
[18763.779780] Call Trace:
[18763.779854] dce110_stream_encoder_dp_blank+0x159/0x2e0 [amdgpu]
[18763.779921] core_link_disable_stream+0x42/0x270 [amdgpu]
[18763.779987] dce110_reset_hw_ctx_wrap+0xca/0x1f0 [amdgpu]
[18763.780053] dce110_apply_ctx_to_hw+0x4a/0x490 [amdgpu]
[18763.780100] ? amdgpu_pm_compute_clocks+0xb9/0x5e0 [amdgpu]
[18763.780168] ? dm_pp_apply_display_requirements+0x19e/0x1b0 [amdgpu]
[18763.780232] dc_commit_state+0x26b/0x570 [amdgpu]
[18763.780236] ? vsnprintf+0x3aa/0x4f0
[18763.780304] amdgpu_dm_atomic_commit_tail+0x3e2/0x1980 [amdgpu]
[18763.780308] ? vt_console_print+0x74/0x3f0
[18763.780311] ? up+0x12/0x60
[18763.780314] ? irq_work_queue+0x91/0xa0
[18763.780316] ? wake_up_klogd+0x30/0x40
[18763.780318] ? vprintk_emit+0x1ef/0x250
[18763.780320] ? printk+0x58/0x6f
[18763.780328] ? drm_atomic_helper_wait_for_dependencies+0x1e4/0x1f0
[drm_kms_helper]
[18763.780341] ? drm_err+0x72/0x90 [drm]
[18763.780349] ? commit_tail+0x3c/0x70 [drm_kms_helper]
[18763.780356] commit_tail+0x3c/0x70 [drm_kms_helper]
[18763.780364] drm_atomic_helper_commit+0x108/0x110 [drm_kms_helper]
[18763.780371] drm_atomic_helper_disable_all+0x144/0x160 [drm_kms_helper]
[18763.780378] drm_atomic_helper_suspend+0x60/0xf0 [drm_kms_helper]
[18763.780445] dm_suspend+0x1c/0x60 [amdgpu]
[18763.780489] amdgpu_device_ip_suspend_phase1+0x91/0xc0 [amdgpu]
[18763.780532] amdgpu_device_ip_suspend+0x1c/0x60 [amdgpu]
[18763.780601] amdgpu_device_pre_asic_reset+0x1f4/0x209 [amdgpu]
[18763.780668] amdgpu_device_gpu_recover+0x77/0x731 [amdgpu]
[18763.780730] amdgpu_job_timedout+0xf7/0x120 [amdgpu]
[18763.780733] drm_sched_job_timedout+0x3a/0x70 [gpu_sched]
[18763.780737] process_one_work+0x19d/0x380
[18763.780739] worker_thread+0x50/0x3b0
[18763.780742] kthread+0xfb/0x130
[18763.780744] ? process_one_work+0x380/0x380
[18763.780745] ? kthread_park+0x90/0x90
[18763.780748] ret_from_fork+0x22/0x40
[18763.780751] ---[ end trace 1b9ec5613589027e ]---
You are receiving this mail because:
- You are the assignee for the bug.
=
--15576947363.D263Ae.8334--
--===============1495493921==
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: base64
Content-Disposition: inline
X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs
IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz
dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVs
--===============1495493921==--