From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 105251] [Vega10] GPU lockup on boot: VMC page fault Date: Wed, 22 Aug 2018 20:21:43 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1046110377==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 7EDBF6E43D for ; Wed, 22 Aug 2018 20:21:43 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1046110377== Content-Type: multipart/alternative; boundary="15349693031.DBbc1e.20547" Content-Transfer-Encoding: 7bit --15349693031.DBbc1e.20547 Date: Wed, 22 Aug 2018 20:21:43 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D105251 --- Comment #23 from Andrey Grodzovsky --- (In reply to CheatCodesOfLife from comment #22) > You're welcome. >=20 > Not the exact same problem, no. I can get a hard-lock by trying to use > amdvlk to play rpcs3, but it doesn't produce the same error and it's not = as > consistent (takes up to 15 minutes to crash) >=20 > Not sure if it's worth noting but I went back and tried every Cemu version > back to 1.5 and a lot of wine versions going back to 2.8. It happens every > time as soon as the game loads. Let's try to get some debug info for the VMC page fault then -=20=20 Clone and build our open source register analyzer from here - https://cgit.freedesktop.org/amd/umr/=20 Install trace-cmd utility=20 Load driver with cmd line parameter amdgpu.vm_fault_stop=3D2 from grub P.S Best to use latest kernel from here - https://cgit.freedesktop.org/~agd5f/linux/log/?h=3Damd-staging-drm-next After desktop is loaded type=20 sudo trace-cmd start -e dma_fence -e gpu_scheduler -e amdgpu -v -e "amdgpu:amdgpu_mm_rreg" -e "amdgpu:amdgpu_mm_wreg" -e "amdgpu:amdgpu_iv" to enable kernel event tracing log If possible to launch the game from shell then prepend the command with=20 GALLIUM_DDEBUG=3Dalways=20 to dump all the MESA commands into files in ~/ddebug_dumps/ Start the game. When the problem happens do the following -=20 as root=20 cd /sys/kernel/debug/tracing && cat trace > event_dump as normal user or root sudo umr -lb > umr_dump sudo umr -O verbose,use_colour -R gfx[.] >> umr_dump sudo umr -O halt_waves,use_colour -wa >> umr_dump dmesg > dmesg_dump Upload a tar/zip of all those files + all the files from ~/ddebug_dumps/ --=20 You are receiving this mail because: You are the assignee for the bug.= --15349693031.DBbc1e.20547 Date: Wed, 22 Aug 2018 20:21:43 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 23 on bug 10525= 1 from Andrey Grodzovsky
(In reply to CheatCodesOfLife from comment #22)
> You're welcome.
>=20
> Not the exact same problem, no. I can get a hard-lock by trying to use
> amdvlk to play rpcs3, but it doesn't produce the same error and it's n=
ot as
> consistent (takes up to 15 minutes to crash)
>=20
> Not sure if it's worth noting but I went back and tried every Cemu ver=
sion
> back to 1.5 and a lot of wine versions going back to 2.8. It happens e=
very
> time as soon as the game loads.

Let's try to get some debug info for the VMC page fault then -=20=20

Clone and build our open source register analyzer from here -
https://cgit.freedesktop.=
org/amd/umr/=20
Install trace-cmd utility=20
Load driver with cmd line parameter amdgpu.vm_fault_stop=3D2 from grub
P.S Best to use latest kernel from here -
https://cgit.freedesktop.org/~agd5f/linux/log/?h=3Damd-staging-drm=
-next

After desktop is loaded type=20

sudo trace-cmd start -e dma_fence -e gpu_scheduler -e amdgpu -v -e
"amdgpu:amdgpu_mm_rreg" -e "amdgpu:amdgpu_mm_wreg" -e &=
quot;amdgpu:amdgpu_iv"
to enable kernel event tracing log

If possible to launch the game from shell then prepend the command with=20
GALLIUM_DDEBUG=3Dalways=20
to dump all the MESA commands into files in ~/ddebug_dumps/


Start the game. When the problem happens do the following -=20

as root=20
cd /sys/kernel/debug/tracing && cat trace > event_dump

as normal user or root
sudo umr -lb > umr_dump
sudo umr -O verbose,use_colour -R gfx[.] >> umr_dump
sudo umr -O halt_waves,use_colour -wa >> umr_dump
dmesg > dmesg_dump

Upload a tar/zip of all those files + all the files from ~/ddebug_dumps/
        


You are receiving this mail because:
  • You are the assignee for the bug.
= --15349693031.DBbc1e.20547-- --===============1046110377== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1046110377==--