From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 105251] [Vega10] GPU lockup on boot: VMC page fault Date: Fri, 24 Aug 2018 14:11:55 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1581578657==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id E80476E6E6 for ; Fri, 24 Aug 2018 14:11:55 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1581578657== Content-Type: multipart/alternative; boundary="15351199155.92AF6D4.10799" Content-Transfer-Encoding: 7bit --15351199155.92AF6D4.10799 Date: Fri, 24 Aug 2018 14:11:55 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D105251 --- Comment #24 from CheatCodesOfLife --- (In reply to Andrey Grodzovsky from comment #23) > (In reply to CheatCodesOfLife from comment #22) > > You're welcome. > >=20 > > Not the exact same problem, no. I can get a hard-lock by trying to use > > amdvlk to play rpcs3, but it doesn't produce the same error and it's no= t as > > consistent (takes up to 15 minutes to crash) > >=20 > > Not sure if it's worth noting but I went back and tried every Cemu vers= ion > > back to 1.5 and a lot of wine versions going back to 2.8. It happens ev= ery > > time as soon as the game loads. >=20 > Let's try to get some debug info for the VMC page fault then -=20=20 >=20 > Clone and build our open source register analyzer from here - > https://cgit.freedesktop.org/amd/umr/=20 > Install trace-cmd utility=20 > Load driver with cmd line parameter amdgpu.vm_fault_stop=3D2 from grub > P.S Best to use latest kernel from here - > https://cgit.freedesktop.org/~agd5f/linux/log/?h=3Damd-staging-drm-next >=20 > After desktop is loaded type=20 >=20 > sudo trace-cmd start -e dma_fence -e gpu_scheduler -e amdgpu -v -e > "amdgpu:amdgpu_mm_rreg" -e "amdgpu:amdgpu_mm_wreg" -e "amdgpu:amdgpu_iv" > to enable kernel event tracing log >=20 > If possible to launch the game from shell then prepend the command with=20 > GALLIUM_DDEBUG=3Dalways=20 > to dump all the MESA commands into files in ~/ddebug_dumps/ >=20 >=20 > Start the game. When the problem happens do the following -=20 >=20 > as root=20 > cd /sys/kernel/debug/tracing && cat trace > event_dump >=20 > as normal user or root > sudo umr -lb > umr_dump > sudo umr -O verbose,use_colour -R gfx[.] >> umr_dump > sudo umr -O halt_waves,use_colour -wa >> umr_dump > dmesg > dmesg_dump >=20 > Upload a tar/zip of all those files + all the files from ~/ddebug_dumps/ Thanks for the instructions. I think I've followed them correctly. I didn't build the amd-drm-next kernel as it'll be an overnight job (slow internet speeds) but I did add the grub parameters. I have attached the files. --=20 You are receiving this mail because: You are the assignee for the bug.= --15351199155.92AF6D4.10799 Date: Fri, 24 Aug 2018 14:11:55 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 24 on bug 10525= 1 from CheatCodesOfLife
(In reply to Andrey Grodzovsky from comment #23)
> (In reply to CheatCodesOfLife from comment #22)
> > You're welcome.
> >=20
> > Not the exact same problem, no. I can get a hard-lock by trying t=
o use
> > amdvlk to play rpcs3, but it doesn't produce the same error and i=
t's not as
> > consistent (takes up to 15 minutes to crash)
> >=20
> > Not sure if it's worth noting but I went back and tried every Cem=
u version
> > back to 1.5 and a lot of wine versions going back to 2.8. It happ=
ens every
> > time as soon as the game loads.
>=20
> Let's try to get some debug info for the VMC page fault then -=20=20
>=20
> Clone and build our open source register analyzer from here -
> https://cgit.freedes=
ktop.org/amd/umr/=20
> Install trace-cmd utility=20
> Load driver with cmd line parameter amdgpu.vm_fault_stop=3D2 from grub
> P.S Best to use latest kernel from here -
> https://cgit.freedesktop.org/~agd5f/linux/log/?h=3Damd-stagin=
g-drm-next
>=20
> After desktop is loaded type=20
>=20
> sudo trace-cmd start -e dma_fence -e gpu_scheduler -e amdgpu -v -e
> "amdgpu:amdgpu_mm_rreg" -e "amdgpu:amdgpu_mm_wreg"=
 -e "amdgpu:amdgpu_iv"
> to enable kernel event tracing log
>=20
> If possible to launch the game from shell then prepend the command wit=
h=20
> GALLIUM_DDEBUG=3Dalways=20
> to dump all the MESA commands into files in ~/ddebug_dumps/
>=20
>=20
> Start the game. When the problem happens do the following -=20
>=20
> as root=20
> cd /sys/kernel/debug/tracing && cat trace > event_dump
>=20
> as normal user or root
> sudo umr -lb > umr_dump
> sudo umr -O verbose,use_colour -R gfx[.] >> umr_dump
> sudo umr -O halt_waves,use_colour -wa >> umr_dump
> dmesg > dmesg_dump
>=20
> Upload a tar/zip of all those files + all the files from ~/ddebug_dump=
s/

Thanks for the instructions. I think I've followed them correctly. I didn't
build the amd-drm-next kernel as it'll be an overnight job (slow internet
speeds) but I did add the grub parameters. I have attached the files.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15351199155.92AF6D4.10799-- --===============1581578657== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1581578657==--