From mboxrd@z Thu Jan 1 00:00:00 1970
From: bugzilla-daemon@freedesktop.org
Subject: [Bug 105251] [Vega10] GPU lockup on boot: VMC page fault
Date: Fri, 24 Aug 2018 14:11:55 +0000
Message-ID:
References:
Mime-Version: 1.0
Content-Type: multipart/mixed; boundary="===============1581578657=="
Return-path:
Received: from culpepper.freedesktop.org (culpepper.freedesktop.org
[131.252.210.165])
by gabe.freedesktop.org (Postfix) with ESMTP id E80476E6E6
for ; Fri, 24 Aug 2018 14:11:55 +0000 (UTC)
In-Reply-To:
List-Unsubscribe: ,
List-Archive:
List-Post:
List-Help:
List-Subscribe: ,
Errors-To: dri-devel-bounces@lists.freedesktop.org
Sender: "dri-devel"
To: dri-devel@lists.freedesktop.org
List-Id: dri-devel@lists.freedesktop.org
--===============1581578657==
Content-Type: multipart/alternative; boundary="15351199155.92AF6D4.10799"
Content-Transfer-Encoding: 7bit
--15351199155.92AF6D4.10799
Date: Fri, 24 Aug 2018 14:11:55 +0000
MIME-Version: 1.0
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Bugzilla-URL: http://bugs.freedesktop.org/
Auto-Submitted: auto-generated
https://bugs.freedesktop.org/show_bug.cgi?id=3D105251
--- Comment #24 from CheatCodesOfLife ---
(In reply to Andrey Grodzovsky from comment #23)
> (In reply to CheatCodesOfLife from comment #22)
> > You're welcome.
> >=20
> > Not the exact same problem, no. I can get a hard-lock by trying to use
> > amdvlk to play rpcs3, but it doesn't produce the same error and it's no=
t as
> > consistent (takes up to 15 minutes to crash)
> >=20
> > Not sure if it's worth noting but I went back and tried every Cemu vers=
ion
> > back to 1.5 and a lot of wine versions going back to 2.8. It happens ev=
ery
> > time as soon as the game loads.
>=20
> Let's try to get some debug info for the VMC page fault then -=20=20
>=20
> Clone and build our open source register analyzer from here -
> https://cgit.freedesktop.org/amd/umr/=20
> Install trace-cmd utility=20
> Load driver with cmd line parameter amdgpu.vm_fault_stop=3D2 from grub
> P.S Best to use latest kernel from here -
> https://cgit.freedesktop.org/~agd5f/linux/log/?h=3Damd-staging-drm-next
>=20
> After desktop is loaded type=20
>=20
> sudo trace-cmd start -e dma_fence -e gpu_scheduler -e amdgpu -v -e
> "amdgpu:amdgpu_mm_rreg" -e "amdgpu:amdgpu_mm_wreg" -e "amdgpu:amdgpu_iv"
> to enable kernel event tracing log
>=20
> If possible to launch the game from shell then prepend the command with=20
> GALLIUM_DDEBUG=3Dalways=20
> to dump all the MESA commands into files in ~/ddebug_dumps/
>=20
>=20
> Start the game. When the problem happens do the following -=20
>=20
> as root=20
> cd /sys/kernel/debug/tracing && cat trace > event_dump
>=20
> as normal user or root
> sudo umr -lb > umr_dump
> sudo umr -O verbose,use_colour -R gfx[.] >> umr_dump
> sudo umr -O halt_waves,use_colour -wa >> umr_dump
> dmesg > dmesg_dump
>=20
> Upload a tar/zip of all those files + all the files from ~/ddebug_dumps/
Thanks for the instructions. I think I've followed them correctly. I didn't
build the amd-drm-next kernel as it'll be an overnight job (slow internet
speeds) but I did add the grub parameters. I have attached the files.
--=20
You are receiving this mail because:
You are the assignee for the bug.=
--15351199155.92AF6D4.10799
Date: Fri, 24 Aug 2018 14:11:55 +0000
MIME-Version: 1.0
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Bugzilla-URL: http://bugs.freedesktop.org/
Auto-Submitted: auto-generated
Comme=
nt # 24
on bug 10525=
1
from CheatCodesOfLife
(In reply to Andrey Grodzovsky from comment #23)
> (In reply to CheatCodesOfLife from comment #22)
> > You're welcome.
> >=20
> > Not the exact same problem, no. I can get a hard-lock by trying t=
o use
> > amdvlk to play rpcs3, but it doesn't produce the same error and i=
t's not as
> > consistent (takes up to 15 minutes to crash)
> >=20
> > Not sure if it's worth noting but I went back and tried every Cem=
u version
> > back to 1.5 and a lot of wine versions going back to 2.8. It happ=
ens every
> > time as soon as the game loads.
>=20
> Let's try to get some debug info for the VMC page fault then -=20=20
>=20
> Clone and build our open source register analyzer from here -
> https://cgit.freedes=
ktop.org/amd/umr/=20
> Install trace-cmd utility=20
> Load driver with cmd line parameter amdgpu.vm_fault_stop=3D2 from grub
> P.S Best to use latest kernel from here -
> https://cgit.freedesktop.org/~agd5f/linux/log/?h=3Damd-stagin=
g-drm-next
>=20
> After desktop is loaded type=20
>=20
> sudo trace-cmd start -e dma_fence -e gpu_scheduler -e amdgpu -v -e
> "amdgpu:amdgpu_mm_rreg" -e "amdgpu:amdgpu_mm_wreg"=
-e "amdgpu:amdgpu_iv"
> to enable kernel event tracing log
>=20
> If possible to launch the game from shell then prepend the command wit=
h=20
> GALLIUM_DDEBUG=3Dalways=20
> to dump all the MESA commands into files in ~/ddebug_dumps/
>=20
>=20
> Start the game. When the problem happens do the following -=20
>=20
> as root=20
> cd /sys/kernel/debug/tracing && cat trace > event_dump
>=20
> as normal user or root
> sudo umr -lb > umr_dump
> sudo umr -O verbose,use_colour -R gfx[.] >> umr_dump
> sudo umr -O halt_waves,use_colour -wa >> umr_dump
> dmesg > dmesg_dump
>=20
> Upload a tar/zip of all those files + all the files from ~/ddebug_dump=
s/
Thanks for the instructions. I think I've followed them correctly. I didn't
build the amd-drm-next kernel as it'll be an overnight job (slow internet
speeds) but I did add the grub parameters. I have attached the files.
You are receiving this mail because:
- You are the assignee for the bug.
=
--15351199155.92AF6D4.10799--
--===============1581578657==
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: base64
Content-Disposition: inline
X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs
IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz
dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg==
--===============1581578657==--