From mboxrd@z Thu Jan 1 00:00:00 1970
From: bugzilla-daemon@freedesktop.org
Subject: [Bug 105251] [Vega10] GPU lockup on boot: VMC page fault
Date: Wed, 22 Aug 2018 20:21:43 +0000
Message-ID:
References:
Mime-Version: 1.0
Content-Type: multipart/mixed; boundary="===============1046110377=="
Return-path:
Received: from culpepper.freedesktop.org (culpepper.freedesktop.org
[131.252.210.165])
by gabe.freedesktop.org (Postfix) with ESMTP id 7EDBF6E43D
for ; Wed, 22 Aug 2018 20:21:43 +0000 (UTC)
In-Reply-To:
List-Unsubscribe: ,
List-Archive:
List-Post:
List-Help:
List-Subscribe: ,
Errors-To: dri-devel-bounces@lists.freedesktop.org
Sender: "dri-devel"
To: dri-devel@lists.freedesktop.org
List-Id: dri-devel@lists.freedesktop.org
--===============1046110377==
Content-Type: multipart/alternative; boundary="15349693031.DBbc1e.20547"
Content-Transfer-Encoding: 7bit
--15349693031.DBbc1e.20547
Date: Wed, 22 Aug 2018 20:21:43 +0000
MIME-Version: 1.0
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Bugzilla-URL: http://bugs.freedesktop.org/
Auto-Submitted: auto-generated
https://bugs.freedesktop.org/show_bug.cgi?id=3D105251
--- Comment #23 from Andrey Grodzovsky ---
(In reply to CheatCodesOfLife from comment #22)
> You're welcome.
>=20
> Not the exact same problem, no. I can get a hard-lock by trying to use
> amdvlk to play rpcs3, but it doesn't produce the same error and it's not =
as
> consistent (takes up to 15 minutes to crash)
>=20
> Not sure if it's worth noting but I went back and tried every Cemu version
> back to 1.5 and a lot of wine versions going back to 2.8. It happens every
> time as soon as the game loads.
Let's try to get some debug info for the VMC page fault then -=20=20
Clone and build our open source register analyzer from here -
https://cgit.freedesktop.org/amd/umr/=20
Install trace-cmd utility=20
Load driver with cmd line parameter amdgpu.vm_fault_stop=3D2 from grub
P.S Best to use latest kernel from here -
https://cgit.freedesktop.org/~agd5f/linux/log/?h=3Damd-staging-drm-next
After desktop is loaded type=20
sudo trace-cmd start -e dma_fence -e gpu_scheduler -e amdgpu -v -e
"amdgpu:amdgpu_mm_rreg" -e "amdgpu:amdgpu_mm_wreg" -e "amdgpu:amdgpu_iv"
to enable kernel event tracing log
If possible to launch the game from shell then prepend the command with=20
GALLIUM_DDEBUG=3Dalways=20
to dump all the MESA commands into files in ~/ddebug_dumps/
Start the game. When the problem happens do the following -=20
as root=20
cd /sys/kernel/debug/tracing && cat trace > event_dump
as normal user or root
sudo umr -lb > umr_dump
sudo umr -O verbose,use_colour -R gfx[.] >> umr_dump
sudo umr -O halt_waves,use_colour -wa >> umr_dump
dmesg > dmesg_dump
Upload a tar/zip of all those files + all the files from ~/ddebug_dumps/
--=20
You are receiving this mail because:
You are the assignee for the bug.=
--15349693031.DBbc1e.20547
Date: Wed, 22 Aug 2018 20:21:43 +0000
MIME-Version: 1.0
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Bugzilla-URL: http://bugs.freedesktop.org/
Auto-Submitted: auto-generated
Comme=
nt # 23
on bug 10525=
1
from Andrey Grodzovsky
(In reply to CheatCodesOfLife from comment #22)
> You're welcome.
>=20
> Not the exact same problem, no. I can get a hard-lock by trying to use
> amdvlk to play rpcs3, but it doesn't produce the same error and it's n=
ot as
> consistent (takes up to 15 minutes to crash)
>=20
> Not sure if it's worth noting but I went back and tried every Cemu ver=
sion
> back to 1.5 and a lot of wine versions going back to 2.8. It happens e=
very
> time as soon as the game loads.
Let's try to get some debug info for the VMC page fault then -=20=20
Clone and build our open source register analyzer from here -
https://cgit.freedesktop.=
org/amd/umr/=20
Install trace-cmd utility=20
Load driver with cmd line parameter amdgpu.vm_fault_stop=3D2 from grub
P.S Best to use latest kernel from here -
https://cgit.freedesktop.org/~agd5f/linux/log/?h=3Damd-staging-drm=
-next
After desktop is loaded type=20
sudo trace-cmd start -e dma_fence -e gpu_scheduler -e amdgpu -v -e
"amdgpu:amdgpu_mm_rreg" -e "amdgpu:amdgpu_mm_wreg" -e &=
quot;amdgpu:amdgpu_iv"
to enable kernel event tracing log
If possible to launch the game from shell then prepend the command with=20
GALLIUM_DDEBUG=3Dalways=20
to dump all the MESA commands into files in ~/ddebug_dumps/
Start the game. When the problem happens do the following -=20
as root=20
cd /sys/kernel/debug/tracing && cat trace > event_dump
as normal user or root
sudo umr -lb > umr_dump
sudo umr -O verbose,use_colour -R gfx[.] >> umr_dump
sudo umr -O halt_waves,use_colour -wa >> umr_dump
dmesg > dmesg_dump
Upload a tar/zip of all those files + all the files from ~/ddebug_dumps/
You are receiving this mail because:
- You are the assignee for the bug.
=
--15349693031.DBbc1e.20547--
--===============1046110377==
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: base64
Content-Disposition: inline
X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs
IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz
dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg==
--===============1046110377==--