From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 111481] AMD Navi GPU frequent freezes on both Manjaro/Ubuntu with kernel 5.3 and mesa 19.2 -git/llvm9 Date: Sat, 02 Nov 2019 23:11:39 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0664851383==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 7F49D6E030 for ; Sat, 2 Nov 2019 23:11:39 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0664851383== Content-Type: multipart/alternative; boundary="15727362992.BcdD.29816" Content-Transfer-Encoding: 7bit --15727362992.BcdD.29816 Date: Sat, 2 Nov 2019 23:11:39 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D111481 --- Comment #193 from wychuchol --- Perhaps needs another entry started but it's related (since it didn't happen before I tried RADV_PERFTEST=3Daco and AMD_DEBUG=3D"nongg,nodma") so I'll p= ost it in case someone has had same issues as me. After some time in Witcher 3 GOTY run with Lutris PC restarts on it's own. I thought something is overheating (I've noticed graphic card memory in PSens= or sometimes reaching 90 so I thought maybe that's what's happening) but I investigated kern.log and this always happened before that autonomous reset: Nov 2 22:01:53 pop-os kernel: [ 979.244964] pcieport 0000:00:01.1: AER: Corrected error received: 0000:01:00.0 Nov 2 22:01:53 pop-os kernel: [ 979.244967] nvme 0000:01:00.0: AER: PCIe = Bus Error: severity=3DCorrected, type=3DData Link Layer, (Transmitter ID) Nov 2 22:01:53 pop-os kernel: [ 979.244968] nvme 0000:01:00.0: AER: dev= ice [1987:5012] error status/mask=3D00001000/00006000 Nov 2 22:01:53 pop-os kernel: [ 979.244968] nvme 0000:01:00.0: AER: [1= 2] Timeout=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 Nov 2 22:01:53 pop-os kernel: [ 979.262629] Emergency Sync complete A solution I found is to add pci=3Dnommconf in /etc/default/grub to the lin= e=20 GRUB_CMDLINE_LINUX_DEFAULT=3D"quiet splash" (so it looks like this: GRUB_CMDLINE_LINUX_DEFAULT=3D"quiet splash pci=3Dnommconf"). --=20 You are receiving this mail because: You are the assignee for the bug.= --15727362992.BcdD.29816 Date: Sat, 2 Nov 2019 23:11:39 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comm= ent # 193 on bug 11148= 1 from wychuchol
Perhaps needs another entry started but it's related (since it=
 didn't happen
before I tried RADV_PERFTEST=3Daco and AMD_DEBUG=3D"nongg,nodma")=
 so I'll post it
in case someone has had same issues as me.

After some time in Witcher 3 GOTY run with Lutris PC restarts on it's own. I
thought something is overheating (I've noticed graphic card memory in PSens=
or
sometimes reaching 90 so I thought maybe that's what's happening) but I
investigated kern.log and this always happened before that autonomous reset:

Nov  2 22:01:53 pop-os kernel: [  979.244964] pcieport 0000:00:01.1: AER:
Corrected error received: 0000:01:00.0
Nov  2 22:01:53 pop-os kernel: [  979.244967] nvme 0000:01:00.0: AER: PCIe =
Bus
Error: severity=3DCorrected, type=3DData Link Layer, (Transmitter ID)
Nov  2 22:01:53 pop-os kernel: [  979.244968] nvme 0000:01:00.0: AER:   dev=
ice
[1987:5012] error status/mask=3D00001000/00006000
Nov  2 22:01:53 pop-os kernel: [  979.244968] nvme 0000:01:00.0: AER:    [1=
2]
Timeout=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20
Nov  2 22:01:53 pop-os kernel: [  979.262629] Emergency Sync complete

A solution I found is to add pci=3Dnommconf in /etc/default/grub to the lin=
e=20
GRUB_CMDLINE_LINUX_DEFAULT=3D"quiet splash" (so it looks like thi=
s:
GRUB_CMDLINE_LINUX_DEFAULT=3D"quiet splash pci=3Dnommconf").


You are receiving this mail because:
  • You are the assignee for the bug.
= --15727362992.BcdD.29816-- --===============0664851383== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVs --===============0664851383==--