From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Mon, 28 May 2018 08:47:31 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0930624184==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 83D616E1AF for ; Mon, 28 May 2018 08:47:32 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0930624184== Content-Type: multipart/alternative; boundary="15274972513.533a.22183" Content-Transfer-Encoding: 7bit --15274972513.533a.22183 Date: Mon, 28 May 2018 08:47:31 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 Michel D=C3=A4nzer changed: What |Removed |Added ---------------------------------------------------------------------------- Component|Driver/AMDgpu |Drivers/Gallium/radeonsi Assignee|xorg-driver-ati@lists.x.org |dri-devel@lists.freedesktop | |.org Version|unspecified |18.0 QA Contact|xorg-team@lists.x.org |dri-devel@lists.freedesktop | |.org Product|xorg |Mesa --- Comment #1 from Michel D=C3=A4nzer --- Please attach the corresponding full Xorg log and dmesg output. This is most likely between Mesa and the kernel; xf86-video-amdgpu doesn't contain any GPU specific rendering code which could cause hangs. I'd recomm= end trying latest upstream versions of Mesa (18.1) and the kernel, and if it st= ill happens, also try getting the current microcode files from https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git= /tree/amdgpu . --=20 You are receiving this mail because: You are the assignee for the bug.= --15274972513.533a.22183 Date: Mon, 28 May 2018 08:47:31 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated = Michel D=C3=A4nzer changed bug 10667= 1
What Removed Added
Component Driver/AMDgpu Drivers/Gallium/radeonsi
Assignee xorg-driver-ati@lists.x.org dri-devel@lists.freedesktop.org
Version unspecified 18.0
QA Contact xorg-team@lists.x.org dri-devel@lists.freedesktop.org
Product xorg Mesa

Commen= t # 1 on bug 10667= 1 from Michel D=C3=A4nzer
Please attach the corresponding full Xorg log and dmesg output.

This is most likely between Mesa and the kernel; xf86-video-amdgpu doesn't
contain any GPU specific rendering code which could cause hangs. I'd recomm=
end
trying latest upstream versions of Mesa (18.1) and the kernel, and if it st=
ill
happens, also try getting the current microcode files from
https://git.kernel.org/pub/scm/linux/kernel/git/fi=
rmware/linux-firmware.git/tree/amdgpu
.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15274972513.533a.22183-- --===============0930624184== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0930624184==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Mon, 28 May 2018 18:07:30 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0547863745==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 91EEB6E21E for ; Mon, 28 May 2018 18:07:30 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0547863745== Content-Type: multipart/alternative; boundary="15275308500.dcCf.29984" Content-Transfer-Encoding: 7bit --15275308500.dcCf.29984 Date: Mon, 28 May 2018 18:07:30 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #2 from Alan W. Irwin --- Created attachment 139816 --> https://bugs.freedesktop.org/attachment.cgi?id=3D139816&action=3Dedit X log file as requested --=20 You are receiving this mail because: You are the assignee for the bug.= --15275308500.dcCf.29984 Date: Mon, 28 May 2018 18:07:30 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Commen= t # 2 on bug 10667= 1 from Alan W. Irwin
Created attachment 13981=
6 [details]
X log file as requested


You are receiving this mail because:
  • You are the assignee for the bug.
= --15275308500.dcCf.29984-- --===============0547863745== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0547863745==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Mon, 28 May 2018 18:08:14 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1469515796==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id D3AA46E295 for ; Mon, 28 May 2018 18:08:14 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1469515796== Content-Type: multipart/alternative; boundary="15275308940.31BB76ba2.30460" Content-Transfer-Encoding: 7bit --15275308940.31BB76ba2.30460 Date: Mon, 28 May 2018 18:08:14 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #3 from Alan W. Irwin --- Created attachment 139817 --> https://bugs.freedesktop.org/attachment.cgi?id=3D139817&action=3Dedit dmesg output as requested --=20 You are receiving this mail because: You are the assignee for the bug.= --15275308940.31BB76ba2.30460 Date: Mon, 28 May 2018 18:08:14 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Commen= t # 3 on bug 10667= 1 from Alan W. Irwin
Created attachment 139=
817 [details]
dmesg output as requested


You are receiving this mail because:
  • You are the assignee for the bug.
= --15275308940.31BB76ba2.30460-- --===============1469515796== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1469515796==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Mon, 28 May 2018 18:32:20 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0399777697==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 22CFC6E259 for ; Mon, 28 May 2018 18:32:20 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0399777697== Content-Type: multipart/alternative; boundary="15275323400.bE1dF8EF3.7863" Content-Transfer-Encoding: 7bit --15275323400.bE1dF8EF3.7863 Date: Mon, 28 May 2018 18:32:20 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #4 from Alan W. Irwin --- Hi Michel: I have added your requested attachments. And if there are other data you n= eed or other tests I can run, let me know. Meanwhile, all else seems well with this new computer (e.g., the lock ups a= re gone under my normal KDE desktop use since I bypassed using this card 3 days ago by displaying my desktop on an X server running on a different computer= .=20 But that is only a temporary workaround (another person needs to use that o= ther computer's display and keyboard/mouse). Therefore, I need the RX 550 to wo= rk reliably on my new computer which is why I will be following your recommendations with regard to trying the latest kernel, mesa, and (if all = else fails) firmware. But building kernel and mesa is going to take me consider= able time for the reasons I mentioned in my original post. --=20 You are receiving this mail because: You are the assignee for the bug.= --15275323400.bE1dF8EF3.7863 Date: Mon, 28 May 2018 18:32:20 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Commen= t # 4 on bug 10667= 1 from Alan W. Irwin
Hi Michel:

I have added your requested attachments.  And if there are other data you n=
eed
or other tests I can run, let me know.

Meanwhile, all else seems well with this new computer (e.g., the lock ups a=
re
gone under my normal KDE desktop use since I bypassed using this card 3 days
ago by displaying my desktop on an X server running on a different computer=
.=20
But that is only a temporary workaround (another person needs to use that o=
ther
computer's display and keyboard/mouse).  Therefore, I need the RX 550 to wo=
rk
reliably on my new computer which is why I will be following your
recommendations with regard to trying the latest kernel, mesa, and (if all =
else
fails) firmware.  But building kernel and mesa is going to take me consider=
able
time for the reasons I mentioned in my original post.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15275323400.bE1dF8EF3.7863-- --===============0399777697== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0399777697==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Sat, 02 Jun 2018 16:51:26 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1842821935==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id C6B8B6E0FF for ; Sat, 2 Jun 2018 16:51:27 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1842821935== Content-Type: multipart/alternative; boundary="15279582870.269fEddEA.7373" Content-Transfer-Encoding: 7bit --15279582870.269fEddEA.7373 Date: Sat, 2 Jun 2018 16:51:27 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #5 from Alan W. Irwin --- Hi Michel: Since the lock ups occurred during ordinary (KDE) desktop use when I wasn't running 3D games I ignored mesa upgrades and instead concentrated first on trying a new kernel version (from 4.16.5 to 4.16.12 because 4.16.12 had conveniently just been propagated from Debian Sid to Buster). And so far it appears that upgrade makes a large improvement for the RX 550. Previously = for 4.16.5 the uptimes before a lock up occurred ranged from 7 hours to 2 days,= but right now with heavy desktop use and a substantial number of runs of the 3D game, I haven't experienced a single lock up with 4.16.12 with current upti= me since I booted 4.16.12 approaching 3 days. You may well conclude "problem already solved", but I normally run my compu= ter 24/7 with reboots only when absolutely necessary. Therefore I would like to keep this bug report open for a while just to report the maximum uptimes (hopefully at least several months) I can achieve with this graphics card. --=20 You are receiving this mail because: You are the assignee for the bug.= --15279582870.269fEddEA.7373 Date: Sat, 2 Jun 2018 16:51:27 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Commen= t # 5 on bug 10667= 1 from Alan W. Irwin
Hi Michel:
Since the lock ups occurred during ordinary (KDE) desktop use when I wasn't
running 3D games I ignored mesa upgrades and instead concentrated first on
trying a new kernel version (from 4.16.5 to 4.16.12 because 4.16.12 had
conveniently just been propagated from Debian Sid to Buster).  And so far it
appears that upgrade makes a large improvement for the RX 550.  Previously =
for
4.16.5 the uptimes before a lock up occurred ranged from 7 hours to 2 days,=
 but
right now with heavy desktop use and a substantial number of runs of the 3D
game, I haven't experienced a single lock up with 4.16.12 with current upti=
me
since I booted 4.16.12 approaching 3 days.
You may well conclude "problem already solved", but I normally ru=
n my computer
24/7 with reboots only when absolutely necessary.  Therefore I would like to
keep this bug report open for a while just to report the maximum uptimes
(hopefully at least several months) I can achieve with this graphics card.<=
/pre>
        


You are receiving this mail because:
  • You are the assignee for the bug.
= --15279582870.269fEddEA.7373-- --===============1842821935== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1842821935==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Sat, 02 Jun 2018 16:54:11 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1586929103==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 8E1166E10B for ; Sat, 2 Jun 2018 16:54:11 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1586929103== Content-Type: multipart/alternative; boundary="15279584510.Db9bD.7138" Content-Transfer-Encoding: 7bit --15279584510.Db9bD.7138 Date: Sat, 2 Jun 2018 16:54:11 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #6 from Alan W. Irwin --- Edit of previous comment: of the 3D game, -> of the 3D game, foobillard, --=20 You are receiving this mail because: You are the assignee for the bug.= --15279584510.Db9bD.7138 Date: Sat, 2 Jun 2018 16:54:11 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Commen= t # 6 on bug 10667= 1 from Alan W. Irwin
Edit of previous comment:

of the 3D game, -> of the 3D game, foobillard,


You are receiving this mail because:
  • You are the assignee for the bug.
= --15279584510.Db9bD.7138-- --===============1586929103== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1586929103==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Wed, 29 Aug 2018 23:25:07 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1774426923==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 2CEE86E660 for ; Wed, 29 Aug 2018 23:25:07 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1774426923== Content-Type: multipart/alternative; boundary="15355851071.68A7aC8.6883" Content-Transfer-Encoding: 7bit --15355851071.68A7aC8.6883 Date: Wed, 29 Aug 2018 23:25:07 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #7 from Alan W. Irwin --- Please remove the resolution of this bug as FIXED. The reason for this request is subsequent kernel-4.16.x use after the initi= al success I reported continued to show lock ups whenever this graphics card w= as used. Yesterday, I tried kernel-4.17.17-1 from Debian Buster (the first ti= me I had tried any kernel-4.17.x version) in great anticipation these kernel loc= kups would be fixed (since kernel-4.17.x apparently contains lots of AMD graphics fixes). But when I used this graphics card for ordinary direct desktop use= (as opposed to accessing my desktop on the new computer via an X-terminal which= is so far the only stable way I can use my new computer), I got a lockup withi= n a half hour or so followed by one roughly 8 hours later. For what it is wort= h, I have also installed mesa-8.1.6-1 and version 20180518-1 of the firmware-amd-graphics package from Debian Buster before performing this fai= ling experiment. So it appears the substantial number of AMD graphics fixes in kernel-4.17.x= and mesa-18.1.y and installation of the relatively recent (from May) Debian Bus= ter firmware-amd-graphics package are not sufficient to stabilize use of this A= MD RX 550 graphics card. That is a big disappointment since this card should = no longer be considered cutting-edge hardware (i.e. it was first offered for s= ale at least 16 months ago) and this delay in fixing it cannot be attributed to non-cooperation from AMD since they appear to have a good open-source recor= d. Because of these on-going issues with direct use of this card, I am going back to using the X-terminal method with this kernel which experience with kernel-4.16.x shows is much more stable since it avoids using this graphics card completely (except for the direct display of the Linux console login prompt). I plan to again try the experiment of attempting to use this card directly = when kernel-4.18.x is promoted to Buster. But meanwhile, if you have any other suggestions I could try, please let me know. --=20 You are receiving this mail because: You are the assignee for the bug.= --15355851071.68A7aC8.6883 Date: Wed, 29 Aug 2018 23:25:07 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Commen= t # 7 on bug 10667= 1 from Alan W. Irwin
Please remove the resolution of this bug as FIXED.

The reason for this request is subsequent kernel-4.16.x use after the initi=
al
success I reported continued to show lock ups whenever this graphics card w=
as
used.  Yesterday, I tried kernel-4.17.17-1 from Debian Buster (the first ti=
me I
had tried any kernel-4.17.x version) in great anticipation these kernel loc=
kups
would be fixed (since kernel-4.17.x apparently contains lots of AMD graphics
fixes).  But when I used this graphics card for ordinary direct desktop use=
 (as
opposed to accessing my desktop on the new computer via an X-terminal which=
 is
so far the only stable way I can use my new computer), I got a lockup withi=
n a
half hour or so followed by one roughly 8 hours later.  For what it is wort=
h, I
have also installed mesa-8.1.6-1 and version 20180518-1 of the
firmware-amd-graphics package from Debian Buster before performing this fai=
ling
experiment.

So it appears the substantial number of AMD graphics fixes in kernel-4.17.x=
 and
mesa-18.1.y and installation of the relatively recent (from May) Debian Bus=
ter
firmware-amd-graphics package are not sufficient to stabilize use of this A=
MD
RX 550 graphics card.  That is a big disappointment since this card should =
no
longer be considered cutting-edge hardware (i.e. it was first offered for s=
ale
at least 16 months ago) and this delay in fixing it cannot be attributed to
non-cooperation from AMD since they appear to have a good open-source recor=
d.

Because of these on-going issues with direct use of this card, I am
going back to using the X-terminal method with this kernel which
experience with kernel-4.16.x shows is much more stable since it
avoids using this graphics card completely (except for the direct
display of the Linux console login prompt).

I plan to again try the experiment of attempting to use this card directly =
when
kernel-4.18.x is promoted to Buster.  But meanwhile, if you have any other
suggestions I could try, please let me know.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15355851071.68A7aC8.6883-- --===============1774426923== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1774426923==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Tue, 04 Sep 2018 19:26:41 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0872216635==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 2BFFD6E236 for ; Tue, 4 Sep 2018 19:26:41 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0872216635== Content-Type: multipart/alternative; boundary="15360892010.50468fAf.27000" Content-Transfer-Encoding: 7bit --15360892010.50468fAf.27000 Date: Tue, 4 Sep 2018 19:26:41 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #8 from Alan W. Irwin --- Created attachment 141451 --> https://bugs.freedesktop.org/attachment.cgi?id=3D141451&action=3Dedit tarball containing kern.log, syslog, and dmesg output --=20 You are receiving this mail because: You are the assignee for the bug.= --15360892010.50468fAf.27000 Date: Tue, 4 Sep 2018 19:26:41 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Commen= t # 8 on bug 10667= 1 from Alan W. Irwin
Created attachment 141451 [details]
tarball containing kern.log, syslog, and dmesg output


You are receiving this mail because:
  • You are the assignee for the bug.
= --15360892010.50468fAf.27000-- --===============0872216635== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0872216635==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Tue, 04 Sep 2018 19:27:43 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1770632117==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id B7F8289C3B for ; Tue, 4 Sep 2018 19:27:42 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1770632117== Content-Type: multipart/alternative; boundary="15360892620.DB47A2.27165" Content-Transfer-Encoding: 7bit --15360892620.DB47A2.27165 Date: Tue, 4 Sep 2018 19:27:42 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #9 from Alan W. Irwin --- We (there are two of us using this machine) just got yet another kernel loc= kup (no remote access possible with ssh, direct keyboard not working), but this= is a case when we were remotely accessing this box with an X-terminal. In oth= er words, the only use of the RX 550 was to display the command-line login pro= mpt for the Linux console of the directly attached monitor until the lockup whe= re it displayed the following message (roughly 15 times in the half-hour befor= e I got out of the lockup by pushing the reset button.) watchdog: BUG: soft lockup - CPU#12 stuck for 22s! [firefox-esr:29266] (At the time we were both browsing different sites with firefox with one of those firefox instances running a couple of days, and as a security measure= we both restrict the use of javascript with the noscript extension to firefox.) I have attached a tarball containing log files (kern.log and syslog) that contain the lockup information (including the above message) as well as information about the fresh boot afterwards. (For what it is worth, that tarball also includes dmesg output which appears to contain information only about the fresh boot.) For this minimal use case for the RX 550, the Linux kernel lasted 6 days be= fore the lockup which is much better than the direct use case where the lockups = can occur as soon as a half hour after a fresh boot. So the current lockup cou= ld be due to an entirely different bug than in the lockups I have encountered = for the direct use case. But, of course, minimal use is not zero use so curren= tly I ascribe both the present remote-use lockup and the previous direct-use lockups to some incompatibility between the RX 550 and the Debian Testing graphics stack. That stack currently includes the following component versions: linux-image-4.17.0-3-amd64 4.17.17-1 firmware-amd-graphics 20180518-1 libdrm-amdgpu1:amd64 2.4.93-1 libglapi-mesa:amd64 18.1.6-1 xserver-xorg-video-amdgpu 18.0.1-1+b1 Please let me know if there are any other data you need or any experiments = you would like me to try. In any case I plan to continue with remote use of th= is box while reporting lockup incidents as they occur. But I also plan to try direct use again whenever one of the components of the above stack gets significantly upgraded for Debian Testing. --=20 You are receiving this mail because: You are the assignee for the bug.= --15360892620.DB47A2.27165 Date: Tue, 4 Sep 2018 19:27:42 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Commen= t # 9 on bug 10667= 1 from Alan W. Irwin
We (there are two of us using this machine) just got yet anoth=
er kernel lockup
(no remote access possible with ssh, direct keyboard not working), but this=
 is
a case when we were remotely accessing this box with an X-terminal.  In oth=
er
words, the only use of the RX 550 was to display the command-line login pro=
mpt
for the Linux console of the directly attached monitor until the lockup whe=
re
it displayed the following message (roughly 15 times in the half-hour befor=
e I
got out of the lockup by pushing the reset button.)

watchdog: BUG: soft lockup - CPU#12 stuck for 22s! [firefox-esr:29266]

(At the time we were both browsing different sites with firefox with one of
those firefox instances running a couple of days, and as a security measure=
 we
both restrict the use of javascript with the noscript extension to firefox.)

I have attached a tarball containing log files (kern.log and syslog) that
contain the lockup information (including the above message) as well as
information about the fresh boot afterwards.  (For what it is worth, that
tarball also includes dmesg output which appears to contain information only
about the fresh boot.)

For this minimal use case for the RX 550, the Linux kernel lasted 6 days be=
fore
the lockup which is much better than the direct use case where the lockups =
can
occur as soon as a half hour after a fresh boot.  So the current lockup cou=
ld
be due to an entirely different bug than in the lockups I have encountered =
for
the direct use case.  But, of course, minimal use is not zero use so curren=
tly
I ascribe both the present remote-use lockup and the previous direct-use
lockups to some incompatibility between the RX 550 and the Debian Testing
graphics stack.  That stack currently includes the following component
versions:

linux-image-4.17.0-3-amd64                    4.17.17-1
firmware-amd-graphics                         20180518-1
libdrm-amdgpu1:amd64                          2.4.93-1
libglapi-mesa:amd64                           18.1.6-1
xserver-xorg-video-amdgpu                     18.0.1-1+b1

Please let me know if there are any other data you need or any experiments =
you
would like me to try.  In any case I plan to continue with remote use of th=
is
box while reporting lockup incidents as they occur.  But I also plan to try
direct use again whenever one of the components of the above stack gets
significantly upgraded for Debian Testing.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15360892620.DB47A2.27165-- --===============1770632117== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1770632117==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Wed, 05 Sep 2018 07:37:30 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1517982511==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 23EC7899B0 for ; Wed, 5 Sep 2018 07:37:30 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1517982511== Content-Type: multipart/alternative; boundary="15361330500.042A1aF5.17256" Content-Transfer-Encoding: 7bit --15361330500.042A1aF5.17256 Date: Wed, 5 Sep 2018 07:37:30 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #10 from Michel D=C3=A4nzer --- (In reply to Alan W. Irwin from comment #9) > So the current lockup could be due to an entirely different bug than in t= he > lockups I have encountered for the direct use case. Yeah, that looks like an RCU or other core kernel issue, not directly relat= ed to the graphics drivers (which as you say, aren't really being used in this case). Does idle=3Dnomwait on the kernel command line help for any of these issues= , by any chance? It's also worth making sure the motherboard BIOS is up to date. --=20 You are receiving this mail because: You are the assignee for the bug.= --15361330500.042A1aF5.17256 Date: Wed, 5 Sep 2018 07:37:30 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 10 on bug 10667= 1 from Michel D=C3=A4nzer
(In reply to Alan W. Irwin from comment #9)
> So the current lockup could be due to an entirel=
y different bug than in the
> lockups I have encountered for the direct use case.

Yeah, that looks like an RCU or other core kernel issue, not directly relat=
ed
to the graphics drivers (which as you say, aren't really being used in this
case).

Does idle=3Dnomwait on the kernel command line help for any of these issues=
, by
any chance?

It's also worth making sure the motherboard BIOS is up to date.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15361330500.042A1aF5.17256-- --===============1517982511== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1517982511==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Thu, 06 Sep 2018 01:26:14 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0895758928==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 6F9F46E462 for ; Thu, 6 Sep 2018 01:26:14 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0895758928== Content-Type: multipart/alternative; boundary="15361971740.aCfbcabFf.24062" Content-Transfer-Encoding: 7bit --15361971740.aCfbcabFf.24062 Date: Thu, 6 Sep 2018 01:26:14 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #11 from Alan W. Irwin --- Thanks for that idle=3Dnomwait suggestion which I have now just tried (veri= fied by irwin@merlin> cat /proc/cmdline BOOT_IMAGE=3D/boot/vmlinuz-4.17.0-3-amd64 root=3DUUID=3D1e45a1ee-a5d6-4327-9a7b-2663ffc0b157 ro rootwait quiet idle= =3Dnomwait ) and I now indeed have a stable result. However, that is currently just f= or the last 5 minutes in remote access mode. :-) So we will see how this goes for, say, the next two weeks, to see if I can beat my last 4.17.17 remote access uptime record of 6 days. With regard to your MB BIOS update suggestion, I am going to hold back on t= hat for a while since the techs from a local computer company that assembled my= box in May felt such updates were dangerous and therefore a last resort. And t= hat is also the consistent advice I have gotten for the other 3 Linux boxes I h= ave had assembled for me since I started using Linux in 1996. Of course, this = year may be a special case with all the Meltdown (although not for this AMD hardware) and many variants of SPECTRE out there so I do plan to update the BIOS within the next couple of months on the assumption that the SPECTRE BI= OS mitigations recommended by AMD to ASUS for this hardware (PRIME B350+ MB wi= th AMD Ryzen 7 1700 CPU, 64GB RAM, and ASUS RX 550 graphics card) will have matured by then.=20=20 But before I implement that planned BIOS update, I am hoping that the curre= nt cutting-edge Linux graphics stack (which according to a senior Phoronix pos= ter works well for the RX 560) will also give me stable direct-display results = for the RX 550 once that version of the graphics stack propagates to Debian Testing. I estimate that propagation time will be a couple of more months based on how quickly elements of the cutting-edge Linux graphics stack such= as the kernel has propagated in the past from upstream to Debian Testing. In sum, it is a waiting game now to see if your idle=3Dnomwait suggestion restores the complete Linux stability I was used to with my old box (for De= bian Oldstable =3D Jessie) for at least the remote display case, and if that sta= bility is obviously much better (i.e., at least a couple of weeks uptime with no lockups) then I will try the direct display case again with idle=3Dnomwait = to see if it makes that case stable as well. Thanks, Michel, for your on-going helpful suggestions for dealing with this troubling instability issue (these troubling instability issues?) for my new Linux box. Alan --=20 You are receiving this mail because: You are the assignee for the bug.= --15361971740.aCfbcabFf.24062 Date: Thu, 6 Sep 2018 01:26:14 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 11 on bug 10667= 1 from Alan W. Irwin
Thanks for that idle=3Dnomwait suggestion which I have now jus=
t tried (verified
by

irwin@merlin> cat /proc/cmdline
BOOT_IMAGE=3D/boot/vmlinuz-4.17.0-3-amd64
root=3DUUID=3D1e45a1ee-a5d6-4327-9a7b-2663ffc0b157 ro rootwait quiet idle=
=3Dnomwait

) and I now indeed have a stable result.  However, that is currently just f=
or
the last 5 minutes in remote access mode.  :-)  So we will see how this goes
for, say, the next two weeks, to see if I can beat my last 4.17.17 remote
access uptime record of 6 days.

With regard to your MB BIOS update suggestion, I am going to hold back on t=
hat
for a while since the techs from a local computer company that assembled my=
 box
in May felt such updates were dangerous and therefore a last resort.  And t=
hat
is also the consistent advice I have gotten for the other 3 Linux boxes I h=
ave
had assembled for me since I started using Linux in 1996.  Of course, this =
year
may be a special case with all the Meltdown (although not for this AMD
hardware) and many variants of SPECTRE out there so I do plan to update the
BIOS within the next couple of months on the assumption that the SPECTRE BI=
OS
mitigations recommended by AMD to ASUS for this hardware (PRIME B350+ MB wi=
th
AMD Ryzen 7 1700 CPU, 64GB RAM, and ASUS RX 550 graphics card) will have
matured by then.=20=20

But before I implement that planned BIOS update, I am hoping that the curre=
nt
cutting-edge Linux graphics stack (which according to a senior Phoronix pos=
ter
works well for the RX 560) will also give me stable direct-display results =
for
the RX 550 once that version of the graphics stack propagates to Debian
Testing.  I estimate that propagation time will be a couple of more months
based on how quickly elements of the cutting-edge Linux graphics stack such=
 as
the kernel has propagated in the past from upstream to Debian Testing.

In sum, it is a waiting game now to see if your idle=3Dnomwait suggestion
restores the complete Linux stability I was used to with my old box (for De=
bian
Oldstable =3D Jessie) for at least the remote display case, and if that sta=
bility
is obviously much better (i.e., at least a couple of weeks uptime with no
lockups) then I will try the direct display case again with idle=3Dnomwait =
to see
if it makes that case stable as well.

Thanks, Michel, for your on-going helpful suggestions for dealing with this
troubling instability issue (these troubling instability issues?) for my new
Linux box.

Alan


You are receiving this mail because:
  • You are the assignee for the bug.
= --15361971740.aCfbcabFf.24062-- --===============0895758928== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0895758928==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Thu, 06 Sep 2018 06:56:27 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0850945763==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 3D466891E3 for ; Thu, 6 Sep 2018 06:56:27 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0850945763== Content-Type: multipart/alternative; boundary="15362169870.31AD9c1.24795" Content-Transfer-Encoding: 7bit --15362169870.31AD9c1.24795 Date: Thu, 6 Sep 2018 06:56:27 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #12 from Alan W. Irwin --- (In reply to Michel D=C3=A4nzer from comment #10) > [T]hat looks like an RCU or other core kernel issue, not directly > related to the graphics drivers. Hi Michel: If so, should I report that probable non-graphics kernel bug (with my crash-report tarball) elsewhere? Or do you suggest I just forget it until I see what are the remote graphics results of idle=3Dnomwait over the course = of the next couple of weeks AND (if that is a success) the direct graphics results= of idle=3Dnomwait for a couple of more weeks after that? --=20 You are receiving this mail because: You are the assignee for the bug.= --15362169870.31AD9c1.24795 Date: Thu, 6 Sep 2018 06:56:27 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 12 on bug 10667= 1 from Alan W. Irwin
(In reply to Michel D=C3=A4nzer from comment #10)
> [T]hat looks like an RCU or other core kernel is=
sue, not directly
> related to the graphics drivers.

Hi Michel:
If so, should I report that probable non-graphics kernel bug (with my
crash-report tarball) elsewhere?  Or do you suggest I just forget it until I
see what are the remote graphics results of idle=3Dnomwait over the course =
of the
next couple of weeks AND (if that is a success) the direct graphics results=
 of
idle=3Dnomwait for a couple of more weeks after that?


You are receiving this mail because:
  • You are the assignee for the bug.
= --15362169870.31AD9c1.24795-- --===============0850945763== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0850945763==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Fri, 07 Sep 2018 22:23:35 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============2023988361==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 7AFCC88DF5 for ; Fri, 7 Sep 2018 22:23:35 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============2023988361== Content-Type: multipart/alternative; boundary="15363590150.bEc4AFAf.11484" Content-Transfer-Encoding: 7bit --15363590150.bEc4AFAf.11484 Date: Fri, 7 Sep 2018 22:23:35 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #13 from Alan W. Irwin --- Well, after 1.5 (successful) days with the remote graphics experiment, I decided instead it made more sense to go after the quicker acting instabili= ty that I have previously experienced in direct graphics mode. So just now I = have started a direct graphics experiment after a Debian Testing upgrade which included = the following firmware and mesa changes: firmware-amd-graphics updated "(20180825+dfsg-1) over (20180518-1)" mesa updated "(18.1.7-1) over (18.1.6-1)" In addition for this experiment I installed the amd64-microcode package that contains "microcode patches for all AMD AMD64 processors". Also, as part of this experiment I have continued with the idle=3Dnomwait k= ernel parameter as verified by=20 irwin@merlin> cat /proc/cmdline BOOT_IMAGE=3D/boot/vmlinuz-4.17.0-3-amd64 root=3DUUID=3D1e45a1ee-a5d6-4327-9a7b-2663ffc0b157 ro rootwait quiet idle= =3Dnomwait N.B. note those kernel parameters do not include any amdgpu-related paramet= ers. Do you recommend any such parameters for the RX 550 such as amdgpu.dc=3D1 = which is sometimes recommended for older versions of AMD new-generation graphics hardware? --=20 You are receiving this mail because: You are the assignee for the bug.= --15363590150.bEc4AFAf.11484 Date: Fri, 7 Sep 2018 22:23:35 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 13 on bug 10667= 1 from Alan W. Irwin
Well, after 1.5 (successful) days with the remote graphics exp=
eriment, I
decided instead it made more sense to go after the quicker acting instabili=
ty
that I have previously experienced in direct graphics mode.  So just now I =
have
started
a direct graphics experiment after a Debian Testing upgrade which included =
the
following firmware and mesa changes:

firmware-amd-graphics updated "(20180825+dfsg-1) over (20180518-1)&quo=
t;

mesa updated "(18.1.7-1) over (18.1.6-1)"

In addition for this experiment I installed the  amd64-microcode package
that contains "microcode patches for all AMD AMD64 processors".

Also, as part of this experiment I have continued with the idle=3Dnomwait k=
ernel
parameter as verified by=20

irwin@merlin> cat /proc/cmdline
BOOT_IMAGE=3D/boot/vmlinuz-4.17.0-3-amd64
root=3DUUID=3D1e45a1ee-a5d6-4327-9a7b-2663ffc0b157 ro rootwait quiet idle=
=3Dnomwait

N.B. note those kernel parameters do not include any amdgpu-related paramet=
ers.
 Do you recommend any such parameters for the RX 550 such as amdgpu.dc=3D1 =
which
is sometimes recommended for older versions of AMD new-generation graphics
hardware?


You are receiving this mail because:
  • You are the assignee for the bug.
= --15363590150.bEc4AFAf.11484-- --===============2023988361== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============2023988361==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Fri, 07 Sep 2018 22:36:01 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0776079868==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 576376E992 for ; Fri, 7 Sep 2018 22:36:01 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0776079868== Content-Type: multipart/alternative; boundary="15363597611.8bfe6.14338" Content-Transfer-Encoding: 7bit --15363597611.8bfe6.14338 Date: Fri, 7 Sep 2018 22:36:01 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #14 from Alan W. Irwin --- Created attachment 141479 --> https://bugs.freedesktop.org/attachment.cgi?id=3D141479&action=3Dedit compressed dmesg output from current direct graphics experiment --=20 You are receiving this mail because: You are the assignee for the bug.= --15363597611.8bfe6.14338 Date: Fri, 7 Sep 2018 22:36:01 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 14 on bug 10667= 1 from Alan W. Irwin
Created attachment 141479 [details]
compressed dmesg output from current direct graphics experiment


You are receiving this mail because:
  • You are the assignee for the bug.
= --15363597611.8bfe6.14338-- --===============0776079868== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0776079868==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Fri, 14 Sep 2018 23:44:52 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1739581259==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 7E2BD6E048 for ; Fri, 14 Sep 2018 23:44:53 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1739581259== Content-Type: multipart/alternative; boundary="15369686931.AbcFaaB.3861" Content-Transfer-Encoding: 7bit --15369686931.AbcFaaB.3861 Date: Fri, 14 Sep 2018 23:44:53 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #15 from Alan W. Irwin --- Created attachment 141567 --> https://bugs.freedesktop.org/attachment.cgi?id=3D141567&action=3Dedit log files from latest logup --=20 You are receiving this mail because: You are the assignee for the bug.= --15369686931.AbcFaaB.3861 Date: Fri, 14 Sep 2018 23:44:53 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 15 on bug 10667= 1 from Alan W. Irwin
Created attachment 1=
41567 [details]
log files from latest logup


You are receiving this mail because:
  • You are the assignee for the bug.
= --15369686931.AbcFaaB.3861-- --===============1739581259== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1739581259==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Sat, 15 Sep 2018 00:04:12 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0083626760==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 561906E0C6 for ; Sat, 15 Sep 2018 00:04:12 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0083626760== Content-Type: multipart/alternative; boundary="15369698520.4102.7679" Content-Transfer-Encoding: 7bit --15369698520.4102.7679 Date: Sat, 15 Sep 2018 00:04:12 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #16 from Alan W. Irwin --- I was beginning to have some hope that the latest direct access experiment would prove to be stable. However, just now it locked up again after almos= t 7 days. So the stability is substantially improved compared to before, and my guess is that improvement is due to installation of the amd64-microcode pac= kage from Debian Buster for this latest experiment.=20=20 However, this is still disappointing stability because typically for truly stable systems I achieve up times of 30 days or longer with the only limit = on uptime being how often I have to reboot due to kernel upgrades. I have attached a crash report tarball containing dmesg output as well as various log files that captured all log activity before the lockup and the = boot afterward. I don't see anything concerning the crash in those log files, b= ut I may be missing something since I am no expert so I would appreciate it if y= ou took a look. I have restarted exactly the same direct graphics access test again (with s= ame versions of graphics stack packages and your recommended idle=3Dnomwait ker= nel parameter in hopes that the kernel will last longer this time before the lo= ckup and/or I catch more details of the lockup when it occurs. If you would pre= fer me to try a different variant of this test, please let me know. --=20 You are receiving this mail because: You are the assignee for the bug.= --15369698520.4102.7679 Date: Sat, 15 Sep 2018 00:04:12 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 16 on bug 10667= 1 from Alan W. Irwin
I was beginning to have some hope that the latest direct acces=
s experiment
would prove to be stable.  However, just now it locked up again after almos=
t 7
days.  So the stability is substantially improved compared to before, and my
guess is that improvement is due to installation of the amd64-microcode pac=
kage
from Debian Buster for this latest experiment.=20=20
However, this is still disappointing stability because typically for truly
stable systems I achieve up times of 30 days or longer with the only limit =
on
uptime being how often I have to reboot due to kernel upgrades.
I have attached a crash report tarball containing dmesg output as well as
various log files that captured all log activity before the lockup and the =
boot
afterward.  I don't see anything concerning the crash in those log files, b=
ut I
may be missing something since I am no expert so I would appreciate it if y=
ou
took a look.
I have restarted exactly the same direct graphics access test again (with s=
ame
versions of graphics stack packages and your recommended idle=3Dnomwait ker=
nel
parameter in hopes that the kernel will last longer this time before the lo=
ckup
and/or I catch more details of the lockup when it occurs.  If you would pre=
fer
me to try a different variant of this test, please let me know.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15369698520.4102.7679-- --===============0083626760== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0083626760==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Sat, 15 Sep 2018 00:52:58 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0680609584==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 551F36E0AC for ; Sat, 15 Sep 2018 00:52:58 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0680609584== Content-Type: multipart/alternative; boundary="15369727780.ad68.19756" Content-Transfer-Encoding: 7bit --15369727780.ad68.19756 Date: Sat, 15 Sep 2018 00:52:58 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #17 from Alan W. Irwin --- I terminated the last test immediately because it turns out a new kernel (L= inux merlin 4.18.0-1-amd64 #1 SMP Debian 4.18.6-1 (2018-09-06) x86_64 GNU/Linux)= has propagated from Debian Unstable to Debian Testing =3D Buster so I will use = that kernel for my new test. On boot with this new kernel the usual blast of ra= ndom color on the Linux console displayed by the RX 550 that I am used to for all previous kernel versions is now gone. So that is a positive step in the ri= ght direction, and I hope that means the Debian Buster graphics stack is finally completely stable for the RX 550, but I will test that hypothesis with this latest test.=20=20 The latest Debian Buster graphics stack versions for this direct graphics kernel stability test for the RX 550 are as follows: linux-image-4.18.0-1-amd64 4.18.6-1 amd64-microcode 3.20180524.1 firmware-amd-graphics 20180825+dfsg-1 libdrm-amdgpu1:amd64 2.4.94-1 libglapi-mesa:amd64 18.1.7-1 xserver-xorg-video-amdgpu 18.0.1-1+b1 Here are my kernel parameters which includes the suggested idle=3Dnomwait: irwin@merlin> cat /proc/cmdline BOOT_IMAGE=3D/boot/vmlinuz-4.18.0-1-amd64 root=3DUUID=3D1e45a1ee-a5d6-4327-9a7b-2663ffc0b157 ro rootwait quiet idle= =3Dnomwait --=20 You are receiving this mail because: You are the assignee for the bug.= --15369727780.ad68.19756 Date: Sat, 15 Sep 2018 00:52:58 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 17 on bug 10667= 1 from Alan W. Irwin
I terminated the last test immediately because it turns out a =
new kernel (Linux
merlin 4.18.0-1-amd64 #1 SMP Debian 4.18.6-1 (2018-09-06) x86_64 GNU/Linux)=
 has
propagated from Debian Unstable to Debian Testing =3D Buster so I will use =
that
kernel for my new test.  On boot with this new kernel the usual blast of ra=
ndom
color on the Linux console displayed by the RX 550 that I am used to for all
previous kernel versions is now gone.  So that is a positive step in the ri=
ght
direction, and I hope that means the Debian Buster graphics stack is finally
completely stable for the RX 550, but I will test that hypothesis with this
latest test.=20=20

The latest Debian Buster graphics stack versions for this direct graphics
kernel stability test for the RX 550 are as follows:

linux-image-4.18.0-1-amd64                    4.18.6-1
amd64-microcode                               3.20180524.1
firmware-amd-graphics                         20180825+dfsg-1
libdrm-amdgpu1:amd64                          2.4.94-1
libglapi-mesa:amd64                           18.1.7-1
xserver-xorg-video-amdgpu                     18.0.1-1+b1

Here are my kernel parameters which includes the suggested idle=3Dnomwait:
irwin@merlin> cat /proc/cmdline
BOOT_IMAGE=3D/boot/vmlinuz-4.18.0-1-amd64
root=3DUUID=3D1e45a1ee-a5d6-4327-9a7b-2663ffc0b157 ro rootwait quiet idle=
=3Dnomwait


You are receiving this mail because:
  • You are the assignee for the bug.
= --15369727780.ad68.19756-- --===============0680609584== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0680609584==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Mon, 24 Sep 2018 04:44:53 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1411790540==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 743E46E1D2 for ; Mon, 24 Sep 2018 04:44:53 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1411790540== Content-Type: multipart/alternative; boundary="15377642931.e5f8a4abb.12493" Content-Transfer-Encoding: 7bit --15377642931.e5f8a4abb.12493 Date: Mon, 24 Sep 2018 04:44:53 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #18 from Alan W. Irwin --- Created attachment 141706 --> https://bugs.freedesktop.org/attachment.cgi?id=3D141706&action=3Dedit tarball containing log information concerning latest lockup --=20 You are receiving this mail because: You are the assignee for the bug.= --15377642931.e5f8a4abb.12493 Date: Mon, 24 Sep 2018 04:44:53 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 18 on bug 10667= 1 from Alan W. Irwin
Created attachment 141706 [details]
tarball containing log information concerning latest lockup


You are receiving this mail because:
  • You are the assignee for the bug.
= --15377642931.e5f8a4abb.12493-- --===============1411790540== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1411790540==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Mon, 24 Sep 2018 05:21:18 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1055106413==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 88FBF6E1D2 for ; Mon, 24 Sep 2018 05:21:18 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1055106413== Content-Type: multipart/alternative; boundary="15377664780.125Eb8.20555" Content-Transfer-Encoding: 7bit --15377664780.125Eb8.20555 Date: Mon, 24 Sep 2018 05:21:18 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #19 from Alan W. Irwin --- Despite a new kernel, this instability issue has continued. Kernel 4.18.6 locked up after 8+ days of up time on our principal computer that has the RX 550 graphics card installed. (I will refer to this computer as the "new" computer, our other working Linux computer that is used to display X results from the new computer as the X-terminal, and our old principal computer (powered down permanently now) as the "old" computer.) The lockup of the new computer occurred some time in the early morning and (since two users use t= his machine at one time) with one inactive XFCE desktop being displayed on our X-terminal and one inactive XFCE desktop being displayed directly on the new computer. The only symptom of the lockup I could spot in the log files was= a burst of null bytes in each log file. For what it is worth that symptom is new. See the attached crash_report_20180923.tar.gz for the log file and dm= esg details. This result of 8+ days of up time for direct graphics desktop use of the new computer is slightly better than the almost 7 days of up time achieved for = the previous similar test for kernel 4.17.7. Although the present up time resu= lt at least encourages further testing with kernel 4.18.x, this is only one te= st, and the next test might give a substantially shorter or longer up time. In = any case this result is still far from ideal since such lockups never occurred = on the old computer that this new computer replaced and also do not currently occur for the X-terminal. That is, on the old principal box up times excee= ding 30 days have been common and similarly on the X-terminal, and the only reas= on I rebooted in those cases was power interruptions or the installation of a new kernel. For the present case of the new box, the lockups mean the only recovery possible is to hit the reset button with all that implies about journal recovery and potential file deletion for files that are in inconsis= tent shape due to the lockup. For what it is worth, the lockup symptoms this time were a bit different th= an before. The new computer had a frozen display (rather than blank before), = and frozen mouse and keyboard (as before). The X-terminal used to remotely acc= ess a desktop running on the new computer had a frozen display (rather than blanked) with working keyboard (and maybe mouse, but I didn't record that) = so I could exit the local X and get to the Linux console where ping to the new computer actually worked (as opposed to ping not working at all for the previous lockup). So because networking was working, ssh to the new comput= er didn't time out. However, it ran for 20+ minutes with no sign of a login so the net result was the same as for previous lockups; there was no way to lo= gin to the new computer from another computer to shut down the new computer normally so the only method of shutting it down was to hit the reset button. --=20 You are receiving this mail because: You are the assignee for the bug.= --15377664780.125Eb8.20555 Date: Mon, 24 Sep 2018 05:21:18 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 19 on bug 10667= 1 from Alan W. Irwin
Despite a new kernel, this instability issue has continued.  K=
ernel 4.18.6
locked up after 8+ days of up time on our principal computer that has the RX
550 graphics card installed.  (I will refer to this computer as the "n=
ew"
computer, our other working Linux computer that is used to display X results
from the new computer as the X-terminal, and our old principal computer
(powered down permanently now) as the "old" computer.) The lockup=
 of the new
computer occurred some time in the early morning and (since two users use t=
his
machine at one time) with one inactive XFCE desktop being displayed on our
X-terminal and one inactive XFCE desktop being displayed directly on the new
computer.  The only symptom of the lockup I could spot in the log files was=
 a
burst of null bytes in each log file.  For what it is worth that symptom is
new.  See the attached crash_report_20180923.tar.gz for the log file and dm=
esg
details.

This result of 8+ days of up time for direct graphics desktop use of the new
computer is slightly better than the almost 7 days of up time achieved for =
the
previous similar test for kernel 4.17.7.  Although the present up time resu=
lt
at least encourages further testing with kernel 4.18.x, this is only one te=
st,
and the next test might give a substantially shorter or longer up time. In =
any
case this result is still far from ideal since such lockups never occurred =
on
the old computer that this new computer replaced and also do not currently
occur for the X-terminal.  That is, on the old principal box up times excee=
ding
30 days have been common and similarly on the X-terminal, and the only reas=
on I
rebooted in those cases was power interruptions or the installation of a new
kernel.  For the present case of the new box, the lockups mean the only
recovery possible is to hit the reset button with all that implies about
journal recovery and potential file deletion for files that are in inconsis=
tent
shape due to the lockup.

For what it is worth, the lockup symptoms this time were a bit different th=
an
before.  The new computer had a frozen display (rather than blank before), =
and
frozen mouse and keyboard (as before).  The X-terminal used to remotely acc=
ess
a desktop running on the new computer had a frozen display (rather than
blanked) with working keyboard (and maybe mouse, but I didn't record that) =
so I
could exit the local X and get to the Linux console where ping to the new
computer actually worked (as opposed to ping not working at all for the
previous lockup).  So because networking was working, ssh to the new comput=
er
didn't time out.  However, it ran for 20+ minutes with no sign of a login so
the net result was the same as for previous lockups; there was no way to lo=
gin
to the new computer from another computer to shut down the new computer
normally so the only method of shutting it down was to hit the reset button=
.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15377664780.125Eb8.20555-- --===============1055106413== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1055106413==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Mon, 24 Sep 2018 05:34:53 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0915718485==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 067656E1D4 for ; Mon, 24 Sep 2018 05:34:53 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0915718485== Content-Type: multipart/alternative; boundary="15377672921.F30F4.23559" Content-Transfer-Encoding: 7bit --15377672921.F30F4.23559 Date: Mon, 24 Sep 2018 05:34:52 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #20 from Alan W. Irwin --- I started a new stability test as of 2018-09-23 15:34:19 right after a Debi= an Buster dist-upgrade. The graphics stack versions for this test are as foll= ows: ii amd64-microcode 3.20180524.1=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20 amd64 Processor microcode firmware for AMD CPUs ii firmware-amd-graphics 20180825+dfsg-1=20=20=20= =20=20=20=20=20=20=20=20=20=20=20 all Binary firmware for AMD/ATI graphics chips ii libdrm-amdgpu1:amd64 2.4.94-1=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 amd64 Userspace interface to amdgpu-specific kernel DRM services -- runtime ii libglapi-mesa:amd64 18.1.7-1=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 amd64 free implementation of the GL API -- shared library ii linux-image-4.18.0-1-amd64 4.18.6-1=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 amd64 Linux 4.18 for 64-bit PCs ii xserver-xorg-video-amdgpu 18.1.0-1=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 amd64 X.Org X server -- AMDGPU display driver That is, these versions are identical to the previous test other than the (substantial) update of the AMDGPU display driver from version 18.0.1-1= +b1 to version 18.1.0-1. The kernel parameters were the same as the previous test, e.g., BOOT_IMAGE=3D/boot/vmlinuz-4.18.0-1-amd64 root=3DUUID=3D1e45a1ee-a5d6-4327-9a7b-2663ffc0b157 ro rootwait quiet idle= =3Dnomwait --=20 You are receiving this mail because: You are the assignee for the bug.= --15377672921.F30F4.23559 Date: Mon, 24 Sep 2018 05:34:52 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 20 on bug 10667= 1 from Alan W. Irwin
I started a new stability test as of 2018-09-23 15:34:19 right=
 after a Debian
Buster dist-upgrade.  The graphics stack versions for this test are as foll=
ows:

ii  amd64-microcode                               3.20180524.1=20=20=20=20=
=20=20=20=20=20=20=20=20=20=20=20=20=20
 amd64        Processor microcode firmware for AMD CPUs
ii  firmware-amd-graphics                         20180825+dfsg-1=20=20=20=
=20=20=20=20=20=20=20=20=20=20=20
 all          Binary firmware for AMD/ATI graphics chips
ii  libdrm-amdgpu1:amd64                          2.4.94-1=20=20=20=20=20=
=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20
 amd64        Userspace interface to amdgpu-specific kernel DRM services --
runtime
ii  libglapi-mesa:amd64                           18.1.7-1=20=20=20=20=20=
=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20
 amd64        free implementation of the GL API -- shared library
ii  linux-image-4.18.0-1-amd64                    4.18.6-1=20=20=20=20=20=
=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20
 amd64        Linux 4.18 for 64-bit PCs
ii  xserver-xorg-video-amdgpu                     18.1.0-1=20=20=20=20=20=
=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20
 amd64        X.Org X server -- AMDGPU display driver

That is, these versions are identical to the previous test other than
the (substantial) update of the AMDGPU display driver from version 18.0.1-1=
+b1
to version 18.1.0-1.

The kernel parameters were the same as the previous test, e.g.,

BOOT_IMAGE=3D/boot/vmlinuz-4.18.0-1-amd64
root=3DUUID=3D1e45a1ee-a5d6-4327-9a7b-2663ffc0b157 ro rootwait quiet idle=
=3Dnomwait


You are receiving this mail because:
  • You are the assignee for the bug.
= --15377672921.F30F4.23559-- --===============0915718485== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0915718485==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Mon, 24 Sep 2018 19:35:15 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0545474508==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 444B66E09C for ; Mon, 24 Sep 2018 19:35:15 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0545474508== Content-Type: multipart/alternative; boundary="15378177150.39c55f34f.19025" Content-Transfer-Encoding: 7bit --15378177150.39c55f34f.19025 Date: Mon, 24 Sep 2018 19:35:15 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #21 from Alan W. Irwin --- Created attachment 141724 --> https://bugs.freedesktop.org/attachment.cgi?id=3D141724&action=3Dedit tarball containing kern.log, syslog, and dmesg output --=20 You are receiving this mail because: You are the assignee for the bug.= --15378177150.39c55f34f.19025 Date: Mon, 24 Sep 2018 19:35:15 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 21 on bug 10667= 1 from Alan W. Irwin
Created attachment 141724 [details]
tarball containing kern.log, syslog, and dmesg output


You are receiving this mail because:
  • You are the assignee for the bug.
= --15378177150.39c55f34f.19025-- --===============0545474508== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0545474508==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Mon, 24 Sep 2018 19:58:45 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0648965934==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id EEE4C6E341 for ; Mon, 24 Sep 2018 19:58:44 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0648965934== Content-Type: multipart/alternative; boundary="15378191240.6d6f9.23889" Content-Transfer-Encoding: 7bit --15378191240.6d6f9.23889 Date: Mon, 24 Sep 2018 19:58:44 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #22 from Alan W. Irwin --- This last stability test lasted only 17.5 hours before the lockup. See the latest attached tarball for the relevant log files (which capture everything during this short up time) and dmesg output. As far as I can tell there is nothing in those log files relevant to the lockup, e.g., no burst of null a= scii characters like what occurred in the log files for the previous experiment. There are some segfaults associated with a cron task I have configured every morning starting at 4:32, but those always occur for that task (which is a complete build and test of CMake) so I don't think they are relevant. The actual lockup today happened with one inactive desktop running on the X-terminal and one active desktop running on the new box. (I was editing a file with Emacs.) Also, the symptoms of this lockup were more severe, i.e., ping did not work from the X-terminal to the new box. But as always there was no way to shut down the new box properly so I had to do that with the reset button. Since I bought the new box in May remote access from an X-terminal has only locked up twice (one of those detailed here), and after a relatively long period of time. So tests where the X-terminal use is the only way to access the new box seems in general much more stable than direct use (as in the present case with such a short time before the lockup). And I haven't tried sole use of the X-terminal for a while now, and that may be completely stab= le with the new kernel. So my conclusion remains that the problem is associat= ed with the Debian Buster graphics stack (and likely also the very latest grap= hics stack if someone will do some up time tests for modern AMD graphics cards f= or that stack) used to display and control the RX 550 card on the new box. I have now started a new test (as of 9:08:19 today) with all graphics stack versions and kernel parameters the same as for the previous test in hopes t= hat when the inevitable lockup comes the log files will be more informative.=20 Please let me know if you have some other experiment you would like me to t= ry. --=20 You are receiving this mail because: You are the assignee for the bug.= --15378191240.6d6f9.23889 Date: Mon, 24 Sep 2018 19:58:44 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 22 on bug 10667= 1 from Alan W. Irwin
This last stability test lasted only 17.5 hours before the loc=
kup.  See the
latest attached tarball for the relevant log files (which capture everything
during this short up time) and dmesg output.  As far as I can tell there is
nothing in those log files relevant to the lockup, e.g., no burst of null a=
scii
characters like what occurred in the log files for the previous experiment.
There are some segfaults associated with a cron task I have configured every
morning starting at 4:32, but those always occur for that task (which is a
complete build and test of CMake) so I don't think they are relevant.

The actual lockup today happened with one inactive desktop running on the
X-terminal and one active desktop running on the new box.  (I was editing a
file with Emacs.) Also, the symptoms of this lockup were more severe, i.e.,
ping
did not work from the X-terminal to the new box.  But as always there was no
way to shut down the new box properly so I had to do that with the reset
button.

Since I bought the new box in May remote access from an X-terminal has only
locked up twice (one of those detailed here), and after a relatively long
period of time.  So tests where the X-terminal use is the only way to access
the new box seems in general much more stable than direct use (as in the
present case with such a short time before the lockup).  And I haven't tried
sole use of the X-terminal for a while now, and that may be completely stab=
le
with the new kernel.  So my conclusion remains that the problem is associat=
ed
with the Debian Buster graphics stack (and likely also the very latest grap=
hics
stack if someone will do some up time tests for modern AMD graphics cards f=
or
that stack) used to display and control the RX 550 card on the new box.

I have now started a new test (as of 9:08:19 today) with all graphics stack
versions and kernel parameters the same as for the previous test in hopes t=
hat
when the inevitable lockup comes the log files will be more informative.=20
Please let me know if you have some other experiment you would like me to t=
ry.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15378191240.6d6f9.23889-- --===============0648965934== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0648965934==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Thu, 04 Oct 2018 05:53:01 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1574360118==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 459F66E1B4 for ; Thu, 4 Oct 2018 05:53:01 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1574360118== Content-Type: multipart/alternative; boundary="15386323811.f6E735.23602" Content-Transfer-Encoding: 7bit --15386323811.f6E735.23602 Date: Thu, 4 Oct 2018 05:53:01 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #23 from Alan W. Irwin --- Created attachment 141872 --> https://bugs.freedesktop.org/attachment.cgi?id=3D141872&action=3Dedit tarball containing daemon.log, messages, kern.log, syslog, and dmesg output The previously described uptime test lasted (until the lockup this morning)= for 9+ days, but the log files included nothing that seemed relevant. The next uptime test that started this morning for exactly the same graphics stack a= nd kernel parameters lasted only 7 hours until a lockup, and this time the (attached) log files caught substantial error messages before the crash.=20= =20 @Michel D=C3=A4nzer: Could you please take a look at this one to see wheth= er there is some clue in the kernel error messages concerning the source of this instability? --=20 You are receiving this mail because: You are the assignee for the bug.= --15386323811.f6E735.23602 Date: Thu, 4 Oct 2018 05:53:01 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 23 on bug 10667= 1 from Alan W. Irwin
Created attachment 141872 [details]
tarball containing daemon.log, messages, kern.log, syslog, and dmesg output

The previously described uptime test lasted (until the lockup this morning)=
 for
9+ days, but the log files included nothing that seemed relevant.   The next
uptime test that started this morning for exactly the same graphics stack a=
nd
kernel parameters lasted only 7 hours until a lockup, and this time the
(attached) log files caught substantial error messages before the crash.=20=
=20

@Michel D=C3=A4nzer:  Could you please take a look at this one to see w=
hether there
is some clue in the kernel error messages concerning the source of this
instability?


You are receiving this mail because:
  • You are the assignee for the bug.
= --15386323811.f6E735.23602-- --===============1574360118== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1574360118==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Fri, 05 Oct 2018 01:39:38 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0683993266==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 6AFFE6E0B1 for ; Fri, 5 Oct 2018 01:39:38 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0683993266== Content-Type: multipart/alternative; boundary="15387035780.20bA0.13009" Content-Transfer-Encoding: 7bit --15387035780.20bA0.13009 Date: Fri, 5 Oct 2018 01:39:38 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #24 from Alan W. Irwin --- Created attachment 141904 --> https://bugs.freedesktop.org/attachment.cgi?id=3D141904&action=3Dedit X log file showing segfault Just now the direct X server failed with a segfault (see the attached log f= ile for the details). I restarted direct X with my normal startx method, and my kernel stability test I started yesterday with two different desktops runni= ng (one with direct X and one for a different user who uses an X-terminal) is continuing. I also reviewed the previous tarball that contained the log fi= les for the last kernel lockup, and the messages there have a lot to say about = NMI so I am hoping if some expert here actually takes a look at those log files, and/or the attached X log file containing messages from X server segfault), they might be able to find a way to increase stability for the RX 550 or mi= ght recommend some variation on these stability experiments to get a better ide= a of the graphics stack bug(s) that are causing this (these) issue(s). --=20 You are receiving this mail because: You are the assignee for the bug.= --15387035780.20bA0.13009 Date: Fri, 5 Oct 2018 01:39:38 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 24 on bug 10667= 1 from Alan W. Irwin
Created attachment 1=
41904 [details]
X log file showing segfault

Just now the direct X server failed with a segfault (see the attached log f=
ile
for the details).  I restarted direct X with my normal startx method, and my
kernel stability test I started yesterday with two different desktops runni=
ng
(one with direct X and one for a different user who uses an X-terminal) is
continuing.  I also reviewed the previous tarball that contained the log fi=
les
for the last kernel lockup, and the messages there have a lot to say about =
NMI
so I am hoping if some expert here actually takes a look at those log files,
and/or the attached X log file containing messages from X server segfault),
they might be able to find a way to increase stability for the RX 550 or mi=
ght
recommend some variation on these stability experiments to get a better ide=
a of
the graphics stack bug(s) that are causing this (these) issue(s).


You are receiving this mail because:
  • You are the assignee for the bug.
= --15387035780.20bA0.13009-- --===============0683993266== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0683993266==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Fri, 05 Oct 2018 08:35:33 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1738842215==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id EFD0A6E744 for ; Fri, 5 Oct 2018 08:35:32 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1738842215== Content-Type: multipart/alternative; boundary="15387285320.eB65D.6009" Content-Transfer-Encoding: 7bit --15387285320.eB65D.6009 Date: Fri, 5 Oct 2018 08:35:32 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 Michel D=C3=A4nzer changed: What |Removed |Added ---------------------------------------------------------------------------- Attachment #141904|application/x-trash |text/plain mime type| | --=20 You are receiving this mail because: You are the assignee for the bug.= --15387285320.eB65D.6009 Date: Fri, 5 Oct 2018 08:35:32 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated = Michel D=C3=A4nzer changed bug 10667= 1
What Removed Added
Attachment #141904 mime type application/x-trash text/plain


You are receiving this mail because:
  • You are the assignee for the bug.
= --15387285320.eB65D.6009-- --===============1738842215== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1738842215==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Fri, 05 Oct 2018 08:40:37 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1424881213==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id D8FB76E745 for ; Fri, 5 Oct 2018 08:40:36 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1424881213== Content-Type: multipart/alternative; boundary="15387288361.4bC7.6623" Content-Transfer-Encoding: 7bit --15387288361.4bC7.6623 Date: Fri, 5 Oct 2018 08:40:36 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #25 from Michel D=C3=A4nzer --- (In reply to Alan W. Irwin from comment #24) > Just now the direct X server failed with a segfault (see the attached log > file for the details). Looks like a Mesa bug. Please install the libgl1-mesa-dri-dbgsym package and attach another log file if it happens again. --=20 You are receiving this mail because: You are the assignee for the bug.= --15387288361.4bC7.6623 Date: Fri, 5 Oct 2018 08:40:36 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 25 on bug 10667= 1 from Michel D=C3=A4nzer
(In reply to Alan W. Irwin from comment #24)
> Just now the direct X server failed with a segfa=
ult (see the attached log
> file for the details).

Looks like a Mesa bug. Please install the libgl1-mesa-dri-dbgsym package and
attach another log file if it happens again.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15387288361.4bC7.6623-- --===============1424881213== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1424881213==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Fri, 05 Oct 2018 09:32:29 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1757446265==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id DE5116E784 for ; Fri, 5 Oct 2018 09:32:28 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1757446265== Content-Type: multipart/alternative; boundary="15387319480.023d.13993" Content-Transfer-Encoding: 7bit --15387319480.023d.13993 Date: Fri, 5 Oct 2018 09:32:28 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #26 from Alan W. Irwin --- (In reply to Michel D=C3=A4nzer from comment #25) > (In reply to Alan W. Irwin from comment #24) > > Just now the direct X server failed with a segfault (see the attached l= og > > file for the details). >=20 > Looks like a Mesa bug. Please install the libgl1-mesa-dri-dbgsym package = and > attach another log file if it happens again. Good idea, but I cannot follow up on it. Debian Jessie =3D oldstable had debug packages for libgl1-mesa-dri, but I can find nothing equivalent for Debian Stretch (not relevant to my Debian Buster box but I looked for it nevertheless) or Debian Buster. Debian Sid = has such packages, but they are all for non-official hardware platforms and not= for my AMD64 hardware platform. Is there any further follow up you can recommend for the NMI-related error messages for the latest kernel lockup? --=20 You are receiving this mail because: You are the assignee for the bug.= --15387319480.023d.13993 Date: Fri, 5 Oct 2018 09:32:28 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 26 on bug 10667= 1 from Alan W. Irwin
(In reply to Michel D=C3=A4nzer from comment #25)
> (In reply to Alan W. Irwin from comment #24)
> > Just now the direct X server failed with a segfault (see the atta=
ched log
> > file for the details).
>=20
> Looks like a Mesa bug. Please install the libgl1-mesa-dri-dbgsym packa=
ge and
> attach another log file if it happens again.

Good idea, but I cannot follow up on it.
Debian Jessie =3D oldstable had debug packages for libgl1-mesa-dri, but I
can find nothing equivalent for Debian Stretch (not relevant to my Debian
Buster box but I looked for it nevertheless) or Debian Buster.  Debian Sid =
has
such packages, but they are all for non-official hardware platforms and not=
 for
my AMD64 hardware platform.

Is there any further follow up you can recommend for the NMI-related error
messages for the latest kernel lockup?


You are receiving this mail because:
  • You are the assignee for the bug.
= --15387319480.023d.13993-- --===============1757446265== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1757446265==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Fri, 05 Oct 2018 09:59:08 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0832669298==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 5D8D16E6C3 for ; Fri, 5 Oct 2018 09:59:08 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0832669298== Content-Type: multipart/alternative; boundary="15387335481.FE2dDF.17796" Content-Transfer-Encoding: 7bit --15387335481.FE2dDF.17796 Date: Fri, 5 Oct 2018 09:59:08 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #27 from Michel D=C3=A4nzer --- (In reply to Alan W. Irwin from comment #26) > Debian Jessie =3D oldstable had debug packages for libgl1-mesa-dri, but I > can find nothing equivalent for Debian Stretch Debugging symbol packages are in a separate repository now, add this to /etc/apt/sources.list: deb https://deb.debian.org/debian-debug/ -debug main contrib non-free (Replace with the suite name you have for the main repository there) > Is there any further follow up you can recommend for the NMI-related error > messages for the latest kernel lockup? Looks e1000e network driver related. --=20 You are receiving this mail because: You are the assignee for the bug.= --15387335481.FE2dDF.17796 Date: Fri, 5 Oct 2018 09:59:08 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 27 on bug 10667= 1 from Michel D=C3=A4nzer
(In reply to Alan W. Irwin from comment #26)
> Debian Jessie =3D oldstable had debug packages f=
or libgl1-mesa-dri, but I
> can find nothing equivalent for Debian Stretch

Debugging symbol packages are in a separate repository now, add this to
/etc/apt/sources.list:

deb     https://deb.debian=
.org/debian-debug/    <suite>-debug   main contrib
non-free

(Replace <suite> with the suite name you have for the main repository=
 there)


> Is there any further follow up you can recommend=
 for the NMI-related error
> messages for the latest kernel lockup?

Looks e1000e network driver related.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15387335481.FE2dDF.17796-- --===============0832669298== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0832669298==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Sat, 06 Oct 2018 00:32:05 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1363083160==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 358766E946 for ; Sat, 6 Oct 2018 00:32:05 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1363083160== Content-Type: multipart/alternative; boundary="15387859250.DcfbbaE.14462" Content-Transfer-Encoding: 7bit --15387859250.DcfbbaE.14462 Date: Sat, 6 Oct 2018 00:32:05 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #28 from Alan W. Irwin --- (In reply to Michel D=C3=A4nzer from comment #27) > (In reply to Alan W. Irwin from comment #26) > > Debian Jessie =3D oldstable had debug packages for libgl1-mesa-dri, but= I > > can find nothing equivalent for Debian Stretch >=20 > Debugging symbol packages are in a separate repository now, add this to > /etc/apt/sources.list: >=20 > deb https://deb.debian.org/debian-debug/ -debug main contrib non-f= ree >=20 > (Replace with the suite name you have for the main repository the= re) Thanks for that Debian Buster help concerning debug symbols. As a result I= now have libgl1-mesa-dri-dbgsym installed just in case I run into this segfault again. > > Is there any further follow up you can recommend for the NMI-related er= ror > > messages for the latest kernel lockup? >=20 > Looks e1000e network driver related. So this appears to be a side issue from the much more frequent lockups I te= nd to get whenever I am using the RX 550 card. So it is off-topic for the cur= rent bug report, but thanks for helping me to determine that by your classificat= ion of this particular source of kernel-4.18.x lockups for my now 5 months old = and still not stable Linux box. Anyhow, I will continue the present stability experiment to see how far I g= et before the next lockup. --=20 You are receiving this mail because: You are the assignee for the bug.= --15387859250.DcfbbaE.14462 Date: Sat, 6 Oct 2018 00:32:05 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 28 on bug 10667= 1 from Alan W. Irwin
(In reply to Michel D=C3=A4nzer from comment #27)
> (In reply to Alan W. Irwin from comment #26)
> > Debian Jessie =3D oldstable had debug packages for libgl1-mesa-dr=
i, but I
> > can find nothing equivalent for Debian Stretch
>=20
> Debugging symbol packages are in a separate repository now, add this to
> /etc/apt/sources.list:
>=20
> deb	https://deb.debia=
n.org/debian-debug/	<suite>-debug	main contrib non-free
>=20
> (Replace <suite> with the suite name you have for the main repos=
itory there)

Thanks for that Debian Buster help concerning debug symbols.  As a result I=
 now
have libgl1-mesa-dri-dbgsym installed just in case I run into this segfault
again.


> > Is there any further follow up you can reco=
mmend for the NMI-related error
> > messages for the latest kernel lockup?
>=20
> Looks e1000e network driver related.

So this appears to be a side issue from the much more frequent lockups I te=
nd
to get whenever I am using the RX 550 card.  So it is off-topic for the cur=
rent
bug report, but thanks for helping me to determine that by your classificat=
ion
of this particular source of kernel-4.18.x lockups for my now 5 months old =
and
still not stable Linux box.

Anyhow, I will continue the present stability experiment to see how far I g=
et
before the next lockup.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15387859250.DcfbbaE.14462-- --===============1363083160== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1363083160==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Wed, 17 Oct 2018 21:28:15 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1566666596==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 7C2966E3FA for ; Wed, 17 Oct 2018 21:28:15 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1566666596== Content-Type: multipart/alternative; boundary="15398116950.b464f.1595" Content-Transfer-Encoding: 7bit --15398116950.b464f.1595 Date: Wed, 17 Oct 2018 21:28:15 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #29 from Alan W. Irwin --- Created attachment 142075 --> https://bugs.freedesktop.org/attachment.cgi?id=3D142075&action=3Dedit tarball containing daemon.log, messages, kern.log, syslog, and dmesg output --=20 You are receiving this mail because: You are the assignee for the bug.= --15398116950.b464f.1595 Date: Wed, 17 Oct 2018 21:28:15 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 29 on bug 10667= 1 from Alan W. Irwin
Created attachment 142075 [details]
tarball containing daemon.log, messages, kern.log, syslog, and dmesg output=


You are receiving this mail because:
  • You are the assignee for the bug.
= --15398116950.b464f.1595-- --===============1566666596== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1566666596==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Wed, 17 Oct 2018 21:54:37 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0881330772==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id B5C5D6E41A for ; Wed, 17 Oct 2018 21:54:37 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0881330772== Content-Type: multipart/alternative; boundary="15398132770.4f20be1Ca.8958" Content-Transfer-Encoding: 7bit --15398132770.4f20be1Ca.8958 Date: Wed, 17 Oct 2018 21:54:37 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #30 from Alan W. Irwin --- This time the system lasted almost 14 days before the lockup. See the late= st attachment for the log details which contain NMI messages followed by a bur= st of ascii null characters (which in my experience can be due to different threads or processes trying to write to the same file, i.e., the NMI error messages themselves might have exposed another kernel bug). Unlike the last case of NMI mesages where an Intel network card was mentioned, the only hardware I can see mentioned in these messages is a particular cpu and my motherboard, e.g., Oct 17 13:25:02 merlin kernel: [1177237.021995] NMI watchdog: Watchdog dete= cted hard LOCKUP on cpu 13 [...] Oct 17 13:25:02 merlin kernel: [1177237.022042] Hardware name: System manufacturer System Product Name/PRIME B350-PLUS, BIOS 3803 01/22/2018 So this appears not to be hard evidence of a graphics stack bug since likely any linux system component bug could lock up a cpu, but I am still pretty s= ure this is a graphics stack issue with the RX 550 because of my prior evidence showing much better kernel stability if I do not use that RX550 card at all. I started a new up-time experiment using today's snapshot of Debian Buster which left most of the graphics stack the same other than libdrm-amdgpu1 wh= ich has been updated from 2.4.94-1 to 2.4.95-1 and the=20 linux kernel which has been updated from 4.18.6-1 to 4.18.10-2. --=20 You are receiving this mail because: You are the assignee for the bug.= --15398132770.4f20be1Ca.8958 Date: Wed, 17 Oct 2018 21:54:37 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 30 on bug 10667= 1 from Alan W. Irwin
This time the system lasted almost 14 days before the lockup. =
 See the latest
attachment for the log details which contain NMI messages followed by a bur=
st
of ascii null characters (which in my experience can be due to different
threads or processes trying to write to the same file, i.e., the NMI error
messages themselves might have exposed another kernel bug).  Unlike the last
case of NMI mesages where an Intel network card was mentioned, the only
hardware I can see
mentioned in these messages is a particular cpu and my motherboard, e.g.,
Oct 17 13:25:02 merlin kernel: [1177237.021995] NMI watchdog: Watchdog dete=
cted
hard LOCKUP on cpu 13
[...]
Oct 17 13:25:02 merlin kernel: [1177237.022042] Hardware name: System
manufacturer System Product Name/PRIME B350-PLUS, BIOS 3803 01/22/2018

So this appears not to be hard evidence of a graphics stack bug since likely
any linux system component bug could lock up a cpu, but I am still pretty s=
ure
this is a graphics stack issue with the RX 550 because of my prior evidence
showing
much better kernel stability if I do not use that RX550 card at all.

I started a new up-time experiment using today's snapshot of Debian Buster
which left most of the graphics stack the same other than libdrm-amdgpu1 wh=
ich
has been updated from 2.4.94-1 to 2.4.95-1 and the=20
linux kernel which has been updated from 4.18.6-1 to 4.18.10-2.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15398132770.4f20be1Ca.8958-- --===============0881330772== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0881330772==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Sun, 21 Oct 2018 05:58:26 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0488549288==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id F05006E050 for ; Sun, 21 Oct 2018 05:58:25 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0488549288== Content-Type: multipart/alternative; boundary="15401015050.ea81ecb.31974" Content-Transfer-Encoding: 7bit --15401015050.ea81ecb.31974 Date: Sun, 21 Oct 2018 05:58:25 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #31 from Alan W. Irwin --- Created attachment 142114 --> https://bugs.freedesktop.org/attachment.cgi?id=3D142114&action=3Dedit tarball containing log information concerning latest lockup --=20 You are receiving this mail because: You are the assignee for the bug.= --15401015050.ea81ecb.31974 Date: Sun, 21 Oct 2018 05:58:25 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 31 on bug 10667= 1 from Alan W. Irwin
Created attachment 142114 [details]
tarball containing log information concerning latest lockup


You are receiving this mail because:
  • You are the assignee for the bug.
= --15401015050.ea81ecb.31974-- --===============0488549288== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0488549288==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Sun, 21 Oct 2018 06:20:46 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1163297428==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 1748089D7C for ; Sun, 21 Oct 2018 06:20:46 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1163297428== Content-Type: multipart/alternative; boundary="15401028460.4B9ceEe3.6005" Content-Transfer-Encoding: 7bit --15401028460.4B9ceEe3.6005 Date: Sun, 21 Oct 2018 06:20:45 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #32 from Alan W. Irwin --- I had another lockup today after ~3 days of uptime. Please see the most re= cent attachment for the relevant log files and dmesg output corresponding to this lockup. These logs contain NMI messages and references to the e1000e kernel module so, although I am no expert, this lockup appears to be e1000e relate= d.=20=20 That kernel module is running the following Intel networking expansion card: 09:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network Connection That card is used to connect (via crossover cable) this Ryzen 7 box with an external X-terminal box. Since this is the second lockup recently attribu= ted to the e1000e module and other recent lockups I have encountered with no er= ror messages or ascii null error messages in the system logs could also be due = to this kernel module, the current stability test I just started was to turn o= ff the X terminal completely (to eliminate use of the 82574L other than its initial detection). The second user of the present system that previously accessed it with the X-terminal is now accessing locally it with "startx --= :1" while I am accessing it locally with "startx". So the two users are sharing one monitor, keyboard, and mouse and switching between their two xfce deskt= ops and associated local X servers using the appropriate ctrl-alt-FN keyboard shortcuts. So although this is a painful way to run our two desktops it obviously is a more stringent test and also a much cleaner test (without the e1000e troubles confusing graphics stack issues) of how stable the Debian Buster graphics stack is for my Ryzen 7 1700 system with 64GB, (idle) Intel 82574L networking card, and (extremely busy since it is switched between tw= o X servers several times per day) AMD RX 550. In sum, my hope is that all the other package upgrades and installations (e= .g., of the firmware packages) I have done have completely stabilized the Debian Buster graphics stack for my Ryzen 7 system with RX 550 so that I will get = an uptime (with this painful but useful experiment with two local X servers) o= f at least several months which would allow me to close this bug report as fixed. @Michel D=C3=A4nzer: Meanwhile, where is the best upstream place to report = the repeated lockups with the e1000e? I have already created a Debian Buster b= ug report at conc= erning the e1000e lockups, but I would like to repeat that for the relevant kernel= bug tracker in case there is no Debian response to that bug report. --=20 You are receiving this mail because: You are the assignee for the bug.= --15401028460.4B9ceEe3.6005 Date: Sun, 21 Oct 2018 06:20:46 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 32 on bug 10667= 1 from Alan W. Irwin
I had another lockup today after ~3 days of uptime.  Please se=
e the most recent
attachment for the relevant log files and dmesg output corresponding to this
lockup.  These logs contain NMI messages and references to the e1000e kernel
module so, although I am no expert, this lockup appears to be e1000e relate=
d.=20=20

That kernel module is running the following Intel networking expansion card:

09:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network
Connection

That card is used to connect (via crossover cable) this Ryzen 7 box with an
external  X-terminal box.  Since this is the second lockup recently attribu=
ted
to the e1000e module and other recent lockups I have encountered with no er=
ror
messages or ascii null error messages in the system logs could also be due =
to
this kernel module, the current stability test I just started was to turn o=
ff
the X terminal completely (to eliminate use of the 82574L other than its
initial detection).  The second user of the present system that previously
accessed it with the X-terminal is now accessing locally it with "star=
tx -- :1"
while I am accessing it locally with "startx".  So the two users =
are sharing
one monitor, keyboard, and mouse and switching between their two xfce deskt=
ops
and associated local X servers using the appropriate ctrl-alt-FN keyboard
shortcuts.  So although this is a painful way to run our two desktops it
obviously is a more stringent test and also a much cleaner test (without the
e1000e troubles confusing graphics stack issues) of how stable the Debian
Buster graphics stack is for my Ryzen 7 1700 system with 64GB, (idle) Intel
82574L networking card, and (extremely busy since it is switched between tw=
o X
servers several times per day) AMD RX 550.

In sum, my hope is that all the other package upgrades and installations (e=
.g.,
of the firmware packages) I have done have completely stabilized the Debian
Buster graphics stack for my Ryzen 7 system with RX 550 so that I will get =
an
uptime (with this painful but useful experiment with two local X servers) o=
f at
least several months which would allow me to close this bug report as fixed.

@Michel D=C3=A4nzer: Meanwhile, where is the best upstream place to rep=
ort the
repeated lockups with the e1000e?  I have already created a Debian Buster b=
ug
report at <https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=3D911496&g=
t; concerning
the e1000e lockups, but I would like to repeat that for the relevant kernel=
 bug
tracker in case there is no Debian response to that bug report.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15401028460.4B9ceEe3.6005-- --===============1163297428== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1163297428==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Sun, 04 Nov 2018 05:56:09 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============2127495904==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id CA01C8808F for ; Sun, 4 Nov 2018 05:56:15 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============2127495904== Content-Type: multipart/alternative; boundary="15413109730.9efd.8852" Content-Transfer-Encoding: 7bit --15413109730.9efd.8852 Date: Sun, 4 Nov 2018 05:56:13 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #33 from Alan W. Irwin --- Created attachment 142358 --> https://bugs.freedesktop.org/attachment.cgi?id=3D142358&action=3Dedit tarball containing log information concerning latest lockup --=20 You are receiving this mail because: You are the assignee for the bug.= --15413109730.9efd.8852 Date: Sun, 4 Nov 2018 05:56:13 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 33 on bug 10667= 1 from Alan W. Irwin
Created attachment 142358 [details]
tarball containing log information concerning latest lockup


You are receiving this mail because:
  • You are the assignee for the bug.
= --15413109730.9efd.8852-- --===============2127495904== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============2127495904==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Sun, 04 Nov 2018 06:40:16 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0385195270==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 1F5506E0AE for ; Sun, 4 Nov 2018 06:40:16 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0385195270== Content-Type: multipart/alternative; boundary="15413136160.BFa9e7A.16764" Content-Transfer-Encoding: 7bit --15413136160.BFa9e7A.16764 Date: Sun, 4 Nov 2018 06:40:16 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #34 from Alan W. Irwin --- I have discovered this box became significantly less stable when there were= two users displaying X directly on it (one with startx , one with startx -- :1)= and using ctrl-alt-F1 and ctrl-alt-F2 to switch between the two local X servers that were displaying two different XFCE desktops via the RX 550 graphics ca= rd. After moving to that mode of operation we got the following results for upt= imes before lockups: 1 day, 2 times 2 days, 1 time 3 days, 1 time For each of these 4 lockups I could not spot any relevant messages in the l= og files. But the substantially shorter uptimes for this method of using this= box does appear to confirm there are still issues with the graphics stack for t= he RX550. But the graphics content being displayed by the two users is roughly similar so I don't understand why this mode of operation is so much less st= able then if just one user is using the RX 550 while the other is using an X-terminal. (None of these lockups occurred anywhere near the times we switched between the two local X servers, but I suppose it is possible that switching sets up a condition that results in a lockup much later.) Anyhow, because of the increased instability I gave up on the two local X servers approach and went back to the one local X server and one X-terminal approach, and with that approach we got an uptime of a week before the syst= em locked up. That lockup occurred tonight, and I have attached a tarball containing log files that show many NMI error messages associated with that lockup (but with no reference to the e1000e module this time). @Michel D=C3=A4nzer: Could you please take a look at these log files and le= t me know if this is the best place to report the present lockup? --=20 You are receiving this mail because: You are the assignee for the bug.= --15413136160.BFa9e7A.16764 Date: Sun, 4 Nov 2018 06:40:16 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 34 on bug 10667= 1 from Alan W. Irwin
I have discovered this box became significantly less stable wh=
en there were two
users displaying X directly on it (one with startx , one with startx -- :1)=
 and
using ctrl-alt-F1 and ctrl-alt-F2 to switch between the two local X servers
that were displaying two different XFCE desktops via the RX 550 graphics ca=
rd.
After moving to that mode of operation we got the following results for upt=
imes
before lockups:

1 day,  2 times
2 days, 1 time
3 days, 1 time

For each of these 4 lockups I could not spot any relevant messages in the l=
og
files.  But the substantially shorter uptimes for this method of using this=
 box
does appear to confirm there are still issues with the graphics stack for t=
he
RX550.  But the graphics content being displayed by the two users is roughly
similar so I don't understand why this mode of operation is so much less st=
able
then if just one user is using the RX 550 while the other is using an
X-terminal.  (None of these lockups occurred anywhere near the times we
switched between the two local X servers, but I suppose it is possible that
switching sets up a condition that results in a lockup much later.)

Anyhow, because of the increased instability I gave up on the two local X
servers approach and went back to the one local X server and one X-terminal
approach, and with that approach we got an uptime of a week before the syst=
em
locked up.  That lockup occurred tonight, and I have attached a tarball
containing log files that show many NMI error messages associated with that
lockup (but with no reference to the e1000e module this time).

@Michel D=C3=A4nzer: Could you please take a look at these log files an=
d let me know
if this is the best place to report the present lockup?


You are receiving this mail because:
  • You are the assignee for the bug.
= --15413136160.BFa9e7A.16764-- --===============0385195270== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0385195270==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Sat, 10 Nov 2018 08:38:45 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0452478860==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id C64BC6E078 for ; Sat, 10 Nov 2018 08:38:44 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0452478860== Content-Type: multipart/alternative; boundary="15418391240.C23D.1827" Content-Transfer-Encoding: 7bit --15418391240.C23D.1827 Date: Sat, 10 Nov 2018 08:38:44 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #35 from fin4478@hotmail.com --- To prevent random kernel lock ups with Ryzen, enable RCU_NOCB_CPU in the ke= rnel configuration and boot the kernel with the rcu_nocbs=3D0-X command line parameter. X is the cpu thread count -1. To fix this with bios, set to Typi= cal Current Idle in the bios Advanced/AMD CBS menu. --=20 You are receiving this mail because: You are the assignee for the bug.= --15418391240.C23D.1827 Date: Sat, 10 Nov 2018 08:38:44 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 35 on bug 10667= 1 from fin4478@hotm= ail.com
To prevent random kernel lock ups with Ryzen, enable RCU_NOCB_=
CPU in the kernel
configuration  and boot the kernel with the rcu_nocbs=3D0-X command line
parameter. X is the cpu thread count -1. To fix this with bios, set to Typi=
cal
Current Idle  in the bios Advanced/AMD CBS menu.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15418391240.C23D.1827-- --===============0452478860== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0452478860==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Thu, 15 Nov 2018 02:51:29 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1843231419==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 5D78489EFF for ; Thu, 15 Nov 2018 02:51:29 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1843231419== Content-Type: multipart/alternative; boundary="15422502890.8CC5cfe.28704" Content-Transfer-Encoding: 7bit --15422502890.8CC5cfe.28704 Date: Thu, 15 Nov 2018 02:51:29 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 --- Comment #36 from Alan W. Irwin --- (In reply to fin4478 from comment #35) > To prevent random kernel lock ups with Ryzen, enable RCU_NOCB_CPU in the > kernel configuration and boot the kernel with the rcu_nocbs=3D0-X command > line parameter. X is the cpu thread count -1. To fix this with bios, set = to > Typical Current Idle in the bios Advanced/AMD CBS menu. I was quickly able to verify all you said at and . So it appears all = Linux Ryzen owners should be aware of this "idle" issue and take the necessary workarounds, but despite over many months publicizing my Linux Ryzen troubl= es in a number of different Linux forums (including this bug report) and many different google searches I remained clueless about this bad Linux Ryzen situation until now. So many thanks for being the first to clue me in! It took me a while to figure out how to rebuild the latest Debian Buster ke= rnel (4.18.10) with RCU_NOCB_CPU enabled, but I have done that now and just rebo= oted with that custom kernel using the rcu_nocbs=3D0-15 kernel parameter (my Ryz= en 7 1700 has 8 cores and 16 threads). So my hopes are high that this step will clean up the lockup issues I have = been experiencing when my system was idling at night. But I have also experienc= ed lockups when the system was being used so the rcu_nocbs=3D0-15 workaround m= ay not be the sole step I have to take to stabilize my Linux Ryzen system. --=20 You are receiving this mail because: You are the assignee for the bug.= --15422502890.8CC5cfe.28704 Date: Thu, 15 Nov 2018 02:51:29 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 36 on bug 10667= 1 from Alan W. Irwin
(In reply to fin4478 from comment #35)
> To prevent random kernel lock ups with Ryzen, en=
able RCU_NOCB_CPU in the
> kernel configuration  and boot the kernel with the rcu_nocbs=3D0-X com=
mand
> line parameter. X is the cpu thread count -1. To fix this with bios, s=
et to
> Typical Current Idle  in the bios Advanced/AMD CBS menu.

I was quickly able to verify all you said at
<https://community.a=
md.com/thread/225795> and
<https:=
//bugzilla.kernel.org/show_bug.cgi?id=3D196683>.  So it appears all =
Linux
Ryzen owners should be aware of this "idle" issue and take the ne=
cessary
workarounds, but despite over many months publicizing my Linux Ryzen troubl=
es
in a number of different Linux forums (including this bug report) and many
different google searches I remained clueless about this bad Linux Ryzen
situation until now.  So many thanks for being the first to clue me in!

It took me a while to figure out how to rebuild the latest Debian Buster ke=
rnel
(4.18.10) with RCU_NOCB_CPU enabled, but I have done that now and just rebo=
oted
with that custom kernel using the rcu_nocbs=3D0-15 kernel parameter (my Ryz=
en 7
1700 has 8 cores and 16 threads).

So my hopes are high that this step will clean up the lockup issues I have =
been
experiencing when my system was idling at night.  But I have also experienc=
ed
lockups when the system was being used so the rcu_nocbs=3D0-15 workaround m=
ay not
be the sole step I have to take to stabilize my Linux Ryzen system.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15422502890.8CC5cfe.28704-- --===============1843231419== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1843231419==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106671] Frequent lock ups for AMD RX 550 graphics card Date: Wed, 25 Sep 2019 18:03:57 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1729744502==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 88AD26EC74 for ; Wed, 25 Sep 2019 18:03:57 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1729744502== Content-Type: multipart/alternative; boundary="15694346372.8bfAd64b.2066" Content-Transfer-Encoding: 7bit --15694346372.8bfAd64b.2066 Date: Wed, 25 Sep 2019 18:03:57 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106671 GitLab Migration User changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution|--- |MOVED --- Comment #37 from GitLab Migration User -= -- -- GitLab Migration Automatic Message -- This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity. You can subscribe and participate further through the new bug through this = link to our GitLab instance: https://gitlab.freedesktop.org/mesa/mesa/issues/131= 4. --=20 You are receiving this mail because: You are the assignee for the bug.= --15694346372.8bfAd64b.2066 Date: Wed, 25 Sep 2019 18:03:57 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated GitLab Migration User changed bug 10667= 1
What Removed Added
Status NEW RESOLVED
Resolution --- MOVED

Comme= nt # 37 on bug 10667= 1 from GitLab Migration User
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been
closed from further activity.

You can subscribe and participate further through the new bug through this =
link
to our GitLab instance: https://gitlab.freedesktop.org/mesa/mesa/issues/1314.
        


You are receiving this mail because:
  • You are the assignee for the bug.
= --15694346372.8bfAd64b.2066-- --===============1729744502== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVs --===============1729744502==--