From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 84CB6C43334 for ; Mon, 13 Jun 2022 01:20:55 +0000 (UTC) Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id E318A10E207; Mon, 13 Jun 2022 01:20:53 +0000 (UTC) Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by gabe.freedesktop.org (Postfix) with ESMTPS id 3796810E207 for ; Mon, 13 Jun 2022 01:20:52 +0000 (UTC) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id A371161220 for ; Mon, 13 Jun 2022 01:20:51 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPS id A3402C34115 for ; Mon, 13 Jun 2022 01:20:50 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1655083250; bh=Hb9wS7I+l0nBRj0qcJuamB8+Gf9cmcN8QQDwSI/WV2I=; h=From:To:Subject:Date:In-Reply-To:References:From; b=aFaPysP8oFeudF/jZM4qcUbTYBRwIuGiHxKKRXAx1SIxxmBoVETbfNL+7vLLTsAHK maOefoIH5OGb0HNsXwvCr/58owPDAtCAt7VKBxKxPWtiywYzKmf58KG1lJ9pw1FGkl NrhrPAyzlEfkjg+6q+yuuOqzXgZysNswLvkATIZeT6tSUYryjUl6bEa8xSy35usl+f 3RC+JOi2gI2n/ek2qq5XejR/NnRoEAsMebJNfiub/8zDxGPm2d1L2qCpAjt5+1VPB2 GVB2F0czOIOEuvzM2INgfImYFjZpVAiu9nmFhfVo+aFM2ZXbI1K/ft0PI9xUIt+Cbp MTr0YowLpzs/A== Received: by aws-us-west-2-korg-bugzilla-1.web.codeaurora.org (Postfix, from userid 48) id 9322FCC13B4; Mon, 13 Jun 2022 01:20:50 +0000 (UTC) From: bugzilla-daemon@kernel.org To: dri-devel@lists.freedesktop.org Subject: [Bug 201957] amdgpu: ring gfx timeout Date: Mon, 13 Jun 2022 01:20:48 +0000 X-Bugzilla-Reason: None X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: AssignedTo drivers_video-dri@kernel-bugs.osdl.org X-Bugzilla-Product: Drivers X-Bugzilla-Component: Video(DRI - non Intel) X-Bugzilla-Version: 2.5 X-Bugzilla-Keywords: X-Bugzilla-Severity: blocking X-Bugzilla-Who: panospolychronis@gmail.com X-Bugzilla-Status: NEW X-Bugzilla-Resolution: X-Bugzilla-Priority: P1 X-Bugzilla-Assigned-To: drivers_video-dri@kernel-bugs.osdl.org X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: https://bugzilla.kernel.org/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: dri-devel@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Direct Rendering Infrastructure - Development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" https://bugzilla.kernel.org/show_bug.cgi?id=3D201957 --- Comment #71 from Panagiotis Polychronis (panospolychronis@gmail.com) --- (In reply to Martin von Wittich from comment #70) > My Ubuntu 20.04 desktop is crashing several times per day due to this bug > since I've upgraded my computer from an old Intel Xeon to an AMD Ryzen 9 > 5900X on a B550 mainboard. I've had the same AMD RX Vega 56 graphics card= in > both computers, so I assume this is probably more related to the > mainboard/CPU than to the graphics card. >=20 > The crashes from today: >=20 > ``` > martin@martin ~ % grep amdgpu /var/log/syslog | grep ERROR | grep -v 'Fai= led > to initialize parser' > Jun 11 03:15:33 martin kernel: [21494.642889] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3D1750601, emitted seq= =3D1750603 > Jun 11 03:15:33 martin kernel: [21494.643055] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* Process information: process firefox pid 5037 thread > firefox:cs0 pid 5123 > Jun 11 03:15:50 martin kernel: [21511.795007] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3D1750605, emitted seq= =3D1750608 > Jun 11 03:15:50 martin kernel: [21511.795174] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* Process information: process firefox pid 5037 thread > firefox:cs0 pid 5123 > Jun 11 15:56:07 martin kernel: [ 1477.069969] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3D216293, emitted seq=3D= 216295 > Jun 11 15:56:07 martin kernel: [ 1477.070140] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* Process information: process firefox pid 5237 thread > firefox:cs0 pid 5302 > Jun 11 15:56:22 martin kernel: [ 1492.174077] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3D216297, emitted seq=3D= 216300 > Jun 11 15:56:22 martin kernel: [ 1492.174248] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* Process information: process pid 0 thread pid 0 > Jun 11 16:03:28 martin kernel: [ 1918.161101] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3D264406, emitted seq=3D= 264408 > Jun 11 16:03:28 martin kernel: [ 1918.161271] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* Process information: process firefox pid 10569 thread > firefox:cs0 pid 10633 > Jun 11 16:03:49 martin kernel: [ 1938.385307] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3D264410, emitted seq=3D= 264413 > Jun 11 16:03:49 martin kernel: [ 1938.385479] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* Process information: process firefox pid 10569 thread > firefox:cs0 pid 10633 > Jun 11 23:28:12 martin kernel: [25491.854294] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3D2390985, emitted seq= =3D2390987 > Jun 11 23:28:12 martin kernel: [25491.854460] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* Process information: process firefox pid 4922 thread > firefox:cs0 pid 4989 > Jun 11 23:28:28 martin kernel: [25507.982446] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3D2390989, emitted seq= =3D2390992 > Jun 11 23:28:28 martin kernel: [25507.982613] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* Process information: process pid 0 thread pid 0 > Jun 11 23:29:51 martin kernel: [25591.333483] amdgpu 0000:2d:00.0: amdgpu= :=20=20 > WALKER_ERROR: 0x0 > Jun 11 23:29:51 martin kernel: [25591.333485] amdgpu 0000:2d:00.0: amdgpu= :=20=20 > MAPPING_ERROR: 0x0 > Jun 11 23:30:01 martin kernel: [25601.412838] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* ring uvd_0 timeout, signaled seq=3D308, emitted seq=3D3= 10 > Jun 11 23:30:01 martin kernel: [25601.413009] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* Process information: process mpv pid 44110 thread mpv:c= s0 > pid 44122 > Jun 11 23:30:16 martin kernel: [25616.014983] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3D2409182, emitted seq= =3D2409185 > Jun 11 23:30:16 martin kernel: [25616.015151] [drm:amdgpu_job_timedout > [amdgpu]] *ERROR* Process information: process firefox pid 42941 thread > firefox:cs0 pid 43005 > ``` >=20 > When I upgraded my computer at the end of 2021, I had to switch from the > default Ubuntu 20.04 kernel `linux-image-generic` (5.4.0) to > `linux-image-generic-hwe-20.04` (5.11.0) because of some hardware issues > with the new computer (I don't remember what exactly didn't work, IIRC the > network). >=20 > I'm not exactly sure when the crashes started, but I changed from > `linux-image-generic-hwe-20.04` (5.14) to `linux-image-oem-20.04d` (5.14)= on > 2022-04-30 in the hopes that that might resolve the issue, but unfortunat= ely > it didn't help. >=20 > I tried the `amdgpu.runpm=3D0` workaround today which also didn't help. >=20 > I can also confirm that the attached video "5 second video clip that > triggers a crash" successfully triggers the crash on my system. >=20 > The main other thing that seems to trigger the crash is to open new tabs = in > Firefox (in that not every new tab I open causes the crash, but when it > crashes, it's usually when I was trying to open a new tab). Did you try with the latest Linux Kernel? I had a lot of gpu lockups like t= his. Also try these kernel parameters : "amdgpu.ppfeaturemask=3D0xffffbffb=20 amdgpu.noretry=3D0 amdgpu.lockup_timeout=3D0 amdgpu.gpu_recovery=3D1 amdgpu= .audio=3D0 amdgpu.deep_color=3D1 amd_iommu=3Don iommu=3Dpt"" ( you might also try with amdgpu.ppfeaturemask=3D0xfffd7fff or amdgpu.ppfeaturemask=3D0xffffffff ) --=20 You may reply to this email to add a comment. You are receiving this mail because: You are watching the assignee of the bug.=