From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 110674] Crashes / Resets From AMDGPU / Radeon VII Date: Fri, 16 Aug 2019 22:14:48 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0308665635==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 321326E9BC for ; Fri, 16 Aug 2019 22:14:48 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0308665635== Content-Type: multipart/alternative; boundary="15659936882.B30C7ab0.25908" Content-Transfer-Encoding: 7bit --15659936882.B30C7ab0.25908 Date: Fri, 16 Aug 2019 22:14:48 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D110674 --- Comment #109 from Tom B --- Created attachment 145080 --> https://bugs.freedesktop.org/attachment.cgi?id=3D145080&action=3Dedit dmesg with amdgpu.dpm=3D2 > Tom B., did you try booting with amdgpu.dpm=3D1 or amdgpu.dpm=3D2 (defaul= t is generally -1 for automatic)? Seems like one of those might enable the = new experimental SW SMU v11 feature on Vega20 . . . Now that is interesting.dpm=3D-1 is the same as default, and default is 1, enabled so dpm=3D1 is what we've been using all along. But dpm=3D2 and the = patch you linked to are interesting. I tried it, it didn't help the crashing issue and I was stuck at 30w. As so= on as I started sddm the system froze. I've attached my dmesg from amdgpu.dpm= =3D2 boot. It doesn't fix the issue but it does help answer a few questions I ha= d: 1. The functions in vega20_ppt.c are used with this new patch so that answe= rs my question from earlier, that's what this file is for and why it contains similar/identical functions. 2. It explains the difference I found in comment 97: This commit https://github.com/torvalds/linux/commit/94ed6d0cfdb867be9bf05f03d682980bce= 5d0036 has the new else block for smu_display_configuration_change which we now kn= ow is the software version of this function. More importantly, though, knowing that enabling DPM causes the crash, this tells us either: A) The bug is present in both versions of the vega20 code: vega20_hwmgr.c a= nd vega20_ppt.c or.. B) The card reaches an invalid state before DPM is initialised and the card= is fine until it receives a DPM change. Given that two different versions of the code produce the same result, my h= unch is that the problem is B. The card is not in a state where it's able to rec= eive power changes. --=20 You are receiving this mail because: You are the assignee for the bug.= --15659936882.B30C7ab0.25908 Date: Fri, 16 Aug 2019 22:14:48 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comm= ent # 109 on bug 11067= 4 from Tom = B
Created attachment 145=
080 [details]
dmesg with amdgpu.dpm=3D2

> Tom B., did you try booting with amdgpu.dpm=3D1 =
or amdgpu.dpm=3D2 (default is generally -1 for automatic)? Seems like one o=
f those might enable the new experimental SW SMU v11 feature on Vega20 . . =
.

Now that is interesting.dpm=3D-1 is the same as default, and default is 1,
enabled so dpm=3D1 is what we've been using all along. But dpm=3D2 and the =
patch
you linked to are interesting.

I tried it, it didn't help the crashing issue and I was stuck at 30w. As so=
on
as I started sddm the system froze. I've attached my dmesg from amdgpu.dpm=
=3D2
boot. It doesn't fix the issue but it does help answer a few questions I ha=
d:


1. The functions in vega20_ppt.c are used with this new patch so that answe=
rs
my question from earlier, that's what this file is for and why it contains
similar/identical functions.

2. It explains the difference I found in comment 97: This commit
https://github.com/torvalds/linux/commit/94ed6d0cfdb867b=
e9bf05f03d682980bce5d0036
has the new else block for smu_display_configuration_change which we now kn=
ow
is the software version of this function.


More importantly, though, knowing that enabling DPM causes the crash, this
tells us either:

A) The bug is present in both versions of the vega20 code: vega20_hwmgr.c a=
nd
vega20_ppt.c or..

B) The card reaches an invalid state before DPM is initialised and the card=
 is
fine until it receives a DPM change.

Given that two different versions of the code produce the same result, my h=
unch
is that the problem is B. The card is not in a state where it's able to rec=
eive
power changes.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15659936882.B30C7ab0.25908-- --===============0308665635== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVs --===============0308665635==--