From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 110674] Crashes / Resets From AMDGPU / Radeon VII Date: Tue, 21 May 2019 08:11:34 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0145221271==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id CC23C89254 for ; Tue, 21 May 2019 08:11:34 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0145221271== Content-Type: multipart/alternative; boundary="15584262943.1A9F9d1be.32260" Content-Transfer-Encoding: 7bit --15584262943.1A9F9d1be.32260 Date: Tue, 21 May 2019 08:11:34 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D110674 --- Comment #30 from Chris Hodapp --- Some interesting findings: First, I think I may have identified the problematic commit (or at least the most-problematic one): d1a3e239a6016f2bb42a91696056e223982e8538 (drm/amd/powerplay: drop the unnecessary uclk hard min setting). I eventual= ly gave up on doing a normal bisect since so many of the commits between 5.0 a= nd 5.1 were non-viable. Instead, I made a list of all the commits that touched vega20-related files. I then started repeatedly picking out the non-tested commit with the most related-sounding message, checking out the v5.1 tag, a= nd reverting the commit in order to test it as the culprit. When I revert that one, my system boots reliably. I still see 133.0 watts of power draw, thoug= h. This brings me to the second thing: When looking through the commits, I not= iced that there were multiple commits that claim to prevent or reduce crashing in high-resolution situations (one references 5k displays, another references = 3+ 4k displays). I want to note that we all seem to have relatively demanding display setups: Hameer has two 144hz 1440p displays, Tom B has two 60hz 4k displays, and I have two 120hz 4k displays. Putting these together I decide= d to try unplugging one of my displays. Imagine my surprise when things booted completely smoothly on a stock 5.1 kernel: glitch-free boot, *no powerplay errors in the kernel log*, and 25 watts of power draw when usage is low. So= I think it is safe to say that one "workaround" is to unplug a monitor if you= can stand to work that way. I actually have access to another Radeon VII so I may try running one per monitor tomorrow. --=20 You are receiving this mail because: You are the assignee for the bug.= --15584262943.1A9F9d1be.32260 Date: Tue, 21 May 2019 08:11:34 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 30 on bug 11067= 4 from Chris Hodapp
Some interesting findings:

First, I think I may have identified the problematic commit (or at least the
most-problematic one): d1a3e239a6016f2bb42a91696056e223982e8538
(drm/amd/powerplay: drop the unnecessary uclk hard min setting). I eventual=
ly
gave up on doing a normal bisect since so many of the commits between 5.0 a=
nd
5.1 were non-viable. Instead, I made a list of all the commits that touched
vega20-related files. I then started repeatedly picking out the non-tested
commit with the most related-sounding message, checking out the v5.1 tag, a=
nd
reverting the commit in order to test it as the culprit. When I revert that
one, my system boots reliably. I still see 133.0 watts of power draw, thoug=
h.

This brings me to the second thing: When looking through the commits, I not=
iced
that there were multiple commits that claim to prevent or reduce crashing in
high-resolution situations (one references 5k displays, another references =
3+
4k displays). I want to note that we all seem to have relatively demanding
display setups: Hameer has two 144hz 1440p displays, Tom B has two 60hz 4k
displays, and I have two 120hz 4k displays. Putting these together I decide=
d to
try unplugging one of my displays. Imagine my surprise when things booted
completely smoothly on a stock 5.1 kernel: glitch-free boot, *no powerplay
errors in the kernel log*, and 25 watts of power draw when usage is low. So=
 I
think it is safe to say that one "workaround" is to unplug a moni=
tor if you can
stand to work that way.

I actually have access to another Radeon VII so I may try running one per
monitor tomorrow.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15584262943.1A9F9d1be.32260-- --===============0145221271== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVs --===============0145221271==--