From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 107432] Periodic complete system lockup with Vega M and Kernel 4.18-rc6+ Date: Thu, 02 Aug 2018 05:47:50 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1918368754==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id A43ED6E0A3 for ; Thu, 2 Aug 2018 05:47:50 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1918368754== Content-Type: multipart/alternative; boundary="15331888700.d59b.19577" Content-Transfer-Encoding: 7bit --15331888700.d59b.19577 Date: Thu, 2 Aug 2018 05:47:50 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D107432 --- Comment #3 from Robert Strube --- (In reply to Michel D=C3=A4nzer from comment #2) > Created attachment 140905 [details] [review] > Use kvmalloc in amdgpu_uvd_suspend >=20 > Does this patch help by any chance? >=20 > If not, can you bisect between 4.18-rc1 and -rc6? Note that from your > description, you'll need to test for at least one day before declaring a > commit good (if you hit a failure, you can immediately declare that commit > bad). Hi Michel, Thank you for the patch. I've rebuilt the kernel with the changes in your patch and am currently going to test it out over the next several days. I've noticed that the problem seems to occur when there is a large amount of memory pressure (e.g. I'm running a VM where I've allocated lots of memory), and almost always after I've just opened a new application windows. Perhap= s a web browser, text editor, etc. Today I had a scenario (running the vanilla 4.18-rc7) where I simply ran ou= t of memory *BUT* this occurred in the absence of opening up a new application window, and the system was able to recover gracefully. I do have 16GB of RAM in my system, but I can easily hit the limit by runni= ng a VM and opening several applications. Should I conduct tests with memory pressure applied to see if your patch addresses the issue? Are we trying to simulate the same scenario as before? I'll report back my results. P.S. I've attached another dmesg.log from the out of memory problems I ran = into today (again running on vanilla 4.18-rc7 and not using your patch) so you c= an compare the two scenarios. This scenario did not result in a complete syst= em lockup, so something different must have occurred. Thanks! --=20 You are receiving this mail because: You are the assignee for the bug.= --15331888700.d59b.19577 Date: Thu, 2 Aug 2018 05:47:50 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Commen= t # 3 on bug 10743= 2 from Robert Strube
(In reply to Michel D=C3=A4nzer from comment #2)
> Created attachment 140905 [details] [review] [review]
> Use kvmalloc in amdgpu_uvd_suspend
>=20
> Does this patch help by any chance?
>=20
> If not, can you bisect between 4.18-rc1 and -rc6? Note that from your
> description, you'll need to test for at least one day before declaring=
 a
> commit good (if you hit a failure, you can immediately declare that co=
mmit
> bad).

Hi Michel,

Thank you for the patch.  I've rebuilt the kernel with the changes in your
patch and am currently going to test it out over the next several days.

I've noticed that the problem seems to occur when there is a large amount of
memory pressure (e.g. I'm running a VM where I've allocated lots of memory),
and almost always after I've just opened a new application windows.  Perhap=
s a
web browser, text editor, etc.

Today I had a scenario (running the vanilla 4.18-rc7) where I simply ran ou=
t of
memory *BUT* this occurred in the absence of opening up a new application
window, and the system was able to recover gracefully.

I do have 16GB of RAM in my system, but I can easily hit the limit by runni=
ng a
VM and opening several applications.

Should I conduct tests with memory pressure applied to see if your patch
addresses the issue?  Are we trying to simulate the same scenario as before?

I'll report back my results.

P.S. I've attached another dmesg.log from the out of memory problems I ran =
into
today (again running on vanilla 4.18-rc7 and not using your patch) so you c=
an
compare the two scenarios.  This scenario did not result in a complete syst=
em
lockup, so something different must have occurred.

Thanks!


You are receiving this mail because:
  • You are the assignee for the bug.
= --15331888700.d59b.19577-- --===============1918368754== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1918368754==--