dri-devel.lists.freedesktop.org archive mirror
 help / color / mirror / Atom feed
* [Bug 208647] New: persistent amdgpu: [mmhub] page faults
@ 2020-07-21 14:35 bugzilla-daemon
  2020-07-21 15:13 ` [Bug 208647] " bugzilla-daemon
                   ` (6 more replies)
  0 siblings, 7 replies; 8+ messages in thread
From: bugzilla-daemon @ 2020-07-21 14:35 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=208647

            Bug ID: 208647
           Summary: persistent amdgpu: [mmhub] page faults
           Product: Drivers
           Version: 2.5
    Kernel Version: 5.4.0-42-generic
          Hardware: All
                OS: Linux
              Tree: Mainline
            Status: NEW
          Severity: normal
          Priority: P1
         Component: Video(DRI - non Intel)
          Assignee: drivers_video-dri@kernel-bugs.osdl.org
          Reporter: jay.foad@gmail.com
        Regression: No

Whenever X is running I get persistent page faults like this:

Jul 21 15:19:16 jay-X470-AORUS-ULTRA-GAMING kernel: amdgpu 0000:0c:00.0:
amdgpu: [mmhub] page fault (src_id:0 ring:169 vmid:0 pasid:0, for process  pid
0 thread  pid 0)
Jul 21 15:19:16 jay-X470-AORUS-ULTRA-GAMING kernel: amdgpu 0000:0c:00.0:
amdgpu:   in page starting at address 0x00000000fffb0000 from client 18
Jul 21 15:19:16 jay-X470-AORUS-ULTRA-GAMING kernel: amdgpu 0000:0c:00.0:
amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00041F52
Jul 21 15:19:16 jay-X470-AORUS-ULTRA-GAMING kernel: amdgpu 0000:0c:00.0:
amdgpu:          Faulty UTCL2 client ID: 0xf
Jul 21 15:19:16 jay-X470-AORUS-ULTRA-GAMING kernel: amdgpu 0000:0c:00.0:
amdgpu:          MORE_FAULTS: 0x0
Jul 21 15:19:16 jay-X470-AORUS-ULTRA-GAMING kernel: amdgpu 0000:0c:00.0:
amdgpu:          WALKER_ERROR: 0x1
Jul 21 15:19:16 jay-X470-AORUS-ULTRA-GAMING kernel: amdgpu 0000:0c:00.0:
amdgpu:          PERMISSION_FAULTS: 0x5
Jul 21 15:19:16 jay-X470-AORUS-ULTRA-GAMING kernel: amdgpu 0000:0c:00.0:
amdgpu:          MAPPING_ERROR: 0x1
Jul 21 15:19:16 jay-X470-AORUS-ULTRA-GAMING kernel: amdgpu 0000:0c:00.0:
amdgpu:          RW: 0x1

Sometimes I get several of these per second. Sometimes there are none for a few
minutes.

If I boot into runlevel 3 (i.e. without starting X) I get one of these during
boot, but then there are no more after that.

I'm running Ubuntu 20.04 but I also saw this on 18.04.

Kernel version is 5.4.0-42-generic but I also saw this with 5.3.0-51-generic.

I'm using the amdgpu-pro drivers.

Graphics card is a Navi 10.

Motherboard is a Gigabyte X470 AORUS ULTRA GAMING.

CPU is an AMD Ryzen 9 3900X.

A very similar sounding bug was reported here:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1888116

-- 
You are receiving this mail because:
You are watching the assignee of the bug.
_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 208647] persistent amdgpu: [mmhub] page faults
  2020-07-21 14:35 [Bug 208647] New: persistent amdgpu: [mmhub] page faults bugzilla-daemon
@ 2020-07-21 15:13 ` bugzilla-daemon
  2020-07-21 15:15 ` bugzilla-daemon
                   ` (5 subsequent siblings)
  6 siblings, 0 replies; 8+ messages in thread
From: bugzilla-daemon @ 2020-07-21 15:13 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=208647

Alex Deucher (alexdeucher@gmail.com) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |alexdeucher@gmail.com

--- Comment #1 from Alex Deucher (alexdeucher@gmail.com) ---
This is most likely a userspace issue (e.g., mesa).  The kernel driver is just
the messenger.

-- 
You are receiving this mail because:
You are watching the assignee of the bug.
_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 208647] persistent amdgpu: [mmhub] page faults
  2020-07-21 14:35 [Bug 208647] New: persistent amdgpu: [mmhub] page faults bugzilla-daemon
  2020-07-21 15:13 ` [Bug 208647] " bugzilla-daemon
@ 2020-07-21 15:15 ` bugzilla-daemon
  2020-07-21 15:22 ` bugzilla-daemon
                   ` (4 subsequent siblings)
  6 siblings, 0 replies; 8+ messages in thread
From: bugzilla-daemon @ 2020-07-21 15:15 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=208647

--- Comment #2 from Jay Foad (jay.foad@gmail.com) ---
Wouldn't there normally be a useful pid in the first line if it came from
userspace?

-- 
You are receiving this mail because:
You are watching the assignee of the bug.
_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 208647] persistent amdgpu: [mmhub] page faults
  2020-07-21 14:35 [Bug 208647] New: persistent amdgpu: [mmhub] page faults bugzilla-daemon
  2020-07-21 15:13 ` [Bug 208647] " bugzilla-daemon
  2020-07-21 15:15 ` bugzilla-daemon
@ 2020-07-21 15:22 ` bugzilla-daemon
  2020-07-21 15:26 ` bugzilla-daemon
                   ` (3 subsequent siblings)
  6 siblings, 0 replies; 8+ messages in thread
From: bugzilla-daemon @ 2020-07-21 15:22 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=208647

Nicolai Hähnle (nhaehnle@gmail.com) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |nhaehnle@gmail.com

--- Comment #3 from Nicolai Hähnle (nhaehnle@gmail.com) ---
Hi Alex, I asked Jay to report this because (1) the fact that there's a fault
during boot is suspicious and points in the direction of this being the
kernel's fault and (2) the fact that it's an *mmhub* fault is even more
suspicious.

Certainly this seems to happen without Mesa video encode/decode activity, so it
can't really be Mesa's (or any graphics driver's) fault.

Someone suggested that audio support also goes through mmhub and that it may be
related. I have no idea if that's true.

-- 
You are receiving this mail because:
You are watching the assignee of the bug.
_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 208647] persistent amdgpu: [mmhub] page faults
  2020-07-21 14:35 [Bug 208647] New: persistent amdgpu: [mmhub] page faults bugzilla-daemon
                   ` (2 preceding siblings ...)
  2020-07-21 15:22 ` bugzilla-daemon
@ 2020-07-21 15:26 ` bugzilla-daemon
  2020-07-21 15:34 ` bugzilla-daemon
                   ` (2 subsequent siblings)
  6 siblings, 0 replies; 8+ messages in thread
From: bugzilla-daemon @ 2020-07-21 15:26 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=208647

--- Comment #4 from Alex Deucher (alexdeucher@gmail.com) ---
Please attach your full dmesg output and xorg log (if using X).

-- 
You are receiving this mail because:
You are watching the assignee of the bug.
_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 208647] persistent amdgpu: [mmhub] page faults
  2020-07-21 14:35 [Bug 208647] New: persistent amdgpu: [mmhub] page faults bugzilla-daemon
                   ` (3 preceding siblings ...)
  2020-07-21 15:26 ` bugzilla-daemon
@ 2020-07-21 15:34 ` bugzilla-daemon
  2020-09-28 15:17 ` bugzilla-daemon
  2020-09-28 15:18 ` bugzilla-daemon
  6 siblings, 0 replies; 8+ messages in thread
From: bugzilla-daemon @ 2020-07-21 15:34 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=208647

--- Comment #5 from Jay Foad (jay.foad@gmail.com) ---
Created attachment 290439
  --> https://bugzilla.kernel.org/attachment.cgi?id=290439&action=edit
output of journalctl -b-5 -k

-- 
You are receiving this mail because:
You are watching the assignee of the bug.
_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 208647] persistent amdgpu: [mmhub] page faults
  2020-07-21 14:35 [Bug 208647] New: persistent amdgpu: [mmhub] page faults bugzilla-daemon
                   ` (4 preceding siblings ...)
  2020-07-21 15:34 ` bugzilla-daemon
@ 2020-09-28 15:17 ` bugzilla-daemon
  2020-09-28 15:18 ` bugzilla-daemon
  6 siblings, 0 replies; 8+ messages in thread
From: bugzilla-daemon @ 2020-09-28 15:17 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=208647

Stefan Winter (mail@stefan-winter.de) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |mail@stefan-winter.de

--- Comment #6 from Stefan Winter (mail@stefan-winter.de) ---
FWIW, this still (or again) happens with a 5.9.0-RC7 with a Navi 10. It did not
happen on 5.8.6 with slightly different .config though.

Attaching a full dmesg. Note that the page faults start happening very shortly
after the snd_hda_intel initialization which activates amdgpu.

-- 
You are receiving this mail because:
You are watching the assignee of the bug.
_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 208647] persistent amdgpu: [mmhub] page faults
  2020-07-21 14:35 [Bug 208647] New: persistent amdgpu: [mmhub] page faults bugzilla-daemon
                   ` (5 preceding siblings ...)
  2020-09-28 15:17 ` bugzilla-daemon
@ 2020-09-28 15:18 ` bugzilla-daemon
  6 siblings, 0 replies; 8+ messages in thread
From: bugzilla-daemon @ 2020-09-28 15:18 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=208647

--- Comment #7 from Stefan Winter (mail@stefan-winter.de) ---
Created attachment 292697
  --> https://bugzilla.kernel.org/attachment.cgi?id=292697&action=edit
dmesg on 5.9.0-RC7

dmesg

-- 
You are receiving this mail because:
You are watching the assignee of the bug.
_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2020-09-28 15:18 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-07-21 14:35 [Bug 208647] New: persistent amdgpu: [mmhub] page faults bugzilla-daemon
2020-07-21 15:13 ` [Bug 208647] " bugzilla-daemon
2020-07-21 15:15 ` bugzilla-daemon
2020-07-21 15:22 ` bugzilla-daemon
2020-07-21 15:26 ` bugzilla-daemon
2020-07-21 15:34 ` bugzilla-daemon
2020-09-28 15:17 ` bugzilla-daemon
2020-09-28 15:18 ` bugzilla-daemon

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).