dri-devel.lists.freedesktop.org archive mirror
 help / color / mirror / Atom feed
* [Bug 215892] New: 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30!
@ 2022-04-27  2:23 bugzilla-daemon
  2022-04-27  2:24 ` [Bug 215892] " bugzilla-daemon
                   ` (5 more replies)
  0 siblings, 6 replies; 7+ messages in thread
From: bugzilla-daemon @ 2022-04-27  2:23 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=215892

            Bug ID: 215892
           Summary: 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]]
                    *ERROR* Failed to register vline0 irq 30!
           Product: Drivers
           Version: 2.5
    Kernel Version: 5.18-rc4
          Hardware: All
                OS: Linux
              Tree: Mainline
            Status: NEW
          Severity: normal
          Priority: P1
         Component: Video(DRI - non Intel)
          Assignee: drivers_video-dri@kernel-bugs.osdl.org
          Reporter: ulatec@gmail.com
        Regression: No

Created attachment 300811
  --> https://bugzilla.kernel.org/attachment.cgi?id=300811&action=edit
New PowerColor board with chip that produces kernel errors

Hello!

This is my first time submitted a bug here. I apologize if I make any mistakes
here, but I am going to do my best to describe the efforts that I have gone
through to attempt to resolve this issue on my own. As well, I hope not to
overload with information, but just wish to help with skipping over the basic
questions.

I have numerous PowerColor RX 6500XT graphics cards, and all of them with a
specific chip package (picture attached) have the same issue. Any PowerColor RX
6500XT with 2152 printed at the top of the package, and "TFTB43.00" at the
bottom of the package suffers the same kernel errors. Previously (up until a
few weeks ago) PowerColor was shipping 6500XT boards with chips that were
stamped with 2146 and "TFAW62.T5" at the top and bottom of the package
respectively. Boards with those chips have zero kernel errors and work
flawlessly. As well, I have tested various 6500XT and 6400 boards from
different AIB partners of AMD and have not had any issues other than this
specific variant from PowerColor.


To be honest, I am not sure if the root of the problem is in pcieport or in
amdgpu, but the amdgpu error throws first. 

I have attached the full dmesg output but to save some time here are some
highlighted lines of issue:

[    5.506718] [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to
register vline0 irq 30!
[   14.368915] pcieport 0000:01:00.0: can't change power state from D0 to D3hot
(config space inaccessible)
[   15.270778] pcieport 0000:01:00.0: can't change power state from D3cold to
D0 (config space inaccessible)
[   15.270799] pcieport 0000:02:00.0: can't change power state from D3cold to
D0 (config space inaccessible)
[   25.478689] pcieport 0000:01:00.0: can't change power state from D3cold to
D0 (config space inaccessible)
[   25.478696] pcieport 0000:02:00.0: can't change power state from D3cold to
D0 (config space inaccessible)
[   25.722619] amdgpu 0000:03:00.0: can't change power state from D3cold to D0
(config space inaccessible)
[   35.833714] [drm:gmc_v10_0_flush_vm_hub.constprop.0 [amdgpu]] *ERROR*
Timeout waiting for VM flush hub: 0!
[   35.941450] [drm:gmc_v10_0_flush_vm_hub.constprop.0 [amdgpu]] *ERROR*
Timeout waiting for sem acquire in VM flush!
[   36.048999] [drm:gmc_v10_0_flush_vm_hub.constprop.0 [amdgpu]] *ERROR*
Timeout waiting for VM flush hub: 1!
[   36.156835] [drm:gmc_v10_0_flush_vm_hub.constprop.0 [amdgpu]] *ERROR*
Timeout waiting for sem acquire in VM flush!
[   36.264770] [drm:gmc_v10_0_flush_vm_hub.constprop.0 [amdgpu]] *ERROR*
Timeout waiting for VM flush hub: 1!
[   36.372616] [drm:gmc_v10_0_flush_vm_hub.constprop.0 [amdgpu]] *ERROR*
Timeout waiting for VM flush hub: 0!


What I have attempted so far:

Results were the same for the following kernels: 5.4.190, 5.10.111, 5.15.34,
5.17.4 and now 5.18-rc4.

Many different motherboards with varying chipsets (B250, H510, X370, B550).
Same result.

Enabling/Disabling clock gating, ASPM, extended synch control for PCIE. Same
result.

The problematic cards from PowerColor indeed do work in Windows without issue.
This leads me to believe that something may have changed with TUL's
implementation of the 6500XT from one production run to another. Hopefully
someone from the amdgpu team can help here.


To summarize, PowerColor's prior 6500XT production worked flawlessly with the
drivers in the mainline kernel. New production for some reason is no longer
usable. New cards work in Windows, but now throw the errors above. Not an
isolated issue of one card, as I have tested 12 identical ones with the same
chip and all have the same result regardless of motherboard, cpu, power,
kernel, OS, etc. Cards (6500XT and 6400s) from other partners have not had any
issues.

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug 215892] 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30!
  2022-04-27  2:23 [Bug 215892] New: 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30! bugzilla-daemon
@ 2022-04-27  2:24 ` bugzilla-daemon
  2022-04-27  2:28 ` bugzilla-daemon
                   ` (4 subsequent siblings)
  5 siblings, 0 replies; 7+ messages in thread
From: bugzilla-daemon @ 2022-04-27  2:24 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=215892

--- Comment #1 from Mark Johnston (ulatec@gmail.com) ---
Created attachment 300812
  --> https://bugzilla.kernel.org/attachment.cgi?id=300812&action=edit
full dmesg

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug 215892] 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30!
  2022-04-27  2:23 [Bug 215892] New: 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30! bugzilla-daemon
  2022-04-27  2:24 ` [Bug 215892] " bugzilla-daemon
@ 2022-04-27  2:28 ` bugzilla-daemon
  2022-04-27  2:33 ` bugzilla-daemon
                   ` (3 subsequent siblings)
  5 siblings, 0 replies; 7+ messages in thread
From: bugzilla-daemon @ 2022-04-27  2:28 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=215892

--- Comment #2 from Mark Johnston (ulatec@gmail.com) ---
Created attachment 300813
  --> https://bugzilla.kernel.org/attachment.cgi?id=300813&action=edit
Prior PowerColor (6500XT) board with chip that does not produce error

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug 215892] 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30!
  2022-04-27  2:23 [Bug 215892] New: 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30! bugzilla-daemon
  2022-04-27  2:24 ` [Bug 215892] " bugzilla-daemon
  2022-04-27  2:28 ` bugzilla-daemon
@ 2022-04-27  2:33 ` bugzilla-daemon
  2022-04-27  2:52 ` bugzilla-daemon
                   ` (2 subsequent siblings)
  5 siblings, 0 replies; 7+ messages in thread
From: bugzilla-daemon @ 2022-04-27  2:33 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=215892

--- Comment #3 from Mark Johnston (ulatec@gmail.com) ---
Created attachment 300814
  --> https://bugzilla.kernel.org/attachment.cgi?id=300814&action=edit
lspci

The hardware configuration that was most recently tested.

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug 215892] 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30!
  2022-04-27  2:23 [Bug 215892] New: 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30! bugzilla-daemon
                   ` (2 preceding siblings ...)
  2022-04-27  2:33 ` bugzilla-daemon
@ 2022-04-27  2:52 ` bugzilla-daemon
  2022-04-27  8:59 ` bugzilla-daemon
  2022-04-27 12:22 ` bugzilla-daemon
  5 siblings, 0 replies; 7+ messages in thread
From: bugzilla-daemon @ 2022-04-27  2:52 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=215892

--- Comment #4 from Mark Johnston (ulatec@gmail.com) ---
Created attachment 300815
  --> https://bugzilla.kernel.org/attachment.cgi?id=300815&action=edit
acpidump summary

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug 215892] 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30!
  2022-04-27  2:23 [Bug 215892] New: 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30! bugzilla-daemon
                   ` (3 preceding siblings ...)
  2022-04-27  2:52 ` bugzilla-daemon
@ 2022-04-27  8:59 ` bugzilla-daemon
  2022-04-27 12:22 ` bugzilla-daemon
  5 siblings, 0 replies; 7+ messages in thread
From: bugzilla-daemon @ 2022-04-27  8:59 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=215892

Artem S. Tashkinov (aros@gmx.com) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
                 CC|                            |aros@gmx.com
         Resolution|---                         |ANSWERED

--- Comment #5 from Artem S. Tashkinov (aros@gmx.com) ---
Please search for dupes and refile if missing:

https://gitlab.freedesktop.org/drm/amd/-/issues

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug 215892] 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30!
  2022-04-27  2:23 [Bug 215892] New: 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30! bugzilla-daemon
                   ` (4 preceding siblings ...)
  2022-04-27  8:59 ` bugzilla-daemon
@ 2022-04-27 12:22 ` bugzilla-daemon
  5 siblings, 0 replies; 7+ messages in thread
From: bugzilla-daemon @ 2022-04-27 12:22 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=215892

--- Comment #6 from Mark Johnston (ulatec@gmail.com) ---
There is one potentially similar report here:
https://gitlab.freedesktop.org/drm/amd/-/issues/1933

Though both of the users report having working desktop environments and nothing
about amdgpu not being able to come out of D3cold. In my case above the gpus
are non-responsive, as they are stuck in the d3cold power state. So the
amdgpu_dm_init.isra may be the same, but the results and impacts differ.

Not really sure what to do here. Should I add my findings (hardware tests) to
that report?

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2022-04-27 12:22 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-04-27  2:23 [Bug 215892] New: 6500XT [drm:amdgpu_dm_init.isra.0.cold [amdgpu]] *ERROR* Failed to register vline0 irq 30! bugzilla-daemon
2022-04-27  2:24 ` [Bug 215892] " bugzilla-daemon
2022-04-27  2:28 ` bugzilla-daemon
2022-04-27  2:33 ` bugzilla-daemon
2022-04-27  2:52 ` bugzilla-daemon
2022-04-27  8:59 ` bugzilla-daemon
2022-04-27 12:22 ` bugzilla-daemon

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).