All of lore.kernel.org
 help / color / mirror / Atom feed
* 5.13-rc3 Renoir (ChipID = 0x1636): error GPU reset, fences timed out, failed to initialize parser -125
@ 2021-05-29 11:40 Julian Wollrath
  2021-05-29 12:16 ` Julian Wollrath
  2021-06-01 15:28 ` Alex Deucher
  0 siblings, 2 replies; 3+ messages in thread
From: Julian Wollrath @ 2021-05-29 11:40 UTC (permalink / raw)
  To: amd-gfx

Hello,

on 5.13-rc3 with a 4650U (Renoir, ChipID = 0x1636) I am able to
reliably get the GPU to reset under X11/Xorg with the amdgpu driver
(current from git) and having persistent problems afterwards. I
achieve this in the following way (using ROOT (root.cern.ch) to draw a
histogram):

$ root
# TH1 *t = new TH1I("test", "test", 2, -.5, 1.5)
# t->Fill(1)
# t->Draw()


this opens a window showing the histogram and results in

kernel: [  370.709485] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
kernel: [  375.820701] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
kernel: [  375.830689] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3783, emitted seq=3785
kernel: [  375.831015] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 1028 thread Xorg
:cs0 pid 1029
kernel: [  375.831174] amdgpu 0000:03:00.0: amdgpu: GPU reset begin!
kernel: [  375.924610] [drm] free PSP TMR buffer
kernel: [  375.951409] amdgpu 0000:03:00.0: amdgpu: MODE2 reset
kernel: [  375.951531] amdgpu 0000:03:00.0: amdgpu: GPU reset succeeded, trying to resume
kernel: [  375.951712] [drm] PCIE GART of 1024M enabled.
kernel: [  375.951715] [drm] PTB located at 0x000000F400900000
kernel: [  375.951957] [drm] PSP is resuming...
kernel: [  375.971811] [drm] reserve 0x400000 from 0xf41f800000 for PSP TMR
kernel: [  376.277979] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available
kernel: [  376.288823] amdgpu 0000:03:00.0: amdgpu: RAP: optional rap ta ucode is not available
kernel: [  376.288830] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
kernel: [  376.288836] amdgpu 0000:03:00.0: amdgpu: SMU is resuming...
kernel: [  376.290539] amdgpu 0000:03:00.0: amdgpu: SMU is resumed successfully!
kernel: [  376.648574] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  376.649562] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  376.721210] [drm] kiq ring mec 2 pipe 1 q 0
kernel: [  376.722457] [drm] DMUB hardware initialized: version=0x01020008
kernel: [  376.936133] [drm] VCN decode and encode initialized successfully(under DPG Mode).
kernel: [  376.936628] [drm] JPEG decode initialized successfully.
kernel: [  376.936635] amdgpu 0000:03:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
kernel: [  376.936639] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
kernel: [  376.936641] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
kernel: [  376.936643] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
kernel: [  376.936645] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
kernel: [  376.936646] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
kernel: [  376.936648] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
kernel: [  376.936649] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
kernel: [  376.936650] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
kernel: [  376.936652] amdgpu 0000:03:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
kernel: [  376.936654] amdgpu 0000:03:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 1
kernel: [  376.936656] amdgpu 0000:03:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 1
kernel: [  376.936657] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 1
kernel: [  376.936659] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 1
kernel: [  376.936660] amdgpu 0000:03:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 1
kernel: [  376.945068] amdgpu 0000:03:00.0: amdgpu: recover vram bo from shadow start
kernel: [  376.945075] amdgpu 0000:03:00.0: amdgpu: recover vram bo from shadow done
kernel: [  376.945081] [drm] Skip scheduling IBs!
kernel: [  376.945084] [drm] Skip scheduling IBs!
kernel: [  376.945119] amdgpu 0000:03:00.0: amdgpu: GPU reset(2) succeeded!
kernel: [  376.945161] [drm] Skip scheduling IBs!
kernel: [  376.945169] [drm] Skip scheduling IBs!
kernel: [  376.945172] [drm] Skip scheduling IBs!
kernel: [  376.945235] [drm] Skip scheduling IBs!
kernel: [  376.945243] [drm] Skip scheduling IBs!
kernel: [  376.945246] [drm] Skip scheduling IBs!
kernel: [  376.945248] [drm] Skip scheduling IBs!
kernel: [  376.945251] [drm] Skip scheduling IBs!
kernel: [  376.945253] [drm] Skip scheduling IBs!
kernel: [  376.948659] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  381.640053] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  381.641067] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  381.641510] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  382.795658] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  382.796462] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  383.157991] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  383.224317] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  383.225321] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  383.226164] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  383.226836] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  383.227294] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  383.227573] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  383.228525] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  391.623243] amdgpu_cs_ioctl: 13 callbacks suppressed
kernel: [  391.623250] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  391.623979] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  396.615491] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  396.616130] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
kernel: [  398.242898] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
[...]

I guess, that this might not be detailed enough information but I am
happy to provide more, if you tell me, what exactly is needed.


With best regards,
Julian Wollrath

--
 ()  ascii ribbon campaign - against html e-mail
 /\                        - against proprietary attachments
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: 5.13-rc3 Renoir (ChipID = 0x1636): error GPU reset, fences timed out, failed to initialize parser -125
  2021-05-29 11:40 5.13-rc3 Renoir (ChipID = 0x1636): error GPU reset, fences timed out, failed to initialize parser -125 Julian Wollrath
@ 2021-05-29 12:16 ` Julian Wollrath
  2021-06-01 15:28 ` Alex Deucher
  1 sibling, 0 replies; 3+ messages in thread
From: Julian Wollrath @ 2021-05-29 12:16 UTC (permalink / raw)
  To: amd-gfx

Hello,

> on 5.13-rc3 with a 4650U (Renoir, ChipID = 0x1636) I am able to
> reliably get the GPU to reset under X11/Xorg with the amdgpu driver
> (current from git) and having persistent problems afterwards. I
> achieve this in the following way (using ROOT (root.cern.ch) to draw a
> histogram):

p.s.: this also happens with 5.10.40, so it is not a recent regression.


Best regards,
Julian Wollrath

--
 ()  ascii ribbon campaign - against html e-mail
 /\                        - against proprietary attachments
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: 5.13-rc3 Renoir (ChipID = 0x1636): error GPU reset, fences timed out, failed to initialize parser -125
  2021-05-29 11:40 5.13-rc3 Renoir (ChipID = 0x1636): error GPU reset, fences timed out, failed to initialize parser -125 Julian Wollrath
  2021-05-29 12:16 ` Julian Wollrath
@ 2021-06-01 15:28 ` Alex Deucher
  1 sibling, 0 replies; 3+ messages in thread
From: Alex Deucher @ 2021-06-01 15:28 UTC (permalink / raw)
  To: Julian Wollrath; +Cc: amd-gfx list

On Mon, May 31, 2021 at 4:17 AM Julian Wollrath <jwollrath@web.de> wrote:
>
> Hello,
>
> on 5.13-rc3 with a 4650U (Renoir, ChipID = 0x1636) I am able to
> reliably get the GPU to reset under X11/Xorg with the amdgpu driver
> (current from git) and having persistent problems afterwards. I
> achieve this in the following way (using ROOT (root.cern.ch) to draw a
> histogram):
>
> $ root
> # TH1 *t = new TH1I("test", "test", 2, -.5, 1.5)
> # t->Fill(1)
> # t->Draw()
>
>
> this opens a window showing the histogram and results in
>
> kernel: [  370.709485] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
> kernel: [  375.820701] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
> kernel: [  375.830689] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3783, emitted seq=3785
> kernel: [  375.831015] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 1028 thread Xorg
> :cs0 pid 1029
> kernel: [  375.831174] amdgpu 0000:03:00.0: amdgpu: GPU reset begin!
> kernel: [  375.924610] [drm] free PSP TMR buffer
> kernel: [  375.951409] amdgpu 0000:03:00.0: amdgpu: MODE2 reset
> kernel: [  375.951531] amdgpu 0000:03:00.0: amdgpu: GPU reset succeeded, trying to resume
> kernel: [  375.951712] [drm] PCIE GART of 1024M enabled.
> kernel: [  375.951715] [drm] PTB located at 0x000000F400900000
> kernel: [  375.951957] [drm] PSP is resuming...
> kernel: [  375.971811] [drm] reserve 0x400000 from 0xf41f800000 for PSP TMR
> kernel: [  376.277979] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available
> kernel: [  376.288823] amdgpu 0000:03:00.0: amdgpu: RAP: optional rap ta ucode is not available
> kernel: [  376.288830] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
> kernel: [  376.288836] amdgpu 0000:03:00.0: amdgpu: SMU is resuming...
> kernel: [  376.290539] amdgpu 0000:03:00.0: amdgpu: SMU is resumed successfully!
> kernel: [  376.648574] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  376.649562] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  376.721210] [drm] kiq ring mec 2 pipe 1 q 0
> kernel: [  376.722457] [drm] DMUB hardware initialized: version=0x01020008
> kernel: [  376.936133] [drm] VCN decode and encode initialized successfully(under DPG Mode).
> kernel: [  376.936628] [drm] JPEG decode initialized successfully.
> kernel: [  376.936635] amdgpu 0000:03:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
> kernel: [  376.936639] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
> kernel: [  376.936641] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
> kernel: [  376.936643] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
> kernel: [  376.936645] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
> kernel: [  376.936646] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
> kernel: [  376.936648] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
> kernel: [  376.936649] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
> kernel: [  376.936650] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
> kernel: [  376.936652] amdgpu 0000:03:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
> kernel: [  376.936654] amdgpu 0000:03:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 1
> kernel: [  376.936656] amdgpu 0000:03:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 1
> kernel: [  376.936657] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 1
> kernel: [  376.936659] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 1
> kernel: [  376.936660] amdgpu 0000:03:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 1
> kernel: [  376.945068] amdgpu 0000:03:00.0: amdgpu: recover vram bo from shadow start
> kernel: [  376.945075] amdgpu 0000:03:00.0: amdgpu: recover vram bo from shadow done
> kernel: [  376.945081] [drm] Skip scheduling IBs!
> kernel: [  376.945084] [drm] Skip scheduling IBs!
> kernel: [  376.945119] amdgpu 0000:03:00.0: amdgpu: GPU reset(2) succeeded!

The GPU reset was successful.  You'll need to restart your GUI
environment.  Unfortunately no current desktop environments on Linux
properly handle lost contexts.

Alex


> kernel: [  376.945161] [drm] Skip scheduling IBs!
> kernel: [  376.945169] [drm] Skip scheduling IBs!
> kernel: [  376.945172] [drm] Skip scheduling IBs!
> kernel: [  376.945235] [drm] Skip scheduling IBs!
> kernel: [  376.945243] [drm] Skip scheduling IBs!
> kernel: [  376.945246] [drm] Skip scheduling IBs!
> kernel: [  376.945248] [drm] Skip scheduling IBs!
> kernel: [  376.945251] [drm] Skip scheduling IBs!
> kernel: [  376.945253] [drm] Skip scheduling IBs!
> kernel: [  376.948659] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  381.640053] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  381.641067] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  381.641510] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  382.795658] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  382.796462] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.157991] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.224317] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.225321] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.226164] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.226836] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.227294] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.227573] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.228525] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  391.623243] amdgpu_cs_ioctl: 13 callbacks suppressed
> kernel: [  391.623250] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  391.623979] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  396.615491] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  396.616130] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  398.242898] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> [...]
>
> I guess, that this might not be detailed enough information but I am
> happy to provide more, if you tell me, what exactly is needed.
>
>
> With best regards,
> Julian Wollrath
>
> --
>  ()  ascii ribbon campaign - against html e-mail
>  /\                        - against proprietary attachments
> _______________________________________________
> amd-gfx mailing list
> amd-gfx@lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2021-06-01 15:28 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-05-29 11:40 5.13-rc3 Renoir (ChipID = 0x1636): error GPU reset, fences timed out, failed to initialize parser -125 Julian Wollrath
2021-05-29 12:16 ` Julian Wollrath
2021-06-01 15:28 ` Alex Deucher

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.