All of lore.kernel.org
 help / color / mirror / Atom feed
* regression on raven (gmc9)
@ 2018-04-02 11:17 Tom St Denis
       [not found] ` <2bd09049-695a-ef03-3a45-53610d366b38-5C7GfCeVMHo@public.gmane.org>
  0 siblings, 1 reply; 7+ messages in thread
From: Tom St Denis @ 2018-04-02 11:17 UTC (permalink / raw)
  To: amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
  Cc: Deucher, Alexander, Huang, Ray, Koenig, Christian

This commit

[root@raven linux]# git bisect bad
c704d7a15d292e6e222b7e990ed7e9f41617cdd8 is the first bad commit
commit c704d7a15d292e6e222b7e990ed7e9f41617cdd8
Author: Alex Deucher <alexander.deucher@amd.com>
Date:   Tue Mar 27 17:10:56 2018 -0500

     drm/amdgpu/gmc9: use amdgpu_ring_emit_reg_write_reg_wait in gpu tlb 
flush

     Use amdgpu_ring_emit_reg_write_reg_wait.  On engines that support it,
     it provides a write and wait in a single packet which avoids a missed
     ack if a world switch happens between the request and waiting for the
     ack.

     Reviewed-by: Huang Rui <ray.huang@amd.com>
     Reviewed-by: Christian König <christian.koenig@amd.com>
     Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

:040000 040000 a08a12a316d09ea231337136cc7aa076d4da44a8 
521c8a18b0c63e185e416a267619a92822d7f268 M      drivers

Causes my raven1 to lockup on starting X with a gfx ring stall.

Tom
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: regression on raven (gmc9)
       [not found] ` <2bd09049-695a-ef03-3a45-53610d366b38-5C7GfCeVMHo@public.gmane.org>
@ 2018-04-02 15:00   ` Alex Deucher
       [not found]     ` <CADnq5_PDpDRCGisjyOTybqjSM3CcUjjnR-xZHJ8KnG70E3kohw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
  0 siblings, 1 reply; 7+ messages in thread
From: Alex Deucher @ 2018-04-02 15:00 UTC (permalink / raw)
  To: Tom St Denis
  Cc: Deucher, Alexander, Huang, Ray, Koenig, Christian,
	amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

On Mon, Apr 2, 2018 at 7:17 AM, Tom St Denis <tstdenis@amd.com> wrote:
> This commit
>
> [root@raven linux]# git bisect bad
> c704d7a15d292e6e222b7e990ed7e9f41617cdd8 is the first bad commit
> commit c704d7a15d292e6e222b7e990ed7e9f41617cdd8
> Author: Alex Deucher <alexander.deucher@amd.com>
> Date:   Tue Mar 27 17:10:56 2018 -0500
>
>     drm/amdgpu/gmc9: use amdgpu_ring_emit_reg_write_reg_wait in gpu tlb
> flush
>
>     Use amdgpu_ring_emit_reg_write_reg_wait.  On engines that support it,
>     it provides a write and wait in a single packet which avoids a missed
>     ack if a world switch happens between the request and waiting for the
>     ack.
>
>     Reviewed-by: Huang Rui <ray.huang@amd.com>
>     Reviewed-by: Christian König <christian.koenig@amd.com>
>     Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
>
> :040000 040000 a08a12a316d09ea231337136cc7aa076d4da44a8
> 521c8a18b0c63e185e416a267619a92822d7f268 M      drivers
>
> Causes my raven1 to lockup on starting X with a gfx ring stall.

I've reverted the patches.

Alex

>
> Tom
> _______________________________________________
> amd-gfx mailing list
> amd-gfx@lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: regression on raven (gmc9)
       [not found]     ` <CADnq5_PDpDRCGisjyOTybqjSM3CcUjjnR-xZHJ8KnG70E3kohw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2018-04-02 15:22       ` Christian König
       [not found]         ` <7d7bf762-7b6e-47a5-d5d7-860e93a0750d-5C7GfCeVMHo@public.gmane.org>
  0 siblings, 1 reply; 7+ messages in thread
From: Christian König @ 2018-04-02 15:22 UTC (permalink / raw)
  To: Alex Deucher, Tom St Denis
  Cc: Deucher, Alexander, Huang, Ray, amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

Am 02.04.2018 um 17:00 schrieb Alex Deucher:
> On Mon, Apr 2, 2018 at 7:17 AM, Tom St Denis <tstdenis@amd.com> wrote:
>> This commit
>>
>> [root@raven linux]# git bisect bad
>> c704d7a15d292e6e222b7e990ed7e9f41617cdd8 is the first bad commit
>> commit c704d7a15d292e6e222b7e990ed7e9f41617cdd8
>> Author: Alex Deucher <alexander.deucher@amd.com>
>> Date:   Tue Mar 27 17:10:56 2018 -0500
>>
>>      drm/amdgpu/gmc9: use amdgpu_ring_emit_reg_write_reg_wait in gpu tlb
>> flush
>>
>>      Use amdgpu_ring_emit_reg_write_reg_wait.  On engines that support it,
>>      it provides a write and wait in a single packet which avoids a missed
>>      ack if a world switch happens between the request and waiting for the
>>      ack.
>>
>>      Reviewed-by: Huang Rui <ray.huang@amd.com>
>>      Reviewed-by: Christian König <christian.koenig@amd.com>
>>      Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
>>
>> :040000 040000 a08a12a316d09ea231337136cc7aa076d4da44a8
>> 521c8a18b0c63e185e416a267619a92822d7f268 M      drivers
>>
>> Causes my raven1 to lockup on starting X with a gfx ring stall.
> I've reverted the patches.

Yeah, you need a specialized firmware version for this which Raven 
probably doesn't have.

Anyway the root cause of the issue which should be fixed by this patch 
set was found in the meantime, so the whole patch set shouldn't be 
necessary any more.

Christian.

>
> Alex
>
>> Tom
>> _______________________________________________
>> amd-gfx mailing list
>> amd-gfx@lists.freedesktop.org
>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 7+ messages in thread

* RE: regression on raven (gmc9)
       [not found]         ` <7d7bf762-7b6e-47a5-d5d7-860e93a0750d-5C7GfCeVMHo@public.gmane.org>
@ 2018-04-03  1:54           ` Deng, Emily
       [not found]             ` <CY4PR12MB11257EABAFAD6E913FD3A21C8FA50-rpdhrqHFk07v2MZdTKcfDgdYzm3356FpvxpqHgZTriW3zl9H0oFU5g@public.gmane.org>
  0 siblings, 1 reply; 7+ messages in thread
From: Deng, Emily @ 2018-04-03  1:54 UTC (permalink / raw)
  To: Koenig, Christian, Alex Deucher, StDenis, Tom
  Cc: Deucher, Alexander, Huang, Ray, amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

Hi Alex and Christian,
    Those patches are important for vega10 SRIOV, without this, vega10 run many games will hang on SRIOV, 
how about to add  "if (amdgpu_sriov_vf(adev) && adev->asic_type == CHIP_VEGA10)" to separate the 
bare metal and SRIOV?

Best Wishes,
Emily Deng




> -----Original Message-----
> From: amd-gfx [mailto:amd-gfx-bounces@lists.freedesktop.org] On Behalf
> Of Christian K?nig
> Sent: Monday, April 02, 2018 11:23 PM
> To: Alex Deucher <alexdeucher@gmail.com>; StDenis, Tom
> <Tom.StDenis@amd.com>
> Cc: Deucher, Alexander <Alexander.Deucher@amd.com>; Huang, Ray
> <Ray.Huang@amd.com>; amd-gfx@lists.freedesktop.org
> Subject: Re: regression on raven (gmc9)
> 
> Am 02.04.2018 um 17:00 schrieb Alex Deucher:
> > On Mon, Apr 2, 2018 at 7:17 AM, Tom St Denis <tstdenis@amd.com>
> wrote:
> >> This commit
> >>
> >> [root@raven linux]# git bisect bad
> >> c704d7a15d292e6e222b7e990ed7e9f41617cdd8 is the first bad commit
> >> commit c704d7a15d292e6e222b7e990ed7e9f41617cdd8
> >> Author: Alex Deucher <alexander.deucher@amd.com>
> >> Date:   Tue Mar 27 17:10:56 2018 -0500
> >>
> >>      drm/amdgpu/gmc9: use amdgpu_ring_emit_reg_write_reg_wait in gpu
> >> tlb flush
> >>
> >>      Use amdgpu_ring_emit_reg_write_reg_wait.  On engines that support
> it,
> >>      it provides a write and wait in a single packet which avoids a missed
> >>      ack if a world switch happens between the request and waiting for the
> >>      ack.
> >>
> >>      Reviewed-by: Huang Rui <ray.huang@amd.com>
> >>      Reviewed-by: Christian König <christian.koenig@amd.com>
> >>      Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
> >>
> >> :040000 040000 a08a12a316d09ea231337136cc7aa076d4da44a8
> >> 521c8a18b0c63e185e416a267619a92822d7f268 M      drivers
> >>
> >> Causes my raven1 to lockup on starting X with a gfx ring stall.
> > I've reverted the patches.
> 
> Yeah, you need a specialized firmware version for this which Raven probably
> doesn't have.
> 
> Anyway the root cause of the issue which should be fixed by this patch set
> was found in the meantime, so the whole patch set shouldn't be necessary
> any more.
> 
> Christian.
> 
> >
> > Alex
> >
> >> Tom
> >> _______________________________________________
> >> amd-gfx mailing list
> >> amd-gfx@lists.freedesktop.org
> >> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
> 
> _______________________________________________
> amd-gfx mailing list
> amd-gfx@lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: regression on raven (gmc9)
       [not found]             ` <CY4PR12MB11257EABAFAD6E913FD3A21C8FA50-rpdhrqHFk07v2MZdTKcfDgdYzm3356FpvxpqHgZTriW3zl9H0oFU5g@public.gmane.org>
@ 2018-04-03  9:22               ` Christian König
       [not found]                 ` <255f64f8-c826-4e65-8ec5-65f95a3fb149-5C7GfCeVMHo@public.gmane.org>
  0 siblings, 1 reply; 7+ messages in thread
From: Christian König @ 2018-04-03  9:22 UTC (permalink / raw)
  To: Deng, Emily, Alex Deucher, StDenis, Tom
  Cc: Deucher, Alexander, Huang, Ray, amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

Hi Emily,

the root cause of the hang was found and as far as I know the plan is to 
fix this with new RLC firmware.

Please see the internal thread with the hardware guys.

Thanks,
Christian.

Am 03.04.2018 um 03:54 schrieb Deng, Emily:
> Hi Alex and Christian,
>      Those patches are important for vega10 SRIOV, without this, vega10 run many games will hang on SRIOV,
> how about to add  "if (amdgpu_sriov_vf(adev) && adev->asic_type == CHIP_VEGA10)" to separate the
> bare metal and SRIOV?
>
> Best Wishes,
> Emily Deng
>
>
>
>
>> -----Original Message-----
>> From: amd-gfx [mailto:amd-gfx-bounces@lists.freedesktop.org] On Behalf
>> Of Christian K?nig
>> Sent: Monday, April 02, 2018 11:23 PM
>> To: Alex Deucher <alexdeucher@gmail.com>; StDenis, Tom
>> <Tom.StDenis@amd.com>
>> Cc: Deucher, Alexander <Alexander.Deucher@amd.com>; Huang, Ray
>> <Ray.Huang@amd.com>; amd-gfx@lists.freedesktop.org
>> Subject: Re: regression on raven (gmc9)
>>
>> Am 02.04.2018 um 17:00 schrieb Alex Deucher:
>>> On Mon, Apr 2, 2018 at 7:17 AM, Tom St Denis <tstdenis@amd.com>
>> wrote:
>>>> This commit
>>>>
>>>> [root@raven linux]# git bisect bad
>>>> c704d7a15d292e6e222b7e990ed7e9f41617cdd8 is the first bad commit
>>>> commit c704d7a15d292e6e222b7e990ed7e9f41617cdd8
>>>> Author: Alex Deucher <alexander.deucher@amd.com>
>>>> Date:   Tue Mar 27 17:10:56 2018 -0500
>>>>
>>>>       drm/amdgpu/gmc9: use amdgpu_ring_emit_reg_write_reg_wait in gpu
>>>> tlb flush
>>>>
>>>>       Use amdgpu_ring_emit_reg_write_reg_wait.  On engines that support
>> it,
>>>>       it provides a write and wait in a single packet which avoids a missed
>>>>       ack if a world switch happens between the request and waiting for the
>>>>       ack.
>>>>
>>>>       Reviewed-by: Huang Rui <ray.huang@amd.com>
>>>>       Reviewed-by: Christian König <christian.koenig@amd.com>
>>>>       Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
>>>>
>>>> :040000 040000 a08a12a316d09ea231337136cc7aa076d4da44a8
>>>> 521c8a18b0c63e185e416a267619a92822d7f268 M      drivers
>>>>
>>>> Causes my raven1 to lockup on starting X with a gfx ring stall.
>>> I've reverted the patches.
>> Yeah, you need a specialized firmware version for this which Raven probably
>> doesn't have.
>>
>> Anyway the root cause of the issue which should be fixed by this patch set
>> was found in the meantime, so the whole patch set shouldn't be necessary
>> any more.
>>
>> Christian.
>>
>>> Alex
>>>
>>>> Tom
>>>> _______________________________________________
>>>> amd-gfx mailing list
>>>> amd-gfx@lists.freedesktop.org
>>>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
>> _______________________________________________
>> amd-gfx mailing list
>> amd-gfx@lists.freedesktop.org
>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 7+ messages in thread

* RE: regression on raven (gmc9)
       [not found]                 ` <255f64f8-c826-4e65-8ec5-65f95a3fb149-5C7GfCeVMHo@public.gmane.org>
@ 2018-04-03  9:55                   ` Deng, Emily
  0 siblings, 0 replies; 7+ messages in thread
From: Deng, Emily @ 2018-04-03  9:55 UTC (permalink / raw)
  To: Koenig, Christian, Alex Deucher, StDenis, Tom
  Cc: Deucher, Alexander, Huang, Ray, amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW

Hi Christian,
     I know this, but seems it will take long time to do this, and this issue has blocked vega10 sriov for long time. 
But we want the staging to be stable on vega10 sriov  as soon as possible.

Best Wishes,
Emily Deng

> -----Original Message-----
> From: Koenig, Christian
> Sent: Tuesday, April 03, 2018 5:22 PM
> To: Deng, Emily <Emily.Deng@amd.com>; Alex Deucher
> <alexdeucher@gmail.com>; StDenis, Tom <Tom.StDenis@amd.com>
> Cc: Deucher, Alexander <Alexander.Deucher@amd.com>; Huang, Ray
> <Ray.Huang@amd.com>; amd-gfx@lists.freedesktop.org
> Subject: Re: regression on raven (gmc9)
> 
> Hi Emily,
> 
> the root cause of the hang was found and as far as I know the plan is to fix
> this with new RLC firmware.
> 
> Please see the internal thread with the hardware guys.
> 
> Thanks,
> Christian.
> 
> Am 03.04.2018 um 03:54 schrieb Deng, Emily:
> > Hi Alex and Christian,
> >      Those patches are important for vega10 SRIOV, without this,
> > vega10 run many games will hang on SRIOV, how about to add  "if
> > (amdgpu_sriov_vf(adev) && adev->asic_type == CHIP_VEGA10)" to
> separate the bare metal and SRIOV?
> >
> > Best Wishes,
> > Emily Deng
> >
> >
> >
> >
> >> -----Original Message-----
> >> From: amd-gfx [mailto:amd-gfx-bounces@lists.freedesktop.org] On
> >> Behalf Of Christian K?nig
> >> Sent: Monday, April 02, 2018 11:23 PM
> >> To: Alex Deucher <alexdeucher@gmail.com>; StDenis, Tom
> >> <Tom.StDenis@amd.com>
> >> Cc: Deucher, Alexander <Alexander.Deucher@amd.com>; Huang, Ray
> >> <Ray.Huang@amd.com>; amd-gfx@lists.freedesktop.org
> >> Subject: Re: regression on raven (gmc9)
> >>
> >> Am 02.04.2018 um 17:00 schrieb Alex Deucher:
> >>> On Mon, Apr 2, 2018 at 7:17 AM, Tom St Denis <tstdenis@amd.com>
> >> wrote:
> >>>> This commit
> >>>>
> >>>> [root@raven linux]# git bisect bad
> >>>> c704d7a15d292e6e222b7e990ed7e9f41617cdd8 is the first bad commit
> >>>> commit c704d7a15d292e6e222b7e990ed7e9f41617cdd8
> >>>> Author: Alex Deucher <alexander.deucher@amd.com>
> >>>> Date:   Tue Mar 27 17:10:56 2018 -0500
> >>>>
> >>>>       drm/amdgpu/gmc9: use amdgpu_ring_emit_reg_write_reg_wait in
> >>>> gpu tlb flush
> >>>>
> >>>>       Use amdgpu_ring_emit_reg_write_reg_wait.  On engines that
> >>>> support
> >> it,
> >>>>       it provides a write and wait in a single packet which avoids a missed
> >>>>       ack if a world switch happens between the request and waiting for
> the
> >>>>       ack.
> >>>>
> >>>>       Reviewed-by: Huang Rui <ray.huang@amd.com>
> >>>>       Reviewed-by: Christian König <christian.koenig@amd.com>
> >>>>       Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
> >>>>
> >>>> :040000 040000 a08a12a316d09ea231337136cc7aa076d4da44a8
> >>>> 521c8a18b0c63e185e416a267619a92822d7f268 M      drivers
> >>>>
> >>>> Causes my raven1 to lockup on starting X with a gfx ring stall.
> >>> I've reverted the patches.
> >> Yeah, you need a specialized firmware version for this which Raven
> >> probably doesn't have.
> >>
> >> Anyway the root cause of the issue which should be fixed by this
> >> patch set was found in the meantime, so the whole patch set shouldn't
> >> be necessary any more.
> >>
> >> Christian.
> >>
> >>> Alex
> >>>
> >>>> Tom
> >>>> _______________________________________________
> >>>> amd-gfx mailing list
> >>>> amd-gfx@lists.freedesktop.org
> >>>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
> >> _______________________________________________
> >> amd-gfx mailing list
> >> amd-gfx@lists.freedesktop.org
> >> https://lists.freedesktop.org/mailman/listinfo/amd-gfx

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: regression on raven (gmc9)
@ 2018-04-02 15:16 Dawson Dias
  0 siblings, 0 replies; 7+ messages in thread
From: Dawson Dias @ 2018-04-02 15:16 UTC (permalink / raw)
  To: amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW


[-- Attachment #1.1: Type: text/plain, Size: 56 bytes --]

Thanks. This was locking up my Raven too.

-Dawson Dias

[-- Attachment #1.2: Type: text/html, Size: 290 bytes --]

[-- Attachment #2: Type: text/plain, Size: 154 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2018-04-03  9:55 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-04-02 11:17 regression on raven (gmc9) Tom St Denis
     [not found] ` <2bd09049-695a-ef03-3a45-53610d366b38-5C7GfCeVMHo@public.gmane.org>
2018-04-02 15:00   ` Alex Deucher
     [not found]     ` <CADnq5_PDpDRCGisjyOTybqjSM3CcUjjnR-xZHJ8KnG70E3kohw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2018-04-02 15:22       ` Christian König
     [not found]         ` <7d7bf762-7b6e-47a5-d5d7-860e93a0750d-5C7GfCeVMHo@public.gmane.org>
2018-04-03  1:54           ` Deng, Emily
     [not found]             ` <CY4PR12MB11257EABAFAD6E913FD3A21C8FA50-rpdhrqHFk07v2MZdTKcfDgdYzm3356FpvxpqHgZTriW3zl9H0oFU5g@public.gmane.org>
2018-04-03  9:22               ` Christian König
     [not found]                 ` <255f64f8-c826-4e65-8ec5-65f95a3fb149-5C7GfCeVMHo@public.gmane.org>
2018-04-03  9:55                   ` Deng, Emily
2018-04-02 15:16 Dawson Dias

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.