amd-gfx.lists.freedesktop.org archive mirror
 help / color / mirror / Atom feed
* RE: [PATCH] SWDEV-227226 [AWS][Linux]ReallyQuick test failed, guest dmesg and host dmesg have error
       [not found] <20200325083141.16919-1-Jiawei.Gu@amd.com>
@ 2020-03-26  5:57 ` Gu, JiaWei (Will)
  2020-03-26  6:10   ` Deng, Emily
  0 siblings, 1 reply; 2+ messages in thread
From: Gu, JiaWei (Will) @ 2020-03-26  5:57 UTC (permalink / raw)
  To: amd-gfx

[AMD Official Use Only - Internal Distribution Only]

Ping......

-----Original Message-----
From: Jiawei <Jiawei.Gu@amd.com> 
Sent: Wednesday, March 25, 2020 4:32 PM
To: amd-gfx@lists.freedesktop.org
Cc: Gu, JiaWei (Will) <JiaWei.Gu@amd.com>
Subject: [PATCH] SWDEV-227226 [AWS][Linux]ReallyQuick test failed, guest dmesg and host dmesg have error

root cause: compute job timeout for sriov/passthrough is 10000 ms, which is too short for some compute benchmark

solution: extend the default compute lockup timeout to 60000 ms

Signed-off-by: Jiawei <Jiawei.Gu@amd.com>
---
 drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
index 3607a63f48bb..88360b220a8f 100755
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
@@ -2680,12 +2680,12 @@ static int amdgpu_device_get_job_timeout_settings(struct amdgpu_device *adev)
 	 * By default timeout for non compute jobs is 10000.
 	 * And there is no timeout enforced on compute jobs.
 	 * In SR-IOV or passthrough mode, timeout for compute
-	 * jobs are 10000 by default.
+	 * jobs are 60000 by default.
 	 */
 	adev->gfx_timeout = msecs_to_jiffies(10000);
 	adev->sdma_timeout = adev->video_timeout = adev->gfx_timeout;
 	if (amdgpu_sriov_vf(adev) || amdgpu_passthrough(adev))
-		adev->compute_timeout = adev->gfx_timeout;
+		adev->compute_timeout =  msecs_to_jiffies(60000);
 	else
 		adev->compute_timeout = MAX_SCHEDULE_TIMEOUT;
 
--
2.20.1
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply related	[flat|nested] 2+ messages in thread

* RE: [PATCH] SWDEV-227226 [AWS][Linux]ReallyQuick test failed, guest dmesg and host dmesg have error
  2020-03-26  5:57 ` [PATCH] SWDEV-227226 [AWS][Linux]ReallyQuick test failed, guest dmesg and host dmesg have error Gu, JiaWei (Will)
@ 2020-03-26  6:10   ` Deng, Emily
  0 siblings, 0 replies; 2+ messages in thread
From: Deng, Emily @ 2020-03-26  6:10 UTC (permalink / raw)
  To: Gu, JiaWei (Will), amd-gfx

[AMD Official Use Only - Internal Distribution Only]

Reviewed-by: Emily Deng <Emily.Deng@amd.com>

>-----Original Message-----
>From: amd-gfx <amd-gfx-bounces@lists.freedesktop.org> On Behalf Of Gu,
>JiaWei (Will)
>Sent: Thursday, March 26, 2020 1:58 PM
>To: amd-gfx@lists.freedesktop.org
>Subject: RE: [PATCH] SWDEV-227226 [AWS][Linux]ReallyQuick test failed, guest
>dmesg and host dmesg have error
>
>[AMD Official Use Only - Internal Distribution Only]
>
>Ping......
>
>-----Original Message-----
>From: Jiawei <Jiawei.Gu@amd.com>
>Sent: Wednesday, March 25, 2020 4:32 PM
>To: amd-gfx@lists.freedesktop.org
>Cc: Gu, JiaWei (Will) <JiaWei.Gu@amd.com>
>Subject: [PATCH] SWDEV-227226 [AWS][Linux]ReallyQuick test failed, guest
>dmesg and host dmesg have error
>
>root cause: compute job timeout for sriov/passthrough is 10000 ms, which is too
>short for some compute benchmark
>
>solution: extend the default compute lockup timeout to 60000 ms
>
>Signed-off-by: Jiawei <Jiawei.Gu@amd.com>
>---
> drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 4 ++--
> 1 file changed, 2 insertions(+), 2 deletions(-)
>
>diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
>b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
>index 3607a63f48bb..88360b220a8f 100755
>--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
>+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
>@@ -2680,12 +2680,12 @@ static int
>amdgpu_device_get_job_timeout_settings(struct amdgpu_device *adev)
> 	 * By default timeout for non compute jobs is 10000.
> 	 * And there is no timeout enforced on compute jobs.
> 	 * In SR-IOV or passthrough mode, timeout for compute
>-	 * jobs are 10000 by default.
>+	 * jobs are 60000 by default.
> 	 */
> 	adev->gfx_timeout = msecs_to_jiffies(10000);
> 	adev->sdma_timeout = adev->video_timeout = adev->gfx_timeout;
> 	if (amdgpu_sriov_vf(adev) || amdgpu_passthrough(adev))
>-		adev->compute_timeout = adev->gfx_timeout;
>+		adev->compute_timeout =  msecs_to_jiffies(60000);
> 	else
> 		adev->compute_timeout = MAX_SCHEDULE_TIMEOUT;
>
>--
>2.20.1
>_______________________________________________
>amd-gfx mailing list
>amd-gfx@lists.freedesktop.org
>https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.free
>desktop.org%2Fmailman%2Flistinfo%2Famd-
>gfx&amp;data=02%7C01%7CEmily.Deng%40amd.com%7Cef72ce0afbaf4ed9b1a
>708d7d14a9e8b%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637
>207990753889298&amp;sdata=febWyY9VUPq2hucVLit73FilxzacEmoIq1TBLFha
>%2FNM%3D&amp;reserved=0
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2020-03-26  6:11 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <20200325083141.16919-1-Jiawei.Gu@amd.com>
2020-03-26  5:57 ` [PATCH] SWDEV-227226 [AWS][Linux]ReallyQuick test failed, guest dmesg and host dmesg have error Gu, JiaWei (Will)
2020-03-26  6:10   ` Deng, Emily

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).