Bug: amdgpu drm driver cause process into Disk sleep state

* Bug: amdgpu drm driver cause process into Disk sleep state
@ 2019-09-03  7:35 =?gb18030?B?Nzg2NjY2Nzk=?=
       [not found] ` <tencent_4DEABBEB3BB4C6A6D84CA9F0DB225FBF5809-9uewiaClKEY@public.gmane.org>
  0 siblings, 1 reply; 11+ messages in thread
From: =?gb18030?B?Nzg2NjY2Nzk=?= @ 2019-09-03  7:35 UTC (permalink / raw)
  To: =?gb18030?B?YW1kLWdmeA==?=
  Cc: =?gb18030?B?YWxleGFuZGVyLmRldWNoZXI=?=,
	=?gb18030?B?Q2hyaXN0aWFuIEuBMIsybmln?=

Hi, Sirs:
       I have a wx5100 amdgpu card, It randomly come into failure.  sometimes, it will cause processes into uninterruptible wait state.

cps-new-ondemand-0587:~ # ps aux|grep -w D
root      11268  0.0  0.0 260628  3516 ?        Ssl  8月26   0:00 /usr/sbin/gssproxy -D
root     136482  0.0  0.0 212500   572 pts/0    S+   15:25   0:00 grep --color=auto -w D
root     370684  0.0  0.0  17972  7428 ?        Ss   9月02   0:04 /usr/sbin/sshd -D
10066    432951  0.0  0.0      0     0 ?        D    9月02   0:00 [FakeFinalizerDa]
root     496774  0.0  0.0      0     0 ?        D    9月02   0:17 [kworker/8:1+eve]
cps-new-ondemand-0587:~ # cat /proc/496774/stack 
[<0>] __switch_to+0x94/0xe8
[<0>] drm_sched_entity_flush+0xf8/0x248 [gpu_sched]
[<0>] amdgpu_ctx_mgr_entity_flush+0xac/0x148 [amdgpu]
[<0>] amdgpu_flush+0x2c/0x50 [amdgpu]
[<0>] filp_close+0x40/0xa0
[<0>] put_files_struct+0x118/0x120
[<0>] put_files_struct+0x30/0x68 [binder_linux]
[<0>] binder_deferred_func+0x4d4/0x658 [binder_linux]
[<0>] process_one_work+0x1b4/0x3f8
[<0>] worker_thread+0x54/0x470
[<0>] kthread+0x134/0x138
[<0>] ret_from_fork+0x10/0x18
[<0>] 0xffffffffffffffff

This issue troubled me a long time.  looking eagerly to get help from you!

-----
Yanhua
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 11+ messages in thread