All of lore.kernel.org
 help / color / mirror / Atom feed
From: =?gb18030?B?Nzg2NjY2Nzk=?= <78666679-9uewiaClKEY@public.gmane.org>
To: =?gb18030?B?S29lbmlnLCBDaHJpc3RpYW4=?=
	<Christian.Koenig-5C7GfCeVMHo@public.gmane.org>,
	=?gb18030?B?YW1kLWdmeA==?=
	<amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org>
Cc: =?gb18030?B?RGV1Y2hlciwgQWxleGFuZGVy?=
	<Alexander.Deucher-5C7GfCeVMHo@public.gmane.org>
Subject: =?gb18030?B?u9i4tKO6IEJ1ZzogYW1kZ3B1IGRybSBkcml2ZXIgY2F1c2UgcHJvY2VzcyBpbnRvIERpc2sgc2xlZXAgc3RhdGU=?=
Date: Tue, 3 Sep 2019 20:50:21 +0800	[thread overview]
Message-ID: <tencent_7DC9F5195A4D538FA626F85991875FC5F508@qq.com> (raw)
In-Reply-To: <f761fec0-c0cc-426c-6bcb-c3fd23808888-5C7GfCeVMHo@public.gmane.org>


[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1.1: Type: text/plain; charset="gb18030", Size: 3121 bytes --]

Hi Christian,
       Sometimes the thread blocked  disk sleeping in call to amdgpu_sa_bo_new. following is the stack trace.  it seems the sa bo is used up ,  so  the caller blocked waiting someone to free sa resources. 



D 206833 227656 [surfaceflinger] <defunct> Binder:45_5
cat /proc/206833/task/227656/stack


[<0>] __switch_to+0x94/0xe8
[<0>] dma_fence_wait_any_timeout+0x234/0x2d0
[<0>] amdgpu_sa_bo_new+0x468/0x540 [amdgpu]
[<0>] amdgpu_ib_get+0x60/0xc8 [amdgpu]
[<0>] amdgpu_job_alloc_with_ib+0x70/0xb0 [amdgpu]
[<0>] amdgpu_vm_bo_update_mapping+0x2e0/0x3d8 [amdgpu]
[<0>] amdgpu_vm_bo_update+0x2a0/0x710 [amdgpu]
[<0>] amdgpu_gem_va_ioctl+0x46c/0x4c8 [amdgpu]
[<0>] drm_ioctl_kernel+0x94/0x118 [drm]
[<0>] drm_ioctl+0x1f0/0x438 [drm]
[<0>] amdgpu_drm_ioctl+0x58/0x90 [amdgpu]
[<0>] do_vfs_ioctl+0xc4/0x8c0
[<0>] ksys_ioctl+0x8c/0xa0
[<0>] __arm64_sys_ioctl+0x28/0x38
[<0>] el0_svc_common+0xa0/0x180
[<0>] el0_svc_handler+0x38/0x78
[<0>] el0_svc+0x8/0xc
[<0>] 0xffffffffffffffff




--------------------
YanHua



------------------ ԭʼÓʼþ ------------------
·¢¼þÈË: "Koenig, Christian"<Christian.Koenig@amd.com>;
·¢ËÍʱ¼ä: 2019Äê9ÔÂ3ÈÕ(ÐÇÆÚ¶þ) ÏÂÎç4:21
ÊÕ¼þÈË: ""<78666679@qq.com>;"amd-gfx"<amd-gfx@lists.freedesktop.org>;
³­ËÍ: "Deucher, Alexander"<Alexander.Deucher@amd.com>;
Ö÷Ìâ: Re: Bug: amdgpu drm driver cause process into Disk sleep state



Hi Yanhua,

please update your kernel first, cause that looks like a known issue 
which was recently fixed by patch "drm/scheduler: use job count instead 
of peek".

Probably best to try the latest bleeding edge kernel and if that doesn't 
help please open up a bug report on https://bugs.freedesktop.org/.

Regards,
Christian.

Am 03.09.19 um 09:35 schrieb 78666679:
> Hi, Sirs:
>         I have a wx5100 amdgpu card, It randomly come into failure.  sometimes, it will cause processes into uninterruptible wait state.
>
>
> cps-new-ondemand-0587:~ # ps aux|grep -w D
> root      11268  0.0  0.0 260628  3516 ?        Ssl  8ÔÂ26   0:00 /usr/sbin/gssproxy -D
> root     136482  0.0  0.0 212500   572 pts/0    S+   15:25   0:00 grep --color=auto -w D
> root     370684  0.0  0.0  17972  7428 ?        Ss   9ÔÂ02   0:04 /usr/sbin/sshd -D
> 10066    432951  0.0  0.0      0     0 ?        D    9ÔÂ02   0:00 [FakeFinalizerDa]
> root     496774  0.0  0.0      0     0 ?        D    9ÔÂ02   0:17 [kworker/8:1+eve]
> cps-new-ondemand-0587:~ # cat /proc/496774/stack
> [<0>] __switch_to+0x94/0xe8
> [<0>] drm_sched_entity_flush+0xf8/0x248 [gpu_sched]
> [<0>] amdgpu_ctx_mgr_entity_flush+0xac/0x148 [amdgpu]
> [<0>] amdgpu_flush+0x2c/0x50 [amdgpu]
> [<0>] filp_close+0x40/0xa0
> [<0>] put_files_struct+0x118/0x120
> [<0>] put_files_struct+0x30/0x68 [binder_linux]
> [<0>] binder_deferred_func+0x4d4/0x658 [binder_linux]
> [<0>] process_one_work+0x1b4/0x3f8
> [<0>] worker_thread+0x54/0x470
> [<0>] kthread+0x134/0x138
> [<0>] ret_from_fork+0x10/0x18
> [<0>] 0xffffffffffffffff
>
>
>
> This issue troubled me a long time.  looking eagerly to get help from you!
>
>
> -----
> Yanhua

[-- Attachment #1.2: Type: text/html, Size: 4595 bytes --]

[-- Attachment #2: Type: text/plain, Size: 153 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

  parent reply	other threads:[~2019-09-03 12:50 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-09-03  7:35 Bug: amdgpu drm driver cause process into Disk sleep state =?gb18030?B?Nzg2NjY2Nzk=?=
     [not found] ` <tencent_4DEABBEB3BB4C6A6D84CA9F0DB225FBF5809-9uewiaClKEY@public.gmane.org>
2019-09-03  8:21   ` Koenig, Christian
     [not found]     ` <f761fec0-c0cc-426c-6bcb-c3fd23808888-5C7GfCeVMHo@public.gmane.org>
2019-09-03  8:27       ` =?gb18030?B?u9i4tKO6IEJ1ZzogYW1kZ3B1IGRybSBkcml2ZXIgY2F1c2UgcHJvY2VzcyBpbnRvIERpc2sgc2xlZXAgc3RhdGU=?= =?gb18030?B?Nzg2NjY2Nzk=?=
2019-09-03 12:50       ` =?gb18030?B?Nzg2NjY2Nzk=?= [this message]
     [not found]         ` <tencent_7DC9F5195A4D538FA626F85991875FC5F508-9uewiaClKEY@public.gmane.org>
2019-09-03 13:07           ` =?GB18030?B?UmU6ILvYuLSjuiBCdWc6IGFtZGdwdSBkcm0gZHJpdmVyIGNhdXNlIHByb2Nlc3MgaW50byBEaXNrIHNsZWVwIHN0YXRl?= Koenig, Christian
     [not found]             ` <2162676e-dbfa-a67d-248c-98e9eb2099c2-5C7GfCeVMHo@public.gmane.org>
2019-09-03 13:16               ` =?gb18030?B?u9i4tKO6ILvYuLSjuiBCdWc6IGFtZGdwdSBkcm0gZHJpdmVyIGNhdXNlIHByb2Nlc3MgaW50byBEaXNrIHNsZWVwIHN0YXRl?= =?gb18030?B?Nzg2NjY2Nzk=?=
     [not found]                 ` <tencent_DFCD5A0853FDA639F81F91375F8DF55AF508-9uewiaClKEY@public.gmane.org>
2019-09-03 13:19                   ` =?GB18030?B?UmU6ILvYuLSjuiC72Li0o7ogQnVnOiBhbWRncHUgZHJtIGRyaXZlciBjYXVzZSBwcm9jZXNzIGludG8gRGlzayBzbGVlcCBzdGF0ZQ==?= Koenig, Christian
     [not found]                     ` <88a08dcc-2e95-9379-693f-2d3fd928aa11-5C7GfCeVMHo@public.gmane.org>
2019-09-03 13:44                       ` =?gb18030?B?u9i4tKO6ILvYuLSjuiC72Li0o7ogQnVnOiBhbWRncHUgZHJtIGRyaXZlciBjYXVzZSBwcm9jZXNzIGludG8gRGlzayBzbGVlcCBzdGF0ZQ==?= =?gb18030?B?Nzg2NjY2Nzk=?=
2019-09-05  1:36                       ` =?gb18030?B?u9i4tKO6ILvYuLSjuiC72Li0o7ogQnVnOiBhbWRncHUgZHJtIGRyaXZlciBjYXVzZSBwcm9jZXNzIGludG8gRGlzayBzbGVlcCBzdGF0ZQ==?= =?gb18030?B?eWFuaHVh?=
     [not found]                         ` <tencent_20683D4D4999B2E0A746EA7D01D677D6070A-9uewiaClKEY@public.gmane.org>
2019-09-06 11:23                           ` =?GB18030?B?UmU6ILvYuLSjuiC72Li0o7ogu9i4tKO6IEJ1ZzogYW1kZ3B1IGRybSBkcml2ZXIgY2F1c2UgcHJvY2VzcyBpbnRvIERpc2sgc2xlZXAgc3RhdGU=?= Koenig, Christian
     [not found]                             ` <badd9ea1-6f78-abbc-bdbe-e11271188524-5C7GfCeVMHo@public.gmane.org>
2019-09-11  2:43                               ` =?gb18030?B?u9i4tKO6ILvYuLSjuiC72Li0o7ogu9i4tKO6IEJ1ZzogYW1kZ3B1IGRybSBkcml2ZXIgY2F1c2UgcHJvY2VzcyBpbnRvIERpc2sgc2xlZXAgc3RhdGU=?= =?gb18030?B?eWFuaHVh?=

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=tencent_7DC9F5195A4D538FA626F85991875FC5F508@qq.com \
    --to=78666679-9uewiaclkey@public.gmane.org \
    --cc=Alexander.Deucher-5C7GfCeVMHo@public.gmane.org \
    --cc=Christian.Koenig-5C7GfCeVMHo@public.gmane.org \
    --cc=amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.