From: =?gb18030?B?Nzg2NjY2Nzk=?= <78666679-9uewiaClKEY@public.gmane.org>
To: =?gb18030?B?S29lbmlnLCBDaHJpc3RpYW4=?=
<Christian.Koenig-5C7GfCeVMHo@public.gmane.org>,
=?gb18030?B?YW1kLWdmeA==?=
<amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org>
Cc: =?gb18030?B?RGV1Y2hlciwgQWxleGFuZGVy?=
<Alexander.Deucher-5C7GfCeVMHo@public.gmane.org>
Subject: =?gb18030?B?u9i4tKO6IEJ1ZzogYW1kZ3B1IGRybSBkcml2ZXIgY2F1c2UgcHJvY2VzcyBpbnRvIERpc2sgc2xlZXAgc3RhdGU=?=
Date: Tue, 3 Sep 2019 20:50:21 +0800 [thread overview]
Message-ID: <tencent_7DC9F5195A4D538FA626F85991875FC5F508@qq.com> (raw)
In-Reply-To: <f761fec0-c0cc-426c-6bcb-c3fd23808888-5C7GfCeVMHo@public.gmane.org>
[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1.1: Type: text/plain; charset="gb18030", Size: 3121 bytes --]
Hi Christian,
Sometimes the thread blocked disk sleeping in call to amdgpu_sa_bo_new. following is the stack trace. it seems the sa bo is used up , so the caller blocked waiting someone to free sa resources.
D 206833 227656 [surfaceflinger] <defunct> Binder:45_5
cat /proc/206833/task/227656/stack
[<0>] __switch_to+0x94/0xe8
[<0>] dma_fence_wait_any_timeout+0x234/0x2d0
[<0>] amdgpu_sa_bo_new+0x468/0x540 [amdgpu]
[<0>] amdgpu_ib_get+0x60/0xc8 [amdgpu]
[<0>] amdgpu_job_alloc_with_ib+0x70/0xb0 [amdgpu]
[<0>] amdgpu_vm_bo_update_mapping+0x2e0/0x3d8 [amdgpu]
[<0>] amdgpu_vm_bo_update+0x2a0/0x710 [amdgpu]
[<0>] amdgpu_gem_va_ioctl+0x46c/0x4c8 [amdgpu]
[<0>] drm_ioctl_kernel+0x94/0x118 [drm]
[<0>] drm_ioctl+0x1f0/0x438 [drm]
[<0>] amdgpu_drm_ioctl+0x58/0x90 [amdgpu]
[<0>] do_vfs_ioctl+0xc4/0x8c0
[<0>] ksys_ioctl+0x8c/0xa0
[<0>] __arm64_sys_ioctl+0x28/0x38
[<0>] el0_svc_common+0xa0/0x180
[<0>] el0_svc_handler+0x38/0x78
[<0>] el0_svc+0x8/0xc
[<0>] 0xffffffffffffffff
--------------------
YanHua
------------------ ÔʼÓʼþ ------------------
·¢¼þÈË: "Koenig, Christian"<Christian.Koenig@amd.com>;
·¢ËÍʱ¼ä: 2019Äê9ÔÂ3ÈÕ(ÐÇÆÚ¶þ) ÏÂÎç4:21
ÊÕ¼þÈË: ""<78666679@qq.com>;"amd-gfx"<amd-gfx@lists.freedesktop.org>;
³ËÍ: "Deucher, Alexander"<Alexander.Deucher@amd.com>;
Ö÷Ìâ: Re: Bug: amdgpu drm driver cause process into Disk sleep state
Hi Yanhua,
please update your kernel first, cause that looks like a known issue
which was recently fixed by patch "drm/scheduler: use job count instead
of peek".
Probably best to try the latest bleeding edge kernel and if that doesn't
help please open up a bug report on https://bugs.freedesktop.org/.
Regards,
Christian.
Am 03.09.19 um 09:35 schrieb 78666679:
> Hi, Sirs:
> I have a wx5100 amdgpu card, It randomly come into failure. sometimes, it will cause processes into uninterruptible wait state.
>
>
> cps-new-ondemand-0587:~ # ps aux|grep -w D
> root 11268 0.0 0.0 260628 3516 ? Ssl 8ÔÂ26 0:00 /usr/sbin/gssproxy -D
> root 136482 0.0 0.0 212500 572 pts/0 S+ 15:25 0:00 grep --color=auto -w D
> root 370684 0.0 0.0 17972 7428 ? Ss 9ÔÂ02 0:04 /usr/sbin/sshd -D
> 10066 432951 0.0 0.0 0 0 ? D 9ÔÂ02 0:00 [FakeFinalizerDa]
> root 496774 0.0 0.0 0 0 ? D 9ÔÂ02 0:17 [kworker/8:1+eve]
> cps-new-ondemand-0587:~ # cat /proc/496774/stack
> [<0>] __switch_to+0x94/0xe8
> [<0>] drm_sched_entity_flush+0xf8/0x248 [gpu_sched]
> [<0>] amdgpu_ctx_mgr_entity_flush+0xac/0x148 [amdgpu]
> [<0>] amdgpu_flush+0x2c/0x50 [amdgpu]
> [<0>] filp_close+0x40/0xa0
> [<0>] put_files_struct+0x118/0x120
> [<0>] put_files_struct+0x30/0x68 [binder_linux]
> [<0>] binder_deferred_func+0x4d4/0x658 [binder_linux]
> [<0>] process_one_work+0x1b4/0x3f8
> [<0>] worker_thread+0x54/0x470
> [<0>] kthread+0x134/0x138
> [<0>] ret_from_fork+0x10/0x18
> [<0>] 0xffffffffffffffff
>
>
>
> This issue troubled me a long time. looking eagerly to get help from you!
>
>
> -----
> Yanhua
[-- Attachment #1.2: Type: text/html, Size: 4595 bytes --]
[-- Attachment #2: Type: text/plain, Size: 153 bytes --]
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx
next prev parent reply other threads:[~2019-09-03 12:50 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-09-03 7:35 Bug: amdgpu drm driver cause process into Disk sleep state =?gb18030?B?Nzg2NjY2Nzk=?=
[not found] ` <tencent_4DEABBEB3BB4C6A6D84CA9F0DB225FBF5809-9uewiaClKEY@public.gmane.org>
2019-09-03 8:21 ` Koenig, Christian
[not found] ` <f761fec0-c0cc-426c-6bcb-c3fd23808888-5C7GfCeVMHo@public.gmane.org>
2019-09-03 8:27 ` =?gb18030?B?u9i4tKO6IEJ1ZzogYW1kZ3B1IGRybSBkcml2ZXIgY2F1c2UgcHJvY2VzcyBpbnRvIERpc2sgc2xlZXAgc3RhdGU=?= =?gb18030?B?Nzg2NjY2Nzk=?=
2019-09-03 12:50 ` =?gb18030?B?Nzg2NjY2Nzk=?= [this message]
[not found] ` <tencent_7DC9F5195A4D538FA626F85991875FC5F508-9uewiaClKEY@public.gmane.org>
2019-09-03 13:07 ` =?GB18030?B?UmU6ILvYuLSjuiBCdWc6IGFtZGdwdSBkcm0gZHJpdmVyIGNhdXNlIHByb2Nlc3MgaW50byBEaXNrIHNsZWVwIHN0YXRl?= Koenig, Christian
[not found] ` <2162676e-dbfa-a67d-248c-98e9eb2099c2-5C7GfCeVMHo@public.gmane.org>
2019-09-03 13:16 ` =?gb18030?B?u9i4tKO6ILvYuLSjuiBCdWc6IGFtZGdwdSBkcm0gZHJpdmVyIGNhdXNlIHByb2Nlc3MgaW50byBEaXNrIHNsZWVwIHN0YXRl?= =?gb18030?B?Nzg2NjY2Nzk=?=
[not found] ` <tencent_DFCD5A0853FDA639F81F91375F8DF55AF508-9uewiaClKEY@public.gmane.org>
2019-09-03 13:19 ` =?GB18030?B?UmU6ILvYuLSjuiC72Li0o7ogQnVnOiBhbWRncHUgZHJtIGRyaXZlciBjYXVzZSBwcm9jZXNzIGludG8gRGlzayBzbGVlcCBzdGF0ZQ==?= Koenig, Christian
[not found] ` <88a08dcc-2e95-9379-693f-2d3fd928aa11-5C7GfCeVMHo@public.gmane.org>
2019-09-03 13:44 ` =?gb18030?B?u9i4tKO6ILvYuLSjuiC72Li0o7ogQnVnOiBhbWRncHUgZHJtIGRyaXZlciBjYXVzZSBwcm9jZXNzIGludG8gRGlzayBzbGVlcCBzdGF0ZQ==?= =?gb18030?B?Nzg2NjY2Nzk=?=
2019-09-05 1:36 ` =?gb18030?B?u9i4tKO6ILvYuLSjuiC72Li0o7ogQnVnOiBhbWRncHUgZHJtIGRyaXZlciBjYXVzZSBwcm9jZXNzIGludG8gRGlzayBzbGVlcCBzdGF0ZQ==?= =?gb18030?B?eWFuaHVh?=
[not found] ` <tencent_20683D4D4999B2E0A746EA7D01D677D6070A-9uewiaClKEY@public.gmane.org>
2019-09-06 11:23 ` =?GB18030?B?UmU6ILvYuLSjuiC72Li0o7ogu9i4tKO6IEJ1ZzogYW1kZ3B1IGRybSBkcml2ZXIgY2F1c2UgcHJvY2VzcyBpbnRvIERpc2sgc2xlZXAgc3RhdGU=?= Koenig, Christian
[not found] ` <badd9ea1-6f78-abbc-bdbe-e11271188524-5C7GfCeVMHo@public.gmane.org>
2019-09-11 2:43 ` =?gb18030?B?u9i4tKO6ILvYuLSjuiC72Li0o7ogu9i4tKO6IEJ1ZzogYW1kZ3B1IGRybSBkcml2ZXIgY2F1c2UgcHJvY2VzcyBpbnRvIERpc2sgc2xlZXAgc3RhdGU=?= =?gb18030?B?eWFuaHVh?=
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=tencent_7DC9F5195A4D538FA626F85991875FC5F508@qq.com \
--to=78666679-9uewiaclkey@public.gmane.org \
--cc=Alexander.Deucher-5C7GfCeVMHo@public.gmane.org \
--cc=Christian.Koenig-5C7GfCeVMHo@public.gmane.org \
--cc=amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.