All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH 0/1] block,iocost: fix potential kernel NULL
@ 2022-05-13 14:59 Yahu Gao
  2022-05-13 14:59   ` Yahu Gao
  0 siblings, 1 reply; 4+ messages in thread
From: Yahu Gao @ 2022-05-13 14:59 UTC (permalink / raw)
  To: Tejun Heo, Jens Axboe; +Cc: cgroups, linux-block

There is a kernel NULL during ioc_pd_init. And the scene have analysed as
follow:
ioc_pd_init
    ...
    for (tblkg = blkg; tblkg; tblkg = tblkg->parent) {
        struct ioc_gq *tiocg = blkg_to_iocg(tblkg);
        // tiocg = pd_to_iocg(blkg_to_pd(blkg, &blkcg_policy_iocost));
                               ^ returns NULL after first iteration
        iocg->ancestors[tiocg->level] = tiocg;
    }


[ 5837.883640] BUG: unable to handle kernel NULL pointer dereference at 00000000000001d0
[ 5837.930538] PGD 0 P4D 0
[ 5837.945644] Oops: 0000 [#1] SMP NOPTI
[ 5837.967541] CPU: 57 PID: 66239 Comm: bash Kdump: loaded Not tainted  #1
[ 5838.010240] Hardware name:
 5838.051901] RIP: 0010:ioc_pd_init+0x12b/0x1a0
[ 5838.077940] Code: 48 8b 45 28 48 8b 00 8b 80 3c 01 00 00 41 89 84 24 d0 01 00 00 48 85 ed 74 28 48 63 0d 8e 3e f1 00 48 83 c1 4c 48 8b 44 cd 00 <48> 63 90 d0 01 00 00 49 89 84 d4 d8 01 00 00 48 8b 6d 38 48 85 ed
[ 5838.190361] RSP: 0018:ffffbf43629abce0 EFLAGS: 00010086
[ 5838.221618] RAX: 0000000000000000 RBX: ffff9c6d9f97f000 RCX: 000000000000004f
[ 5838.264315] RDX: 0000000000000003 RSI: 0000000000000001 RDI: ffff9c2dc1ddc738
[ 5838.307007] RBP: ffff9c6de4809000 R08: 0000000000000000 R09: ffffdf433f1c63a8
[ 5838.349701] R10: 00000000000000ec R11: 00000000000000ec R12: ffff9c2dc1ddc600
[ 5838.392392] R13: 0000000000000000 R14: 0000000000000000 R15: ffffffff8bf256c0
[ 5838.435085] FS:  00007fd943e90100(0000) GS:ffff9c2dffc40000(0000) knlGS:0000000000000000
[ 5838.483499] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033ize
[ 5838.517866] CR2: 00000000000001d0 CR3: 0000003f338b8001 CR4: 00000000007606e0
[ 5838.560562] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 5838.603253] DR3: 0000000000000000 DR6: 009a8000fffe0ff0 DR7: 0000000000000400
[ 5838.645946] PKRU: 55555554
[ 5838.662103] Call Trace:
[ 5838.676723]  blkcg_activate_policy+0x107/0x2b0
[ 5838.706937]  blk_iocost_init+0x1af/0x240
[ 5838.730387]  ioc_qos_write+0x311/0x410
[ 5838.752798]  ? do_filp_open+0xa7/0x100
[ 5838.775212]  cgroup_file_write+0x8a/0x150
[ 5838.799181]  ? __check_object_size+0xa8/0x16b
[ 5838.825239]  kernfs_fop_write+0x116/0x190
[ 5838.849210]  vfs_write+0xa5/0x1a0
[ 5838.869015]  ksys_write+0x4f/0xb0
[ 5838.888830]  do_syscall_64+0x5b/0x1a0
[ 5838.910724]  entry_SYSCALL_64_after_hwframe+0x65/0xca
[ 5838.940937] RIP: 0033:0x7fd94336eb28

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [PATCH] block,iocost: fix potential kernel NULL
@ 2022-05-13 14:59   ` Yahu Gao
  0 siblings, 0 replies; 4+ messages in thread
From: Yahu Gao @ 2022-05-13 14:59 UTC (permalink / raw)
  To: Tejun Heo, Jens Axboe; +Cc: cgroups, linux-block, Yahu Gao, Kunhai Dai

From: Yahu Gao <yahugao@didiglobal.com>

Some inode pinned dying memory cgroup and its parent destroyed at first.
The parent's pd of iocost won't be allocated during function
blkcg_activate_policy.
Ignore the DYING CSS to avoid kernel NULL during iocost policy data init.

Signed-off-by: Yahu Gao <yahugao@didiglobal.com>
Signed-off-by: Kunhai Dai <daikunhai@didiglobal.com>

diff --git a/block/blk-cgroup.c b/block/blk-cgroup.c
index a91f8ae18b49..32472de2e61d 100644
--- a/block/blk-cgroup.c
+++ b/block/blk-cgroup.c
@@ -1418,6 +1418,9 @@ int blkcg_activate_policy(struct request_queue *q,
 	list_for_each_entry_reverse(blkg, &q->blkg_list, q_node) {
 		struct blkg_policy_data *pd;
 
+		if (blkg->blkcg->css.flags & CSS_DYING)
+			continue;
+
 		if (blkg->pd[pol->plid])
 			continue;
 
@@ -1459,8 +1462,11 @@ int blkcg_activate_policy(struct request_queue *q,
 
 	/* all allocated, init in the same order */
 	if (pol->pd_init_fn)
-		list_for_each_entry_reverse(blkg, &q->blkg_list, q_node)
+		list_for_each_entry_reverse(blkg, &q->blkg_list, q_node) {
+			if (blkg->blkcg->css.flags & CSS_DYING)
+				continue;
 			pol->pd_init_fn(blkg->pd[pol->plid]);
+		}
 
 	__set_bit(pol->plid, q->blkcg_pols);
 	ret = 0;
-- 
2.30.1

^ permalink raw reply related	[flat|nested] 4+ messages in thread

* [PATCH] block,iocost: fix potential kernel NULL
@ 2022-05-13 14:59   ` Yahu Gao
  0 siblings, 0 replies; 4+ messages in thread
From: Yahu Gao @ 2022-05-13 14:59 UTC (permalink / raw)
  To: Tejun Heo, Jens Axboe
  Cc: cgroups-u79uwXL29TY76Z2rM5mHXA,
	linux-block-u79uwXL29TY76Z2rM5mHXA, Yahu Gao, Kunhai Dai

From: Yahu Gao <yahugao-+mmu7dyatJ+Rq8AjE7tl8g@public.gmane.org>

Some inode pinned dying memory cgroup and its parent destroyed at first.
The parent's pd of iocost won't be allocated during function
blkcg_activate_policy.
Ignore the DYING CSS to avoid kernel NULL during iocost policy data init.

Signed-off-by: Yahu Gao <yahugao-+mmu7dyatJ+Rq8AjE7tl8g@public.gmane.org>
Signed-off-by: Kunhai Dai <daikunhai-+mmu7dyatJ+Rq8AjE7tl8g@public.gmane.org>

diff --git a/block/blk-cgroup.c b/block/blk-cgroup.c
index a91f8ae18b49..32472de2e61d 100644
--- a/block/blk-cgroup.c
+++ b/block/blk-cgroup.c
@@ -1418,6 +1418,9 @@ int blkcg_activate_policy(struct request_queue *q,
 	list_for_each_entry_reverse(blkg, &q->blkg_list, q_node) {
 		struct blkg_policy_data *pd;
 
+		if (blkg->blkcg->css.flags & CSS_DYING)
+			continue;
+
 		if (blkg->pd[pol->plid])
 			continue;
 
@@ -1459,8 +1462,11 @@ int blkcg_activate_policy(struct request_queue *q,
 
 	/* all allocated, init in the same order */
 	if (pol->pd_init_fn)
-		list_for_each_entry_reverse(blkg, &q->blkg_list, q_node)
+		list_for_each_entry_reverse(blkg, &q->blkg_list, q_node) {
+			if (blkg->blkcg->css.flags & CSS_DYING)
+				continue;
 			pol->pd_init_fn(blkg->pd[pol->plid]);
+		}
 
 	__set_bit(pol->plid, q->blkcg_pols);
 	ret = 0;
-- 
2.30.1

^ permalink raw reply related	[flat|nested] 4+ messages in thread

* Re: [PATCH] block,iocost: fix potential kernel NULL
  2022-05-13 14:59   ` Yahu Gao
  (?)
@ 2022-05-13 16:28   ` Tejun Heo
  -1 siblings, 0 replies; 4+ messages in thread
From: Tejun Heo @ 2022-05-13 16:28 UTC (permalink / raw)
  To: Yahu Gao; +Cc: Jens Axboe, cgroups, linux-block, Yahu Gao, Kunhai Dai

Hello,

On Fri, May 13, 2022 at 10:59:28PM +0800, Yahu Gao wrote:
> From: Yahu Gao <yahugao@didiglobal.com>
> 
> Some inode pinned dying memory cgroup and its parent destroyed at first.
> The parent's pd of iocost won't be allocated during function
> blkcg_activate_policy.
> Ignore the DYING CSS to avoid kernel NULL during iocost policy data init.

Thanks for the analysis and patch but I'm not quite sure the analysis is
correct. When a cgroup goes down, its blkgs are destroyed
blkcg_destroy_blkgs() which is invoked by blkcg_unpin_online() when its
online_pin reaches zero which is incremented and decremented recursively, so
an ancestor's blkgs should be destroyed before a descendant's if the code is
working as intended. Can you guys dig a bit deeper and why we're losing
ancestor blkgs before descendants?

Thanks.

-- 
tejun

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2022-05-13 16:28 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-05-13 14:59 [PATCH 0/1] block,iocost: fix potential kernel NULL Yahu Gao
2022-05-13 14:59 ` [PATCH] " Yahu Gao
2022-05-13 14:59   ` Yahu Gao
2022-05-13 16:28   ` Tejun Heo

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.