All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0
@ 2015-02-17  5:24 ` Joonsoo Kim
  0 siblings, 0 replies; 12+ messages in thread
From: Joonsoo Kim @ 2015-02-17  5:24 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Johannes Weiner, Michal Hocko, cgroups, linux-mm, linux-kernel,
	Joonsoo Kim

It can be possible to return NULL in parent_mem_cgroup()
if use_hierarchy is 0. So, we need to check NULL in the loop on
mem_cgroup_low(). Without it, following NULL pointer dereference happens.

[   33.607531] BUG: unable to handle kernel NULL pointer dereference at 00000000000000b0
[   33.608008] IP: [<ffffffff811dcf60>] mem_cgroup_low+0x40/0x90
[   33.608008] PGD 1d893067 PUD 1cf41067 PMD 0
[   33.608008] Oops: 0000 [#12] SMP
[   33.608008] Modules linked in:
[   33.608008] CPU: 1 PID: 3936 Comm: as Tainted: G      D         3.19.0-next-20150216 #156
[   33.608008] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Bochs 01/01/2011
[   33.608008] task: ffff88001d9c8000 ti: ffff88000cb14000 task.ti: ffff88000cb14000
[   33.608008] RIP: 0010:[<ffffffff811dcf60>]  [<ffffffff811dcf60>] mem_cgroup_low+0x40/0x90
[   33.608008] RSP: 0000:ffff88000cb17a88  EFLAGS: 00010286
[   33.608008] RAX: 0000000000000000 RBX: ffff88000cb17bc0 RCX: 0000000000000000
[   33.608008] RDX: ffff88001f491400 RSI: 0000000000000000 RDI: 0000000000000000
[   33.608008] RBP: ffff88000cb17a88 R08: 0000000000000160 R09: 0000000000000000
[   33.608008] R10: 0000000000000000 R11: 0000000002b8c101 R12: 0000000000000000
[   33.608008] R13: 0000000000000000 R14: ffff88001fff9e08 R15: ffff88001da95800
[   33.608008] FS:  00002b7a12715380(0000) GS:ffff88001fa40000(0000) knlGS:0000000000000000
[   33.608008] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   33.608008] CR2: 00000000000000b0 CR3: 000000000762f000 CR4: 00000000000007e0
[   33.608008] Stack:
[   33.608008]  ffff88000cb17b18 ffffffff811838ec ffff88000cb17cd8 0000000000000000
[   33.608008]  0000000000000000 0001000000000000 000280da00000000 ffff88001fff8780
[   33.608008]  ffff88000cb17af8 ffffffff810e1d7e ffff88001fff8780 000000030000000c
[   33.608008] Call Trace:
[   33.608008]  [<ffffffff811838ec>] shrink_zone+0xac/0x2d0
[   33.608008]  [<ffffffff810e1d7e>] ? ktime_get+0x3e/0xa0
[   33.608008]  [<ffffffff81183e94>] do_try_to_free_pages+0x174/0x440
[   33.608008]  [<ffffffff8117f1a8>] ? throttle_direct_reclaim+0x98/0x250
[   33.608008]  [<ffffffff8118421a>] try_to_free_pages+0xba/0x150
[   33.608008]  [<ffffffff81176d10>] __alloc_pages_nodemask+0x5a0/0x950
[   33.608008]  [<ffffffff811c09ff>] alloc_pages_vma+0xaf/0x200
[   33.608008]  [<ffffffff811a0717>] handle_mm_fault+0x1287/0x17e0
[   33.608008]  [<ffffffff81059e9e>] ? kvm_clock_read+0x1e/0x20
[   33.608008]  [<ffffffff81059e9e>] ? kvm_clock_read+0x1e/0x20
[   33.608008]  [<ffffffff8101e6a9>] ? sched_clock+0x9/0x10
[   33.608008]  [<ffffffff810605f1>] __do_page_fault+0x191/0x440
[   33.608008]  [<ffffffff81060955>] trace_do_page_fault+0x45/0x100
[   33.608008]  [<ffffffff8105968e>] do_async_page_fault+0x1e/0xd0
[   33.608008]  [<ffffffff8176f628>] async_page_fault+0x28/0x30
[   33.608008] Code: 48 8b 15 cc 21 b4 00 48 39 d6 74 53 48 8b 8e b0 00 00 00 48 39 8e 28 01 00 00 72 43 31 c9 48 39 fe 75 1d eb 35 66 0f 1f 44 00 00 <48> 8b 86 b0 00 00 00 48 39 86 28 01 00 00 72 30 48 39 f7 74 1a
[   33.608008] RIP  [<ffffffff811dcf60>] mem_cgroup_low+0x40/0x90
[   33.608008]  RSP <ffff88000cb17a88>
[   33.608008] CR2: 00000000000000b0
[   33.608008] BUG: unable to handle kernel [   33.653499] ---[ end trace e264a32717ffda51 ]---

Signed-off-by: Joonsoo Kim <iamjoonsoo.kim@lge.com>
---
 mm/memcontrol.c | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/mm/memcontrol.c b/mm/memcontrol.c
index d18d3a6..507cfea 100644
--- a/mm/memcontrol.c
+++ b/mm/memcontrol.c
@@ -5431,6 +5431,8 @@ bool mem_cgroup_low(struct mem_cgroup *root, struct mem_cgroup *memcg)
 
 	while (memcg != root) {
 		memcg = parent_mem_cgroup(memcg);
+		if (!memcg)
+			break;
 
 		if (memcg == root_mem_cgroup)
 			break;
-- 
1.9.1


^ permalink raw reply related	[flat|nested] 12+ messages in thread

* [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0
@ 2015-02-17  5:24 ` Joonsoo Kim
  0 siblings, 0 replies; 12+ messages in thread
From: Joonsoo Kim @ 2015-02-17  5:24 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Johannes Weiner, Michal Hocko, cgroups, linux-mm, linux-kernel,
	Joonsoo Kim

It can be possible to return NULL in parent_mem_cgroup()
if use_hierarchy is 0. So, we need to check NULL in the loop on
mem_cgroup_low(). Without it, following NULL pointer dereference happens.

[   33.607531] BUG: unable to handle kernel NULL pointer dereference at 00000000000000b0
[   33.608008] IP: [<ffffffff811dcf60>] mem_cgroup_low+0x40/0x90
[   33.608008] PGD 1d893067 PUD 1cf41067 PMD 0
[   33.608008] Oops: 0000 [#12] SMP
[   33.608008] Modules linked in:
[   33.608008] CPU: 1 PID: 3936 Comm: as Tainted: G      D         3.19.0-next-20150216 #156
[   33.608008] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Bochs 01/01/2011
[   33.608008] task: ffff88001d9c8000 ti: ffff88000cb14000 task.ti: ffff88000cb14000
[   33.608008] RIP: 0010:[<ffffffff811dcf60>]  [<ffffffff811dcf60>] mem_cgroup_low+0x40/0x90
[   33.608008] RSP: 0000:ffff88000cb17a88  EFLAGS: 00010286
[   33.608008] RAX: 0000000000000000 RBX: ffff88000cb17bc0 RCX: 0000000000000000
[   33.608008] RDX: ffff88001f491400 RSI: 0000000000000000 RDI: 0000000000000000
[   33.608008] RBP: ffff88000cb17a88 R08: 0000000000000160 R09: 0000000000000000
[   33.608008] R10: 0000000000000000 R11: 0000000002b8c101 R12: 0000000000000000
[   33.608008] R13: 0000000000000000 R14: ffff88001fff9e08 R15: ffff88001da95800
[   33.608008] FS:  00002b7a12715380(0000) GS:ffff88001fa40000(0000) knlGS:0000000000000000
[   33.608008] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   33.608008] CR2: 00000000000000b0 CR3: 000000000762f000 CR4: 00000000000007e0
[   33.608008] Stack:
[   33.608008]  ffff88000cb17b18 ffffffff811838ec ffff88000cb17cd8 0000000000000000
[   33.608008]  0000000000000000 0001000000000000 000280da00000000 ffff88001fff8780
[   33.608008]  ffff88000cb17af8 ffffffff810e1d7e ffff88001fff8780 000000030000000c
[   33.608008] Call Trace:
[   33.608008]  [<ffffffff811838ec>] shrink_zone+0xac/0x2d0
[   33.608008]  [<ffffffff810e1d7e>] ? ktime_get+0x3e/0xa0
[   33.608008]  [<ffffffff81183e94>] do_try_to_free_pages+0x174/0x440
[   33.608008]  [<ffffffff8117f1a8>] ? throttle_direct_reclaim+0x98/0x250
[   33.608008]  [<ffffffff8118421a>] try_to_free_pages+0xba/0x150
[   33.608008]  [<ffffffff81176d10>] __alloc_pages_nodemask+0x5a0/0x950
[   33.608008]  [<ffffffff811c09ff>] alloc_pages_vma+0xaf/0x200
[   33.608008]  [<ffffffff811a0717>] handle_mm_fault+0x1287/0x17e0
[   33.608008]  [<ffffffff81059e9e>] ? kvm_clock_read+0x1e/0x20
[   33.608008]  [<ffffffff81059e9e>] ? kvm_clock_read+0x1e/0x20
[   33.608008]  [<ffffffff8101e6a9>] ? sched_clock+0x9/0x10
[   33.608008]  [<ffffffff810605f1>] __do_page_fault+0x191/0x440
[   33.608008]  [<ffffffff81060955>] trace_do_page_fault+0x45/0x100
[   33.608008]  [<ffffffff8105968e>] do_async_page_fault+0x1e/0xd0
[   33.608008]  [<ffffffff8176f628>] async_page_fault+0x28/0x30
[   33.608008] Code: 48 8b 15 cc 21 b4 00 48 39 d6 74 53 48 8b 8e b0 00 00 00 48 39 8e 28 01 00 00 72 43 31 c9 48 39 fe 75 1d eb 35 66 0f 1f 44 00 00 <48> 8b 86 b0 00 00 00 48 39 86 28 01 00 00 72 30 48 39 f7 74 1a
[   33.608008] RIP  [<ffffffff811dcf60>] mem_cgroup_low+0x40/0x90
[   33.608008]  RSP <ffff88000cb17a88>
[   33.608008] CR2: 00000000000000b0
[   33.608008] BUG: unable to handle kernel [   33.653499] ---[ end trace e264a32717ffda51 ]---

Signed-off-by: Joonsoo Kim <iamjoonsoo.kim@lge.com>
---
 mm/memcontrol.c | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/mm/memcontrol.c b/mm/memcontrol.c
index d18d3a6..507cfea 100644
--- a/mm/memcontrol.c
+++ b/mm/memcontrol.c
@@ -5431,6 +5431,8 @@ bool mem_cgroup_low(struct mem_cgroup *root, struct mem_cgroup *memcg)
 
 	while (memcg != root) {
 		memcg = parent_mem_cgroup(memcg);
+		if (!memcg)
+			break;
 
 		if (memcg == root_mem_cgroup)
 			break;
-- 
1.9.1

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply related	[flat|nested] 12+ messages in thread

* [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0
@ 2015-02-17  5:24 ` Joonsoo Kim
  0 siblings, 0 replies; 12+ messages in thread
From: Joonsoo Kim @ 2015-02-17  5:24 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Johannes Weiner, Michal Hocko, cgroups-u79uwXL29TY76Z2rM5mHXA,
	linux-mm-Bw31MaZKKs3YtjvyW6yDsg,
	linux-kernel-u79uwXL29TY76Z2rM5mHXA, Joonsoo Kim

It can be possible to return NULL in parent_mem_cgroup()
if use_hierarchy is 0. So, we need to check NULL in the loop on
mem_cgroup_low(). Without it, following NULL pointer dereference happens.

[   33.607531] BUG: unable to handle kernel NULL pointer dereference at 00000000000000b0
[   33.608008] IP: [<ffffffff811dcf60>] mem_cgroup_low+0x40/0x90
[   33.608008] PGD 1d893067 PUD 1cf41067 PMD 0
[   33.608008] Oops: 0000 [#12] SMP
[   33.608008] Modules linked in:
[   33.608008] CPU: 1 PID: 3936 Comm: as Tainted: G      D         3.19.0-next-20150216 #156
[   33.608008] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Bochs 01/01/2011
[   33.608008] task: ffff88001d9c8000 ti: ffff88000cb14000 task.ti: ffff88000cb14000
[   33.608008] RIP: 0010:[<ffffffff811dcf60>]  [<ffffffff811dcf60>] mem_cgroup_low+0x40/0x90
[   33.608008] RSP: 0000:ffff88000cb17a88  EFLAGS: 00010286
[   33.608008] RAX: 0000000000000000 RBX: ffff88000cb17bc0 RCX: 0000000000000000
[   33.608008] RDX: ffff88001f491400 RSI: 0000000000000000 RDI: 0000000000000000
[   33.608008] RBP: ffff88000cb17a88 R08: 0000000000000160 R09: 0000000000000000
[   33.608008] R10: 0000000000000000 R11: 0000000002b8c101 R12: 0000000000000000
[   33.608008] R13: 0000000000000000 R14: ffff88001fff9e08 R15: ffff88001da95800
[   33.608008] FS:  00002b7a12715380(0000) GS:ffff88001fa40000(0000) knlGS:0000000000000000
[   33.608008] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   33.608008] CR2: 00000000000000b0 CR3: 000000000762f000 CR4: 00000000000007e0
[   33.608008] Stack:
[   33.608008]  ffff88000cb17b18 ffffffff811838ec ffff88000cb17cd8 0000000000000000
[   33.608008]  0000000000000000 0001000000000000 000280da00000000 ffff88001fff8780
[   33.608008]  ffff88000cb17af8 ffffffff810e1d7e ffff88001fff8780 000000030000000c
[   33.608008] Call Trace:
[   33.608008]  [<ffffffff811838ec>] shrink_zone+0xac/0x2d0
[   33.608008]  [<ffffffff810e1d7e>] ? ktime_get+0x3e/0xa0
[   33.608008]  [<ffffffff81183e94>] do_try_to_free_pages+0x174/0x440
[   33.608008]  [<ffffffff8117f1a8>] ? throttle_direct_reclaim+0x98/0x250
[   33.608008]  [<ffffffff8118421a>] try_to_free_pages+0xba/0x150
[   33.608008]  [<ffffffff81176d10>] __alloc_pages_nodemask+0x5a0/0x950
[   33.608008]  [<ffffffff811c09ff>] alloc_pages_vma+0xaf/0x200
[   33.608008]  [<ffffffff811a0717>] handle_mm_fault+0x1287/0x17e0
[   33.608008]  [<ffffffff81059e9e>] ? kvm_clock_read+0x1e/0x20
[   33.608008]  [<ffffffff81059e9e>] ? kvm_clock_read+0x1e/0x20
[   33.608008]  [<ffffffff8101e6a9>] ? sched_clock+0x9/0x10
[   33.608008]  [<ffffffff810605f1>] __do_page_fault+0x191/0x440
[   33.608008]  [<ffffffff81060955>] trace_do_page_fault+0x45/0x100
[   33.608008]  [<ffffffff8105968e>] do_async_page_fault+0x1e/0xd0
[   33.608008]  [<ffffffff8176f628>] async_page_fault+0x28/0x30
[   33.608008] Code: 48 8b 15 cc 21 b4 00 48 39 d6 74 53 48 8b 8e b0 00 00 00 48 39 8e 28 01 00 00 72 43 31 c9 48 39 fe 75 1d eb 35 66 0f 1f 44 00 00 <48> 8b 86 b0 00 00 00 48 39 86 28 01 00 00 72 30 48 39 f7 74 1a
[   33.608008] RIP  [<ffffffff811dcf60>] mem_cgroup_low+0x40/0x90
[   33.608008]  RSP <ffff88000cb17a88>
[   33.608008] CR2: 00000000000000b0
[   33.608008] BUG: unable to handle kernel [   33.653499] ---[ end trace e264a32717ffda51 ]---

Signed-off-by: Joonsoo Kim <iamjoonsoo.kim-Hm3cg6mZ9cc@public.gmane.org>
---
 mm/memcontrol.c | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/mm/memcontrol.c b/mm/memcontrol.c
index d18d3a6..507cfea 100644
--- a/mm/memcontrol.c
+++ b/mm/memcontrol.c
@@ -5431,6 +5431,8 @@ bool mem_cgroup_low(struct mem_cgroup *root, struct mem_cgroup *memcg)
 
 	while (memcg != root) {
 		memcg = parent_mem_cgroup(memcg);
+		if (!memcg)
+			break;
 
 		if (memcg == root_mem_cgroup)
 			break;
-- 
1.9.1

^ permalink raw reply related	[flat|nested] 12+ messages in thread

* Re: [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0
  2015-02-17  5:24 ` Joonsoo Kim
  (?)
@ 2015-02-17  8:33   ` Michal Hocko
  -1 siblings, 0 replies; 12+ messages in thread
From: Michal Hocko @ 2015-02-17  8:33 UTC (permalink / raw)
  To: Joonsoo Kim
  Cc: Andrew Morton, Johannes Weiner, cgroups, linux-mm, linux-kernel,
	Joonsoo Kim

On Tue 17-02-15 14:24:59, Joonsoo Kim wrote:
> It can be possible to return NULL in parent_mem_cgroup()
> if use_hierarchy is 0.

This alone is not sufficient because the low limit is present only in
the unified hierarchy API and there is no use_hierarchy there. The
primary issue here is that the memcg has 0 usage so the previous
check for usage will not stop us. And that is bug IMO.

I think that the following patch would be more correct from semantic
POV:
---
>From f5d74671d30e44c50b45b4464c92f536f1dbdff6 Mon Sep 17 00:00:00 2001
From: Michal Hocko <mhocko@suse.cz>
Date: Tue, 17 Feb 2015 08:02:12 +0100
Subject: [PATCH] memcg: fix low limit calculation

A memcg is considered low limited even when the current usage is equal
to the low limit. This leads to interesting side effects e.g.
groups/hierarchies with no memory accounted are considered protected and
so the reclaim will emit MEMCG_LOW event when encountering them.

Another and much bigger issue was reported by Joonsoo Kim. He has hit a
NULL ptr dereference with the legacy cgroup API which even doesn't have
low limit exposed. The limit is 0 by default but the initial check fails
for memcg with 0 consumption and parent_mem_cgroup() would return NULL
if use_hierarchy is 0 and so page_counter_read would try to dereference
NULL.

I suppose that the current implementation is just an overlook because
the documentation in Documentation/cgroups/unified-hierarchy.txt says:
"
The memory.low boundary on the other hand is a top-down allocated
reserve.  A cgroup enjoys reclaim protection when it and all its
ancestors are below their low boundaries
"

Fix the usage and the low limit comparision in mem_cgroup_low accordingly.

Fixes: 241994ed8649 (mm: memcontrol: default hierarchy interface for memory)
Reported-by: Joonsoo Kim <js1304@gmail.com>
Signed-off-by: Michal Hocko <mhocko@suse.cz>
---
 mm/memcontrol.c | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/mm/memcontrol.c b/mm/memcontrol.c
index 0b436bc02ba4..079b5c02e245 100644
--- a/mm/memcontrol.c
+++ b/mm/memcontrol.c
@@ -5426,7 +5426,7 @@ bool mem_cgroup_low(struct mem_cgroup *root, struct mem_cgroup *memcg)
 	if (memcg == root_mem_cgroup)
 		return false;
 
-	if (page_counter_read(&memcg->memory) > memcg->low)
+	if (page_counter_read(&memcg->memory) >= memcg->low)
 		return false;
 
 	while (memcg != root) {
@@ -5435,7 +5435,7 @@ bool mem_cgroup_low(struct mem_cgroup *root, struct mem_cgroup *memcg)
 		if (memcg == root_mem_cgroup)
 			break;
 
-		if (page_counter_read(&memcg->memory) > memcg->low)
+		if (page_counter_read(&memcg->memory) >= memcg->low)
 			return false;
 	}
 	return true;
-- 
2.1.4

-- 
Michal Hocko
SUSE Labs

^ permalink raw reply related	[flat|nested] 12+ messages in thread

* Re: [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0
@ 2015-02-17  8:33   ` Michal Hocko
  0 siblings, 0 replies; 12+ messages in thread
From: Michal Hocko @ 2015-02-17  8:33 UTC (permalink / raw)
  To: Joonsoo Kim
  Cc: Andrew Morton, Johannes Weiner, cgroups, linux-mm, linux-kernel,
	Joonsoo Kim

On Tue 17-02-15 14:24:59, Joonsoo Kim wrote:
> It can be possible to return NULL in parent_mem_cgroup()
> if use_hierarchy is 0.

This alone is not sufficient because the low limit is present only in
the unified hierarchy API and there is no use_hierarchy there. The
primary issue here is that the memcg has 0 usage so the previous
check for usage will not stop us. And that is bug IMO.

I think that the following patch would be more correct from semantic
POV:
---

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0
@ 2015-02-17  8:33   ` Michal Hocko
  0 siblings, 0 replies; 12+ messages in thread
From: Michal Hocko @ 2015-02-17  8:33 UTC (permalink / raw)
  To: Joonsoo Kim
  Cc: Andrew Morton, Johannes Weiner, cgroups-u79uwXL29TY76Z2rM5mHXA,
	linux-mm-Bw31MaZKKs3YtjvyW6yDsg,
	linux-kernel-u79uwXL29TY76Z2rM5mHXA, Joonsoo Kim

On Tue 17-02-15 14:24:59, Joonsoo Kim wrote:
> It can be possible to return NULL in parent_mem_cgroup()
> if use_hierarchy is 0.

This alone is not sufficient because the low limit is present only in
the unified hierarchy API and there is no use_hierarchy there. The
primary issue here is that the memcg has 0 usage so the previous
check for usage will not stop us. And that is bug IMO.

I think that the following patch would be more correct from semantic
POV:
---
From f5d74671d30e44c50b45b4464c92f536f1dbdff6 Mon Sep 17 00:00:00 2001
From: Michal Hocko <mhocko-AlSwsSmVLrQ@public.gmane.org>
Date: Tue, 17 Feb 2015 08:02:12 +0100
Subject: [PATCH] memcg: fix low limit calculation

A memcg is considered low limited even when the current usage is equal
to the low limit. This leads to interesting side effects e.g.
groups/hierarchies with no memory accounted are considered protected and
so the reclaim will emit MEMCG_LOW event when encountering them.

Another and much bigger issue was reported by Joonsoo Kim. He has hit a
NULL ptr dereference with the legacy cgroup API which even doesn't have
low limit exposed. The limit is 0 by default but the initial check fails
for memcg with 0 consumption and parent_mem_cgroup() would return NULL
if use_hierarchy is 0 and so page_counter_read would try to dereference
NULL.

I suppose that the current implementation is just an overlook because
the documentation in Documentation/cgroups/unified-hierarchy.txt says:
"
The memory.low boundary on the other hand is a top-down allocated
reserve.  A cgroup enjoys reclaim protection when it and all its
ancestors are below their low boundaries
"

Fix the usage and the low limit comparision in mem_cgroup_low accordingly.

Fixes: 241994ed8649 (mm: memcontrol: default hierarchy interface for memory)
Reported-by: Joonsoo Kim <js1304-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Signed-off-by: Michal Hocko <mhocko-AlSwsSmVLrQ@public.gmane.org>
---
 mm/memcontrol.c | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/mm/memcontrol.c b/mm/memcontrol.c
index 0b436bc02ba4..079b5c02e245 100644
--- a/mm/memcontrol.c
+++ b/mm/memcontrol.c
@@ -5426,7 +5426,7 @@ bool mem_cgroup_low(struct mem_cgroup *root, struct mem_cgroup *memcg)
 	if (memcg == root_mem_cgroup)
 		return false;
 
-	if (page_counter_read(&memcg->memory) > memcg->low)
+	if (page_counter_read(&memcg->memory) >= memcg->low)
 		return false;
 
 	while (memcg != root) {
@@ -5435,7 +5435,7 @@ bool mem_cgroup_low(struct mem_cgroup *root, struct mem_cgroup *memcg)
 		if (memcg == root_mem_cgroup)
 			break;
 
-		if (page_counter_read(&memcg->memory) > memcg->low)
+		if (page_counter_read(&memcg->memory) >= memcg->low)
 			return false;
 	}
 	return true;
-- 
2.1.4

-- 
Michal Hocko
SUSE Labs

^ permalink raw reply related	[flat|nested] 12+ messages in thread

* Re: [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0
  2015-02-17  8:33   ` Michal Hocko
  (?)
@ 2015-02-17 12:28     ` Johannes Weiner
  -1 siblings, 0 replies; 12+ messages in thread
From: Johannes Weiner @ 2015-02-17 12:28 UTC (permalink / raw)
  To: Michal Hocko
  Cc: Joonsoo Kim, Andrew Morton, cgroups, linux-mm, linux-kernel, Joonsoo Kim

On Tue, Feb 17, 2015 at 09:33:27AM +0100, Michal Hocko wrote:
> On Tue 17-02-15 14:24:59, Joonsoo Kim wrote:
> > It can be possible to return NULL in parent_mem_cgroup()
> > if use_hierarchy is 0.
> 
> This alone is not sufficient because the low limit is present only in
> the unified hierarchy API and there is no use_hierarchy there. The
> primary issue here is that the memcg has 0 usage so the previous
> check for usage will not stop us. And that is bug IMO.

Yes, empty groups shouldn't be considered low.

> From f5d74671d30e44c50b45b4464c92f536f1dbdff6 Mon Sep 17 00:00:00 2001
> From: Michal Hocko <mhocko@suse.cz>
> Date: Tue, 17 Feb 2015 08:02:12 +0100
> Subject: [PATCH] memcg: fix low limit calculation
> 
> A memcg is considered low limited even when the current usage is equal
> to the low limit. This leads to interesting side effects e.g.
> groups/hierarchies with no memory accounted are considered protected and
> so the reclaim will emit MEMCG_LOW event when encountering them.
> 
> Another and much bigger issue was reported by Joonsoo Kim. He has hit a
> NULL ptr dereference with the legacy cgroup API which even doesn't have
> low limit exposed. The limit is 0 by default but the initial check fails
> for memcg with 0 consumption and parent_mem_cgroup() would return NULL
> if use_hierarchy is 0 and so page_counter_read would try to dereference
> NULL.
> 
> I suppose that the current implementation is just an overlook because
> the documentation in Documentation/cgroups/unified-hierarchy.txt says:
> "
> The memory.low boundary on the other hand is a top-down allocated
> reserve.  A cgroup enjoys reclaim protection when it and all its
> ancestors are below their low boundaries
> "
> 
> Fix the usage and the low limit comparision in mem_cgroup_low accordingly.
> 
> Fixes: 241994ed8649 (mm: memcontrol: default hierarchy interface for memory)
> Reported-by: Joonsoo Kim <js1304@gmail.com>
> Signed-off-by: Michal Hocko <mhocko@suse.cz>

Acked-by: Johannes Weiner <hannes@cmpxchg.org>

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0
@ 2015-02-17 12:28     ` Johannes Weiner
  0 siblings, 0 replies; 12+ messages in thread
From: Johannes Weiner @ 2015-02-17 12:28 UTC (permalink / raw)
  To: Michal Hocko
  Cc: Joonsoo Kim, Andrew Morton, cgroups, linux-mm, linux-kernel, Joonsoo Kim

On Tue, Feb 17, 2015 at 09:33:27AM +0100, Michal Hocko wrote:
> On Tue 17-02-15 14:24:59, Joonsoo Kim wrote:
> > It can be possible to return NULL in parent_mem_cgroup()
> > if use_hierarchy is 0.
> 
> This alone is not sufficient because the low limit is present only in
> the unified hierarchy API and there is no use_hierarchy there. The
> primary issue here is that the memcg has 0 usage so the previous
> check for usage will not stop us. And that is bug IMO.

Yes, empty groups shouldn't be considered low.

> From f5d74671d30e44c50b45b4464c92f536f1dbdff6 Mon Sep 17 00:00:00 2001
> From: Michal Hocko <mhocko@suse.cz>
> Date: Tue, 17 Feb 2015 08:02:12 +0100
> Subject: [PATCH] memcg: fix low limit calculation
> 
> A memcg is considered low limited even when the current usage is equal
> to the low limit. This leads to interesting side effects e.g.
> groups/hierarchies with no memory accounted are considered protected and
> so the reclaim will emit MEMCG_LOW event when encountering them.
> 
> Another and much bigger issue was reported by Joonsoo Kim. He has hit a
> NULL ptr dereference with the legacy cgroup API which even doesn't have
> low limit exposed. The limit is 0 by default but the initial check fails
> for memcg with 0 consumption and parent_mem_cgroup() would return NULL
> if use_hierarchy is 0 and so page_counter_read would try to dereference
> NULL.
> 
> I suppose that the current implementation is just an overlook because
> the documentation in Documentation/cgroups/unified-hierarchy.txt says:
> "
> The memory.low boundary on the other hand is a top-down allocated
> reserve.  A cgroup enjoys reclaim protection when it and all its
> ancestors are below their low boundaries
> "
> 
> Fix the usage and the low limit comparision in mem_cgroup_low accordingly.
> 
> Fixes: 241994ed8649 (mm: memcontrol: default hierarchy interface for memory)
> Reported-by: Joonsoo Kim <js1304@gmail.com>
> Signed-off-by: Michal Hocko <mhocko@suse.cz>

Acked-by: Johannes Weiner <hannes@cmpxchg.org>

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0
@ 2015-02-17 12:28     ` Johannes Weiner
  0 siblings, 0 replies; 12+ messages in thread
From: Johannes Weiner @ 2015-02-17 12:28 UTC (permalink / raw)
  To: Michal Hocko
  Cc: Joonsoo Kim, Andrew Morton, cgroups-u79uwXL29TY76Z2rM5mHXA,
	linux-mm-Bw31MaZKKs3YtjvyW6yDsg,
	linux-kernel-u79uwXL29TY76Z2rM5mHXA, Joonsoo Kim

On Tue, Feb 17, 2015 at 09:33:27AM +0100, Michal Hocko wrote:
> On Tue 17-02-15 14:24:59, Joonsoo Kim wrote:
> > It can be possible to return NULL in parent_mem_cgroup()
> > if use_hierarchy is 0.
> 
> This alone is not sufficient because the low limit is present only in
> the unified hierarchy API and there is no use_hierarchy there. The
> primary issue here is that the memcg has 0 usage so the previous
> check for usage will not stop us. And that is bug IMO.

Yes, empty groups shouldn't be considered low.

> From f5d74671d30e44c50b45b4464c92f536f1dbdff6 Mon Sep 17 00:00:00 2001
> From: Michal Hocko <mhocko-AlSwsSmVLrQ@public.gmane.org>
> Date: Tue, 17 Feb 2015 08:02:12 +0100
> Subject: [PATCH] memcg: fix low limit calculation
> 
> A memcg is considered low limited even when the current usage is equal
> to the low limit. This leads to interesting side effects e.g.
> groups/hierarchies with no memory accounted are considered protected and
> so the reclaim will emit MEMCG_LOW event when encountering them.
> 
> Another and much bigger issue was reported by Joonsoo Kim. He has hit a
> NULL ptr dereference with the legacy cgroup API which even doesn't have
> low limit exposed. The limit is 0 by default but the initial check fails
> for memcg with 0 consumption and parent_mem_cgroup() would return NULL
> if use_hierarchy is 0 and so page_counter_read would try to dereference
> NULL.
> 
> I suppose that the current implementation is just an overlook because
> the documentation in Documentation/cgroups/unified-hierarchy.txt says:
> "
> The memory.low boundary on the other hand is a top-down allocated
> reserve.  A cgroup enjoys reclaim protection when it and all its
> ancestors are below their low boundaries
> "
> 
> Fix the usage and the low limit comparision in mem_cgroup_low accordingly.
> 
> Fixes: 241994ed8649 (mm: memcontrol: default hierarchy interface for memory)
> Reported-by: Joonsoo Kim <js1304-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> Signed-off-by: Michal Hocko <mhocko-AlSwsSmVLrQ@public.gmane.org>

Acked-by: Johannes Weiner <hannes-druUgvl0LCNAfugRpC6u6w@public.gmane.org>

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0
  2015-02-17  8:33   ` Michal Hocko
  (?)
@ 2015-02-25  1:20     ` Joonsoo Kim
  -1 siblings, 0 replies; 12+ messages in thread
From: Joonsoo Kim @ 2015-02-25  1:20 UTC (permalink / raw)
  To: Michal Hocko
  Cc: Andrew Morton, Johannes Weiner, cgroups, linux-mm, linux-kernel

On Tue, Feb 17, 2015 at 09:33:27AM +0100, Michal Hocko wrote:
> On Tue 17-02-15 14:24:59, Joonsoo Kim wrote:
> > It can be possible to return NULL in parent_mem_cgroup()
> > if use_hierarchy is 0.
> 
> This alone is not sufficient because the low limit is present only in
> the unified hierarchy API and there is no use_hierarchy there. The
> primary issue here is that the memcg has 0 usage so the previous
> check for usage will not stop us. And that is bug IMO.
> 
> I think that the following patch would be more correct from semantic
> POV:
> ---
> >From f5d74671d30e44c50b45b4464c92f536f1dbdff6 Mon Sep 17 00:00:00 2001
> From: Michal Hocko <mhocko@suse.cz>
> Date: Tue, 17 Feb 2015 08:02:12 +0100
> Subject: [PATCH] memcg: fix low limit calculation
> 
> A memcg is considered low limited even when the current usage is equal
> to the low limit. This leads to interesting side effects e.g.
> groups/hierarchies with no memory accounted are considered protected and
> so the reclaim will emit MEMCG_LOW event when encountering them.
> 
> Another and much bigger issue was reported by Joonsoo Kim. He has hit a
> NULL ptr dereference with the legacy cgroup API which even doesn't have
> low limit exposed. The limit is 0 by default but the initial check fails
> for memcg with 0 consumption and parent_mem_cgroup() would return NULL
> if use_hierarchy is 0 and so page_counter_read would try to dereference
> NULL.
> 
> I suppose that the current implementation is just an overlook because
> the documentation in Documentation/cgroups/unified-hierarchy.txt says:
> "
> The memory.low boundary on the other hand is a top-down allocated
> reserve.  A cgroup enjoys reclaim protection when it and all its
> ancestors are below their low boundaries
> "
> 
> Fix the usage and the low limit comparision in mem_cgroup_low accordingly.
> 
> Fixes: 241994ed8649 (mm: memcontrol: default hierarchy interface for memory)
> Reported-by: Joonsoo Kim <js1304@gmail.com>
> Signed-off-by: Michal Hocko <mhocko@suse.cz>

Good! This fixes my issue.

Thanks.

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0
@ 2015-02-25  1:20     ` Joonsoo Kim
  0 siblings, 0 replies; 12+ messages in thread
From: Joonsoo Kim @ 2015-02-25  1:20 UTC (permalink / raw)
  To: Michal Hocko
  Cc: Andrew Morton, Johannes Weiner, cgroups, linux-mm, linux-kernel

On Tue, Feb 17, 2015 at 09:33:27AM +0100, Michal Hocko wrote:
> On Tue 17-02-15 14:24:59, Joonsoo Kim wrote:
> > It can be possible to return NULL in parent_mem_cgroup()
> > if use_hierarchy is 0.
> 
> This alone is not sufficient because the low limit is present only in
> the unified hierarchy API and there is no use_hierarchy there. The
> primary issue here is that the memcg has 0 usage so the previous
> check for usage will not stop us. And that is bug IMO.
> 
> I think that the following patch would be more correct from semantic
> POV:
> ---
> >From f5d74671d30e44c50b45b4464c92f536f1dbdff6 Mon Sep 17 00:00:00 2001
> From: Michal Hocko <mhocko@suse.cz>
> Date: Tue, 17 Feb 2015 08:02:12 +0100
> Subject: [PATCH] memcg: fix low limit calculation
> 
> A memcg is considered low limited even when the current usage is equal
> to the low limit. This leads to interesting side effects e.g.
> groups/hierarchies with no memory accounted are considered protected and
> so the reclaim will emit MEMCG_LOW event when encountering them.
> 
> Another and much bigger issue was reported by Joonsoo Kim. He has hit a
> NULL ptr dereference with the legacy cgroup API which even doesn't have
> low limit exposed. The limit is 0 by default but the initial check fails
> for memcg with 0 consumption and parent_mem_cgroup() would return NULL
> if use_hierarchy is 0 and so page_counter_read would try to dereference
> NULL.
> 
> I suppose that the current implementation is just an overlook because
> the documentation in Documentation/cgroups/unified-hierarchy.txt says:
> "
> The memory.low boundary on the other hand is a top-down allocated
> reserve.  A cgroup enjoys reclaim protection when it and all its
> ancestors are below their low boundaries
> "
> 
> Fix the usage and the low limit comparision in mem_cgroup_low accordingly.
> 
> Fixes: 241994ed8649 (mm: memcontrol: default hierarchy interface for memory)
> Reported-by: Joonsoo Kim <js1304@gmail.com>
> Signed-off-by: Michal Hocko <mhocko@suse.cz>

Good! This fixes my issue.

Thanks.

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0
@ 2015-02-25  1:20     ` Joonsoo Kim
  0 siblings, 0 replies; 12+ messages in thread
From: Joonsoo Kim @ 2015-02-25  1:20 UTC (permalink / raw)
  To: Michal Hocko
  Cc: Andrew Morton, Johannes Weiner, cgroups-u79uwXL29TY76Z2rM5mHXA,
	linux-mm-Bw31MaZKKs3YtjvyW6yDsg,
	linux-kernel-u79uwXL29TY76Z2rM5mHXA

On Tue, Feb 17, 2015 at 09:33:27AM +0100, Michal Hocko wrote:
> On Tue 17-02-15 14:24:59, Joonsoo Kim wrote:
> > It can be possible to return NULL in parent_mem_cgroup()
> > if use_hierarchy is 0.
> 
> This alone is not sufficient because the low limit is present only in
> the unified hierarchy API and there is no use_hierarchy there. The
> primary issue here is that the memcg has 0 usage so the previous
> check for usage will not stop us. And that is bug IMO.
> 
> I think that the following patch would be more correct from semantic
> POV:
> ---
> >From f5d74671d30e44c50b45b4464c92f536f1dbdff6 Mon Sep 17 00:00:00 2001
> From: Michal Hocko <mhocko-AlSwsSmVLrQ@public.gmane.org>
> Date: Tue, 17 Feb 2015 08:02:12 +0100
> Subject: [PATCH] memcg: fix low limit calculation
> 
> A memcg is considered low limited even when the current usage is equal
> to the low limit. This leads to interesting side effects e.g.
> groups/hierarchies with no memory accounted are considered protected and
> so the reclaim will emit MEMCG_LOW event when encountering them.
> 
> Another and much bigger issue was reported by Joonsoo Kim. He has hit a
> NULL ptr dereference with the legacy cgroup API which even doesn't have
> low limit exposed. The limit is 0 by default but the initial check fails
> for memcg with 0 consumption and parent_mem_cgroup() would return NULL
> if use_hierarchy is 0 and so page_counter_read would try to dereference
> NULL.
> 
> I suppose that the current implementation is just an overlook because
> the documentation in Documentation/cgroups/unified-hierarchy.txt says:
> "
> The memory.low boundary on the other hand is a top-down allocated
> reserve.  A cgroup enjoys reclaim protection when it and all its
> ancestors are below their low boundaries
> "
> 
> Fix the usage and the low limit comparision in mem_cgroup_low accordingly.
> 
> Fixes: 241994ed8649 (mm: memcontrol: default hierarchy interface for memory)
> Reported-by: Joonsoo Kim <js1304-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> Signed-off-by: Michal Hocko <mhocko-AlSwsSmVLrQ@public.gmane.org>

Good! This fixes my issue.

Thanks.

^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2015-02-25  1:20 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-02-17  5:24 [PATCH] mm/memcontrol: fix NULL pointer dereference when use_hierarchy is 0 Joonsoo Kim
2015-02-17  5:24 ` Joonsoo Kim
2015-02-17  5:24 ` Joonsoo Kim
2015-02-17  8:33 ` Michal Hocko
2015-02-17  8:33   ` Michal Hocko
2015-02-17  8:33   ` Michal Hocko
2015-02-17 12:28   ` Johannes Weiner
2015-02-17 12:28     ` Johannes Weiner
2015-02-17 12:28     ` Johannes Weiner
2015-02-25  1:20   ` Joonsoo Kim
2015-02-25  1:20     ` Joonsoo Kim
2015-02-25  1:20     ` Joonsoo Kim

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.