linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* [PATCH for-5.2-fixes] memcg: Don't loop on css_tryget_online() failure
@ 2019-05-29 21:06 Tejun Heo
  2019-06-05 12:55 ` Michal Hocko
  0 siblings, 1 reply; 3+ messages in thread
From: Tejun Heo @ 2019-05-29 21:06 UTC (permalink / raw)
  To: Johannes Weiner, Michal Hocko, Vladimir Davydov
  Cc: cgroups, linux-mm, kernel-team

A PF_EXITING task may stay associated with an offline css.
get_mem_cgroup_from_mm() may deadlock if mm->owner is in such state.
All similar logics in memcg are falling back to root memcg on
tryget_online failure and get_mem_cgroup_from_mm() can do the same.

A similar failure existed for task_get_css() and could be triggered
through BSD process accounting racing against memcg offlining.  See
18fa84a2db0e ("cgroup: Use css_tryget() instead of css_tryget_online()
in task_get_css()") for details.

Signed-off-by: Tejun Heo <tj@kernel.org>
---
 mm/memcontrol.c |   24 ++++++++++--------------
 1 file changed, 10 insertions(+), 14 deletions(-)

diff --git a/mm/memcontrol.c b/mm/memcontrol.c
index e50a2db5b4ff..be1fa89db198 100644
--- a/mm/memcontrol.c
+++ b/mm/memcontrol.c
@@ -918,23 +918,19 @@ struct mem_cgroup *get_mem_cgroup_from_mm(struct mm_struct *mm)
 
 	if (mem_cgroup_disabled())
 		return NULL;
+	/*
+	 * Page cache insertions can happen without an actual mm context,
+	 * e.g. during disk probing on boot, loopback IO, acct() writes.
+	 */
+	if (unlikely(!mm))
+		return root_mem_cgroup;
 
 	rcu_read_lock();
-	do {
-		/*
-		 * Page cache insertions can happen withou an
-		 * actual mm context, e.g. during disk probing
-		 * on boot, loopback IO, acct() writes etc.
-		 */
-		if (unlikely(!mm))
-			memcg = root_mem_cgroup;
-		else {
-			memcg = mem_cgroup_from_task(rcu_dereference(mm->owner));
-			if (unlikely(!memcg))
-				memcg = root_mem_cgroup;
-		}
-	} while (!css_tryget_online(&memcg->css));
+	memcg = mem_cgroup_from_task(rcu_dereference(mm->owner));
+	if (!css_tryget_online(&memcg->css))
+		memcg = root_mem_cgroup;
 	rcu_read_unlock();
+
 	return memcg;
 }
 EXPORT_SYMBOL(get_mem_cgroup_from_mm);


^ permalink raw reply related	[flat|nested] 3+ messages in thread

* Re: [PATCH for-5.2-fixes] memcg: Don't loop on css_tryget_online() failure
  2019-05-29 21:06 [PATCH for-5.2-fixes] memcg: Don't loop on css_tryget_online() failure Tejun Heo
@ 2019-06-05 12:55 ` Michal Hocko
  2019-06-05 13:26   ` Tejun Heo
  0 siblings, 1 reply; 3+ messages in thread
From: Michal Hocko @ 2019-06-05 12:55 UTC (permalink / raw)
  To: Tejun Heo
  Cc: Johannes Weiner, Vladimir Davydov, cgroups, linux-mm, kernel-team

On Wed 29-05-19 14:06:17, Tejun Heo wrote:
> A PF_EXITING task may stay associated with an offline css.
> get_mem_cgroup_from_mm() may deadlock if mm->owner is in such state.
> All similar logics in memcg are falling back to root memcg on
> tryget_online failure and get_mem_cgroup_from_mm() can do the same.
>
> A similar failure existed for task_get_css() and could be triggered
> through BSD process accounting racing against memcg offlining.  See
> 18fa84a2db0e ("cgroup: Use css_tryget() instead of css_tryget_online()
> in task_get_css()") for details.
> 
> Signed-off-by: Tejun Heo <tj@kernel.org>

Do we need to mark this patch for stable or this is too unlikely to
happen?

Acked-by: Michal Hocko <mhocko@suse.com>

Thanks!
> ---
>  mm/memcontrol.c |   24 ++++++++++--------------
>  1 file changed, 10 insertions(+), 14 deletions(-)
> 
> diff --git a/mm/memcontrol.c b/mm/memcontrol.c
> index e50a2db5b4ff..be1fa89db198 100644
> --- a/mm/memcontrol.c
> +++ b/mm/memcontrol.c
> @@ -918,23 +918,19 @@ struct mem_cgroup *get_mem_cgroup_from_mm(struct mm_struct *mm)
>  
>  	if (mem_cgroup_disabled())
>  		return NULL;
> +	/*
> +	 * Page cache insertions can happen without an actual mm context,
> +	 * e.g. during disk probing on boot, loopback IO, acct() writes.
> +	 */
> +	if (unlikely(!mm))
> +		return root_mem_cgroup;
>  
>  	rcu_read_lock();
> -	do {
> -		/*
> -		 * Page cache insertions can happen withou an
> -		 * actual mm context, e.g. during disk probing
> -		 * on boot, loopback IO, acct() writes etc.
> -		 */
> -		if (unlikely(!mm))
> -			memcg = root_mem_cgroup;
> -		else {
> -			memcg = mem_cgroup_from_task(rcu_dereference(mm->owner));
> -			if (unlikely(!memcg))
> -				memcg = root_mem_cgroup;
> -		}
> -	} while (!css_tryget_online(&memcg->css));
> +	memcg = mem_cgroup_from_task(rcu_dereference(mm->owner));
> +	if (!css_tryget_online(&memcg->css))
> +		memcg = root_mem_cgroup;
>  	rcu_read_unlock();
> +
>  	return memcg;
>  }
>  EXPORT_SYMBOL(get_mem_cgroup_from_mm);

-- 
Michal Hocko
SUSE Labs


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH for-5.2-fixes] memcg: Don't loop on css_tryget_online() failure
  2019-06-05 12:55 ` Michal Hocko
@ 2019-06-05 13:26   ` Tejun Heo
  0 siblings, 0 replies; 3+ messages in thread
From: Tejun Heo @ 2019-06-05 13:26 UTC (permalink / raw)
  To: Michal Hocko
  Cc: Johannes Weiner, Vladimir Davydov, cgroups, linux-mm, kernel-team

On Wed, Jun 05, 2019 at 02:55:20PM +0200, Michal Hocko wrote:
> On Wed 29-05-19 14:06:17, Tejun Heo wrote:
> > A PF_EXITING task may stay associated with an offline css.
> > get_mem_cgroup_from_mm() may deadlock if mm->owner is in such state.
> > All similar logics in memcg are falling back to root memcg on
> > tryget_online failure and get_mem_cgroup_from_mm() can do the same.
> >
> > A similar failure existed for task_get_css() and could be triggered
> > through BSD process accounting racing against memcg offlining.  See
> > 18fa84a2db0e ("cgroup: Use css_tryget() instead of css_tryget_online()
> > in task_get_css()") for details.
> > 
> > Signed-off-by: Tejun Heo <tj@kernel.org>
> 
> Do we need to mark this patch for stable or this is too unlikely to
> happen?

This one's a lot less likely than the one in task_get_css() which
already is pretty low frequency.  I don't think it warrants -stable
tagging.

Thanks.

-- 
tejun


^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2019-06-05 13:26 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-05-29 21:06 [PATCH for-5.2-fixes] memcg: Don't loop on css_tryget_online() failure Tejun Heo
2019-06-05 12:55 ` Michal Hocko
2019-06-05 13:26   ` Tejun Heo

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).