All of lore.kernel.org
 help / color / mirror / Atom feed
From: Minchan Kim <minchan.kim@gmail.com>
To: David Rientjes <rientjes@google.com>
Cc: Rik van Riel <riel@redhat.com>,
	Andrew Morton <akpm@linux-foundation.org>,
	KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>,
	Nick Piggin <npiggin@suse.de>,
	Andrea Arcangeli <aarcange@redhat.com>,
	Balbir Singh <balbir@linux.vnet.ibm.com>,
	Lubos Lunak <l.lunak@suse.cz>,
	linux-kernel@vger.kernel.org, linux-mm@kvack.org
Subject: Re: [patch 4/7 -mm] oom: badness heuristic rewrite
Date: Tue, 16 Feb 2010 22:14:46 +0900	[thread overview]
Message-ID: <1266326086.1709.50.camel@barrios-desktop> (raw)
In-Reply-To: <alpine.DEB.2.00.1002151347470.26927@chino.kir.corp.google.com>

On Mon, 2010-02-15 at 13:54 -0800, David Rientjes wrote:
> We're not enforcing a global, system-wide forkbomb policy in the oom 
> killer, but we do need to identify tasks that fork a very large number of 
> tasks to break ties with other tasks: in other words, it would not be 
> helpful to kill an application that has been running for weeks because 
> another application with the same or less memory usage has forked 1000 
> children and has caused an oom condition.  That unfairly penalizes the 
> former application that is actually doing work.
> 
> Again, I'd encourage you to look at this as only a slight penalization 
> rather than a policy that strictly needs to be enforced.  If it were 
> strictly enforced, it would be a prerequisite for selection if such a task 
> were to exist; in my implementation, it is part of the heuristic.

Okay. I can think it of slight penalization in this patch. 
But in current OOM logic, we try to kill child instead of forkbomb
itself. My concern was that.
Of course, It's not a part of your patch[2/7] which is good. 
It has been in there during long time. I hope we could solve that in
this chance. Pz, look at below my example. 

> 
> > > That doesn't work with Rik's example of a webserver that forks a large
> > > number of threads to handle client connections.  It is _always_ better to
> > > kill a child instead of making the entire webserver unresponsive.
> > 
> > In such case, admin have to handle it by oom_forkbom_thres.
> > Isn't it your goal?
> > 
> 
> oom_forkbomb_thres has a default value, which is 1000, so it should be 
> enabled by default.
> 
> > My suggestion is how handle buggy forkbomb processes which make
> > system almost hang by user's mistake. :)
> > 
> 
> I don't think you've given a clear description (or, even better, a patch) 
> of your suggestion.

I write down my suggestion, again. 
My concern is following as. 


1. Forkbomb A task makes 2000 children in a second.
2. 2000 children has almost same memory usage. I know another factors
affect oom_score. but in here, I assume all of children have almost same
badness score. 
3. Your heuristic penalizes A task so it would be detected as forkbomb. 
4. So OOM killer select A task as bad task. 
5. oom_kill_process kills high badness one of children, _NOT_ task A
itself. Unfortunately high badness child doesn't has big memory usage
compared to sibling. It means sooner or later we would need OOM again. 


My point was 5.

1. oom_kill_process have to take a long time to scan tasklist for
selecting just one high badness task. Okay. It's right since OOM system
hang is much bad and it would be better to kill just first task(ie,
random one) in tasklist. 

2. But in above scenario, sibling have almost same memory. So we would
need OOM again sooner or later and OOM logic could do above scenario
repeatably. 

Yes. Our system is already unresponsible since time slice is spread out
many child tasks. Then in here, it would be better to kill dumb child
instead of BUGGY forkbomb task A? How long time do we have to wait
system responsible? 

I said _BUGGY_ forkbomb task. That's because Rik's example isn't buggy
task. Administrator already knows apache can make many task in a second.
So he can handle it by your oom_forkbomb_thres knob. It's goal of your
knob. 

So my suggestion is following as. 

I assume normal forkbomb tasks are handled well by admin who use your
oom_forkbom_thres. The remained problem is just BUGGY forkbomb process. 
So if your logic selects same victim task as forkbomb by your heuristic
and it's 5th time continuously in 10 second, let's kill forkbomb instead
of child.

tsk = select_victim_task(&cause);
if (tsk == last_victim_tsk && cause == BUGGY_FORKBOMB)
	if (++count == 5 && time_since_first_detect_forkbomb <= 10*HZ)
		kill(tsk);
else {
   last_victim_tsk = NULL; count = 0; time_since... = 0;
   kill(tsk's child);
}

It's just example of my concern. It might never good solution.
What I mean is just whether we have to care this.



-- 
Kind regards,
Minchan Kim



WARNING: multiple messages have this Message-ID (diff)
From: Minchan Kim <minchan.kim@gmail.com>
To: David Rientjes <rientjes@google.com>
Cc: Rik van Riel <riel@redhat.com>,
	Andrew Morton <akpm@linux-foundation.org>,
	KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>,
	Nick Piggin <npiggin@suse.de>,
	Andrea Arcangeli <aarcange@redhat.com>,
	Balbir Singh <balbir@linux.vnet.ibm.com>,
	Lubos Lunak <l.lunak@suse.cz>,
	linux-kernel@vger.kernel.org, linux-mm@kvack.org
Subject: Re: [patch 4/7 -mm] oom: badness heuristic rewrite
Date: Tue, 16 Feb 2010 22:14:46 +0900	[thread overview]
Message-ID: <1266326086.1709.50.camel@barrios-desktop> (raw)
In-Reply-To: <alpine.DEB.2.00.1002151347470.26927@chino.kir.corp.google.com>

On Mon, 2010-02-15 at 13:54 -0800, David Rientjes wrote:
> We're not enforcing a global, system-wide forkbomb policy in the oom 
> killer, but we do need to identify tasks that fork a very large number of 
> tasks to break ties with other tasks: in other words, it would not be 
> helpful to kill an application that has been running for weeks because 
> another application with the same or less memory usage has forked 1000 
> children and has caused an oom condition.  That unfairly penalizes the 
> former application that is actually doing work.
> 
> Again, I'd encourage you to look at this as only a slight penalization 
> rather than a policy that strictly needs to be enforced.  If it were 
> strictly enforced, it would be a prerequisite for selection if such a task 
> were to exist; in my implementation, it is part of the heuristic.

Okay. I can think it of slight penalization in this patch. 
But in current OOM logic, we try to kill child instead of forkbomb
itself. My concern was that.
Of course, It's not a part of your patch[2/7] which is good. 
It has been in there during long time. I hope we could solve that in
this chance. Pz, look at below my example. 

> 
> > > That doesn't work with Rik's example of a webserver that forks a large
> > > number of threads to handle client connections.  It is _always_ better to
> > > kill a child instead of making the entire webserver unresponsive.
> > 
> > In such case, admin have to handle it by oom_forkbom_thres.
> > Isn't it your goal?
> > 
> 
> oom_forkbomb_thres has a default value, which is 1000, so it should be 
> enabled by default.
> 
> > My suggestion is how handle buggy forkbomb processes which make
> > system almost hang by user's mistake. :)
> > 
> 
> I don't think you've given a clear description (or, even better, a patch) 
> of your suggestion.

I write down my suggestion, again. 
My concern is following as. 


1. Forkbomb A task makes 2000 children in a second.
2. 2000 children has almost same memory usage. I know another factors
affect oom_score. but in here, I assume all of children have almost same
badness score. 
3. Your heuristic penalizes A task so it would be detected as forkbomb. 
4. So OOM killer select A task as bad task. 
5. oom_kill_process kills high badness one of children, _NOT_ task A
itself. Unfortunately high badness child doesn't has big memory usage
compared to sibling. It means sooner or later we would need OOM again. 


My point was 5.

1. oom_kill_process have to take a long time to scan tasklist for
selecting just one high badness task. Okay. It's right since OOM system
hang is much bad and it would be better to kill just first task(ie,
random one) in tasklist. 

2. But in above scenario, sibling have almost same memory. So we would
need OOM again sooner or later and OOM logic could do above scenario
repeatably. 

Yes. Our system is already unresponsible since time slice is spread out
many child tasks. Then in here, it would be better to kill dumb child
instead of BUGGY forkbomb task A? How long time do we have to wait
system responsible? 

I said _BUGGY_ forkbomb task. That's because Rik's example isn't buggy
task. Administrator already knows apache can make many task in a second.
So he can handle it by your oom_forkbomb_thres knob. It's goal of your
knob. 

So my suggestion is following as. 

I assume normal forkbomb tasks are handled well by admin who use your
oom_forkbom_thres. The remained problem is just BUGGY forkbomb process. 
So if your logic selects same victim task as forkbomb by your heuristic
and it's 5th time continuously in 10 second, let's kill forkbomb instead
of child.

tsk = select_victim_task(&cause);
if (tsk == last_victim_tsk && cause == BUGGY_FORKBOMB)
	if (++count == 5 && time_since_first_detect_forkbomb <= 10*HZ)
		kill(tsk);
else {
   last_victim_tsk = NULL; count = 0; time_since... = 0;
   kill(tsk's child);
}

It's just example of my concern. It might never good solution.
What I mean is just whether we have to care this.



-- 
Kind regards,
Minchan Kim


--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2010-02-16 13:15 UTC|newest]

Thread overview: 140+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-02-10 16:32 [patch 0/7 -mm] oom killer rewrite David Rientjes
2010-02-10 16:32 ` David Rientjes
2010-02-10 16:32 ` [patch 1/7 -mm] oom: filter tasks not sharing the same cpuset David Rientjes
2010-02-10 16:32   ` David Rientjes
2010-02-10 17:08   ` Rik van Riel
2010-02-10 17:08     ` Rik van Riel
2010-02-11 23:52   ` KAMEZAWA Hiroyuki
2010-02-11 23:52     ` KAMEZAWA Hiroyuki
2010-02-15  2:56   ` KOSAKI Motohiro
2010-02-15  2:56     ` KOSAKI Motohiro
2010-02-15 22:06     ` David Rientjes
2010-02-15 22:06       ` David Rientjes
2010-02-16  4:52       ` KOSAKI Motohiro
2010-02-16  4:52         ` KOSAKI Motohiro
2010-02-16  6:01         ` KOSAKI Motohiro
2010-02-16  6:01           ` KOSAKI Motohiro
2010-02-16  7:03         ` Nick Piggin
2010-02-16  7:03           ` Nick Piggin
2010-02-16  8:49           ` David Rientjes
2010-02-16  8:49             ` David Rientjes
2010-02-16  9:04             ` Nick Piggin
2010-02-16  9:04               ` Nick Piggin
2010-02-16  9:10               ` David Rientjes
2010-02-16  9:10                 ` David Rientjes
2010-02-16  8:46         ` David Rientjes
2010-02-16  8:46           ` David Rientjes
2010-02-10 16:32 ` [patch 2/7 -mm] oom: sacrifice child with highest badness score for parent David Rientjes
2010-02-10 16:32   ` David Rientjes
2010-02-10 20:52   ` Rik van Riel
2010-02-10 20:52     ` Rik van Riel
2010-02-12  0:00   ` KAMEZAWA Hiroyuki
2010-02-12  0:00     ` KAMEZAWA Hiroyuki
2010-02-12  0:15     ` David Rientjes
2010-02-12  0:15       ` David Rientjes
2010-02-13  2:49   ` Minchan Kim
2010-02-13  2:49     ` Minchan Kim
2010-02-15  3:08   ` KOSAKI Motohiro
2010-02-15  3:08     ` KOSAKI Motohiro
2010-02-10 16:32 ` [patch 3/7 -mm] oom: select task from tasklist for mempolicy ooms David Rientjes
2010-02-10 16:32   ` David Rientjes
2010-02-10 22:47   ` Rik van Riel
2010-02-10 22:47     ` Rik van Riel
2010-02-15  5:03   ` KOSAKI Motohiro
2010-02-15  5:03     ` KOSAKI Motohiro
2010-02-15 22:11     ` David Rientjes
2010-02-15 22:11       ` David Rientjes
2010-02-16  5:15       ` KOSAKI Motohiro
2010-02-16  5:15         ` KOSAKI Motohiro
2010-02-16 21:52         ` David Rientjes
2010-02-16 21:52           ` David Rientjes
2010-02-17  0:48           ` David Rientjes
2010-02-17  0:48             ` David Rientjes
2010-02-17  1:13             ` KOSAKI Motohiro
2010-02-17  1:13               ` KOSAKI Motohiro
2010-02-10 16:32 ` [patch 4/7 -mm] oom: badness heuristic rewrite David Rientjes
2010-02-10 16:32   ` David Rientjes
2010-02-11  4:10   ` Rik van Riel
2010-02-11  4:10     ` Rik van Riel
2010-02-11  9:14     ` David Rientjes
2010-02-11  9:14       ` David Rientjes
2010-02-11 15:07       ` Nick Bowler
2010-02-11 15:07         ` Nick Bowler
2010-02-11 21:01         ` David Rientjes
2010-02-11 21:01           ` David Rientjes
2010-02-11 21:43       ` Andrew Morton
2010-02-11 21:43         ` Andrew Morton
2010-02-11 21:51         ` David Rientjes
2010-02-11 21:51           ` David Rientjes
2010-02-11 22:31           ` Andrew Morton
2010-02-11 22:31             ` Andrew Morton
2010-02-11 22:42             ` David Rientjes
2010-02-11 22:42               ` David Rientjes
2010-02-11 23:11               ` Andrew Morton
2010-02-11 23:11                 ` Andrew Morton
2010-02-11 23:31                 ` David Rientjes
2010-02-11 23:31                   ` David Rientjes
2010-02-11 23:37                   ` Andrew Morton
2010-02-11 23:37                     ` Andrew Morton
2010-02-12 13:56       ` Minchan Kim
2010-02-12 13:56         ` Minchan Kim
2010-02-12 21:00         ` David Rientjes
2010-02-12 21:00           ` David Rientjes
2010-02-13  2:45           ` Minchan Kim
2010-02-13  2:45             ` Minchan Kim
2010-02-15 21:54             ` David Rientjes
2010-02-15 21:54               ` David Rientjes
2010-02-16 13:14               ` Minchan Kim [this message]
2010-02-16 13:14                 ` Minchan Kim
2010-02-16 21:41                 ` David Rientjes
2010-02-16 21:41                   ` David Rientjes
2010-02-17  7:41                   ` Minchan Kim
2010-02-17  7:41                     ` Minchan Kim
2010-02-17  9:23                     ` David Rientjes
2010-02-17  9:23                       ` David Rientjes
2010-02-17 13:08                       ` Minchan Kim
2010-02-17 13:08                         ` Minchan Kim
2010-02-15  8:05   ` KOSAKI Motohiro
2010-02-15  8:05     ` KOSAKI Motohiro
2010-02-10 16:32 ` [patch 5/7 -mm] oom: replace sysctls with quick mode David Rientjes
2010-02-10 16:32   ` David Rientjes
2010-02-12  0:26   ` KAMEZAWA Hiroyuki
2010-02-12  0:26     ` KAMEZAWA Hiroyuki
2010-02-12  9:58     ` David Rientjes
2010-02-12  9:58       ` David Rientjes
2010-02-15  8:09   ` KOSAKI Motohiro
2010-02-15  8:09     ` KOSAKI Motohiro
2010-02-15 22:15     ` David Rientjes
2010-02-15 22:15       ` David Rientjes
2010-02-16  5:25       ` KOSAKI Motohiro
2010-02-16  5:25         ` KOSAKI Motohiro
2010-02-16  9:04         ` David Rientjes
2010-02-16  9:04           ` David Rientjes
2010-02-10 16:32 ` [patch 6/7 -mm] oom: avoid oom killer for lowmem allocations David Rientjes
2010-02-10 16:32   ` David Rientjes
2010-02-11  4:13   ` Rik van Riel
2010-02-11  4:13     ` Rik van Riel
2010-02-11  9:19     ` David Rientjes
2010-02-11  9:19       ` David Rientjes
2010-02-11 14:08       ` Rik van Riel
2010-02-11 14:08         ` Rik van Riel
2010-02-12  1:28   ` KAMEZAWA Hiroyuki
2010-02-12  1:28     ` KAMEZAWA Hiroyuki
2010-02-12 10:06     ` David Rientjes
2010-02-12 10:06       ` David Rientjes
2010-02-15  0:09       ` KAMEZAWA Hiroyuki
2010-02-15  0:09         ` KAMEZAWA Hiroyuki
2010-02-15 22:01         ` David Rientjes
2010-02-15 22:01           ` David Rientjes
2010-02-15  8:29   ` KOSAKI Motohiro
2010-02-15  8:29     ` KOSAKI Motohiro
2010-02-10 16:32 ` [patch 7/7 -mm] oom: remove unnecessary code and cleanup David Rientjes
2010-02-10 16:32   ` David Rientjes
2010-02-12  0:12   ` KAMEZAWA Hiroyuki
2010-02-12  0:12     ` KAMEZAWA Hiroyuki
2010-02-12  0:21     ` David Rientjes
2010-02-12  0:21       ` David Rientjes
2010-02-15  8:31       ` KOSAKI Motohiro
2010-02-15  8:31         ` KOSAKI Motohiro
2010-02-15  2:51 ` [patch 0/7 -mm] oom killer rewrite KOSAKI Motohiro
2010-02-15  2:51   ` KOSAKI Motohiro

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1266326086.1709.50.camel@barrios-desktop \
    --to=minchan.kim@gmail.com \
    --cc=aarcange@redhat.com \
    --cc=akpm@linux-foundation.org \
    --cc=balbir@linux.vnet.ibm.com \
    --cc=kamezawa.hiroyu@jp.fujitsu.com \
    --cc=l.lunak@suse.cz \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=npiggin@suse.de \
    --cc=riel@redhat.com \
    --cc=rientjes@google.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.