From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail191.messagelabs.com (mail191.messagelabs.com [216.82.242.19]) by kanga.kvack.org (Postfix) with ESMTP id 91F926B01E1 for ; Tue, 8 Jun 2010 21:07:10 -0400 (EDT) Received: from wpaz21.hot.corp.google.com (wpaz21.hot.corp.google.com [172.24.198.85]) by smtp-out.google.com with ESMTP id o59176QI021326 for ; Tue, 8 Jun 2010 18:07:07 -0700 Received: from pvg7 (pvg7.prod.google.com [10.241.210.135]) by wpaz21.hot.corp.google.com with ESMTP id o59175U7003550 for ; Tue, 8 Jun 2010 18:07:05 -0700 Received: by pvg7 with SMTP id 7so2244140pvg.39 for ; Tue, 08 Jun 2010 18:07:04 -0700 (PDT) Date: Tue, 8 Jun 2010 18:07:03 -0700 (PDT) From: David Rientjes Subject: Re: [patch -mm 01/18] oom: filter tasks not sharing the same cpuset In-Reply-To: <20100608170630.80753ed1.akpm@linux-foundation.org> Message-ID: References: <20100607084024.873B.A69D9226@jp.fujitsu.com> <20100608162513.c633439e.akpm@linux-foundation.org> <20100608170630.80753ed1.akpm@linux-foundation.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-linux-mm@kvack.org To: Andrew Morton Cc: KOSAKI Motohiro , Rik van Riel , Nick Piggin , Oleg Nesterov , KAMEZAWA Hiroyuki , Balbir Singh , linux-mm@kvack.org List-ID: On Tue, 8 Jun 2010, Andrew Morton wrote: > > Tasks that do not share the same set of allowed nodes with the task that > > triggered the oom should not be considered as candidates for oom kill. > > > > Tasks in other cpusets with a disjoint set of mems would be unfairly > > penalized otherwise because of oom conditions elsewhere; an extreme > > example could unfairly kill all other applications on the system if a > > single task in a user's cpuset sets itself to OOM_DISABLE and then uses > > more memory than allowed. > > OK, so Nick's change didn't anticipate things being set to OOM_DISABLE? > I wrote out a more elaborate rebuttal to this in your reply to my latest patchset, but not strictly eliminating these tasks from consideration unfairly penalizes tasks in other cpusets simply because their big, there's no way to understand the scale of other cpusets compared to current's with a single divide in the heuristic (in this case, divide by 8), and there's no guarantee that killing such a task would free any memory which would have two results: (i) we need to reinvoke the oom killer to kill yet another task, and (ii) we've now unnecessarily killed a task simply because it was large and probably lost a substantial amount of work. > OOM_DISABLE seems pretty dangerous really - allows malicious > unprivileged users to go homicidal? > OOM_DISABLE doesn't get set without CAP_SYS_RESOURCE, you need that capability to decrease an oom_adj value. So my changelog could probably benefit from s/user/job/. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org