From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932400AbdJZVDs (ORCPT ); Thu, 26 Oct 2017 17:03:48 -0400 Received: from mail-io0-f169.google.com ([209.85.223.169]:51312 "EHLO mail-io0-f169.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751553AbdJZVDo (ORCPT ); Thu, 26 Oct 2017 17:03:44 -0400 X-Google-Smtp-Source: ABhQp+RSYTYOPr8ET3vP4zEsvr/bYu5C3kiFnBCj/iWX9w+ORVqTqcPKvdZOUHGCdRrbQ4f+Sec2cA== Date: Thu, 26 Oct 2017 14:03:41 -0700 (PDT) From: David Rientjes X-X-Sender: rientjes@chino.kir.corp.google.com To: Johannes Weiner cc: Andrew Morton , linux-mm@kvack.org, Michal Hocko , Vladimir Davydov , Tetsuo Handa , Tejun Heo , kernel-team@fb.com, cgroups@vger.kernel.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, Roman Gushchin Subject: Re: [RESEND v12 0/6] cgroup-aware OOM killer In-Reply-To: <20171026142445.GA21147@cmpxchg.org> Message-ID: References: <20171019185218.12663-1-guro@fb.com> <20171019194534.GA5502@cmpxchg.org> <20171026142445.GA21147@cmpxchg.org> User-Agent: Alpine 2.10 (DEB 1266 2009-07-14) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 26 Oct 2017, Johannes Weiner wrote: > > The nack is for three reasons: > > > > (1) unfair comparison of root mem cgroup usage to bias against that mem > > cgroup from oom kill in system oom conditions, > > > > (2) the ability of users to completely evade the oom killer by attaching > > all processes to child cgroups either purposefully or unpurposefully, > > and > > > > (3) the inability of userspace to effectively control oom victim > > selection. > > My apologies if my summary was too reductionist. > > That being said, the arguments you repeat here have come up in > previous threads and been responded to. This doesn't change my > conclusion that your NAK is bogus. > They actually haven't been responded to, Roman was working through v11 and made a change on how the root mem cgroup usage was calculated that was better than previous iterations but still not an apples to apples comparison with other cgroups. The problem is that it the calculation for leaf cgroups includes additional memory classes, so it biases against processes that are moved to non-root mem cgroups. Simply creating mem cgroups and attaching processes should not independently cause them to become more preferred: it should be a fair comparison between the root mem cgroup and the set of leaf mem cgroups as implemented. That is very trivial to do with hierarchical oom cgroup scoring. Since the ability of userspace to control oom victim selection is not addressed whatsoever by this patchset, and the suggested method cannot be implemented on top of this patchset as you have argued because it requires a change to the heuristic itself, the patchset needs to become complete before being mergeable.