From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1761005AbdEVSFz (ORCPT ); Mon, 22 May 2017 14:05:55 -0400 Received: from mail-qk0-f175.google.com ([209.85.220.175]:35544 "EHLO mail-qk0-f175.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1758262AbdEVSFw (ORCPT ); Mon, 22 May 2017 14:05:52 -0400 MIME-Version: 1.0 In-Reply-To: References: <149520375057.74196.2843113275800730971.stgit@buzz> From: Roman Guschin Date: Mon, 22 May 2017 19:05:51 +0100 Message-ID: Subject: Re: [PATCH] mm/oom_kill: count global and memory cgroup oom kills To: Konstantin Khlebnikov Cc: linux-mm@kvack.org, Andrew Morton , Tejun Heo , cgroups@vger.kernel.org, linux-kernel@vger.kernel.org, Vlastimil Babka , Michal Hocko , Johannes Weiner Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 2017-05-22 10:11 GMT+01:00 Konstantin Khlebnikov : > > > On 19.05.2017 19:34, Roman Guschin wrote: >> >> 2017-05-19 15:22 GMT+01:00 Konstantin Khlebnikov >> : >> From a user's point of view the difference between "oom" and "max" >> becomes really vague here, >> assuming that "max" is described almost in the same words: >> >> "The number of times the cgroup's memory usage was >> about to go over the max boundary. If direct reclaim >> fails to bring it down, the OOM killer is invoked." >> >> I wonder, if it's better to fix the existing "oom" value to show what >> it has to show, according to docs, >> rather than to introduce a new one? >> > > Nope, they are different. I think we should rephase documentation somehow > > low - count of reclaims below low level > high - count of post-allocation reclaims above high level > max - count of direct reclaims > oom - count of failed direct reclaims > oom_kill - count of oom killer invocations and killed processes Definitely worth it. Also, I would prefer to reserve "oom" for number of oom victims, and introduce something like "reclaim_failed". It will be consistent with existing vmstat. Thanks!