From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-8.6 required=3.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,INCLUDES_PATCH,MAILING_LIST_MULTI,SIGNED_OFF_BY, SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED,USER_AGENT_SANE_1 autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 88AE5C3A59D for ; Thu, 22 Aug 2019 07:09:24 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 5673820644 for ; Thu, 22 Aug 2019 07:09:24 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1566457764; bh=aS6g79UrvnFuENqWwBrtqn7qrfVlnYvrDw3+jpwdrJY=; h=Date:From:To:Cc:Subject:References:In-Reply-To:List-ID:From; b=fdYYhXxNTWMBTKvucsuqyZkwbJii9Ky5UxlFBm9dmMySWjjRGm95Zd1iDZOvdGOYj ecoJ8zNC+HvqQVzUew3YptXl5QeF41W8dcNKKYgb1KwQ1EJeJ/UY/H58O44+/5+or7 fLUQ+XW1dEiRCyMZ3rTq6pV4TSAuNdcJMlT3GoFk= Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730813AbfHVHJX (ORCPT ); Thu, 22 Aug 2019 03:09:23 -0400 Received: from mx2.suse.de ([195.135.220.15]:59992 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1727401AbfHVHJW (ORCPT ); Thu, 22 Aug 2019 03:09:22 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 27E53AE00; Thu, 22 Aug 2019 07:09:20 +0000 (UTC) Date: Thu, 22 Aug 2019 09:09:19 +0200 From: Michal Hocko To: Edward Chron Cc: David Rientjes , Andrew Morton , Roman Gushchin , Johannes Weiner , Tetsuo Handa , Shakeel Butt , linux-mm@kvack.org, linux-kernel@vger.kernel.org, Ivan Delalande Subject: Re: [PATCH] mm/oom: Add oom_score_adj value to oom Killed process message Message-ID: <20190822070919.GB12785@dhcp22.suse.cz> References: <20190821001445.32114-1-echron@arista.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed 21-08-19 15:25:13, Edward Chron wrote: > On Tue, Aug 20, 2019 at 8:25 PM David Rientjes wrote: > > > > On Tue, 20 Aug 2019, Edward Chron wrote: > > > > > For an OOM event: print oom_score_adj value for the OOM Killed process to > > > document what the oom score adjust value was at the time the process was > > > OOM Killed. The adjustment value can be set by user code and it affects > > > the resulting oom_score so it is used to influence kill process selection. > > > > > > When eligible tasks are not printed (sysctl oom_dump_tasks = 0) printing > > > this value is the only documentation of the value for the process being > > > killed. Having this value on the Killed process message documents if a > > > miscconfiguration occurred or it can confirm that the oom_score_adj > > > value applies as expected. > > > > > > An example which illustates both misconfiguration and validation that > > > the oom_score_adj was applied as expected is: > > > > > > Aug 14 23:00:02 testserver kernel: Out of memory: Killed process 2692 > > > (systemd-udevd) total-vm:1056800kB, anon-rss:1052760kB, file-rss:4kB, > > > shmem-rss:0kB oom_score_adj:1000 > > > > > > The systemd-udevd is a critical system application that should have an > > > oom_score_adj of -1000. Here it was misconfigured to have a adjustment > > > of 1000 making it a highly favored OOM kill target process. The output > > > documents both the misconfiguration and the fact that the process > > > was correctly targeted by OOM due to the miconfiguration. Having > > > the oom_score_adj on the Killed message ensures that it is documented. > > > > > > Signed-off-by: Edward Chron > > > Acked-by: Michal Hocko > > > > Acked-by: David Rientjes > > > > vm.oom_dump_tasks is pretty useful, however, so it's curious why you > > haven't left it enabled :/ > > > > > diff --git a/mm/oom_kill.c b/mm/oom_kill.c > > > index eda2e2a0bdc6..c781f73b6cd6 100644 > > > --- a/mm/oom_kill.c > > > +++ b/mm/oom_kill.c > > > @@ -884,12 +884,13 @@ static void __oom_kill_process(struct task_struct *victim, const char *message) > > > */ > > > do_send_sig_info(SIGKILL, SEND_SIG_PRIV, victim, PIDTYPE_TGID); > > > mark_oom_victim(victim); > > > - pr_err("%s: Killed process %d (%s) total-vm:%lukB, anon-rss:%lukB, file-rss:%lukB, shmem-rss:%lukB\n", > > > + pr_err("%s: Killed process %d (%s) total-vm:%lukB, anon-rss:%lukB, file-rss:%lukB, shmem-rss:%lukB oom_score_adj:%ld\n", > > > message, task_pid_nr(victim), victim->comm, > > > K(victim->mm->total_vm), > > > K(get_mm_counter(victim->mm, MM_ANONPAGES)), > > > K(get_mm_counter(victim->mm, MM_FILEPAGES)), > > > - K(get_mm_counter(victim->mm, MM_SHMEMPAGES))); > > > + K(get_mm_counter(victim->mm, MM_SHMEMPAGES)), > > > + (long)victim->signal->oom_score_adj); > > > task_unlock(victim); > > > > > > /* > > > > Nit: why not just use %hd and avoid the cast to long? > > Sorry I may have accidently top posted my response to this. Here is > where my response should go: > ----------------------------------------------------------------------------------------------------------------------------------- > > Good point, I can post this with your correction. > > I will add your Acked-by: David Rientjes > > I am adding your Acked-by to the revised patch as this is what Michal > asked me to do (so I assume that is what I should do). > > Should I post as a separate fix again or simply post here? Andrew usually folds these small fixups automagically. If that doesn't happen here for some reason then just repost with acks and the fixup. Thanks! -- Michal Hocko SUSE Labs