From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-pf0-f199.google.com (mail-pf0-f199.google.com [209.85.192.199]) by kanga.kvack.org (Postfix) with ESMTP id 6EF746B0038 for ; Mon, 25 Sep 2017 16:32:42 -0400 (EDT) Received: by mail-pf0-f199.google.com with SMTP id y29so14944474pff.6 for ; Mon, 25 Sep 2017 13:32:42 -0700 (PDT) Received: from mx1.suse.de (mx2.suse.de. [195.135.220.15]) by mx.google.com with ESMTPS id 127si3311924pfe.378.2017.09.25.13.32.41 for (version=TLS1 cipher=AES128-SHA bits=128/128); Mon, 25 Sep 2017 13:32:41 -0700 (PDT) Date: Mon, 25 Sep 2017 22:32:35 +0200 From: Michal Hocko Subject: Re: [PATCH 0/2 v4] oom: capture unreclaimable slab info in oom message when kernel panic Message-ID: <20170925203235.vhhiqxp72v67n76l@dhcp22.suse.cz> References: <1505947132-4363-1-git-send-email-yang.s@alibaba-inc.com> <20170925142352.havlx6ikheanqyhj@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Sender: owner-linux-mm@kvack.org List-ID: To: Yang Shi Cc: cl@linux.com, penberg@kernel.org, rientjes@google.com, iamjoonsoo.kim@lge.com, akpm@linux-foundation.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org On Mon 25-09-17 23:55:19, Yang Shi wrote: > > > On 9/25/17 7:23 AM, Michal Hocko wrote: > > On Thu 21-09-17 06:38:50, Yang Shi wrote: > > > Recently we ran into a oom issue, kernel panic due to no killable process. > > > The dmesg shows huge unreclaimable slabs used almost 100% memory, but kdump doesn't capture vmcore due to some reason. > > > > > > So, it may sound better to capture unreclaimable slab info in oom message when kernel panic to aid trouble shooting and cover the corner case. > > > Since kernel already panic, so capturing more information sounds worthy and doesn't bother normal oom killer. > > > > > > With the patchset, tools/vm/slabinfo has a new option, "-U", to show unreclaimable slab only. > > > > > > And, oom will print all non zero (num_objs * size != 0) unreclaimable slabs in oom killer message. > > > > Well, I do undestand that this _might_ be useful but it also might > > generates a _lot_ of output. The oom report can be quite verbose already > > so is this something we want to have enabled by default? > > The uneclaimable slub message will be just printed out when kernel panic (no > killable process or panic_on_oom is set). So, it will not bother normal oom. > Since kernel is already panic, so it might be preferred to have more > information reported. Well, this certainly depends. If you have a limited console output (e.g. no serial console) then the additional information can easily scroll the potentially much more useful information from the early oom report. We already do have a control to enable/disable tasks dumping which can be very long as well. > We definitely can add a proc knob to control it if we want to disable the > message even if when kernel panic. Well, I do not have a strong opinion on this. I can see cases where this kind of information would be useful but most OOM reports I have seen were simply user space pinned memory. Slab memory leaks are seen very seldom. Do you think a pr_dbg and slab stats for all ooms would be still useful? -- Michal Hocko SUSE Labs -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org