From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.5 required=3.0 tests=MAILING_LIST_MULTI,SPF_PASS, USER_AGENT_MUTT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 6587BC67863 for ; Tue, 23 Oct 2018 08:54:50 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 3BCBB20671 for ; Tue, 23 Oct 2018 08:54:50 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 3BCBB20671 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=kernel.org Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728327AbeJWRRO (ORCPT ); Tue, 23 Oct 2018 13:17:14 -0400 Received: from mx2.suse.de ([195.135.220.15]:33734 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1727632AbeJWRRO (ORCPT ); Tue, 23 Oct 2018 13:17:14 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay1.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id EE6E6ACCC; Tue, 23 Oct 2018 08:54:46 +0000 (UTC) Date: Tue, 23 Oct 2018 10:54:45 +0200 From: Michal Hocko To: Petr Mladek Cc: Tetsuo Handa , Sergey Senozhatsky , Johannes Weiner , linux-mm@kvack.org, syzkaller-bugs@googlegroups.com, guro@fb.com, kirill.shutemov@linux.intel.com, linux-kernel@vger.kernel.org, rientjes@google.com, yang.s@alibaba-inc.com, Andrew Morton , Sergey Senozhatsky , Steven Rostedt , syzbot Subject: Re: [PATCH v3] mm: memcontrol: Don't flood OOM messages with no eligible task. Message-ID: <20181023085445.GQ18839@dhcp22.suse.cz> References: <201810180246.w9I2koi3011358@www262.sakura.ne.jp> <20181018042739.GA650@jagdpanzerIV> <201810180526.w9I5QvVn032670@www262.sakura.ne.jp> <20181018061018.GB650@jagdpanzerIV> <20181018075611.GY18839@dhcp22.suse.cz> <20181018081352.GA438@jagdpanzerIV> <2c2b2820-e6f8-76c8-c431-18f60845b3ab@i-love.sakura.ne.jp> <20181018235427.GA877@jagdpanzerIV> <5d472476-7852-f97b-9412-63536dffaa0e@i-love.sakura.ne.jp> <20181023083738.o4wo3jxw3xkp3rwx@pathway.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20181023083738.o4wo3jxw3xkp3rwx@pathway.suse.cz> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org [I strongly suspect this whole email thread went way out of scope of the issue really deserves] I didn't want to participate any further but let me clarify one thing because I can see how the discussion could generate some confusion. On Tue 23-10-18 10:37:38, Petr Mladek wrote: [...] > My understanding is that this situation happens when the system is > misconfigured and unusable without manual intervention. If > the user is able to see what the problem is then we are good. Not really. The flood of _memcg_ oom report about no eligible tasks should indeed happen only when the memcg is misconfigured. The system is and should be still usable at this stage. Ratelimit is aimed to reduce pointless message which do not help to debug the issue itself much. There is a race condition as explained by Tetsuo that could lead to this situation even without a misconfiguration and that is clearly a bug and something to deal with and patches have been posted in that regards [1] The rest of the discussion is about how to handle printk rate-limiting properly and whether ad-hoc solution is more appropriate than a real API we have in place and whether the later needs some enhancements. That is completely orthogonal on the issue at hands and as such it should be really discussed separately. [1] http://lkml.kernel.org/r/20181022071323.9550-1-mhocko@kernel.org -- Michal Hocko SUSE Labs