From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-9.8 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_PATCH, MAILING_LIST_MULTI,SIGNED_OFF_BY,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id AC1EAC55ABD for ; Thu, 12 Nov 2020 11:22:48 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id EB92F2220B for ; Thu, 12 Nov 2020 11:22:47 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (1024-bit key) header.d=suse.com header.i=@suse.com header.b="LWFNYjaA" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org EB92F2220B Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=suse.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id 07A286B0071; Thu, 12 Nov 2020 06:22:47 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 02BD26B0072; Thu, 12 Nov 2020 06:22:46 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id E5C066B0073; Thu, 12 Nov 2020 06:22:46 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0028.hostedemail.com [216.40.44.28]) by kanga.kvack.org (Postfix) with ESMTP id B8B226B0071 for ; Thu, 12 Nov 2020 06:22:46 -0500 (EST) Received: from smtpin19.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay02.hostedemail.com (Postfix) with ESMTP id 602393625 for ; Thu, 12 Nov 2020 11:22:46 +0000 (UTC) X-FDA: 77475528732.19.touch29_4c110bd27306 Received: from filter.hostedemail.com (10.5.16.251.rfc1918.com [10.5.16.251]) by smtpin19.hostedemail.com (Postfix) with ESMTP id 41BAF1AD1B1 for ; Thu, 12 Nov 2020 11:22:46 +0000 (UTC) X-HE-Tag: touch29_4c110bd27306 X-Filterd-Recvd-Size: 4491 Received: from mx2.suse.de (mx2.suse.de [195.135.220.15]) by imf46.hostedemail.com (Postfix) with ESMTP for ; Thu, 12 Nov 2020 11:22:45 +0000 (UTC) X-Virus-Scanned: by amavisd-new at test-mx.suse.de DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1605180164; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=X95cPMSStTZzNZ4CNsxGYNYPK8NnGZvfuwbtqitjKVA=; b=LWFNYjaAVRlaazrlO8jJtyv5stNspdTNsojXR33Q+m3Fi0Eymuku5jyFut52blH6j5Wlc7 Recui6/EBM5gsMJg2SfkCv5aTVGs9Pj34HQoSk85GyDj76EyDRczkC0oKQSNd1ZtQqZM3B LrCYofJl/oI1vyYM23bK8tLGIOubqPc= Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id 5E245AF16; Thu, 12 Nov 2020 11:22:44 +0000 (UTC) Date: Thu, 12 Nov 2020 12:22:42 +0100 From: Michal Hocko To: Rik van Riel Cc: hughd@google.com, xuyu@linux.alibaba.com, akpm@linux-foundation.org, mgorman@suse.de, aarcange@redhat.com, willy@infradead.org, linux-kernel@vger.kernel.org, kernel-team@fb.com, linux-mm@kvack.org, vbabka@suse.cz, Andrey Grodzovsky , Chris Wilson Subject: Re: [PATCH 2/2] mm,thp,shm: limit gfp mask to no more than specified Message-ID: <20201112112242.GA12240@dhcp22.suse.cz> References: <20201105191508.1961686-1-riel@surriel.com> <20201105191508.1961686-3-riel@surriel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20201105191508.1961686-3-riel@surriel.com> X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: [Cc Chris for i915 and Andray] On Thu 05-11-20 14:15:08, Rik van Riel wrote: > Matthew Wilcox pointed out that the i915 driver opportunistically > allocates tmpfs memory, but will happily reclaim some of its > pool if no memory is available. It would be good to explicitly mention the requested gfp flags for those allocations. i915 uses __GFP_NORETRY | __GFP_NOWARN, or GFP_KERNEL. Is __shmem_rw really meant to not allocate from highmeme/movable zones? Can it be ever backed by THPs? ttm might want __GFP_RETRY_MAYFAIL while shmem_read_mapping_page use the mapping gfp mask which can be NOFS or something else. This is quite messy already and I suspect that they are more targeting regular order-0 requests. E.g. have a look at cb5f1a52caf23. I am worried that this games with gfp flags will lead to unmaintainable code later on. There is a clear disconnect betwen the core THP allocation strategy and what drivers are asking for and those requirements might be really conflicting. Not to mention that flags might be different between regular and THP pages. > Make sure the gfp mask used to opportunistically allocate a THP > is always at least as restrictive as the original gfp mask. > > Signed-off-by: Rik van Riel > Suggested-by: Matthew Wilcox > --- > mm/shmem.c | 21 +++++++++++++++++++++ > 1 file changed, 21 insertions(+) > > diff --git a/mm/shmem.c b/mm/shmem.c > index 6c3cb192a88d..ee3cea10c2a4 100644 > --- a/mm/shmem.c > +++ b/mm/shmem.c > @@ -1531,6 +1531,26 @@ static struct page *shmem_swapin(swp_entry_t swap, gfp_t gfp, > return page; > } > > +/* > + * Make sure huge_gfp is always more limited than limit_gfp. > + * Some of the flags set permissions, while others set limitations. > + */ > +static gfp_t limit_gfp_mask(gfp_t huge_gfp, gfp_t limit_gfp) > +{ > + gfp_t allowflags = __GFP_IO | __GFP_FS | __GFP_RECLAIM; > + gfp_t denyflags = __GFP_NOWARN | __GFP_NORETRY; > + gfp_t result = huge_gfp & ~allowflags; > + > + /* > + * Minimize the result gfp by taking the union with the deny flags, > + * and the intersection of the allow flags. > + */ > + result |= (limit_gfp & denyflags); > + result |= (huge_gfp & limit_gfp) & allowflags; > + > + return result; > +} > + > static struct page *shmem_alloc_hugepage(gfp_t gfp, > struct shmem_inode_info *info, pgoff_t index) > { > @@ -1889,6 +1909,7 @@ static int shmem_getpage_gfp(struct inode *inode, pgoff_t index, > > alloc_huge: > huge_gfp = vma_thp_gfp_mask(vma); > + huge_gfp = limit_gfp_mask(huge_gfp, gfp); > page = shmem_alloc_and_acct_page(huge_gfp, inode, index, true); > if (IS_ERR(page)) { > alloc_nohuge: > -- > 2.25.4 -- Michal Hocko SUSE Labs