From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.5 required=3.0 tests=INCLUDES_PATCH, MAILING_LIST_MULTI,SPF_PASS,USER_AGENT_MUTT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 93820C43382 for ; Wed, 26 Sep 2018 14:22:33 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 55F3220676 for ; Wed, 26 Sep 2018 14:22:33 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 55F3220676 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=kernel.org Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727527AbeIZUfm (ORCPT ); Wed, 26 Sep 2018 16:35:42 -0400 Received: from mx2.suse.de ([195.135.220.15]:52454 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726768AbeIZUfm (ORCPT ); Wed, 26 Sep 2018 16:35:42 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 7A37CAC38; Wed, 26 Sep 2018 14:22:29 +0000 (UTC) Date: Wed, 26 Sep 2018 16:22:27 +0200 From: Michal Hocko To: "Kirill A. Shutemov" Cc: Andrew Morton , Mel Gorman , Vlastimil Babka , David Rientjes , Andrea Argangeli , Zi Yan , Stefan Priebe - Profihost AG , linux-mm@kvack.org, LKML Subject: Re: [PATCH 2/2] mm, thp: consolidate THP gfp handling into alloc_hugepage_direct_gfpmask Message-ID: <20180926142227.GZ6278@dhcp22.suse.cz> References: <20180925120326.24392-1-mhocko@kernel.org> <20180925120326.24392-3-mhocko@kernel.org> <20180926133039.y7o5x4nafovxzh2s@kshutemo-mobl1> <20180926141708.GX6278@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180926141708.GX6278@dhcp22.suse.cz> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed 26-09-18 16:17:08, Michal Hocko wrote: > On Wed 26-09-18 16:30:39, Kirill A. Shutemov wrote: > > On Tue, Sep 25, 2018 at 02:03:26PM +0200, Michal Hocko wrote: > > > diff --git a/mm/huge_memory.c b/mm/huge_memory.c > > > index c3bc7e9c9a2a..c0bcede31930 100644 > > > --- a/mm/huge_memory.c > > > +++ b/mm/huge_memory.c > > > @@ -629,21 +629,40 @@ static vm_fault_t __do_huge_pmd_anonymous_page(struct vm_fault *vmf, > > > * available > > > * never: never stall for any thp allocation > > > */ > > > -static inline gfp_t alloc_hugepage_direct_gfpmask(struct vm_area_struct *vma) > > > +static inline gfp_t alloc_hugepage_direct_gfpmask(struct vm_area_struct *vma, unsigned long addr) > > > { > > > const bool vma_madvised = !!(vma->vm_flags & VM_HUGEPAGE); > > > + gfp_t this_node = 0; > > > + > > > +#ifdef CONFIG_NUMA > > > + struct mempolicy *pol; > > > + /* > > > + * __GFP_THISNODE is used only when __GFP_DIRECT_RECLAIM is not > > > + * specified, to express a general desire to stay on the current > > > + * node for optimistic allocation attempts. If the defrag mode > > > + * and/or madvise hint requires the direct reclaim then we prefer > > > + * to fallback to other node rather than node reclaim because that > > > + * can lead to excessive reclaim even though there is free memory > > > + * on other nodes. We expect that NUMA preferences are specified > > > + * by memory policies. > > > + */ > > > + pol = get_vma_policy(vma, addr); > > > + if (pol->mode != MPOL_BIND) > > > + this_node = __GFP_THISNODE; > > > + mpol_cond_put(pol); > > > +#endif > > > > I'm not very good with NUMA policies. Could you explain in more details how > > the code above is equivalent to the code below? > > MPOL_PREFERRED is handled by policy_node() before we call __alloc_pages_nodemask. > __GFP_THISNODE is applied only when we are not using > __GFP_DIRECT_RECLAIM which is handled in alloc_hugepage_direct_gfpmask > now. > Lastly MPOL_BIND wasn't handled explicitly but in the end the removed > late check would remove __GFP_THISNODE for it as well. So in the end we > are doing the same thing unless I miss something Forgot to add. One notable exception would be that the previous code would allow to hit WARN_ON_ONCE(policy->mode == MPOL_BIND && (gfp & __GFP_THISNODE)); in policy_node if the requested node (e.g. cpu local one) was outside of the mbind nodemask. This is not possible now. We haven't heard about any such warning yet so it is unlikely that it happens though. -- Michal Hocko SUSE Labs