From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.3 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 76D71C433FF for ; Thu, 8 Aug 2019 17:03:09 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 573212173C for ; Thu, 8 Aug 2019 17:03:09 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2404177AbfHHRDI (ORCPT ); Thu, 8 Aug 2019 13:03:08 -0400 Received: from mga18.intel.com ([134.134.136.126]:41450 "EHLO mga18.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1732444AbfHHRDI (ORCPT ); Thu, 8 Aug 2019 13:03:08 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga008.jf.intel.com ([10.7.209.65]) by orsmga106.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 08 Aug 2019 10:03:06 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.64,362,1559545200"; d="scan'208";a="169048728" Received: from jmath3-mobl1.ger.corp.intel.com (HELO [10.252.5.86]) ([10.252.5.86]) by orsmga008.jf.intel.com with ESMTP; 08 Aug 2019 10:03:02 -0700 Subject: Re: [PATCHv2 2/3] i915: convert to new mount API To: Chris Wilson , Al Viro , Hugh Dickins Cc: Christoph Hellwig , Sergey Senozhatsky , Jani Nikula , Joonas Lahtinen , Rodrigo Vivi , David Howells , David Airlie , Daniel Vetter , intel-gfx@lists.freedesktop.org, dri-devel@lists.freedesktop.org, linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org References: <20190805160307.5418-1-sergey.senozhatsky@gmail.com> <20190805160307.5418-3-sergey.senozhatsky@gmail.com> <20190805181255.GH1131@ZenIV.linux.org.uk> <20190805182834.GI1131@ZenIV.linux.org.uk> <20190807063002.GG6627@lst.de> <20190808012314.GK1131@ZenIV.linux.org.uk> <156528143789.22627.18099397585070419297@skylake-alporthouse-com> From: Matthew Auld Message-ID: <8a9a9aee-a266-5ab2-85b3-e39e821b2d03@intel.com> Date: Thu, 8 Aug 2019 18:03:02 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.7.0 MIME-Version: 1.0 In-Reply-To: <156528143789.22627.18099397585070419297@skylake-alporthouse-com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 08/08/2019 17:23, Chris Wilson wrote: > Quoting Hugh Dickins (2019-08-08 16:54:16) >> On Thu, 8 Aug 2019, Al Viro wrote: >>> On Wed, Aug 07, 2019 at 08:30:02AM +0200, Christoph Hellwig wrote: >>>> On Tue, Aug 06, 2019 at 12:50:10AM -0700, Hugh Dickins wrote: >>>>> Though personally I'm averse to managing "f"objects through >>>>> "m"interfaces, which can get ridiculous (notably, MADV_HUGEPAGE works >>>>> on the virtual address of a mapping, but the huge-or-not alignment of >>>>> that mapping must have been decided previously). In Google we do use >>>>> fcntls F_HUGEPAGE and F_NOHUGEPAGE to override on a per-file basis - >>>>> one day I'll get to upstreaming those. >>>> >>>> Such an interface seems very useful, although the two fcntls seem a bit >>>> odd. >>>> >>>> But I think the point here is that the i915 has its own somewhat odd >>>> instance of tmpfs. If we could pass the equivalent of the huge=* >>>> options to shmem_file_setup all that garbage (including the >>>> shmem_file_setup_with_mnt function) could go away. >>> >>> ... or follow shmem_file_super() with whatever that fcntl maps to >>> internally. I would really love to get rid of that i915 kludge. >> >> As to the immediate problem of i915_gemfs using remount_fs on linux-next, >> IIUC, all that is necessary at the moment is the deletions patch below >> (but I'd prefer that to come from the i915 folks). Since gemfs has no >> need to change the huge option from its default to its default. >> >> As to the future of when they get back to wanting huge pages in gemfs, >> yes, that can probably best be arranged by using the internals of an >> fcntl F_HUGEPAGE on those objects that would benefit from it. >> >> Though my intention there was that the "huge=never" default ought >> to continue to refuse to give huge pages, even when asked by fcntl. >> So a little hackery may still be required, to allow the i915_gemfs >> internal mount to get huge pages when a user mount would not. >> >> As to whether shmem_file_setup_with_mnt() needs to live: I've given >> that no thought, but accept that shm_mnt is such a ragbag of different >> usages, that i915 is right to prefer their own separate gemfs mount. >> >> Hugh >> >> --- mmotm/drivers/gpu/drm/i915/gem/i915_gemfs.c 2019-07-21 19:40:16.573703780 -0700 >> +++ linux/drivers/gpu/drm/i915/gem/i915_gemfs.c 2019-08-08 07:19:23.967689058 -0700 >> @@ -24,28 +24,6 @@ int i915_gemfs_init(struct drm_i915_priv >> if (IS_ERR(gemfs)) >> return PTR_ERR(gemfs); >> >> - /* >> - * Enable huge-pages for objects that are at least HPAGE_PMD_SIZE, most >> - * likely 2M. Note that within_size may overallocate huge-pages, if say >> - * we allocate an object of size 2M + 4K, we may get 2M + 2M, but under >> - * memory pressure shmem should split any huge-pages which can be >> - * shrunk. >> - */ >> - >> - if (has_transparent_hugepage()) { >> - struct super_block *sb = gemfs->mnt_sb; >> - /* FIXME: Disabled until we get W/A for read BW issue. */ >> - char options[] = "huge=never"; >> - int flags = 0; >> - int err; >> - >> - err = sb->s_op->remount_fs(sb, &flags, options); >> - if (err) { >> - kern_unmount(gemfs); >> - return err; >> - } >> - } > > That's perfectly fine; we should probably leave a hint as to why gemfs > exists and include the suggestion of looking at per-file hugepage > controls. > > Matthew, how does this affect your current plans? If at all? Fine with me. > -Chris >