From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 5E295C43334 for ; Thu, 2 Jun 2022 05:08:56 +0000 (UTC) Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id 90761113026; Thu, 2 Jun 2022 05:08:55 +0000 (UTC) Received: from mga01.intel.com (mga01.intel.com [192.55.52.88]) by gabe.freedesktop.org (Postfix) with ESMTPS id 76CB9113028; Thu, 2 Jun 2022 05:08:54 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1654146534; x=1685682534; h=date:from:to:cc:subject:message-id:references: mime-version:content-transfer-encoding:in-reply-to; bh=rUSAd39O1ueJesfDxku/j9RNqWsu3LjdUQUzvEC3LVs=; b=bUlMnLhuIyXw3Vj2xwxw6y1vOd7B3w6Htx/nqYm6GRfCJnby85vqNnAA QMMJa679huGEUHL/hNAFhv8vcTaxR69ib9FCc0e+DH4JD1JdmQGnxhMA6 JVO9fPSBpfJMqSGgHnczvV9ch3n26JFpIto8D9ybukUE1/f6bjEYwV0MI O05c4a0EyVVe/9hlVrM4cPSqI0jRZGPW3JD4VPreSBH8VvhZ1wKGEe6Wg KwBfaYk0Fs5YqOCwaFdAxcFIMFcMSw9tZgXqSnihAL9KjDkI7z5Qq4MQ9 U8fDSR13G+eP/+Uf9ROnX6msq3ocH8rWvHO19SJNke4WoOxgUYM7grk0M g==; X-IronPort-AV: E=McAfee;i="6400,9594,10365"; a="301187888" X-IronPort-AV: E=Sophos;i="5.91,270,1647327600"; d="scan'208";a="301187888" Received: from orsmga007.jf.intel.com ([10.7.209.58]) by fmsmga101.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 01 Jun 2022 22:08:53 -0700 X-IronPort-AV: E=Sophos;i="5.91,270,1647327600"; d="scan'208";a="577311530" Received: from nvishwa1-desk.sc.intel.com (HELO nvishwa1-DESK) ([172.25.29.76]) by orsmga007-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 01 Jun 2022 22:08:52 -0700 Date: Wed, 1 Jun 2022 22:08:35 -0700 From: Niranjana Vishwanathapura To: Daniel Vetter Subject: Re: [Intel-gfx] [RFC v3 3/3] drm/doc/rfc: VM_BIND uapi definition Message-ID: <20220602050833.GP4461@nvishwa1-DESK> References: <20220517183212.20274-1-niranjana.vishwanathapura@intel.com> <20220517183212.20274-4-niranjana.vishwanathapura@intel.com> <20220523191943.GH4461@nvishwa1-DESK> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1; format=flowed Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: User-Agent: Mutt/1.5.24 (2015-08-30) X-BeenThere: dri-devel@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Direct Rendering Infrastructure - Development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: "Zanoni, Paulo R" , "intel-gfx@lists.freedesktop.org" , "dri-devel@lists.freedesktop.org" , "Hellstrom, Thomas" , "Wilson, Chris P" , "Vetter, Daniel" , "christian.koenig@amd.com" Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" On Wed, Jun 01, 2022 at 11:27:17AM +0200, Daniel Vetter wrote: >On Wed, 1 Jun 2022 at 11:03, Dave Airlie wrote: >> >> On Tue, 24 May 2022 at 05:20, Niranjana Vishwanathapura >> wrote: >> > >> > On Thu, May 19, 2022 at 04:07:30PM -0700, Zanoni, Paulo R wrote: >> > >On Tue, 2022-05-17 at 11:32 -0700, Niranjana Vishwanathapura wrote: >> > >> VM_BIND and related uapi definitions >> > >> >> > >> v2: Ensure proper kernel-doc formatting with cross references. >> > >> Also add new uapi and documentation as per review comments >> > >> from Daniel. >> > >> >> > >> Signed-off-by: Niranjana Vishwanathapura >> > >> --- >> > >> Documentation/gpu/rfc/i915_vm_bind.h | 399 +++++++++++++++++++++++++++ >> > >> 1 file changed, 399 insertions(+) >> > >> create mode 100644 Documentation/gpu/rfc/i915_vm_bind.h >> > >> >> > >> diff --git a/Documentation/gpu/rfc/i915_vm_bind.h b/Documentation/gpu/rfc/i915_vm_bind.h >> > >> new file mode 100644 >> > >> index 000000000000..589c0a009107 >> > >> --- /dev/null >> > >> +++ b/Documentation/gpu/rfc/i915_vm_bind.h >> > >> @@ -0,0 +1,399 @@ >> > >> +/* SPDX-License-Identifier: MIT */ >> > >> +/* >> > >> + * Copyright © 2022 Intel Corporation >> > >> + */ >> > >> + >> > >> +/** >> > >> + * DOC: I915_PARAM_HAS_VM_BIND >> > >> + * >> > >> + * VM_BIND feature availability. >> > >> + * See typedef drm_i915_getparam_t param. >> > >> + */ >> > >> +#define I915_PARAM_HAS_VM_BIND 57 >> > >> + >> > >> +/** >> > >> + * DOC: I915_VM_CREATE_FLAGS_USE_VM_BIND >> > >> + * >> > >> + * Flag to opt-in for VM_BIND mode of binding during VM creation. >> > >> + * See struct drm_i915_gem_vm_control flags. >> > >> + * >> > >> + * A VM in VM_BIND mode will not support the older execbuff mode of binding. >> > >> + * In VM_BIND mode, execbuff ioctl will not accept any execlist (ie., the >> > >> + * &drm_i915_gem_execbuffer2.buffer_count must be 0). >> > >> + * Also, &drm_i915_gem_execbuffer2.batch_start_offset and >> > >> + * &drm_i915_gem_execbuffer2.batch_len must be 0. >> > >> + * DRM_I915_GEM_EXECBUFFER_EXT_BATCH_ADDRESSES extension must be provided >> > >> + * to pass in the batch buffer addresses. >> > >> + * >> > >> + * Additionally, I915_EXEC_NO_RELOC, I915_EXEC_HANDLE_LUT and >> > >> + * I915_EXEC_BATCH_FIRST of &drm_i915_gem_execbuffer2.flags must be 0 >> > >> + * (not used) in VM_BIND mode. I915_EXEC_USE_EXTENSIONS flag must always be >> > >> + * set (See struct drm_i915_gem_execbuffer_ext_batch_addresses). >> > >> + * The buffers_ptr, buffer_count, batch_start_offset and batch_len fields >> > >> + * of struct drm_i915_gem_execbuffer2 are also not used and must be 0. >> > >> + */ >> > > >> > >From that description, it seems we have: >> > > >> > >struct drm_i915_gem_execbuffer2 { >> > > __u64 buffers_ptr; -> must be 0 (new) >> > > __u32 buffer_count; -> must be 0 (new) >> > > __u32 batch_start_offset; -> must be 0 (new) >> > > __u32 batch_len; -> must be 0 (new) >> > > __u32 DR1; -> must be 0 (old) >> > > __u32 DR4; -> must be 0 (old) >> > > __u32 num_cliprects; (fences) -> must be 0 since using extensions >> > > __u64 cliprects_ptr; (fences, extensions) -> contains an actual pointer! >> > > __u64 flags; -> some flags must be 0 (new) >> > > __u64 rsvd1; (context info) -> repurposed field (old) >> > > __u64 rsvd2; -> unused >> > >}; >> > > >> > >Based on that, why can't we just get drm_i915_gem_execbuffer3 instead >> > >of adding even more complexity to an already abused interface? While >> > >the Vulkan-like extension thing is really nice, I don't think what >> > >we're doing here is extending the ioctl usage, we're completely >> > >changing how the base struct should be interpreted based on how the VM >> > >was created (which is an entirely different ioctl). >> > > >> > >From Rusty Russel's API Design grading, drm_i915_gem_execbuffer2 is >> > >already at -6 without these changes. I think after vm_bind we'll need >> > >to create a -11 entry just to deal with this ioctl. >> > > >> > >> > The only change here is removing the execlist support for VM_BIND >> > mode (other than natual extensions). >> > Adding a new execbuffer3 was considered, but I think we need to be careful >> > with that as that goes beyond the VM_BIND support, including any future >> > requirements (as we don't want an execbuffer4 after VM_BIND). >> >> Why not? it's not like adding extensions here is really that different >> than adding new ioctls. >> >> I definitely think this deserves an execbuffer3 without even >> considering future requirements. Just to burn down the old >> requirements and pointless fields. >> >> Make execbuffer3 be vm bind only, no relocs, no legacy bits, leave the >> older sw on execbuf2 for ever. > >I guess another point in favour of execbuf3 would be that it's less >midlayer. If we share the entry point then there's quite a few vfuncs >needed to cleanly split out the vm_bind paths from the legacy >reloc/softping paths. > >If we invert this and do execbuf3, then there's the existing ioctl >vfunc, and then we share code (where it even makes sense, probably >request setup/submit need to be shared, anything else is probably >cleaner to just copypaste) with the usual helper approach. > >Also that would guarantee that really none of the old concepts like >i915_active on the vma or vma open counts and all that stuff leaks >into the new vm_bind execbuf. > >Finally I also think that copypasting would make backporting easier, >or at least more flexible, since it should make it easier to have the >upstream vm_bind co-exist with all the other things we have. Without >huge amounts of conflicts (or at least much less) that pushing a pile >of vfuncs into the existing code would cause. > >So maybe we should do this? Thanks Dave, Daniel. There are a few things that will be common between execbuf2 and execbuf3, like request setup/submit (as you said), fence handling (timeline fences, fence array, composite fences), engine selection, etc. Also, many of the 'flags' will be there in execbuf3 also (but bit position will differ). But I guess these should be fine as the suggestion here is to copy-paste the execbuff code and having a shared code where possible. Besides, we can stop supporting some older feature in execbuff3 (like fence array in favor of newer timeline fences), which will further reduce common code. Ok, I will update this series by adding execbuf3 and send out soon. Niranjana >-Daniel >-- >Daniel Vetter >Software Engineer, Intel Corporation >http://blog.ffwll.ch From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 02081C43334 for ; Thu, 2 Jun 2022 05:08:59 +0000 (UTC) Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id CFB80113028; Thu, 2 Jun 2022 05:08:55 +0000 (UTC) Received: from mga01.intel.com (mga01.intel.com [192.55.52.88]) by gabe.freedesktop.org (Postfix) with ESMTPS id 76CB9113028; Thu, 2 Jun 2022 05:08:54 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1654146534; x=1685682534; h=date:from:to:cc:subject:message-id:references: mime-version:content-transfer-encoding:in-reply-to; bh=rUSAd39O1ueJesfDxku/j9RNqWsu3LjdUQUzvEC3LVs=; b=bUlMnLhuIyXw3Vj2xwxw6y1vOd7B3w6Htx/nqYm6GRfCJnby85vqNnAA QMMJa679huGEUHL/hNAFhv8vcTaxR69ib9FCc0e+DH4JD1JdmQGnxhMA6 JVO9fPSBpfJMqSGgHnczvV9ch3n26JFpIto8D9ybukUE1/f6bjEYwV0MI O05c4a0EyVVe/9hlVrM4cPSqI0jRZGPW3JD4VPreSBH8VvhZ1wKGEe6Wg KwBfaYk0Fs5YqOCwaFdAxcFIMFcMSw9tZgXqSnihAL9KjDkI7z5Qq4MQ9 U8fDSR13G+eP/+Uf9ROnX6msq3ocH8rWvHO19SJNke4WoOxgUYM7grk0M g==; X-IronPort-AV: E=McAfee;i="6400,9594,10365"; a="301187888" X-IronPort-AV: E=Sophos;i="5.91,270,1647327600"; d="scan'208";a="301187888" Received: from orsmga007.jf.intel.com ([10.7.209.58]) by fmsmga101.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 01 Jun 2022 22:08:53 -0700 X-IronPort-AV: E=Sophos;i="5.91,270,1647327600"; d="scan'208";a="577311530" Received: from nvishwa1-desk.sc.intel.com (HELO nvishwa1-DESK) ([172.25.29.76]) by orsmga007-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 01 Jun 2022 22:08:52 -0700 Date: Wed, 1 Jun 2022 22:08:35 -0700 From: Niranjana Vishwanathapura To: Daniel Vetter Message-ID: <20220602050833.GP4461@nvishwa1-DESK> References: <20220517183212.20274-1-niranjana.vishwanathapura@intel.com> <20220517183212.20274-4-niranjana.vishwanathapura@intel.com> <20220523191943.GH4461@nvishwa1-DESK> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1; format=flowed Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: User-Agent: Mutt/1.5.24 (2015-08-30) Subject: Re: [Intel-gfx] [RFC v3 3/3] drm/doc/rfc: VM_BIND uapi definition X-BeenThere: intel-gfx@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Intel graphics driver community testing & development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: "Zanoni, Paulo R" , "intel-gfx@lists.freedesktop.org" , "dri-devel@lists.freedesktop.org" , "Hellstrom, Thomas" , "Wilson, Chris P" , "Vetter, Daniel" , Dave Airlie , "christian.koenig@amd.com" Errors-To: intel-gfx-bounces@lists.freedesktop.org Sender: "Intel-gfx" On Wed, Jun 01, 2022 at 11:27:17AM +0200, Daniel Vetter wrote: >On Wed, 1 Jun 2022 at 11:03, Dave Airlie wrote: >> >> On Tue, 24 May 2022 at 05:20, Niranjana Vishwanathapura >> wrote: >> > >> > On Thu, May 19, 2022 at 04:07:30PM -0700, Zanoni, Paulo R wrote: >> > >On Tue, 2022-05-17 at 11:32 -0700, Niranjana Vishwanathapura wrote: >> > >> VM_BIND and related uapi definitions >> > >> >> > >> v2: Ensure proper kernel-doc formatting with cross references. >> > >> Also add new uapi and documentation as per review comments >> > >> from Daniel. >> > >> >> > >> Signed-off-by: Niranjana Vishwanathapura >> > >> --- >> > >> Documentation/gpu/rfc/i915_vm_bind.h | 399 +++++++++++++++++++++++++++ >> > >> 1 file changed, 399 insertions(+) >> > >> create mode 100644 Documentation/gpu/rfc/i915_vm_bind.h >> > >> >> > >> diff --git a/Documentation/gpu/rfc/i915_vm_bind.h b/Documentation/gpu/rfc/i915_vm_bind.h >> > >> new file mode 100644 >> > >> index 000000000000..589c0a009107 >> > >> --- /dev/null >> > >> +++ b/Documentation/gpu/rfc/i915_vm_bind.h >> > >> @@ -0,0 +1,399 @@ >> > >> +/* SPDX-License-Identifier: MIT */ >> > >> +/* >> > >> + * Copyright © 2022 Intel Corporation >> > >> + */ >> > >> + >> > >> +/** >> > >> + * DOC: I915_PARAM_HAS_VM_BIND >> > >> + * >> > >> + * VM_BIND feature availability. >> > >> + * See typedef drm_i915_getparam_t param. >> > >> + */ >> > >> +#define I915_PARAM_HAS_VM_BIND 57 >> > >> + >> > >> +/** >> > >> + * DOC: I915_VM_CREATE_FLAGS_USE_VM_BIND >> > >> + * >> > >> + * Flag to opt-in for VM_BIND mode of binding during VM creation. >> > >> + * See struct drm_i915_gem_vm_control flags. >> > >> + * >> > >> + * A VM in VM_BIND mode will not support the older execbuff mode of binding. >> > >> + * In VM_BIND mode, execbuff ioctl will not accept any execlist (ie., the >> > >> + * &drm_i915_gem_execbuffer2.buffer_count must be 0). >> > >> + * Also, &drm_i915_gem_execbuffer2.batch_start_offset and >> > >> + * &drm_i915_gem_execbuffer2.batch_len must be 0. >> > >> + * DRM_I915_GEM_EXECBUFFER_EXT_BATCH_ADDRESSES extension must be provided >> > >> + * to pass in the batch buffer addresses. >> > >> + * >> > >> + * Additionally, I915_EXEC_NO_RELOC, I915_EXEC_HANDLE_LUT and >> > >> + * I915_EXEC_BATCH_FIRST of &drm_i915_gem_execbuffer2.flags must be 0 >> > >> + * (not used) in VM_BIND mode. I915_EXEC_USE_EXTENSIONS flag must always be >> > >> + * set (See struct drm_i915_gem_execbuffer_ext_batch_addresses). >> > >> + * The buffers_ptr, buffer_count, batch_start_offset and batch_len fields >> > >> + * of struct drm_i915_gem_execbuffer2 are also not used and must be 0. >> > >> + */ >> > > >> > >From that description, it seems we have: >> > > >> > >struct drm_i915_gem_execbuffer2 { >> > > __u64 buffers_ptr; -> must be 0 (new) >> > > __u32 buffer_count; -> must be 0 (new) >> > > __u32 batch_start_offset; -> must be 0 (new) >> > > __u32 batch_len; -> must be 0 (new) >> > > __u32 DR1; -> must be 0 (old) >> > > __u32 DR4; -> must be 0 (old) >> > > __u32 num_cliprects; (fences) -> must be 0 since using extensions >> > > __u64 cliprects_ptr; (fences, extensions) -> contains an actual pointer! >> > > __u64 flags; -> some flags must be 0 (new) >> > > __u64 rsvd1; (context info) -> repurposed field (old) >> > > __u64 rsvd2; -> unused >> > >}; >> > > >> > >Based on that, why can't we just get drm_i915_gem_execbuffer3 instead >> > >of adding even more complexity to an already abused interface? While >> > >the Vulkan-like extension thing is really nice, I don't think what >> > >we're doing here is extending the ioctl usage, we're completely >> > >changing how the base struct should be interpreted based on how the VM >> > >was created (which is an entirely different ioctl). >> > > >> > >From Rusty Russel's API Design grading, drm_i915_gem_execbuffer2 is >> > >already at -6 without these changes. I think after vm_bind we'll need >> > >to create a -11 entry just to deal with this ioctl. >> > > >> > >> > The only change here is removing the execlist support for VM_BIND >> > mode (other than natual extensions). >> > Adding a new execbuffer3 was considered, but I think we need to be careful >> > with that as that goes beyond the VM_BIND support, including any future >> > requirements (as we don't want an execbuffer4 after VM_BIND). >> >> Why not? it's not like adding extensions here is really that different >> than adding new ioctls. >> >> I definitely think this deserves an execbuffer3 without even >> considering future requirements. Just to burn down the old >> requirements and pointless fields. >> >> Make execbuffer3 be vm bind only, no relocs, no legacy bits, leave the >> older sw on execbuf2 for ever. > >I guess another point in favour of execbuf3 would be that it's less >midlayer. If we share the entry point then there's quite a few vfuncs >needed to cleanly split out the vm_bind paths from the legacy >reloc/softping paths. > >If we invert this and do execbuf3, then there's the existing ioctl >vfunc, and then we share code (where it even makes sense, probably >request setup/submit need to be shared, anything else is probably >cleaner to just copypaste) with the usual helper approach. > >Also that would guarantee that really none of the old concepts like >i915_active on the vma or vma open counts and all that stuff leaks >into the new vm_bind execbuf. > >Finally I also think that copypasting would make backporting easier, >or at least more flexible, since it should make it easier to have the >upstream vm_bind co-exist with all the other things we have. Without >huge amounts of conflicts (or at least much less) that pushing a pile >of vfuncs into the existing code would cause. > >So maybe we should do this? Thanks Dave, Daniel. There are a few things that will be common between execbuf2 and execbuf3, like request setup/submit (as you said), fence handling (timeline fences, fence array, composite fences), engine selection, etc. Also, many of the 'flags' will be there in execbuf3 also (but bit position will differ). But I guess these should be fine as the suggestion here is to copy-paste the execbuff code and having a shared code where possible. Besides, we can stop supporting some older feature in execbuff3 (like fence array in favor of newer timeline fences), which will further reduce common code. Ok, I will update this series by adding execbuf3 and send out soon. Niranjana >-Daniel >-- >Daniel Vetter >Software Engineer, Intel Corporation >http://blog.ffwll.ch