From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-10.5 required=3.0 tests=BAYES_00, DKIM_ADSP_CUSTOM_MED,DKIM_INVALID,DKIM_SIGNED,FREEMAIL_FORGED_FROMDOMAIN, FREEMAIL_FROM,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER, INCLUDES_PATCH,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 6D301C43461 for ; Thu, 15 Apr 2021 09:24:12 +0000 (UTC) Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id 1301C61222 for ; Thu, 15 Apr 2021 09:24:12 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 1301C61222 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=gmail.com Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=dri-devel-bounces@lists.freedesktop.org Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id 4EBE56EA17; Thu, 15 Apr 2021 09:24:10 +0000 (UTC) Received: from mail-qv1-xf2c.google.com (mail-qv1-xf2c.google.com [IPv6:2607:f8b0:4864:20::f2c]) by gabe.freedesktop.org (Postfix) with ESMTPS id 1F09E89D4F; Thu, 15 Apr 2021 09:24:09 +0000 (UTC) Received: by mail-qv1-xf2c.google.com with SMTP id bs7so10844284qvb.12; Thu, 15 Apr 2021 02:24:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=nAGKDL6rzqEPFvWqbGn5kStdWJRIwFVxc31VEcY3Eas=; b=Ic1mWy+uwX0xuWIMJhwfd1Yn1wH3u362zhUxAjEAniGHPOCmRQP2z4ePtYhVkj2Kdx oZ4dsfAYMPeow/bOiL3+TxugtIn1TTfO6Qoh41/Dk6aXfqGcd7o4iSwd3d1oDkSF9vNN DvAgT1nqg/hTTTmvbZnUMrViLOvxi5Fgdin3NILWmjcvmK8W/9eH+OODvTe6wvFVyyDD +PK1mJPba0K1EcT65w141poua19ODTmrB6wrUB46ry43KOrnnRkDfmE4bR9wTtkFq166 Qh0f48UX+Lsh50h6B6aMwS1aTW3HGq8WeHC+QTkZPVPqYRJSqDnbkIxPOqk5JPjEiR/8 ZgkA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=nAGKDL6rzqEPFvWqbGn5kStdWJRIwFVxc31VEcY3Eas=; b=tDr89nArpSdl16It5bmc6xi/Tmh1Gf19QBm/gCX80UHjlEY/MQwJxIgeOvGf04TII4 8zUFjUhoHpI23u+PzOTqrJTt6MNivFNfa/buhLkcmGHkxDP+Rxw9kjniy/lrPo+QktYQ ZIHF3v8INEhfQXmMipp533D8DDUKjT5Za6Sn5Ch+jKTnMvIeRZ3J6s9eF0P3nv7vzqb8 l4wArM/T7sUsjHmBaHyk7kzvbSD09CafHbj4zk3TnToQ0Lh533GpCbK5MT7tUUk2X8dd 9PKpG7uvmIUkcDXgOdgx0E+t8t+/DuPcWN7yktnhe+CwtQDwSjQEnXFF8sYPY4aLPdGt ZExw== X-Gm-Message-State: AOAM533trDxede5LEp+GZ5cI/9GETxcOkzR+Ircd8KU1eYOJpsBkXnjQ cL412Ub566tjCkWK0E03bglrhNP4WDO2BHvDiRM= X-Google-Smtp-Source: ABdhPJyi6iAfk+fR1xkHIUh8/ki9WVhF6I1zHe0SVViMU+Ei9rSItLYYC+TeONZmEr7DspNvcyVa1XEA0Uwa2y+GJj4= X-Received: by 2002:a05:6214:1c45:: with SMTP id if5mr2263111qvb.48.1618478648088; Thu, 15 Apr 2021 02:24:08 -0700 (PDT) MIME-Version: 1.0 References: <20210412090526.30547-1-matthew.auld@intel.com> <20210412090526.30547-12-matthew.auld@intel.com> <7bc2a663-fbc0-6486-341a-7dca980c2440@linux.intel.com> <9aca524c-3710-dfaa-bfec-aef76aee7b26@linux.intel.com> In-Reply-To: <9aca524c-3710-dfaa-bfec-aef76aee7b26@linux.intel.com> From: Matthew Auld Date: Thu, 15 Apr 2021 10:23:41 +0100 Message-ID: Subject: Re: [Intel-gfx] [PATCH 11/19] drm/i915: Update the helper to set correct mapping To: Tvrtko Ursulin X-BeenThere: dri-devel@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Direct Rendering Infrastructure - Development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: Intel Graphics Development , Matthew Auld , ML dri-devel Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" On Thu, 15 Apr 2021 at 09:21, Tvrtko Ursulin wrote: > > > On 14/04/2021 17:20, Matthew Auld wrote: > > On Wed, 14 Apr 2021 at 16:22, Tvrtko Ursulin > > wrote: > >> > >> > >> On 12/04/2021 10:05, Matthew Auld wrote: > >>> From: Venkata Sandeep Dhanalakota > >>> > >>> Determine the possible coherent map type based on object location, > >>> and if target has llc or if user requires an always coherent > >>> mapping. > >>> > >>> Cc: Matthew Auld > >>> Cc: CQ Tang > >>> Suggested-by: Michal Wajdeczko > >>> Signed-off-by: Venkata Sandeep Dhanalakota > >>> --- > >>> drivers/gpu/drm/i915/gt/intel_engine_cs.c | 3 ++- > >>> drivers/gpu/drm/i915/gt/intel_engine_pm.c | 2 +- > >>> drivers/gpu/drm/i915/gt/intel_lrc.c | 4 +++- > >>> drivers/gpu/drm/i915/gt/intel_ring.c | 9 ++++++--- > >>> drivers/gpu/drm/i915/gt/selftest_context.c | 3 ++- > >>> drivers/gpu/drm/i915/gt/selftest_hangcheck.c | 4 ++-- > >>> drivers/gpu/drm/i915/gt/selftest_lrc.c | 4 +++- > >>> drivers/gpu/drm/i915/gt/uc/intel_guc.c | 4 +++- > >>> drivers/gpu/drm/i915/gt/uc/intel_huc.c | 4 +++- > >>> drivers/gpu/drm/i915/i915_drv.h | 11 +++++++++-- > >>> drivers/gpu/drm/i915/selftests/igt_spinner.c | 4 ++-- > >>> 11 files changed, 36 insertions(+), 16 deletions(-) > >>> > >>> diff --git a/drivers/gpu/drm/i915/gt/intel_engine_cs.c b/drivers/gpu/drm/i915/gt/intel_engine_cs.c > >>> index efe935f80c1a..b79568d370f5 100644 > >>> --- a/drivers/gpu/drm/i915/gt/intel_engine_cs.c > >>> +++ b/drivers/gpu/drm/i915/gt/intel_engine_cs.c > >>> @@ -664,7 +664,8 @@ static int init_status_page(struct intel_engine_cs *engine) > >>> if (ret) > >>> goto err; > >>> > >>> - vaddr = i915_gem_object_pin_map(obj, I915_MAP_WB); > >>> + vaddr = i915_gem_object_pin_map(obj, > >>> + i915_coherent_map_type(engine->i915, obj, true)); > >>> if (IS_ERR(vaddr)) { > >>> ret = PTR_ERR(vaddr); > >>> goto err_unpin; > >>> diff --git a/drivers/gpu/drm/i915/gt/intel_engine_pm.c b/drivers/gpu/drm/i915/gt/intel_engine_pm.c > >>> index 7c9af86fdb1e..47f4397095e5 100644 > >>> --- a/drivers/gpu/drm/i915/gt/intel_engine_pm.c > >>> +++ b/drivers/gpu/drm/i915/gt/intel_engine_pm.c > >>> @@ -23,7 +23,7 @@ static void dbg_poison_ce(struct intel_context *ce) > >>> > >>> if (ce->state) { > >>> struct drm_i915_gem_object *obj = ce->state->obj; > >>> - int type = i915_coherent_map_type(ce->engine->i915); > >>> + int type = i915_coherent_map_type(ce->engine->i915, obj, true); > >>> void *map; > >>> > >>> if (!i915_gem_object_trylock(obj)) > >>> diff --git a/drivers/gpu/drm/i915/gt/intel_lrc.c b/drivers/gpu/drm/i915/gt/intel_lrc.c > >>> index e86897cde984..aafe2a4df496 100644 > >>> --- a/drivers/gpu/drm/i915/gt/intel_lrc.c > >>> +++ b/drivers/gpu/drm/i915/gt/intel_lrc.c > >>> @@ -903,7 +903,9 @@ lrc_pre_pin(struct intel_context *ce, > >>> GEM_BUG_ON(!i915_vma_is_pinned(ce->state)); > >>> > >>> *vaddr = i915_gem_object_pin_map(ce->state->obj, > >>> - i915_coherent_map_type(ce->engine->i915) | > >>> + i915_coherent_map_type(ce->engine->i915, > >>> + ce->state->obj, > >>> + false) | > >>> I915_MAP_OVERRIDE); > >>> > >>> return PTR_ERR_OR_ZERO(*vaddr); > >>> diff --git a/drivers/gpu/drm/i915/gt/intel_ring.c b/drivers/gpu/drm/i915/gt/intel_ring.c > >>> index aee0a77c77e0..3cf6c7e68108 100644 > >>> --- a/drivers/gpu/drm/i915/gt/intel_ring.c > >>> +++ b/drivers/gpu/drm/i915/gt/intel_ring.c > >>> @@ -53,9 +53,12 @@ int intel_ring_pin(struct intel_ring *ring, struct i915_gem_ww_ctx *ww) > >>> > >>> if (i915_vma_is_map_and_fenceable(vma)) > >>> addr = (void __force *)i915_vma_pin_iomap(vma); > >>> - else > >>> - addr = i915_gem_object_pin_map(vma->obj, > >>> - i915_coherent_map_type(vma->vm->i915)); > >>> + else { > >>> + int type = i915_coherent_map_type(vma->vm->i915, vma->obj, false); > >>> + > >>> + addr = i915_gem_object_pin_map(vma->obj, type); > >>> + } > >>> + > >>> if (IS_ERR(addr)) { > >>> ret = PTR_ERR(addr); > >>> goto err_ring; > >>> diff --git a/drivers/gpu/drm/i915/gt/selftest_context.c b/drivers/gpu/drm/i915/gt/selftest_context.c > >>> index b9bdd1d23243..26685b927169 100644 > >>> --- a/drivers/gpu/drm/i915/gt/selftest_context.c > >>> +++ b/drivers/gpu/drm/i915/gt/selftest_context.c > >>> @@ -88,7 +88,8 @@ static int __live_context_size(struct intel_engine_cs *engine) > >>> goto err; > >>> > >>> vaddr = i915_gem_object_pin_map_unlocked(ce->state->obj, > >>> - i915_coherent_map_type(engine->i915)); > >>> + i915_coherent_map_type(engine->i915, > >>> + ce->state->obj, false)); > >>> if (IS_ERR(vaddr)) { > >>> err = PTR_ERR(vaddr); > >>> intel_context_unpin(ce); > >>> diff --git a/drivers/gpu/drm/i915/gt/selftest_hangcheck.c b/drivers/gpu/drm/i915/gt/selftest_hangcheck.c > >>> index 746985971c3a..5b63d4df8c93 100644 > >>> --- a/drivers/gpu/drm/i915/gt/selftest_hangcheck.c > >>> +++ b/drivers/gpu/drm/i915/gt/selftest_hangcheck.c > >>> @@ -69,7 +69,7 @@ static int hang_init(struct hang *h, struct intel_gt *gt) > >>> h->seqno = memset(vaddr, 0xff, PAGE_SIZE); > >>> > >>> vaddr = i915_gem_object_pin_map_unlocked(h->obj, > >>> - i915_coherent_map_type(gt->i915)); > >>> + i915_coherent_map_type(gt->i915, h->obj, false)); > >>> if (IS_ERR(vaddr)) { > >>> err = PTR_ERR(vaddr); > >>> goto err_unpin_hws; > >>> @@ -130,7 +130,7 @@ hang_create_request(struct hang *h, struct intel_engine_cs *engine) > >>> return ERR_CAST(obj); > >>> } > >>> > >>> - vaddr = i915_gem_object_pin_map_unlocked(obj, i915_coherent_map_type(gt->i915)); > >>> + vaddr = i915_gem_object_pin_map_unlocked(obj, i915_coherent_map_type(gt->i915, obj, false)); > >>> if (IS_ERR(vaddr)) { > >>> i915_gem_object_put(obj); > >>> i915_vm_put(vm); > >>> diff --git a/drivers/gpu/drm/i915/gt/selftest_lrc.c b/drivers/gpu/drm/i915/gt/selftest_lrc.c > >>> index 85e7df6a5123..d8f6623524e8 100644 > >>> --- a/drivers/gpu/drm/i915/gt/selftest_lrc.c > >>> +++ b/drivers/gpu/drm/i915/gt/selftest_lrc.c > >>> @@ -1221,7 +1221,9 @@ static int compare_isolation(struct intel_engine_cs *engine, > >>> } > >>> > >>> lrc = i915_gem_object_pin_map_unlocked(ce->state->obj, > >>> - i915_coherent_map_type(engine->i915)); > >>> + i915_coherent_map_type(engine->i915, > >>> + ce->state->obj, > >>> + false)); > >>> if (IS_ERR(lrc)) { > >>> err = PTR_ERR(lrc); > >>> goto err_B1; > >>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc.c b/drivers/gpu/drm/i915/gt/uc/intel_guc.c > >>> index 78305b2ec89d..adae04c47aab 100644 > >>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc.c > >>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc.c > >>> @@ -682,7 +682,9 @@ int intel_guc_allocate_and_map_vma(struct intel_guc *guc, u32 size, > >>> if (IS_ERR(vma)) > >>> return PTR_ERR(vma); > >>> > >>> - vaddr = i915_gem_object_pin_map_unlocked(vma->obj, I915_MAP_WB); > >>> + vaddr = i915_gem_object_pin_map_unlocked(vma->obj, > >>> + i915_coherent_map_type(guc_to_gt(guc)->i915, > >>> + vma->obj, true)); > >>> if (IS_ERR(vaddr)) { > >>> i915_vma_unpin_and_release(&vma, 0); > >>> return PTR_ERR(vaddr); > >>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_huc.c b/drivers/gpu/drm/i915/gt/uc/intel_huc.c > >>> index 2126dd81ac38..56d2144dc6a0 100644 > >>> --- a/drivers/gpu/drm/i915/gt/uc/intel_huc.c > >>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_huc.c > >>> @@ -82,7 +82,9 @@ static int intel_huc_rsa_data_create(struct intel_huc *huc) > >>> if (IS_ERR(vma)) > >>> return PTR_ERR(vma); > >>> > >>> - vaddr = i915_gem_object_pin_map_unlocked(vma->obj, I915_MAP_WB); > >>> + vaddr = i915_gem_object_pin_map_unlocked(vma->obj, > >>> + i915_coherent_map_type(gt->i915, > >>> + vma->obj, true)); > >>> if (IS_ERR(vaddr)) { > >>> i915_vma_unpin_and_release(&vma, 0); > >>> return PTR_ERR(vaddr); > >>> diff --git a/drivers/gpu/drm/i915/i915_drv.h b/drivers/gpu/drm/i915/i915_drv.h > >>> index 69e43bf91a15..2abbc06712a4 100644 > >>> --- a/drivers/gpu/drm/i915/i915_drv.h > >>> +++ b/drivers/gpu/drm/i915/i915_drv.h > >>> @@ -78,6 +78,7 @@ > >>> #include "gem/i915_gem_context_types.h" > >>> #include "gem/i915_gem_shrinker.h" > >>> #include "gem/i915_gem_stolen.h" > >>> +#include "gem/i915_gem_lmem.h" > >>> > >>> #include "gt/intel_engine.h" > >>> #include "gt/intel_gt_types.h" > >>> @@ -1921,9 +1922,15 @@ static inline int intel_hws_csb_write_index(struct drm_i915_private *i915) > >>> } > >>> > >>> static inline enum i915_map_type > >>> -i915_coherent_map_type(struct drm_i915_private *i915) > >>> +i915_coherent_map_type(struct drm_i915_private *i915, > >>> + struct drm_i915_gem_object *obj, bool always_coherent) > >>> { > >>> - return HAS_LLC(i915) ? I915_MAP_WB : I915_MAP_WC; > >>> + if (i915_gem_object_is_lmem(obj)) > >>> + return I915_MAP_WC; > >>> + if (HAS_LLC(i915) || always_coherent) > >>> + return I915_MAP_WB; > >>> + else > >>> + return I915_MAP_WC; > >> > >> Seems this patch is doing two things. > >> > >> First it is adding lmem support to this helper by always returning WC > >> for lmem objects. > >> > >> Secondly it is introducing an idea of "always coherent" in a helper > >> called i915_coherent_map_type. Could someone explain what is coherent vs > >> always coherent? > >> > >> And also, why is always coherent happy with WB? Sounds counter intuitive > >> to me. > > > > All this does is try to keep the existing behaviour intact, whilst > > also ensuring that all lmem objects are mapped using only WC, no > > matter what. The always_coherent=true thing is for the existing places > > where we sometimes map the object using WB, without first considering > > whether the device has the fast shared LLC vs snooping. Yes, it's > > slightly ugly :) > > Not fully following - if we had to write kerneldoc for always_coherent > input argument - what it would say? @always_coherent - If true we should always try to map the object using WB. If false we should only map as WB if the device supports the fast shared LLC, in the case of snooped devices we will map use WC. Note that If the resource is lmem then we will always map as WC, regardless of the value of always_coherent, since that's all we currently support. Maybe the naming is poor? > > Regards, > > Tvrtko _______________________________________________ dri-devel mailing list dri-devel@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/dri-devel From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-10.5 required=3.0 tests=BAYES_00, DKIM_ADSP_CUSTOM_MED,DKIM_INVALID,DKIM_SIGNED,FREEMAIL_FORGED_FROMDOMAIN, FREEMAIL_FROM,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER, INCLUDES_PATCH,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 35980C433ED for ; Thu, 15 Apr 2021 09:24:11 +0000 (UTC) Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id C6DFB61222 for ; Thu, 15 Apr 2021 09:24:10 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org C6DFB61222 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=gmail.com Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=intel-gfx-bounces@lists.freedesktop.org Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id F2DA689D4F; Thu, 15 Apr 2021 09:24:09 +0000 (UTC) Received: from mail-qv1-xf2c.google.com (mail-qv1-xf2c.google.com [IPv6:2607:f8b0:4864:20::f2c]) by gabe.freedesktop.org (Postfix) with ESMTPS id 1F09E89D4F; Thu, 15 Apr 2021 09:24:09 +0000 (UTC) Received: by mail-qv1-xf2c.google.com with SMTP id bs7so10844284qvb.12; Thu, 15 Apr 2021 02:24:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=nAGKDL6rzqEPFvWqbGn5kStdWJRIwFVxc31VEcY3Eas=; b=Ic1mWy+uwX0xuWIMJhwfd1Yn1wH3u362zhUxAjEAniGHPOCmRQP2z4ePtYhVkj2Kdx oZ4dsfAYMPeow/bOiL3+TxugtIn1TTfO6Qoh41/Dk6aXfqGcd7o4iSwd3d1oDkSF9vNN DvAgT1nqg/hTTTmvbZnUMrViLOvxi5Fgdin3NILWmjcvmK8W/9eH+OODvTe6wvFVyyDD +PK1mJPba0K1EcT65w141poua19ODTmrB6wrUB46ry43KOrnnRkDfmE4bR9wTtkFq166 Qh0f48UX+Lsh50h6B6aMwS1aTW3HGq8WeHC+QTkZPVPqYRJSqDnbkIxPOqk5JPjEiR/8 ZgkA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=nAGKDL6rzqEPFvWqbGn5kStdWJRIwFVxc31VEcY3Eas=; b=tDr89nArpSdl16It5bmc6xi/Tmh1Gf19QBm/gCX80UHjlEY/MQwJxIgeOvGf04TII4 8zUFjUhoHpI23u+PzOTqrJTt6MNivFNfa/buhLkcmGHkxDP+Rxw9kjniy/lrPo+QktYQ ZIHF3v8INEhfQXmMipp533D8DDUKjT5Za6Sn5Ch+jKTnMvIeRZ3J6s9eF0P3nv7vzqb8 l4wArM/T7sUsjHmBaHyk7kzvbSD09CafHbj4zk3TnToQ0Lh533GpCbK5MT7tUUk2X8dd 9PKpG7uvmIUkcDXgOdgx0E+t8t+/DuPcWN7yktnhe+CwtQDwSjQEnXFF8sYPY4aLPdGt ZExw== X-Gm-Message-State: AOAM533trDxede5LEp+GZ5cI/9GETxcOkzR+Ircd8KU1eYOJpsBkXnjQ cL412Ub566tjCkWK0E03bglrhNP4WDO2BHvDiRM= X-Google-Smtp-Source: ABdhPJyi6iAfk+fR1xkHIUh8/ki9WVhF6I1zHe0SVViMU+Ei9rSItLYYC+TeONZmEr7DspNvcyVa1XEA0Uwa2y+GJj4= X-Received: by 2002:a05:6214:1c45:: with SMTP id if5mr2263111qvb.48.1618478648088; Thu, 15 Apr 2021 02:24:08 -0700 (PDT) MIME-Version: 1.0 References: <20210412090526.30547-1-matthew.auld@intel.com> <20210412090526.30547-12-matthew.auld@intel.com> <7bc2a663-fbc0-6486-341a-7dca980c2440@linux.intel.com> <9aca524c-3710-dfaa-bfec-aef76aee7b26@linux.intel.com> In-Reply-To: <9aca524c-3710-dfaa-bfec-aef76aee7b26@linux.intel.com> From: Matthew Auld Date: Thu, 15 Apr 2021 10:23:41 +0100 Message-ID: To: Tvrtko Ursulin Subject: Re: [Intel-gfx] [PATCH 11/19] drm/i915: Update the helper to set correct mapping X-BeenThere: intel-gfx@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Intel graphics driver community testing & development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: Intel Graphics Development , Matthew Auld , ML dri-devel Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: intel-gfx-bounces@lists.freedesktop.org Sender: "Intel-gfx" On Thu, 15 Apr 2021 at 09:21, Tvrtko Ursulin wrote: > > > On 14/04/2021 17:20, Matthew Auld wrote: > > On Wed, 14 Apr 2021 at 16:22, Tvrtko Ursulin > > wrote: > >> > >> > >> On 12/04/2021 10:05, Matthew Auld wrote: > >>> From: Venkata Sandeep Dhanalakota > >>> > >>> Determine the possible coherent map type based on object location, > >>> and if target has llc or if user requires an always coherent > >>> mapping. > >>> > >>> Cc: Matthew Auld > >>> Cc: CQ Tang > >>> Suggested-by: Michal Wajdeczko > >>> Signed-off-by: Venkata Sandeep Dhanalakota > >>> --- > >>> drivers/gpu/drm/i915/gt/intel_engine_cs.c | 3 ++- > >>> drivers/gpu/drm/i915/gt/intel_engine_pm.c | 2 +- > >>> drivers/gpu/drm/i915/gt/intel_lrc.c | 4 +++- > >>> drivers/gpu/drm/i915/gt/intel_ring.c | 9 ++++++--- > >>> drivers/gpu/drm/i915/gt/selftest_context.c | 3 ++- > >>> drivers/gpu/drm/i915/gt/selftest_hangcheck.c | 4 ++-- > >>> drivers/gpu/drm/i915/gt/selftest_lrc.c | 4 +++- > >>> drivers/gpu/drm/i915/gt/uc/intel_guc.c | 4 +++- > >>> drivers/gpu/drm/i915/gt/uc/intel_huc.c | 4 +++- > >>> drivers/gpu/drm/i915/i915_drv.h | 11 +++++++++-- > >>> drivers/gpu/drm/i915/selftests/igt_spinner.c | 4 ++-- > >>> 11 files changed, 36 insertions(+), 16 deletions(-) > >>> > >>> diff --git a/drivers/gpu/drm/i915/gt/intel_engine_cs.c b/drivers/gpu/drm/i915/gt/intel_engine_cs.c > >>> index efe935f80c1a..b79568d370f5 100644 > >>> --- a/drivers/gpu/drm/i915/gt/intel_engine_cs.c > >>> +++ b/drivers/gpu/drm/i915/gt/intel_engine_cs.c > >>> @@ -664,7 +664,8 @@ static int init_status_page(struct intel_engine_cs *engine) > >>> if (ret) > >>> goto err; > >>> > >>> - vaddr = i915_gem_object_pin_map(obj, I915_MAP_WB); > >>> + vaddr = i915_gem_object_pin_map(obj, > >>> + i915_coherent_map_type(engine->i915, obj, true)); > >>> if (IS_ERR(vaddr)) { > >>> ret = PTR_ERR(vaddr); > >>> goto err_unpin; > >>> diff --git a/drivers/gpu/drm/i915/gt/intel_engine_pm.c b/drivers/gpu/drm/i915/gt/intel_engine_pm.c > >>> index 7c9af86fdb1e..47f4397095e5 100644 > >>> --- a/drivers/gpu/drm/i915/gt/intel_engine_pm.c > >>> +++ b/drivers/gpu/drm/i915/gt/intel_engine_pm.c > >>> @@ -23,7 +23,7 @@ static void dbg_poison_ce(struct intel_context *ce) > >>> > >>> if (ce->state) { > >>> struct drm_i915_gem_object *obj = ce->state->obj; > >>> - int type = i915_coherent_map_type(ce->engine->i915); > >>> + int type = i915_coherent_map_type(ce->engine->i915, obj, true); > >>> void *map; > >>> > >>> if (!i915_gem_object_trylock(obj)) > >>> diff --git a/drivers/gpu/drm/i915/gt/intel_lrc.c b/drivers/gpu/drm/i915/gt/intel_lrc.c > >>> index e86897cde984..aafe2a4df496 100644 > >>> --- a/drivers/gpu/drm/i915/gt/intel_lrc.c > >>> +++ b/drivers/gpu/drm/i915/gt/intel_lrc.c > >>> @@ -903,7 +903,9 @@ lrc_pre_pin(struct intel_context *ce, > >>> GEM_BUG_ON(!i915_vma_is_pinned(ce->state)); > >>> > >>> *vaddr = i915_gem_object_pin_map(ce->state->obj, > >>> - i915_coherent_map_type(ce->engine->i915) | > >>> + i915_coherent_map_type(ce->engine->i915, > >>> + ce->state->obj, > >>> + false) | > >>> I915_MAP_OVERRIDE); > >>> > >>> return PTR_ERR_OR_ZERO(*vaddr); > >>> diff --git a/drivers/gpu/drm/i915/gt/intel_ring.c b/drivers/gpu/drm/i915/gt/intel_ring.c > >>> index aee0a77c77e0..3cf6c7e68108 100644 > >>> --- a/drivers/gpu/drm/i915/gt/intel_ring.c > >>> +++ b/drivers/gpu/drm/i915/gt/intel_ring.c > >>> @@ -53,9 +53,12 @@ int intel_ring_pin(struct intel_ring *ring, struct i915_gem_ww_ctx *ww) > >>> > >>> if (i915_vma_is_map_and_fenceable(vma)) > >>> addr = (void __force *)i915_vma_pin_iomap(vma); > >>> - else > >>> - addr = i915_gem_object_pin_map(vma->obj, > >>> - i915_coherent_map_type(vma->vm->i915)); > >>> + else { > >>> + int type = i915_coherent_map_type(vma->vm->i915, vma->obj, false); > >>> + > >>> + addr = i915_gem_object_pin_map(vma->obj, type); > >>> + } > >>> + > >>> if (IS_ERR(addr)) { > >>> ret = PTR_ERR(addr); > >>> goto err_ring; > >>> diff --git a/drivers/gpu/drm/i915/gt/selftest_context.c b/drivers/gpu/drm/i915/gt/selftest_context.c > >>> index b9bdd1d23243..26685b927169 100644 > >>> --- a/drivers/gpu/drm/i915/gt/selftest_context.c > >>> +++ b/drivers/gpu/drm/i915/gt/selftest_context.c > >>> @@ -88,7 +88,8 @@ static int __live_context_size(struct intel_engine_cs *engine) > >>> goto err; > >>> > >>> vaddr = i915_gem_object_pin_map_unlocked(ce->state->obj, > >>> - i915_coherent_map_type(engine->i915)); > >>> + i915_coherent_map_type(engine->i915, > >>> + ce->state->obj, false)); > >>> if (IS_ERR(vaddr)) { > >>> err = PTR_ERR(vaddr); > >>> intel_context_unpin(ce); > >>> diff --git a/drivers/gpu/drm/i915/gt/selftest_hangcheck.c b/drivers/gpu/drm/i915/gt/selftest_hangcheck.c > >>> index 746985971c3a..5b63d4df8c93 100644 > >>> --- a/drivers/gpu/drm/i915/gt/selftest_hangcheck.c > >>> +++ b/drivers/gpu/drm/i915/gt/selftest_hangcheck.c > >>> @@ -69,7 +69,7 @@ static int hang_init(struct hang *h, struct intel_gt *gt) > >>> h->seqno = memset(vaddr, 0xff, PAGE_SIZE); > >>> > >>> vaddr = i915_gem_object_pin_map_unlocked(h->obj, > >>> - i915_coherent_map_type(gt->i915)); > >>> + i915_coherent_map_type(gt->i915, h->obj, false)); > >>> if (IS_ERR(vaddr)) { > >>> err = PTR_ERR(vaddr); > >>> goto err_unpin_hws; > >>> @@ -130,7 +130,7 @@ hang_create_request(struct hang *h, struct intel_engine_cs *engine) > >>> return ERR_CAST(obj); > >>> } > >>> > >>> - vaddr = i915_gem_object_pin_map_unlocked(obj, i915_coherent_map_type(gt->i915)); > >>> + vaddr = i915_gem_object_pin_map_unlocked(obj, i915_coherent_map_type(gt->i915, obj, false)); > >>> if (IS_ERR(vaddr)) { > >>> i915_gem_object_put(obj); > >>> i915_vm_put(vm); > >>> diff --git a/drivers/gpu/drm/i915/gt/selftest_lrc.c b/drivers/gpu/drm/i915/gt/selftest_lrc.c > >>> index 85e7df6a5123..d8f6623524e8 100644 > >>> --- a/drivers/gpu/drm/i915/gt/selftest_lrc.c > >>> +++ b/drivers/gpu/drm/i915/gt/selftest_lrc.c > >>> @@ -1221,7 +1221,9 @@ static int compare_isolation(struct intel_engine_cs *engine, > >>> } > >>> > >>> lrc = i915_gem_object_pin_map_unlocked(ce->state->obj, > >>> - i915_coherent_map_type(engine->i915)); > >>> + i915_coherent_map_type(engine->i915, > >>> + ce->state->obj, > >>> + false)); > >>> if (IS_ERR(lrc)) { > >>> err = PTR_ERR(lrc); > >>> goto err_B1; > >>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_guc.c b/drivers/gpu/drm/i915/gt/uc/intel_guc.c > >>> index 78305b2ec89d..adae04c47aab 100644 > >>> --- a/drivers/gpu/drm/i915/gt/uc/intel_guc.c > >>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_guc.c > >>> @@ -682,7 +682,9 @@ int intel_guc_allocate_and_map_vma(struct intel_guc *guc, u32 size, > >>> if (IS_ERR(vma)) > >>> return PTR_ERR(vma); > >>> > >>> - vaddr = i915_gem_object_pin_map_unlocked(vma->obj, I915_MAP_WB); > >>> + vaddr = i915_gem_object_pin_map_unlocked(vma->obj, > >>> + i915_coherent_map_type(guc_to_gt(guc)->i915, > >>> + vma->obj, true)); > >>> if (IS_ERR(vaddr)) { > >>> i915_vma_unpin_and_release(&vma, 0); > >>> return PTR_ERR(vaddr); > >>> diff --git a/drivers/gpu/drm/i915/gt/uc/intel_huc.c b/drivers/gpu/drm/i915/gt/uc/intel_huc.c > >>> index 2126dd81ac38..56d2144dc6a0 100644 > >>> --- a/drivers/gpu/drm/i915/gt/uc/intel_huc.c > >>> +++ b/drivers/gpu/drm/i915/gt/uc/intel_huc.c > >>> @@ -82,7 +82,9 @@ static int intel_huc_rsa_data_create(struct intel_huc *huc) > >>> if (IS_ERR(vma)) > >>> return PTR_ERR(vma); > >>> > >>> - vaddr = i915_gem_object_pin_map_unlocked(vma->obj, I915_MAP_WB); > >>> + vaddr = i915_gem_object_pin_map_unlocked(vma->obj, > >>> + i915_coherent_map_type(gt->i915, > >>> + vma->obj, true)); > >>> if (IS_ERR(vaddr)) { > >>> i915_vma_unpin_and_release(&vma, 0); > >>> return PTR_ERR(vaddr); > >>> diff --git a/drivers/gpu/drm/i915/i915_drv.h b/drivers/gpu/drm/i915/i915_drv.h > >>> index 69e43bf91a15..2abbc06712a4 100644 > >>> --- a/drivers/gpu/drm/i915/i915_drv.h > >>> +++ b/drivers/gpu/drm/i915/i915_drv.h > >>> @@ -78,6 +78,7 @@ > >>> #include "gem/i915_gem_context_types.h" > >>> #include "gem/i915_gem_shrinker.h" > >>> #include "gem/i915_gem_stolen.h" > >>> +#include "gem/i915_gem_lmem.h" > >>> > >>> #include "gt/intel_engine.h" > >>> #include "gt/intel_gt_types.h" > >>> @@ -1921,9 +1922,15 @@ static inline int intel_hws_csb_write_index(struct drm_i915_private *i915) > >>> } > >>> > >>> static inline enum i915_map_type > >>> -i915_coherent_map_type(struct drm_i915_private *i915) > >>> +i915_coherent_map_type(struct drm_i915_private *i915, > >>> + struct drm_i915_gem_object *obj, bool always_coherent) > >>> { > >>> - return HAS_LLC(i915) ? I915_MAP_WB : I915_MAP_WC; > >>> + if (i915_gem_object_is_lmem(obj)) > >>> + return I915_MAP_WC; > >>> + if (HAS_LLC(i915) || always_coherent) > >>> + return I915_MAP_WB; > >>> + else > >>> + return I915_MAP_WC; > >> > >> Seems this patch is doing two things. > >> > >> First it is adding lmem support to this helper by always returning WC > >> for lmem objects. > >> > >> Secondly it is introducing an idea of "always coherent" in a helper > >> called i915_coherent_map_type. Could someone explain what is coherent vs > >> always coherent? > >> > >> And also, why is always coherent happy with WB? Sounds counter intuitive > >> to me. > > > > All this does is try to keep the existing behaviour intact, whilst > > also ensuring that all lmem objects are mapped using only WC, no > > matter what. The always_coherent=true thing is for the existing places > > where we sometimes map the object using WB, without first considering > > whether the device has the fast shared LLC vs snooping. Yes, it's > > slightly ugly :) > > Not fully following - if we had to write kerneldoc for always_coherent > input argument - what it would say? @always_coherent - If true we should always try to map the object using WB. If false we should only map as WB if the device supports the fast shared LLC, in the case of snooped devices we will map use WC. Note that If the resource is lmem then we will always map as WC, regardless of the value of always_coherent, since that's all we currently support. Maybe the naming is poor? > > Regards, > > Tvrtko _______________________________________________ Intel-gfx mailing list Intel-gfx@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/intel-gfx