From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-6.8 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, INCLUDES_PATCH,MAILING_LIST_MULTI,SIGNED_OFF_BY,SPF_HELO_NONE,SPF_PASS, URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 2802EC83000 for ; Tue, 28 Apr 2020 12:45:21 +0000 (UTC) Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id 037B3206F0 for ; Tue, 28 Apr 2020 12:45:20 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 037B3206F0 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=linux.intel.com Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=intel-gfx-bounces@lists.freedesktop.org Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id 86AC76E339; Tue, 28 Apr 2020 12:45:20 +0000 (UTC) Received: from mga11.intel.com (mga11.intel.com [192.55.52.93]) by gabe.freedesktop.org (Postfix) with ESMTPS id BCD4B6E354 for ; Tue, 28 Apr 2020 12:45:18 +0000 (UTC) IronPort-SDR: OPmPNEF9uy71u1UupyCcQJSWc2FX1rfr4OHKZF7dh5V1XjCDyGm4dFjlP6RzmUWx1SeJBBQpU/ iGH71q0KNFHw== X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by fmsmga102.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 28 Apr 2020 05:45:18 -0700 IronPort-SDR: sGRB2PKFQjRfCnly3HoqU0NjhtDwL1S+IXrEm1hryuF5ZNlT0b5rMiTXxVxIonL0NcIPyojEVy XDa9V/fmQLxA== X-IronPort-AV: E=Sophos;i="5.73,327,1583222400"; d="scan'208";a="458779591" Received: from jkrzyszt-desk.igk.intel.com ([172.22.244.18]) by fmsmga005-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 28 Apr 2020 05:45:17 -0700 Message-ID: <64c6797e1031f3f9da8b29cce8553b059b788a31.camel@linux.intel.com> From: Janusz Krzysztofik To: Chris Wilson , intel-gfx@lists.freedesktop.org Date: Tue, 28 Apr 2020 14:45:13 +0200 In-Reply-To: <20200424152412.5928-1-chris@chris-wilson.co.uk> References: <20200424152412.5928-1-chris@chris-wilson.co.uk> Organization: Intel Technology Poland sp. z o.o. - ul. Slowackiego 173, 80-298 Gdansk - KRS 101882 - NIP 957-07-52-316 User-Agent: Evolution 3.32.5 (3.32.5-1.fc30) MIME-Version: 1.0 Subject: Re: [Intel-gfx] [PATCH] drm/i915/gem: Lazily acquire the device wakeref for freeing objects X-BeenThere: intel-gfx@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Intel graphics driver community testing & development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: intel-gfx-bounces@lists.freedesktop.org Sender: "Intel-gfx" Hi Chris, On Fri, 2020-04-24 at 16:24 +0100, Chris Wilson wrote: > We only need the device wakeref on freeing the objects if we have to > unbind the object from the global GTT, or otherwise update device > information. If the objects are clean, we never need the wakeref, so > avoid taking until required. > > For this to be effective in preventing us from waking the device after > it is unbind, we also need to mark the GGTT as closed on device removal. > The GGTT will be rebuilt from scratch the next time we need to open it > (on binding a new device). > > Signed-off-by: Chris Wilson > Cc: Janusz Krzysztofik > --- > drivers/gpu/drm/i915/gem/i915_gem_mman.c | 3 +++ > drivers/gpu/drm/i915/gem/i915_gem_object.c | 3 --- > drivers/gpu/drm/i915/gt/intel_ggtt.c | 9 +++++++-- > drivers/gpu/drm/i915/gt/intel_gtt.h | 1 + > drivers/gpu/drm/i915/i915_drv.c | 1 + > drivers/gpu/drm/i915/i915_vma.c | 3 ++- > 6 files changed, 14 insertions(+), 6 deletions(-) > > diff --git a/drivers/gpu/drm/i915/gem/i915_gem_mman.c b/drivers/gpu/drm/i915/gem/i915_gem_mman.c > index b39c24dae64e..c6cead6f2b3e 100644 > --- a/drivers/gpu/drm/i915/gem/i915_gem_mman.c > +++ b/drivers/gpu/drm/i915/gem/i915_gem_mman.c > @@ -421,6 +421,9 @@ static void i915_gem_object_release_mmap_gtt(struct drm_i915_gem_object *obj) > struct drm_i915_private *i915 = to_i915(obj->base.dev); > intel_wakeref_t wakeref; > > + if (!atomic_read(&i915->ggtt.vm.open)) > + return; > + > /* > * Serialisation between user GTT access and our code depends upon > * revoking the CPU's PTE whilst the mutex is held. The next user > diff --git a/drivers/gpu/drm/i915/gem/i915_gem_object.c b/drivers/gpu/drm/i915/gem/i915_gem_object.c > index 9d1d0131f7c2..99356c00c19e 100644 > --- a/drivers/gpu/drm/i915/gem/i915_gem_object.c > +++ b/drivers/gpu/drm/i915/gem/i915_gem_object.c > @@ -162,9 +162,7 @@ static void __i915_gem_free_objects(struct drm_i915_private *i915, > struct llist_node *freed) > { > struct drm_i915_gem_object *obj, *on; > - intel_wakeref_t wakeref; > > - wakeref = intel_runtime_pm_get(&i915->runtime_pm); > llist_for_each_entry_safe(obj, on, freed, freed) { > struct i915_mmap_offset *mmo, *mn; > > @@ -224,7 +222,6 @@ static void __i915_gem_free_objects(struct drm_i915_private *i915, > call_rcu(&obj->rcu, __i915_gem_free_object_rcu); > cond_resched(); > } > - intel_runtime_pm_put(&i915->runtime_pm, wakeref); > } > > void i915_gem_flush_free_objects(struct drm_i915_private *i915) > diff --git a/drivers/gpu/drm/i915/gt/intel_ggtt.c b/drivers/gpu/drm/i915/gt/intel_ggtt.c > index 66165b10256e..b65545182ef5 100644 > --- a/drivers/gpu/drm/i915/gt/intel_ggtt.c > +++ b/drivers/gpu/drm/i915/gt/intel_ggtt.c > @@ -681,8 +681,6 @@ static void ggtt_cleanup_hw(struct i915_ggtt *ggtt) > { > struct i915_vma *vma, *vn; > > - atomic_set(&ggtt->vm.open, 0); > - > rcu_barrier(); /* flush the RCU'ed__i915_vm_release */ > flush_workqueue(ggtt->vm.i915->wq); > > @@ -709,6 +707,13 @@ static void ggtt_cleanup_hw(struct i915_ggtt *ggtt) > io_mapping_fini(&ggtt->iomap); > } > > +void i915_ggtt_driver_remove(struct drm_i915_private *i915) > +{ > + struct i915_ggtt *ggtt = &i915->ggtt; > + > + atomic_set(&ggtt->vm.open, 0); > +} > + > /** > * i915_ggtt_driver_release - Clean up GGTT hardware initialization > * @i915: i915 device > diff --git a/drivers/gpu/drm/i915/gt/intel_gtt.h b/drivers/gpu/drm/i915/gt/intel_gtt.h > index d93ebdf3fa0e..f140ce5c171a 100644 > --- a/drivers/gpu/drm/i915/gt/intel_gtt.h > +++ b/drivers/gpu/drm/i915/gt/intel_gtt.h > @@ -501,6 +501,7 @@ int i915_ggtt_enable_hw(struct drm_i915_private *i915); > void i915_ggtt_enable_guc(struct i915_ggtt *ggtt); > void i915_ggtt_disable_guc(struct i915_ggtt *ggtt); > int i915_init_ggtt(struct drm_i915_private *i915); > +void i915_ggtt_driver_remove(struct drm_i915_private *i915); > void i915_ggtt_driver_release(struct drm_i915_private *i915); > > static inline bool i915_ggtt_has_aperture(const struct i915_ggtt *ggtt) > diff --git a/drivers/gpu/drm/i915/i915_drv.c b/drivers/gpu/drm/i915/i915_drv.c > index 2d62efd9316f..bdf97a1cb7cc 100644 > --- a/drivers/gpu/drm/i915/i915_drv.c > +++ b/drivers/gpu/drm/i915/i915_drv.c > @@ -768,6 +768,7 @@ static void i915_driver_hw_remove(struct drm_i915_private *dev_priv) > > i915_perf_fini(dev_priv); > > + i915_ggtt_driver_remove(dev_priv); > if (pdev->msi_enabled) > pci_disable_msi(pdev); > > diff --git a/drivers/gpu/drm/i915/i915_vma.c b/drivers/gpu/drm/i915/i915_vma.c > index fc14ebf9a0b7..6fe56ad2a542 100644 > --- a/drivers/gpu/drm/i915/i915_vma.c > +++ b/drivers/gpu/drm/i915/i915_vma.c > @@ -1319,7 +1319,8 @@ int i915_vma_unbind(struct i915_vma *vma) > return -EAGAIN; > } > > - if (i915_vma_is_bound(vma, I915_VMA_GLOBAL_BIND)) > + if (i915_vma_is_bound(vma, I915_VMA_GLOBAL_BIND) && > + atomic_read(&vma->vm->open)) > /* XXX not always required: nop_clear_range */ > wakeref = intel_runtime_pm_get(&vm->i915->runtime_pm); > Can you please explain why you think it is OK to call mutex_lock_interruptible_nested(&closed_GGTT->mutex, PPGTT_subclass)? Can you please also explain why you think it is safe to call __i915_vma_unbind(vma_bound_to_GGTT) even if wakeref is not taken? Thanks, Janusz _______________________________________________ Intel-gfx mailing list Intel-gfx@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/intel-gfx