From mboxrd@z Thu Jan  1 00:00:00 1970
From: Rodrigo Vivi <rodrigo.vivi@gmail.com>
Subject: Re: [PATCH v4] drm/i915: get a runtime PM ref for the
 deferred GPU reset work
Date: Thu, 24 Apr 2014 18:02:42 -0300
Message-ID: <CABVU7+uoPqqV7=uY1yA9D6ThWU_THaQifL9nRNDNCVfY=J7xvQ@mail.gmail.com>
References: <1398187713-3104-1-git-send-email-imre.deak@intel.com>
 <1398204544-11084-1-git-send-email-imre.deak@intel.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Return-path: <intel-gfx-bounces@lists.freedesktop.org>
Received: from mail-vc0-f172.google.com (mail-vc0-f172.google.com
 [209.85.220.172])
 by gabe.freedesktop.org (Postfix) with ESMTP id 46EAB6E114
 for <intel-gfx@lists.freedesktop.org>; Thu, 24 Apr 2014 14:02:43 -0700 (PDT)
Received: by mail-vc0-f172.google.com with SMTP id la4so3705039vcb.31
 for <intel-gfx@lists.freedesktop.org>; Thu, 24 Apr 2014 14:02:42 -0700 (PDT)
In-Reply-To: <1398204544-11084-1-git-send-email-imre.deak@intel.com>
List-Unsubscribe: <http://lists.freedesktop.org/mailman/options/intel-gfx>,
 <mailto:intel-gfx-request@lists.freedesktop.org?subject=unsubscribe>
List-Archive: <http://lists.freedesktop.org/archives/intel-gfx>
List-Post: <mailto:intel-gfx@lists.freedesktop.org>
List-Help: <mailto:intel-gfx-request@lists.freedesktop.org?subject=help>
List-Subscribe: <http://lists.freedesktop.org/mailman/listinfo/intel-gfx>,
 <mailto:intel-gfx-request@lists.freedesktop.org?subject=subscribe>
Errors-To: intel-gfx-bounces@lists.freedesktop.org
Sender: "Intel-gfx" <intel-gfx-bounces@lists.freedesktop.org>
To: Imre Deak <imre.deak@intel.com>
Cc: intel-gfx <intel-gfx@lists.freedesktop.org>
List-Id: intel-gfx@lists.freedesktop.org

Makes sense for me, so
Reviewed-by: Rodrigo Vivi <rodrigo.vivi@gmail.com>

On Tue, Apr 22, 2014 at 7:09 PM, Imre Deak <imre.deak@intel.com> wrote:
> Atm we can end up in the GPU reset deferred work in D3 state if the last
> runtime PM reference is dropped between detecting a hang/scheduling the
> work and executing the work. At least one such case I could trigger is
> the simulated reset via the i915_wedged debugfs entry. Fix this by
> getting an RPM reference around accessing the HW in the reset work.
>
> v2:
> - Instead of getting/putting the RPM reference in the reset work itself,
>   get it already before scheduling the work. By this we also prevent
>   going to D3 before the work gets to run, in addition to making sure
>   that we run the work itself in D0. (Ville, Daniel)
> v3:
> - fix inverted logic fail when putting the RPM ref on behalf of a
>   cancelled GPU reset work (Ville)
> v4:
> - Taking the RPM ref in the interrupt handler isn't really needed b/c
>   it's already guaranteed that we hold an RPM ref until the end of the
>   reset work in all cases we care about. So take the ref in the reset
>   work (for cases like i915_wedged_set). (Daniel)
>
> Signed-off-by: Imre Deak <imre.deak@intel.com>
> ---
>  drivers/gpu/drm/i915/i915_irq.c | 10 ++++++++++
>  1 file changed, 10 insertions(+)
>
> diff --git a/drivers/gpu/drm/i915/i915_irq.c b/drivers/gpu/drm/i915/i915_irq.c
> index a651d0d..0e47111 100644
> --- a/drivers/gpu/drm/i915/i915_irq.c
> +++ b/drivers/gpu/drm/i915/i915_irq.c
> @@ -2175,6 +2175,14 @@ static void i915_error_work_func(struct work_struct *work)
>                                    reset_event);
>
>                 /*
> +                * In most cases it's guaranteed that we get here with an RPM
> +                * reference held, for example because there is a pending GPU
> +                * request that won't finish until the reset is done. This
> +                * isn't the case at least when we get here by doing a
> +                * simulated reset via debugs, so get an RPM reference.
> +                */
> +               intel_runtime_pm_get(dev_priv);
> +               /*
>                  * All state reset _must_ be completed before we update the
>                  * reset counter, for otherwise waiters might miss the reset
>                  * pending state and not properly drop locks, resulting in
> @@ -2184,6 +2192,8 @@ static void i915_error_work_func(struct work_struct *work)
>
>                 intel_display_handle_reset(dev);
>
> +               intel_runtime_pm_put(dev_priv);
> +
>                 if (ret == 0) {
>                         /*
>                          * After all the gem state is reset, increment the reset
> --
> 1.8.4
>
> _______________________________________________
> Intel-gfx mailing list
> Intel-gfx@lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/intel-gfx


-- 
Rodrigo Vivi
Blog: http://blog.vivi.eng.br