From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751388AbcLCJxy (ORCPT ); Sat, 3 Dec 2016 04:53:54 -0500 Received: from mga04.intel.com ([192.55.52.120]:25170 "EHLO mga04.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750828AbcLCJxw (ORCPT ); Sat, 3 Dec 2016 04:53:52 -0500 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.33,291,1477983600"; d="scan'208";a="793759668" From: Jani Nikula To: Matt Turner , intel-gfx@lists.freedesktop.org Cc: Daniel Vetter , Mika Kuoppala , Kenneth Graunke , Mark Janes , linux-kernel@vger.kernel.org, Matt Turner , "Argotti\, Yann" , Chris Wilson Subject: Re: [PATCH] drm/i915: Remove instructions to file a bug report. In-Reply-To: <1480726985-12762-1-git-send-email-mattst88@gmail.com> Organization: Intel Finland Oy - BIC 0357606-4 - Westendinkatu 7, 02160 Espoo References: <1480726985-12762-1-git-send-email-mattst88@gmail.com> Date: Sat, 03 Dec 2016 11:52:49 +0200 Message-ID: <87inr1qqz2.fsf@intel.com> MIME-Version: 1.0 Content-Type: text/plain Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, 03 Dec 2016, Matt Turner wrote: > From these instructions, users assume that /sys/class/drm/card0/error > contains all the information a developer needs to diagnose and fix a GPU > hang. > > In fact it doesn't, and we have no tools for solving them (other than > stabbing in the dark). Most of the time the error state itself isn't > even useful because it just shows a hang on a PIPE_CONTROL or similar. > > Until a time when the error state contains enough information to > actually solve a hang, stop telling users to file unsolvable bugs, and > instead rely on users who know where and how to file a good bug report > to find their own way there. > > Signed-off-by: Matt Turner > --- > Maybe now's a good time to discuss what *would* be useful to put in the > error state for debugging hangs. The currently executing shader program > would be a great place to start. I'm wondering why we're getting this patch now, and my guess is that it's because we have been reassigning the related bugs to Mesa more actively lately. Is that the case? IIUC the bug reports are useful for us when it's a kernel bug, but less useful for you when it's a Mesa bug. And you'd rather have fewer incoming bugs that you think are unsolvable with the information at hand. Sounds like a bug workflow issue between drm/i915 and Mesa to be ironed out. BR, Jani. > > drivers/gpu/drm/i915/i915_gpu_error.c | 11 ----------- > 1 file changed, 11 deletions(-) > > diff --git a/drivers/gpu/drm/i915/i915_gpu_error.c b/drivers/gpu/drm/i915/i915_gpu_error.c > index 334f15d..8ddca7b 100644 > --- a/drivers/gpu/drm/i915/i915_gpu_error.c > +++ b/drivers/gpu/drm/i915/i915_gpu_error.c > @@ -1431,7 +1431,6 @@ void i915_capture_error_state(struct drm_i915_private *dev_priv, > u32 engine_mask, > const char *error_msg) > { > - static bool warned; > struct drm_i915_error_state *error; > unsigned long flags; > > @@ -1475,16 +1474,6 @@ void i915_capture_error_state(struct drm_i915_private *dev_priv, > i915_error_state_free(&error->ref); > return; > } > - > - if (!warned) { > - DRM_INFO("GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.\n"); > - DRM_INFO("Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel\n"); > - DRM_INFO("drm/i915 developers can then reassign to the right component if it's not a kernel issue.\n"); > - DRM_INFO("The gpu crash dump is required to analyze gpu hangs, so please always attach it.\n"); > - DRM_INFO("GPU crash dump saved to /sys/class/drm/card%d/error\n", > - dev_priv->drm.primary->index); > - warned = true; > - } > } > > void i915_error_state_get(struct drm_device *dev, -- Jani Nikula, Intel Open Source Technology Center