From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ben Widawsky Subject: Re: [PATCH] drm/i915: Trigger hangcheck if we detect more a repeating missed IRQ Date: Tue, 10 Apr 2012 16:59:11 -0700 Message-ID: <20120410165911.7a0dd7b5@bwidawsk.net> References: <1334073641-22717-1-git-send-email-chris@chris-wilson.co.uk> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: Received: from cloud01.chad-versace.us (184-106-247-128.static.cloud-ips.com [184.106.247.128]) by gabe.freedesktop.org (Postfix) with ESMTP id 0749C9E93F for ; Tue, 10 Apr 2012 16:59:20 -0700 (PDT) In-Reply-To: <1334073641-22717-1-git-send-email-chris@chris-wilson.co.uk> List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: intel-gfx-bounces+gcfxdi-intel-gfx=m.gmane.org@lists.freedesktop.org Errors-To: intel-gfx-bounces+gcfxdi-intel-gfx=m.gmane.org@lists.freedesktop.org To: Chris Wilson Cc: intel-gfx@lists.freedesktop.org List-Id: intel-gfx@lists.freedesktop.org On Tue, 10 Apr 2012 17:00:41 +0100 Chris Wilson wrote: > On the first instance we just wish to kick the waiters and see if that > terminates the wait conditions. If it does not, then we do not want to > keep retrying without ever making any forward progress and becoming > stuck in a hangcheck loop. > > Reported-and-tested-by: Lukas Hejtmanek > Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=48209 > Signed-off-by: Chris Wilson I'm still confused about the problem we are purportedly fixing. This should happen if we've missed an irq (or the watchdog fired too soon), and then fires again before the thread has actually woken up to realize that is missed the first IRQ? As for extract the kick_ring bit of code for core hangcheck_elapsed, that looks fine. I just don't quite understand the exact problem this solves, and can't envision how we hit this case it seems the patch will fix. Ben