All of lore.kernel.org
 help / color / mirror / Atom feed
From: Daniel J Blueman <daniel.blueman@gmail.com>
To: Chris Wilson <chris@chris-wilson.co.uk>
Cc: intel-gfx@lists.freedesktop.org, Keith Packard <keith.packard@intel.com>
Subject: Re: Sandy Bridge GPU hang reproducer...
Date: Fri, 20 May 2011 11:10:24 +0100	[thread overview]
Message-ID: <BANLkTimQR05SgKCb_ThHmOB=L0YhfHqgkw@mail.gmail.com> (raw)
In-Reply-To: <e0d58a$fki7@orsmga002.jf.intel.com>

On 18 May 2011 19:04, Chris Wilson <chris@chris-wilson.co.uk> wrote:
> On Wed, 18 May 2011 12:38:44 +0100, Daniel J Blueman <daniel.blueman@gmail.com> wrote:
>> Hi Chris et al,
>>
>> On my Sandy Bridge GPU (8086:0126 rev 09) laptop, I often see hangs
>> that are correctly recovered and sometimes ones which aren't (causing
>> X lockup or kernel hard lock), hurting usability.
>>
>> I'm able to reproduce GPU hangs often with the composite tests in
>> rendercheck (may need to restart a few times):
>>
>> $ ./rendercheck -t composite,cacomposite
>> Begin composite mask test on a8
>> <command hang, maybe GPU hang too>
>
> Reproducing use rendercheck is unusual, as that is something that we do
> run frequently.
>
> Looking through the i915_error_state made me wince (lots of 1x1 copies
> over the same pixel...) but it does remind me of
>
>  https://bugzilla.kernel.org/show_bug.cgi?id=27892
>
> for which the workaround is to flush the caches after every op (Option
> "DebugFlushCaches" "True").

After considerable time with 'DebugFlushCaches' enabled, I still hit a
crash. With just 'DebugFlushBatches', it's solid so far (>20 sigma).
Sometimes, I still see the hangcheck timer get hit [1] but nothing
more.

This is quite a heavy-handed workaround and is instrumental to
stability, so it would be really good going to find a more specific
workaround that will be enabled by default.

Thanks,
  Daniel

--- [1]

[   43.015860] [drm:i915_hangcheck_ring_idle] *ERROR* Hangcheck timer
elapsed... blt ring idle [waiting on 8471, at 8471], missed IRQ?
[30413.917749] [drm:i915_hangcheck_ring_idle] *ERROR* Hangcheck timer
elapsed... blt ring idle [waiting on 1390703, at 1390703], missed IRQ?
[33033.624549] [drm:i915_hangcheck_ring_idle] *ERROR* Hangcheck timer
elapsed... blt ring idle [waiting on 4186964, at 4186964], missed IRQ?
-- 
Daniel J Blueman

      reply	other threads:[~2011-05-20 10:10 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-05-18 11:38 Sandy Bridge GPU hang reproducer Daniel J Blueman
2011-05-18 18:04 ` Chris Wilson
2011-05-20 10:10   ` Daniel J Blueman [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='BANLkTimQR05SgKCb_ThHmOB=L0YhfHqgkw@mail.gmail.com' \
    --to=daniel.blueman@gmail.com \
    --cc=chris@chris-wilson.co.uk \
    --cc=intel-gfx@lists.freedesktop.org \
    --cc=keith.packard@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.