All of lore.kernel.org
 help / color / mirror / Atom feed
From: Takashi Iwai <tiwai@suse.de>
To: Daniel Vetter <daniel@ffwll.ch>
Cc: intel-gfx@lists.freedesktop.org
Subject: Re: Possible i915 regression with 4.4-rc
Date: Fri, 04 Dec 2015 17:12:26 +0100	[thread overview]
Message-ID: <s5h1tb2yyz9.wl-tiwai@suse.de> (raw)
In-Reply-To: <20151204160252.GD10243@phenom.ffwll.local>

On Fri, 04 Dec 2015 17:02:52 +0100,
Daniel Vetter wrote:
> 
> On Fri, Dec 04, 2015 at 11:40:59AM +0200, Ville Syrjälä wrote:
> > On Fri, Dec 04, 2015 at 10:49:48AM +0200, Jani Nikula wrote:
> > > On Thu, 03 Dec 2015, Ville Syrjälä <ville.syrjala@linux.intel.com> wrote:
> > > > On Thu, Dec 03, 2015 at 09:00:55PM +0100, Takashi Iwai wrote:
> > > >> Hi,
> > > >> 
> > > >> I've experienced a few graphics issues recently, and I tend to believe
> > > >> that it has happened since 4.4-rc.  Namely, after some long time usage
> > > >> on my HSW laptop (two or three days), the mouse cursor vanished
> > > >> suddenly.  It kept pointing but just became invisible.  Also, after
> > > >> some S3 cycles, some glyphs on a console or on Firefox became
> > > >> invisible, too.  The windows and graphics were shown well, and X core
> > > >> fonts were still shown properly, too.  Switching to VT1 and back
> > > >> didn't change the situation.
> > > >
> > > > I think I have a fix for this *very* annoying problem. I'v been cursing
> > > > on irc for weeks about it, until I finally got off my arse and debugged
> > > > it.
> > > >
> > > > I pushed out my my cursor branch:
> > > > git://github.com/vsyrjala/linux.git disappearing_cursor_fix
> > > >
> > > > It has lots of other junk too, but it should be just there two that fix it:
> > > > 59f65fa270fb ("drm/i915: Kill intel_crtc->cursor_bo")
> > > > 25651a198d17 ("drm/i915: Drop the broken curcor base==0 special casing")
> > > >
> > > > Unfortunatleey I've managed to keep myself busy on other stuff, so didn't
> > > > send them out yet. Maybe tomorrow...
> > > 
> > > So I've hit this too, albeit very rarely, on a Haswell running Debian
> > > stable with the stock v3.16 kernel. Haven't seen it on any other
> > > machine. It's really too rare to even debug or verify a fix. Is it
> > > possible we just happened to make an old bug occur more frequently now?
> > 
> > The potential for it has definitely been there for a long time.
> 
> Oh dear, let's have fun and look at some awful history.
> 
> commit e568af1c626031925465a5caaab7cca1303d55c7
> Author: Daniel Vetter <daniel.vetter@ffwll.ch>
> Date:   Wed Mar 26 20:08:20 2014 +0100
> 
>     drm/i915: Undo gtt scratch pte unmapping again
> 
> Which essentially reverted
> 
> commit 828c79087cec61eaf4c76bb32c222fbe35ac3930
> Author: Ben Widawsky <benjamin.widawsky@intel.com>
> Date:   Wed Oct 16 09:21:30 2013 -0700
> 
>     drm/i915: Disable GGTT PTEs on GEN6+ suspend
>     
>     Once the machine gets to a certain point in the suspend process, we
>     expect the GPU to be idle. If it is not, we might corrupt memory.
>     Empirically (with an early version of this patch) we have seen this is
>     not the case. We cannot currently explain why the latent GPU writes
>     occur.
>     
>     In the technical sense, this patch is a workaround in that we have an
>     issue we can't explain, and the patch indirectly solves the issue.
>     However, it's really better than a workaround because we understand why
>     it works, and it really should be a safe thing to do in all cases.
>     
>     The noticeable effect other than the debug messages would be an increase
>     in the suspend time. I have not measure how expensive it actually is.
>     
>     I think it would be good to spend further time to root cause why we're
>     seeing these latent writes, but it shouldn't preclude preventing the
>     fallout.
>     
>     NOTE: It should be safe (and makes some sense IMO) to also keep the
>     VALID bit unset on resume when we clear_range(). I've opted not to do
>     this as properly clearing those bits at some later point would be extra
>     work.
>     
>     v2: Fix bugzilla link
>     
>     Bugzilla: http://bugs.freedesktop.org/show_bug.cgi?id=65496
>     Bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=59321
>     Tested-by: Takashi Iwai <tiwai@suse.de>
>     Tested-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
>     Signed-off-by: Ben Widawsky <ben@bwidawsk.net>
>     Tested-By: Todd Previte <tprevite@gmail.com>
>     Cc: stable@vger.kernel.org
>     Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>
> 
> This was a regression in a regression right before I ragequit the entire
> bug handling deal because no one cared any more and management was all
> "why is this important".
> 
> Would be interesting if these issues magically disapper when changing that
> back again. Doesn't mean that we're any closer to figuring out what's
> corrupting what exactly here, but at least we'd have a reason to digg out
> this old sob story of mine.

Hm, but this revert was also fairly ago, and I don't remember of the
similar breakage until 4.4-rc.  Might be just a (bad) luck, though.

(And no surprise, I was already in the party above!  Everyone must
 have smoked badly there.)


thanks,

Takashi
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

      reply	other threads:[~2015-12-04 16:12 UTC|newest]

Thread overview: 17+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-12-03 20:00 Possible i915 regression with 4.4-rc Takashi Iwai
2015-12-03 20:33 ` Ville Syrjälä
2015-12-03 21:08   ` Takashi Iwai
2015-12-03 21:25     ` Ville Syrjälä
2015-12-03 21:35       ` Chris Wilson
2015-12-04  8:44         ` Jani Nikula
2015-12-04 15:57           ` Daniel Vetter
2015-12-04 12:00         ` Dave Gordon
2015-12-04 12:06           ` Chris Wilson
2015-12-04 12:16             ` Chris Wilson
2015-12-04 12:45               ` Chris Wilson
2015-12-03 21:38       ` Lukas Wunner
2015-12-08  7:03       ` Takashi Iwai
2015-12-04  8:49   ` Jani Nikula
2015-12-04  9:40     ` Ville Syrjälä
2015-12-04 16:02       ` Daniel Vetter
2015-12-04 16:12         ` Takashi Iwai [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=s5h1tb2yyz9.wl-tiwai@suse.de \
    --to=tiwai@suse.de \
    --cc=daniel@ffwll.ch \
    --cc=intel-gfx@lists.freedesktop.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.