nouveau.lists.freedesktop.org archive mirror
 help / color / mirror / Atom feed
From: Jamie Heilman <jamie@audible.transient.net>
To: Karol Herbst <kherbst@redhat.com>,
	Ben Skeggs <bskeggs@redhat.com>,
	LKML <linux-kernel@vger.kernel.org>,
	nouveau <nouveau@lists.freedesktop.org>
Subject: Re: [Nouveau] nouveau regression post v5.8, still present in v5.10
Date: Tue, 2 Feb 2021 20:26:48 +0000	[thread overview]
Message-ID: <YBm1iJXeDvf/Z7B4@audible.transient.net> (raw)
In-Reply-To: <YAtxgP6YJJwcotuA@audible.transient.net>

Jamie Heilman wrote:
> Jamie Heilman wrote:
> > Karol Herbst wrote:
> > > fyi, there is a patch which solves a maybe related issue on your GPU,
> > > mind giving it a try before we dig further?
> > > https://gitlab.freedesktop.org/drm/nouveau/-/issues/14#note_767791
> > 
> > So, I tried that.  Turns out, I can still trigger a problem.  Is it
> > the same problem?  Maybe?  I also tried applying the patch from 
> > 
> > ca386aa7155a ("drm/nouveau/kms/nv50-gp1xx: add WAR for EVO push buffer HW bug")
> > to 5.8.0-rc6-01516-g0a96099691c8 and very interestingly, it changed
> > the mode of failure to same thing I saw with 5.10.9 patched with the patch
> > from that bug report.  In both cases I get this in the log:
> > 
> > kern.err: nouveau 0000:01:00.0: Xorg[2243]: nv50cal_space: -16
> > kern.err: nouveau 0000:01:00.0: Xorg[2243]: nv50cal_space: -16
> > kern.err: nouveau 0000:01:00.0: Xorg[2243]: nv50cal_space: -16
> > kern.err: nouveau 0000:01:00.0: Xorg[2243]: nv50cal_space: -16
> > ...
> > and so on
> > 
> > In one incident my monitor would't even wake up anymore after this.
> > 
> > 
> > I'm trying to repo it now on an unpatched 5.8.0-rc6-01515-gae09163ac27c
> > right now, as running glxgears does seem to help reproduce problems
> > faster which is nice, I'm just not entirely sure it's the same set of
> > problems; hopefully that version is free from issues, but we'll
> > see...
> 
> Ugh, well I can crash 5.8.0-rc6-01515-gae09163ac27c and 5.8.18 in
> basically the same way running glxgears and a xset dpms force off
> loop.  So I'm starting to think it's not the same thing, and that
> problem has been latent from before I started having periodic issues.
> 
> I should note that my exact testing technique for the above was to run
> 4 copies of glxgears and the xset force dpms off loop at the same
> time.  Really looks more like it triggers a resource starvation issue
> maybe.  The crash is also worse, particularly if I don't do anything
> about it right away as my workstation eventually falls off the network
> and I'm forced to power cycle it; the crashes I was chasing after
> wouldn't do quite that much violence, normally I could still log in,
> rebuild a kernel, and shut things down cleanly.
> 
> More than one bug here I suspect.

OK, I went back and bisected again while patching known issues to get
a better idea what was causing the problem I've been having and I'm
confident it was the bug which Bastian Beranek's patch (now in
mainline) addressed.  My original bisection got confused by the
EVO push buffer HW bug which was fixed in ca386aa7155a54.  Once I
bisected with the patch from ca386aa7155a54 applied, my bisection
landed on f844eb485eb05 and Bastian Beranek's patch fixed that right
up.

'course I remain mildly concerned I can crash the kernel with little
more than glxgears and xset ... but the original stability problem I
reported I can safely say has been fixed.  If I can figure out the
nature of what I suspect is unrecoverable resource starvation, I'll
start a new thread for that.


-- 
Jamie Heilman                     http://audible.transient.net/~jamie/
_______________________________________________
Nouveau mailing list
Nouveau@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/nouveau

      reply	other threads:[~2021-02-02 20:30 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-12-25  7:34 nouveau regression post v5.8, still present in v5.10 Jamie Heilman
     [not found] ` <X+WV8OiQzTIfLdgW-3MLwneIWSru5Joj7rniG6R4hnT+Y9+D1@public.gmane.org>
2021-01-04 13:21   ` Karol Herbst
2021-01-04 17:23     ` [Nouveau] " Jamie Heilman
2021-01-04 17:43       ` Jamie Heilman
2021-01-06  3:18         ` Jamie Heilman
     [not found]           ` <X/UsBWwFR+V0hIOS-3MLwneIWSru5Joj7rniG6R4hnT+Y9+D1@public.gmane.org>
2021-01-06  9:40             ` Karol Herbst
2021-01-21  2:33               ` [Nouveau] " Jamie Heilman
     [not found]                 ` <YAjn9jR+d2zRfNjb-3MLwneIWSru5Joj7rniG6R4hnT+Y9+D1@public.gmane.org>
2021-01-21  9:11                   ` Karol Herbst
2021-01-22 22:05                     ` [Nouveau] " Jamie Heilman
2021-01-23  0:44                       ` Jamie Heilman
2021-02-02 20:26                         ` Jamie Heilman [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=YBm1iJXeDvf/Z7B4@audible.transient.net \
    --to=jamie@audible.transient.net \
    --cc=bskeggs@redhat.com \
    --cc=kherbst@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=nouveau@lists.freedesktop.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).