dri-devel.lists.freedesktop.org archive mirror
 help / color / mirror / Atom feed
From: bugzilla-daemon@bugzilla.kernel.org
To: dri-devel@lists.freedesktop.org
Subject: [Bug 215315] New: [REGRESSION BISECTED] amdgpu crashes system suspend - NUC8i7HVKVA
Date: Sun, 12 Dec 2021 23:08:28 +0000	[thread overview]
Message-ID: <bug-215315-2300@https.bugzilla.kernel.org/> (raw)

https://bugzilla.kernel.org/show_bug.cgi?id=215315

            Bug ID: 215315
           Summary: [REGRESSION BISECTED] amdgpu crashes system suspend -
                    NUC8i7HVKVA
           Product: Drivers
           Version: 2.5
    Kernel Version: 5.15-rc1, 5.15, 5.16-rc4
          Hardware: x86-64
                OS: Linux
              Tree: Mainline
            Status: NEW
          Severity: normal
          Priority: P1
         Component: Video(DRI - non Intel)
          Assignee: drivers_video-dri@kernel-bugs.osdl.org
          Reporter: lenb@kernel.org
        Regression: No

My Intel NUC8i7HVKVA has an AMD GPU.

Until 5.15-rc1, this machine was rock solid in suspend stress testing -- never
crashing after hundreds of hours of back-to-back suspend cycles.

Until this patch went upstream:

commit f7d6779df642720e22bffd449e683bb8690bd3bf (refs/bisect/bad)
Author: Guchun Chen <guchun.chen@amd.com>
Date:   Fri Aug 27 18:31:41 2021 +0800

    drm/amdgpu: stop scheduler when calling hw_fini (v2)

    This gurantees no more work on the ring can be submitted
    to hardware in suspend/resume case, otherwise a potential
    race will occur and the ring will get no chance to stay
    empty before suspend.

    v2: Call drm_sched_resubmit_job before drm_sched_start to
    restart jobs from the pending list.

    Suggested-by: Andrey Grodzovsky <andrey.grodzovsky@amd.com>
    Suggested-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Guchun Chen <guchun.chen@amd.com>
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Cc: stable@vger.kernel.org

I bisected that the patch before this one was integrated can handle over 1,000
back-to-back "freeze" system suspend cycles.  Yet, when this patch is present,
the system may crash before it completes only 100 cycles, and at most lasts a
few hundred cycles.

This crash is present in all following upstream rc's, including 5.15-rc4.

When I revert this patch from 5.15-rc4, stability returns.

Usually, the crash is manifest by a black screen, and a system that does not
respond to ping, and will only respond to a long AC power button press to
remove power; and a subsequent cold reboot.

I have witnessed the crash occur, and the "ubuntu color themed" screen enters
some sort of reverse video mode.  In this weird color mode, I've seen a text
window oscillate between scrolling and un-scrolling for a line -- sort of like
it is going back in time, but then changes its mind.  There is no response to
keyboard, mouse, or network input.

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

             reply	other threads:[~2021-12-12 23:08 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-12-12 23:08 bugzilla-daemon [this message]
2021-12-12 23:11 ` [Bug 215315] " bugzilla-daemon
2021-12-13  6:04 ` [Bug 215315] New: " Thorsten Leemhuis

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=bug-215315-2300@https.bugzilla.kernel.org/ \
    --to=bugzilla-daemon@bugzilla.kernel.org \
    --cc=dri-devel@lists.freedesktop.org \
    --subject='Re: [Bug 215315] New: [REGRESSION BISECTED] amdgpu crashes system suspend - NUC8i7HVKVA' \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).