dri-devel.lists.freedesktop.org archive mirror
 help / color / mirror / Atom feed
From: bugzilla-daemon@bugzilla.kernel.org
To: dri-devel@lists.freedesktop.org
Subject: [Bug 215315] New: [REGRESSION BISECTED] amdgpu crashes system suspend - NUC8i7HVKVA
Date: Sun, 12 Dec 2021 23:08:28 +0000	[thread overview]
Message-ID: <bug-215315-2300@https.bugzilla.kernel.org/> (raw)


            Bug ID: 215315
           Summary: [REGRESSION BISECTED] amdgpu crashes system suspend -
           Product: Drivers
           Version: 2.5
    Kernel Version: 5.15-rc1, 5.15, 5.16-rc4
          Hardware: x86-64
                OS: Linux
              Tree: Mainline
            Status: NEW
          Severity: normal
          Priority: P1
         Component: Video(DRI - non Intel)
          Assignee: drivers_video-dri@kernel-bugs.osdl.org
          Reporter: lenb@kernel.org
        Regression: No

My Intel NUC8i7HVKVA has an AMD GPU.

Until 5.15-rc1, this machine was rock solid in suspend stress testing -- never
crashing after hundreds of hours of back-to-back suspend cycles.

Until this patch went upstream:

commit f7d6779df642720e22bffd449e683bb8690bd3bf (refs/bisect/bad)
Author: Guchun Chen <guchun.chen@amd.com>
Date:   Fri Aug 27 18:31:41 2021 +0800

    drm/amdgpu: stop scheduler when calling hw_fini (v2)

    This gurantees no more work on the ring can be submitted
    to hardware in suspend/resume case, otherwise a potential
    race will occur and the ring will get no chance to stay
    empty before suspend.

    v2: Call drm_sched_resubmit_job before drm_sched_start to
    restart jobs from the pending list.

    Suggested-by: Andrey Grodzovsky <andrey.grodzovsky@amd.com>
    Suggested-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Guchun Chen <guchun.chen@amd.com>
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Cc: stable@vger.kernel.org

I bisected that the patch before this one was integrated can handle over 1,000
back-to-back "freeze" system suspend cycles.  Yet, when this patch is present,
the system may crash before it completes only 100 cycles, and at most lasts a
few hundred cycles.

This crash is present in all following upstream rc's, including 5.15-rc4.

When I revert this patch from 5.15-rc4, stability returns.

Usually, the crash is manifest by a black screen, and a system that does not
respond to ping, and will only respond to a long AC power button press to
remove power; and a subsequent cold reboot.

I have witnessed the crash occur, and the "ubuntu color themed" screen enters
some sort of reverse video mode.  In this weird color mode, I've seen a text
window oscillate between scrolling and un-scrolling for a line -- sort of like
it is going back in time, but then changes its mind.  There is no response to
keyboard, mouse, or network input.

You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

             reply	other threads:[~2021-12-12 23:08 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-12-12 23:08 bugzilla-daemon [this message]
2021-12-12 23:11 ` [Bug 215315] [REGRESSION BISECTED] amdgpu crashes system suspend - NUC8i7HVKVA bugzilla-daemon
2021-12-13  6:04 ` [Bug 215315] New: " Thorsten Leemhuis

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=bug-215315-2300@https.bugzilla.kernel.org/ \
    --to=bugzilla-daemon@bugzilla.kernel.org \
    --cc=dri-devel@lists.freedesktop.org \


* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).