regressions.lists.linux.dev archive mirror
 help / color / mirror / Atom feed
From: "Limonciello, Mario" <Mario.Limonciello@amd.com>
To: Thorsten Leemhuis <regressions@leemhuis.info>,
	Kai-Heng Feng <kai.heng.feng@canonical.com>
Cc: "casteyde.christian@free.fr" <casteyde.christian@free.fr>,
	"stable@vger.kernel.org" <stable@vger.kernel.org>,
	"regressions@lists.linux.dev" <regressions@lists.linux.dev>,
	"Deucher, Alexander" <Alexander.Deucher@amd.com>,
	"gregkh@linuxfoundation.org" <gregkh@linuxfoundation.org>
Subject: RE: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
Date: Wed, 18 May 2022 20:15:16 +0000	[thread overview]
Message-ID: <MN0PR12MB6101666E2F0F4206FEDDF334E2D19@MN0PR12MB6101.namprd12.prod.outlook.com> (raw)
In-Reply-To: <2ce8f87e-785a-25b2-159a-cca45243b75b@leemhuis.info>

[AMD Official Use Only - General]



> -----Original Message-----
> From: Thorsten Leemhuis <regressions@leemhuis.info>
> Sent: Wednesday, May 18, 2022 01:37
> To: Kai-Heng Feng <kai.heng.feng@canonical.com>
> Cc: casteyde.christian@free.fr; stable@vger.kernel.org;
> regressions@lists.linux.dev; Deucher, Alexander
> <Alexander.Deucher@amd.com>; gregkh@linuxfoundation.org; Limonciello,
> Mario <Mario.Limonciello@amd.com>
> Subject: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video
> since 5.17.4 (works 5.17.3)
> 
> On 18.05.22 07:54, Kai-Heng Feng wrote:
> > On Wed, May 18, 2022 at 1:52 PM Thorsten Leemhuis
> > <regressions@leemhuis.info> wrote:
> >>
> >> On 17.05.22 19:37, casteyde.christian@free.fr wrote:
> >>
> >>> I've tryied to revert the offending commit on 5.18-rc7 (887f75cfd0da
> >>> ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset"),
> and
> >>> the problem disappears so it's really this commit that breaks.
> >>
> >> In that case I'll update the regzbot status to make sure it's visible as
> >> regression introduced in the 5.18 cycle:
> >>
> >> #regzbot introduced: 887f75cfd0da
> >>
> >> BTW: obviously would be nice to get this fixed before 5.18 is released
> >> (which might already happen on Sunday), especially as the culprit
> >> apparently was already backported to stable, but I guess that won't be
> >> easy...
> >>
> >> Which made me wondering: is reverting the culprit temporarily in
> >> mainline (and reapplying it later with a fix) a option here?
> >
> > It's too soon to call it's the culprit.
> 
> Well, sure, the root-cause might be somewhere else. But from the point
> of kernel regressions (and tracking them) it's the culprit, as that's
> the change that triggers the misbehavior. And that's how Linus
> approaches these things as well when it comes to reverting to fix
> regressions -- and he even might...
> 
> > The suspend on the system
> > doesn't work properly at the first place.
> 
> ...ignore things like this, as long as a revert is unlikely to cause
> more damage than good.

I think the right way to focus on this is to fix the original suspend issue.
The fact that the first suspend is failing with s3 should be a red flag that
the system is in a pretty bad state.

Maybe can we get /sys/power/pm_debug_messages turned on as well
as  /sys/power/pm_print_times.  Then we should have a better idea on
what is going on that triggers that first failure.

Again, it would be much better to put all this in a bug report somewhere.
It's really hard to associate dmesgs in a threaded email with what's going
on.  Kernel Bugzilla, AMD's Gitlab, it doesn't matter where really.  Anywhere
is better than email threads IMO.

In this case the revert would causes problems for the resume of any dGPU.
So it's a tradeoff of many dGPU resume failures vs one APU resume failure
in s2idle after a failed suspend in s3.

BTW - I'm not really sure why the system is picking s2idle for "the second try".
Is that the kernel doing this, or is this userspace causing it?  We really shouldn't
be seeing different suspend modes across attempts without a user consciously
selecting one.

> 
> Ciao. Thorsten
> 
> 
> >> Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
> >>
> >> P.S.: As the Linux kernel's regression tracker I deal with a lot of
> >> reports and sometimes miss something important when writing mails like
> >> this. If that's the case here, don't hesitate to tell me in a public
> >> reply, it's in everyone's interest to set the public record straight.
> >

  reply	other threads:[~2022-05-18 20:15 UTC|newest]

Thread overview: 30+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-05-14 14:41 [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3) Christian Casteyde
2022-05-14 15:12 ` Thorsten Leemhuis
2022-05-14 17:34   ` Christian Casteyde
2022-05-16  2:47     ` Kai-Heng Feng
2022-05-16  3:06       ` Mario Limonciello
2022-05-16 17:23       ` Christian Casteyde
2022-05-17  2:03         ` Kai-Heng Feng
2022-05-17  6:36           ` Christian Casteyde
2022-05-17  6:58             ` Kai-Heng Feng
2022-05-17 17:37               ` casteyde.christian
2022-05-18  5:52                 ` Thorsten Leemhuis
2022-05-18  5:54                   ` Kai-Heng Feng
2022-05-18  6:37                     ` Thorsten Leemhuis
2022-05-18 20:15                       ` Limonciello, Mario [this message]
2022-05-23 13:02                       ` Christian Casteyde
2022-05-23 14:00                         ` Limonciello, Mario
2022-05-23 17:03                         ` Christian Casteyde
2022-05-23 22:01                           ` Limonciello, Mario
2022-05-24 20:54                           ` Christian Casteyde
2022-05-25  7:29                             ` Thorsten Leemhuis
2022-05-17 17:38               ` casteyde.christian
2022-05-17 18:13                 ` Limonciello, Mario
2022-05-18 11:02                   ` casteyde.christian
2022-05-19 17:31                     ` Limonciello, Mario
2022-05-19 18:07                       ` Limonciello, Mario
2022-05-18  2:08                 ` Kai-Heng Feng
2022-05-18  2:19                   ` Mario Limonciello
2022-05-18  7:15                   ` Christian Casteyde
2022-05-18 10:53                     ` casteyde.christian
2022-05-27  9:51   ` [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3) #forregzbot Thorsten Leemhuis

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=MN0PR12MB6101666E2F0F4206FEDDF334E2D19@MN0PR12MB6101.namprd12.prod.outlook.com \
    --to=mario.limonciello@amd.com \
    --cc=Alexander.Deucher@amd.com \
    --cc=casteyde.christian@free.fr \
    --cc=gregkh@linuxfoundation.org \
    --cc=kai.heng.feng@canonical.com \
    --cc=regressions@leemhuis.info \
    --cc=regressions@lists.linux.dev \
    --cc=stable@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).