All of lore.kernel.org
 help / color / mirror / Atom feed
From: Steve Wahl <steve.wahl@hpe.com>
To: Eric Hagberg <ehagberg@gmail.com>
Cc: Steve Wahl <steve.wahl@hpe.com>,
	me@pavinjoseph.com, dave.hansen@linux.intel.com,
	regressions@lists.linux.dev, stable@vger.kernel.org
Subject: Re: [REGRESSION] kexec does firmware reboot in kernel v6.7.6
Date: Tue, 12 Mar 2024 10:13:39 -0500	[thread overview]
Message-ID: <ZfBxIykq3LwPq34M@swahl-home.5wahls.com> (raw)
In-Reply-To: <CAJbxNHfPHpbzRwfuFw6j7SxR1OsgBH2VJFPnchBHTtRueJna4A@mail.gmail.com>

On Tue, Mar 12, 2024 at 07:04:10AM -0400, Eric Hagberg wrote:
> On Thu, Mar 7, 2024 at 11:33 AM Steve Wahl <steve.wahl@hpe.com> wrote:
> > What Linux Distribution are you running on that machine?  My guess
> > would be that this is not distro related; if you are running something
> > quite different from Pavin that would confirm this.
> 
> Distro in use is Rocky 8, so it’s pretty clear not to be distro-specific.
> 
> > I found an AMD based system to try to reproduce this on.
> 
> yeah, it probably requires either a specific cpu or set or devices plus cpu
> to trigger… found that it also affects Dell R7625 servers in addition to
> the R6615s

I agree that it's likely the CPU or particular set of surrounding
devices that trigger the problem.

I have not succeeded in reproducing the problem yet. I tried an AMD
based system lent to me, but it's probably the wrong generation (AMD
EPYC 7251) and I didn't see the problem.  I have a line on a system
that's more in line with the systems the bug was reported on that I
should be able to try tomorrow.

I would love to have some direction from the community at large on
this.  The fact that nogbpages on the command line causes the same
problem without my patch suggests it's not bad code directly in my
patch, but something in the way kexec reacts to the resulting identity
map.  One quick solution would be a kernel command line parameter to
select between the previous identity map creation behavior and the new
behavior.  E.g. in addition to "nogbpages", we could have
"somegbpages" and "allgbpages" -- or gbpages=[all, some, none] with
nogbpages a synonym for backwards compatibility.

But I don't want to introduce a new command line parameter if the
actual problem can be understood and fixed.  The question is how much
time do I have to persue a direct fix before some other action needs
to be taken?

Thanks,

--> Steve Wahl

-- 
Steve Wahl, Hewlett Packard Enterprise

  parent reply	other threads:[~2024-03-12 15:13 UTC|newest]

Thread overview: 26+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-03-05 10:39 [REGRESSION] kexec does firmware reboot in kernel v6.7.6 Eric Hagberg
2024-03-07 16:33 ` Steve Wahl
     [not found]   ` <CAJbxNHfPHpbzRwfuFw6j7SxR1OsgBH2VJFPnchBHTtRueJna4A@mail.gmail.com>
2024-03-12 15:13     ` Steve Wahl [this message]
2024-03-12 18:42       ` Pavin Joseph
2024-03-12 20:09         ` Steve Wahl
2024-03-12 22:02         ` Steve Wahl
2024-03-12 22:02           ` Steve Wahl
2024-03-13 12:16           ` Eric W. Biederman
2024-03-13 12:16             ` Eric W. Biederman
2024-03-13 16:17             ` Steve Wahl
2024-03-13 16:17               ` Steve Wahl
2024-03-14  9:25               ` Dave Young
2024-03-14  9:25                 ` Dave Young
  -- strict thread matches above, loose matches on Subject: below --
2024-03-01 14:10 Pavin Joseph
2024-03-01 14:45 ` Linux regression tracking (Thorsten Leemhuis)
2024-03-02  8:24   ` Pavin Joseph
2024-03-02 15:17     ` Linux regression tracking (Thorsten Leemhuis)
2024-03-02 16:10       ` Pavin Joseph
2024-03-03  0:00         ` Steve Wahl
2024-03-03  6:32           ` Pavin Joseph
2024-03-04 16:15             ` Steve Wahl
2024-03-04 17:48               ` Pavin Joseph
2024-03-05 15:25                 ` Steve Wahl
2024-03-05 19:58                   ` Pavin Joseph
2024-03-06  3:09                     ` Pavin Joseph
2024-03-06 15:50                       ` Steve Wahl

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ZfBxIykq3LwPq34M@swahl-home.5wahls.com \
    --to=steve.wahl@hpe.com \
    --cc=dave.hansen@linux.intel.com \
    --cc=ehagberg@gmail.com \
    --cc=me@pavinjoseph.com \
    --cc=regressions@lists.linux.dev \
    --cc=stable@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.