linuxppc-dev.lists.ozlabs.org archive mirror
 help / color / mirror / Atom feed
* [Bug 203837] New: Booting kernel under KVM immediately freezes host
@ 2019-06-06 22:59 bugzilla-daemon
  2019-06-07  5:42 ` [Bug 203837] " bugzilla-daemon
                   ` (3 more replies)
  0 siblings, 4 replies; 6+ messages in thread
From: bugzilla-daemon @ 2019-06-06 22:59 UTC (permalink / raw)
  To: linuxppc-dev

https://bugzilla.kernel.org/show_bug.cgi?id=203837

            Bug ID: 203837
           Summary: Booting kernel under KVM immediately freezes host
           Product: Platform Specific/Hardware
           Version: 2.5
    Kernel Version: v5.2-rc2
          Hardware: PPC-64
                OS: Linux
              Tree: Mainline
            Status: NEW
          Severity: blocking
          Priority: P1
         Component: PPC-64
          Assignee: platform_ppc-64@kernel-bugs.osdl.org
          Reporter: shawn@anastas.io
        Regression: No

Created attachment 283133
  --> https://bugzilla.kernel.org/attachment.cgi?id=283133&action=edit
Guest kernel config

When booting kernel v5.2-rc2 (and confirmed up to 156c05917) in a VM on a
POWER9 host running kernel 5.1.7, the host immediately locks up and
becomes unresponsive to the point of requiring a hard reset.

The last guest kernel message printed to the screen before the
host locks up is:

[    0.013940] smp: Bringing up secondary CPUs ...

Due to the nature of the bug, it is very difficult to bisect, since a manual
host reset is required each time the bug is encountered. Also, my only
POWER machine is my primary workstation.

The bug has also been confirmed on other host kernel versions (down to 5.0.x).
When downgrading the guest kernel to 5.1.0, the issue is not present.

The guest kernel .config is attached.

-- 
You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug 203837] Booting kernel under KVM immediately freezes host
  2019-06-06 22:59 [Bug 203837] New: Booting kernel under KVM immediately freezes host bugzilla-daemon
@ 2019-06-07  5:42 ` bugzilla-daemon
  2019-06-07  6:29 ` bugzilla-daemon
                   ` (2 subsequent siblings)
  3 siblings, 0 replies; 6+ messages in thread
From: bugzilla-daemon @ 2019-06-07  5:42 UTC (permalink / raw)
  To: linuxppc-dev

https://bugzilla.kernel.org/show_bug.cgi?id=203837

Paul Mackerras (paulus@ozlabs.org) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |paulus@ozlabs.org

--- Comment #1 from Paul Mackerras (paulus@ozlabs.org) ---
I have tried but not succeeded in replicating this problem.

I have tried 5.2-rc3 in the host with the config I usually use, plus 5.2-rc3 in
the guest with that same config. That boots just fine.

With 5.2-rc3 in the host and my usual config, and 5.2-rc3 in the guest compiled
with the config attached to this bug, the guest gets a kernel panic due to
being unable to mount root. It looks like it never manages to load virtio-blk
for some reason.

With the config attached to this bug, I did once see the guest stop outputting
messages after the message about bringing up CPUs. The host was still running
just fine, and top in the host showed the qemu-system-ppc64 process using 100%
of a CPU, consistent with the guest being in an infinite loop.

I think we need more details about the machine where the crash is occurring -
host kernel config, details of VM config (qemu command line or libvirt xml),
etc.

-- 
You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug 203837] Booting kernel under KVM immediately freezes host
  2019-06-06 22:59 [Bug 203837] New: Booting kernel under KVM immediately freezes host bugzilla-daemon
  2019-06-07  5:42 ` [Bug 203837] " bugzilla-daemon
@ 2019-06-07  6:29 ` bugzilla-daemon
  2019-06-10  6:27   ` Nicholas Piggin
  2019-06-10  6:30 ` bugzilla-daemon
  2019-06-11 19:42 ` bugzilla-daemon
  3 siblings, 1 reply; 6+ messages in thread
From: bugzilla-daemon @ 2019-06-07  6:29 UTC (permalink / raw)
  To: linuxppc-dev

https://bugzilla.kernel.org/show_bug.cgi?id=203837

--- Comment #2 from Paul Mackerras (paulus@ozlabs.org) ---
Just tried 5.1.7 in the host and got the guest locking up during boot. In xmon
I see one cpu in pmdp_invalidate and another in handle_mm_fault. It seems very
possible this is the bug that Nick Piggin's recent patch series fixes
("powerpc/64s: Fix THP PMD collapse serialisation"):

http://patchwork.ozlabs.org/project/linuxppc-dev/list/?series=112348

-- 
You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Bug 203837] Booting kernel under KVM immediately freezes host
  2019-06-07  6:29 ` bugzilla-daemon
@ 2019-06-10  6:27   ` Nicholas Piggin
  0 siblings, 0 replies; 6+ messages in thread
From: Nicholas Piggin @ 2019-06-10  6:27 UTC (permalink / raw)
  To: bugzilla-daemon, linuxppc-dev

bugzilla-daemon@bugzilla.kernel.org's on June 7, 2019 4:29 pm:
> https://bugzilla.kernel.org/show_bug.cgi?id=203837
> 
> --- Comment #2 from Paul Mackerras (paulus@ozlabs.org) ---
> Just tried 5.1.7 in the host and got the guest locking up during boot. In xmon
> I see one cpu in pmdp_invalidate and another in handle_mm_fault. It seems very
> possible this is the bug that Nick Piggin's recent patch series fixes
> ("powerpc/64s: Fix THP PMD collapse serialisation"):
> 
> http://patchwork.ozlabs.org/project/linuxppc-dev/list/?series=112348

It's worth a try, although the bug was introduced around 4.20 and
I wasn't able to trigger it on radix, but other timing changes
could cause it to trigger I suppose.

pdbg (https://github.com/open-power/pdbg) is a useful tool for your
BMC that can often get the CPU registers out even for bad crashes,
this might help to narrow down the problem without bisecting.

Thanks,
Nick


^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug 203837] Booting kernel under KVM immediately freezes host
  2019-06-06 22:59 [Bug 203837] New: Booting kernel under KVM immediately freezes host bugzilla-daemon
  2019-06-07  5:42 ` [Bug 203837] " bugzilla-daemon
  2019-06-07  6:29 ` bugzilla-daemon
@ 2019-06-10  6:30 ` bugzilla-daemon
  2019-06-11 19:42 ` bugzilla-daemon
  3 siblings, 0 replies; 6+ messages in thread
From: bugzilla-daemon @ 2019-06-10  6:30 UTC (permalink / raw)
  To: linuxppc-dev

https://bugzilla.kernel.org/show_bug.cgi?id=203837

--- Comment #3 from npiggin@gmail.com ---
bugzilla-daemon@bugzilla.kernel.org's on June 7, 2019 4:29 pm:
> https://bugzilla.kernel.org/show_bug.cgi?id=203837
> 
> --- Comment #2 from Paul Mackerras (paulus@ozlabs.org) ---
> Just tried 5.1.7 in the host and got the guest locking up during boot. In
> xmon
> I see one cpu in pmdp_invalidate and another in handle_mm_fault. It seems
> very
> possible this is the bug that Nick Piggin's recent patch series fixes
> ("powerpc/64s: Fix THP PMD collapse serialisation"):
> 
> http://patchwork.ozlabs.org/project/linuxppc-dev/list/?series=112348

It's worth a try, although the bug was introduced around 4.20 and
I wasn't able to trigger it on radix, but other timing changes
could cause it to trigger I suppose.

pdbg (https://github.com/open-power/pdbg) is a useful tool for your
BMC that can often get the CPU registers out even for bad crashes,
this might help to narrow down the problem without bisecting.

Thanks,
Nick

-- 
You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug 203837] Booting kernel under KVM immediately freezes host
  2019-06-06 22:59 [Bug 203837] New: Booting kernel under KVM immediately freezes host bugzilla-daemon
                   ` (2 preceding siblings ...)
  2019-06-10  6:30 ` bugzilla-daemon
@ 2019-06-11 19:42 ` bugzilla-daemon
  3 siblings, 0 replies; 6+ messages in thread
From: bugzilla-daemon @ 2019-06-11 19:42 UTC (permalink / raw)
  To: linuxppc-dev

https://bugzilla.kernel.org/show_bug.cgi?id=203837

--- Comment #4 from Shawn Anastasio (shawn@anastas.io) ---
I have applied Nick's patchset to 5.1.7 but the issue still occurs.

As for using pdbg, I'm aware of the tool's existence but I'm not sure how
I would effectively use it to diagnose this issue. If anybody has some
pointers, it'd be appreciated.

-- 
You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2019-06-11 19:44 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-06-06 22:59 [Bug 203837] New: Booting kernel under KVM immediately freezes host bugzilla-daemon
2019-06-07  5:42 ` [Bug 203837] " bugzilla-daemon
2019-06-07  6:29 ` bugzilla-daemon
2019-06-10  6:27   ` Nicholas Piggin
2019-06-10  6:30 ` bugzilla-daemon
2019-06-11 19:42 ` bugzilla-daemon

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).