All of lore.kernel.org
 help / color / mirror / Atom feed
* linux 5.12 - fails to boot - soft lockup - CPU#0 stuck for 23s! - RIP smp_call_function_single
@ 2021-05-17  8:13 James Feeney
  2021-05-17  8:32 ` Borislav Petkov
  0 siblings, 1 reply; 28+ messages in thread
From: James Feeney @ 2021-05-17  8:13 UTC (permalink / raw)
  To: linux-smp; +Cc: Borislav Petkov, Jens Axboe

I re-ran my git bisect, this time with a full power-down and cold boot, and more thorough testing, running a web browser.  My second bisect went from good to bad.

So now, instead, git bisect ended here:

4f432e8bb15b352da72525144da025a46695968f is the first bad commit
commit 4f432e8bb15b352da72525144da025a46695968f
Author: Borislav Petkov <bp@suse.de>
Date:   Thu Jan 7 13:23:34 2021 +0100

    x86/mce: Get rid of mcheck_intel_therm_init()

    Move the APIC_LVTTHMR read which needs to happen on the BSP, to
    intel_init_thermal(). One less boot dependency.

    No functional changes.

    Signed-off-by: Borislav Petkov <bp@suse.de>
    Tested-by: Srinivas Pandruvada <srinivas.pandruvada@linux.intel.com>
    Link: https://lkml.kernel.org/r/20210201142704.12495-2-bp@alien8.de

 arch/x86/include/asm/mce.h            |  6 ------
 arch/x86/kernel/cpu/mce/core.c        |  1 -
 arch/x86/kernel/cpu/mce/therm_throt.c | 15 ++++-----------
 3 files changed, 4 insertions(+), 18 deletions(-)


Please let me know if that makes more sense.

Again:

Arch Linux
linux 5.12.arch1-1

Intel Core2 T7200
Mobile Intel 945PM Express Chipset
ICH7-M
Mobility Radeon X1600

Generally, on failure, the system will not boot past "Loading initial ramdisk...", or, when it does, the boot process will hang, and the console will eventually show:

watchdog: BUG: soft lockup - CPU#0 stuck for 23s! [systemd-udevd: 241]
...
RIP: 0010:smp_call_function_single+0xf7/0x140

The top of the call trace variously shows either "__flush_tlb_all" or "tlbflush_read_file", with the "soft lockup" repeating indefinitely.

If this is some race/timing issue on boot, I have to go back and re-test every "good" bisect, re-booting many times to see if there is *ever* a failure - and that is supposing that there is no interaction between whatever is causing the problem and all the other patches being added.  Any insight would be appreciated.


James

^ permalink raw reply	[flat|nested] 28+ messages in thread

end of thread, other threads:[~2021-05-31 21:46 UTC | newest]

Thread overview: 28+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-05-17  8:13 linux 5.12 - fails to boot - soft lockup - CPU#0 stuck for 23s! - RIP smp_call_function_single James Feeney
2021-05-17  8:32 ` Borislav Petkov
2021-05-19  3:58   ` James Feeney
2021-05-19 11:12     ` Borislav Petkov
2021-05-19 20:03       ` James Feeney
2021-05-19 21:18         ` Borislav Petkov
2021-05-20  3:12           ` James Feeney
2021-05-20  9:21             ` Borislav Petkov
2021-05-21 22:11               ` James Feeney
2021-05-22  9:06                 ` Borislav Petkov
2021-05-22 23:28                   ` James Feeney
2021-05-22 23:28                     ` James Feeney
2021-05-23 17:05                     ` Borislav Petkov
2021-05-23 23:02                       ` James Feeney
2021-05-24  7:51                         ` Borislav Petkov
2021-05-25  4:02                           ` James Feeney
2021-05-27 10:31                             ` [PATCH] x86/thermal: Fix LVT thermal setup for SMI delivery mode Borislav Petkov
2021-05-27 11:49                               ` Thomas Gleixner
2021-05-27 11:56                                 ` Borislav Petkov
2021-05-27 18:54                                 ` Borislav Petkov
2021-05-28  8:23                                   ` Thomas Gleixner
2021-05-28 11:19                                     ` Borislav Petkov
2021-05-31 18:26                                       ` James Feeney
2021-05-27 18:09                               ` Srinivas Pandruvada
2021-05-27 19:01                                 ` Borislav Petkov
2021-05-27 20:28                                   ` Srinivas Pandruvada
2021-05-28  7:05                               ` James Feeney
2021-05-31 21:46   ` [tip: x86/urgent] " tip-bot2 for Borislav Petkov

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.