All of lore.kernel.org
 help / color / mirror / Atom feed
From: Jan Beulich <jbeulich@suse.com>
To: Claudemir Todo Bom <claudemir@todobom.com>
Cc: "xen-devel@lists.xenproject.org" <xen-devel@lists.xenproject.org>,
	Juergen Gross <jgross@suse.com>,
	Boris Ostrovsky <boris.ostrovsky@oracle.com>
Subject: Re: Problems with APIC on versions 4.9 and later (4.8 works)
Date: Wed, 20 Jan 2021 09:50:37 +0100	[thread overview]
Message-ID: <ff799cd4-ba42-e120-107c-5011dc803b5a@suse.com> (raw)
In-Reply-To: <CANyqHYcifnCgd5C5vbYoi4CTtoMX5+jzGqHfs6JZ+e=d2Y_dmg@mail.gmail.com>

On 19.01.2021 20:36, Claudemir Todo Bom wrote:
> I do not have serial output on this setup, so I recorded a video with
> boot_delay=50 in order to be able to get all the kernel messages:
> https://youtu.be/y95h6vqoF7Y

This doesn't show any badness afaics.

> This is running 4.14 from debian bullseye (testing).
> 
> I'm also attaching the dmesg output when booting xen 4.8 with  the same
> kernel version and same parameters.
> 
> I visually compared all the messages, and the only thing I noticed was that
> 4.14 used tsc as clocksource and 4.8 used xen. I tried to boot the kernel
> with "clocksource=xen" and the problem is happening with that also.

There's some confusion here I suppose: The clock source you talk
about is the kernel's, not Xen's. I didn't think this would
change for the same kernel version with different Xen underneath,
but the Linux maintainers of the Xen code there may know better.
Cc-ing them.

> The "start" of the problem is that when the kernel gets to the "Freeing
> unused kernel image (initmem) memory: 2380K" it hangs and stays there for a
> while. After a few minutes it shows that a process (swapper) is blocked for
> sometime (image attached)

Now that's pretty unusual - the call trace seen in the screen
shot you had attached indicates the kernel didn't even make it
past its own initialization just yet. Just to have explored that
possibility - could you enable Xen's NMI watchdog (simply
"watchdog" on the Xen command line)? Among the boot messages
there ought to be one indicating whether it actually works on
your system. Without a serial console you wouldn't see anything
if it triggers, but the system would then never make it to the
kernel side issue.

As far as making sure we at least see all kernel messages -
are you having "ignore_loglevel" in place? I don't think I've
been able to spot the kernel command line anywhere in the video.

I'm afraid there's no real way around seeing the full Xen
messages, i.e. including possible ones while Dom0 already boots
(and allowing some debug keys to be issued, as the rcu_barrier
on the stack may suggest there's an issue with one of the
secondary CPUs). You could try whether "vag=keep" on the Xen
command line allows you to see more, but this option may have
quite severe an effect on the timing of Dom0's booting, which
may make an already bad situation worse.

Alternatively the kernel may need instrumenting to figure what
exactly it is that prevent forward progress.

There's one other wild guess you may want to try: "cpuidle=no"
on the Xen command line.

Jan


  parent reply	other threads:[~2021-01-20  8:50 UTC|newest]

Thread overview: 26+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-01-18 20:15 Problems with APIC on versions 4.9 and later (4.8 works) Claudemir Todo Bom
2021-01-19  9:07 ` Jan Beulich
2021-01-19 19:47   ` Claudemir Todo Bom
     [not found]   ` <CANyqHYcifnCgd5C5vbYoi4CTtoMX5+jzGqHfs6JZ+e=d2Y_dmg@mail.gmail.com>
2021-01-20  8:50     ` Jan Beulich [this message]
2021-01-20 15:13       ` Jürgen Groß
2021-01-20 20:13         ` Claudemir Todo Bom
2021-01-21  7:59           ` Jan Beulich
2021-01-22 12:55         ` Claudemir Todo Bom
2021-01-22 23:36         ` Claudemir Todo Bom
2021-01-25  9:38           ` Jan Beulich
2021-01-25 19:37             ` Claudemir Todo Bom
2021-01-26 11:48               ` Jan Beulich
2021-01-27 12:25                 ` Claudemir Todo Bom
     [not found]                 ` <CANyqHYeDR_NUKzPtbfLiUzxAUzerKepbU4B-_6=U-7Y6uy8gpQ@mail.gmail.com>
2021-01-28  9:47                   ` Jan Beulich
2021-01-28  9:49                     ` Jan Beulich
2021-01-28 13:08                       ` Claudemir Todo Bom
2021-01-29 14:21                         ` Jan Beulich
2021-01-29 15:07                           ` Claudemir Todo Bom
2021-01-29 16:24                         ` Jan Beulich
2021-01-29 19:31                           ` Claudemir Todo Bom
2021-02-01  7:51                             ` Jan Beulich
2021-02-01 12:47                             ` Jan Beulich
2021-02-01 14:46                               ` Claudemir Todo Bom
2021-02-01 15:09                                 ` Jan Beulich
2021-02-01 15:26                                   ` Claudemir Todo Bom
2021-02-01 16:48                                     ` Jan Beulich

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ff799cd4-ba42-e120-107c-5011dc803b5a@suse.com \
    --to=jbeulich@suse.com \
    --cc=boris.ostrovsky@oracle.com \
    --cc=claudemir@todobom.com \
    --cc=jgross@suse.com \
    --cc=xen-devel@lists.xenproject.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.