xen-devel.lists.xenproject.org archive mirror
 help / color / mirror / Atom feed
From: Ian Jackson <iwj@xenproject.org>
To: Dario Faggioli <dfaggioli@suse.com>
Cc: committers@xenproject.org, xen-devel@lists.xenproject.org,
	Andrew Cooper <andrew.cooper3@citrix.com>,
	Jan Beulich  <jbeulich@suse.com>, Julien Grall <julien@xen.org>,
	community.manager@xenproject.org
Subject: Re: [ANNOUNCE] Xen 4.15 - call for notification/status of significant bugs
Date: Thu, 4 Feb 2021 15:12:43 +0000	[thread overview]
Message-ID: <24604.3819.903469.786536@mariner.uk.xensource.com> (raw)
In-Reply-To: <6d0d7181bad79259aff28351621d2ac1eeaca113.camel@suse.com>

Dario Faggioli writes ("Re: [ANNOUNCE] Xen 4.15 - call for notification/status of significant bugs"):
> On Thu, 2021-02-04 at 12:12 +0000, Ian Jackson wrote:
> > I reviewed a thread about this and it is not clear to me where we are
> > with this.
.
> Ok, let me try to summarize the current status.

Thanks.

> - BUG: credit=sched2 machine hang when using DRAKVUF
> 
>   https://lists.xen.org/archives/html/xen-devel/2020-05/msg01985.html
>   https://lists.xenproject.org/archives/html/xen-devel/2020-10/msg01561.html
>   https://bugzilla.opensuse.org/show_bug.cgi?id=1179246
> 
>   99% sure that it's a Credit2 scheduler issue.
>   I'm actively working on it.
>   "Seems a tricky one; I'm still in the analysis phase"
> 
>   Manifests only with certain combination of hardware and workload. 
>   I'm not reproducing, but there are multiple reports of it (see 
>   above). I'm investigating and trying to come up at least with 
>   debug patches that one of the reporter should be able and willing to 
>   test.

I think this is a clear blocker for 4.15.  I will call it "F".

> - Null scheduler and vwfi native problem
> 
>   https://lists.xenproject.org/archives/html/xen-devel/2021-01/msg01634.html
> 
>   RCU issues, but manifests due to scheduler behavior (especially   
>   NULL scheduler, especially on ARM).
>   I'm actively working on it.
> 
>   Patches that should solve the issue for ARM posted already. They 
>   will need to be slightly adjusted to cover x86 as well. Waiting a 
>   couple days more for a confirmation from the reporter that the
>   patches do help, at least on ARM.

I'm not sure whether this is a blocker but it looks like it is going
to be fixed so I will keep it on my list.  I will call it "G".


> - Xen crash after S3 suspend - Xen 4.13
> 
>   https://lists.xen.org/archives/html/xen-devel/2020-03/msg01251.html
>   https://lists.xen.org/archives/html/xen-devel/2021-01/msg02620.html
> 
>   S3 suspend issue, but root cause seems to be in the scheduler.
> 
>   Marek is, as usual, providing good info and feedback. It comes as 
>   third in my list (below the two above, basically), but I will look
>   into it.

This is not a blocker so I won't track it explicitly but I would
very much welcome a fix if it is simple or comes quickly.


> - Ryzen 4000 (Mobile) Softlocks/Micro-stutters
> 
>   https://lists.xenproject.org/archives/html/xen-devel/2020-10/msg00966.html
> 
>   Seems could be scheduling, but amount of info is limited.
> 
>   What we know is that with `dom0_max_vcpus=1 dom0_vcpus_pin`, all 
>   schedulers seem to work fine. Without those params, Credit2 is the 
>   "least bad", although not satisfactory. Other schedulers don't even 
>   boot.
>   Fact is, it is reported to occure on QubesOS, which has its own 
>   downstream patches, plus there are no logs.
>   There's a feeling that this (together with others) hints at SMT off 
>   having issues on AMD (Ryzen?), but again, it's not crystal clear to 
>   me whether this is the issue (or an issue at all) and, if yes, in 
>   what subsystem the problem lays.
>   I can try to have a look, mostly for trying to understand whether or 
>   not it is really the case that some AMDs have issues with SMT=off.
>   But that probably will be after I'll be done with the other issues 
>   I've mentioned before (above) this one.

I'm not sure whether you are saying (a) our current code is not
useable on this hardware because of this issue, or on the other hand
(b) you think the issue is specific to downstream patches ?

Do you think I should consider this a blocker for 4.15 ?


> - Recent upgrade of 4.13 -> 4.14 issue
> 
>   https://lists.xenproject.org/archives/html/xen-devel/2020-10/msg01800.html 
> 
>   To my judgment, It's not at all clear whether or not this is a 
>   scheduler issue. And at least with the amount of info that we have 
>   so far, I'd lean toward "no, it's not". I'm happy to help with it 
>   anyway, of course, but it comes after the others.

Again, I think this is not a regression so not a blocker for 4.15.


> So, Ian, was this any helpful?

Yes, very much so, thank you.

Ian.


  parent reply	other threads:[~2021-02-04 15:13 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-02-01 15:33 [ANNOUNCE] Xen 4.15 release schedule - feature freeze now in effect Ian Jackson
2021-02-04 12:12 ` [ANNOUNCE] Xen 4.15 - call for notification/status of significant bugs Ian Jackson
2021-02-04 12:20   ` Andrew Cooper
2021-02-04 15:15     ` Ian Jackson
2021-02-04 14:20   ` Dario Faggioli
2021-02-04 15:00     ` Tamas K Lengyel
2021-02-04 18:22       ` Dario Faggioli
2021-02-04 15:12     ` Ian Jackson [this message]
2021-02-04 14:30   ` Jan Beulich
2021-02-04 15:18     ` Ian Jackson
2021-02-05 15:33   ` Jan Beulich

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=24604.3819.903469.786536@mariner.uk.xensource.com \
    --to=iwj@xenproject.org \
    --cc=andrew.cooper3@citrix.com \
    --cc=committers@xenproject.org \
    --cc=community.manager@xenproject.org \
    --cc=dfaggioli@suse.com \
    --cc=jbeulich@suse.com \
    --cc=julien@xen.org \
    --cc=xen-devel@lists.xenproject.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).