xen-devel.lists.xenproject.org archive mirror
 help / color / mirror / Atom feed
* [ANNOUNCE] Xen 4.15 - hard codefreeze slip by one week
@ 2021-02-10 15:58 Ian Jackson
  2021-02-19 16:47 ` [ANNOUNCE] Xen 4.15 - hard codefreeze today Ian Jackson
  0 siblings, 1 reply; 4+ messages in thread
From: Ian Jackson @ 2021-02-10 15:58 UTC (permalink / raw)
  To: committers; +Cc: xen-devel, community.manager

Hello.  Unfortunately we are having difficulty with osstest due to a
combination of an ill-timed Debian update and Linux kernel regressions
which got into the upstream stable trees and thence into Debian.  I
have been working to try to resolve this situation.  That has taken
time I should have been spending on release management and caused a
delay to pushes.  There have been a few other bugs, some in recently
introduced patches.  And we still have some bugs being investigated.

So I have decided to slip by one week.  Accordingly, here is the new
freeze status and remaining schedule:

  We are in feature freeze.  No new features should be committed to
  xen.git#staging.

  You may continue to commit straightforward bugfixes, docs changes, and
  new tests, without a release-ack.  Anything involving reorganisation
  or refactoring should get a release ack.  If in doubt please ask me
  and I will grant (or withhold) permission.

* Hard codefreeze (after which all patches will need a release manager
* ack) will occur on the 19th of February.
*
* Friday 19th February   Code freeze

       Bugfixes only, all changes to be approved by the Release Manager.

* Week of 19th March **tentative**    Release
       (probably Tuesday or Wednesday)

  Any patches containing substantial refactoring are to treated as
  new features, even if they intent is to fix bugs.

  Freeze exceptions will not be routine, but may be granted in
  exceptional cases for small changes on the basis of risk assessment.
  Large series will not get exceptions.  Contributors *must not* rely on
  getting, or expect, a freeze exception.

  New or improved tests (supposing they do not involve refactoring,
  even build system reorganisation), and documentation improvements,
  will generally be treated as bugfixes.

  The release dates is provisional and will be adjusted in the light
  of apparent code quality etc.

Thanks,
Ian.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [ANNOUNCE] Xen 4.15 - hard codefreeze today
  2021-02-10 15:58 [ANNOUNCE] Xen 4.15 - hard codefreeze slip by one week Ian Jackson
@ 2021-02-19 16:47 ` Ian Jackson
  2021-02-19 16:54   ` Jan Beulich
  2021-02-20 14:07   ` Dario Faggioli
  0 siblings, 2 replies; 4+ messages in thread
From: Ian Jackson @ 2021-02-19 16:47 UTC (permalink / raw)
  To: committers; +Cc: xen-devel, community.manager

Today is the last day for committing anything to 4.15 without an
explicit release-ack.

Today, still:

    You may continue to commit straightforward bugfixes, docs changes, and
    new tests, without a release-ack.  Anything involving reorganisation
    or refactoring should get a release ack.

From the end of today, *all* changes must have a Release-Ack.

I intend to grant release-acks for bugfixes.  By and large I will try
to adopt a risk/benefit analysis.  As the freeze goes on I will grow
stricter.  I will be much more liberal with docs and tests.

So we are now here in the release schedule:

> * Friday 19th February   Code freeze
> 
>        Bugfixes only, all changes to be approved by the Release Manager.
> 
> * Week of 19th March **tentative**    Release
>        (probably Tuesday or Wednesday)
...
>   The releas dates is provisional and will be adjusted in the light
>   of apparent code quality etc.

My current list of issues I am tracking for the release is below.  If
you know about any of these issues please do let me know.

I'm slightly concerned that there are some issues on that list that
I'm not aware of any progress occurring on.  But maybe I haven't been
CC'd on all the mails, in which case I'd appreciate a summary update.

Also, please try to make sure that any patches targeted for 4.15
contain `4.15` in the Subject line.  Typically, something like this:
   [PATCH for-4.15 v2] re-invert the neutron polarisation

Thanks,
Ian.


OPEN ISSUES AND BLOCKERS
------------------------

C. Fallout from MSR handling behavioral change.

Information from
  Jan Beulich <jbeulich@suse.com>
  Andrew Cooper <andrew.cooper3@citrix.com>

Andrew writes:
| Bugs are "VMs which boot on earlier releases don't boot on
| 4.15 at the moment".
| 
| Still WIP and on my TODO list.

I think this
  [PATCH v2 4/4] tools/libs: Apply MSR policy to a guest
is probably part of the answer but it hasn't been committed yet.


D. Use-after-free in the IOMMU code

Information from
  Julien Grall <julien@xen.org>
References
 [PATCH for-4.15 0/4] xen/iommu: Collection of bug fixes for IOMMU teadorwn
 <20201222154338.9459-1-julien@xen.org>

Quoting the 0/:
| This series is a collection of bug fixes for the IOMMU teardown code.
| All of them are candidate for 4.15 as they can either leak memory or
| lead to host crash/host corruption.

These patches are still being discussed.  One went in, so now we are
talking about
  [PATCH v3 0/3] xen/iommu: Collection of bug fixes for IOMMU teadorwn


F. BUG: credit=sched2 machine hang when using DRAKVUF

Information from
  Dario Faggioli <dfaggioli@suse.com>
References
  https://lists.xen.org/archives/html/xen-devel/2020-05/msg01985.html
  https://lists.xenproject.org/archives/html/xen-devel/2020-10/msg01561.html
  https://bugzilla.opensuse.org/show_bug.cgi?id=1179246

Quoting Dario:
| Manifests only with certain combination of hardware and workload. 
| I'm not reproducing, but there are multiple reports of it (see 
| above). I'm investigating and trying to come up at least with 
| debug patches that one of the reporter should be able and willing to 
| test.

Dario is working on this.  Last update 29.1.21 ?


G. Null scheduler and vwfi native problem

Information from
  Dario Faggioli <dfaggioli@suse.com>

References
  https://lists.xenproject.org/archives/html/xen-devel/2021-01/msg01634.html

Quoting Dario:
| RCU issues, but manifests due to scheduler behavior (especially   
| NULL scheduler, especially on ARM).
|
| Patches that should solve the issue for ARM posted already. They
| will need to be slightly adjusted to cover x86 as well.

As of last update from Dario 29.1.21:
waiting for test report from submitter.


H. Ryzen 4000 (Mobile) Softlocks/Micro-stutters

Information from
  Dario Faggioli <dfaggioli@suse.com>

As of last update from Dario 29.1.21:
Discussions currently ongoing about the severity of this issue.


I. "x86/PV: avoid speculation abuse through guest accessors"

Information from
  Jan Beulich <jbeulich@suse.com>

| F. The almost-XSA "x86/PV: avoid speculation abuse through guest
| accessors" - the first 4 patches are needed to address the actual
| issue. The next 3 patches are needed to get the tree into
| consistent state again, identifier-wise. The remaining patches
| can probably wait.

The primary fixes for this have reviews and R-A and will be going in
shortly.  There is some followup work which needs to be reviewed and
acked.


J. x86/time: calibration rendezvous adjustments

Information from
  Jan Beulich <jbeulich@suse.com>

Not entirely a regression.  But 3 out of the 4 patches have reviews
and R-A and should be going in shortly.

Patch 4/ is RFC and it's not clear to e whether it's targeted at 4.15.


K. Problems with xl save / cancel

Information from Jürgen Groß:
  xl daemon won't kill the domain after it has gone through a
  suspend-cancel cycle.

Investigation is ongoing.  Not clear at this stage how big a blocker
this is.


L. ABI stability checking

   [PATCH for-4.15 00/10] tools: Support to use abi-dumper on libraries
   [PATCH v2 for-4.15] tools/libxl: Work around unintialised variable libxl__domain_get_device_model_uid()

This is testing/build work and will enable ABI checking of future
changes to 4.15 after its release.  I don't think it's a blocker but
it would be nice to have.  It has R-A and I think most acks now.


NEWLY CLOSED ISSUES
===================


A. HPET/PIT issue on newer Intel systems

Information from
  Andrew Cooper <andrew.cooper3@citrix.com>

| This has had literally tens of reports across the devel and users
| mailing lists, and prevents Xen from booting at all on the past two
| generations of Intel laptop.  I've finally got a repro and posted a
| fix to the list, but still in progress.

Fixed.  c/s e1de4c196a.


B. "scheduler broken" bugs.

Information from
  Andrew Cooper <andrew.cooper3@citrix.com>
  Dario Faggioli <dfaggioli@suse.com>

Quoting Andrew Cooper
| We've had 4 or 5 reports of Xen not working, and very little
| investigation on whats going on.  Suspicion is that there might be
| two bugs, one with smt=0 on recent AMD hardware, and one more
| general "some workloads cause negative credit" and might or might
| not be specific to credit2 (debugging feedback differs - also might
| be 3 underlying issue).

Dario has expaneded on this and I am closing this one out in favour of
F, G, H.


PREVIOUS CLOSED ISSUES
======================

E. zstd support


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [ANNOUNCE] Xen 4.15 - hard codefreeze today
  2021-02-19 16:47 ` [ANNOUNCE] Xen 4.15 - hard codefreeze today Ian Jackson
@ 2021-02-19 16:54   ` Jan Beulich
  2021-02-20 14:07   ` Dario Faggioli
  1 sibling, 0 replies; 4+ messages in thread
From: Jan Beulich @ 2021-02-19 16:54 UTC (permalink / raw)
  To: Ian Jackson; +Cc: xen-devel, community.manager, committers

On 19.02.2021 17:47, Ian Jackson wrote:
> J. x86/time: calibration rendezvous adjustments
> 
> Information from
>   Jan Beulich <jbeulich@suse.com>
> 
> Not entirely a regression.  But 3 out of the 4 patches have reviews
> and R-A and should be going in shortly.
> 
> Patch 4/ is RFC and it's not clear to e whether it's targeted at 4.15.

No, that's an intended optimization which can wait. I'm still
trying to determine a way how I could demonstrate it actually
makes a difference.

Jan


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [ANNOUNCE] Xen 4.15 - hard codefreeze today
  2021-02-19 16:47 ` [ANNOUNCE] Xen 4.15 - hard codefreeze today Ian Jackson
  2021-02-19 16:54   ` Jan Beulich
@ 2021-02-20 14:07   ` Dario Faggioli
  1 sibling, 0 replies; 4+ messages in thread
From: Dario Faggioli @ 2021-02-20 14:07 UTC (permalink / raw)
  To: Ian Jackson, committers; +Cc: xen-devel, community.manager

[-- Attachment #1: Type: text/plain, Size: 2229 bytes --]

On Fri, 2021-02-19 at 16:47 +0000, Ian Jackson wrote:
> 
> OPEN ISSUES AND BLOCKERS
> ------------------------
> 
> F. BUG: credit=sched2 machine hang when using DRAKVUF
> 
> Information from
>   Dario Faggioli <dfaggioli@suse.com>
> References
>   https://lists.xen.org/archives/html/xen-devel/2020-05/msg01985.html
>   
> https://lists.xenproject.org/archives/html/xen-devel/2020-10/msg01561.html
>   https://bugzilla.opensuse.org/show_bug.cgi?id=1179246
> 
> Quoting Dario:
> > Manifests only with certain combination of hardware and workload. 
> > I'm not reproducing, but there are multiple reports of it (see 
> > above). I'm investigating and trying to come up at least with 
> > debug patches that one of the reporter should be able and willing
> > to 
> > test.
> 
> Dario is working on this.  Last update 29.1.21 ?
> 
Yep. A have a few more insights about it, but still not sure about a
few things. I'll try to give a more detailed update on Mon or Tue

> G. Null scheduler and vwfi native problem
> 
> Information from
>   Dario Faggioli <dfaggioli@suse.com>
> 
> References
>    
> https://lists.xenproject.org/archives/html/xen-devel/2021-01/msg01634.html
> 
> Quoting Dario:
> > RCU issues, but manifests due to scheduler behavior (especially   
> > NULL scheduler, especially on ARM).
> > 
> > Patches that should solve the issue for ARM posted already. They
> > will need to be slightly adjusted to cover x86 as well.
> 
> As of last update from Dario 29.1.21:
> waiting for test report from submitter.
> 
Report recently arrived and was positive. The issue, on ARM, is solved
by the patches sent to him. I've done the x86 bits of those patches,
but am still debugging an error I have with them applied.

It probably make sense for me to properly submit the two patches that
fix the problem on ARM right away (the x86 part would be in its own
patch anyway).

Thanks and Regards
-- 
Dario Faggioli, Ph.D
http://about.me/dario.faggioli
Virtualization Software Engineer
SUSE Labs, SUSE https://www.suse.com/
-------------------------------------------------------------------
<<This happens because _I_ choose it to happen!>> (Raistlin Majere)

[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 833 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2021-02-20 14:07 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-02-10 15:58 [ANNOUNCE] Xen 4.15 - hard codefreeze slip by one week Ian Jackson
2021-02-19 16:47 ` [ANNOUNCE] Xen 4.15 - hard codefreeze today Ian Jackson
2021-02-19 16:54   ` Jan Beulich
2021-02-20 14:07   ` Dario Faggioli

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).