From mboxrd@z Thu Jan 1 00:00:00 1970 From: Dario Faggioli Subject: Re: [xen-unstable test] 88047: regressions - FAIL Date: Fri, 1 Apr 2016 16:40:00 +0200 Message-ID: <1459521600.5082.280.camel@citrix.com> References: <56FE5ED802000078000E1F8D@prv-mh.provo.novell.com> <22270.31070.725580.163612@mariner.uk.xensource.com> <56FE974302000078000E213B@prv-mh.provo.novell.com> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============7896290127234563698==" Return-path: Received: from mail6.bemta3.messagelabs.com ([195.245.230.39]) by lists.xenproject.org with esmtp (Exim 4.84_2) (envelope-from ) id 1am0FM-0002AP-Jv for xen-devel@lists.xenproject.org; Fri, 01 Apr 2016 14:40:24 +0000 In-Reply-To: <56FE974302000078000E213B@prv-mh.provo.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Errors-To: xen-devel-bounces@lists.xen.org Sender: "Xen-devel" To: Jan Beulich , Ian Jackson Cc: xen-devel , osstest-admin@xenproject.org List-Id: xen-devel@lists.xenproject.org --===============7896290127234563698== Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="=-1UJJcAtOFEVQqyaAjaZ8" --=-1UJJcAtOFEVQqyaAjaZ8 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable On Fri, 2016-04-01 at 07:44 -0600, Jan Beulich wrote: > > On 01.04.16 at 15:36, wrote: > > That message is from libvirt.=C2=A0=C2=A0I don't know the exact situati= on. > >=20 > > But we have seen other timeouts with merlot[01].=C2=A0=C2=A0They seem t= o > > sometimes have weird stalls when running under Xen. > Might it be worth trying whether that's a deep C-state issue (by > temporarily limiting their use of deep C states)? >=20 IIRC, it's the box(es) with the weird NUMA setup... something about nodes without any memory: Apr=C2=A0=C2=A01 01:03:31.095559 (XEN) SRAT: Node 0 PXM 0 0-a0000 Apr=C2=A0=C2=A01 01:03:31.103546 (XEN) SRAT: Node 0 PXM 0 100000-c0000000 Apr=C2=A0=C2=A01 01:03:31.103565 (XEN) SRAT: Node 0 PXM 0 100000000-2400000= 00 Apr=C2=A0=C2=A01 01:03:31.111600 (XEN) SRAT: Node 2 PXM 2 240000000-43f0000= 00 Apr=C2=A0=C2=A01 01:03:31.111619 (XEN) NUMA: Allocated memnodemap from 43ee= 16000 - 43ee1b000 Apr=C2=A0=C2=A01 01:03:31.119615 (XEN) NUMA: Using 8 for the hash shift. Apr=C2=A0=C2=A01 01:03:31.127613 (XEN) SRAT: Node 1 has no memory. BIOS Bug= or mis-configured hardware? Apr=C2=A0=C2=A01 01:03:31.135548 (XEN) SRAT: Node 3 has no memory. BIOS Bug= or mis-configured hardware? Not that I see how this could cause the long latencies / timeouts, though..= . :-/ In any case, I think Jan's C states suggestion is worth a shot. Regards, Dario --=20 <> (Raistlin Majere) ----------------------------------------------------------------- Dario Faggioli, Ph.D, http://about.me/dario.faggioli Senior Software Engineer, Citrix Systems R&D Ltd., Cambridge (UK) --=-1UJJcAtOFEVQqyaAjaZ8 Content-Type: application/pgp-signature; name="signature.asc" Content-Description: This is a digitally signed message part Content-Transfer-Encoding: 7bit -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iEYEABECAAYFAlb+iEEACgkQk4XaBE3IOsTLbgCbB7YfzhmgKGjUVlQE22x6Y/3c AlkAn1OlhEXbXCSEB0eqjNtM38ucmFE0 =WZrQ -----END PGP SIGNATURE----- --=-1UJJcAtOFEVQqyaAjaZ8-- --===============7896290127234563698== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KWGVuLWRldmVs IG1haWxpbmcgbGlzdApYZW4tZGV2ZWxAbGlzdHMueGVuLm9yZwpodHRwOi8vbGlzdHMueGVuLm9y Zy94ZW4tZGV2ZWwK --===============7896290127234563698==--