From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <xen-devel-bounces@lists.xenproject.org>
X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on
	aws-us-west-2-korg-lkml-1.web.codeaurora.org
Received: from lists.xenproject.org (lists.xenproject.org [192.237.175.120])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(No client certificate requested)
	by smtp.lore.kernel.org (Postfix) with ESMTPS id 33E78C6FA82
	for <xen-devel@archiver.kernel.org>; Tue, 20 Sep 2022 18:28:14 +0000 (UTC)
Received: from list by lists.xenproject.org with outflank-mailman.409556.652531 (Exim 4.92)
	(envelope-from <xen-devel-bounces@lists.xenproject.org>)
	id 1oahyG-0006af-F4; Tue, 20 Sep 2022 18:27:48 +0000
X-Outflank-Mailman: Message body and most headers restored to incoming version
Received: by outflank-mailman (output) from mailman id 409556.652531; Tue, 20 Sep 2022 18:27:48 +0000
Received: from localhost ([127.0.0.1] helo=lists.xenproject.org)
	by lists.xenproject.org with esmtp (Exim 4.92)
	(envelope-from <xen-devel-bounces@lists.xenproject.org>)
	id 1oahyG-0006aY-AW; Tue, 20 Sep 2022 18:27:48 +0000
Received: by outflank-mailman (input) for mailman id 409556;
 Tue, 20 Sep 2022 18:27:46 +0000
Received: from se1-gles-flk1-in.inumbo.com ([94.247.172.50]
 helo=se1-gles-flk1.inumbo.com)
 by lists.xenproject.org with esmtp (Exim 4.92) (envelope-from
 <SRS0=3LiI=ZX=gmail.com=tamas.k.lengyel@srs-se1.protection.inumbo.net>)
 id 1oahyE-0006aO-Ig
 for xen-devel@lists.xenproject.org; Tue, 20 Sep 2022 18:27:46 +0000
Received: from mail-yw1-x112b.google.com (mail-yw1-x112b.google.com
 [2607:f8b0:4864:20::112b])
 by se1-gles-flk1.inumbo.com (Halon) with ESMTPS
 id eb83ac58-3911-11ed-bad8-01ff208a15ba;
 Tue, 20 Sep 2022 20:27:45 +0200 (CEST)
Received: by mail-yw1-x112b.google.com with SMTP id
 00721157ae682-333a4a5d495so37113977b3.10
 for <xen-devel@lists.xenproject.org>; Tue, 20 Sep 2022 11:27:45 -0700 (PDT)
X-BeenThere: xen-devel@lists.xenproject.org
List-Id: Xen developer discussion <xen-devel.lists.xenproject.org>
List-Unsubscribe: <https://lists.xenproject.org/mailman/options/xen-devel>,
 <mailto:xen-devel-request@lists.xenproject.org?subject=unsubscribe>
List-Post: <mailto:xen-devel@lists.xenproject.org>
List-Help: <mailto:xen-devel-request@lists.xenproject.org?subject=help>
List-Subscribe: <https://lists.xenproject.org/mailman/listinfo/xen-devel>,
 <mailto:xen-devel-request@lists.xenproject.org?subject=subscribe>
Errors-To: xen-devel-bounces@lists.xenproject.org
Precedence: list
Sender: "Xen-devel" <xen-devel-bounces@lists.xenproject.org>
X-Inumbo-ID: eb83ac58-3911-11ed-bad8-01ff208a15ba
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=gmail.com; s=20210112;
        h=cc:to:subject:message-id:date:from:in-reply-to:references
         :mime-version:from:to:cc:subject:date;
        bh=dZ/p0pP7pAlxBAk0vf02AC0UOh0FnkUvXme5D2aPjwQ=;
        b=mCvPc5yJyjhMEluZQEERiVd6MYmKGQM6ACH7X3j1YvR+tGoqtYRg9HAyFH/PAmcodq
         Z9NXJBjXcr+7PyLm8NHT0t2xx1bmOHfx+oSBg9C1UB0h4cK3Mf8QvA/2WNCze0j4Qk8b
         SWV5zdwta5W3XKSFJ5/I4MUqq3BSHYd2LHNkTCfx38rXynZcnvoP0v+GhLY73jhSzjrX
         N6QC54oeuYzFlwJXopOVeCv+z8QZszpXYPl9eBhC5rg3p6aVrR3k64eJnLosVRvEsEKt
         kq8XIpI1VJ5iMwKQw6ThK59csUOFuPz7s09ldmEF8+iYemzHgsodtPtY2cTFBPCTfmce
         hSxQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20210112;
        h=cc:to:subject:message-id:date:from:in-reply-to:references
         :mime-version:x-gm-message-state:from:to:cc:subject:date;
        bh=dZ/p0pP7pAlxBAk0vf02AC0UOh0FnkUvXme5D2aPjwQ=;
        b=dB+Ph2jTEK3BKCO3of7Cf0UC5uDw3HroNWTf3jqp7d14UXvio7P5vsZfidrtd2ejtx
         3M56ocRRb9YlGKlCpfgaPIn8W8yomrCDuoOLk7MQqX2R+f7A0zrvKE+f69RiDA9HU1JV
         I+YlS9QYmoBkSjCQigeqoe2SIqEDm4+qtBtfwHVeOrJA/SHAnajd2wQXXEmhJA3SZ7Yf
         4iCqoAoULarEdYEWFIugYBrn2YlcC2Ic9iy9g+9qJxhzyXfiOFfBEaQKnCpHa3b7YdsH
         T9dg7OuDLkHFhWfj0OWFYP4kRdyZCl3Vr3D+8OzaASyrKZN6e4/z2doq+qXwkimI0tck
         ohHw==
X-Gm-Message-State: ACrzQf1RZUeLTmtIWByio3HFHBK7ggsg0MeG5ZT/w9UiOJzBu+vwNNDS
	tXm/0cI318GLpjxHncXD4ehnQkyTpDung7Lc9iY=
X-Google-Smtp-Source: AMsMyM5GuZCsEut4gu0cJ947mnyj4cHdrO+GZFZGiFnSNXE0RG/0xYgPxS/DzUnrBtT1dmSns1DyyFJqXgWhenuvvmE=
X-Received: by 2002:a81:6ed4:0:b0:345:2c35:a203 with SMTP id
 j203-20020a816ed4000000b003452c35a203mr21537293ywc.262.1663698464240; Tue, 20
 Sep 2022 11:27:44 -0700 (PDT)
MIME-Version: 1.0
References: <8294476a707d7f20799a40479cc0bf9a1cf07673.1663249988.git.tamas.lengyel@intel.com>
 <9fa4a72f-8b38-fbf0-40c7-8fcd6b34cf10@suse.com> <9e79155f-0836-1b18-b648-b6e5422e89c8@oracle.com>
 <737386d2-ceeb-ad97-39a1-42585b913b9d@suse.com> <CABfawhk+TByRwVvGjv-e6-2UFeO7g6xBz3-CR_QOtYM2_37U=Q@mail.gmail.com>
 <29d29f64-b799-d56c-1292-661fb5127728@suse.com> <CABfawhnRUhQAc0cRybz8sLLkxjuZCO6JVA5QYHBERG7gf0zpZQ@mail.gmail.com>
 <406b7f6e-d092-fb6a-d0dd-60a9743027f6@suse.com> <CABfawhmrnL1HGOWS1fkEv5X4CwfkrBj-+APJ=hM1GCzzgjW4zA@mail.gmail.com>
 <5d1b06f0-fc20-585e-9da0-fb24c5931ad3@suse.com> <ffc59d24-7862-b7fb-e11e-b5f773129b0c@oracle.com>
 <8c0c9e20-f3d5-86fc-647f-ee89d63f2118@suse.com> <4d317c1e-3481-6d9e-c5ab-dfd9c559d89d@oracle.com>
 <a895f8ef-1135-7a44-07db-3c2f3d685a1a@suse.com> <2c0e506a-e494-68d5-f4d0-1b54cca2b970@oracle.com>
In-Reply-To: <2c0e506a-e494-68d5-f4d0-1b54cca2b970@oracle.com>
From: Tamas K Lengyel <tamas.k.lengyel@gmail.com>
Date: Tue, 20 Sep 2022 14:27:08 -0400
Message-ID: <CABfawh=2tgzwjYw52fWdZQLmpFAUVOWJ=KMTb4hfVno2UCSaDg@mail.gmail.com>
Subject: Re: [PATCH] x86/vpmu: fix race-condition in vpmu_load
To: Boris Ostrovsky <boris.ostrovsky@oracle.com>
Cc: Jan Beulich <jbeulich@suse.com>, Andrew Cooper <andrew.cooper3@citrix.com>, 
	=?UTF-8?Q?Roger_Pau_Monn=C3=A9?= <roger.pau@citrix.com>, Wei Liu <wl@xen.org>, 
	xen-devel@lists.xenproject.org, Tamas K Lengyel <tamas.lengyel@intel.com>
Content-Type: multipart/alternative; boundary="000000000000f97bec05e91ffb61"

--000000000000f97bec05e91ffb61
Content-Type: text/plain; charset="UTF-8"

On Tue, Sep 20, 2022 at 2:12 PM Boris Ostrovsky <boris.ostrovsky@oracle.com>
wrote:

>
> On 9/20/22 10:54 AM, Jan Beulich wrote:
> > On 20.09.2022 16:26, Boris Ostrovsky wrote:
> >> On 9/20/22 4:01 AM, Jan Beulich wrote:
> >>> On 20.09.2022 00:42, Boris Ostrovsky wrote:
> >>>> It is saving vpmu data from current pcpu's MSRs for a remote vcpu so
> @v
> >>>> in vmx_find_msr() is not @current:
> >>>>
> >>>>         vpmu_load()
> >>>>             ...
> >>>>             prev = per_cpu(last_vcpu, pcpu);
> >>>>             vpmu_save_force(prev)
> >>>>                 core2_vpmu_save()
> >>>>                     __core2_vpmu_save()
> >>>>                         vmx_read_guest_msr()
> >>>>                             vmx_find_msr()
> >>>>
> >>>>
> >>>> The call to vmx_find_msr() was introduced by 755087eb9b10c. I wonder
> though whether
> >>>> this call is needed when code path above is executed (i.e. when we
> are saving
> >>>> remove vcpu)
> >>> How could it not be needed? We need to obtain the guest value. The
> >>> thing I don't understand is why this forced saving is necessary,
> >>> when context_switch() unconditionally calls vpmu_switch_from().
> >>
> >> IIRC the logic is:
> >>
> >> 1. vcpuA runs on pcpu0
> >> 2. vcpuA is de-scheduled and is selected to run on pcpu1. It has not
> yet called vpmu_load() from pcpu1
> > The calling of vpmu_load() shouldn't matter here. What does matter is
> > that vpmu_save() was necessarily called already.
>
>
> I thought we don't always save MSRs on context switch because we optimize
> for case when the same vcpu gets scheduled again. But I am not sure I see
> this now that I am looking at the code.
>

I see context_switch calling vpmu_save_from before __context_switch, but if
that did actually save the vPMU state it would have reset
VPMU_CONTEXT_LOADED, so by the time vpmu_load calls vpmu_save_force it
would have just bailed before hitting the ASSERT failure in vmx_find_msrs.
The context must still be loaded at that point (I'm trying to verify right
now by adding some printks).

Tamas

--000000000000f97bec05e91ffb61
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div dir=3D"ltr"><br></div><br><div class=3D"gmail_quote">=
<div dir=3D"ltr" class=3D"gmail_attr">On Tue, Sep 20, 2022 at 2:12 PM Boris=
 Ostrovsky &lt;<a href=3D"mailto:boris.ostrovsky@oracle.com">boris.ostrovsk=
y@oracle.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" styl=
e=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);paddin=
g-left:1ex"><br>
On 9/20/22 10:54 AM, Jan Beulich wrote:<br>
&gt; On 20.09.2022 16:26, Boris Ostrovsky wrote:<br>
&gt;&gt; On 9/20/22 4:01 AM, Jan Beulich wrote:<br>
&gt;&gt;&gt; On 20.09.2022 00:42, Boris Ostrovsky wrote:<br>
&gt;&gt;&gt;&gt; It is saving vpmu data from current pcpu&#39;s MSRs for a =
remote vcpu so @v<br>
&gt;&gt;&gt;&gt; in vmx_find_msr() is not @current:<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0vpmu_load()<br>
&gt;&gt;&gt;&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0...<br>
&gt;&gt;&gt;&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0prev =3D pe=
r_cpu(last_vcpu, pcpu);<br>
&gt;&gt;&gt;&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0vpmu_save_f=
orce(prev)<br>
&gt;&gt;&gt;&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0core2_vpmu_save()<br>
&gt;&gt;&gt;&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0__core2_vpmu_save()<br>
&gt;&gt;&gt;&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0vmx_read_guest_msr()<br>
&gt;&gt;&gt;&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0vmx_find_msr()<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; The call to vmx_find_msr() was introduced by 755087eb9b10c=
. I wonder though whether<br>
&gt;&gt;&gt;&gt; this call is needed when code path above is executed (i.e.=
 when we are saving<br>
&gt;&gt;&gt;&gt; remove vcpu)<br>
&gt;&gt;&gt; How could it not be needed? We need to obtain the guest value.=
 The<br>
&gt;&gt;&gt; thing I don&#39;t understand is why this forced saving is nece=
ssary,<br>
&gt;&gt;&gt; when context_switch() unconditionally calls vpmu_switch_from()=
.<br>
&gt;&gt;<br>
&gt;&gt; IIRC the logic is:<br>
&gt;&gt;<br>
&gt;&gt; 1. vcpuA runs on pcpu0<br>
&gt;&gt; 2. vcpuA is de-scheduled and is selected to run on pcpu1. It has n=
ot yet called vpmu_load() from pcpu1<br>
&gt; The calling of vpmu_load() shouldn&#39;t matter here. What does matter=
 is<br>
&gt; that vpmu_save() was necessarily called already.<br>
<br>
<br>
I thought we don&#39;t always save MSRs on context switch because we optimi=
ze for case when the same vcpu gets scheduled again. But I am not sure I se=
e this now that I am looking at the code.<br></blockquote><div><br></div><d=
iv>I see context_switch calling vpmu_save_from before __context_switch, but=
 if that did actually save the vPMU state it would have reset VPMU_CONTEXT_=
LOADED, so by the time vpmu_load calls vpmu_save_force it would have just b=
ailed before hitting the ASSERT failure in vmx_find_msrs. The context must =
still be loaded at that point (I&#39;m trying to verify right now by adding=
 some printks).</div><div><br></div><div>Tamas<br></div></div></div>

--000000000000f97bec05e91ffb61--