From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <linux-kernel-owner@vger.kernel.org>
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
        id S1753187AbcLBUle (ORCPT <rfc822;w@1wt.eu>);
        Fri, 2 Dec 2016 15:41:34 -0500
Received: from mail-ua0-f174.google.com ([209.85.217.174]:34994 "EHLO
        mail-ua0-f174.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
        with ESMTP id S1751075AbcLBUlc (ORCPT
        <rfc822;linux-kernel@vger.kernel.org>);
        Fri, 2 Dec 2016 15:41:32 -0500
MIME-Version: 1.0
In-Reply-To: <CA+55aFwwa_A3RfodonHVDz+2xZ0+x+=MRPCVDqTkaG96SV5ZoQ@mail.gmail.com>
References: <cover.1480638597.git.luto@kernel.org> <cover.1480536936.git.luto@kernel.org>
 <0a21157c2233ba7d0781bbf07866b3f2d7e7c25d.1480638597.git.luto@kernel.org>
 <CA+55aFw3P++jGbRhkrM3MyqwZZ3mxmxQTC1BfYS2rhw6P3LKNA@mail.gmail.com>
 <CALCETrXCLLW9pd=h7dZ0u2oBSiRBJOBkxvEq2iz=Z8aTVtrZ=w@mail.gmail.com>
 <20161202180343.gehqor7lgtmzwqq3@pd.tnic> <CA+55aFyHptQ3=gjXA_LGGkvOgsRB9FBXvCT56fG4UqcFV4wFsA@mail.gmail.com>
 <20161202185008.tdziqrzi4a3axord@pd.tnic> <CA+55aFxuhzE0woFyRjZ8=Ji1EPR1+MohrbDm=2AiQH50dsptjg@mail.gmail.com>
 <20161202192050.l5l3rcwems6hptub@pd.tnic> <CA+55aFydg4RDv8a-Wpf=TyzPN59kONcUdCkk1KcSNMW_2Uix_g@mail.gmail.com>
 <CALCETrVtEg256EPMbp1j8RkbaMJNtNge6-h0EoZ3HmRo6DZCLQ@mail.gmail.com> <CA+55aFwwa_A3RfodonHVDz+2xZ0+x+=MRPCVDqTkaG96SV5ZoQ@mail.gmail.com>
From: Andy Lutomirski <luto@amacapital.net>
Date: Fri, 2 Dec 2016 12:41:11 -0800
Message-ID: <CALCETrXo1zLXcRX-tF-MQ5qa7sLrKoAYdn5ox-F-kp-3JhPaXg@mail.gmail.com>
Subject: Re: [PATCH v2 5/6] x86/xen: Add a Xen-specific sync_core() implementation
To: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Borislav Petkov <bp@alien8.de>, Borislav Petkov <bp@kernel.org>,
        Andy Lutomirski <luto@kernel.org>, Peter Anvin <hpa@zytor.com>,
        "the arch/x86 maintainers" <x86@kernel.org>,
        One Thousand Gnomes <gnomes@lxorguk.ukuu.org.uk>,
        "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
        Brian Gerst <brgerst@gmail.com>,
        Matthew Whitehead <tedheadster@gmail.com>,
        Henrique de Moraes Holschuh <hmh@hmh.eng.br>,
        Peter Zijlstra <peterz@infradead.org>,
        Andrew Cooper <andrew.cooper3@citrix.com>
Content-Type: text/plain; charset=UTF-8
Sender: linux-kernel-owner@vger.kernel.org
List-ID: <linux-kernel.vger.kernel.org>
X-Mailing-List: linux-kernel@vger.kernel.org

On Fri, Dec 2, 2016 at 11:35 AM, Linus Torvalds
<torvalds@linux-foundation.org> wrote:
> On Fri, Dec 2, 2016 at 11:30 AM, Andy Lutomirski <luto@amacapital.net> wrote:
>>
>> How's this?
>
> Looks ok. I do think that
>
>> I suppose it could be an unconditional IRET-to-self, but that's a good
>> deal slower and not a whole lot simpler.  Although if we start doing
>> it right, performance won't really matter here.
>
> Considering you already got the iret-to-self wrong in the first
> version, I really like the "one single unconditional version" so that
> everybody tests that _one_ thing and there isn't anything subtle going
> on.
>
> Hmm?

Okay, sold.  It makes the patchset much much shorter, too.

>
> And yes, if it turns out that performance matters, we almost certainly
> are doing something really wrong, and we shouldn't be using that
> sync_core() thing in that place anyway.

To play devil's advocate (and definitely out of scope for this
particular patchset), is user code permitted to do:

1. Map a page RX at one address and RW somewhere else (for nice ASLR).
2. Write to the RW mapping.
3. CPUID or IRET-to-self.
4. Jump to the RX mapping.

Because, if so, we should maybe serialize whenever we migrate a
process to a different CPU.  (We *definitely* need to flush the store
buffer when migrating, because the x86 architecture makes some memory
ordering promises that get broken if a store from a thread stays in
the store buffer of a different CPU when the thread gets migrated.)
And if we're going to start serializing when migrating a thread, then
we actually care about performance, in which case we should optimize
the crap out of this thing, which probably means using MFENCE on AMD
CPUs (AMD promises that MFENCE flushes the pipeline.  Intel seems to
be confused as to exactly what effect MFENCE has, or at least I'm
confused as to what Intel thinks MFENCE does.)  And we should make
sure that we only do the extra flush when we don't switch mms.

--Andy