linux-arch.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Andy Lutomirski <luto@kernel.org>
To: "Yu, Yu-cheng" <yu-cheng.yu@intel.com>
Cc: Andy Lutomirski <luto@kernel.org>,
	"H.J. Lu" <hjl.tools@gmail.com>, X86 ML <x86@kernel.org>,
	"H. Peter Anvin" <hpa@zytor.com>,
	Thomas Gleixner <tglx@linutronix.de>,
	Ingo Molnar <mingo@redhat.com>,
	LKML <linux-kernel@vger.kernel.org>,
	"open list:DOCUMENTATION" <linux-doc@vger.kernel.org>,
	Linux-MM <linux-mm@kvack.org>,
	linux-arch <linux-arch@vger.kernel.org>,
	Linux API <linux-api@vger.kernel.org>,
	Arnd Bergmann <arnd@arndb.de>,
	Balbir Singh <bsingharora@gmail.com>,
	Borislav Petkov <bp@alien8.de>,
	Cyrill Gorcunov <gorcunov@gmail.com>,
	Dave Hansen <dave.hansen@linux.intel.com>,
	Eugene Syromiatnikov <esyr@redhat.com>,
	Florian Weimer <fweimer@redhat.com>, Jann Horn <jannh@google.com>,
	Jonathan Corbet <corbet@lwn.net>,
	Kees Cook <keescook@chromium.org>,
	Mike Kravetz <mike.kravetz@oracle.com>,
	Nadav Amit <nadav.amit@gmail.com>,
	Oleg Nesterov <oleg@redhat.com>, Pavel Machek <pavel@ucw.cz>,
	Peter Zijlstra <peterz@infradead.org>,
	Randy Dunlap <rdunlap@infradead.org>,
	"Ravi V. Shankar" <ravi.v.shankar@intel.com>,
	Vedvyas Shanbhogue <vedvyas.shanbhogue@intel.com>,
	Dave Martin <Dave.Martin@arm.com>,
	Weijiang Yang <weijiang.yang@intel.com>,
	Pengfei Xu <pengfei.xu@intel.com>
Subject: Re: [PATCH v13 8/8] x86/vsyscall/64: Fixup Shadow Stack and Indirect Branch Tracking for vsyscall emulation
Date: Thu, 1 Oct 2020 10:26:27 -0700	[thread overview]
Message-ID: <CALCETrU-pjSFBGBROukA8dtSUmft9E1j86oS16Lw0Oz1yzv8Gw@mail.gmail.com> (raw)
In-Reply-To: <79d1e67d-2394-1ce6-3bad-cce24ba792bd@intel.com>

On Thu, Oct 1, 2020 at 9:51 AM Yu, Yu-cheng <yu-cheng.yu@intel.com> wrote:
>
> On 9/30/2020 6:10 PM, Andy Lutomirski wrote:
> > On Wed, Sep 30, 2020 at 6:01 PM H.J. Lu <hjl.tools@gmail.com> wrote:
> >>
> >> On Wed, Sep 30, 2020 at 4:44 PM Andy Lutomirski <luto@kernel.org> wrote:
>
> [...]
>
> >>>>>>>    From 09803e66dca38d7784e32687d0693550948199ed Mon Sep 17 00:00:00 2001
> >>>>>>> From: Yu-cheng Yu <yu-cheng.yu@intel.com>
> >>>>>>> Date: Thu, 29 Nov 2018 14:15:38 -0800
> >>>>>>> Subject: [PATCH v13 8/8] x86/vsyscall/64: Fixup Shadow Stack and
> >>>>>>> Indirect Branch
> >>>>>>>     Tracking for vsyscall emulation
> >>>>>>>
> >>>>>>> Vsyscall entry points are effectively branch targets.  Mark them with
> >>>>>>> ENDBR64 opcodes.  When emulating the RET instruction, unwind shadow stack
> >>>>>>> and reset IBT state machine.
> >>>>>>>
> >>>>>>> Signed-off-by: Yu-cheng Yu <yu-cheng.yu@intel.com>
> >>>>>>> ---
> >>>>>>> v13:
> >>>>>>> - Check shadow stack address is canonical.
> >>>>>>> - Change from writing to MSRs to writing to CET xstate.
> >>>>>>>
> >>>>>>>     arch/x86/entry/vsyscall/vsyscall_64.c     | 34 +++++++++++++++++++++++
> >>>>>>>     arch/x86/entry/vsyscall/vsyscall_emu_64.S |  9 ++++++
> >>>>>>>     arch/x86/entry/vsyscall/vsyscall_trace.h  |  1 +
> >>>>>>>     3 files changed, 44 insertions(+)
> >>>>>>>
> >>>>>>> diff --git a/arch/x86/entry/vsyscall/vsyscall_64.c
> >>>>>>> b/arch/x86/entry/vsyscall/vsyscall_64.c
> >>>>>>> index 44c33103a955..30b166091d46 100644
> >>>>>>> --- a/arch/x86/entry/vsyscall/vsyscall_64.c
> >>>>>>> +++ b/arch/x86/entry/vsyscall/vsyscall_64.c
> >>>>>>> @@ -38,6 +38,9 @@
> >>>>>>>     #include <asm/fixmap.h>
> >>>>>>>     #include <asm/traps.h>
> >>>>>>>     #include <asm/paravirt.h>
> >>>>>>> +#include <asm/fpu/xstate.h>
> >>>>>>> +#include <asm/fpu/types.h>
> >>>>>>> +#include <asm/fpu/internal.h>
> >>>>>>>
> >>>>>>>     #define CREATE_TRACE_POINTS
> >>>>>>>     #include "vsyscall_trace.h"
> >>>>>>> @@ -286,6 +289,44 @@ bool emulate_vsyscall(unsigned long error_code,
> >>>>>>>           /* Emulate a ret instruction. */
> >>>>>>>           regs->ip = caller;
> >>>>>>>           regs->sp += 8;
> >>>>>>> +
> >>>>>>> +#ifdef CONFIG_X86_CET
> >>>>>>> +       if (tsk->thread.cet.shstk_size || tsk->thread.cet.ibt_enabled) {
> >>>>>>> +               struct cet_user_state *cet;
> >>>>>>> +               struct fpu *fpu;
> >>>>>>> +
> >>>>>>> +               fpu = &tsk->thread.fpu;
> >>>>>>> +               fpregs_lock();
> >>>>>>> +
> >>>>>>> +               if (!test_thread_flag(TIF_NEED_FPU_LOAD)) {
> >>>>>>> +                       copy_fpregs_to_fpstate(fpu);
> >>>>>>> +                       set_thread_flag(TIF_NEED_FPU_LOAD);
> >>>>>>> +               }
> >>>>>>> +
> >>>>>>> +               cet = get_xsave_addr(&fpu->state.xsave, XFEATURE_CET_USER);
> >>>>>>> +               if (!cet) {
> >>>>>>> +                       /*
> >>>>>>> +                        * This should not happen.  The task is
> >>>>>>> +                        * CET-enabled, but CET xstate is in INIT.
> >>>>>>> +                        */
> >>>>>>
> [...]
> >>>>>>
> >>>>>
> >>>>> For what it's worth, I think there is an alternative.  If you all
> >>>>> (userspace people, etc) can come up with a credible way for a user
> >>>>> program to statically declare that it doesn't need vsyscalls, then we
> >>>>> could make SHSTK depend on *that*, and we could avoid this mess.  This
> >>>>> breaks orthogonality, but it's probably a decent outcome.
> >>>>>
> >>>>
> >>>> Would an arch_prctl(DISABLE_VSYSCALL) work?  The kernel then sets a
> >>>> thread flag, and in emulate_vsyscall(), checks the flag.
> >>>>
> >>>> When CET is enabled, ld-linux will do DISABLE_VSYSCALL.
> >>>>
> >>>> How is that?
> >>>
> >>> Backwards, no?  Presumably vsyscall needs to be disabled before or
> >>> concurrently with CET being enabled, not after.
> >>>
> >>> I think the solution of making vsyscall emulation work correctly with
> >>> CET is going to be better and possibly more straightforward.
> >>>
> >>
> >> We can do
> >>
> >> 1. Add ARCH_X86_DISABLE_VSYSCALL to disable the vsyscall page.
> >> 2. If CPU supports CET and the program is CET enabled:
> >>      a. Disable the vsyscall page.
> >>      b. Pass control to user.
> >>      c. Enable the vsyscall page when ARCH_X86_CET_DISABLE is called.
> >>
> >> So when control is passed from kernel to user, the vsyscall page is
> >> disabled if the program
> >> is CET enabled.
> >
> > Let me say this one more time:
> >
> > If we have a per-process vsyscall disable control and a per-process
> > CET control, we are going to keep those settings orthogonal.  I'm
> > willing to entertain an option in which enabling SHSTK without also
> > disabling vsyscalls is disallowed, We are *not* going to have any CET
> > flags magically disable vsyscalls, though, and we are not going to
> > have a situation where disabling vsyscalls on process startup requires
> > enabling SHSTK.
> >
> > Any possible static vsyscall controls (and CET controls, for that
> > matter) also need to come with some explanation of whether they are
> > properties set on the ELF loader, the ELF program being loaded, or
> > both.  And this explanation needs to cover what happens when old
> > binaries link against new libc versions and vice versa.  A new
> > CET-enabled binary linked against old libc running on a new kernel
> > that is expected to work on a non-CET CPU MUST work on a CET CPU, too.
> >
> > Right now, literally the only thing preventing vsyscall emulation from
> > coexisting with SHSTK is that the implementation eeds work.
> >
> > So your proposal is rejected.  Sorry.
> >
> I think, even with shadow stack/ibt enabled, we can still allow XONLY
> without too much mess.
>
> What about this?
>
> Thanks,
> Yu-cheng
>
> ======
>
> diff --git a/arch/x86/entry/vsyscall/vsyscall_64.c
> b/arch/x86/entry/vsyscall/vsyscall_64.c
> index 8b0b32ac7791..d39da0a15521 100644
> --- a/arch/x86/entry/vsyscall/vsyscall_64.c
> +++ b/arch/x86/entry/vsyscall/vsyscall_64.c
> @@ -48,16 +48,16 @@
>   static enum { EMULATE, XONLY, NONE } vsyscall_mode __ro_after_init =
>   #ifdef CONFIG_LEGACY_VSYSCALL_NONE
>          NONE;
> -#elif defined(CONFIG_LEGACY_VSYSCALL_XONLY)
> +#elif defined(CONFIG_LEGACY_VSYSCALL_XONLY) || defined(CONFIG_X86_CET)
>          XONLY;
> -#else
> +#else
>          EMULATE;
>   #endif

I don't get it.

First, you can't do any of this based on config -- it must be runtime.

Second, and more importantly, I don't see how XONLY helps at all.  The
(non-executable) text that's exposed to user code in EMULATE mode is
trivial to get right with CET -- your code already handles it.  It's
the emulation code (that runs identically in EMULATE and XONLY mode)
that's tricky.

  reply	other threads:[~2020-10-01 17:26 UTC|newest]

Thread overview: 29+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-09-25 14:57 [PATCH v13 0/8] Control-flow Enforcement: Indirect Branch Tracking Yu-cheng Yu
2020-09-25 14:57 ` [PATCH v13 1/8] x86/cet/ibt: Add Kconfig option for user-mode " Yu-cheng Yu
2020-09-25 14:57 ` [PATCH v13 2/8] x86/cet/ibt: User-mode Indirect Branch Tracking support Yu-cheng Yu
2020-09-25 14:57 ` [PATCH v13 3/8] x86/cet/ibt: Handle signals for Indirect Branch Tracking Yu-cheng Yu
2020-09-25 14:58 ` [PATCH v13 4/8] x86/cet/ibt: ELF header parsing " Yu-cheng Yu
2020-09-25 14:58 ` [PATCH v13 5/8] x86/cet/ibt: Update arch_prctl functions " Yu-cheng Yu
2020-09-25 14:58 ` [PATCH v13 6/8] x86/vdso/32: Add ENDBR32 to __kernel_vsyscall entry point Yu-cheng Yu
2020-09-25 14:58 ` [PATCH v13 7/8] x86/vdso: Insert endbr32/endbr64 to vDSO Yu-cheng Yu
2020-09-25 16:18   ` Andy Lutomirski
2020-09-25 16:24     ` Yu, Yu-cheng
2020-09-25 14:58 ` [PATCH v13 8/8] x86/vsyscall/64: Fixup Shadow Stack and Indirect Branch Tracking for vsyscall emulation Yu-cheng Yu
2020-09-25 16:31   ` Andy Lutomirski
2020-09-25 16:47     ` Yu, Yu-cheng
2020-09-25 16:51       ` Andy Lutomirski
2020-09-28 16:59         ` Yu-cheng Yu
2020-09-28 17:37           ` Andy Lutomirski
2020-09-28 19:04             ` Yu, Yu-cheng
2020-09-29 18:37             ` Yu, Yu-cheng
2020-09-29 19:57               ` Andy Lutomirski
2020-09-29 20:00                 ` Andy Lutomirski
2020-09-30 22:33                   ` Yu, Yu-cheng
2020-09-30 23:44                     ` Andy Lutomirski
2020-10-01  1:00                       ` H.J. Lu
2020-10-01  1:10                         ` Andy Lutomirski
2020-10-01  1:21                           ` H.J. Lu
2020-10-01 16:51                           ` Yu, Yu-cheng
2020-10-01 17:26                             ` Andy Lutomirski [this message]
2020-10-06 19:09                               ` Yu, Yu-cheng
2020-10-09 17:42                                 ` Andy Lutomirski

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=CALCETrU-pjSFBGBROukA8dtSUmft9E1j86oS16Lw0Oz1yzv8Gw@mail.gmail.com \
    --to=luto@kernel.org \
    --cc=Dave.Martin@arm.com \
    --cc=arnd@arndb.de \
    --cc=bp@alien8.de \
    --cc=bsingharora@gmail.com \
    --cc=corbet@lwn.net \
    --cc=dave.hansen@linux.intel.com \
    --cc=esyr@redhat.com \
    --cc=fweimer@redhat.com \
    --cc=gorcunov@gmail.com \
    --cc=hjl.tools@gmail.com \
    --cc=hpa@zytor.com \
    --cc=jannh@google.com \
    --cc=keescook@chromium.org \
    --cc=linux-api@vger.kernel.org \
    --cc=linux-arch@vger.kernel.org \
    --cc=linux-doc@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=mike.kravetz@oracle.com \
    --cc=mingo@redhat.com \
    --cc=nadav.amit@gmail.com \
    --cc=oleg@redhat.com \
    --cc=pavel@ucw.cz \
    --cc=pengfei.xu@intel.com \
    --cc=peterz@infradead.org \
    --cc=ravi.v.shankar@intel.com \
    --cc=rdunlap@infradead.org \
    --cc=tglx@linutronix.de \
    --cc=vedvyas.shanbhogue@intel.com \
    --cc=weijiang.yang@intel.com \
    --cc=x86@kernel.org \
    --cc=yu-cheng.yu@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).