From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-0.8 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (pdx-korg-mail-1.web.codeaurora.org [172.30.200.123]) by aws-us-west-2-korg-lkml-1.web.codeaurora.org (Postfix) with ESMTP id 9748AC433EF for ; Tue, 12 Jun 2018 20:18:51 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 4E28020891 for ; Tue, 12 Jun 2018 20:18:51 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 4E28020891 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=intel.com Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933630AbeFLUSt (ORCPT ); Tue, 12 Jun 2018 16:18:49 -0400 Received: from mga18.intel.com ([134.134.136.126]:40916 "EHLO mga18.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932823AbeFLUSr (ORCPT ); Tue, 12 Jun 2018 16:18:47 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by orsmga106.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 12 Jun 2018 13:18:46 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.51,216,1526367600"; d="scan'208";a="236601496" Received: from 2b52.sc.intel.com (HELO [143.183.136.147]) ([143.183.136.147]) by fmsmga005.fm.intel.com with ESMTP; 12 Jun 2018 13:18:46 -0700 Message-ID: <1528834538.9849.13.camel@2b52.sc.intel.com> Subject: Re: [PATCH 00/10] Control Flow Enforcement - Part (3) From: Yu-cheng Yu To: Andy Lutomirski Cc: bsingharora@gmail.com, LKML , linux-doc@vger.kernel.org, Linux-MM , linux-arch , X86 ML , "H. Peter Anvin" , Thomas Gleixner , Ingo Molnar , "H. J. Lu" , "Shanbhogue, Vedvyas" , "Ravi V. Shankar" , Dave Hansen , Jonathan Corbet , Oleg Nesterov , Arnd Bergmann , mike.kravetz@oracle.com Date: Tue, 12 Jun 2018 13:15:38 -0700 In-Reply-To: <1528824280.9447.30.camel@2b52.sc.intel.com> References: <20180607143807.3611-1-yu-cheng.yu@intel.com> <1528815820.8271.16.camel@2b52.sc.intel.com> <1528820489.9324.14.camel@2b52.sc.intel.com> <1528824280.9447.30.camel@2b52.sc.intel.com> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.10.4-0ubuntu2 Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 2018-06-12 at 10:24 -0700, Yu-cheng Yu wrote: > On Tue, 2018-06-12 at 09:31 -0700, Andy Lutomirski wrote: > > On Tue, Jun 12, 2018 at 9:24 AM Yu-cheng Yu wrote: > > > > > > On Tue, 2018-06-12 at 09:00 -0700, Andy Lutomirski wrote: > > > > On Tue, Jun 12, 2018 at 8:06 AM Yu-cheng Yu wrote: > > > > > > > > > > On Tue, 2018-06-12 at 20:56 +1000, Balbir Singh wrote: > > > > > > > > > > > > On 08/06/18 00:37, Yu-cheng Yu wrote: > > > > > > > This series introduces CET - Shadow stack > > > > > > > > > > > > > > At the high level, shadow stack is: > > > > > > > > > > > > > > Allocated from a task's address space with vm_flags VM_SHSTK; > > > > > > > Its PTEs must be read-only and dirty; > > > > > > > Fixed sized, but the default size can be changed by sys admin. > > > > > > > > > > > > > > For a forked child, the shadow stack is duplicated when the next > > > > > > > shadow stack access takes place. > > > > > > > > > > > > > > For a pthread child, a new shadow stack is allocated. > > > > > > > > > > > > > > The signal handler uses the same shadow stack as the main program. > > > > > > > > > > > > > > > > > > > Even with sigaltstack()? > > > > > > > > > > > > > > > > > > Balbir Singh. > > > > > > > > > > Yes. > > > > > > > > > > > > > I think we're going to need some provision to add an alternate signal > > > > stack to handle the case where the shadow stack overflows. > > > > > > The shadow stack stores only return addresses; its consumption will not > > > exceed a percentage of (program stack size + sigaltstack size) before > > > those overflow. When that happens, there is usually very little we can > > > do. So we set a default shadow stack size that supports certain nested > > > calls and allow sys admin to adjust it. > > > > > > > Of course there's something you can do: add a sigaltstack-like stack > > switching mechanism. Have a reserve shadow stack and, when a signal > > is delivered (possibly guarded by other conditions like "did the > > shadow stack overflow"), switch to a new shadow stack and maybe write > > a special token to the new shadow stack that says "signal delivery > > jumped here and will restore to the previous shadow stack and > > such-and-such address on return". > > If (shstk size == (stack size + sigaltstack size)), then shstk will not > overflow before program stack overflows and sigaltstack also overflows. > > Let me think about this. The reserve shadow stack will help only when the shstk overflows but signal stack/sigaltstack still has room and we can deliver a signal. If the shstk is large enough to cover any nested calls that will overflow both the program stack and sigaltstack then we don't need a reserve shstk. We can estimate how big the shstk needs to be; in the worst case it should not be greater than (program stack size + sigaltstack size). The default shstk size we choose pass all signal tests in GLIBC. In case there is a need to increase it for a very large RLIMIT_STACK or very large sigaltstack, the sys admin can increase the default shstk size. Yu-cheng