From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-13.8 required=3.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER,INCLUDES_PATCH, MAILING_LIST_MULTI,NICE_REPLY_A,SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 26CC0C433E6 for ; Mon, 15 Mar 2021 23:50:31 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id E69B364F5C for ; Mon, 15 Mar 2021 23:50:30 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231801AbhCOXt7 (ORCPT ); Mon, 15 Mar 2021 19:49:59 -0400 Received: from mga11.intel.com ([192.55.52.93]:20962 "EHLO mga11.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231441AbhCOXtj (ORCPT ); Mon, 15 Mar 2021 19:49:39 -0400 IronPort-SDR: +skRiTTidTowaKW3Tq6ndHMPTqDxUMwP7pZArbZ+0RJYbKwZOMIzMjiaIzqGHr90E/pz3Q2bH6 BP6CZRlUkjow== X-IronPort-AV: E=McAfee;i="6000,8403,9924"; a="185810421" X-IronPort-AV: E=Sophos;i="5.81,251,1610438400"; d="scan'208";a="185810421" Received: from orsmga003.jf.intel.com ([10.7.209.27]) by fmsmga102.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 15 Mar 2021 16:49:39 -0700 IronPort-SDR: G3Kfxx1H3lWkaaJTQcSPdWIJ5LT62bC8jUkQ7S5lM8sjBW5swY9z2e5Q4uKCdOQeNy23jRFqy8 yyAcGqWRQSvA== X-IronPort-AV: E=Sophos;i="5.81,251,1610438400"; d="scan'208";a="371796194" Received: from vamcfadd-mobl1.amr.corp.intel.com (HELO khuang2-desk.gar.corp.intel.com) ([10.252.129.148]) by orsmga003-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 15 Mar 2021 16:49:35 -0700 Date: Tue, 16 Mar 2021 12:49:33 +1300 From: Kai Huang To: Jarkko Sakkinen Cc: Sean Christopherson , kvm@vger.kernel.org, linux-sgx@vger.kernel.org, x86@kernel.org, linux-kernel@vger.kernel.org, luto@kernel.org, dave.hansen@intel.com, rick.p.edgecombe@intel.com, haitao.huang@intel.com, pbonzini@redhat.com, bp@alien8.de, tglx@linutronix.de, mingo@redhat.com, hpa@zytor.com Subject: Re: [PATCH v2 07/25] x86/sgx: Initialize virtual EPC driver even when SGX driver is disabled Message-Id: <20210316124933.3921b390ae153af598b145df@intel.com> In-Reply-To: References: <20210315161317.9c72479dfcde4e22078abcd2@intel.com> <20210316094859.7b5947b743a81dff7434615c@intel.com> X-Mailer: Sylpheed 3.7.0 (GTK+ 2.24.33; x86_64-redhat-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 16 Mar 2021 01:08:44 +0200 Jarkko Sakkinen wrote: > On Tue, Mar 16, 2021 at 01:05:05AM +0200, Jarkko Sakkinen wrote: > > On Tue, Mar 16, 2021 at 09:48:59AM +1300, Kai Huang wrote: > > > On Mon, 15 Mar 2021 15:51:17 +0200 Jarkko Sakkinen wrote: > > > > On Mon, Mar 15, 2021 at 03:04:59PM +0200, Jarkko Sakkinen wrote: > > > > > On Mon, Mar 15, 2021 at 04:13:17PM +1300, Kai Huang wrote: > > > > > > On Sun, 14 Mar 2021 17:27:18 +0200 Jarkko Sakkinen wrote: > > > > > > > On Sun, Mar 14, 2021 at 05:25:26PM +0200, Jarkko Sakkinen wrote: > > > > > > > > On Sat, Mar 13, 2021 at 09:07:36PM +0200, Jarkko Sakkinen wrote: > > > > > > > > > On Sat, Mar 13, 2021 at 09:05:36PM +0200, Jarkko Sakkinen wrote: > > > > > > > > > > On Fri, Mar 12, 2021 at 01:44:58PM -0800, Sean Christopherson wrote: > > > > > > > > > > > On Tue, Mar 09, 2021, Kai Huang wrote: > > > > > > > > > > > > Modify sgx_init() to always try to initialize the virtual EPC driver, > > > > > > > > > > > > even if the SGX driver is disabled. The SGX driver might be disabled > > > > > > > > > > > > if SGX Launch Control is in locked mode, or not supported in the > > > > > > > > > > > > hardware at all. This allows (non-Linux) guests that support non-LC > > > > > > > > > > > > configurations to use SGX. > > > > > > > > > > > > > > > > > > > > > > > > Acked-by: Dave Hansen > > > > > > > > > > > > Signed-off-by: Kai Huang > > > > > > > > > > > > --- > > > > > > > > > > > > arch/x86/kernel/cpu/sgx/main.c | 10 +++++++++- > > > > > > > > > > > > 1 file changed, 9 insertions(+), 1 deletion(-) > > > > > > > > > > > > > > > > > > > > > > > > diff --git a/arch/x86/kernel/cpu/sgx/main.c b/arch/x86/kernel/cpu/sgx/main.c > > > > > > > > > > > > index 44fe91a5bfb3..8c922e68274d 100644 > > > > > > > > > > > > --- a/arch/x86/kernel/cpu/sgx/main.c > > > > > > > > > > > > +++ b/arch/x86/kernel/cpu/sgx/main.c > > > > > > > > > > > > @@ -712,7 +712,15 @@ static int __init sgx_init(void) > > > > > > > > > > > > goto err_page_cache; > > > > > > > > > > > > } > > > > > > > > > > > > > > > > > > > > > > > > - ret = sgx_drv_init(); > > > > > > > > > > > > + /* > > > > > > > > > > > > + * Always try to initialize the native *and* KVM drivers. > > > > > > > > > > > > + * The KVM driver is less picky than the native one and > > > > > > > > > > > > + * can function if the native one is not supported on the > > > > > > > > > > > > + * current system or fails to initialize. > > > > > > > > > > > > + * > > > > > > > > > > > > + * Error out only if both fail to initialize. > > > > > > > > > > > > + */ > > > > > > > > > > > > + ret = !!sgx_drv_init() & !!sgx_vepc_init(); > > > > > > > > > > > > > > > > > > > > > > I love this code. > > > > > > > > > > > > > > > > > > > > > > Reviewed-by: Sean Christopherson > > > > > > > > > > > > > > > > > > > > I'm still wondering why this code let's go through when sgx_drv_init() > > > > > > > > > > succeeds and sgx_vepc_init() fails. > > > > > > > > > > > > > > > > > > > > The inline comment explains only the mirrored case (which does make > > > > > > > > > > sense). > > > > > > > > > > > > > > > > > > I.e. if sgx_drv_init() succeeds, I'd expect that sgx_vepc_init() must > > > > > > > > > succeed. Why expect legitly anything else? > > > > > > > > > > > > > > > > Apologies coming with these ideas at this point, but here is what this > > > > > > > > led me. > > > > > > > > > > > > > > > > I think that the all this complexity comes from a bad code structure. > > > > > > > > > > > > > > > > So, what is essentially happening here: > > > > > > > > > > > > > > > > - We essentially want to make EPC always work. > > > > > > > > - Driver optionally. > > > > > > > > > > > > > > > > So what this sums to is something like: > > > > > > > > > > > > > > > > ret = sgx_epc_init(); > > > > > > > > if (ret) { > > > > > > > > pr_err("EPC initialization failed.\n"); > > > > > > > > return ret; > > > > > > > > } > > > > > > > > > > > > > > > > ret = sgx_drv_init(); > > > > > > > > if (ret) > > > > > > > > pr_info("Driver could not be initialized.\n"); > > > > > > > > > > > > > > > > /* continue */ > > > > > > > > > > > > > > > > I.e. I think there should be a single EPC init, which does both EPC > > > > > > > > bootstrapping and vepc, and driver initialization comes after that. > > > > > > > > > > > > > > In other words, from SGX point of view, the thing that KVM needs is > > > > > > > to cut out EPC and driver part into different islands. How this is now > > > > > > > implemented in the current patch set is half-way there but not yet what > > > > > > > it should be. > > > > > > > > > > > > Well conceptually, SGX virtualization and SGX driver are two independently > > > > > > functionalities can be enabled separately, although they both requires some > > > > > > come functionalities, such as /dev/sgx_provision, which we have moved to > > > > > > sgx/main.c exactly for this purpose. THerefore, conceptually, it is bad to make > > > > > > assumption that, if SGX virtualization initialization succeeded, SGX driver > > > > > > must succeed -- we can potentially add more staff in SGX virtualization in the > > > > > > future.. > > > > > > > > > > > > If the name sgx_vepc_init() confuses you, I can rename it to sgx_virt_init(). > > > > > > > > > > I don't understand what would be the bad thing here. Can you open that > > > > > up please? I'm neither capable of predicting the future... > > > > > > Conceptually they are two different functionalities, and doesn't depend on each > > > other. Why calling SGX driver initialization only when SGX virtualization > > > succeeded? > > > > > > We might want to add reclaiming EPC page (VMM EPC oversubscription) from KVM > > > guest in the future, which may bring more initialization staff sgx_vepc_init(), > > > and those new staff should not impact SGX driver. > > > > > > I don't see your approach is any better, both from concept and flexibility. > > > > > > Like I said, we can rename to sgx_virt_init() to be more generic, but I > > > strongly disagree your approach. > > > > > > > > > > > Right, so since vepc_init() does only just device file initialization the > > > > current function structure is fine. I totally forgot that sgx_drv_init() > > > > does not call EPC initialization when I wrote the above :-) We refactored > > > > during the inital cycle the driver so many times that I sometimes fix up > > > > thing, sorry about. > > > > > > > > To meld this into code: > > > > > > > > ret = sgx_vepc_init(); > > > > if (ret != -ENODEV) { > > > > pr_err("vEPC initialization failed with %d.\n", ret); > > > > return ret; > > > > } > > > > > > > > ret = sgx_drv_init(); > > > > if (ret != ENODEV) > > > > pr_info("Driver initialization failed %d.\n", ret); > > > > > > Hmm.. Let's say an extreme case: misc_register() failed in sgx_vepc_init(), due > > > to -ENOMEM. Then OOM kill gets involved, and kills bunch of apps. And then In > > > this case, theoretically, misc_register() in sgx_drv_init() doesn't need to > > > fail. > > > > > > The point is really SGX driver and SGX virt are two independent > > > functionalities, so don't make dependency on them, manually. Plus I don't see > > > any benefit of your approach, but only cons. > > > > The way I've understood it is that given that KVM can support SGX > > without FLC, vEPC should be available even if driver cannot be > > enabled. > > > > This is also exactly what the short summary states. > > > > "Initialize virtual EPC driver even when SGX driver is disabled" > > > > It *does not* state: > > > > "Initialize SGX driver even when vEPC driver is disabled" OK. The patch title can be improved. How about: "Initialize SGX driver and virtual EPC driver independently" ? > > > > Also, this is how I interpret the inline comment. > > > > All this considered, the other direction is undocumented functionality. OK. How about below? /* * Always try to initialize the native *and* KVM drivers. They are independent * functionalities and one can be initialized even when the other is not * supported or fails to initialize. */ The explicit saying of "not supported or fails to initialize" was requested by you -- you wanted to distinguish -ENODEV with other error codes. > > Also: > > 1. There is *zero* good practical reasons to support the "2nd direction". > For KVM getting init'd with SGX, on the other hand, we have good > practical reasons. Why there's *zero* good practical reasons? With initializing them independently, people don't need to worry about *internal* of sgx_vepc_init() and sgx_drv_init(), but just need pay attention of the logic that they are two independent functionalities. Being able to initialize them independently is much more clear and easier to understand. And like I said, in this way it is more flexible to extend -- for instance, we may add more staff to support VMM EPC oversubscription. So why there is *zero* good practical reasons? Btw, there are customers that want to just use KVM SGX, but not SGX driver in host, for which people may want to add separate CONFIG option, say, CONFIG_X86_SGX_DRIVER, to be able to disable/enable SGX driver code, just like CONFIG_X86_SGX_KVM. Make them independent logically just make things more clear. > 2. We can get something practically useful with simpler and more verbose > code, i.e. better logging. I can add error msg in sgx_vepc_init() upon misc_register() if you want. > > /Jarkko