From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753917AbdBTQqi (ORCPT ); Mon, 20 Feb 2017 11:46:38 -0500 Received: from mail-vk0-f41.google.com ([209.85.213.41]:35307 "EHLO mail-vk0-f41.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750884AbdBTQqg (ORCPT ); Mon, 20 Feb 2017 11:46:36 -0500 MIME-Version: 1.0 In-Reply-To: <38d613dc-5b46-1f4f-04d1-53c01932e6d6@redhat.com> References: <38d613dc-5b46-1f4f-04d1-53c01932e6d6@redhat.com> From: Andy Lutomirski Date: Mon, 20 Feb 2017 08:46:14 -0800 Message-ID: Subject: Re: RFC: Getting rid of LTR in VMX To: Paolo Bonzini Cc: kvm list , "linux-kernel@vger.kernel.org" , X86 ML , "H. Peter Anvin" , Borislav Petkov Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Feb 20, 2017 at 3:05 AM, Paolo Bonzini wrote: > > > On 18/02/2017 04:29, Andy Lutomirski wrote: >> There's no code here because the patch is trivial, but I want to run >> the idea by you all first to see if there are any issues. >> >> VMX is silly and forces the TSS limit to the minimum on VM exits. KVM >> wastes lots of cycles bumping it back up to accomodate the io bitmap. > > Actually looked at the code now... > > reload_tss is only invoked for userspace exits, so it is a nice-to-have > but it wouldn't show on most workloads. Still it does save 150-200 > clock cycles to remove it (I just commented out reload_tss() from > __vmx_load_host_state to test). That's for anything involving userspace or preemption, right? > > Another 100-150 could be saved if we could just use rdgsbase/wrgsbase, > instead of rdmsr/wrmsr, to read and write the kernel GS. Super hacky > patch after sig. I have a Real Patch Series (tm) to do that, but it has a couple of unresolved corner cases so far. That being said, vmx_save_host_state() is, um, poorly optimized. I'll try to find some time to fix the obvious things. Meanwhile, I'll send real patches for TR. > + cr4_set_bits(X86_CR4_FSGSBASE); Nice root hole :-p --Andy