From: Paolo Bonzini <pbonzini@redhat.com>
To: Liran Alon <liran.alon@oracle.com>,
rkrcmar@redhat.com, kvm@vger.kernel.org
Cc: Singh Brijesh <brijesh.singh@amd.com>,
Sean Christopherson <sean.j.christopherson@intel.com>,
Boris Ostrovsky <boris.ostrovsky@oracle.com>
Subject: Re: [PATCH v2] KVM: SVM: Fix detection of AMD Errata 1096
Date: Thu, 18 Jul 2019 12:50:23 +0200 [thread overview]
Message-ID: <f011a43e-6bd3-0a90-fc19-a5b075994aa3@redhat.com> (raw)
In-Reply-To: <20190716235658.18185-1-liran.alon@oracle.com>
On 17/07/19 01:56, Liran Alon wrote:
> AMD Errata 1096:
> When CPU raise #NPF on guest data access and vCPU CR4.SMAP=1, it is
> possible that CPU microcode implementing DecodeAssist will fail
> to read bytes of instruction which caused #NPF. In this case,
> GuestIntrBytes field of the VMCB on a VMEXIT will incorrectly
> return 0 instead of the correct guest instruction bytes.
> This happens because CPU microcode reading instruction bytes
> uses a special opcode which attempts to read data using CPL=0
> priviledges. The microcode reads CS:RIP and if it hits a SMAP
> fault, it gives up and returns no instruction bytes.
>
> Current KVM code which attemps to detect and workaround this errata have
> multiple issues:
>
> 1) Code mistakenly checks if vCPU CR4.SMAP=0 instead of vCPU CR4.SMAP=1 which
> is required for encountering a SMAP fault.
>
> 2) Code assumes SMAP fault can only occur when vCPU CPL==3.
> However, the condition for a SMAP fault is a data access with CPL<3
> to a page mapped in page-tables as user-accessible (i.e. PTE with U/S
> bit set to 1).
> Therefore, in case vCPU CR4.SMEP=0, guest can execute an instruction
> which reside in a user-accessible page with CPL<3 priviledge. If this
> instruction raise a #NPF on it's data access, then CPU DecodeAssist
> microcode will still encounter a SMAP violation.
> Even though no sane OS will do so (as it's an obvious priviledge
> escalation vulnerability), we still need to handle this semanticly
> correct in KVM side.
>
> As CR4.SMAP=1 is an easy triggerable condition, attempt to avoid
> false-positive of detecting errata by taking note that in case vCPU
> CR4.SMEP=1, errata could only be encountered in case CPL==3 (As
> otherwise, CPU would raise SMEP fault to guest instead of #NPF).
> This can be a useful condition to avoid false-positive because guests
> usually enable SMAP if they have also enabled SMEP.
>
> In addition, to avoid future confusion and improve code readbility,
> comment errata details in code and not just in commit message.
>
> Fixes: 05d5a4863525 ("KVM: SVM: Workaround errata#1096 (insn_len maybe zero on SMAP violation)")
> Cc: Singh Brijesh <brijesh.singh@amd.com>
> Cc: Sean Christopherson <sean.j.christopherson@intel.com>
> Cc: Paolo Bonzini <pbonzini@redhat.com>
> Reviewed-by: Boris Ostrovsky <boris.ostrovsky@oracle.com>
> Signed-off-by: Liran Alon <liran.alon@oracle.com>
> ---
> arch/x86/kvm/svm.c | 42 +++++++++++++++++++++++++++++++++++-------
> 1 file changed, 35 insertions(+), 7 deletions(-)
>
> diff --git a/arch/x86/kvm/svm.c b/arch/x86/kvm/svm.c
> index 735b8c01895e..7d6410539dd7 100644
> --- a/arch/x86/kvm/svm.c
> +++ b/arch/x86/kvm/svm.c
> @@ -7120,13 +7120,41 @@ static int nested_enable_evmcs(struct kvm_vcpu *vcpu,
>
> static bool svm_need_emulation_on_page_fault(struct kvm_vcpu *vcpu)
> {
> - bool is_user, smap;
> -
> - is_user = svm_get_cpl(vcpu) == 3;
> - smap = !kvm_read_cr4_bits(vcpu, X86_CR4_SMAP);
> + unsigned long cr4 = kvm_read_cr4(vcpu);
> + bool smep = cr4 & X86_CR4_SMEP;
> + bool smap = cr4 & X86_CR4_SMAP;
> + bool is_user = svm_get_cpl(vcpu) == 3;
>
> /*
> - * Detect and workaround Errata 1096 Fam_17h_00_0Fh
> + * Detect and workaround Errata 1096 Fam_17h_00_0Fh.
> + *
> + * Errata:
> + * When CPU raise #NPF on guest data access and vCPU CR4.SMAP=1, it is
> + * possible that CPU microcode implementing DecodeAssist will fail
> + * to read bytes of instruction which caused #NPF. In this case,
> + * GuestIntrBytes field of the VMCB on a VMEXIT will incorrectly
> + * return 0 instead of the correct guest instruction bytes.
> + *
> + * This happens because CPU microcode reading instruction bytes
> + * uses a special opcode which attempts to read data using CPL=0
> + * priviledges. The microcode reads CS:RIP and if it hits a SMAP
> + * fault, it gives up and returns no instruction bytes.
> + *
> + * Detection:
> + * We reach here in case CPU supports DecodeAssist, raised #NPF and
> + * returned 0 in GuestIntrBytes field of the VMCB.
> + * First, errata can only be triggered in case vCPU CR4.SMAP=1.
> + * Second, if vCPU CR4.SMEP=1, errata could only be triggered
> + * in case vCPU CPL==3 (Because otherwise guest would have triggered
> + * a SMEP fault instead of #NPF).
> + * Otherwise, vCPU CR4.SMEP=0, errata could be triggered by any vCPU CPL.
> + * As most guests enable SMAP if they have also enabled SMEP, use above
> + * logic in order to attempt minimize false-positive of detecting errata
> + * while still preserving all cases semantic correctness.
> + *
> + * Workaround:
> + * To determine what instruction the guest was executing, the hypervisor
> + * will have to decode the instruction at the instruction pointer.
> *
> * In non SEV guest, hypervisor will be able to read the guest
> * memory to decode the instruction pointer when insn_len is zero
> @@ -7137,11 +7165,11 @@ static bool svm_need_emulation_on_page_fault(struct kvm_vcpu *vcpu)
> * instruction pointer so we will not able to workaround it. Lets
> * print the error and request to kill the guest.
> */
> - if (is_user && smap) {
> + if (smap && (!smep || is_user)) {
> if (!sev_guest(vcpu->kvm))
> return true;
>
> - pr_err_ratelimited("KVM: Guest triggered AMD Erratum 1096\n");
> + pr_err_ratelimited("KVM: SEV Guest triggered AMD Erratum 1096\n");
> kvm_make_request(KVM_REQ_TRIPLE_FAULT, vcpu);
> }
>
>
Queued, subject to Brijesh's answer on guest CR4.SMAP vs host CR4.SMAP.
Paolo
prev parent reply other threads:[~2019-07-18 10:50 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-07-16 23:56 [PATCH v2] KVM: SVM: Fix detection of AMD Errata 1096 Liran Alon
2019-07-17 20:44 ` Singh, Brijesh
2019-07-18 10:49 ` Paolo Bonzini
2019-07-18 15:25 ` Singh, Brijesh
2019-07-18 10:50 ` Paolo Bonzini [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=f011a43e-6bd3-0a90-fc19-a5b075994aa3@redhat.com \
--to=pbonzini@redhat.com \
--cc=boris.ostrovsky@oracle.com \
--cc=brijesh.singh@amd.com \
--cc=kvm@vger.kernel.org \
--cc=liran.alon@oracle.com \
--cc=rkrcmar@redhat.com \
--cc=sean.j.christopherson@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).