From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-12.8 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM, HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER,INCLUDES_PATCH, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id C7DF7C433ED for ; Thu, 13 May 2021 17:13:42 +0000 (UTC) Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id 3C1486142C for ; Thu, 13 May 2021 17:13:41 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 3C1486142C Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=gmail.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Received: from localhost ([::1]:38810 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1lhEu0-000608-KS for qemu-devel@archiver.kernel.org; Thu, 13 May 2021 13:13:38 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:43888) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1lhES4-0007wq-OY for qemu-devel@nongnu.org; Thu, 13 May 2021 12:44:44 -0400 Received: from mail-yb1-xb34.google.com ([2607:f8b0:4864:20::b34]:45629) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1lhES2-0003j3-IY for qemu-devel@nongnu.org; Thu, 13 May 2021 12:44:44 -0400 Received: by mail-yb1-xb34.google.com with SMTP id g38so35500082ybi.12 for ; Thu, 13 May 2021 09:44:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=2XNd3eUzf9XREb+pgO4It2usQ416pB1hlzRUsw5RxRI=; b=qYnoA/1bTmK9kGkgkor9ZIDwbfy3c6zhnjEhzQiAPfGvPe0vreN1O3P52OXnrMnNy1 PS4PTV3fIt9DTjlSg0N4+Kol/ByT5ke3lZNg2cFt6ShVMriv0Vmg/XoOdr1AdlaKS59F m38N3ILXJ59wiEiuXqibuGzxnxdR6cJslhxLwNj8FnKhcS1Mjr1kRFnYWVN2zXUF53AP 1orlekCNLOdjSI+zJdC0OqrDSHodmdwVKCjdgTJAYhd72nvneeCxPjADGQcgKbyF1KFv 4oCYROJ4jKJzcObKF0K5CuW+N7oSnrUP2pb6HK+JHOgQpFZ1Q12/UxGNtXKgGAaQWCzN UkqA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=2XNd3eUzf9XREb+pgO4It2usQ416pB1hlzRUsw5RxRI=; b=q2lJdSO11J8ctqgE3OZ0LalEZw1DiV6y1olEKRIlCoUELt4xuOXdXyyKGpOGO+C54x oEJiMOv+QCZcnM4yUaiIsWNv4vs20htGSQ/Xzz4Bud155UDc6p0RuIiKWo3+Hjd1mYJB K8hWY9tIBRz5osLK+XREAEh+otvmt7Ab/TrnXK6FhRsepw/KmhmqXtF5H2yj/G9XN8gt sZc4s9f2gahkZ6rybgPLUEWMPAdt+NZTXEQB9R+FfqazLQfYxa1CnL6NiVSM6PzNGWvE 1mi7D9DUbGFOMTjLDN2ile7y5eTPv5iMVobzNYegbM9L/YNiP4nKEhVxAENnOKojSEFb bb3g== X-Gm-Message-State: AOAM530efMjbTNnBdUOd01jfOJazbSvcf063ZWlQtHBS+QohBEn0TTEs Rd0Qs5IipJdyNIaqRJqx6BxqGckR/XhFV3QC1qsXZQLc16g= X-Google-Smtp-Source: ABdhPJyUWN9QwVCZUP7pnmfSTgQdi3FHpH7oSCP/RR7SoUpxUEq5BYLxZqHI/puR5xbAD5bXzi+fAI/2O/8G1UJ4wao= X-Received: by 2002:a25:234d:: with SMTP id j74mr55538267ybj.301.1620924280634; Thu, 13 May 2021 09:44:40 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a05:7110:1b0:b029:b7:a582:d972 with HTTP; Thu, 13 May 2021 09:44:40 -0700 (PDT) In-Reply-To: <20210507080055.258563-2-ziqiaokong@gmail.com> References: <20210507080055.258563-1-ziqiaokong@gmail.com> <20210507080055.258563-2-ziqiaokong@gmail.com> From: Ziqiao Kong Date: Fri, 14 May 2021 00:44:40 +0800 Message-ID: Subject: Re: [PATCH v4 2/2] target/i386: Correct implementation for FCS, FIP, FDS and FDP To: qemu-devel@nongnu.org Content-Type: text/plain; charset="UTF-8" Received-SPF: pass client-ip=2607:f8b0:4864:20::b34; envelope-from=ziqiaokong@gmail.com; helo=mail-yb1-xb34.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: pbonzini@redhat.com, richard.henderson@linaro.org, ehabkost@redhat.com, Ziqiao Kong Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Sender: "Qemu-devel" Ping. On 5/7/21, Ziqiao Kong wrote: > Changes since v3: > - Split the long patches to series to make review easier. > - Fix the coding style problems in v3. > > Changes since v2: > - Change the sequence of fpcs, fpds, fpip and fpdp in CPUX86State. > - Use stl instead of stw in do_fstenv. > - Move variables to floats instruction case block. > - Move last accessed memory operand to a temp variable to avoid another > load. > - Move segment selectors instead of segment base to fpcs and fpds. > - Fix some code stype problems for the original code in floats case block. > > Signed-off-by: Ziqiao Kong > --- > target/i386/cpu.h | 4 +++ > target/i386/tcg/fpu_helper.c | 48 ++++++++++++++++++++++-------------- > target/i386/tcg/translate.c | 45 ++++++++++++++++++++++++++++++++- > 3 files changed, 77 insertions(+), 20 deletions(-) > > diff --git a/target/i386/cpu.h b/target/i386/cpu.h > index 570f916878..241945320b 100644 > --- a/target/i386/cpu.h > +++ b/target/i386/cpu.h > @@ -705,6 +705,8 @@ typedef uint64_t FeatureWordArray[FEATURE_WORDS]; > #define CPUID_7_0_EBX_INVPCID (1U << 10) > /* Restricted Transactional Memory */ > #define CPUID_7_0_EBX_RTM (1U << 11) > +/* Deprecates FPU CS and FPU DS values */ > +#define CPUID_7_0_EBX_FCS_FDS (1U << 13) > /* Memory Protection Extension */ > #define CPUID_7_0_EBX_MPX (1U << 14) > /* AVX-512 Foundation */ > @@ -1440,6 +1442,8 @@ typedef struct CPUX86State { > FPReg fpregs[8]; > /* KVM-only so far */ > uint16_t fpop; > + uint16_t fpcs; > + uint16_t fpds; > uint64_t fpip; > uint64_t fpdp; > > diff --git a/target/i386/tcg/fpu_helper.c b/target/i386/tcg/fpu_helper.c > index 60ed93520a..f1a8717ed8 100644 > --- a/target/i386/tcg/fpu_helper.c > +++ b/target/i386/tcg/fpu_helper.c > @@ -766,6 +766,10 @@ void helper_fninit(CPUX86State *env) > { > env->fpus = 0; > env->fpstt = 0; > + env->fpcs = 0; > + env->fpip = 0; > + env->fpds = 0; > + env->fpdp = 0; > cpu_set_fpuc(env, 0x37f); > env->fptags[0] = 1; > env->fptags[1] = 1; > @@ -2368,6 +2372,7 @@ static void do_fstenv(CPUX86State *env, target_ulong > ptr, int data32, > { > int fpus, fptag, exp, i; > uint64_t mant; > + uint16_t fpcs, fpds; > CPU_LDoubleU tmp; > > fpus = (env->fpus & ~0x3800) | (env->fpstt & 0x7) << 11; > @@ -2390,24 +2395,39 @@ static void do_fstenv(CPUX86State *env, target_ulong > ptr, int data32, > } > } > } > + > + /* > + * If CR0.PE = 1, each instruction saves FCS and FDS into memory. If > + * CPUID.(EAX=07H,ECX=0H):EBX[bit 13] = 1, the processor deprecates > + * FCS and FDS; it saves each as 0000H. > + */ > + if (!(env->features[FEAT_7_0_EBX] & CPUID_7_0_EBX_FCS_FDS) > + && (env->cr[0] & CR0_PE_MASK)) { > + fpcs = env->fpcs; > + fpds = env->fpds; > + } else { > + fpcs = 0; > + fpds = 0; > + } > + > if (data32) { > /* 32 bit */ > cpu_stl_data_ra(env, ptr, env->fpuc, retaddr); > cpu_stl_data_ra(env, ptr + 4, fpus, retaddr); > cpu_stl_data_ra(env, ptr + 8, fptag, retaddr); > - cpu_stl_data_ra(env, ptr + 12, 0, retaddr); /* fpip */ > - cpu_stl_data_ra(env, ptr + 16, 0, retaddr); /* fpcs */ > - cpu_stl_data_ra(env, ptr + 20, 0, retaddr); /* fpoo */ > - cpu_stl_data_ra(env, ptr + 24, 0, retaddr); /* fpos */ > + cpu_stl_data_ra(env, ptr + 12, env->fpip, retaddr); /* fpip */ > + cpu_stl_data_ra(env, ptr + 16, fpcs, retaddr); /* fpcs */ > + cpu_stl_data_ra(env, ptr + 20, env->fpdp, retaddr); /* fpdp */ > + cpu_stl_data_ra(env, ptr + 24, fpds, retaddr); /* fpds */ > } else { > /* 16 bit */ > cpu_stw_data_ra(env, ptr, env->fpuc, retaddr); > cpu_stw_data_ra(env, ptr + 2, fpus, retaddr); > cpu_stw_data_ra(env, ptr + 4, fptag, retaddr); > - cpu_stw_data_ra(env, ptr + 6, 0, retaddr); > - cpu_stw_data_ra(env, ptr + 8, 0, retaddr); > - cpu_stw_data_ra(env, ptr + 10, 0, retaddr); > - cpu_stw_data_ra(env, ptr + 12, 0, retaddr); > + cpu_stw_data_ra(env, ptr + 6, env->fpip, retaddr); > + cpu_stw_data_ra(env, ptr + 8, fpcs, retaddr); > + cpu_stw_data_ra(env, ptr + 10, env->fpdp, retaddr); > + cpu_stw_data_ra(env, ptr + 12, fpds, retaddr); > } > } > > @@ -2473,17 +2493,7 @@ void helper_fsave(CPUX86State *env, target_ulong ptr, > int data32) > } > > /* fninit */ > - env->fpus = 0; > - env->fpstt = 0; > - cpu_set_fpuc(env, 0x37f); > - env->fptags[0] = 1; > - env->fptags[1] = 1; > - env->fptags[2] = 1; > - env->fptags[3] = 1; > - env->fptags[4] = 1; > - env->fptags[5] = 1; > - env->fptags[6] = 1; > - env->fptags[7] = 1; > + helper_fninit(env); > } > > void helper_frstor(CPUX86State *env, target_ulong ptr, int data32) > diff --git a/target/i386/tcg/translate.c b/target/i386/tcg/translate.c > index 52e94fe106..59647ea5b7 100644 > --- a/target/i386/tcg/translate.c > +++ b/target/i386/tcg/translate.c > @@ -5839,6 +5839,11 @@ static target_ulong disas_insn(DisasContext *s, > CPUState *cpu) > /* floats */ > case 0xd8 ... 0xdf: > { > + TCGv last_addr = tcg_temp_new(); > + int last_seg; > + bool update_fdp = false; > + bool update_fip = true; > + > if (s->flags & (HF_EM_MASK | HF_TS_MASK)) { > /* if CR0.EM or CR0.TS are set, generate an FPU exception > */ > /* XXX: what to do if illegal op ? */ > @@ -5851,7 +5856,14 @@ static target_ulong disas_insn(DisasContext *s, > CPUState *cpu) > op = ((b & 7) << 3) | ((modrm >> 3) & 7); > if (mod != 3) { > /* memory op */ > - gen_lea_modrm(env, s, modrm); > + AddressParts a = gen_lea_modrm_0(env, s, modrm); > + TCGv ea = gen_lea_modrm_1(s, a); > + > + update_fdp = true; > + last_seg = a.def_seg; > + tcg_gen_mov_tl(last_addr, ea); > + gen_lea_v_seg(s, s->aflag, ea, a.def_seg, s->override); > + > switch (op) { > case 0x00 ... 0x07: /* fxxxs */ > case 0x10 ... 0x17: /* fixxxl */ > @@ -5978,19 +5990,23 @@ static target_ulong disas_insn(DisasContext *s, > CPUState *cpu) > break; > case 0x0c: /* fldenv mem */ > gen_helper_fldenv(cpu_env, s->A0, tcg_const_i32(dflag - > 1)); > + update_fip = update_fdp = false; > break; > case 0x0d: /* fldcw mem */ > tcg_gen_qemu_ld_i32(s->tmp2_i32, s->A0, > s->mem_index, MO_LEUW); > gen_helper_fldcw(cpu_env, s->tmp2_i32); > + update_fip = update_fdp = false; > break; > case 0x0e: /* fnstenv mem */ > gen_helper_fstenv(cpu_env, s->A0, tcg_const_i32(dflag - > 1)); > + update_fip = update_fdp = false; > break; > case 0x0f: /* fnstcw mem */ > gen_helper_fnstcw(s->tmp2_i32, cpu_env); > tcg_gen_qemu_st_i32(s->tmp2_i32, s->A0, > s->mem_index, MO_LEUW); > + update_fip = update_fdp = false; > break; > case 0x1d: /* fldt mem */ > gen_helper_fldt_ST0(cpu_env, s->A0); > @@ -6001,14 +6017,17 @@ static target_ulong disas_insn(DisasContext *s, > CPUState *cpu) > break; > case 0x2c: /* frstor mem */ > gen_helper_frstor(cpu_env, s->A0, tcg_const_i32(dflag - > 1)); > + update_fip = update_fdp = false; > break; > case 0x2e: /* fnsave mem */ > gen_helper_fsave(cpu_env, s->A0, tcg_const_i32(dflag - > 1)); > + update_fip = update_fdp = false; > break; > case 0x2f: /* fnstsw mem */ > gen_helper_fnstsw(s->tmp2_i32, cpu_env); > tcg_gen_qemu_st_i32(s->tmp2_i32, s->A0, > s->mem_index, MO_LEUW); > + update_fip = update_fdp = false; > break; > case 0x3c: /* fbld */ > gen_helper_fbld_ST0(cpu_env, s->A0); > @@ -6051,6 +6070,7 @@ static target_ulong disas_insn(DisasContext *s, > CPUState *cpu) > case 0: /* fnop */ > /* check exceptions (FreeBSD FPU probe) */ > gen_helper_fwait(cpu_env); > + update_fip = update_fdp = false; > break; > default: > goto unknown_op; > @@ -6220,9 +6240,11 @@ static target_ulong disas_insn(DisasContext *s, > CPUState *cpu) > break; > case 2: /* fclex */ > gen_helper_fclex(cpu_env); > + update_fip = update_fdp = false; > break; > case 3: /* fninit */ > gen_helper_fninit(cpu_env); > + update_fip = update_fdp = false; > break; > case 4: /* fsetpm (287 only, just do nop here) */ > break; > @@ -6343,6 +6365,27 @@ static target_ulong disas_insn(DisasContext *s, > CPUState *cpu) > goto unknown_op; > } > } > + > + if (update_fip) { > + tcg_gen_ld32u_tl(s->T0, cpu_env, > + offsetof(CPUX86State, > segs[R_CS].selector)); > + tcg_gen_st16_tl(s->T0, cpu_env, offsetof(CPUX86State, > fpcs)); > + > + tcg_gen_movi_tl(s->T0, pc_start - s->cs_base); > + tcg_gen_st_tl(s->T0, cpu_env, offsetof(CPUX86State, > fpip)); > + } > + > + if (update_fdp) { > + if (s->override >= 0) { > + last_seg = s->override; > + } > + tcg_gen_ld32u_tl(s->T0, cpu_env, > + offsetof(CPUX86State, > + segs[last_seg].selector)); > + tcg_gen_st16_tl(s->T0, cpu_env, offsetof(CPUX86State, > fpds)); > + > + tcg_gen_st_tl(last_addr, cpu_env, offsetof(CPUX86State, > fpdp)); > + } > } > break; > /************************/ > -- > 2.25.1 > >