From: "Zhu, Lingshan" <lingshan.zhu@intel.com>
To: Liuxiangdong <liuxiangdong5@huawei.com>,
peterz@infradead.org, pbonzini@redhat.com
Cc: bp@alien8.de, seanjc@google.com, vkuznets@redhat.com,
wanpengli@tencent.com, jmattson@google.com, joro@8bytes.org,
weijiang.yang@intel.com, kan.liang@linux.intel.com,
ak@linux.intel.com, wei.w.wang@intel.com, eranian@google.com,
linux-kernel@vger.kernel.org, x86@kernel.org,
kvm@vger.kernel.org, like.xu.linux@gmail.com,
"Fangyi (Eric)" <eric.fangyi@huawei.com>,
Xiexiangyou <xiexiangyou@huawei.com>
Subject: Re: [PATCH V7 00/18] KVM: x86/pmu: Add *basic* support to enable guest PEBS via DS
Date: Fri, 25 Jun 2021 17:46:12 +0800 [thread overview]
Message-ID: <37832cc0-788d-91b9-dc95-147eca133842@intel.com> (raw)
In-Reply-To: <60D5A487.8020507@huawei.com>
On 6/25/2021 5:40 PM, Liuxiangdong wrote:
>
>
> On 2021/6/22 17:42, Zhu Lingshan wrote:
>> The guest Precise Event Based Sampling (PEBS) feature can provide an
>> architectural state of the instruction executed after the guest
>> instruction that exactly caused the event. It needs new hardware
>> facility only available on Intel Ice Lake Server platforms. This
>> patch set enables the basic PEBS feature for KVM guests on ICX.
>>
>> We can use PEBS feature on the Linux guest like native:
>>
>> # echo 0 > /proc/sys/kernel/watchdog (on the host)
>
> Only on the host?
> I cannot use pebs unless try with "echo 0 > /proc/sys/kernel/watchdog"
> both on the host and guest on ICX.
Hi Xiangdong
I guess you may run into the "cross-map" case(slow path below), so I
think you can disable them both in host and guest to make PEBS work.
Thanks
>
>> # perf record -e instructions:ppp ./br_instr a
>> # perf record -c 100000 -e instructions:pp ./br_instr a
>>
>> To emulate guest PEBS facility for the above perf usages, we need to
>> implement 2 code paths:
>>
>> 1) Fast path
>>
>> This is when the host assigned physical PMC has an identical index as
>> the virtual PMC (e.g. using physical PMC0 to emulate virtual PMC0).
>> This path is used in most common use cases.
>>
>> 2) Slow path
>>
>> This is when the host assigned physical PMC has a different index
>> from the virtual PMC (e.g. using physical PMC1 to emulate virtual
>> PMC0) In this case, KVM needs to rewrite the PEBS records to change
>> the applicable counter indexes to the virtual PMC indexes, which
>> would otherwise contain the physical counter index written by PEBS
>> facility, and switch the counter reset values to the offset
>> corresponding to the physical counter indexes in the DS data structure.
>>
>> The previous version [0] enables both fast path and slow path, which
>> seems a bit more complex as the first step. In this patchset, we want
>> to start with the fast path to get the basic guest PEBS enabled while
>> keeping the slow path disabled. More focused discussion on the slow
>> path [1] is planned to be put to another patchset in the next step.
>>
>> Compared to later versions in subsequent steps, the functionality to
>> support host-guest PEBS both enabled and the functionality to emulate
>> guest PEBS when the counter is cross-mapped are missing in this patch
>> set (neither of these are typical scenarios).
>>
>> With the basic support, the guest can retrieve the correct PEBS
>> information from its own PEBS records on the Ice Lake servers.
>> And we expect it should work when migrating to another Ice Lake and
>> no regression about host perf is expected.
>>
>> Here are the results of pebs test from guest/host for same workload:
>>
>> perf report on guest:
>> # Samples: 2K of event 'instructions:ppp', # Event count (approx.):
>> 1473377250 # Overhead Command Shared Object Symbol
>> 57.74% br_instr br_instr [.] lfsr_cond
>> 41.40% br_instr br_instr [.] cmp_end
>> 0.21% br_instr [kernel.kallsyms] [k] __lock_acquire
>>
>> perf report on host:
>> # Samples: 2K of event 'instructions:ppp', # Event count (approx.):
>> 1462721386 # Overhead Command Shared Object Symbol
>> 57.90% br_instr br_instr [.] lfsr_cond
>> 41.95% br_instr br_instr [.] cmp_end
>> 0.05% br_instr [kernel.vmlinux] [k] lock_acquire
>> Conclusion: the profiling results on the guest are similar
>> tothat on the host.
>>
>> A minimum guest kernel version may be v5.4 or a backport version
>> support Icelake server PEBS.
>>
>> Please check more details in each commit and feel free to comment.
>>
>> Previous:
>> https://lore.kernel.org/kvm/20210511024214.280733-1-like.xu@linux.intel.com/
>>
>>
>> [0]
>> https://lore.kernel.org/kvm/20210104131542.495413-1-like.xu@linux.intel.com/
>>
>> [1]
>> https://lore.kernel.org/kvm/20210115191113.nktlnmivc3edstiv@two.firstfloor.org/
>>
>>
>> V6 -> V7 Changelog:
>> - Fix conditions order and call x86_pmu_handle_guest_pebs()
>> unconditionally; (PeterZ)
>> - Add a new patch to make all that perf_guest_cbs stuff suck less;
>> (PeterZ)
>> - Document IA32_MISC_ENABLE[7] that that behavior matches bare metal;
>> (Sean & Venkatesh)
>> - Update commit message for fixed counter mask refactoring;(PeterZ)
>> - Clarifying comments about {.host and .guest} for
>> intel_guest_get_msrs(); (PeterZ)
>> - Add pebs_capable to store valid PEBS_COUNTER_MASK value; (PeterZ)
>> - Add more comments for perf's precise_ip field; (Andi & PeterZ)
>> - Refactor perf_overflow_handler_t and make it more legible; (PeterZ)
>> - Use "(unsigned long)cpuc->ds" instead of
>> __this_cpu_read(cpu_hw_events.ds); (PeterZ)
>> - Keep using "(struct kvm_pmu *)data" to follow K&R; (Andi)
>>
>> Like Xu (17):
>> perf/core: Use static_call to optimize perf_guest_info_callbacks
>> perf/x86/intel: Add EPT-Friendly PEBS for Ice Lake Server
>> perf/x86/intel: Handle guest PEBS overflow PMI for KVM guest
>> perf/x86/core: Pass "struct kvm_pmu *" to determine the guest values
>> KVM: x86/pmu: Set MSR_IA32_MISC_ENABLE_EMON bit when vPMU is enabled
>> KVM: x86/pmu: Introduce the ctrl_mask value for fixed counter
>> KVM: x86/pmu: Add IA32_PEBS_ENABLE MSR emulation for extended PEBS
>> KVM: x86/pmu: Reprogram PEBS event to emulate guest PEBS counter
>> KVM: x86/pmu: Adjust precise_ip to emulate Ice Lake guest PDIR
>> counter
>> KVM: x86/pmu: Add IA32_DS_AREA MSR emulation to support guest DS
>> KVM: x86/pmu: Add PEBS_DATA_CFG MSR emulation to support adaptive
>> PEBS
>> KVM: x86: Set PEBS_UNAVAIL in IA32_MISC_ENABLE when PEBS is enabled
>> KVM: x86/pmu: Move pmc_speculative_in_use() to arch/x86/kvm/pmu.h
>> KVM: x86/pmu: Disable guest PEBS temporarily in two rare situations
>> KVM: x86/pmu: Add kvm_pmu_cap to optimize perf_get_x86_pmu_capability
>> KVM: x86/cpuid: Refactor host/guest CPU model consistency check
>> KVM: x86/pmu: Expose CPUIDs feature bits PDCM, DS, DTES64
>>
>> Peter Zijlstra (Intel) (1):
>> x86/perf/core: Add pebs_capable to store valid PEBS_COUNTER_MASK
>> value
>>
>> arch/arm/kernel/perf_callchain.c | 16 +--
>> arch/arm64/kernel/perf_callchain.c | 29 +++--
>> arch/arm64/kvm/perf.c | 22 ++--
>> arch/csky/kernel/perf_callchain.c | 4 +-
>> arch/nds32/kernel/perf_event_cpu.c | 16 +--
>> arch/riscv/kernel/perf_callchain.c | 4 +-
>> arch/x86/events/core.c | 43 ++++++--
>> arch/x86/events/intel/core.c | 165 +++++++++++++++++++++++------
>> arch/x86/events/perf_event.h | 6 +-
>> arch/x86/include/asm/kvm_host.h | 18 +++-
>> arch/x86/include/asm/msr-index.h | 6 ++
>> arch/x86/include/asm/perf_event.h | 5 +-
>> arch/x86/kvm/cpuid.c | 24 ++---
>> arch/x86/kvm/cpuid.h | 5 +
>> arch/x86/kvm/pmu.c | 60 ++++++++---
>> arch/x86/kvm/pmu.h | 38 +++++++
>> arch/x86/kvm/vmx/capabilities.h | 26 +++--
>> arch/x86/kvm/vmx/pmu_intel.c | 115 ++++++++++++++++----
>> arch/x86/kvm/vmx/vmx.c | 24 ++++-
>> arch/x86/kvm/vmx/vmx.h | 2 +-
>> arch/x86/kvm/x86.c | 51 +++++----
>> arch/x86/xen/pmu.c | 33 +++---
>> include/linux/perf_event.h | 12 ++-
>> kernel/events/core.c | 9 ++
>> 24 files changed, 544 insertions(+), 189 deletions(-)
>>
>
next prev parent reply other threads:[~2021-06-25 9:46 UTC|newest]
Thread overview: 39+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-06-22 9:42 [PATCH V7 00/18] KVM: x86/pmu: Add *basic* support to enable guest PEBS via DS Zhu Lingshan
2021-06-22 9:42 ` [PATCH V7 01/18] perf/core: Use static_call to optimize perf_guest_info_callbacks Zhu Lingshan
2021-07-02 11:22 ` Peter Zijlstra
2021-07-02 16:00 ` Joe Perches
2021-07-02 16:19 ` Peter Zijlstra
2021-07-02 16:42 ` Joe Perches
2021-07-02 16:38 ` Mark Rutland
2021-07-02 16:56 ` Joe Perches
2021-07-08 8:53 ` Zhu Lingshan
2021-06-22 9:42 ` [PATCH V7 02/18] perf/x86/intel: Add EPT-Friendly PEBS for Ice Lake Server Zhu Lingshan
2021-06-22 9:42 ` [PATCH V7 03/18] perf/x86/intel: Handle guest PEBS overflow PMI for KVM guest Zhu Lingshan
2021-07-02 11:28 ` Peter Zijlstra
2021-07-08 13:39 ` Zhu, Lingshan
2021-06-22 9:42 ` [PATCH V7 04/18] perf/x86/core: Pass "struct kvm_pmu *" to determine the guest values Zhu Lingshan
2021-06-22 9:42 ` [PATCH V7 05/18] KVM: x86/pmu: Set MSR_IA32_MISC_ENABLE_EMON bit when vPMU is enabled Zhu Lingshan
2021-06-22 9:42 ` [PATCH V7 06/18] KVM: x86/pmu: Introduce the ctrl_mask value for fixed counter Zhu Lingshan
2021-06-22 9:42 ` [PATCH V7 07/18] x86/perf/core: Add pebs_capable to store valid PEBS_COUNTER_MASK value Zhu Lingshan
2021-06-22 9:42 ` [PATCH V7 08/18] KVM: x86/pmu: Add IA32_PEBS_ENABLE MSR emulation for extended PEBS Zhu Lingshan
2021-06-22 9:42 ` [PATCH V7 09/18] KVM: x86/pmu: Reprogram PEBS event to emulate guest PEBS counter Zhu Lingshan
2021-06-22 9:42 ` [PATCH V7 10/18] KVM: x86/pmu: Adjust precise_ip to emulate Ice Lake guest PDIR counter Zhu Lingshan
2021-06-22 9:42 ` [PATCH V7 11/18] KVM: x86/pmu: Add IA32_DS_AREA MSR emulation to support guest DS Zhu Lingshan
2021-07-02 11:52 ` Peter Zijlstra
2021-07-08 8:52 ` Zhu, Lingshan
2021-06-22 9:43 ` [PATCH V7 12/18] KVM: x86/pmu: Add PEBS_DATA_CFG MSR emulation to support adaptive PEBS Zhu Lingshan
2021-06-22 9:43 ` [PATCH V7 13/18] KVM: x86: Set PEBS_UNAVAIL in IA32_MISC_ENABLE when PEBS is enabled Zhu Lingshan
2021-06-22 9:43 ` [PATCH V7 14/18] KVM: x86/pmu: Move pmc_speculative_in_use() to arch/x86/kvm/pmu.h Zhu Lingshan
2021-06-22 9:43 ` [PATCH V7 15/18] KVM: x86/pmu: Disable guest PEBS temporarily in two rare situations Zhu Lingshan
2021-07-02 12:46 ` Peter Zijlstra
2021-07-08 8:52 ` Zhu, Lingshan
2021-06-22 9:43 ` [PATCH V7 16/18] KVM: x86/pmu: Add kvm_pmu_cap to optimize perf_get_x86_pmu_capability Zhu Lingshan
2021-06-22 9:43 ` [PATCH V7 17/18] KVM: x86/cpuid: Refactor host/guest CPU model consistency check Zhu Lingshan
2021-06-22 9:43 ` [PATCH V7 18/18] KVM: x86/pmu: Expose CPUIDs feature bits PDCM, DS, DTES64 Zhu Lingshan
2021-06-25 9:40 ` [PATCH V7 00/18] KVM: x86/pmu: Add *basic* support to enable guest PEBS via DS Liuxiangdong
2021-06-25 9:46 ` Zhu, Lingshan [this message]
2021-06-28 7:49 ` Wang, Wei W
2021-06-28 7:53 ` Zhu, Lingshan
2021-07-02 12:49 ` Peter Zijlstra
2021-07-12 1:37 ` Liuxiangdong
2021-07-12 10:34 ` Like Xu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=37832cc0-788d-91b9-dc95-147eca133842@intel.com \
--to=lingshan.zhu@intel.com \
--cc=ak@linux.intel.com \
--cc=bp@alien8.de \
--cc=eranian@google.com \
--cc=eric.fangyi@huawei.com \
--cc=jmattson@google.com \
--cc=joro@8bytes.org \
--cc=kan.liang@linux.intel.com \
--cc=kvm@vger.kernel.org \
--cc=like.xu.linux@gmail.com \
--cc=linux-kernel@vger.kernel.org \
--cc=liuxiangdong5@huawei.com \
--cc=pbonzini@redhat.com \
--cc=peterz@infradead.org \
--cc=seanjc@google.com \
--cc=vkuznets@redhat.com \
--cc=wanpengli@tencent.com \
--cc=wei.w.wang@intel.com \
--cc=weijiang.yang@intel.com \
--cc=x86@kernel.org \
--cc=xiexiangyou@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).