From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-12.2 required=3.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_PATCH, MAILING_LIST_MULTI,NICE_REPLY_A,SIGNED_OFF_BY,SPF_HELO_NONE,SPF_PASS, URIBL_BLOCKED,USER_AGENT_SANE_1 autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 18543C433E0 for ; Fri, 31 Jul 2020 13:58:45 +0000 (UTC) Received: from merlin.infradead.org (merlin.infradead.org [205.233.59.134]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id D55A822B40 for ; Fri, 31 Jul 2020 13:58:44 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (2048-bit key) header.d=lists.infradead.org header.i=@lists.infradead.org header.b="JLNPyxwV" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org D55A822B40 Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=arm.com Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=merlin.20170209; h=Sender:Content-Type: Content-Transfer-Encoding:Cc:List-Subscribe:List-Help:List-Post:List-Archive: List-Unsubscribe:List-Id:In-Reply-To:MIME-Version:Date:Message-ID:From: References:To:Subject:Reply-To:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Owner; bh=I+1MFrFVbQj5JcoZ7tc8QRqnEu5bfKVBAmP/M/iP4Os=; b=JLNPyxwVFeKuLBW8k+46lZPs5 5wzxBZcoZXjxi8WhfaMT1djnRfJK57ZZBh6DqNCCTWHUiJAmfC2sV4UFGbhAkmDogZ6nfEUV7deeI NRFzvg+7TwBjpAFQGdTzaiP40z6dOAf3jHo+STsfnLBu6gtiIDiPYZn1TnjicMvNkL+1WxZwWGgsz gABFh1WgX1ZFIAJ3k32Qu/0170TuDVny78s4INuSeE7F/HrLhyLaLD4HKwAc8IdlIkSfk/7gGOYAj +yEjaeCDCqdFBwfT2lzQ7rK6LsZ2MACqlAUmmkgLT4ULWSjkVNpkrE+KCNP2c6ZVg4GfPlhP47l9d +W2bht4mQ==; Received: from localhost ([::1] helo=merlin.infradead.org) by merlin.infradead.org with esmtp (Exim 4.92.3 #3 (Red Hat Linux)) id 1k1VWf-0004SD-WD; Fri, 31 Jul 2020 13:56:46 +0000 Received: from foss.arm.com ([217.140.110.172]) by merlin.infradead.org with esmtp (Exim 4.92.3 #3 (Red Hat Linux)) id 1k1VWd-0004RT-28 for linux-arm-kernel@lists.infradead.org; Fri, 31 Jul 2020 13:56:44 +0000 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 1A4CD31B; Fri, 31 Jul 2020 06:56:35 -0700 (PDT) Received: from [10.37.12.83] (unknown [10.37.12.83]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 313E53F66E; Fri, 31 Jul 2020 06:56:32 -0700 (PDT) Subject: Re: [PATCH 1/4] drivers/perf: Add support for ARMv8.3-SPE To: liwei391@huawei.com, leo.yan@linaro.org, will@kernel.org References: <20200724091607.41903-1-liwei391@huawei.com> <20200724091607.41903-2-liwei391@huawei.com> <20200730081458.GA23324@leoy-ThinkPad-X240s> <92273648-8638-a56f-dbd1-9aaae089d1cc@huawei.com> From: Suzuki K Poulose Message-ID: <6e4af717-e808-6cee-26b8-8d4475aa2945@arm.com> Date: Fri, 31 Jul 2020 15:01:19 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.7.0 MIME-Version: 1.0 In-Reply-To: <92273648-8638-a56f-dbd1-9aaae089d1cc@huawei.com> Content-Language: en-US X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.8.0 (BSD) ) MR-646709E3 X-CRM114-CacheID: sfid-20200731_095643_234620_7C826BCC X-CRM114-Status: GOOD ( 35.19 ) X-BeenThere: linux-arm-kernel@lists.infradead.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: mark.rutland@arm.com, alexander.shishkin@linux.intel.com, catalin.marinas@arm.com, adrian.hunter@intel.com, acme@kernel.org, linux-kernel@vger.kernel.org, zhangshaokun@hisilicon.com, peterz@infradead.org, mingo@redhat.com, James.Clark@arm.com, guohanjun@huawei.com, namhyung@kernel.org, jolsa@redhat.com, linux-arm-kernel@lists.infradead.org Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset="us-ascii"; Format="flowed" Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org On 07/31/2020 01:18 PM, liwei (GF) wrote: > > > On 2020/7/30 16:14, Leo Yan wrote: >> Hi Suzuki, >> >> On Wed, Jul 29, 2020 at 10:12:50AM +0100, Suzuki Kuruppassery Poulose wrote: >>> On 07/24/2020 10:16 AM, Wei Li wrote: >>>> Armv8.3 extends the SPE by adding: >>>> - Alignment field in the Events packet, and filtering on this event >>>> using PMSEVFR_EL1. >>>> - Support for the Scalable Vector Extension (SVE). >>>> >>>> The main additions for SVE are: >>>> - Recording the vector length for SVE operations in the Operation Type >>>> packet. It is not possible to filter on vector length. >>>> - Incomplete predicate and empty predicate fields in the Events packet, >>>> and filtering on these events using PMSEVFR_EL1. >>>> >>>> Update the check of pmsevfr for empty/partial predicated SVE and >>>> alignment event in kernel driver. >>>> >>>> Signed-off-by: Wei Li >>>> --- >>>> arch/arm64/include/asm/sysreg.h | 4 +++- >>>> drivers/perf/arm_spe_pmu.c | 18 ++++++++++++++---- >>>> 2 files changed, 17 insertions(+), 5 deletions(-) >>>> >>>> diff --git a/arch/arm64/include/asm/sysreg.h b/arch/arm64/include/asm/sysreg.h >>>> index 463175f80341..be4c44ccdb56 100644 >>>> --- a/arch/arm64/include/asm/sysreg.h >>>> +++ b/arch/arm64/include/asm/sysreg.h >>>> @@ -281,7 +281,6 @@ >>>> #define SYS_PMSFCR_EL1_ST_SHIFT 18 >>>> #define SYS_PMSEVFR_EL1 sys_reg(3, 0, 9, 9, 5) >>>> -#define SYS_PMSEVFR_EL1_RES0 0x0000ffff00ff0f55UL >>>> #define SYS_PMSLATFR_EL1 sys_reg(3, 0, 9, 9, 6) >>>> #define SYS_PMSLATFR_EL1_MINLAT_SHIFT 0 >>>> @@ -769,6 +768,9 @@ >>>> #define ID_AA64DFR0_PMUVER_8_5 0x6 >>>> #define ID_AA64DFR0_PMUVER_IMP_DEF 0xf >>>> +#define ID_AA64DFR0_PMSVER_8_2 0x1 >>>> +#define ID_AA64DFR0_PMSVER_8_3 0x2 >>>> + >>>> #define ID_DFR0_PERFMON_SHIFT 24 >>>> #define ID_DFR0_PERFMON_8_1 0x4 >>>> diff --git a/drivers/perf/arm_spe_pmu.c b/drivers/perf/arm_spe_pmu.c >>>> index e51ddb6d63ed..5ec7ee0c8fa1 100644 >>>> --- a/drivers/perf/arm_spe_pmu.c >>>> +++ b/drivers/perf/arm_spe_pmu.c >>>> @@ -54,7 +54,7 @@ struct arm_spe_pmu { >>>> struct hlist_node hotplug_node; >>>> int irq; /* PPI */ >>>> - >>>> + int pmuver; >>>> u16 min_period; >>>> u16 counter_sz; >>>> @@ -80,6 +80,15 @@ struct arm_spe_pmu { >>>> /* Keep track of our dynamic hotplug state */ >>>> static enum cpuhp_state arm_spe_pmu_online; >>>> +static u64 sys_pmsevfr_el1_mask[] = { >>>> + [ID_AA64DFR0_PMSVER_8_2] = GENMASK_ULL(63, 48) | GENMASK_ULL(31, 24) | >>>> + GENMASK_ULL(15, 12) | BIT_ULL(7) | BIT_ULL(5) | BIT_ULL(3) | >>>> + BIT_ULL(1), >>>> + [ID_AA64DFR0_PMSVER_8_3] = GENMASK_ULL(63, 48) | GENMASK_ULL(31, 24) | >>>> + GENMASK_ULL(18, 17) | GENMASK_ULL(15, 11) | BIT_ULL(7) | >>>> + BIT_ULL(5) | BIT_ULL(3) | BIT_ULL(1), >>>> +}; >>>> + >>>> enum arm_spe_pmu_buf_fault_action { >>>> SPE_PMU_BUF_FAULT_ACT_SPURIOUS, >>>> SPE_PMU_BUF_FAULT_ACT_FATAL, >>>> @@ -670,7 +679,7 @@ static int arm_spe_pmu_event_init(struct perf_event *event) >>>> !cpumask_test_cpu(event->cpu, &spe_pmu->supported_cpus)) >>>> return -ENOENT; >>>> - if (arm_spe_event_to_pmsevfr(event) & SYS_PMSEVFR_EL1_RES0) >>>> + if (arm_spe_event_to_pmsevfr(event) & ~sys_pmsevfr_el1_mask[spe_pmu->pmuver]) >>>> return -EOPNOTSUPP; >>>> if (attr->exclude_idle) >>>> @@ -937,6 +946,7 @@ static void __arm_spe_pmu_dev_probe(void *info) >>>> fld, smp_processor_id()); >>>> return; >>>> } >>>> + spe_pmu->pmuver = fld; >>> >>> How do we deal with cases where we have big.LITTLE system with differing >>> SPE versions ? >> >> Good point. >> >> The first question we need to answer is: how to define SPE version? >> From my understanding, if SPE uses the same sample specification and >> the same packet format, then we should consider the SPE is the same >> version cross CPUs. So even some CPUs are ARMv8.2 and other CPUs are >> ARMv8.3 variants, we still should take the SPE as the same version. >> >> And when read the SPE driver in the file drivers/perf/arm_spe_pmu.c and >> I concluded that so far the SPE perf driver is to only support SPE-v1 >> with single instance, it cannot support a complex usage case like >> below: >> >> CPU0-3: ARMv8.2 architecture with SPE >> CPU4-7: ARMv8.3 architecture with SPE >> >> For this case, if we take SPE as two different versions, let's say >> SPE-8.2 and SPE-8.3, then should the SPE driver need to create multi >> perf PMU events? For example, we should create a perf PMU event >> 'arm_spe_8.2' and another PMU event 'arm_spe_8.3'. > > As we have supported SPE v2 (ARMv8.3-SPE) now, if we add the new > of_device_id: "arm,statistical-profiling-extension-v2" and the new > platform_device_id: "arm,spe-v2", we may really support two instance now. > Even two different versions of SPE pmus which work on different range of > cores with the same PPI. Their functional scopes are the same as the PPI partitions. > >> Another option is we always take this as SPE-v1 and only create single >> PMU event, just keep what's we are doing with the perf event >> 'arm_spe_0', but the driver needs to dynamically detect SPE PMU version >> number in the function arm_spe_pmu_event_init(), and then based on >> version number to select corresponding mask for PMSEVFR. > > Thus, the driver will service two devices, and will also register two PMUs 'arm_spe_0' and > 'arm_spe_1', so i think there is no conflict here. Or the other option is to strictly support only one version - the lowest version supported by the system, which is compatible with all the others. This could be achieved by using the sanitised feature value of the SPE version. Will, What do you prefer ? > > Back to Suzuki's question, refer to the ACPI parsing code for SPE device, function > arm_spe_acpi_register_device(), there is a check of hetero_id for all cores. > It seems that we only support homogeneous ACPI/SPE machines, but i can't find the similar > check in OF/SPE parsing code. I think the driver assumes that the system supports uniform version of the SPE (which was valid when the driver was written). Cheers Suzuki _______________________________________________ linux-arm-kernel mailing list linux-arm-kernel@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-arm-kernel