From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id C44A8C4332F for ; Wed, 15 Dec 2021 16:33:51 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230350AbhLOQdu (ORCPT ); Wed, 15 Dec 2021 11:33:50 -0500 Received: from foss.arm.com ([217.140.110.172]:57148 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S244841AbhLOQds (ORCPT ); Wed, 15 Dec 2021 11:33:48 -0500 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 6EBF26D; Wed, 15 Dec 2021 08:33:48 -0800 (PST) Received: from FVFF77S0Q05N (unknown [10.57.67.176]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 7279A3F5A1; Wed, 15 Dec 2021 08:33:46 -0800 (PST) Date: Wed, 15 Dec 2021 16:33:43 +0000 From: Mark Rutland To: German Gomez Cc: linux-kernel@vger.kernel.org, linux-perf-users@vger.kernel.org, acme@kernel.org, Alexandre Truong , John Garry , Will Deacon , Mathieu Poirier , Leo Yan , Alexander Shishkin , Jiri Olsa , Namhyung Kim , linux-arm-kernel@lists.infradead.org Subject: Re: [PATCH v4 6/6] perf tools: determine if LR is the return address Message-ID: References: <20211215151139.40854-1-german.gomez@arm.com> <20211215151139.40854-7-german.gomez@arm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20211215151139.40854-7-german.gomez@arm.com> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, On Wed, Dec 15, 2021 at 03:11:38PM +0000, German Gomez wrote: > From: Alexandre Truong > > On arm64 and frame pointer mode (e.g: perf record --callgraph fp), > use dwarf unwind info to check if the link register is the return > address in order to inject it to the frame pointer stack. This series looks good overall, but as a general note the commit messages are a bit hard to read because they jump into implementation details of the patch (i.e. the change the patch makes) before explaining the problem (i.e. what the patch is trying to solve). It would be nice to have a short introduction, e.g. When unwinding using frame pointers on arm64, the return address of the current leaf function may be missed. The return address of a leaf function may live in the LR and/or a frame record (and the location can change within a function), so it is necessary to use DWARF to identify where to look for the return address at any given point during a function. For example: unsigned long foo(unsigned long i) { i += 2; i += 5; } ... could be compiled as: foo: // return addr in LR add x0, x0, #2 // return addr in LR stp x29, x30, [SP, #-16]! // return addr in LR mov x29, sp // return addr in LR *and* frame record add x0, x0, #5 // return addr in LR *and* frame record ldp x29, x30, [sp], #16 // return addr in LR ret > Write the following application: > > int a = 10; > > void f2(void) > { > for (int i = 0; i < 1000000; i++) > a *= a; > } > > void f1() > { > for (int i = 0; i < 10; i++) > f2(); > } > > int main(void) > { > f1(); > return 0; > } > > with the following compilation flags: > gcc -fno-omit-frame-pointer -fno-inline -O2 > > The compiler omits the frame pointer for f2 on arm. This is a problem > with any leaf call, for example an application with many different > calls to malloc() would always omit the calling frame, even if it > can be determined. I think the wording here is slightly misleading. For f2, the compiler *doesn't create a frame record*, but the frame pointer (to the caller's frame record) remains and is not omitted. Also, I think it's woth noting (as per the example I gave above) this applies to *any* function which is the current leaf function, regardless of whether that function creates a frame record at some point. For example, if `f1` is interrupted before it creates its own frame record (or after it destroys the frame record), the FP will point at the record created by `main` (containing the caller of main), and `main` itself will be missing from the unwind as it will only exist in the LR. > ./perf record --call-graph fp ./a.out > ./perf report > > currently gives the following stack: > > 0xffffea52f361 > _start > __libc_start_main > main > f2 > > After this change, perf report correctly shows f1() calling f2(), > even though it was missing from the frame pointer unwind: > > ./perf report > > 0xffffea52f361 > _start > __libc_start_main > main > f1 > f2 > > Signed-off-by: Alexandre Truong > Signed-off-by: German Gomez > --- > tools/perf/util/Build | 1 + > .../util/arm64-frame-pointer-unwind-support.c | 63 +++++++++++++++++++ > .../util/arm64-frame-pointer-unwind-support.h | 10 +++ > tools/perf/util/machine.c | 19 ++++-- > tools/perf/util/machine.h | 1 + > 5 files changed, 89 insertions(+), 5 deletions(-) > create mode 100644 tools/perf/util/arm64-frame-pointer-unwind-support.c > create mode 100644 tools/perf/util/arm64-frame-pointer-unwind-support.h > > diff --git a/tools/perf/util/Build b/tools/perf/util/Build > index 2e5bfbb69960..03d4c647bd86 100644 > --- a/tools/perf/util/Build > +++ b/tools/perf/util/Build > @@ -1,3 +1,4 @@ > +perf-y += arm64-frame-pointer-unwind-support.o > perf-y += annotate.o > perf-y += block-info.o > perf-y += block-range.o > diff --git a/tools/perf/util/arm64-frame-pointer-unwind-support.c b/tools/perf/util/arm64-frame-pointer-unwind-support.c > new file mode 100644 > index 000000000000..4f5ecf51ed38 > --- /dev/null > +++ b/tools/perf/util/arm64-frame-pointer-unwind-support.c > @@ -0,0 +1,63 @@ > +// SPDX-License-Identifier: GPL-2.0 > +#include "arm64-frame-pointer-unwind-support.h" > +#include "callchain.h" > +#include "event.h" > +#include "perf_regs.h" // SMPL_REG_MASK > +#include "unwind.h" > + > +#define perf_event_arm_regs perf_event_arm64_regs > +#include "../arch/arm64/include/uapi/asm/perf_regs.h" > +#undef perf_event_arm_regs > + > +struct entries { > + u64 stack[2]; > + size_t length; > +}; > + > +static bool get_leaf_frame_caller_enabled(struct perf_sample *sample) > +{ > + return callchain_param.record_mode == CALLCHAIN_FP && sample->user_regs.regs > + && sample->user_regs.mask & SMPL_REG_MASK(PERF_REG_ARM64_LR); > +} > + > +static int add_entry(struct unwind_entry *entry, void *arg) > +{ > + struct entries *entries = arg; > + > + entries->stack[entries->length++] = entry->ip; > + return 0; > +} > + > +u64 get_leaf_frame_caller_aarch64(struct perf_sample *sample, struct thread *thread, int usr_idx) > +{ > + int ret; > + struct entries entries = {}; > + struct regs_dump old_regs = sample->user_regs; > + > + if (!get_leaf_frame_caller_enabled(sample)) > + return 0; > + > + /* > + * If PC and SP are not recorded, get the value of PC from the stack > + * and set its mask. SP is not used when doing the unwinding but it > + * still needs to be set to prevent failures. > + */ To prevent failures where? Is this something libunwind requires? Thanks, Mark. > + if (!(sample->user_regs.mask & SMPL_REG_MASK(PERF_REG_ARM64_PC))) { > + sample->user_regs.cache_mask |= SMPL_REG_MASK(PERF_REG_ARM64_PC); > + sample->user_regs.cache_regs[PERF_REG_ARM64_PC] = sample->callchain->ips[usr_idx+1]; > + } > + > + if (!(sample->user_regs.mask & SMPL_REG_MASK(PERF_REG_ARM64_SP))) { > + sample->user_regs.cache_mask |= SMPL_REG_MASK(PERF_REG_ARM64_SP); > + sample->user_regs.cache_regs[PERF_REG_ARM64_SP] = 0; > + } > + > + ret = unwind__get_entries(add_entry, &entries, thread, sample, 2); > + sample->user_regs = old_regs; > + > + if (ret || entries.length != 2) > + return ret; > + > + return callchain_param.order == ORDER_CALLER ? entries.stack[0] : entries.stack[1]; > +} > diff --git a/tools/perf/util/arm64-frame-pointer-unwind-support.h b/tools/perf/util/arm64-frame-pointer-unwind-support.h > new file mode 100644 > index 000000000000..32af9ce94398 > --- /dev/null > +++ b/tools/perf/util/arm64-frame-pointer-unwind-support.h > @@ -0,0 +1,10 @@ > +/* SPDX-License-Identifier: GPL-2.0 */ > +#ifndef __PERF_ARM_FRAME_POINTER_UNWIND_SUPPORT_H > +#define __PERF_ARM_FRAME_POINTER_UNWIND_SUPPORT_H > + > +#include "event.h" > +#include "thread.h" > + > +u64 get_leaf_frame_caller_aarch64(struct perf_sample *sample, struct thread *thread, int user_idx); > + > +#endif /* __PERF_ARM_FRAME_POINTER_UNWIND_SUPPORT_H */ > diff --git a/tools/perf/util/machine.c b/tools/perf/util/machine.c > index 3eddad009f78..a00fd6796b35 100644 > --- a/tools/perf/util/machine.c > +++ b/tools/perf/util/machine.c > @@ -34,6 +34,7 @@ > #include "bpf-event.h" > #include // page_size > #include "cgroup.h" > +#include "arm64-frame-pointer-unwind-support.h" > > #include > #include > @@ -2710,10 +2711,13 @@ static int find_prev_cpumode(struct ip_callchain *chain, struct thread *thread, > return err; > } > > -static u64 get_leaf_frame_caller(struct perf_sample *sample __maybe_unused, > - struct thread *thread __maybe_unused, int usr_idx __maybe_unused) > +static u64 get_leaf_frame_caller(struct perf_sample *sample, > + struct thread *thread, int usr_idx) > { > - return 0; > + if (machine__normalize_is(thread->maps->machine, "arm64")) > + return get_leaf_frame_caller_aarch64(sample, thread, usr_idx); > + else > + return 0; > } > > static int thread__resolve_callchain_sample(struct thread *thread, > @@ -3114,14 +3118,19 @@ int machine__set_current_tid(struct machine *machine, int cpu, pid_t pid, > } > > /* > - * Compares the raw arch string. N.B. see instead perf_env__arch() if a > - * normalized arch is needed. > + * Compares the raw arch string. N.B. see instead perf_env__arch() or > + * machine__normalize_is() if a normalized arch is needed. > */ > bool machine__is(struct machine *machine, const char *arch) > { > return machine && !strcmp(perf_env__raw_arch(machine->env), arch); > } > > +bool machine__normalize_is(struct machine *machine, const char *arch) > +{ > + return machine && !strcmp(perf_env__arch(machine->env), arch); > +} > + > int machine__nr_cpus_avail(struct machine *machine) > { > return machine ? perf_env__nr_cpus_avail(machine->env) : 0; > diff --git a/tools/perf/util/machine.h b/tools/perf/util/machine.h > index a143087eeb47..665535153411 100644 > --- a/tools/perf/util/machine.h > +++ b/tools/perf/util/machine.h > @@ -208,6 +208,7 @@ static inline bool machine__is_host(struct machine *machine) > } > > bool machine__is(struct machine *machine, const char *arch); > +bool machine__normalize_is(struct machine *machine, const char *arch); > int machine__nr_cpus_avail(struct machine *machine); > > struct thread *__machine__findnew_thread(struct machine *machine, pid_t pid, pid_t tid); > -- > 2.25.1 > From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from bombadil.infradead.org (bombadil.infradead.org [198.137.202.133]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 86E9AC433FE for ; Wed, 15 Dec 2021 16:56:35 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=bombadil.20210309; h=Sender: Content-Transfer-Encoding:Content-Type:List-Subscribe:List-Help:List-Post: List-Archive:List-Unsubscribe:List-Id:In-Reply-To:MIME-Version:References: Message-ID:Subject:Cc:To:From:Date:Reply-To:Content-ID:Content-Description: Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID: List-Owner; bh=S7QOP186P8m4/0lX9sA+bTkqAwbDaokG5ljvHORz7sk=; b=Iva5RGMSFi1jBH 8fquUEwUipmwpkO1WJpDGDNIEGV0zHS4nWy+nilsBfc1XYT9magMFjjRttd6POKIshXIzb0SbDrcs Z8FtiLVFoGsuWQe/8eoFdS9L6EOmbSqrWGyKJsvVDJc+niOZnQM66RcESFJ56+zywEm47oSYz2Trp 1jU0dDHpKgYnByrrky0Y6WIjDTePZi7OpSECZK5+W4KRTsBMNcXY26Tb/SBj9fszQkKhd7Mtfdkgt t3JIgP3r+fhu6r+aOWltzT18FX+BDsIr+iceaJl/d+8pDEooYhJr6i7tvMMtqRF+MLHX+gVTyQPb9 gK4Ax6SF6zZPIpOCuweQ==; Received: from localhost ([::1] helo=bombadil.infradead.org) by bombadil.infradead.org with esmtp (Exim 4.94.2 #2 (Red Hat Linux)) id 1mxXYP-001sxn-Ae; Wed, 15 Dec 2021 16:54:57 +0000 Received: from foss.arm.com ([217.140.110.172]) by bombadil.infradead.org with esmtp (Exim 4.94.2 #2 (Red Hat Linux)) id 1mxXDx-001kXm-LG for linux-arm-kernel@lists.infradead.org; Wed, 15 Dec 2021 16:33:51 +0000 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 6EBF26D; Wed, 15 Dec 2021 08:33:48 -0800 (PST) Received: from FVFF77S0Q05N (unknown [10.57.67.176]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 7279A3F5A1; Wed, 15 Dec 2021 08:33:46 -0800 (PST) Date: Wed, 15 Dec 2021 16:33:43 +0000 From: Mark Rutland To: German Gomez Cc: linux-kernel@vger.kernel.org, linux-perf-users@vger.kernel.org, acme@kernel.org, Alexandre Truong , John Garry , Will Deacon , Mathieu Poirier , Leo Yan , Alexander Shishkin , Jiri Olsa , Namhyung Kim , linux-arm-kernel@lists.infradead.org Subject: Re: [PATCH v4 6/6] perf tools: determine if LR is the return address Message-ID: References: <20211215151139.40854-1-german.gomez@arm.com> <20211215151139.40854-7-german.gomez@arm.com> MIME-Version: 1.0 Content-Disposition: inline In-Reply-To: <20211215151139.40854-7-german.gomez@arm.com> X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.8.0 (BSD) ) MR-646709E3 X-CRM114-CacheID: sfid-20211215_083349_867757_B3A9111E X-CRM114-Status: GOOD ( 45.28 ) X-BeenThere: linux-arm-kernel@lists.infradead.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org Hi, On Wed, Dec 15, 2021 at 03:11:38PM +0000, German Gomez wrote: > From: Alexandre Truong > > On arm64 and frame pointer mode (e.g: perf record --callgraph fp), > use dwarf unwind info to check if the link register is the return > address in order to inject it to the frame pointer stack. This series looks good overall, but as a general note the commit messages are a bit hard to read because they jump into implementation details of the patch (i.e. the change the patch makes) before explaining the problem (i.e. what the patch is trying to solve). It would be nice to have a short introduction, e.g. When unwinding using frame pointers on arm64, the return address of the current leaf function may be missed. The return address of a leaf function may live in the LR and/or a frame record (and the location can change within a function), so it is necessary to use DWARF to identify where to look for the return address at any given point during a function. For example: unsigned long foo(unsigned long i) { i += 2; i += 5; } ... could be compiled as: foo: // return addr in LR add x0, x0, #2 // return addr in LR stp x29, x30, [SP, #-16]! // return addr in LR mov x29, sp // return addr in LR *and* frame record add x0, x0, #5 // return addr in LR *and* frame record ldp x29, x30, [sp], #16 // return addr in LR ret > Write the following application: > > int a = 10; > > void f2(void) > { > for (int i = 0; i < 1000000; i++) > a *= a; > } > > void f1() > { > for (int i = 0; i < 10; i++) > f2(); > } > > int main(void) > { > f1(); > return 0; > } > > with the following compilation flags: > gcc -fno-omit-frame-pointer -fno-inline -O2 > > The compiler omits the frame pointer for f2 on arm. This is a problem > with any leaf call, for example an application with many different > calls to malloc() would always omit the calling frame, even if it > can be determined. I think the wording here is slightly misleading. For f2, the compiler *doesn't create a frame record*, but the frame pointer (to the caller's frame record) remains and is not omitted. Also, I think it's woth noting (as per the example I gave above) this applies to *any* function which is the current leaf function, regardless of whether that function creates a frame record at some point. For example, if `f1` is interrupted before it creates its own frame record (or after it destroys the frame record), the FP will point at the record created by `main` (containing the caller of main), and `main` itself will be missing from the unwind as it will only exist in the LR. > ./perf record --call-graph fp ./a.out > ./perf report > > currently gives the following stack: > > 0xffffea52f361 > _start > __libc_start_main > main > f2 > > After this change, perf report correctly shows f1() calling f2(), > even though it was missing from the frame pointer unwind: > > ./perf report > > 0xffffea52f361 > _start > __libc_start_main > main > f1 > f2 > > Signed-off-by: Alexandre Truong > Signed-off-by: German Gomez > --- > tools/perf/util/Build | 1 + > .../util/arm64-frame-pointer-unwind-support.c | 63 +++++++++++++++++++ > .../util/arm64-frame-pointer-unwind-support.h | 10 +++ > tools/perf/util/machine.c | 19 ++++-- > tools/perf/util/machine.h | 1 + > 5 files changed, 89 insertions(+), 5 deletions(-) > create mode 100644 tools/perf/util/arm64-frame-pointer-unwind-support.c > create mode 100644 tools/perf/util/arm64-frame-pointer-unwind-support.h > > diff --git a/tools/perf/util/Build b/tools/perf/util/Build > index 2e5bfbb69960..03d4c647bd86 100644 > --- a/tools/perf/util/Build > +++ b/tools/perf/util/Build > @@ -1,3 +1,4 @@ > +perf-y += arm64-frame-pointer-unwind-support.o > perf-y += annotate.o > perf-y += block-info.o > perf-y += block-range.o > diff --git a/tools/perf/util/arm64-frame-pointer-unwind-support.c b/tools/perf/util/arm64-frame-pointer-unwind-support.c > new file mode 100644 > index 000000000000..4f5ecf51ed38 > --- /dev/null > +++ b/tools/perf/util/arm64-frame-pointer-unwind-support.c > @@ -0,0 +1,63 @@ > +// SPDX-License-Identifier: GPL-2.0 > +#include "arm64-frame-pointer-unwind-support.h" > +#include "callchain.h" > +#include "event.h" > +#include "perf_regs.h" // SMPL_REG_MASK > +#include "unwind.h" > + > +#define perf_event_arm_regs perf_event_arm64_regs > +#include "../arch/arm64/include/uapi/asm/perf_regs.h" > +#undef perf_event_arm_regs > + > +struct entries { > + u64 stack[2]; > + size_t length; > +}; > + > +static bool get_leaf_frame_caller_enabled(struct perf_sample *sample) > +{ > + return callchain_param.record_mode == CALLCHAIN_FP && sample->user_regs.regs > + && sample->user_regs.mask & SMPL_REG_MASK(PERF_REG_ARM64_LR); > +} > + > +static int add_entry(struct unwind_entry *entry, void *arg) > +{ > + struct entries *entries = arg; > + > + entries->stack[entries->length++] = entry->ip; > + return 0; > +} > + > +u64 get_leaf_frame_caller_aarch64(struct perf_sample *sample, struct thread *thread, int usr_idx) > +{ > + int ret; > + struct entries entries = {}; > + struct regs_dump old_regs = sample->user_regs; > + > + if (!get_leaf_frame_caller_enabled(sample)) > + return 0; > + > + /* > + * If PC and SP are not recorded, get the value of PC from the stack > + * and set its mask. SP is not used when doing the unwinding but it > + * still needs to be set to prevent failures. > + */ To prevent failures where? Is this something libunwind requires? Thanks, Mark. > + if (!(sample->user_regs.mask & SMPL_REG_MASK(PERF_REG_ARM64_PC))) { > + sample->user_regs.cache_mask |= SMPL_REG_MASK(PERF_REG_ARM64_PC); > + sample->user_regs.cache_regs[PERF_REG_ARM64_PC] = sample->callchain->ips[usr_idx+1]; > + } > + > + if (!(sample->user_regs.mask & SMPL_REG_MASK(PERF_REG_ARM64_SP))) { > + sample->user_regs.cache_mask |= SMPL_REG_MASK(PERF_REG_ARM64_SP); > + sample->user_regs.cache_regs[PERF_REG_ARM64_SP] = 0; > + } > + > + ret = unwind__get_entries(add_entry, &entries, thread, sample, 2); > + sample->user_regs = old_regs; > + > + if (ret || entries.length != 2) > + return ret; > + > + return callchain_param.order == ORDER_CALLER ? entries.stack[0] : entries.stack[1]; > +} > diff --git a/tools/perf/util/arm64-frame-pointer-unwind-support.h b/tools/perf/util/arm64-frame-pointer-unwind-support.h > new file mode 100644 > index 000000000000..32af9ce94398 > --- /dev/null > +++ b/tools/perf/util/arm64-frame-pointer-unwind-support.h > @@ -0,0 +1,10 @@ > +/* SPDX-License-Identifier: GPL-2.0 */ > +#ifndef __PERF_ARM_FRAME_POINTER_UNWIND_SUPPORT_H > +#define __PERF_ARM_FRAME_POINTER_UNWIND_SUPPORT_H > + > +#include "event.h" > +#include "thread.h" > + > +u64 get_leaf_frame_caller_aarch64(struct perf_sample *sample, struct thread *thread, int user_idx); > + > +#endif /* __PERF_ARM_FRAME_POINTER_UNWIND_SUPPORT_H */ > diff --git a/tools/perf/util/machine.c b/tools/perf/util/machine.c > index 3eddad009f78..a00fd6796b35 100644 > --- a/tools/perf/util/machine.c > +++ b/tools/perf/util/machine.c > @@ -34,6 +34,7 @@ > #include "bpf-event.h" > #include // page_size > #include "cgroup.h" > +#include "arm64-frame-pointer-unwind-support.h" > > #include > #include > @@ -2710,10 +2711,13 @@ static int find_prev_cpumode(struct ip_callchain *chain, struct thread *thread, > return err; > } > > -static u64 get_leaf_frame_caller(struct perf_sample *sample __maybe_unused, > - struct thread *thread __maybe_unused, int usr_idx __maybe_unused) > +static u64 get_leaf_frame_caller(struct perf_sample *sample, > + struct thread *thread, int usr_idx) > { > - return 0; > + if (machine__normalize_is(thread->maps->machine, "arm64")) > + return get_leaf_frame_caller_aarch64(sample, thread, usr_idx); > + else > + return 0; > } > > static int thread__resolve_callchain_sample(struct thread *thread, > @@ -3114,14 +3118,19 @@ int machine__set_current_tid(struct machine *machine, int cpu, pid_t pid, > } > > /* > - * Compares the raw arch string. N.B. see instead perf_env__arch() if a > - * normalized arch is needed. > + * Compares the raw arch string. N.B. see instead perf_env__arch() or > + * machine__normalize_is() if a normalized arch is needed. > */ > bool machine__is(struct machine *machine, const char *arch) > { > return machine && !strcmp(perf_env__raw_arch(machine->env), arch); > } > > +bool machine__normalize_is(struct machine *machine, const char *arch) > +{ > + return machine && !strcmp(perf_env__arch(machine->env), arch); > +} > + > int machine__nr_cpus_avail(struct machine *machine) > { > return machine ? perf_env__nr_cpus_avail(machine->env) : 0; > diff --git a/tools/perf/util/machine.h b/tools/perf/util/machine.h > index a143087eeb47..665535153411 100644 > --- a/tools/perf/util/machine.h > +++ b/tools/perf/util/machine.h > @@ -208,6 +208,7 @@ static inline bool machine__is_host(struct machine *machine) > } > > bool machine__is(struct machine *machine, const char *arch); > +bool machine__normalize_is(struct machine *machine, const char *arch); > int machine__nr_cpus_avail(struct machine *machine); > > struct thread *__machine__findnew_thread(struct machine *machine, pid_t pid, pid_t tid); > -- > 2.25.1 > _______________________________________________ linux-arm-kernel mailing list linux-arm-kernel@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-arm-kernel