From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-3.8 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,USER_AGENT_GIT autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 924E7C2D0C3 for ; Wed, 18 Dec 2019 07:08:59 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 74E292465E for ; Wed, 18 Dec 2019 07:08:59 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726744AbfLRHI6 (ORCPT ); Wed, 18 Dec 2019 02:08:58 -0500 Received: from szxga05-in.huawei.com ([45.249.212.191]:7707 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726682AbfLRHI4 (ORCPT ); Wed, 18 Dec 2019 02:08:56 -0500 Received: from DGGEMS410-HUB.china.huawei.com (unknown [172.30.72.58]) by Forcepoint Email with ESMTP id 30DAD92510CB087CAE9B; Wed, 18 Dec 2019 15:08:51 +0800 (CST) Received: from huawei.com (10.175.102.38) by DGGEMS410-HUB.china.huawei.com (10.3.19.210) with Microsoft SMTP Server id 14.3.439.0; Wed, 18 Dec 2019 15:08:44 +0800 From: Tan Xiaojun To: , , , , , , , , , , , , , , , CC: , , , , , , , Subject: [PATCH 0/6] perf tools: Add support for some spe events and precise ip Date: Wed, 18 Dec 2019 15:54:49 +0800 Message-ID: <20191218075455.5106-1-tanxiaojun@huawei.com> X-Mailer: git-send-email 2.17.1 MIME-Version: 1.0 Content-Type: text/plain X-Originating-IP: [10.175.102.38] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org After the commit ffd3d18c20b8 ("perf tools: Add ARM Statistical Profiling Extensions (SPE) support") is merged, "perf record" and "perf report --dump-raw-trace" have been supported. However, the raw data that is dumped cannot be used without parsing. This patchset is to improve the "perf report" support for spe, and further process the data. Currently, support for the three events of llc-miss, tlb-miss, branch-miss and remote-access is added. The macro definition was modified under Jeremy's suggestion, and the "event:pp" approach was used under James' suggestion to achieve the precise ip of some events on arm64. Currently, only branch-misses are implemented, and other event support will be added later. In addition, we also found that when recording large multi-threaded programs, ctrl + c could not end recording, so it was fixed and two patches were added. Tan Xiaojun (4): perf tools: Move arm-spe-pkt-decoder.h/c to the new dir perf tools: Add support for "report" for some spe events perf report: Add --spe options for arm-spe perf tools: Support "branch-misses:pp" on arm64 Wei Li (2): perf tools: add perf_evlist__terminate() for terminate perf tools: arm-spe: fix record hang after being terminated tools/perf/Documentation/perf-report.txt | 10 + tools/perf/arch/arm64/util/arm-spe.c | 10 +- tools/perf/builtin-record.c | 1 + tools/perf/builtin-report.c | 5 + tools/perf/util/Build | 2 +- tools/perf/util/arm-spe-decoder/Build | 1 + .../util/arm-spe-decoder/arm-spe-decoder.c | 225 +++++ .../util/arm-spe-decoder/arm-spe-decoder.h | 66 ++ .../arm-spe-pkt-decoder.c | 0 .../arm-spe-pkt-decoder.h | 2 + tools/perf/util/arm-spe.c | 788 +++++++++++++++++- tools/perf/util/arm-spe.h | 3 + tools/perf/util/auxtrace.c | 49 ++ tools/perf/util/auxtrace.h | 29 + tools/perf/util/evlist.c | 16 + tools/perf/util/evlist.h | 1 + tools/perf/util/evsel.h | 1 + tools/perf/util/session.h | 2 + 18 files changed, 1169 insertions(+), 42 deletions(-) create mode 100644 tools/perf/util/arm-spe-decoder/Build create mode 100644 tools/perf/util/arm-spe-decoder/arm-spe-decoder.c create mode 100644 tools/perf/util/arm-spe-decoder/arm-spe-decoder.h rename tools/perf/util/{ => arm-spe-decoder}/arm-spe-pkt-decoder.c (100%) rename tools/perf/util/{ => arm-spe-decoder}/arm-spe-pkt-decoder.h (96%) -- 2.17.1