From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1760079Ab3BLOKe (ORCPT ); Tue, 12 Feb 2013 09:10:34 -0500 Received: from mail-we0-f182.google.com ([74.125.82.182]:59283 "EHLO mail-we0-f182.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1759917Ab3BLOKc (ORCPT ); Tue, 12 Feb 2013 09:10:32 -0500 From: Stephane Eranian To: linux-kernel@vger.kernel.org Cc: peterz@infradead.org, mingo@elte.hu, ak@linux.intel.com, acme@redhat.com, jolsa@redhat.com, namhyung.kim@lge.com Subject: [PATCH 0/2] perf stat: add per-core count aggregation Date: Tue, 12 Feb 2013 15:09:26 +0100 Message-Id: <1360678168-6974-1-git-send-email-eranian@google.com> X-Mailer: git-send-email 1.7.9.5 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org This patch series contains improvement to the aggregation support in perf stat. First, the aggregation code is refactored and a aggr_mode enum is defined. There is also an important bug fix for the existing per-socket aggregation. Second, the patch adds a new --aggr-core option to perf stat. It aggregates counts per physical core and becomes useful on systems with hyper-threading. The cores are presented per socket: S0-C1, means socket 0 core 1. Note that the core number represents its physical core id. As such, numbers may not always be contiguous. All of this is based on topology information available in sysfs. Per-core aggregation can be combined with interval printing: # perf stat -a --aggr-core -I 1000 -e cycles sleep 100 # time core cpus counts events 1.000101160 S0-C0 2 6,051,254,899 cycles 1.000101160 S0-C1 2 6,379,230,776 cycles 1.000101160 S0-C2 2 6,480,268,471 cycles 1.000101160 S0-C3 2 6,110,514,321 cycles 2.000663750 S0-C0 2 6,572,533,016 cycles 2.000663750 S0-C1 2 6,378,623,674 cycles 2.000663750 S0-C2 2 6,264,127,589 cycles 2.000663750 S0-C3 2 6,305,346,613 cycles For instance here on this SNB machine, we can see that the load is evenly balanced across all 4 physical core (HT is on). Signed-off-by: Stephane Eranian - Stephane Eranian (2): perf stat: refactor aggregation code perf stat: add per-core aggregation tools/perf/Documentation/perf-stat.txt | 6 + tools/perf/builtin-stat.c | 237 ++++++++++++++++++++------------ tools/perf/util/cpumap.c | 86 ++++++++++-- tools/perf/util/cpumap.h | 12 ++ 4 files changed, 239 insertions(+), 102 deletions(-) -- 1.7.9.5