From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752255AbbFNIT5 (ORCPT ); Sun, 14 Jun 2015 04:19:57 -0400 Received: from mx1.redhat.com ([209.132.183.28]:34613 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751714AbbFNITx (ORCPT ); Sun, 14 Jun 2015 04:19:53 -0400 From: Jiri Olsa To: Arnaldo Carvalho de Melo Cc: lkml , Adrian Hunter , Andi Kleen , David Ahern , Ingo Molnar , Namhyung Kim , Peter Zijlstra , Stephane Eranian Subject: [PATCHv2 00/30] perf stat: Introduce --per-task option Date: Sun, 14 Jun 2015 10:19:15 +0200 Message-Id: <1434269985-521-1-git-send-email-jolsa@kernel.org> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org hi, adding the possibility to display stat data per thread. Allowing following commands and output: $ perf stat -e cycles,instructions --per-task -p 25388,25442 -a ^C Performance counter stats for process id '25388,25442': cat-25388 100,122 cycles vim-25442 4,167,876 cycles cat-25388 20,080 instructions vim-25442 3,232,735 instructions 6.057130572 seconds time elapsed Also works under interval mode: $ perf stat -e cycles,instructions --per-task -p 25388,25442 -a -I 1000 # time task counts unit events 1.000190002 cat-25388 54,196 cycles 1.000190002 vim-25442 0 cycles 1.000190002 cat-25388 11,660 instructions 1.000190002 vim-25442 0 instructions 2.000742579 cat-25388 55,247 cycles 2.000742579 vim-25442 1,752,951 cycles 2.000742579 cat-25388 11,872 instructions 2.000742579 vim-25442 1,238,512 instructions 3.001057871 cat-25388 0 cycles 3.001057871 vim-25442 0 cycles 3.001057871 cat-25388 0 instructions 3.001057871 vim-25442 0 instructions ^C 3.242519732 cat-25388 0 cycles 3.242519732 vim-25442 0 cycles 3.242519732 cat-25388 0 instructions 3.242519732 vim-25442 0 instructions v2 changes: - renamed xyarray__zero to xyarray__reset - using atomic_t for cpu_map/thread_map refcnt - moving all stat functions into stat object - using init_stats to initialize 'struct stats*' - change TASK-$pid output into $comm-$pid - updated doc with --per-task option thanks for comments, jirka --- Jiri Olsa (30): perf tools: Introduce xyarray__reset function perf tools: Add thread_map__(alloc|realloc) helpers perf tools: Change thread_map::map into struct perf tools: Add comm string into struct thread_map perf tests: Add thread_map object tests perf tools: Add reference counting for cpu_map object perf tools: Add reference counting for thread_map object perf tools: Propagate cpu maps through the evlist perf tools: Propagate thread maps through the evlist perf tools: Make perf_evsel__(nr_)cpus generic perf tools: Move perf_evsel__(alloc|free|reset)_counts into stat object perf stat: Introduce perf_counts__(alloc|free|reset) functions perf stat: Introduce perf_counts function perf stat: Use xyarray for cpu evsel counts perf stat: Make stats work over the thread dimension perf stat: Rename struct perf_counts::cpu member to values perf stat: Move perf_evsel__(alloc|free|reset)_stat_priv into stat object perf stat: Move perf_evsel__(alloc|free)_prev_raw_counts into stat object perf stat: Move perf_evlist__(alloc|free)_stats into evlist object perf stat: Introduce perf_evsel__alloc_stats function perf stat: Introduce perf_evsel__read function perf stat: Introduce read_counters function perf stat: Separate counters reading and processing perf stat: Move zero_per_pkg into counter process code perf stat: Move perf_stat initialization counter process code perf stat: Remove perf_evsel__read_cb function perf stat: Rename print_interval to process_interval perf stat: Using init_stats instead of memset perf stat: Introduce print_counters function perf stat: Introduce --per-task option tools/perf/Documentation/perf-stat.txt | 3 + tools/perf/builtin-stat.c | 398 ++++++++++++++-------------- tools/perf/builtin-trace.c | 4 +- tools/perf/tests/Build | 1 + tools/perf/tests/builtin-test.c | 4 + tools/perf/tests/code-reading.c | 4 +- tools/perf/tests/keep-tracking.c | 4 +- tools/perf/tests/mmap-basic.c | 4 +- tools/perf/tests/mmap-thread-lookup.c | 2 +- tools/perf/tests/openat-syscall-all-cpus.c | 9 +- tools/perf/tests/openat-syscall-tp-fields.c | 2 +- tools/perf/tests/openat-syscall.c | 6 +- tools/perf/tests/switch-tracking.c | 4 +- tools/perf/tests/tests.h | 1 + tools/perf/tests/thread-map.c | 28 ++ tools/perf/util/auxtrace.c | 4 +- tools/perf/util/cpumap.c | 25 +- tools/perf/util/cpumap.h | 6 +- tools/perf/util/event.c | 6 +- tools/perf/util/evlist.c | 38 ++- tools/perf/util/evlist.h | 1 - tools/perf/util/evsel.c | 47 ++-- tools/perf/util/evsel.h | 43 ++- tools/perf/util/parse-events.c | 5 +- tools/perf/util/python-ext-sources | 2 + tools/perf/util/python.c | 4 +- tools/perf/util/record.c | 4 +- tools/perf/util/session.c | 2 +- tools/perf/util/stat.c | 134 ++++++++++ tools/perf/util/stat.h | 46 ++++ tools/perf/util/svghelper.c | 2 +- tools/perf/util/thread_map.c | 131 +++++++-- tools/perf/util/thread_map.h | 17 +- tools/perf/util/xyarray.c | 8 + tools/perf/util/xyarray.h | 2 + 35 files changed, 672 insertions(+), 329 deletions(-) create mode 100644 tools/perf/tests/thread-map.c