* [GIT PULL 00/16] perf/core improvements and fixes @ 2014-03-14 21:29 Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 01/16] perf bench: Add futex-hash microbenchmark Arnaldo Carvalho de Melo ` (16 more replies) 0 siblings, 17 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:29 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Andi Kleen, Aswin Chandramouleeswaran, Corey Ashford, Darren Hart, David Ahern, Davidlohr Bueso, Don Zickus, Frederic Weisbecker, Jason Low, Jiri Olsa, Joe Mario, Masami Hiramatsu, Mike Galbraith, Namhyung Kim, Paul Mackerras, Pekka Enberg, Peter Zijlstra, Ramkumar Ramachandra, Scott J Norton, Stephane Eranian, Thomas Gleixner, Waiman Long, Arnaldo Carvalho de Melo From: Arnaldo Carvalho de Melo <acme@ghostprotocols.net> Hi Ingo, Please consider pulling, - Arnaldo The following changes since commit 81827ed8d85e892311965dc9ec4120b2b2e745bd: perf/x86/uncore: Fix missing end markers for SNB/IVB/HSW IMC PMU (2014-03-14 09:25:25 +0100) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux tags/perf-core-for-mingo for you to fetch changes up to d75e6097ef1f7669deb500fbbdf53cfe524f1b53: perf machine: Factor machine__find_thread to take tid argument (2014-03-14 18:08:42 -0300) ---------------------------------------------------------------- perf/core improvements and fixes: User visible: . Add several futex 'perf bench' microbenchmarks (Davidlohr Bueso) . Speed up thread map generation (Don Zickus) . Fix synthesizing mmaps for threads (Don Zickus) . Fix invalid output on event group stdio report (Namhyung Kim) . Introduce 'perf kvm --list-cmds' command line option for use by scripts (Ramkumar Ramachandra) Documentation: . Clarify load-latency information in the 'perf mem' docs (Andi Kleen) . Clarify x86 register naming in 'perf probe' docs (Andi Kleen) Refactorigns: . hists browser refactorings to reuse code accross UIs (Namhyung Kim) Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Andi Kleen (2): perf mem: Clarify load-latency in documentation perf probe: Clarify x86 register naming for perf probe Arnaldo Carvalho de Melo (1): perf symbols: Introduce thread__find_cpumode_addr_location Davidlohr Bueso (3): perf bench: Add futex-hash microbenchmark perf bench: Add futex-wake microbenchmark perf bench: Add futex-requeue microbenchmark Don Zickus (3): perf tools: Fix synthesizing mmaps for threads perf session: Change header.misc dump from decimal to hex perf tools: Speed up thread map generation Jiri Olsa (1): perf machine: Factor machine__find_thread to take tid argument Namhyung Kim (5): perf ui/stdio: Fix invalid output on event group report perf ui/gtk: Reuse generic __hpp__fmt() code perf ui/hists: Pass struct hpp to print functions perf ui/tui: Reuse generic __hpp__fmt() code perf ui hists: Pass evsel to hpp->header/width functions explicitly Ramkumar Ramachandra (1): perf kvm: introduce --list-cmds for use by scripts tools/perf/Documentation/perf-mem.txt | 4 + tools/perf/Documentation/perf-probe.txt | 2 + tools/perf/Makefile.perf | 3 + tools/perf/bench/bench.h | 3 + tools/perf/bench/futex-hash.c | 212 ++++++++++++++++++++++++++++++++ tools/perf/bench/futex-requeue.c | 211 +++++++++++++++++++++++++++++++ tools/perf/bench/futex-wake.c | 201 ++++++++++++++++++++++++++++++ tools/perf/bench/futex.h | 71 +++++++++++ tools/perf/builtin-bench.c | 12 +- tools/perf/builtin-diff.c | 7 +- tools/perf/builtin-kvm.c | 12 +- tools/perf/perf-completion.sh | 2 +- tools/perf/perf.h | 6 + tools/perf/tests/dwarf-unwind.c | 2 +- tools/perf/ui/browsers/hists.c | 122 ++++++------------ tools/perf/ui/gtk/hists.c | 78 ++---------- tools/perf/ui/hist.c | 138 +++++++++++++++------ tools/perf/ui/stdio/hist.c | 11 +- tools/perf/util/event.c | 156 ++++++++++++++--------- tools/perf/util/hist.h | 20 ++- tools/perf/util/machine.c | 46 +++---- tools/perf/util/machine.h | 3 +- tools/perf/util/parse-options.c | 37 +++++- tools/perf/util/parse-options.h | 8 +- tools/perf/util/session.c | 2 +- tools/perf/util/thread.c | 21 ++++ tools/perf/util/thread.h | 5 + 27 files changed, 1094 insertions(+), 301 deletions(-) create mode 100644 tools/perf/bench/futex-hash.c create mode 100644 tools/perf/bench/futex-requeue.c create mode 100644 tools/perf/bench/futex-wake.c create mode 100644 tools/perf/bench/futex.h ^ permalink raw reply [flat|nested] 56+ messages in thread
* [PATCH 01/16] perf bench: Add futex-hash microbenchmark 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo @ 2014-03-14 21:29 ` Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 02/16] perf bench: Add futex-wake microbenchmark Arnaldo Carvalho de Melo ` (15 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:29 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Davidlohr Bueso, Aswin Chandramouleeswaran, Darren Hart, Ingo Molnar, Jason Low, Peter Zijlstra, Scott J Norton, Thomas Gleixner, Waiman Long, Arnaldo Carvalho de Melo From: Davidlohr Bueso <davidlohr@hp.com> Introduce futexes to perf-bench and add a program that stresses and measures the kernel's implementation of the hash table. This is a multi-threaded program that simply measures the amount of failed futex wait calls - we only want to deal with the hashing overhead, so a negative return of futex_wait_setup() is enough to do the trick. An example run: $ perf bench futex hash -t 32 Run summary [PID 10989]: 32 threads, each operating on 1024 [private] futexes for 10 secs. [thread 0] futexes: 0x19d9b10 ... 0x19dab0c [ 418713 ops/sec ] [thread 1] futexes: 0x19daca0 ... 0x19dbc9c [ 469913 ops/sec ] [thread 2] futexes: 0x19dbe30 ... 0x19dce2c [ 479744 ops/sec ] ... [thread 31] futexes: 0x19fbb80 ... 0x19fcb7c [ 464179 ops/sec ] Averaged 454310 operations/sec (+- 0.84%), total secs = 10 Signed-off-by: Davidlohr Bueso <davidlohr@hp.com> Acked-by: Darren Hart <dvhart@linux.intel.com> Cc: Aswin Chandramouleeswaran <aswin@hp.com> Cc: Darren Hart <dvhart@linux.intel.com> Cc: Ingo Molnar <mingo@redhat.com> Cc: Jason Low <jason.low2@hp.com> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Scott J Norton <scott.norton@hp.com> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: Waiman Long <Waiman.Long@hp.com> Link: http://lkml.kernel.org/r/1387081917-9102-2-git-send-email-davidlohr@hp.com Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/Makefile.perf | 1 + tools/perf/bench/bench.h | 1 + tools/perf/bench/futex-hash.c | 212 ++++++++++++++++++++++++++++++++++++++++++ tools/perf/bench/futex.h | 48 ++++++++++ tools/perf/builtin-bench.c | 10 +- tools/perf/perf.h | 6 ++ 6 files changed, 277 insertions(+), 1 deletion(-) create mode 100644 tools/perf/bench/futex-hash.c create mode 100644 tools/perf/bench/futex.h diff --git a/tools/perf/Makefile.perf b/tools/perf/Makefile.perf index 1f7ec48ac959..54ae547b4bc4 100644 --- a/tools/perf/Makefile.perf +++ b/tools/perf/Makefile.perf @@ -426,6 +426,7 @@ BUILTIN_OBJS += $(OUTPUT)bench/mem-memset-x86-64-asm.o endif BUILTIN_OBJS += $(OUTPUT)bench/mem-memcpy.o BUILTIN_OBJS += $(OUTPUT)bench/mem-memset.o +BUILTIN_OBJS += $(OUTPUT)bench/futex-hash.o BUILTIN_OBJS += $(OUTPUT)builtin-diff.o BUILTIN_OBJS += $(OUTPUT)builtin-evlist.o diff --git a/tools/perf/bench/bench.h b/tools/perf/bench/bench.h index 0fdc85269c4d..34edb5c34db3 100644 --- a/tools/perf/bench/bench.h +++ b/tools/perf/bench/bench.h @@ -31,6 +31,7 @@ extern int bench_sched_pipe(int argc, const char **argv, const char *prefix); extern int bench_mem_memcpy(int argc, const char **argv, const char *prefix __maybe_unused); extern int bench_mem_memset(int argc, const char **argv, const char *prefix); +extern int bench_futex_hash(int argc, const char **argv, const char *prefix); #define BENCH_FORMAT_DEFAULT_STR "default" #define BENCH_FORMAT_DEFAULT 0 diff --git a/tools/perf/bench/futex-hash.c b/tools/perf/bench/futex-hash.c new file mode 100644 index 000000000000..a84206e9c4aa --- /dev/null +++ b/tools/perf/bench/futex-hash.c @@ -0,0 +1,212 @@ +/* + * Copyright (C) 2013 Davidlohr Bueso <davidlohr@hp.com> + * + * futex-hash: Stress the hell out of the Linux kernel futex uaddr hashing. + * + * This program is particularly useful for measuring the kernel's futex hash + * table/function implementation. In order for it to make sense, use with as + * many threads and futexes as possible. + */ + +#include "../perf.h" +#include "../util/util.h" +#include "../util/stat.h" +#include "../util/parse-options.h" +#include "../util/header.h" +#include "bench.h" +#include "futex.h" + +#include <err.h> +#include <stdlib.h> +#include <sys/time.h> +#include <pthread.h> + +static unsigned int nthreads = 0; +static unsigned int nsecs = 10; +/* amount of futexes per thread */ +static unsigned int nfutexes = 1024; +static bool fshared = false, done = false, silent = false; + +struct timeval start, end, runtime; +static pthread_mutex_t thread_lock; +static unsigned int threads_starting; +static struct stats throughput_stats; +static pthread_cond_t thread_parent, thread_worker; + +struct worker { + int tid; + u_int32_t *futex; + pthread_t thread; + unsigned long ops; +}; + +static const struct option options[] = { + OPT_UINTEGER('t', "threads", &nthreads, "Specify amount of threads"), + OPT_UINTEGER('r', "runtime", &nsecs, "Specify runtime (in seconds)"), + OPT_UINTEGER('f', "futexes", &nfutexes, "Specify amount of futexes per threads"), + OPT_BOOLEAN( 's', "silent", &silent, "Silent mode: do not display data/details"), + OPT_BOOLEAN( 'S', "shared", &fshared, "Use shared futexes instead of private ones"), + OPT_END() +}; + +static const char * const bench_futex_hash_usage[] = { + "perf bench futex hash <options>", + NULL +}; + +static void *workerfn(void *arg) +{ + int ret; + unsigned int i; + struct worker *w = (struct worker *) arg; + + pthread_mutex_lock(&thread_lock); + threads_starting--; + if (!threads_starting) + pthread_cond_signal(&thread_parent); + pthread_cond_wait(&thread_worker, &thread_lock); + pthread_mutex_unlock(&thread_lock); + + do { + for (i = 0; i < nfutexes; i++, w->ops++) { + /* + * We want the futex calls to fail in order to stress + * the hashing of uaddr and not measure other steps, + * such as internal waitqueue handling, thus enlarging + * the critical region protected by hb->lock. + */ + ret = futex_wait(&w->futex[i], 1234, NULL, + fshared ? 0 : FUTEX_PRIVATE_FLAG); + if (!silent && + (!ret || errno != EAGAIN || errno != EWOULDBLOCK)) + warn("Non-expected futex return call"); + } + } while (!done); + + return NULL; +} + +static void toggle_done(int sig __maybe_unused, + siginfo_t *info __maybe_unused, + void *uc __maybe_unused) +{ + /* inform all threads that we're done for the day */ + done = true; + gettimeofday(&end, NULL); + timersub(&end, &start, &runtime); +} + +static void print_summary(void) +{ + unsigned long avg = avg_stats(&throughput_stats); + double stddev = stddev_stats(&throughput_stats); + + printf("%sAveraged %ld operations/sec (+- %.2f%%), total secs = %d\n", + !silent ? "\n" : "", avg, rel_stddev_stats(stddev, avg), + (int) runtime.tv_sec); +} + +int bench_futex_hash(int argc, const char **argv, + const char *prefix __maybe_unused) +{ + int ret = 0; + cpu_set_t cpu; + struct sigaction act; + unsigned int i, ncpus; + pthread_attr_t thread_attr; + struct worker *worker = NULL; + + argc = parse_options(argc, argv, options, bench_futex_hash_usage, 0); + if (argc) { + usage_with_options(bench_futex_hash_usage, options); + exit(EXIT_FAILURE); + } + + ncpus = sysconf(_SC_NPROCESSORS_ONLN); + + sigfillset(&act.sa_mask); + act.sa_sigaction = toggle_done; + sigaction(SIGINT, &act, NULL); + + if (!nthreads) /* default to the number of CPUs */ + nthreads = ncpus; + + worker = calloc(nthreads, sizeof(*worker)); + if (!worker) + goto errmem; + + printf("Run summary [PID %d]: %d threads, each operating on %d [%s] futexes for %d secs.\n\n", + getpid(), nthreads, nfutexes, fshared ? "shared":"private", nsecs); + + init_stats(&throughput_stats); + pthread_mutex_init(&thread_lock, NULL); + pthread_cond_init(&thread_parent, NULL); + pthread_cond_init(&thread_worker, NULL); + + threads_starting = nthreads; + pthread_attr_init(&thread_attr); + gettimeofday(&start, NULL); + for (i = 0; i < nthreads; i++) { + worker[i].tid = i; + worker[i].futex = calloc(nfutexes, sizeof(*worker[i].futex)); + if (!worker[i].futex) + goto errmem; + + CPU_ZERO(&cpu); + CPU_SET(i % ncpus, &cpu); + + ret = pthread_attr_setaffinity_np(&thread_attr, sizeof(cpu_set_t), &cpu); + if (ret) + err(EXIT_FAILURE, "pthread_attr_setaffinity_np"); + + ret = pthread_create(&worker[i].thread, &thread_attr, workerfn, + (void *)(struct worker *) &worker[i]); + if (ret) + err(EXIT_FAILURE, "pthread_create"); + + } + pthread_attr_destroy(&thread_attr); + + pthread_mutex_lock(&thread_lock); + while (threads_starting) + pthread_cond_wait(&thread_parent, &thread_lock); + pthread_cond_broadcast(&thread_worker); + pthread_mutex_unlock(&thread_lock); + + sleep(nsecs); + toggle_done(0, NULL, NULL); + + for (i = 0; i < nthreads; i++) { + ret = pthread_join(worker[i].thread, NULL); + if (ret) + err(EXIT_FAILURE, "pthread_join"); + } + + /* cleanup & report results */ + pthread_cond_destroy(&thread_parent); + pthread_cond_destroy(&thread_worker); + pthread_mutex_destroy(&thread_lock); + + for (i = 0; i < nthreads; i++) { + unsigned long t = worker[i].ops/runtime.tv_sec; + update_stats(&throughput_stats, t); + if (!silent) { + if (nfutexes == 1) + printf("[thread %2d] futex: %p [ %ld ops/sec ]\n", + worker[i].tid, &worker[i].futex[0], t); + else + printf("[thread %2d] futexes: %p ... %p [ %ld ops/sec ]\n", + worker[i].tid, &worker[i].futex[0], + &worker[i].futex[nfutexes-1], t); + } + + free(worker[i].futex); + } + + print_summary(); + + free(worker); + return ret; +errmem: + err(EXIT_FAILURE, "calloc"); +} diff --git a/tools/perf/bench/futex.h b/tools/perf/bench/futex.h new file mode 100644 index 000000000000..7d0bda543e3d --- /dev/null +++ b/tools/perf/bench/futex.h @@ -0,0 +1,48 @@ +/* + * Glibc independent futex library for testing kernel functionality. + * Shamelessly stolen from Darren Hart <dvhltc@us.ibm.com> + * http://git.kernel.org/cgit/linux/kernel/git/dvhart/futextest.git/ + */ + +#ifndef _FUTEX_H +#define _FUTEX_H + +#include <unistd.h> +#include <sys/syscall.h> +#include <sys/types.h> +#include <linux/futex.h> + +/** + * futex() - SYS_futex syscall wrapper + * @uaddr: address of first futex + * @op: futex op code + * @val: typically expected value of uaddr, but varies by op + * @timeout: typically an absolute struct timespec (except where noted + * otherwise). Overloaded by some ops + * @uaddr2: address of second futex for some ops\ + * @val3: varies by op + * @opflags: flags to be bitwise OR'd with op, such as FUTEX_PRIVATE_FLAG + * + * futex() is used by all the following futex op wrappers. It can also be + * used for misuse and abuse testing. Generally, the specific op wrappers + * should be used instead. It is a macro instead of an static inline function as + * some of the types over overloaded (timeout is used for nr_requeue for + * example). + * + * These argument descriptions are the defaults for all + * like-named arguments in the following wrappers except where noted below. + */ +#define futex(uaddr, op, val, timeout, uaddr2, val3, opflags) \ + syscall(SYS_futex, uaddr, op | opflags, val, timeout, uaddr2, val3) + +/** + * futex_wait() - block on uaddr with optional timeout + * @timeout: relative timeout + */ +static inline int +futex_wait(u_int32_t *uaddr, u_int32_t val, struct timespec *timeout, int opflags) +{ + return futex(uaddr, FUTEX_WAIT, val, timeout, NULL, 0, opflags); +} + +#endif /* _FUTEX_H */ diff --git a/tools/perf/builtin-bench.c b/tools/perf/builtin-bench.c index e47f90cc7b98..a8b0138fc6a0 100644 --- a/tools/perf/builtin-bench.c +++ b/tools/perf/builtin-bench.c @@ -12,6 +12,7 @@ * sched ... scheduler and IPC performance * mem ... memory access performance * numa ... NUMA scheduling and MM performance + * futex ... Futex performance */ #include "perf.h" #include "util/util.h" @@ -54,6 +55,12 @@ static struct bench mem_benchmarks[] = { { NULL, NULL, NULL } }; +static struct bench futex_benchmarks[] = { + { "hash", "Benchmark for futex hash table", bench_futex_hash }, + { "all", "Test all futex benchmarks", NULL }, + { NULL, NULL, NULL } +}; + struct collection { const char *name; const char *summary; @@ -61,11 +68,12 @@ struct collection { }; static struct collection collections[] = { - { "sched", "Scheduler and IPC benchmarks", sched_benchmarks }, + { "sched", "Scheduler and IPC benchmarks", sched_benchmarks }, { "mem", "Memory access benchmarks", mem_benchmarks }, #ifdef HAVE_LIBNUMA_SUPPORT { "numa", "NUMA scheduling and MM benchmarks", numa_benchmarks }, #endif + {"futex", "Futex stressing benchmarks", futex_benchmarks }, { "all", "All benchmarks", NULL }, { NULL, NULL, NULL } }; diff --git a/tools/perf/perf.h b/tools/perf/perf.h index 6898ad0e199f..e18a8b5e6953 100644 --- a/tools/perf/perf.h +++ b/tools/perf/perf.h @@ -12,6 +12,9 @@ #ifndef __NR_perf_event_open # define __NR_perf_event_open 336 #endif +#ifndef __NR_futex +# define __NR_futex 240 +#endif #endif #if defined(__x86_64__) @@ -23,6 +26,9 @@ #ifndef __NR_perf_event_open # define __NR_perf_event_open 298 #endif +#ifndef __NR_futex +# define __NR_futex 202 +#endif #endif #ifdef __powerpc__ -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 02/16] perf bench: Add futex-wake microbenchmark 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 01/16] perf bench: Add futex-hash microbenchmark Arnaldo Carvalho de Melo @ 2014-03-14 21:29 ` Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 03/16] perf bench: Add futex-requeue microbenchmark Arnaldo Carvalho de Melo ` (14 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:29 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Davidlohr Bueso, Aswin Chandramouleeswaran, Darren Hart, Ingo Molnar, Jason Low, Peter Zijlstra, Scott J Norton, Thomas Gleixner, Waiman Long, Arnaldo Carvalho de Melo From: Davidlohr Bueso <davidlohr@hp.com> Block a bunch of threads on a futex and wake them up, N at a time. This program is particularly useful to measure the latency of nthread wakeups in non-error situations: all waiters are queued and all wake calls wakeup one or more tasks. An example run: $ perf bench futex wake -t 512 -r 100 Run summary [PID 27823]: blocking on 512 threads (at futex 0x7e10d4), waking up 1 at a time. [Run 1]: Wokeup 512 of 512 threads in 6.0080 ms [Run 2]: Wokeup 512 of 512 threads in 5.2280 ms [Run 3]: Wokeup 512 of 512 threads in 4.8300 ms ... [Run 100]: Wokeup 512 of 512 threads in 5.0100 ms Wokeup 512 of 512 threads in 5.0109 ms (+-2.25%) Signed-off-by: Davidlohr Bueso <davidlohr@hp.com> Acked-by: Darren Hart <dvhart@linux.intel.com> Cc: Aswin Chandramouleeswaran <aswin@hp.com> Cc: Darren Hart <dvhart@linux.intel.com> Cc: Ingo Molnar <mingo@redhat.com> Cc: Jason Low <jason.low2@hp.com> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Scott J Norton <scott.norton@hp.com> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: Waiman Long <Waiman.Long@hp.com> Link: http://lkml.kernel.org/r/1387081917-9102-3-git-send-email-davidlohr@hp.com Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/Makefile.perf | 1 + tools/perf/bench/bench.h | 1 + tools/perf/bench/futex-wake.c | 201 ++++++++++++++++++++++++++++++++++++++++++ tools/perf/bench/futex.h | 10 +++ tools/perf/builtin-bench.c | 1 + 5 files changed, 214 insertions(+) create mode 100644 tools/perf/bench/futex-wake.c diff --git a/tools/perf/Makefile.perf b/tools/perf/Makefile.perf index 54ae547b4bc4..6fa5d8b74635 100644 --- a/tools/perf/Makefile.perf +++ b/tools/perf/Makefile.perf @@ -427,6 +427,7 @@ endif BUILTIN_OBJS += $(OUTPUT)bench/mem-memcpy.o BUILTIN_OBJS += $(OUTPUT)bench/mem-memset.o BUILTIN_OBJS += $(OUTPUT)bench/futex-hash.o +BUILTIN_OBJS += $(OUTPUT)bench/futex-wake.o BUILTIN_OBJS += $(OUTPUT)builtin-diff.o BUILTIN_OBJS += $(OUTPUT)builtin-evlist.o diff --git a/tools/perf/bench/bench.h b/tools/perf/bench/bench.h index 34edb5c34db3..6ac3f1d083cc 100644 --- a/tools/perf/bench/bench.h +++ b/tools/perf/bench/bench.h @@ -32,6 +32,7 @@ extern int bench_mem_memcpy(int argc, const char **argv, const char *prefix __maybe_unused); extern int bench_mem_memset(int argc, const char **argv, const char *prefix); extern int bench_futex_hash(int argc, const char **argv, const char *prefix); +extern int bench_futex_wake(int argc, const char **argv, const char *prefix); #define BENCH_FORMAT_DEFAULT_STR "default" #define BENCH_FORMAT_DEFAULT 0 diff --git a/tools/perf/bench/futex-wake.c b/tools/perf/bench/futex-wake.c new file mode 100644 index 000000000000..d096169b161e --- /dev/null +++ b/tools/perf/bench/futex-wake.c @@ -0,0 +1,201 @@ +/* + * Copyright (C) 2013 Davidlohr Bueso <davidlohr@hp.com> + * + * futex-wake: Block a bunch of threads on a futex and wake'em up, N at a time. + * + * This program is particularly useful to measure the latency of nthread wakeups + * in non-error situations: all waiters are queued and all wake calls wakeup + * one or more tasks, and thus the waitqueue is never empty. + */ + +#include "../perf.h" +#include "../util/util.h" +#include "../util/stat.h" +#include "../util/parse-options.h" +#include "../util/header.h" +#include "bench.h" +#include "futex.h" + +#include <err.h> +#include <stdlib.h> +#include <sys/time.h> +#include <pthread.h> + +/* all threads will block on the same futex */ +static u_int32_t futex1 = 0; + +/* + * How many wakeups to do at a time. + * Default to 1 in order to make the kernel work more. + */ +static unsigned int nwakes = 1; + +/* + * There can be significant variance from run to run, + * the more repeats, the more exact the overall avg and + * the better idea of the futex latency. + */ +static unsigned int repeat = 10; + +pthread_t *worker; +static bool done = 0, silent = 0; +static pthread_mutex_t thread_lock; +static pthread_cond_t thread_parent, thread_worker; +static struct stats waketime_stats, wakeup_stats; +static unsigned int ncpus, threads_starting, nthreads = 0; + +static const struct option options[] = { + OPT_UINTEGER('t', "threads", &nthreads, "Specify amount of threads"), + OPT_UINTEGER('w', "nwakes", &nwakes, "Specify amount of threads to wake at once"), + OPT_UINTEGER('r', "repeat", &repeat, "Specify amount of times to repeat the run"), + OPT_BOOLEAN( 's', "silent", &silent, "Silent mode: do not display data/details"), + OPT_END() +}; + +static const char * const bench_futex_wake_usage[] = { + "perf bench futex wake <options>", + NULL +}; + +static void *workerfn(void *arg __maybe_unused) +{ + pthread_mutex_lock(&thread_lock); + threads_starting--; + if (!threads_starting) + pthread_cond_signal(&thread_parent); + pthread_cond_wait(&thread_worker, &thread_lock); + pthread_mutex_unlock(&thread_lock); + + futex_wait(&futex1, 0, NULL, FUTEX_PRIVATE_FLAG); + return NULL; +} + +static void print_summary(void) +{ + double waketime_avg = avg_stats(&waketime_stats); + double waketime_stddev = stddev_stats(&waketime_stats); + unsigned int wakeup_avg = avg_stats(&wakeup_stats); + + printf("Wokeup %d of %d threads in %.4f ms (+-%.2f%%)\n", + wakeup_avg, + nthreads, + waketime_avg/1e3, + rel_stddev_stats(waketime_stddev, waketime_avg)); +} + +static void block_threads(pthread_t *w, + pthread_attr_t thread_attr) +{ + cpu_set_t cpu; + unsigned int i; + + threads_starting = nthreads; + + /* create and block all threads */ + for (i = 0; i < nthreads; i++) { + CPU_ZERO(&cpu); + CPU_SET(i % ncpus, &cpu); + + if (pthread_attr_setaffinity_np(&thread_attr, sizeof(cpu_set_t), &cpu)) + err(EXIT_FAILURE, "pthread_attr_setaffinity_np"); + + if (pthread_create(&w[i], &thread_attr, workerfn, NULL)) + err(EXIT_FAILURE, "pthread_create"); + } +} + +static void toggle_done(int sig __maybe_unused, + siginfo_t *info __maybe_unused, + void *uc __maybe_unused) +{ + done = true; +} + +int bench_futex_wake(int argc, const char **argv, + const char *prefix __maybe_unused) +{ + int ret = 0; + unsigned int i, j; + struct sigaction act; + pthread_attr_t thread_attr; + + argc = parse_options(argc, argv, options, bench_futex_wake_usage, 0); + if (argc) { + usage_with_options(bench_futex_wake_usage, options); + exit(EXIT_FAILURE); + } + + ncpus = sysconf(_SC_NPROCESSORS_ONLN); + + sigfillset(&act.sa_mask); + act.sa_sigaction = toggle_done; + sigaction(SIGINT, &act, NULL); + + if (!nthreads) + nthreads = ncpus; + + worker = calloc(nthreads, sizeof(*worker)); + if (!worker) + err(EXIT_FAILURE, "calloc"); + + printf("Run summary [PID %d]: blocking on %d threads (at futex %p), " + "waking up %d at a time.\n\n", + getpid(), nthreads, &futex1, nwakes); + + init_stats(&wakeup_stats); + init_stats(&waketime_stats); + pthread_attr_init(&thread_attr); + pthread_mutex_init(&thread_lock, NULL); + pthread_cond_init(&thread_parent, NULL); + pthread_cond_init(&thread_worker, NULL); + + for (j = 0; j < repeat && !done; j++) { + unsigned int nwoken = 0; + struct timeval start, end, runtime; + + /* create, launch & block all threads */ + block_threads(worker, thread_attr); + + /* make sure all threads are already blocked */ + pthread_mutex_lock(&thread_lock); + while (threads_starting) + pthread_cond_wait(&thread_parent, &thread_lock); + pthread_cond_broadcast(&thread_worker); + pthread_mutex_unlock(&thread_lock); + + usleep(100000); + + /* Ok, all threads are patiently blocked, start waking folks up */ + gettimeofday(&start, NULL); + while (nwoken != nthreads) + nwoken += futex_wake(&futex1, nwakes, FUTEX_PRIVATE_FLAG); + gettimeofday(&end, NULL); + timersub(&end, &start, &runtime); + + update_stats(&wakeup_stats, nwoken); + update_stats(&waketime_stats, runtime.tv_usec); + + if (!silent) { + printf("[Run %d]: Wokeup %d of %d threads in %.4f ms\n", + j + 1, nwoken, nthreads, runtime.tv_usec/1e3); + } + + for (i = 0; i < nthreads; i++) { + ret = pthread_join(worker[i], NULL); + if (ret) + err(EXIT_FAILURE, "pthread_join"); + } + + } + + /* cleanup & report results */ + pthread_cond_destroy(&thread_parent); + pthread_cond_destroy(&thread_worker); + pthread_mutex_destroy(&thread_lock); + pthread_attr_destroy(&thread_attr); + + print_summary(); + + free(worker); + return ret; +} diff --git a/tools/perf/bench/futex.h b/tools/perf/bench/futex.h index 7d0bda543e3d..6ac45093fac4 100644 --- a/tools/perf/bench/futex.h +++ b/tools/perf/bench/futex.h @@ -45,4 +45,14 @@ futex_wait(u_int32_t *uaddr, u_int32_t val, struct timespec *timeout, int opflag return futex(uaddr, FUTEX_WAIT, val, timeout, NULL, 0, opflags); } +/** + * futex_wake() - wake one or more tasks blocked on uaddr + * @nr_wake: wake up to this many tasks + */ +static inline int +futex_wake(u_int32_t *uaddr, int nr_wake, int opflags) +{ + return futex(uaddr, FUTEX_WAKE, nr_wake, NULL, NULL, 0, opflags); +} + #endif /* _FUTEX_H */ diff --git a/tools/perf/builtin-bench.c b/tools/perf/builtin-bench.c index a8b0138fc6a0..743a30a8baa0 100644 --- a/tools/perf/builtin-bench.c +++ b/tools/perf/builtin-bench.c @@ -57,6 +57,7 @@ static struct bench mem_benchmarks[] = { static struct bench futex_benchmarks[] = { { "hash", "Benchmark for futex hash table", bench_futex_hash }, + { "wake", "Benchmark for futex wake calls", bench_futex_wake }, { "all", "Test all futex benchmarks", NULL }, { NULL, NULL, NULL } }; -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 03/16] perf bench: Add futex-requeue microbenchmark 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 01/16] perf bench: Add futex-hash microbenchmark Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 02/16] perf bench: Add futex-wake microbenchmark Arnaldo Carvalho de Melo @ 2014-03-14 21:29 ` Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 04/16] perf mem: Clarify load-latency in documentation Arnaldo Carvalho de Melo ` (13 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:29 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Davidlohr Bueso, Aswin Chandramouleeswaran, Darren Hart, Ingo Molnar, Jason Low, Peter Zijlstra, Scott J Norton, Thomas Gleixner, Waiman Long, Arnaldo Carvalho de Melo From: Davidlohr Bueso <davidlohr@hp.com> Block a bunch of threads on a futex and requeue them on another, N at a time. This program is particularly useful to measure the latency of nthread requeues without waking up any tasks -- thus mimicking a regular futex_wait. An example run: $ perf bench futex requeue -r 100 -t 64 Run summary [PID 151011]: Requeuing 64 threads (from 0x7d15c4 to 0x7d15c8), 1 at a time. [Run 1]: Requeued 64 of 64 threads in 0.0400 ms [Run 2]: Requeued 64 of 64 threads in 0.0390 ms [Run 3]: Requeued 64 of 64 threads in 0.0400 ms ... [Run 100]: Requeued 64 of 64 threads in 0.0390 ms Requeued 64 of 64 threads in 0.0399 ms (+-0.37%) Signed-off-by: Davidlohr Bueso <davidlohr@hp.com> Acked-by: Darren Hart <dvhart@linux.intel.com> Cc: Aswin Chandramouleeswaran <aswin@hp.com> Cc: Darren Hart <dvhart@linux.intel.com> Cc: Ingo Molnar <mingo@redhat.com> Cc: Jason Low <jason.low2@hp.com> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Scott J Norton <scott.norton@hp.com> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: Waiman Long <Waiman.Long@hp.com> Link: http://lkml.kernel.org/r/1387081917-9102-4-git-send-email-davidlohr@hp.com Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/Makefile.perf | 1 + tools/perf/bench/bench.h | 1 + tools/perf/bench/futex-requeue.c | 211 +++++++++++++++++++++++++++++++++++++++ tools/perf/bench/futex.h | 13 +++ tools/perf/builtin-bench.c | 1 + 5 files changed, 227 insertions(+) create mode 100644 tools/perf/bench/futex-requeue.c diff --git a/tools/perf/Makefile.perf b/tools/perf/Makefile.perf index 6fa5d8b74635..50d875d970c4 100644 --- a/tools/perf/Makefile.perf +++ b/tools/perf/Makefile.perf @@ -428,6 +428,7 @@ BUILTIN_OBJS += $(OUTPUT)bench/mem-memcpy.o BUILTIN_OBJS += $(OUTPUT)bench/mem-memset.o BUILTIN_OBJS += $(OUTPUT)bench/futex-hash.o BUILTIN_OBJS += $(OUTPUT)bench/futex-wake.o +BUILTIN_OBJS += $(OUTPUT)bench/futex-requeue.o BUILTIN_OBJS += $(OUTPUT)builtin-diff.o BUILTIN_OBJS += $(OUTPUT)builtin-evlist.o diff --git a/tools/perf/bench/bench.h b/tools/perf/bench/bench.h index 6ac3f1d083cc..eba46709b279 100644 --- a/tools/perf/bench/bench.h +++ b/tools/perf/bench/bench.h @@ -33,6 +33,7 @@ extern int bench_mem_memcpy(int argc, const char **argv, extern int bench_mem_memset(int argc, const char **argv, const char *prefix); extern int bench_futex_hash(int argc, const char **argv, const char *prefix); extern int bench_futex_wake(int argc, const char **argv, const char *prefix); +extern int bench_futex_requeue(int argc, const char **argv, const char *prefix); #define BENCH_FORMAT_DEFAULT_STR "default" #define BENCH_FORMAT_DEFAULT 0 diff --git a/tools/perf/bench/futex-requeue.c b/tools/perf/bench/futex-requeue.c new file mode 100644 index 000000000000..a16255876f1d --- /dev/null +++ b/tools/perf/bench/futex-requeue.c @@ -0,0 +1,211 @@ +/* + * Copyright (C) 2013 Davidlohr Bueso <davidlohr@hp.com> + * + * futex-requeue: Block a bunch of threads on futex1 and requeue them + * on futex2, N at a time. + * + * This program is particularly useful to measure the latency of nthread + * requeues without waking up any tasks -- thus mimicking a regular futex_wait. + */ + +#include "../perf.h" +#include "../util/util.h" +#include "../util/stat.h" +#include "../util/parse-options.h" +#include "../util/header.h" +#include "bench.h" +#include "futex.h" + +#include <err.h> +#include <stdlib.h> +#include <sys/time.h> +#include <pthread.h> + +static u_int32_t futex1 = 0, futex2 = 0; + +/* + * How many tasks to requeue at a time. + * Default to 1 in order to make the kernel work more. + */ +static unsigned int nrequeue = 1; + +/* + * There can be significant variance from run to run, + * the more repeats, the more exact the overall avg and + * the better idea of the futex latency. + */ +static unsigned int repeat = 10; + +static pthread_t *worker; +static bool done = 0, silent = 0; +static pthread_mutex_t thread_lock; +static pthread_cond_t thread_parent, thread_worker; +static struct stats requeuetime_stats, requeued_stats; +static unsigned int ncpus, threads_starting, nthreads = 0; + +static const struct option options[] = { + OPT_UINTEGER('t', "threads", &nthreads, "Specify amount of threads"), + OPT_UINTEGER('q', "nrequeue", &nrequeue, "Specify amount of threads to requeue at once"), + OPT_UINTEGER('r', "repeat", &repeat, "Specify amount of times to repeat the run"), + OPT_BOOLEAN( 's', "silent", &silent, "Silent mode: do not display data/details"), + OPT_END() +}; + +static const char * const bench_futex_requeue_usage[] = { + "perf bench futex requeue <options>", + NULL +}; + +static void print_summary(void) +{ + double requeuetime_avg = avg_stats(&requeuetime_stats); + double requeuetime_stddev = stddev_stats(&requeuetime_stats); + unsigned int requeued_avg = avg_stats(&requeued_stats); + + printf("Requeued %d of %d threads in %.4f ms (+-%.2f%%)\n", + requeued_avg, + nthreads, + requeuetime_avg/1e3, + rel_stddev_stats(requeuetime_stddev, requeuetime_avg)); +} + +static void *workerfn(void *arg __maybe_unused) +{ + pthread_mutex_lock(&thread_lock); + threads_starting--; + if (!threads_starting) + pthread_cond_signal(&thread_parent); + pthread_cond_wait(&thread_worker, &thread_lock); + pthread_mutex_unlock(&thread_lock); + + futex_wait(&futex1, 0, NULL, FUTEX_PRIVATE_FLAG); + return NULL; +} + +static void block_threads(pthread_t *w, + pthread_attr_t thread_attr) +{ + cpu_set_t cpu; + unsigned int i; + + threads_starting = nthreads; + + /* create and block all threads */ + for (i = 0; i < nthreads; i++) { + CPU_ZERO(&cpu); + CPU_SET(i % ncpus, &cpu); + + if (pthread_attr_setaffinity_np(&thread_attr, sizeof(cpu_set_t), &cpu)) + err(EXIT_FAILURE, "pthread_attr_setaffinity_np"); + + if (pthread_create(&w[i], &thread_attr, workerfn, NULL)) + err(EXIT_FAILURE, "pthread_create"); + } +} + +static void toggle_done(int sig __maybe_unused, + siginfo_t *info __maybe_unused, + void *uc __maybe_unused) +{ + done = true; +} + +int bench_futex_requeue(int argc, const char **argv, + const char *prefix __maybe_unused) +{ + int ret = 0; + unsigned int i, j; + struct sigaction act; + pthread_attr_t thread_attr; + + argc = parse_options(argc, argv, options, bench_futex_requeue_usage, 0); + if (argc) + goto err; + + ncpus = sysconf(_SC_NPROCESSORS_ONLN); + + sigfillset(&act.sa_mask); + act.sa_sigaction = toggle_done; + sigaction(SIGINT, &act, NULL); + + if (!nthreads) + nthreads = ncpus; + + worker = calloc(nthreads, sizeof(*worker)); + if (!worker) + err(EXIT_FAILURE, "calloc"); + + printf("Run summary [PID %d]: Requeuing %d threads (from %p to %p), " + "%d at a time.\n\n", + getpid(), nthreads, &futex1, &futex2, nrequeue); + + init_stats(&requeued_stats); + init_stats(&requeuetime_stats); + pthread_attr_init(&thread_attr); + pthread_mutex_init(&thread_lock, NULL); + pthread_cond_init(&thread_parent, NULL); + pthread_cond_init(&thread_worker, NULL); + + for (j = 0; j < repeat && !done; j++) { + unsigned int nrequeued = 0; + struct timeval start, end, runtime; + + /* create, launch & block all threads */ + block_threads(worker, thread_attr); + + /* make sure all threads are already blocked */ + pthread_mutex_lock(&thread_lock); + while (threads_starting) + pthread_cond_wait(&thread_parent, &thread_lock); + pthread_cond_broadcast(&thread_worker); + pthread_mutex_unlock(&thread_lock); + + usleep(100000); + + /* Ok, all threads are patiently blocked, start requeueing */ + gettimeofday(&start, NULL); + for (nrequeued = 0; nrequeued < nthreads; nrequeued += nrequeue) + /* + * Do not wakeup any tasks blocked on futex1, allowing + * us to really measure futex_wait functionality. + */ + futex_cmp_requeue(&futex1, 0, &futex2, 0, nrequeue, + FUTEX_PRIVATE_FLAG); + gettimeofday(&end, NULL); + timersub(&end, &start, &runtime); + + update_stats(&requeued_stats, nrequeued); + update_stats(&requeuetime_stats, runtime.tv_usec); + + if (!silent) { + printf("[Run %d]: Requeued %d of %d threads in %.4f ms\n", + j + 1, nrequeued, nthreads, runtime.tv_usec/1e3); + } + + /* everybody should be blocked on futex2, wake'em up */ + nrequeued = futex_wake(&futex2, nthreads, FUTEX_PRIVATE_FLAG); + if (nthreads != nrequeued) + warnx("couldn't wakeup all tasks (%d/%d)", nrequeued, nthreads); + + for (i = 0; i < nthreads; i++) { + ret = pthread_join(worker[i], NULL); + if (ret) + err(EXIT_FAILURE, "pthread_join"); + } + + } + + /* cleanup & report results */ + pthread_cond_destroy(&thread_parent); + pthread_cond_destroy(&thread_worker); + pthread_mutex_destroy(&thread_lock); + pthread_attr_destroy(&thread_attr); + + print_summary(); + + free(worker); + return ret; +err: + usage_with_options(bench_futex_requeue_usage, options); + exit(EXIT_FAILURE); +} diff --git a/tools/perf/bench/futex.h b/tools/perf/bench/futex.h index 6ac45093fac4..71f2844cf97f 100644 --- a/tools/perf/bench/futex.h +++ b/tools/perf/bench/futex.h @@ -55,4 +55,17 @@ futex_wake(u_int32_t *uaddr, int nr_wake, int opflags) return futex(uaddr, FUTEX_WAKE, nr_wake, NULL, NULL, 0, opflags); } +/** +* futex_cmp_requeue() - requeue tasks from uaddr to uaddr2 +* @nr_wake: wake up to this many tasks +* @nr_requeue: requeue up to this many tasks +*/ +static inline int +futex_cmp_requeue(u_int32_t *uaddr, u_int32_t val, u_int32_t *uaddr2, int nr_wake, + int nr_requeue, int opflags) +{ + return futex(uaddr, FUTEX_CMP_REQUEUE, nr_wake, nr_requeue, uaddr2, + val, opflags); +} + #endif /* _FUTEX_H */ diff --git a/tools/perf/builtin-bench.c b/tools/perf/builtin-bench.c index 743a30a8baa0..f600b74216da 100644 --- a/tools/perf/builtin-bench.c +++ b/tools/perf/builtin-bench.c @@ -58,6 +58,7 @@ static struct bench mem_benchmarks[] = { static struct bench futex_benchmarks[] = { { "hash", "Benchmark for futex hash table", bench_futex_hash }, { "wake", "Benchmark for futex wake calls", bench_futex_wake }, + { "requeue", "Benchmark for futex requeue calls", bench_futex_requeue }, { "all", "Test all futex benchmarks", NULL }, { NULL, NULL, NULL } }; -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 04/16] perf mem: Clarify load-latency in documentation 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (2 preceding siblings ...) 2014-03-14 21:29 ` [PATCH 03/16] perf bench: Add futex-requeue microbenchmark Arnaldo Carvalho de Melo @ 2014-03-14 21:29 ` Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 05/16] perf probe: Clarify x86 register naming for perf probe Arnaldo Carvalho de Melo ` (12 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:29 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Andi Kleen, Stephane Eranian, Arnaldo Carvalho de Melo From: Andi Kleen <ak@linux.intel.com> Clarify in the documentation that 'perf mem report' reports use-latency, not load/store-latency on Intel systems. This often causes confusion with users. Signed-off-by: Andi Kleen <ak@linux.intel.com> Cc: Stephane Eranian <eranian@google.com> Link: http://lkml.kernel.org/r/1393596135-4227-2-git-send-email-andi@firstfloor.org Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/Documentation/perf-mem.txt | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/tools/perf/Documentation/perf-mem.txt b/tools/perf/Documentation/perf-mem.txt index 888d51137fbe..1d78a4064da4 100644 --- a/tools/perf/Documentation/perf-mem.txt +++ b/tools/perf/Documentation/perf-mem.txt @@ -18,6 +18,10 @@ from it, into perf.data. Perf record options are accepted and are passed through "perf mem -t <TYPE> report" displays the result. It invokes perf report with the right set of options to display a memory access profile. +Note that on Intel systems the memory latency reported is the use-latency, +not the pure load (or store latency). Use latency includes any pipeline +queueing delays in addition to the memory subsystem latency. + OPTIONS ------- <command>...:: -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 05/16] perf probe: Clarify x86 register naming for perf probe 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (3 preceding siblings ...) 2014-03-14 21:29 ` [PATCH 04/16] perf mem: Clarify load-latency in documentation Arnaldo Carvalho de Melo @ 2014-03-14 21:29 ` Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 06/16] perf tools: Fix synthesizing mmaps for threads Arnaldo Carvalho de Melo ` (11 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:29 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Andi Kleen, Masami Hiramatsu, Arnaldo Carvalho de Melo From: Andi Kleen <ak@linux.intel.com> Clarify how to specify x86 registers in perf probe. I recently ran into this problem and had to figure it out from the source. Signed-off-by: Andi Kleen <ak@linux.intel.com> Acked-by: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com> Cc: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com> Link: http://lkml.kernel.org/r/1393596135-4227-3-git-send-email-andi@firstfloor.org Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/Documentation/perf-probe.txt | 2 ++ 1 file changed, 2 insertions(+) diff --git a/tools/perf/Documentation/perf-probe.txt b/tools/perf/Documentation/perf-probe.txt index b715cb71592b..1513935c399b 100644 --- a/tools/perf/Documentation/perf-probe.txt +++ b/tools/perf/Documentation/perf-probe.txt @@ -136,6 +136,8 @@ Each probe argument follows below syntax. 'NAME' specifies the name of this argument (optional). You can use the name of local variable, local data structure member (e.g. var->field, var.field2), local array with fixed index (e.g. array[1], var->array[0], var->pointer[2]), or kprobe-tracer argument format (e.g. $retval, %ax, etc). Note that the name of this argument will be set as the last member name if you specify a local data structure member (e.g. field2 for 'var->field1.field2'.) 'TYPE' casts the type of this argument (optional). If omitted, perf probe automatically set the type based on debuginfo. You can specify 'string' type only for the local variable or structure member which is an array of or a pointer to 'char' or 'unsigned char' type. +On x86 systems %REG is always the short form of the register: for example %AX. %RAX or %EAX is not valid. + LINE SYNTAX ----------- Line range is described by following syntax. -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 06/16] perf tools: Fix synthesizing mmaps for threads 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (4 preceding siblings ...) 2014-03-14 21:29 ` [PATCH 05/16] perf probe: Clarify x86 register naming for perf probe Arnaldo Carvalho de Melo @ 2014-03-14 21:30 ` Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 07/16] perf ui/stdio: Fix invalid output on event group report Arnaldo Carvalho de Melo ` (10 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:30 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Don Zickus, Jiri Olsa, Stephane Eranian, Arnaldo Carvalho de Melo From: Don Zickus <dzickus@redhat.com> Currently if a process creates a bunch of threads using pthread_create and then perf is run in system_wide mode, the mmaps for those threads are not captured with a synthesized mmap event. The reason is those threads are not visible when walking the /proc/ directory looking for /proc/<pid>/maps files. Instead they are discovered using the /proc/<pid>/tasks file (which the synthesized comm event uses). This causes problems when a program is trying to map a data address to a tid. Because the tid has no maps, the event is dropped. Changing the program to look up using the pid instead of the tid, finds the correct maps but creates ugly hacks in the program to carry the correct tid around. Fix this by moving the walking of the /proc/<pid>/tasks up a level (out of the comm function) based on Arnaldo's suggestion. Tweaked things a bit to special case the 'full' bit and 'guest' check. Signed-off-by: Don Zickus <dzickus@redhat.com> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Stephane Eranian <eranian@google.com> Link: http://lkml.kernel.org/r/1393429527-167840-2-git-send-email-dzickus@redhat.com Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/util/event.c | 111 ++++++++++++++++++++++++------------------------ 1 file changed, 56 insertions(+), 55 deletions(-) diff --git a/tools/perf/util/event.c b/tools/perf/util/event.c index b0f3ca850e9e..55eebe936513 100644 --- a/tools/perf/util/event.c +++ b/tools/perf/util/event.c @@ -94,14 +94,10 @@ static pid_t perf_event__get_comm_tgid(pid_t pid, char *comm, size_t len) static pid_t perf_event__synthesize_comm(struct perf_tool *tool, union perf_event *event, pid_t pid, - int full, perf_event__handler_t process, struct machine *machine) { - char filename[PATH_MAX]; size_t size; - DIR *tasks; - struct dirent dirent, *next; pid_t tgid; memset(&event->comm, 0, sizeof(event->comm)); @@ -124,53 +120,11 @@ static pid_t perf_event__synthesize_comm(struct perf_tool *tool, event->comm.header.size = (sizeof(event->comm) - (sizeof(event->comm.comm) - size) + machine->id_hdr_size); - if (!full) { - event->comm.tid = pid; - - if (process(tool, event, &synth_sample, machine) != 0) - return -1; + event->comm.tid = pid; - goto out; - } - - if (machine__is_default_guest(machine)) - return 0; - - snprintf(filename, sizeof(filename), "%s/proc/%d/task", - machine->root_dir, pid); - - tasks = opendir(filename); - if (tasks == NULL) { - pr_debug("couldn't open %s\n", filename); - return 0; - } - - while (!readdir_r(tasks, &dirent, &next) && next) { - char *end; - pid = strtol(dirent.d_name, &end, 10); - if (*end) - continue; - - /* already have tgid; jut want to update the comm */ - (void) perf_event__get_comm_tgid(pid, event->comm.comm, - sizeof(event->comm.comm)); - - size = strlen(event->comm.comm) + 1; - size = PERF_ALIGN(size, sizeof(u64)); - memset(event->comm.comm + size, 0, machine->id_hdr_size); - event->comm.header.size = (sizeof(event->comm) - - (sizeof(event->comm.comm) - size) + - machine->id_hdr_size); - - event->comm.tid = pid; - - if (process(tool, event, &synth_sample, machine) != 0) { - tgid = -1; - break; - } - } + if (process(tool, event, &synth_sample, machine) != 0) + return -1; - closedir(tasks); out: return tgid; } @@ -329,12 +283,59 @@ static int __event__synthesize_thread(union perf_event *comm_event, struct perf_tool *tool, struct machine *machine, bool mmap_data) { - pid_t tgid = perf_event__synthesize_comm(tool, comm_event, pid, full, - process, machine); - if (tgid == -1) - return -1; - return perf_event__synthesize_mmap_events(tool, mmap_event, pid, tgid, - process, machine, mmap_data); + char filename[PATH_MAX]; + DIR *tasks; + struct dirent dirent, *next; + pid_t tgid; + + /* special case: only send one comm event using passed in pid */ + if (!full) { + tgid = perf_event__synthesize_comm(tool, comm_event, pid, + process, machine); + + if (tgid == -1) + return -1; + + return perf_event__synthesize_mmap_events(tool, mmap_event, pid, tgid, + process, machine, mmap_data); + } + + if (machine__is_default_guest(machine)) + return 0; + + snprintf(filename, sizeof(filename), "%s/proc/%d/task", + machine->root_dir, pid); + + tasks = opendir(filename); + if (tasks == NULL) { + pr_debug("couldn't open %s\n", filename); + return 0; + } + + while (!readdir_r(tasks, &dirent, &next) && next) { + char *end; + int rc = 0; + pid_t _pid; + + _pid = strtol(dirent.d_name, &end, 10); + if (*end) + continue; + + tgid = perf_event__synthesize_comm(tool, comm_event, _pid, + process, machine); + if (tgid == -1) + return -1; + + /* process the thread's maps too */ + rc = perf_event__synthesize_mmap_events(tool, mmap_event, _pid, tgid, + process, machine, mmap_data); + + if (rc) + return rc; + } + + closedir(tasks); + return 0; } int perf_event__synthesize_thread_map(struct perf_tool *tool, -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 07/16] perf ui/stdio: Fix invalid output on event group report 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (5 preceding siblings ...) 2014-03-14 21:30 ` [PATCH 06/16] perf tools: Fix synthesizing mmaps for threads Arnaldo Carvalho de Melo @ 2014-03-14 21:30 ` Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 08/16] perf ui/gtk: Reuse generic __hpp__fmt() code Arnaldo Carvalho de Melo ` (9 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:30 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Namhyung Kim, Andi Kleen, David Ahern, Jiri Olsa, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Arnaldo Carvalho de Melo From: Namhyung Kim <namhyung@kernel.org> When some of group member has 0 overhead, it printed previous percentage instead of 0.00%. It's because passing integer 0 as a percent rather than double 0.0 so the remaining bits came from garbage. The TUI and GTK don't have this problem since they pass 0.0. Before: # Samples: 845 of event 'anon group { cycles, cache-references, cache-misses }' # Event count (approx.): 174775051 # # Overhead Samples # ........................ .................................... # 20.32% 8.58% 73.51% 45 30 138 6.87% 6.87% 6.87% 21 0 0 5.29% 0.31% 0.31% 10 1 0 1.89% 1.89% 1.89% 6 0 0 1.76% 1.76% 1.76% 2 0 0 After: # Overhead Samples # ........................ .................................... # 20.32% 8.58% 73.51% 45 30 138 6.87% 0.00% 0.00% 21 0 0 5.29% 0.31% 0.00% 10 1 0 1.89% 0.00% 0.00% 6 0 0 1.76% 0.00% 0.00% 2 0 0 Signed-off-by: Namhyung Kim <namhyung@kernel.org> Cc: Andi Kleen <andi@firstfloor.org> Cc: David Ahern <dsahern@gmail.com> Cc: Ingo Molnar <mingo@kernel.org> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Namhyung Kim <namhyung.kim@lge.com> Cc: Paul Mackerras <paulus@samba.org> Cc: Peter Zijlstra <a.p.zijlstra@chello.nl> Link: http://lkml.kernel.org/r/1393809254-4480-2-git-send-email-namhyung@kernel.org Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/ui/hist.c | 20 ++++++++++++++++---- 1 file changed, 16 insertions(+), 4 deletions(-) diff --git a/tools/perf/ui/hist.c b/tools/perf/ui/hist.c index 78f4c92e9b73..6094562c9523 100644 --- a/tools/perf/ui/hist.c +++ b/tools/perf/ui/hist.c @@ -52,8 +52,15 @@ static int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, * zero-fill group members in the middle which * have no sample */ - ret += print_fn(hpp->buf + ret, hpp->size - ret, - fmt, 0); + if (fmt_percent) { + ret += print_fn(hpp->buf + ret, + hpp->size - ret, + fmt, 0.0); + } else { + ret += print_fn(hpp->buf + ret, + hpp->size - ret, + fmt, 0ULL); + } } if (fmt_percent) @@ -72,8 +79,13 @@ static int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, /* * zero-fill group members at last which have no sample */ - ret += print_fn(hpp->buf + ret, hpp->size - ret, - fmt, 0); + if (fmt_percent) { + ret += print_fn(hpp->buf + ret, hpp->size - ret, + fmt, 0.0); + } else { + ret += print_fn(hpp->buf + ret, hpp->size - ret, + fmt, 0ULL); + } } } return ret; -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 08/16] perf ui/gtk: Reuse generic __hpp__fmt() code 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (6 preceding siblings ...) 2014-03-14 21:30 ` [PATCH 07/16] perf ui/stdio: Fix invalid output on event group report Arnaldo Carvalho de Melo @ 2014-03-14 21:30 ` Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 09/16] perf ui/hists: Pass struct hpp to print functions Arnaldo Carvalho de Melo ` (8 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:30 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Namhyung Kim, Andi Kleen, David Ahern, Jiri Olsa, Namhyung Kim, Paul Mackerras, Pekka Enberg, Peter Zijlstra, Arnaldo Carvalho de Melo From: Namhyung Kim <namhyung@kernel.org> The __hpp__color_fmt used in the gtk code can be replace by the generic code with small change in print_fn callback. This is a preparation to upcoming changes and no functional changes intended. Signed-off-by: Namhyung Kim <namhyung@kernel.org> Acked-by: Pekka Enberg <penberg@kernel.org> Cc: Andi Kleen <andi@firstfloor.org> Cc: David Ahern <dsahern@gmail.com> Cc: Ingo Molnar <mingo@kernel.org> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Namhyung Kim <namhyung.kim@lge.com> Cc: Paul Mackerras <paulus@samba.org> Cc: Pekka Enberg <penberg@kernel.org> Cc: Peter Zijlstra <a.p.zijlstra@chello.nl> Link: http://lkml.kernel.org/r/1393809254-4480-3-git-send-email-namhyung@kernel.org Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/ui/gtk/hists.c | 73 +++++++---------------------------------------- tools/perf/ui/hist.c | 9 ++---- tools/perf/util/hist.h | 6 ++++ 3 files changed, 19 insertions(+), 69 deletions(-) diff --git a/tools/perf/ui/gtk/hists.c b/tools/perf/ui/gtk/hists.c index 5b95c44f3435..3dab00e748f2 100644 --- a/tools/perf/ui/gtk/hists.c +++ b/tools/perf/ui/gtk/hists.c @@ -8,16 +8,22 @@ #define MAX_COLUMNS 32 -static int __percent_color_snprintf(char *buf, size_t size, double percent) +static int __percent_color_snprintf(char *buf, size_t size, const char *fmt, ...) { int ret = 0; + va_list args; + double percent; const char *markup; + va_start(args, fmt); + percent = va_arg(args, double); + va_end(args); + markup = perf_gtk__get_percent_color(percent); if (markup) ret += scnprintf(buf, size, markup); - ret += scnprintf(buf + ret, size - ret, " %6.2f%%", percent); + ret += scnprintf(buf + ret, size - ret, fmt, percent); if (markup) ret += scnprintf(buf + ret, size - ret, "</span>"); @@ -25,66 +31,6 @@ static int __percent_color_snprintf(char *buf, size_t size, double percent) return ret; } - -static int __hpp__color_fmt(struct perf_hpp *hpp, struct hist_entry *he, - u64 (*get_field)(struct hist_entry *)) -{ - int ret; - double percent = 0.0; - struct hists *hists = he->hists; - struct perf_evsel *evsel = hists_to_evsel(hists); - - if (hists->stats.total_period) - percent = 100.0 * get_field(he) / hists->stats.total_period; - - ret = __percent_color_snprintf(hpp->buf, hpp->size, percent); - - if (perf_evsel__is_group_event(evsel)) { - int prev_idx, idx_delta; - struct hist_entry *pair; - int nr_members = evsel->nr_members; - - prev_idx = perf_evsel__group_idx(evsel); - - list_for_each_entry(pair, &he->pairs.head, pairs.node) { - u64 period = get_field(pair); - u64 total = pair->hists->stats.total_period; - - evsel = hists_to_evsel(pair->hists); - idx_delta = perf_evsel__group_idx(evsel) - prev_idx - 1; - - while (idx_delta--) { - /* - * zero-fill group members in the middle which - * have no sample - */ - ret += __percent_color_snprintf(hpp->buf + ret, - hpp->size - ret, - 0.0); - } - - percent = 100.0 * period / total; - ret += __percent_color_snprintf(hpp->buf + ret, - hpp->size - ret, - percent); - - prev_idx = perf_evsel__group_idx(evsel); - } - - idx_delta = nr_members - prev_idx - 1; - - while (idx_delta--) { - /* - * zero-fill group members at last which have no sample - */ - ret += __percent_color_snprintf(hpp->buf + ret, - hpp->size - ret, - 0.0); - } - } - return ret; -} - #define __HPP_COLOR_PERCENT_FN(_type, _field) \ static u64 he_get_##_field(struct hist_entry *he) \ { \ @@ -95,7 +41,8 @@ static int perf_gtk__hpp_color_##_type(struct perf_hpp_fmt *fmt __maybe_unused, struct perf_hpp *hpp, \ struct hist_entry *he) \ { \ - return __hpp__color_fmt(hpp, he, he_get_##_field); \ + return __hpp__fmt(hpp, he, he_get_##_field, " %6.2f%%", \ + __percent_color_snprintf, true); \ } __HPP_COLOR_PERCENT_FN(overhead, period) diff --git a/tools/perf/ui/hist.c b/tools/perf/ui/hist.c index 6094562c9523..085353435f07 100644 --- a/tools/perf/ui/hist.c +++ b/tools/perf/ui/hist.c @@ -8,12 +8,9 @@ /* hist period print (hpp) functions */ -typedef int (*hpp_snprint_fn)(char *buf, size_t size, const char *fmt, ...); - -static int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, - u64 (*get_field)(struct hist_entry *), - const char *fmt, hpp_snprint_fn print_fn, - bool fmt_percent) +int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, + u64 (*get_field)(struct hist_entry *), + const char *fmt, hpp_snprint_fn print_fn, bool fmt_percent) { int ret; struct hists *hists = he->hists; diff --git a/tools/perf/util/hist.h b/tools/perf/util/hist.h index a59743fa3ef7..97f924ec6af9 100644 --- a/tools/perf/util/hist.h +++ b/tools/perf/util/hist.h @@ -166,6 +166,12 @@ void perf_hpp__init(void); void perf_hpp__column_register(struct perf_hpp_fmt *format); void perf_hpp__column_enable(unsigned col); +typedef int (*hpp_snprint_fn)(char *buf, size_t size, const char *fmt, ...); + +int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, + u64 (*get_field)(struct hist_entry *), + const char *fmt, hpp_snprint_fn print_fn, bool fmt_percent); + static inline size_t perf_hpp__use_color(void) { return !symbol_conf.field_sep; -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 09/16] perf ui/hists: Pass struct hpp to print functions 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (7 preceding siblings ...) 2014-03-14 21:30 ` [PATCH 08/16] perf ui/gtk: Reuse generic __hpp__fmt() code Arnaldo Carvalho de Melo @ 2014-03-14 21:30 ` Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 10/16] perf ui/tui: Reuse generic __hpp__fmt() code Arnaldo Carvalho de Melo ` (7 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:30 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Namhyung Kim, Andi Kleen, David Ahern, Jiri Olsa, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Arnaldo Carvalho de Melo From: Namhyung Kim <namhyung@kernel.org> Instead of the pointer to buffer and its size so that it can also get private argument passed along with hpp. This is a preparation of further change. Signed-off-by: Namhyung Kim <namhyung@kernel.org> Cc: Andi Kleen <andi@firstfloor.org> Cc: David Ahern <dsahern@gmail.com> Cc: Ingo Molnar <mingo@kernel.org> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Namhyung Kim <namhyung.kim@lge.com> Cc: Paul Mackerras <paulus@samba.org> Cc: Peter Zijlstra <a.p.zijlstra@chello.nl> Link: http://lkml.kernel.org/r/1393809254-4480-4-git-send-email-namhyung@kernel.org Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/ui/gtk/hists.c | 4 ++- tools/perf/ui/hist.c | 87 +++++++++++++++++++++++++++++++++++----------- tools/perf/ui/stdio/hist.c | 6 ---- tools/perf/util/hist.h | 8 ++++- 4 files changed, 76 insertions(+), 29 deletions(-) diff --git a/tools/perf/ui/gtk/hists.c b/tools/perf/ui/gtk/hists.c index 3dab00e748f2..430fd555b161 100644 --- a/tools/perf/ui/gtk/hists.c +++ b/tools/perf/ui/gtk/hists.c @@ -8,12 +8,14 @@ #define MAX_COLUMNS 32 -static int __percent_color_snprintf(char *buf, size_t size, const char *fmt, ...) +static int __percent_color_snprintf(struct perf_hpp *hpp, const char *fmt, ...) { int ret = 0; va_list args; double percent; const char *markup; + char *buf = hpp->buf; + size_t size = hpp->size; va_start(args, fmt); percent = va_arg(args, double); diff --git a/tools/perf/ui/hist.c b/tools/perf/ui/hist.c index 085353435f07..0c427e5d37c7 100644 --- a/tools/perf/ui/hist.c +++ b/tools/perf/ui/hist.c @@ -8,6 +8,13 @@ /* hist period print (hpp) functions */ +#define hpp__call_print_fn(hpp, fn, fmt, ...) \ +({ \ + int __ret = fn(hpp, fmt, ##__VA_ARGS__); \ + advance_hpp(hpp, __ret); \ + __ret; \ +}) + int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, u64 (*get_field)(struct hist_entry *), const char *fmt, hpp_snprint_fn print_fn, bool fmt_percent) @@ -15,6 +22,8 @@ int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, int ret; struct hists *hists = he->hists; struct perf_evsel *evsel = hists_to_evsel(hists); + char *buf = hpp->buf; + size_t size = hpp->size; if (fmt_percent) { double percent = 0.0; @@ -23,9 +32,9 @@ int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, percent = 100.0 * get_field(he) / hists->stats.total_period; - ret = print_fn(hpp->buf, hpp->size, fmt, percent); + ret = hpp__call_print_fn(hpp, print_fn, fmt, percent); } else - ret = print_fn(hpp->buf, hpp->size, fmt, get_field(he)); + ret = hpp__call_print_fn(hpp, print_fn, fmt, get_field(he)); if (perf_evsel__is_group_event(evsel)) { int prev_idx, idx_delta; @@ -50,22 +59,21 @@ int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, * have no sample */ if (fmt_percent) { - ret += print_fn(hpp->buf + ret, - hpp->size - ret, - fmt, 0.0); + ret += hpp__call_print_fn(hpp, print_fn, + fmt, 0.0); } else { - ret += print_fn(hpp->buf + ret, - hpp->size - ret, - fmt, 0ULL); + ret += hpp__call_print_fn(hpp, print_fn, + fmt, 0ULL); } } - if (fmt_percent) - ret += print_fn(hpp->buf + ret, hpp->size - ret, - fmt, 100.0 * period / total); - else - ret += print_fn(hpp->buf + ret, hpp->size - ret, - fmt, period); + if (fmt_percent) { + ret += hpp__call_print_fn(hpp, print_fn, fmt, + 100.0 * period / total); + } else { + ret += hpp__call_print_fn(hpp, print_fn, fmt, + period); + } prev_idx = perf_evsel__group_idx(evsel); } @@ -77,14 +85,22 @@ int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, * zero-fill group members at last which have no sample */ if (fmt_percent) { - ret += print_fn(hpp->buf + ret, hpp->size - ret, - fmt, 0.0); + ret += hpp__call_print_fn(hpp, print_fn, + fmt, 0.0); } else { - ret += print_fn(hpp->buf + ret, hpp->size - ret, - fmt, 0ULL); + ret += hpp__call_print_fn(hpp, print_fn, + fmt, 0ULL); } } } + + /* + * Restore original buf and size as it's where caller expects + * the result will be saved. + */ + hpp->buf = buf; + hpp->size = size; + return ret; } @@ -116,6 +132,34 @@ static int hpp__width_##_type(struct perf_hpp_fmt *fmt __maybe_unused, \ return len; \ } +static int hpp_color_scnprintf(struct perf_hpp *hpp, const char *fmt, ...) +{ + va_list args; + ssize_t ssize = hpp->size; + double percent; + int ret; + + va_start(args, fmt); + percent = va_arg(args, double); + ret = value_color_snprintf(hpp->buf, hpp->size, fmt, percent); + va_end(args); + + return (ret >= ssize) ? (ssize - 1) : ret; +} + +static int hpp_entry_scnprintf(struct perf_hpp *hpp, const char *fmt, ...) +{ + va_list args; + ssize_t ssize = hpp->size; + int ret; + + va_start(args, fmt); + ret = vsnprintf(hpp->buf, hpp->size, fmt, args); + va_end(args); + + return (ret >= ssize) ? (ssize - 1) : ret; +} + #define __HPP_COLOR_PERCENT_FN(_type, _field) \ static u64 he_get_##_field(struct hist_entry *he) \ { \ @@ -126,7 +170,7 @@ static int hpp__color_##_type(struct perf_hpp_fmt *fmt __maybe_unused, \ struct perf_hpp *hpp, struct hist_entry *he) \ { \ return __hpp__fmt(hpp, he, he_get_##_field, " %6.2f%%", \ - percent_color_snprintf, true); \ + hpp_color_scnprintf, true); \ } #define __HPP_ENTRY_PERCENT_FN(_type, _field) \ @@ -135,7 +179,7 @@ static int hpp__entry_##_type(struct perf_hpp_fmt *_fmt __maybe_unused, \ { \ const char *fmt = symbol_conf.field_sep ? " %.2f" : " %6.2f%%"; \ return __hpp__fmt(hpp, he, he_get_##_field, fmt, \ - scnprintf, true); \ + hpp_entry_scnprintf, true); \ } #define __HPP_ENTRY_RAW_FN(_type, _field) \ @@ -148,7 +192,8 @@ static int hpp__entry_##_type(struct perf_hpp_fmt *_fmt __maybe_unused, \ struct perf_hpp *hpp, struct hist_entry *he) \ { \ const char *fmt = symbol_conf.field_sep ? " %"PRIu64 : " %11"PRIu64; \ - return __hpp__fmt(hpp, he, he_get_raw_##_field, fmt, scnprintf, false); \ + return __hpp__fmt(hpp, he, he_get_raw_##_field, fmt, \ + hpp_entry_scnprintf, false); \ } #define HPP_PERCENT_FNS(_type, _str, _field, _min_width, _unit_width) \ diff --git a/tools/perf/ui/stdio/hist.c b/tools/perf/ui/stdio/hist.c index 831fbb77d1ff..9bad89228472 100644 --- a/tools/perf/ui/stdio/hist.c +++ b/tools/perf/ui/stdio/hist.c @@ -306,12 +306,6 @@ static size_t hist_entry__callchain_fprintf(struct hist_entry *he, return hist_entry_callchain__fprintf(he, total_period, left_margin, fp); } -static inline void advance_hpp(struct perf_hpp *hpp, int inc) -{ - hpp->buf += inc; - hpp->size -= inc; -} - static int hist_entry__period_snprintf(struct perf_hpp *hpp, struct hist_entry *he) { diff --git a/tools/perf/util/hist.h b/tools/perf/util/hist.h index 97f924ec6af9..d51ed9898229 100644 --- a/tools/perf/util/hist.h +++ b/tools/perf/util/hist.h @@ -166,12 +166,18 @@ void perf_hpp__init(void); void perf_hpp__column_register(struct perf_hpp_fmt *format); void perf_hpp__column_enable(unsigned col); -typedef int (*hpp_snprint_fn)(char *buf, size_t size, const char *fmt, ...); +typedef int (*hpp_snprint_fn)(struct perf_hpp *hpp, const char *fmt, ...); int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, u64 (*get_field)(struct hist_entry *), const char *fmt, hpp_snprint_fn print_fn, bool fmt_percent); +static inline void advance_hpp(struct perf_hpp *hpp, int inc) +{ + hpp->buf += inc; + hpp->size -= inc; +} + static inline size_t perf_hpp__use_color(void) { return !symbol_conf.field_sep; -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 10/16] perf ui/tui: Reuse generic __hpp__fmt() code 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (8 preceding siblings ...) 2014-03-14 21:30 ` [PATCH 09/16] perf ui/hists: Pass struct hpp to print functions Arnaldo Carvalho de Melo @ 2014-03-14 21:30 ` Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 11/16] perf session: Change header.misc dump from decimal to hex Arnaldo Carvalho de Melo ` (6 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:30 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Namhyung Kim, Andi Kleen, David Ahern, Jiri Olsa, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Arnaldo Carvalho de Melo From: Namhyung Kim <namhyung@kernel.org> The __hpp__color_fmt used in the TUI code can be replace by the generic code with small change in print_fn callback. And it also needs to move callback function to the generic __hpp__fmt(). No functional changes intended. Signed-off-by: Namhyung Kim <namhyung@kernel.org> Cc: Andi Kleen <andi@firstfloor.org> Cc: David Ahern <dsahern@gmail.com> Cc: Ingo Molnar <mingo@kernel.org> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Namhyung Kim <namhyung.kim@lge.com> Cc: Paul Mackerras <paulus@samba.org> Cc: Peter Zijlstra <a.p.zijlstra@chello.nl> Link: http://lkml.kernel.org/r/1393809254-4480-5-git-send-email-namhyung@kernel.org Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/ui/browsers/hists.c | 122 ++++++++++++++--------------------------- tools/perf/ui/gtk/hists.c | 2 +- tools/perf/ui/hist.c | 26 ++++++--- tools/perf/util/hist.h | 4 +- 4 files changed, 63 insertions(+), 91 deletions(-) diff --git a/tools/perf/ui/browsers/hists.c b/tools/perf/ui/browsers/hists.c index b720b92eba6e..7ec871af3f6f 100644 --- a/tools/perf/ui/browsers/hists.c +++ b/tools/perf/ui/browsers/hists.c @@ -587,95 +587,52 @@ struct hpp_arg { bool current_entry; }; -static int __hpp__color_callchain(struct hpp_arg *arg) +static int __hpp__overhead_callback(struct perf_hpp *hpp, bool front) { - if (!symbol_conf.use_callchain) - return 0; - - slsmg_printf("%c ", arg->folded_sign); - return 2; -} - -static int __hpp__color_fmt(struct perf_hpp *hpp, struct hist_entry *he, - u64 (*get_field)(struct hist_entry *), - int (*callchain_cb)(struct hpp_arg *)) -{ - int ret = 0; - double percent = 0.0; - struct hists *hists = he->hists; struct hpp_arg *arg = hpp->ptr; - if (hists->stats.total_period) - percent = 100.0 * get_field(he) / hists->stats.total_period; - - ui_browser__set_percent_color(arg->b, percent, arg->current_entry); - - if (callchain_cb) - ret += callchain_cb(arg); - - ret += scnprintf(hpp->buf, hpp->size, "%6.2f%%", percent); - slsmg_printf("%s", hpp->buf); - - if (symbol_conf.event_group) { - int prev_idx, idx_delta; - struct perf_evsel *evsel = hists_to_evsel(hists); - struct hist_entry *pair; - int nr_members = evsel->nr_members; - - if (nr_members <= 1) - goto out; + if (arg->current_entry && arg->b->navkeypressed) + ui_browser__set_color(arg->b, HE_COLORSET_SELECTED); + else + ui_browser__set_color(arg->b, HE_COLORSET_NORMAL); - prev_idx = perf_evsel__group_idx(evsel); + if (front) { + if (!symbol_conf.use_callchain) + return 0; - list_for_each_entry(pair, &he->pairs.head, pairs.node) { - u64 period = get_field(pair); - u64 total = pair->hists->stats.total_period; + slsmg_printf("%c ", arg->folded_sign); + return 2; + } - if (!total) - continue; + return 0; +} - evsel = hists_to_evsel(pair->hists); - idx_delta = perf_evsel__group_idx(evsel) - prev_idx - 1; +static int __hpp__color_callback(struct perf_hpp *hpp, bool front __maybe_unused) +{ + struct hpp_arg *arg = hpp->ptr; - while (idx_delta--) { - /* - * zero-fill group members in the middle which - * have no sample - */ - ui_browser__set_percent_color(arg->b, 0.0, - arg->current_entry); - ret += scnprintf(hpp->buf, hpp->size, - " %6.2f%%", 0.0); - slsmg_printf("%s", hpp->buf); - } + if (!arg->current_entry || !arg->b->navkeypressed) + ui_browser__set_color(arg->b, HE_COLORSET_NORMAL); + return 0; +} - percent = 100.0 * period / total; - ui_browser__set_percent_color(arg->b, percent, - arg->current_entry); - ret += scnprintf(hpp->buf, hpp->size, - " %6.2f%%", percent); - slsmg_printf("%s", hpp->buf); +static int __hpp__slsmg_color_printf(struct perf_hpp *hpp, const char *fmt, ...) +{ + struct hpp_arg *arg = hpp->ptr; + int ret; + va_list args; + double percent; - prev_idx = perf_evsel__group_idx(evsel); - } + va_start(args, fmt); + percent = va_arg(args, double); + va_end(args); - idx_delta = nr_members - prev_idx - 1; + ui_browser__set_percent_color(arg->b, percent, arg->current_entry); - while (idx_delta--) { - /* - * zero-fill group members at last which have no sample - */ - ui_browser__set_percent_color(arg->b, 0.0, - arg->current_entry); - ret += scnprintf(hpp->buf, hpp->size, - " %6.2f%%", 0.0); - slsmg_printf("%s", hpp->buf); - } - } -out: - if (!arg->current_entry || !arg->b->navkeypressed) - ui_browser__set_color(arg->b, HE_COLORSET_NORMAL); + ret = scnprintf(hpp->buf, hpp->size, fmt, percent); + slsmg_printf("%s", hpp->buf); + advance_hpp(hpp, ret); return ret; } @@ -690,14 +647,15 @@ hist_browser__hpp_color_##_type(struct perf_hpp_fmt *fmt __maybe_unused,\ struct perf_hpp *hpp, \ struct hist_entry *he) \ { \ - return __hpp__color_fmt(hpp, he, __hpp_get_##_field, _cb); \ + return __hpp__fmt(hpp, he, __hpp_get_##_field, _cb, " %6.2f%%", \ + __hpp__slsmg_color_printf, true); \ } -__HPP_COLOR_PERCENT_FN(overhead, period, __hpp__color_callchain) -__HPP_COLOR_PERCENT_FN(overhead_sys, period_sys, NULL) -__HPP_COLOR_PERCENT_FN(overhead_us, period_us, NULL) -__HPP_COLOR_PERCENT_FN(overhead_guest_sys, period_guest_sys, NULL) -__HPP_COLOR_PERCENT_FN(overhead_guest_us, period_guest_us, NULL) +__HPP_COLOR_PERCENT_FN(overhead, period, __hpp__overhead_callback) +__HPP_COLOR_PERCENT_FN(overhead_sys, period_sys, __hpp__color_callback) +__HPP_COLOR_PERCENT_FN(overhead_us, period_us, __hpp__color_callback) +__HPP_COLOR_PERCENT_FN(overhead_guest_sys, period_guest_sys, __hpp__color_callback) +__HPP_COLOR_PERCENT_FN(overhead_guest_us, period_guest_us, __hpp__color_callback) #undef __HPP_COLOR_PERCENT_FN diff --git a/tools/perf/ui/gtk/hists.c b/tools/perf/ui/gtk/hists.c index 430fd555b161..7912dab6e5fb 100644 --- a/tools/perf/ui/gtk/hists.c +++ b/tools/perf/ui/gtk/hists.c @@ -43,7 +43,7 @@ static int perf_gtk__hpp_color_##_type(struct perf_hpp_fmt *fmt __maybe_unused, struct perf_hpp *hpp, \ struct hist_entry *he) \ { \ - return __hpp__fmt(hpp, he, he_get_##_field, " %6.2f%%", \ + return __hpp__fmt(hpp, he, he_get_##_field, NULL, " %6.2f%%", \ __percent_color_snprintf, true); \ } diff --git a/tools/perf/ui/hist.c b/tools/perf/ui/hist.c index 0c427e5d37c7..ac39313a3f0c 100644 --- a/tools/perf/ui/hist.c +++ b/tools/perf/ui/hist.c @@ -16,15 +16,20 @@ }) int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, - u64 (*get_field)(struct hist_entry *), + hpp_field_fn get_field, hpp_callback_fn callback, const char *fmt, hpp_snprint_fn print_fn, bool fmt_percent) { - int ret; + int ret = 0; struct hists *hists = he->hists; struct perf_evsel *evsel = hists_to_evsel(hists); char *buf = hpp->buf; size_t size = hpp->size; + if (callback) { + ret = callback(hpp, true); + advance_hpp(hpp, ret); + } + if (fmt_percent) { double percent = 0.0; @@ -32,9 +37,9 @@ int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, percent = 100.0 * get_field(he) / hists->stats.total_period; - ret = hpp__call_print_fn(hpp, print_fn, fmt, percent); + ret += hpp__call_print_fn(hpp, print_fn, fmt, percent); } else - ret = hpp__call_print_fn(hpp, print_fn, fmt, get_field(he)); + ret += hpp__call_print_fn(hpp, print_fn, fmt, get_field(he)); if (perf_evsel__is_group_event(evsel)) { int prev_idx, idx_delta; @@ -94,6 +99,13 @@ int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, } } + if (callback) { + int __ret = callback(hpp, false); + + advance_hpp(hpp, __ret); + ret += __ret; + } + /* * Restore original buf and size as it's where caller expects * the result will be saved. @@ -169,7 +181,7 @@ static u64 he_get_##_field(struct hist_entry *he) \ static int hpp__color_##_type(struct perf_hpp_fmt *fmt __maybe_unused, \ struct perf_hpp *hpp, struct hist_entry *he) \ { \ - return __hpp__fmt(hpp, he, he_get_##_field, " %6.2f%%", \ + return __hpp__fmt(hpp, he, he_get_##_field, NULL, " %6.2f%%", \ hpp_color_scnprintf, true); \ } @@ -178,7 +190,7 @@ static int hpp__entry_##_type(struct perf_hpp_fmt *_fmt __maybe_unused, \ struct perf_hpp *hpp, struct hist_entry *he) \ { \ const char *fmt = symbol_conf.field_sep ? " %.2f" : " %6.2f%%"; \ - return __hpp__fmt(hpp, he, he_get_##_field, fmt, \ + return __hpp__fmt(hpp, he, he_get_##_field, NULL, fmt, \ hpp_entry_scnprintf, true); \ } @@ -192,7 +204,7 @@ static int hpp__entry_##_type(struct perf_hpp_fmt *_fmt __maybe_unused, \ struct perf_hpp *hpp, struct hist_entry *he) \ { \ const char *fmt = symbol_conf.field_sep ? " %"PRIu64 : " %11"PRIu64; \ - return __hpp__fmt(hpp, he, he_get_raw_##_field, fmt, \ + return __hpp__fmt(hpp, he, he_get_raw_##_field, NULL, fmt, \ hpp_entry_scnprintf, false); \ } diff --git a/tools/perf/util/hist.h b/tools/perf/util/hist.h index d51ed9898229..9e1cada45bb8 100644 --- a/tools/perf/util/hist.h +++ b/tools/perf/util/hist.h @@ -166,10 +166,12 @@ void perf_hpp__init(void); void perf_hpp__column_register(struct perf_hpp_fmt *format); void perf_hpp__column_enable(unsigned col); +typedef u64 (*hpp_field_fn)(struct hist_entry *he); +typedef int (*hpp_callback_fn)(struct perf_hpp *hpp, bool front); typedef int (*hpp_snprint_fn)(struct perf_hpp *hpp, const char *fmt, ...); int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, - u64 (*get_field)(struct hist_entry *), + hpp_field_fn get_field, hpp_callback_fn callback, const char *fmt, hpp_snprint_fn print_fn, bool fmt_percent); static inline void advance_hpp(struct perf_hpp *hpp, int inc) -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 11/16] perf session: Change header.misc dump from decimal to hex 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (9 preceding siblings ...) 2014-03-14 21:30 ` [PATCH 10/16] perf ui/tui: Reuse generic __hpp__fmt() code Arnaldo Carvalho de Melo @ 2014-03-14 21:30 ` Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 12/16] perf symbols: Introduce thread__find_cpumode_addr_location Arnaldo Carvalho de Melo ` (5 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:30 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Don Zickus, Jiri Olsa, Stephane Eranian, Arnaldo Carvalho de Melo From: Don Zickus <dzickus@redhat.com> When printing the raw dump of a data file, the header.misc is printed as a decimal. Unfortunately, that field is a bit mask, so it is hard to interpret as a decimal. Print in hex, so the user can easily see what bits are set and more importantly what type of info it is conveying. V2: add 0x in front per Jiri Olsa Signed-off-by: Don Zickus <dzickus@redhat.com> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Stephane Eranian <eranian@google.com> Link: http://lkml.kernel.org/r/1393386227-149412-3-git-send-email-dzickus@redhat.com Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/util/session.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/tools/perf/util/session.c b/tools/perf/util/session.c index 1d555d652f58..55960f22233c 100644 --- a/tools/perf/util/session.c +++ b/tools/perf/util/session.c @@ -794,7 +794,7 @@ static void dump_sample(struct perf_evsel *evsel, union perf_event *event, if (!dump_trace) return; - printf("(IP, %d): %d/%d: %#" PRIx64 " period: %" PRIu64 " addr: %#" PRIx64 "\n", + printf("(IP, 0x%x): %d/%d: %#" PRIx64 " period: %" PRIu64 " addr: %#" PRIx64 "\n", event->header.misc, sample->pid, sample->tid, sample->ip, sample->period, sample->addr); -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 12/16] perf symbols: Introduce thread__find_cpumode_addr_location 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (10 preceding siblings ...) 2014-03-14 21:30 ` [PATCH 11/16] perf session: Change header.misc dump from decimal to hex Arnaldo Carvalho de Melo @ 2014-03-14 21:30 ` Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 13/16] perf ui hists: Pass evsel to hpp->header/width functions explicitly Arnaldo Carvalho de Melo ` (4 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:30 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Andi Kleen, David Ahern, Frederic Weisbecker, Jiri Olsa, Mike Galbraith, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Stephane Eranian From: Arnaldo Carvalho de Melo <acme@redhat.com> Its one level up thread__find_addr_location, where it will look in different domains for a sample: user, kernel, hypervisor, etc. Will soon be used by a patchkit by Andi Kleen. Cc: Adrian Hunter <adrian.hunter@intel.com> Cc: Andi Kleen <andi@firstfloor.org> Cc: David Ahern <dsahern@gmail.com> Cc: Frederic Weisbecker <fweisbec@gmail.com> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Mike Galbraith <efault@gmx.de> Cc: Namhyung Kim <namhyung@kernel.org> Cc: Paul Mackerras <paulus@samba.org> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Stephane Eranian <eranian@google.com> Link: http://lkml.kernel.org/n/tip-so6nxkh7xj48bc5kq4jpj991@git.kernel.org Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/util/machine.c | 33 ++++++++------------------------- tools/perf/util/thread.c | 21 +++++++++++++++++++++ tools/perf/util/thread.h | 5 +++++ 3 files changed, 34 insertions(+), 25 deletions(-) diff --git a/tools/perf/util/machine.c b/tools/perf/util/machine.c index 813e94e7cf29..a6799538069c 100644 --- a/tools/perf/util/machine.c +++ b/tools/perf/util/machine.c @@ -1184,39 +1184,22 @@ static bool symbol__match_regex(struct symbol *sym, regex_t *regex) return 0; } -static const u8 cpumodes[] = { - PERF_RECORD_MISC_USER, - PERF_RECORD_MISC_KERNEL, - PERF_RECORD_MISC_GUEST_USER, - PERF_RECORD_MISC_GUEST_KERNEL -}; -#define NCPUMODES (sizeof(cpumodes)/sizeof(u8)) - static void ip__resolve_ams(struct machine *machine, struct thread *thread, struct addr_map_symbol *ams, u64 ip) { struct addr_location al; - size_t i; - u8 m; memset(&al, 0, sizeof(al)); + /* + * We cannot use the header.misc hint to determine whether a + * branch stack address is user, kernel, guest, hypervisor. + * Branches may straddle the kernel/user/hypervisor boundaries. + * Thus, we have to try consecutively until we find a match + * or else, the symbol is unknown + */ + thread__find_cpumode_addr_location(thread, machine, MAP__FUNCTION, ip, &al); - for (i = 0; i < NCPUMODES; i++) { - m = cpumodes[i]; - /* - * We cannot use the header.misc hint to determine whether a - * branch stack address is user, kernel, guest, hypervisor. - * Branches may straddle the kernel/user/hypervisor boundaries. - * Thus, we have to try consecutively until we find a match - * or else, the symbol is unknown - */ - thread__find_addr_location(thread, machine, m, MAP__FUNCTION, - ip, &al); - if (al.map) - goto found; - } -found: ams->addr = ip; ams->al_addr = al.addr; ams->sym = al.sym; diff --git a/tools/perf/util/thread.c b/tools/perf/util/thread.c index 0358882c8910..3ce0498bdae6 100644 --- a/tools/perf/util/thread.c +++ b/tools/perf/util/thread.c @@ -142,3 +142,24 @@ int thread__fork(struct thread *thread, struct thread *parent, u64 timestamp) return 0; } + +void thread__find_cpumode_addr_location(struct thread *thread, + struct machine *machine, + enum map_type type, u64 addr, + struct addr_location *al) +{ + size_t i; + const u8 const cpumodes[] = { + PERF_RECORD_MISC_USER, + PERF_RECORD_MISC_KERNEL, + PERF_RECORD_MISC_GUEST_USER, + PERF_RECORD_MISC_GUEST_KERNEL + }; + + for (i = 0; i < ARRAY_SIZE(cpumodes); i++) { + thread__find_addr_location(thread, machine, cpumodes[i], type, + addr, al); + if (al->map) + break; + } +} diff --git a/tools/perf/util/thread.h b/tools/perf/util/thread.h index 5b856bf942e1..9a070743270c 100644 --- a/tools/perf/util/thread.h +++ b/tools/perf/util/thread.h @@ -58,6 +58,11 @@ void thread__find_addr_location(struct thread *thread, struct machine *machine, u8 cpumode, enum map_type type, u64 addr, struct addr_location *al); +void thread__find_cpumode_addr_location(struct thread *thread, + struct machine *machine, + enum map_type type, u64 addr, + struct addr_location *al); + static inline void *thread__priv(struct thread *thread) { return thread->priv; -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 13/16] perf ui hists: Pass evsel to hpp->header/width functions explicitly 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (11 preceding siblings ...) 2014-03-14 21:30 ` [PATCH 12/16] perf symbols: Introduce thread__find_cpumode_addr_location Arnaldo Carvalho de Melo @ 2014-03-14 21:30 ` Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 14/16] perf kvm: introduce --list-cmds for use by scripts Arnaldo Carvalho de Melo ` (3 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:30 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Namhyung Kim, Andi Kleen, David Ahern, Jiri Olsa, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Arnaldo Carvalho de Melo From: Namhyung Kim <namhyung@kernel.org> Those functions need evsel to investigate event group and it's passed via hpp->ptr. However as it can be missed easily so it's better to pass it via an argument IMHO. Signed-off-by: Namhyung Kim <namhyung@kernel.org> Cc: Andi Kleen <andi@firstfloor.org> Cc: David Ahern <dsahern@gmail.com> Cc: Ingo Molnar <mingo@kernel.org> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Namhyung Kim <namhyung.kim@lge.com> Cc: Paul Mackerras <paulus@samba.org> Cc: Peter Zijlstra <a.p.zijlstra@chello.nl> Link: http://lkml.kernel.org/r/1394437440-11609-2-git-send-email-namhyung@kernel.org Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/builtin-diff.c | 7 ++++--- tools/perf/ui/gtk/hists.c | 3 +-- tools/perf/ui/hist.c | 24 ++++++++++-------------- tools/perf/ui/stdio/hist.c | 5 ++--- tools/perf/util/hist.h | 6 ++++-- 5 files changed, 21 insertions(+), 24 deletions(-) diff --git a/tools/perf/builtin-diff.c b/tools/perf/builtin-diff.c index a77e31246c00..204fffe22532 100644 --- a/tools/perf/builtin-diff.c +++ b/tools/perf/builtin-diff.c @@ -952,8 +952,8 @@ static int hpp__entry_global(struct perf_hpp_fmt *_fmt, struct perf_hpp *hpp, dfmt->header_width, buf); } -static int hpp__header(struct perf_hpp_fmt *fmt, - struct perf_hpp *hpp) +static int hpp__header(struct perf_hpp_fmt *fmt, struct perf_hpp *hpp, + struct perf_evsel *evsel __maybe_unused) { struct diff_hpp_fmt *dfmt = container_of(fmt, struct diff_hpp_fmt, fmt); @@ -963,7 +963,8 @@ static int hpp__header(struct perf_hpp_fmt *fmt, } static int hpp__width(struct perf_hpp_fmt *fmt, - struct perf_hpp *hpp __maybe_unused) + struct perf_hpp *hpp __maybe_unused, + struct perf_evsel *evsel __maybe_unused) { struct diff_hpp_fmt *dfmt = container_of(fmt, struct diff_hpp_fmt, fmt); diff --git a/tools/perf/ui/gtk/hists.c b/tools/perf/ui/gtk/hists.c index 7912dab6e5fb..e395ef9b0ae0 100644 --- a/tools/perf/ui/gtk/hists.c +++ b/tools/perf/ui/gtk/hists.c @@ -165,7 +165,6 @@ static void perf_gtk__show_hists(GtkWidget *window, struct hists *hists, struct perf_hpp hpp = { .buf = s, .size = sizeof(s), - .ptr = hists_to_evsel(hists), }; nr_cols = 0; @@ -192,7 +191,7 @@ static void perf_gtk__show_hists(GtkWidget *window, struct hists *hists, col_idx = 0; perf_hpp__for_each_format(fmt) { - fmt->header(fmt, &hpp); + fmt->header(fmt, &hpp, hists_to_evsel(hists)); gtk_tree_view_insert_column_with_attributes(GTK_TREE_VIEW(view), -1, ltrim(s), diff --git a/tools/perf/ui/hist.c b/tools/perf/ui/hist.c index ac39313a3f0c..0f403b83e9d1 100644 --- a/tools/perf/ui/hist.c +++ b/tools/perf/ui/hist.c @@ -118,29 +118,27 @@ int __hpp__fmt(struct perf_hpp *hpp, struct hist_entry *he, #define __HPP_HEADER_FN(_type, _str, _min_width, _unit_width) \ static int hpp__header_##_type(struct perf_hpp_fmt *fmt __maybe_unused, \ - struct perf_hpp *hpp) \ + struct perf_hpp *hpp, \ + struct perf_evsel *evsel) \ { \ int len = _min_width; \ \ - if (symbol_conf.event_group) { \ - struct perf_evsel *evsel = hpp->ptr; \ - \ + if (symbol_conf.event_group) \ len = max(len, evsel->nr_members * _unit_width); \ - } \ + \ return scnprintf(hpp->buf, hpp->size, "%*s", len, _str); \ } #define __HPP_WIDTH_FN(_type, _min_width, _unit_width) \ static int hpp__width_##_type(struct perf_hpp_fmt *fmt __maybe_unused, \ - struct perf_hpp *hpp __maybe_unused) \ + struct perf_hpp *hpp __maybe_unused, \ + struct perf_evsel *evsel) \ { \ int len = _min_width; \ \ - if (symbol_conf.event_group) { \ - struct perf_evsel *evsel = hpp->ptr; \ - \ + if (symbol_conf.event_group) \ len = max(len, evsel->nr_members * _unit_width); \ - } \ + \ return len; \ } @@ -329,15 +327,13 @@ unsigned int hists__sort_list_width(struct hists *hists) struct perf_hpp_fmt *fmt; struct sort_entry *se; int i = 0, ret = 0; - struct perf_hpp dummy_hpp = { - .ptr = hists_to_evsel(hists), - }; + struct perf_hpp dummy_hpp; perf_hpp__for_each_format(fmt) { if (i) ret += 2; - ret += fmt->width(fmt, &dummy_hpp); + ret += fmt->width(fmt, &dummy_hpp, hists_to_evsel(hists)); } list_for_each_entry(se, &hist_entry__sort_list, list) diff --git a/tools/perf/ui/stdio/hist.c b/tools/perf/ui/stdio/hist.c index 9bad89228472..d59893edf031 100644 --- a/tools/perf/ui/stdio/hist.c +++ b/tools/perf/ui/stdio/hist.c @@ -379,7 +379,6 @@ size_t hists__fprintf(struct hists *hists, bool show_header, int max_rows, struct perf_hpp dummy_hpp = { .buf = bf, .size = sizeof(bf), - .ptr = hists_to_evsel(hists), }; bool first = true; size_t linesz; @@ -398,7 +397,7 @@ size_t hists__fprintf(struct hists *hists, bool show_header, int max_rows, else first = false; - fmt->header(fmt, &dummy_hpp); + fmt->header(fmt, &dummy_hpp, hists_to_evsel(hists)); fprintf(fp, "%s", bf); } @@ -443,7 +442,7 @@ size_t hists__fprintf(struct hists *hists, bool show_header, int max_rows, else first = false; - width = fmt->width(fmt, &dummy_hpp); + width = fmt->width(fmt, &dummy_hpp, hists_to_evsel(hists)); for (i = 0; i < width; i++) fprintf(fp, "."); } diff --git a/tools/perf/util/hist.h b/tools/perf/util/hist.h index 9e1cada45bb8..0c76bf972736 100644 --- a/tools/perf/util/hist.h +++ b/tools/perf/util/hist.h @@ -132,8 +132,10 @@ struct perf_hpp { }; struct perf_hpp_fmt { - int (*header)(struct perf_hpp_fmt *fmt, struct perf_hpp *hpp); - int (*width)(struct perf_hpp_fmt *fmt, struct perf_hpp *hpp); + int (*header)(struct perf_hpp_fmt *fmt, struct perf_hpp *hpp, + struct perf_evsel *evsel); + int (*width)(struct perf_hpp_fmt *fmt, struct perf_hpp *hpp, + struct perf_evsel *evsel); int (*color)(struct perf_hpp_fmt *fmt, struct perf_hpp *hpp, struct hist_entry *he); int (*entry)(struct perf_hpp_fmt *fmt, struct perf_hpp *hpp, -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 14/16] perf kvm: introduce --list-cmds for use by scripts 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (12 preceding siblings ...) 2014-03-14 21:30 ` [PATCH 13/16] perf ui hists: Pass evsel to hpp->header/width functions explicitly Arnaldo Carvalho de Melo @ 2014-03-14 21:30 ` Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 15/16] perf tools: Speed up thread map generation Arnaldo Carvalho de Melo ` (2 subsequent siblings) 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:30 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Ramkumar Ramachandra, David Ahern, Arnaldo Carvalho de Melo From: Ramkumar Ramachandra <artagnon@gmail.com> Introduce $ perf kvm --list-cmds to dump a raw list of commands for use by the completion script. In order to do this, introduce parse_options_subcommand() for handling subcommands as a special case in the parse-options machinery. Signed-off-by: Ramkumar Ramachandra <artagnon@gmail.com> Acked-by: David Ahern <dsahern@gmail.com> Acked-by: Jiri Olsa <jolsa@redhat.com> Cc: David Ahern <dsahern@gmail.com> Link: http://lkml.kernel.org/r/1393896396-10427-1-git-send-email-artagnon@gmail.com Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/builtin-kvm.c | 12 +++++------- tools/perf/perf-completion.sh | 2 +- tools/perf/util/parse-options.c | 37 +++++++++++++++++++++++++++++++++---- tools/perf/util/parse-options.h | 8 +++++++- 4 files changed, 46 insertions(+), 13 deletions(-) diff --git a/tools/perf/builtin-kvm.c b/tools/perf/builtin-kvm.c index a7350519c63f..21c164b8f9db 100644 --- a/tools/perf/builtin-kvm.c +++ b/tools/perf/builtin-kvm.c @@ -1691,17 +1691,15 @@ int cmd_kvm(int argc, const char **argv, const char *prefix __maybe_unused) OPT_END() }; - - const char * const kvm_usage[] = { - "perf kvm [<options>] {top|record|report|diff|buildid-list|stat}", - NULL - }; + const char *const kvm_subcommands[] = { "top", "record", "report", "diff", + "buildid-list", "stat", NULL }; + const char *kvm_usage[] = { NULL, NULL }; perf_host = 0; perf_guest = 1; - argc = parse_options(argc, argv, kvm_options, kvm_usage, - PARSE_OPT_STOP_AT_NON_OPTION); + argc = parse_options_subcommand(argc, argv, kvm_options, kvm_subcommands, kvm_usage, + PARSE_OPT_STOP_AT_NON_OPTION); if (!argc) usage_with_options(kvm_usage, kvm_options); diff --git a/tools/perf/perf-completion.sh b/tools/perf/perf-completion.sh index 496e2abb5482..ae3a57694b6b 100644 --- a/tools/perf/perf-completion.sh +++ b/tools/perf/perf-completion.sh @@ -123,7 +123,7 @@ __perf_main () __perfcomp_colon "$evts" "$cur" # List subcommands for 'perf kvm' elif [[ $prev == "kvm" ]]; then - subcmds="top record report diff buildid-list stat" + subcmds=$($cmd $prev --list-cmds) __perfcomp_colon "$subcmds" "$cur" # List long option names elif [[ $cur == --* ]]; then diff --git a/tools/perf/util/parse-options.c b/tools/perf/util/parse-options.c index d22e3f8017dc..bf48092983c6 100644 --- a/tools/perf/util/parse-options.c +++ b/tools/perf/util/parse-options.c @@ -407,7 +407,9 @@ int parse_options_step(struct parse_opt_ctx_t *ctx, if (internal_help && !strcmp(arg + 2, "help")) return usage_with_options_internal(usagestr, options, 0); if (!strcmp(arg + 2, "list-opts")) - return PARSE_OPT_LIST; + return PARSE_OPT_LIST_OPTS; + if (!strcmp(arg + 2, "list-cmds")) + return PARSE_OPT_LIST_SUBCMDS; switch (parse_long_opt(ctx, arg + 2, options)) { case -1: return parse_options_usage(usagestr, options, arg + 2, 0); @@ -433,25 +435,45 @@ int parse_options_end(struct parse_opt_ctx_t *ctx) return ctx->cpidx + ctx->argc; } -int parse_options(int argc, const char **argv, const struct option *options, - const char * const usagestr[], int flags) +int parse_options_subcommand(int argc, const char **argv, const struct option *options, + const char *const subcommands[], const char *usagestr[], int flags) { struct parse_opt_ctx_t ctx; perf_header__set_cmdline(argc, argv); + /* build usage string if it's not provided */ + if (subcommands && !usagestr[0]) { + struct strbuf buf = STRBUF_INIT; + + strbuf_addf(&buf, "perf %s [<options>] {", argv[0]); + for (int i = 0; subcommands[i]; i++) { + if (i) + strbuf_addstr(&buf, "|"); + strbuf_addstr(&buf, subcommands[i]); + } + strbuf_addstr(&buf, "}"); + + usagestr[0] = strdup(buf.buf); + strbuf_release(&buf); + } + parse_options_start(&ctx, argc, argv, flags); switch (parse_options_step(&ctx, options, usagestr)) { case PARSE_OPT_HELP: exit(129); case PARSE_OPT_DONE: break; - case PARSE_OPT_LIST: + case PARSE_OPT_LIST_OPTS: while (options->type != OPTION_END) { printf("--%s ", options->long_name); options++; } exit(130); + case PARSE_OPT_LIST_SUBCMDS: + for (int i = 0; subcommands[i]; i++) + printf("%s ", subcommands[i]); + exit(130); default: /* PARSE_OPT_UNKNOWN */ if (ctx.argv[0][1] == '-') { error("unknown option `%s'", ctx.argv[0] + 2); @@ -464,6 +486,13 @@ int parse_options(int argc, const char **argv, const struct option *options, return parse_options_end(&ctx); } +int parse_options(int argc, const char **argv, const struct option *options, + const char * const usagestr[], int flags) +{ + return parse_options_subcommand(argc, argv, options, NULL, + (const char **) usagestr, flags); +} + #define USAGE_OPTS_WIDTH 24 #define USAGE_GAP 2 diff --git a/tools/perf/util/parse-options.h b/tools/perf/util/parse-options.h index cbf0149cf221..d8dac8ac5f37 100644 --- a/tools/perf/util/parse-options.h +++ b/tools/perf/util/parse-options.h @@ -140,6 +140,11 @@ extern int parse_options(int argc, const char **argv, const struct option *options, const char * const usagestr[], int flags); +extern int parse_options_subcommand(int argc, const char **argv, + const struct option *options, + const char *const subcommands[], + const char *usagestr[], int flags); + extern NORETURN void usage_with_options(const char * const *usagestr, const struct option *options); @@ -148,7 +153,8 @@ extern NORETURN void usage_with_options(const char * const *usagestr, enum { PARSE_OPT_HELP = -1, PARSE_OPT_DONE, - PARSE_OPT_LIST, + PARSE_OPT_LIST_OPTS, + PARSE_OPT_LIST_SUBCMDS, PARSE_OPT_UNKNOWN, }; -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 15/16] perf tools: Speed up thread map generation 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (13 preceding siblings ...) 2014-03-14 21:30 ` [PATCH 14/16] perf kvm: introduce --list-cmds for use by scripts Arnaldo Carvalho de Melo @ 2014-03-14 21:30 ` Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 16/16] perf machine: Factor machine__find_thread to take tid argument Arnaldo Carvalho de Melo 2014-03-18 8:24 ` [GIT PULL 00/16] perf/core improvements and fixes Ingo Molnar 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:30 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Don Zickus, Jiri Olsa, Joe Mario, Arnaldo Carvalho de Melo From: Don Zickus <dzickus@redhat.com> When trying to capture perf data on a system running spejbb2013, perf hung for about 15 minutes. This is because it took that long to gather about 10,000 thread maps and process them. I don't think a user wants to wait that long. Instead, recognize that thread maps are roughly equivalent to pid maps and just quickly copy those instead. To do this, I synthesize 'fork' events, this eventually calls thread__fork() and copies the maps over. The overhead goes from 15 minutes down to about a few seconds. -- V2: based on Jiri's comments, moved malloc up a level and made sure the memory was freed Signed-off-by: Don Zickus <dzickus@redhat.com> Acked-by: Jiri Olsa <jolsa@redhat.com> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Joe Mario <jmario@redhat.com> Link: http://lkml.kernel.org/r/1394808224-113774-1-git-send-email-dzickus@redhat.com Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/util/event.c | 59 ++++++++++++++++++++++++++++++++++++++++++------- 1 file changed, 51 insertions(+), 8 deletions(-) diff --git a/tools/perf/util/event.c b/tools/perf/util/event.c index 55eebe936513..3e580be0f6fb 100644 --- a/tools/perf/util/event.c +++ b/tools/perf/util/event.c @@ -129,6 +129,28 @@ out: return tgid; } +static int perf_event__synthesize_fork(struct perf_tool *tool, + union perf_event *event, pid_t pid, + pid_t tgid, perf_event__handler_t process, + struct machine *machine) +{ + memset(&event->fork, 0, sizeof(event->fork) + machine->id_hdr_size); + + /* this is really a clone event but we use fork to synthesize it */ + event->fork.ppid = tgid; + event->fork.ptid = tgid; + event->fork.pid = tgid; + event->fork.tid = pid; + event->fork.header.type = PERF_RECORD_FORK; + + event->fork.header.size = (sizeof(event->fork) + machine->id_hdr_size); + + if (process(tool, event, &synth_sample, machine) != 0) + return -1; + + return 0; +} + int perf_event__synthesize_mmap_events(struct perf_tool *tool, union perf_event *event, pid_t pid, pid_t tgid, @@ -278,6 +300,7 @@ int perf_event__synthesize_modules(struct perf_tool *tool, static int __event__synthesize_thread(union perf_event *comm_event, union perf_event *mmap_event, + union perf_event *fork_event, pid_t pid, int full, perf_event__handler_t process, struct perf_tool *tool, @@ -326,9 +349,15 @@ static int __event__synthesize_thread(union perf_event *comm_event, if (tgid == -1) return -1; - /* process the thread's maps too */ - rc = perf_event__synthesize_mmap_events(tool, mmap_event, _pid, tgid, - process, machine, mmap_data); + if (_pid == pid) { + /* process the parent's maps too */ + rc = perf_event__synthesize_mmap_events(tool, mmap_event, pid, tgid, + process, machine, mmap_data); + } else { + /* only fork the tid's map, to save time */ + rc = perf_event__synthesize_fork(tool, fork_event, _pid, tgid, + process, machine); + } if (rc) return rc; @@ -344,7 +373,7 @@ int perf_event__synthesize_thread_map(struct perf_tool *tool, struct machine *machine, bool mmap_data) { - union perf_event *comm_event, *mmap_event; + union perf_event *comm_event, *mmap_event, *fork_event; int err = -1, thread, j; comm_event = malloc(sizeof(comm_event->comm) + machine->id_hdr_size); @@ -355,9 +384,14 @@ int perf_event__synthesize_thread_map(struct perf_tool *tool, if (mmap_event == NULL) goto out_free_comm; + fork_event = malloc(sizeof(fork_event->fork) + machine->id_hdr_size); + if (fork_event == NULL) + goto out_free_mmap; + err = 0; for (thread = 0; thread < threads->nr; ++thread) { if (__event__synthesize_thread(comm_event, mmap_event, + fork_event, threads->map[thread], 0, process, tool, machine, mmap_data)) { @@ -383,6 +417,7 @@ int perf_event__synthesize_thread_map(struct perf_tool *tool, /* if not, generate events for it */ if (need_leader && __event__synthesize_thread(comm_event, mmap_event, + fork_event, comm_event->comm.pid, 0, process, tool, machine, mmap_data)) { @@ -391,6 +426,8 @@ int perf_event__synthesize_thread_map(struct perf_tool *tool, } } } + free(fork_event); +out_free_mmap: free(mmap_event); out_free_comm: free(comm_event); @@ -405,7 +442,7 @@ int perf_event__synthesize_threads(struct perf_tool *tool, DIR *proc; char proc_path[PATH_MAX]; struct dirent dirent, *next; - union perf_event *comm_event, *mmap_event; + union perf_event *comm_event, *mmap_event, *fork_event; int err = -1; comm_event = malloc(sizeof(comm_event->comm) + machine->id_hdr_size); @@ -416,6 +453,10 @@ int perf_event__synthesize_threads(struct perf_tool *tool, if (mmap_event == NULL) goto out_free_comm; + fork_event = malloc(sizeof(fork_event->fork) + machine->id_hdr_size); + if (fork_event == NULL) + goto out_free_mmap; + if (machine__is_default_guest(machine)) return 0; @@ -423,7 +464,7 @@ int perf_event__synthesize_threads(struct perf_tool *tool, proc = opendir(proc_path); if (proc == NULL) - goto out_free_mmap; + goto out_free_fork; while (!readdir_r(proc, &dirent, &next) && next) { char *end; @@ -435,12 +476,14 @@ int perf_event__synthesize_threads(struct perf_tool *tool, * We may race with exiting thread, so don't stop just because * one thread couldn't be synthesized. */ - __event__synthesize_thread(comm_event, mmap_event, pid, 1, - process, tool, machine, mmap_data); + __event__synthesize_thread(comm_event, mmap_event, fork_event, pid, + 1, process, tool, machine, mmap_data); } err = 0; closedir(proc); +out_free_fork: + free(fork_event); out_free_mmap: free(mmap_event); out_free_comm: -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* [PATCH 16/16] perf machine: Factor machine__find_thread to take tid argument 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (14 preceding siblings ...) 2014-03-14 21:30 ` [PATCH 15/16] perf tools: Speed up thread map generation Arnaldo Carvalho de Melo @ 2014-03-14 21:30 ` Arnaldo Carvalho de Melo 2014-03-18 8:24 ` [GIT PULL 00/16] perf/core improvements and fixes Ingo Molnar 16 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-03-14 21:30 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Jiri Olsa, Corey Ashford, David Ahern, Don Zickus, Frederic Weisbecker, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Arnaldo Carvalho de Melo From: Jiri Olsa <jolsa@redhat.com> Forcing the code to always search thread by pid/tid pair. The PID value will be needed in future to determine the process thread leader for map groups sharing. Signed-off-by: Jiri Olsa <jolsa@redhat.com> Acked-by: Adrian Hunter <adrian.hunter@intel.com> Cc: Corey Ashford <cjashfor@linux.vnet.ibm.com> Cc: David Ahern <dsahern@gmail.com> Cc: Don Zickus <dzickus@redhat.com> Cc: Frederic Weisbecker <fweisbec@gmail.com> Cc: Ingo Molnar <mingo@kernel.org> Cc: Namhyung Kim <namhyung@kernel.org> Cc: Paul Mackerras <paulus@samba.org> Cc: Peter Zijlstra <a.p.zijlstra@chello.nl> Link: http://lkml.kernel.org/r/1394805606-25883-3-git-send-email-jolsa@redhat.com Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> --- tools/perf/tests/dwarf-unwind.c | 2 +- tools/perf/util/machine.c | 13 +++++++++---- tools/perf/util/machine.h | 3 ++- 3 files changed, 12 insertions(+), 6 deletions(-) diff --git a/tools/perf/tests/dwarf-unwind.c b/tools/perf/tests/dwarf-unwind.c index f16ea2808a75..c059ee81c038 100644 --- a/tools/perf/tests/dwarf-unwind.c +++ b/tools/perf/tests/dwarf-unwind.c @@ -128,7 +128,7 @@ int test__dwarf_unwind(void) if (verbose > 1) machine__fprintf(machine, stderr); - thread = machine__find_thread(machine, getpid()); + thread = machine__find_thread(machine, getpid(), getpid()); if (!thread) { pr_err("Could not get thread\n"); goto out; diff --git a/tools/perf/util/machine.c b/tools/perf/util/machine.c index a6799538069c..5cecd98c1bc0 100644 --- a/tools/perf/util/machine.c +++ b/tools/perf/util/machine.c @@ -327,9 +327,10 @@ struct thread *machine__findnew_thread(struct machine *machine, pid_t pid, return __machine__findnew_thread(machine, pid, tid, true); } -struct thread *machine__find_thread(struct machine *machine, pid_t tid) +struct thread *machine__find_thread(struct machine *machine, pid_t pid, + pid_t tid) { - return __machine__findnew_thread(machine, 0, tid, false); + return __machine__findnew_thread(machine, pid, tid, false); } int machine__process_comm_event(struct machine *machine, union perf_event *event, @@ -1114,7 +1115,9 @@ static void machine__remove_thread(struct machine *machine, struct thread *th) int machine__process_fork_event(struct machine *machine, union perf_event *event, struct perf_sample *sample) { - struct thread *thread = machine__find_thread(machine, event->fork.tid); + struct thread *thread = machine__find_thread(machine, + event->fork.pid, + event->fork.tid); struct thread *parent = machine__findnew_thread(machine, event->fork.ppid, event->fork.ptid); @@ -1140,7 +1143,9 @@ int machine__process_fork_event(struct machine *machine, union perf_event *event int machine__process_exit_event(struct machine *machine, union perf_event *event, struct perf_sample *sample __maybe_unused) { - struct thread *thread = machine__find_thread(machine, event->fork.tid); + struct thread *thread = machine__find_thread(machine, + event->fork.pid, + event->fork.tid); if (dump_trace) perf_event__fprintf_task(event, stdout); diff --git a/tools/perf/util/machine.h b/tools/perf/util/machine.h index 2e6c248c870f..c8c74a119398 100644 --- a/tools/perf/util/machine.h +++ b/tools/perf/util/machine.h @@ -41,7 +41,8 @@ struct map *machine__kernel_map(struct machine *machine, enum map_type type) return machine->vmlinux_maps[type]; } -struct thread *machine__find_thread(struct machine *machine, pid_t tid); +struct thread *machine__find_thread(struct machine *machine, pid_t pid, + pid_t tid); int machine__process_comm_event(struct machine *machine, union perf_event *event, struct perf_sample *sample); -- 1.8.1.4 ^ permalink raw reply related [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo ` (15 preceding siblings ...) 2014-03-14 21:30 ` [PATCH 16/16] perf machine: Factor machine__find_thread to take tid argument Arnaldo Carvalho de Melo @ 2014-03-18 8:24 ` Ingo Molnar 16 siblings, 0 replies; 56+ messages in thread From: Ingo Molnar @ 2014-03-18 8:24 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Andi Kleen, Aswin Chandramouleeswaran, Corey Ashford, Darren Hart, David Ahern, Davidlohr Bueso, Don Zickus, Frederic Weisbecker, Jason Low, Jiri Olsa, Joe Mario, Masami Hiramatsu, Mike Galbraith, Namhyung Kim, Paul Mackerras, Pekka Enberg, Peter Zijlstra, Ramkumar Ramachandra, Scott J Norton, Stephane Eranian, Thomas Gleixner, Waiman Long, Arnaldo Carvalho de Melo * Arnaldo Carvalho de Melo <acme@infradead.org> wrote: > From: Arnaldo Carvalho de Melo <acme@ghostprotocols.net> > > Hi Ingo, > > Please consider pulling, > > - Arnaldo > > The following changes since commit 81827ed8d85e892311965dc9ec4120b2b2e745bd: > > perf/x86/uncore: Fix missing end markers for SNB/IVB/HSW IMC PMU (2014-03-14 09:25:25 +0100) > > are available in the git repository at: > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux tags/perf-core-for-mingo > > for you to fetch changes up to d75e6097ef1f7669deb500fbbdf53cfe524f1b53: > > perf machine: Factor machine__find_thread to take tid argument (2014-03-14 18:08:42 -0300) > > ---------------------------------------------------------------- > perf/core improvements and fixes: > > User visible: > > . Add several futex 'perf bench' microbenchmarks (Davidlohr Bueso) > > . Speed up thread map generation (Don Zickus) > > . Fix synthesizing mmaps for threads (Don Zickus) > > . Fix invalid output on event group stdio report (Namhyung Kim) > > . Introduce 'perf kvm --list-cmds' command line option for use by > scripts (Ramkumar Ramachandra) > > Documentation: > > . Clarify load-latency information in the 'perf mem' docs (Andi Kleen) > > . Clarify x86 register naming in 'perf probe' docs (Andi Kleen) > > Refactorigns: > > . hists browser refactorings to reuse code accross UIs (Namhyung Kim) > > Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> > > ---------------------------------------------------------------- > Andi Kleen (2): > perf mem: Clarify load-latency in documentation > perf probe: Clarify x86 register naming for perf probe > > Arnaldo Carvalho de Melo (1): > perf symbols: Introduce thread__find_cpumode_addr_location > > Davidlohr Bueso (3): > perf bench: Add futex-hash microbenchmark > perf bench: Add futex-wake microbenchmark > perf bench: Add futex-requeue microbenchmark > > Don Zickus (3): > perf tools: Fix synthesizing mmaps for threads > perf session: Change header.misc dump from decimal to hex > perf tools: Speed up thread map generation > > Jiri Olsa (1): > perf machine: Factor machine__find_thread to take tid argument > > Namhyung Kim (5): > perf ui/stdio: Fix invalid output on event group report > perf ui/gtk: Reuse generic __hpp__fmt() code > perf ui/hists: Pass struct hpp to print functions > perf ui/tui: Reuse generic __hpp__fmt() code > perf ui hists: Pass evsel to hpp->header/width functions explicitly > > Ramkumar Ramachandra (1): > perf kvm: introduce --list-cmds for use by scripts > > tools/perf/Documentation/perf-mem.txt | 4 + > tools/perf/Documentation/perf-probe.txt | 2 + > tools/perf/Makefile.perf | 3 + > tools/perf/bench/bench.h | 3 + > tools/perf/bench/futex-hash.c | 212 ++++++++++++++++++++++++++++++++ > tools/perf/bench/futex-requeue.c | 211 +++++++++++++++++++++++++++++++ > tools/perf/bench/futex-wake.c | 201 ++++++++++++++++++++++++++++++ > tools/perf/bench/futex.h | 71 +++++++++++ > tools/perf/builtin-bench.c | 12 +- > tools/perf/builtin-diff.c | 7 +- > tools/perf/builtin-kvm.c | 12 +- > tools/perf/perf-completion.sh | 2 +- > tools/perf/perf.h | 6 + > tools/perf/tests/dwarf-unwind.c | 2 +- > tools/perf/ui/browsers/hists.c | 122 ++++++------------ > tools/perf/ui/gtk/hists.c | 78 ++---------- > tools/perf/ui/hist.c | 138 +++++++++++++++------ > tools/perf/ui/stdio/hist.c | 11 +- > tools/perf/util/event.c | 156 ++++++++++++++--------- > tools/perf/util/hist.h | 20 ++- > tools/perf/util/machine.c | 46 +++---- > tools/perf/util/machine.h | 3 +- > tools/perf/util/parse-options.c | 37 +++++- > tools/perf/util/parse-options.h | 8 +- > tools/perf/util/session.c | 2 +- > tools/perf/util/thread.c | 21 ++++ > tools/perf/util/thread.h | 5 + > 27 files changed, 1094 insertions(+), 301 deletions(-) > create mode 100644 tools/perf/bench/futex-hash.c > create mode 100644 tools/perf/bench/futex-requeue.c > create mode 100644 tools/perf/bench/futex-wake.c > create mode 100644 tools/perf/bench/futex.h Pulled, thanks a lot Arnaldo! Ingo ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2019-01-04 18:33 Arnaldo Carvalho de Melo 2019-01-08 15:32 ` Ingo Molnar 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2019-01-04 18:33 UTC (permalink / raw) To: Ingo Molnar Cc: Clark Williams, linux-kernel, linux-perf-users, Arnaldo Carvalho de Melo, Arnaldo Carvalho de Melo, Ivan Krylov, Jin Yao, Mattias Jacobsson Hi Ingo, Please consider pulling, Best regards, - Arnaldo Test results at the end of this message, as usual. The following changes since commit 2573be22e5b6f24a0cabc97715c808c47e29eaaf: Merge tag 'perf-core-for-mingo-4.21-20190103' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/urgent (2019-01-03 14:05:16 +0100) are available in the Git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo-4.21-20190104 for you to fetch changes up to 03fa483821c0b4db7c2b1453d3332f397d82313f: perf test shell: Use a fallback to get the pathname in vfs_getname (2019-01-04 15:12:29 -0300) ---------------------------------------------------------------- perf/core improvements and fixes: perf annotate: Ivan Krylov: - Pass filename to objdump via execl, fixing usage with filenames with special characters. perf report: Jin Yao: Fix wrong iteration count in --branch-history perf stat: Jin Yao: - Fix endless wait for child process perf test: Arnaldo Carvalho de Melo: - Use a fallback to get the pathname in vfs_getname in tools build: Jiri Olsa: - Allow overriding CFLAGS assignments. Misc: Arnaldo Carvalho de Melo: - Syncronize UAPI headers Mattias Jacobsson: - Remove redundant va_end() in strbuf_addv() Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Arnaldo Carvalho de Melo (10): tools headers x86: Sync disabled-features.h tools headers uapi: Sync prctl.h with the kernel sources tools beauty: Make the prctl option table generator catch all PR_ options tools headers uapi: Update i915_drm.h tools headers uapi: Sync linux/in.h copy from the kernel sources tools headers uapi: Sync linux/kvm.h with the kernel sources tools headers uapi: Sync copy of asm-generic/unistd.h with the kernel sources tools headers x86: Sync asm/cpufeatures.h copy with the kernel sources perf python: Make sure the python binding output directory is in place perf test shell: Use a fallback to get the pathname in vfs_getname Ivan Krylov (1): perf annotate: Pass filename to objdump via execl Jin Yao (2): perf stat: Fix endless wait for child process perf report: Fix wrong iteration count in --branch-history Jiri Olsa (2): tools thermal tmon: Use -O3 instead of -O1 if available tools iio: Override CFLAGS assignments Mattias Jacobsson (1): perf strbuf: Remove redundant va_end() in strbuf_addv() tools/arch/x86/include/asm/cpufeatures.h | 2 ++ tools/arch/x86/include/asm/disabled-features.h | 8 ++++++- tools/iio/Makefile | 2 +- tools/include/uapi/asm-generic/unistd.h | 4 +++- tools/include/uapi/drm/i915_drm.h | 8 +++++++ tools/include/uapi/linux/in.h | 10 +++++--- tools/include/uapi/linux/kvm.h | 19 +++++++++++++++ tools/include/uapi/linux/prctl.h | 8 +++++++ tools/perf/Makefile.perf | 4 +++- tools/perf/builtin-stat.c | 3 ++- tools/perf/tests/shell/lib/probe_vfs_getname.sh | 3 ++- tools/perf/trace/beauty/prctl_option.sh | 2 +- tools/perf/util/annotate.c | 8 +++---- tools/perf/util/callchain.c | 32 +++++++++++++++---------- tools/perf/util/callchain.h | 1 + tools/perf/util/machine.c | 2 +- tools/perf/util/strbuf.c | 1 - tools/thermal/tmon/Makefile | 2 +- 18 files changed, 90 insertions(+), 29 deletions(-) Test results: The first ones are container based builds of tools/perf with and without libelf support. Where clang is available, it is also used to build perf with/without libelf, and building with LIBCLANGLLVM=1 (built-in clang) with gcc and clang when clang and its devel libraries are installed. The objtool and samples/bpf/ builds are disabled now that I'm switching from using the sources in a local volume to fetching them from a http server to build it inside the container, to make it easier to build in a container cluster. Those will come back later. Several are cross builds, the ones with -x-ARCH and the android one, and those may not have all the features built, due to lack of multi-arch devel packages, available and being used so far on just a few, like debian:experimental-x-{arm64,mipsel}. The 'perf test' one will perform a variety of tests exercising tools/perf/util/, tools/lib/{bpf,traceevent,etc}, as well as run perf commands with a variety of command line event specifications to then intercept the sys_perf_event syscall to check that the perf_event_attr fields are set up as expected, among a variety of other unit tests. Then there is the 'make -C tools/perf build-test' ones, that build tools/perf/ with a variety of feature sets, exercising the build with an incomplete set of features as well as with a complete one. It is planned to have it run on each of the containers mentioned above, using some container orchestration infrastructure. Get in contact if interested in helping having this in place. $ dm 1 alpine:3.4 : Ok gcc (Alpine 5.3.0) 5.3.0 2 alpine:3.5 : Ok gcc (Alpine 6.2.1) 6.2.1 20160822 3 alpine:3.6 : Ok gcc (Alpine 6.3.0) 6.3.0 4 alpine:3.7 : Ok gcc (Alpine 6.4.0) 6.4.0 5 alpine:3.8 : Ok gcc (Alpine 6.4.0) 6.4.0 6 alpine:edge : Ok gcc (Alpine 8.2.0) 8.2.0 7 amazonlinux:1 : Ok gcc (GCC) 4.8.5 20150623 (Red Hat 4.8.5-28) 8 amazonlinux:2 : Ok gcc (GCC) 7.3.1 20180303 (Red Hat 7.3.1-5) 9 android-ndk:r12b-arm : Ok arm-linux-androideabi-gcc (GCC) 4.9.x 20150123 (prerelease) 10 android-ndk:r15c-arm : Ok arm-linux-androideabi-gcc (GCC) 4.9.x 20150123 (prerelease) 11 centos:5 : Ok gcc (GCC) 4.1.2 20080704 (Red Hat 4.1.2-55) 12 centos:6 : Ok gcc (GCC) 4.4.7 20120313 (Red Hat 4.4.7-23) 13 centos:7 : Ok gcc (GCC) 4.8.5 20150623 (Red Hat 4.8.5-36) 14 clearlinux:latest : Ok gcc (Clear Linux OS for Intel Architecture) 8.2.1 20180502 15 debian:7 : Ok gcc (Debian 4.7.2-5) 4.7.2 16 debian:8 : Ok gcc (Debian 4.9.2-10+deb8u2) 4.9.2 17 debian:9 : Ok gcc (Debian 6.3.0-18+deb9u1) 6.3.0 20170516 18 debian:experimental : Ok gcc (Debian 8.2.0-13) 8.2.0 19 debian:experimental-x-arm64 : Ok aarch64-linux-gnu-gcc (Debian 8.2.0-11) 8.2.0 20 debian:experimental-x-mips : Ok mips-linux-gnu-gcc (Debian 8.2.0-11) 8.2.0 21 debian:experimental-x-mips64 : Ok mips64-linux-gnuabi64-gcc (Debian 8.2.0-11) 8.2.0 22 debian:experimental-x-mipsel : Ok mipsel-linux-gnu-gcc (Debian 8.2.0-11) 8.2.0 23 fedora:20 : Ok gcc (GCC) 4.8.3 20140911 (Red Hat 4.8.3-7) 24 fedora:22 : Ok gcc (GCC) 5.3.1 20160406 (Red Hat 5.3.1-6) 25 fedora:23 : Ok gcc (GCC) 5.3.1 20160406 (Red Hat 5.3.1-6) 26 fedora:24 : Ok gcc (GCC) 6.3.1 20161221 (Red Hat 6.3.1-1) 27 fedora:24-x-ARC-uClibc : Ok arc-linux-gcc (ARCompact ISA Linux uClibc toolchain 2017.09-rc2) 7.1.1 20170710 28 fedora:25 : Ok gcc (GCC) 6.4.1 20170727 (Red Hat 6.4.1-1) 29 fedora:26 : Ok gcc (GCC) 7.3.1 20180130 (Red Hat 7.3.1-2) 30 fedora:27 : Ok gcc (GCC) 7.3.1 20180712 (Red Hat 7.3.1-6) 31 fedora:28 : Ok gcc (GCC) 8.2.1 20181105 (Red Hat 8.2.1-5) 32 fedora:29 : Ok gcc (GCC) 8.2.1 20181105 (Red Hat 8.2.1-5) 33 fedora:rawhide : Ok gcc (GCC) 8.2.1 20181105 (Red Hat 8.2.1-5) 34 gentoo-stage3-amd64:latest : Ok gcc (Gentoo 7.3.0-r3 p1.4) 7.3.0 35 mageia:5 : Ok gcc (GCC) 4.9.2 36 mageia:6 : Ok gcc (Mageia 5.5.0-1.mga6) 5.5.0 37 opensuse:13.2 : Ok gcc (SUSE Linux) 4.8.3 20140627 [gcc-4_8-branch revision 212064] 38 opensuse:15.0 : Ok gcc (SUSE Linux) 7.3.1 20180323 [gcc-7-branch revision 258812] 39 opensuse:42.1 : Ok gcc (SUSE Linux) 4.8.5 40 opensuse:42.2 : Ok gcc (SUSE Linux) 4.8.5 41 opensuse:42.3 : Ok gcc (SUSE Linux) 4.8.5 42 opensuse:tumbleweed : Ok gcc (SUSE Linux) 8.2.1 20181108 [gcc-8-branch revision 265914] 43 oraclelinux:6 : Ok gcc (GCC) 4.4.7 20120313 (Red Hat 4.4.7-23.0.1) 44 oraclelinux:7 : Ok gcc (GCC) 4.8.5 20150623 (Red Hat 4.8.5-36.0.1) 45 ubuntu:12.04.5 : Ok gcc (Ubuntu/Linaro 4.6.3-1ubuntu5) 4.6.3 46 ubuntu:14.04.4 : Ok gcc (Ubuntu 4.8.4-2ubuntu1~14.04.4) 4.8.4 47 ubuntu:14.04.4-x-linaro-arm64 : Ok aarch64-linux-gnu-gcc (Linaro GCC 5.5-2017.10) 5.5.0 48 ubuntu:16.04 : Ok gcc (Ubuntu 5.4.0-6ubuntu1~16.04.10) 5.4.0 20160609 49 ubuntu:16.04-x-arm : Ok arm-linux-gnueabihf-gcc (Ubuntu/Linaro 5.4.0-6ubuntu1~16.04.9) 5.4.0 20160609 50 ubuntu:16.04-x-arm64 : Ok aarch64-linux-gnu-gcc (Ubuntu/Linaro 5.4.0-6ubuntu1~16.04.9) 5.4.0 20160609 51 ubuntu:16.04-x-powerpc : Ok powerpc-linux-gnu-gcc (Ubuntu 5.4.0-6ubuntu1~16.04.9) 5.4.0 20160609 52 ubuntu:16.04-x-powerpc64 : Ok powerpc64-linux-gnu-gcc (Ubuntu/IBM 5.4.0-6ubuntu1~16.04.9) 5.4.0 20160609 53 ubuntu:16.04-x-powerpc64el : Ok powerpc64le-linux-gnu-gcc (Ubuntu/IBM 5.4.0-6ubuntu1~16.04.9) 5.4.0 20160609 54 ubuntu:16.04-x-s390 : Ok s390x-linux-gnu-gcc (Ubuntu 5.4.0-6ubuntu1~16.04.9) 5.4.0 20160609 55 ubuntu:17.10 : Ok gcc (Ubuntu 7.2.0-8ubuntu3.2) 7.2.0 56 ubuntu:18.04 : Ok gcc (Ubuntu 7.3.0-27ubuntu1~18.04) 7.3.0 57 ubuntu:18.04-x-arm : Ok arm-linux-gnueabihf-gcc (Ubuntu/Linaro 7.3.0-27ubuntu1~18.04) 7.3.0 58 ubuntu:18.04-x-arm64 : Ok aarch64-linux-gnu-gcc (Ubuntu/Linaro 7.3.0-27ubuntu1~18.04) 7.3.0 59 ubuntu:18.04-x-m68k : Ok m68k-linux-gnu-gcc (Ubuntu 7.3.0-27ubuntu1~18.04) 7.3.0 60 ubuntu:18.04-x-powerpc : Ok powerpc-linux-gnu-gcc (Ubuntu 7.3.0-27ubuntu1~18.04) 7.3.0 61 ubuntu:18.04-x-powerpc64 : Ok powerpc64-linux-gnu-gcc (Ubuntu 7.3.0-27ubuntu1~18.04) 7.3.0 62 ubuntu:18.04-x-powerpc64el : Ok powerpc64le-linux-gnu-gcc (Ubuntu 7.3.0-27ubuntu1~18.04) 7.3.0 63 ubuntu:18.04-x-riscv64 : Ok riscv64-linux-gnu-gcc (Ubuntu 7.3.0-27ubuntu1~18.04) 7.3.0 64 ubuntu:18.04-x-s390 : Ok s390x-linux-gnu-gcc (Ubuntu 7.3.0-27ubuntu1~18.04) 7.3.0 65 ubuntu:18.04-x-sh4 : Ok sh4-linux-gnu-gcc (Ubuntu 7.3.0-27ubuntu1~18.04) 7.3.0 66 ubuntu:18.04-x-sparc64 : Ok sparc64-linux-gnu-gcc (Ubuntu 7.3.0-27ubuntu1~18.04) 7.3.0 67 ubuntu:18.10 : Ok gcc (Ubuntu 8.2.0-7ubuntu1) 8.2.0 68 ubuntu:19.04 : Ok gcc (Ubuntu 8.2.0-12ubuntu1) 8.2.0 69 ubuntu:19.04-x-alpha : Ok alpha-linux-gnu-gcc (Ubuntu 8.2.0-11ubuntu1) 8.2.0 70 ubuntu:19.04-x-hppa : Ok hppa-linux-gnu-gcc (Ubuntu 8.2.0-11ubuntu1) 8.2.0 # uname -a Linux quaco 4.19.13-300.fc29.x86_64 #1 SMP Sat Dec 29 22:54:28 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux # git log --oneline -1 03fa483821c0 perf test shell: Use a fallback to get the pathname in vfs_getname # perf version --build-options perf version 4.20.g03fa48 dwarf: [ on ] # HAVE_DWARF_SUPPORT dwarf_getlocations: [ on ] # HAVE_DWARF_GETLOCATIONS_SUPPORT glibc: [ on ] # HAVE_GLIBC_SUPPORT gtk2: [ on ] # HAVE_GTK2_SUPPORT syscall_table: [ on ] # HAVE_SYSCALL_TABLE_SUPPORT libbfd: [ on ] # HAVE_LIBBFD_SUPPORT libelf: [ on ] # HAVE_LIBELF_SUPPORT libnuma: [ on ] # HAVE_LIBNUMA_SUPPORT numa_num_possible_cpus: [ on ] # HAVE_LIBNUMA_SUPPORT libperl: [ on ] # HAVE_LIBPERL_SUPPORT libpython: [ on ] # HAVE_LIBPYTHON_SUPPORT libslang: [ on ] # HAVE_SLANG_SUPPORT libcrypto: [ on ] # HAVE_LIBCRYPTO_SUPPORT libunwind: [ on ] # HAVE_LIBUNWIND_SUPPORT libdw-dwarf-unwind: [ on ] # HAVE_DWARF_SUPPORT zlib: [ on ] # HAVE_ZLIB_SUPPORT lzma: [ on ] # HAVE_LZMA_SUPPORT get_cpuid: [ on ] # HAVE_AUXTRACE_SUPPORT bpf: [ on ] # HAVE_LIBBPF_SUPPORT # perf test 1: vmlinux symtab matches kallsyms : Ok 2: Detect openat syscall event : Ok 3: Detect openat syscall event on all cpus : Ok 4: Read samples using the mmap interface : Ok 5: Test data source output : Ok 6: Parse event definition strings : Ok 7: Simple expression parser : Ok 8: PERF_RECORD_* events & perf_sample fields : Ok 9: Parse perf pmu format : Ok 10: DSO data read : Ok 11: DSO data cache : Ok 12: DSO data reopen : Ok 13: Roundtrip evsel->name : Ok 14: Parse sched tracepoints fields : Ok 15: syscalls:sys_enter_openat event fields : Ok 16: Setup struct perf_event_attr : Ok 17: Match and link multiple hists : Ok 18: 'import perf' in python : Ok 19: Breakpoint overflow signal handler : Ok 20: Breakpoint overflow sampling : Ok 21: Breakpoint accounting : Ok 22: Watchpoint : 22.1: Read Only Watchpoint : Skip 22.2: Write Only Watchpoint : Ok 22.3: Read / Write Watchpoint : Ok 22.4: Modify Watchpoint : Ok 23: Number of exit events of a simple workload : Ok 24: Software clock events period values : Ok 25: Object code reading : Ok 26: Sample parsing : Ok 27: Use a dummy software event to keep tracking : Ok 28: Parse with no sample_id_all bit set : Ok 29: Filter hist entries : Ok 30: Lookup mmap thread : Ok 31: Share thread mg : Ok 32: Sort output of hist entries : Ok 33: Cumulate child hist entries : Ok 34: Track with sched_switch : Ok 35: Filter fds with revents mask in a fdarray : Ok 36: Add fd to a fdarray, making it autogrow : Ok 37: kmod_path__parse : Ok 38: Thread map : Ok 39: LLVM search and compile : 39.1: Basic BPF llvm compile : Ok 39.2: kbuild searching : Ok 39.3: Compile source for BPF prologue generation : Ok 39.4: Compile source for BPF relocation : Ok 40: Session topology : Ok 41: BPF filter : 41.1: Basic BPF filtering : Ok 41.2: BPF pinning : Ok 41.3: BPF prologue generation : Ok 41.4: BPF relocation checker : Ok 42: Synthesize thread map : Ok 43: Remove thread map : Ok 44: Synthesize cpu map : Ok 45: Synthesize stat config : Ok 46: Synthesize stat : Ok 47: Synthesize stat round : Ok 48: Synthesize attr update : Ok 49: Event times : Ok 50: Read backward ring buffer : Ok 51: Print cpu map : Ok 52: Probe SDT events : Ok 53: is_printable_array : Ok 54: Print bitmap : Ok 55: perf hooks : Ok 56: builtin clang support : Skip (not compiled in) 57: unit_number__scnprintf : Ok 58: mem2node : Ok 59: x86 rdpmc : Ok 60: Convert perf time to TSC : Ok 61: DWARF unwind : Ok 62: x86 instruction decoder - new instructions : Ok 63: x86 bp modify : Ok 64: probe libc's inet_pton & backtrace it with ping : Ok 65: Use vfs_getname probe to get syscall args filenames : Ok 66: Add vfs_getname probe to get syscall args filenames : Ok 67: Check open filename arg using perf trace + vfs_getname: Ok $ make -C tools/perf build-test make: Entering directory '/home/acme/git/perf/tools/perf' - tarpkg: ./tests/perf-targz-src-pkg . make_no_demangle_O: make NO_DEMANGLE=1 make_no_libperl_O: make NO_LIBPERL=1 make_no_newt_O: make NO_NEWT=1 make_no_gtk2_O: make NO_GTK2=1 make_no_libbpf_O: make NO_LIBBPF=1 make_static_O: make LDFLAGS=-static make_no_ui_O: make NO_NEWT=1 NO_SLANG=1 NO_GTK2=1 make_cscope_O: make cscope make_no_libaudit_O: make NO_LIBAUDIT=1 make_help_O: make help make_no_auxtrace_O: make NO_AUXTRACE=1 make_no_libnuma_O: make NO_LIBNUMA=1 make_perf_o_O: make perf.o make_no_libelf_O: make NO_LIBELF=1 make_install_prefix_O: make install prefix=/tmp/krava make_tags_O: make tags make_no_backtrace_O: make NO_BACKTRACE=1 make_pure_O: make make_no_libunwind_O: make NO_LIBUNWIND=1 make_no_slang_O: make NO_SLANG=1 make_install_O: make install make_clean_all_O: make clean all make_install_bin_O: make install-bin make_no_libpython_O: make NO_LIBPYTHON=1 make_debug_O: make DEBUG=1 make_doc_O: make doc make_minimal_O: make NO_LIBPERL=1 NO_LIBPYTHON=1 NO_NEWT=1 NO_GTK2=1 NO_DEMANGLE=1 NO_LIBELF=1 NO_LIBUNWIND=1 NO_BACKTRACE=1 NO_LIBNUMA=1 NO_LIBAUDIT=1 NO_LIBBIONIC=1 NO_LIBDW_DWARF_UNWIND=1 NO_AUXTRACE=1 NO_LIBBPF=1 NO_LIBCRYPTO=1 NO_SDT=1 NO_JVMTI=1 make_with_clangllvm_O: make LIBCLANGLLVM=1 make_util_pmu_bison_o_O: make util/pmu-bison.o make_no_libbionic_O: make NO_LIBBIONIC=1 make_no_scripts_O: make NO_LIBPYTHON=1 NO_LIBPERL=1 make_util_map_o_O: make util/map.o make_with_babeltrace_O: make LIBBABELTRACE=1 make_install_prefix_slash_O: make install prefix=/tmp/krava/ make_no_libdw_dwarf_unwind_O: make NO_LIBDW_DWARF_UNWIND=1 OK make: Leaving directory '/home/acme/git/perf/tools/perf' $ ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2019-01-04 18:33 Arnaldo Carvalho de Melo @ 2019-01-08 15:32 ` Ingo Molnar 0 siblings, 0 replies; 56+ messages in thread From: Ingo Molnar @ 2019-01-08 15:32 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: Clark Williams, linux-kernel, linux-perf-users, Arnaldo Carvalho de Melo, Ivan Krylov, Jin Yao, Mattias Jacobsson * Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > Hi Ingo, > > Please consider pulling, > > Best regards, > > - Arnaldo > > Test results at the end of this message, as usual. > > The following changes since commit 2573be22e5b6f24a0cabc97715c808c47e29eaaf: > > Merge tag 'perf-core-for-mingo-4.21-20190103' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/urgent (2019-01-03 14:05:16 +0100) > > are available in the Git repository at: > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo-4.21-20190104 > > for you to fetch changes up to 03fa483821c0b4db7c2b1453d3332f397d82313f: > > perf test shell: Use a fallback to get the pathname in vfs_getname (2019-01-04 15:12:29 -0300) > > ---------------------------------------------------------------- > perf/core improvements and fixes: > > perf annotate: > > Ivan Krylov: > > - Pass filename to objdump via execl, fixing usage with filenames > with special characters. > > perf report: > > Jin Yao: > > Fix wrong iteration count in --branch-history > > perf stat: > > Jin Yao: > > - Fix endless wait for child process > > perf test: > > Arnaldo Carvalho de Melo: > > - Use a fallback to get the pathname in vfs_getname in > > tools build: > > Jiri Olsa: > > - Allow overriding CFLAGS assignments. > > Misc: > > Arnaldo Carvalho de Melo: > > - Syncronize UAPI headers > > Mattias Jacobsson: > > - Remove redundant va_end() in strbuf_addv() > > Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> > > ---------------------------------------------------------------- > Arnaldo Carvalho de Melo (10): > tools headers x86: Sync disabled-features.h > tools headers uapi: Sync prctl.h with the kernel sources > tools beauty: Make the prctl option table generator catch all PR_ options > tools headers uapi: Update i915_drm.h > tools headers uapi: Sync linux/in.h copy from the kernel sources > tools headers uapi: Sync linux/kvm.h with the kernel sources > tools headers uapi: Sync copy of asm-generic/unistd.h with the kernel sources > tools headers x86: Sync asm/cpufeatures.h copy with the kernel sources > perf python: Make sure the python binding output directory is in place > perf test shell: Use a fallback to get the pathname in vfs_getname > > Ivan Krylov (1): > perf annotate: Pass filename to objdump via execl > > Jin Yao (2): > perf stat: Fix endless wait for child process > perf report: Fix wrong iteration count in --branch-history > > Jiri Olsa (2): > tools thermal tmon: Use -O3 instead of -O1 if available > tools iio: Override CFLAGS assignments > > Mattias Jacobsson (1): > perf strbuf: Remove redundant va_end() in strbuf_addv() > > tools/arch/x86/include/asm/cpufeatures.h | 2 ++ > tools/arch/x86/include/asm/disabled-features.h | 8 ++++++- > tools/iio/Makefile | 2 +- > tools/include/uapi/asm-generic/unistd.h | 4 +++- > tools/include/uapi/drm/i915_drm.h | 8 +++++++ > tools/include/uapi/linux/in.h | 10 +++++--- > tools/include/uapi/linux/kvm.h | 19 +++++++++++++++ > tools/include/uapi/linux/prctl.h | 8 +++++++ > tools/perf/Makefile.perf | 4 +++- > tools/perf/builtin-stat.c | 3 ++- > tools/perf/tests/shell/lib/probe_vfs_getname.sh | 3 ++- > tools/perf/trace/beauty/prctl_option.sh | 2 +- > tools/perf/util/annotate.c | 8 +++---- > tools/perf/util/callchain.c | 32 +++++++++++++++---------- > tools/perf/util/callchain.h | 1 + > tools/perf/util/machine.c | 2 +- > tools/perf/util/strbuf.c | 1 - > tools/thermal/tmon/Makefile | 2 +- > 18 files changed, 90 insertions(+), 29 deletions(-) Pulled, thanks a lot Arnaldo! Ingo ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2016-07-05 0:38 Arnaldo Carvalho de Melo 2016-07-05 10:14 ` Ingo Molnar 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2016-07-05 0:38 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Alexei Starovoitov, Ananth N Mavinakayanahalli, Brendan Gregg, Chris Phlipot, Christoph Hellwig, Daniel Borkmann, David Ahern, Eric Leblond, He Kuang, Hemant Kumar, Jiri Olsa, Masami Hiramatsu, Milian Wolff, Namhyung Kim, Peter Zijlstra, pi3orama, Taeung Song, Wang Nan, Zefan Li, Arnaldo Carvalho de Melo Hi Ingo, Please consider pulling, Build status: [root@jouet ~]# dm alpine:3.4: FAIL <--- strerror_r usage needs to be made XSI-compliant, Alpine Linux uses musl libc centos:5: FAIL <--- flex snafu, being investigated by Jiri centos:6: Ok centos:7: Ok debian:7: Ok debian:8: Ok debian:experimental: Ok fedora:21: Ok fedora:22: Ok fedora:23: Ok fedora:24: Ok fedora:rawhide: Ok mageia:5: Ok opensuse:13.2: Ok opensuse:42.1: Ok ubuntu:12.04.5: Ok ubuntu:14.04.4: Ok ubuntu:15.10: Ok ubuntu:16.04: Ok [root@jouet ~]# These images are now available at: https://hub.docker.com/search/?q=acmel And the 'dm' script at: https://fedorapeople.org/~acme/perf/dm Regards, - Arnaldo The following changes since commit dc29bb47a34130459fadd58f05e2acd051a6327d: Merge tag 'perf-core-for-mingo-20160630' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2016-07-01 08:40:39 +0200) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo-20160704 for you to fetch changes up to f3d082ceabe53177c98bfa4580a294c2844966e8: perf tools: Sync copy of syscall_64.tbl with the kernel (2016-07-04 20:29:40 -0300) ---------------------------------------------------------------- perf/core improvements and fixes: Documentation: - Update android build documentation (Chris Phlipot) Infrastructure: - Respect WERROR=0 in libapi and libsubcmd, to allow building on Android (Chris Phlipot) - Prep work to support SDT events in probe cache (Masami Hiramatsu) - ELF support for SDT (Hemant Kumar) - Add feature detection for libelf's elf_getshdrstrndx function (Arnaldo Carvalho de Melo) - Fix hist accumulation test (Jiri Olsa) - Unwind callchain fixes (Jiri Olsa) - Change internal representation of numa nodes obtained from perf.data header (Jiri Olsa) - Sync copy of syscall_64.tbl with the kernel (Arnaldo Carvalho de Melo) - Add LGPL 2.1 license header to libbpf source files (Wang Nan) Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Arnaldo Carvalho de Melo (2): perf build: Add feature detection for libelf's elf_getshdrstrndx() perf tools: Sync copy of syscall_64.tbl with the kernel Chris Phlipot (3): tools lib api: Respect WERROR=0 for build tools lib subcmd: Respect WERROR=0 for build perf tools: Update android build documentation Hemant Kumar (1): perf sdt: ELF support for SDT Jiri Olsa (4): perf header: Transform nodes string info to struct perf tests: Fix hist accumulation test perf unwind: Add initialized arg into unwind__prepare_access perf unwind: Call unwind__prepare_access for forked thread Masami Hiramatsu (5): perf probe: Use cache entry if possible perf probe: Show all cached probes perf probe: Remove caches when --cache is given perf probe: Add group name support perf buildid-cache: Scan and import user SDT events to probe cache Wang Nan (1): tools lib bpf: Add license header tools/build/Makefile.feature | 1 + tools/build/feature/Makefile | 4 + tools/build/feature/test-all.c | 5 + tools/build/feature/test-libelf-getshdrstrndx.c | 8 + tools/lib/api/Makefile | 8 +- tools/lib/bpf/bpf.c | 13 ++ tools/lib/bpf/bpf.h | 13 ++ tools/lib/bpf/libbpf.c | 13 ++ tools/lib/bpf/libbpf.h | 13 ++ tools/lib/subcmd/Makefile | 8 +- tools/perf/Documentation/android.txt | 16 +- tools/perf/Documentation/perf-buildid-cache.txt | 3 + tools/perf/Documentation/perf-probe.txt | 19 +- tools/perf/arch/x86/entry/syscalls/syscall_64.tbl | 2 + tools/perf/builtin-probe.c | 31 ++- tools/perf/config/Makefile | 4 + tools/perf/tests/hists_cumulate.c | 4 + tools/perf/util/build-id.c | 138 +++++++++++- tools/perf/util/build-id.h | 3 + tools/perf/util/callchain.h | 1 + tools/perf/util/env.c | 5 +- tools/perf/util/env.h | 10 +- tools/perf/util/header.c | 76 +++---- tools/perf/util/map.c | 9 +- tools/perf/util/map.h | 2 +- tools/perf/util/probe-event.c | 91 +++++++- tools/perf/util/probe-file.c | 175 ++++++++++++++- tools/perf/util/probe-file.h | 10 +- tools/perf/util/symbol-elf.c | 260 ++++++++++++++++++++++ tools/perf/util/symbol.h | 22 ++ tools/perf/util/thread.c | 39 +++- tools/perf/util/unwind-libunwind.c | 11 +- tools/perf/util/unwind.h | 9 +- tools/perf/util/util.c | 19 +- 34 files changed, 940 insertions(+), 105 deletions(-) create mode 100644 tools/build/feature/test-libelf-getshdrstrndx.c ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2016-07-05 0:38 Arnaldo Carvalho de Melo @ 2016-07-05 10:14 ` Ingo Molnar 0 siblings, 0 replies; 56+ messages in thread From: Ingo Molnar @ 2016-07-05 10:14 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: linux-kernel, Adrian Hunter, Alexei Starovoitov, Ananth N Mavinakayanahalli, Brendan Gregg, Chris Phlipot, Christoph Hellwig, Daniel Borkmann, David Ahern, Eric Leblond, He Kuang, Hemant Kumar, Jiri Olsa, Masami Hiramatsu, Milian Wolff, Namhyung Kim, Peter Zijlstra, pi3orama, Taeung Song, Wang Nan, Zefan Li, Arnaldo Carvalho de Melo * Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > Hi Ingo, > > Please consider pulling, > > Build status: > > [root@jouet ~]# dm > alpine:3.4: FAIL <--- strerror_r usage needs to be made XSI-compliant, Alpine Linux uses musl libc > centos:5: FAIL <--- flex snafu, being investigated by Jiri > centos:6: Ok > centos:7: Ok > debian:7: Ok > debian:8: Ok > debian:experimental: Ok > fedora:21: Ok > fedora:22: Ok > fedora:23: Ok > fedora:24: Ok > fedora:rawhide: Ok > mageia:5: Ok > opensuse:13.2: Ok > opensuse:42.1: Ok > ubuntu:12.04.5: Ok > ubuntu:14.04.4: Ok > ubuntu:15.10: Ok > ubuntu:16.04: Ok > [root@jouet ~]# > > These images are now available at: https://hub.docker.com/search/?q=acmel > > And the 'dm' script at: https://fedorapeople.org/~acme/perf/dm > > Regards, > > - Arnaldo > > The following changes since commit dc29bb47a34130459fadd58f05e2acd051a6327d: > > Merge tag 'perf-core-for-mingo-20160630' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2016-07-01 08:40:39 +0200) > > are available in the git repository at: > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo-20160704 > > for you to fetch changes up to f3d082ceabe53177c98bfa4580a294c2844966e8: > > perf tools: Sync copy of syscall_64.tbl with the kernel (2016-07-04 20:29:40 -0300) > > ---------------------------------------------------------------- > perf/core improvements and fixes: > > Documentation: > > - Update android build documentation (Chris Phlipot) > > Infrastructure: > > - Respect WERROR=0 in libapi and libsubcmd, to allow building on Android (Chris Phlipot) > > - Prep work to support SDT events in probe cache (Masami Hiramatsu) > > - ELF support for SDT (Hemant Kumar) > > - Add feature detection for libelf's elf_getshdrstrndx function (Arnaldo Carvalho de Melo) > > - Fix hist accumulation test (Jiri Olsa) > > - Unwind callchain fixes (Jiri Olsa) > > - Change internal representation of numa nodes obtained from > perf.data header (Jiri Olsa) > > - Sync copy of syscall_64.tbl with the kernel (Arnaldo Carvalho de Melo) > > - Add LGPL 2.1 license header to libbpf source files (Wang Nan) > > Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> > > ---------------------------------------------------------------- > Arnaldo Carvalho de Melo (2): > perf build: Add feature detection for libelf's elf_getshdrstrndx() > perf tools: Sync copy of syscall_64.tbl with the kernel > > Chris Phlipot (3): > tools lib api: Respect WERROR=0 for build > tools lib subcmd: Respect WERROR=0 for build > perf tools: Update android build documentation > > Hemant Kumar (1): > perf sdt: ELF support for SDT > > Jiri Olsa (4): > perf header: Transform nodes string info to struct > perf tests: Fix hist accumulation test > perf unwind: Add initialized arg into unwind__prepare_access > perf unwind: Call unwind__prepare_access for forked thread > > Masami Hiramatsu (5): > perf probe: Use cache entry if possible > perf probe: Show all cached probes > perf probe: Remove caches when --cache is given > perf probe: Add group name support > perf buildid-cache: Scan and import user SDT events to probe cache > > Wang Nan (1): > tools lib bpf: Add license header > > tools/build/Makefile.feature | 1 + > tools/build/feature/Makefile | 4 + > tools/build/feature/test-all.c | 5 + > tools/build/feature/test-libelf-getshdrstrndx.c | 8 + > tools/lib/api/Makefile | 8 +- > tools/lib/bpf/bpf.c | 13 ++ > tools/lib/bpf/bpf.h | 13 ++ > tools/lib/bpf/libbpf.c | 13 ++ > tools/lib/bpf/libbpf.h | 13 ++ > tools/lib/subcmd/Makefile | 8 +- > tools/perf/Documentation/android.txt | 16 +- > tools/perf/Documentation/perf-buildid-cache.txt | 3 + > tools/perf/Documentation/perf-probe.txt | 19 +- > tools/perf/arch/x86/entry/syscalls/syscall_64.tbl | 2 + > tools/perf/builtin-probe.c | 31 ++- > tools/perf/config/Makefile | 4 + > tools/perf/tests/hists_cumulate.c | 4 + > tools/perf/util/build-id.c | 138 +++++++++++- > tools/perf/util/build-id.h | 3 + > tools/perf/util/callchain.h | 1 + > tools/perf/util/env.c | 5 +- > tools/perf/util/env.h | 10 +- > tools/perf/util/header.c | 76 +++---- > tools/perf/util/map.c | 9 +- > tools/perf/util/map.h | 2 +- > tools/perf/util/probe-event.c | 91 +++++++- > tools/perf/util/probe-file.c | 175 ++++++++++++++- > tools/perf/util/probe-file.h | 10 +- > tools/perf/util/symbol-elf.c | 260 ++++++++++++++++++++++ > tools/perf/util/symbol.h | 22 ++ > tools/perf/util/thread.c | 39 +++- > tools/perf/util/unwind-libunwind.c | 11 +- > tools/perf/util/unwind.h | 9 +- > tools/perf/util/util.c | 19 +- > 34 files changed, 940 insertions(+), 105 deletions(-) > create mode 100644 tools/build/feature/test-libelf-getshdrstrndx.c Pulled, thanks a lot Arnaldo! Ingo ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2016-01-29 21:17 Arnaldo Carvalho de Melo 2016-02-03 10:02 ` Ingo Molnar 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2016-01-29 21:17 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Alexander Yarygin, Alexei Starovoitov, Al Grant, Brendan Gregg, Chunyan Zhang, Daniel Borkmann, David Ahern, He Kuang, Hemant Kumar, Jiri Olsa, linux-arm-kernel, linux-doc, linuxppc-dev, Li Zefan, Masami Hiramatsu, Mathieu Poirier, Michael Ellerman, Mike Leach, Namhyung Kim, Naveen N . Rao, Paul Mackerras, Peter Zijlstra, pi3orama, Rabin Vincent, Scott Wood, Srikar Dronamraju, Tor Jeremiassen, Wang Nan, Will Deacon, Arnaldo Carvalho de Melo See http://www.infradead.org/rpr.html Hi Ingo, This is on top of the previously submitted perf-core-for-mingo tag, please consider applying, - Arnaldo The following changes since commit 5ac76283b32b116c58e362e99542182ddcfc8262: perf cpumap: Auto initialize cpu__max_{node,cpu} (2016-01-26 16:08:36 -0300) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo-2 for you to fetch changes up to 814568db641f6587c1e98a3a85f214cb6a30fe10: perf build: Align the names of the build tests: (2016-01-29 17:51:04 -0300) ---------------------------------------------------------------- New features: - Port 'perf kvm stat' to PowerPC (Hemant Kumar) Infrastructure: - Use the 'feature-dump' target to do the feature checks just once and then add code to reuse that in the tests/make makefile, speeding up the 'make -C tools/perf build-test' target (Wang Nan) - Reduce the number of tests the 'build-test' target do to those that don't pollute the source tree (Arnaldo Carvalho de Melo) - Improve the output of the build tests a bit by aligning the name of the tests, more can be done to filter out uninteresting info in the output (Arnaldo Carvalho de Melo) - Add perf_evlist pointer to *info_priv_size(), more prep work for supporting the coresight architecture (Mathieu Poirier) - Improve the 'perf test bp_signal' test (Wang Nan) - Check environment before starting the BPF 'perf test', so that we can just 'Skip' older kernels instead of 'FAIL'ing them (Wang Nan) - Fix cpumode of synthesized buildid event (Wang Nan) Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Arnaldo Carvalho de Melo (2): perf tools: Speed up build-tests by reducing the number of builds tested perf build: Align the names of the build tests: Hemant Kumar (4): perf kvm/{x86,s390}: Remove dependency on uapi/kvm_perf.h perf kvm/{x86,s390}: Remove const from kvm_events_tp perf kvm/powerpc: Port perf kvm stat to powerpc perf kvm/powerpc: Add support for HCALL reasons Jiri Olsa (1): perf build: Fix feature-dump checks, we need to test all features Mathieu Poirier (1): perf auxtrace: Add perf_evlist pointer to *info_priv_size() Wang Nan (8): tools build: Check basic headers for test-compile feature checker perf build: Remove all condition feature check {C,LD}FLAGS perf build: Use feature dump file for build-test perf buildid: Fix cpumode of buildid event perf test: Check environment before start real BPF test perf test: Improve bp_signal perf tools: Move timestamp creation to util perf record: Use OPT_BOOLEAN_SET for buildid cache related options tools/build/Makefile.feature | 8 ++ tools/build/feature/test-compile.c | 2 + tools/perf/Makefile | 11 +- tools/perf/arch/powerpc/Makefile | 2 + tools/perf/arch/powerpc/util/Build | 1 + tools/perf/arch/powerpc/util/book3s_hcalls.h | 123 ++++++++++++++++++ tools/perf/arch/powerpc/util/book3s_hv_exits.h | 33 +++++ tools/perf/arch/powerpc/util/kvm-stat.c | 170 +++++++++++++++++++++++++ tools/perf/arch/s390/util/kvm-stat.c | 10 +- tools/perf/arch/x86/util/intel-bts.c | 4 +- tools/perf/arch/x86/util/intel-pt.c | 4 +- tools/perf/arch/x86/util/kvm-stat.c | 16 ++- tools/perf/builtin-buildid-cache.c | 14 +- tools/perf/builtin-kvm.c | 38 ++++-- tools/perf/builtin-record.c | 12 +- tools/perf/config/Makefile | 101 +++++++-------- tools/perf/tests/bp_signal.c | 140 ++++++++++++++++---- tools/perf/tests/bpf.c | 37 ++++++ tools/perf/tests/make | 39 +++++- tools/perf/util/auxtrace.c | 7 +- tools/perf/util/auxtrace.h | 6 +- tools/perf/util/build-id.c | 6 +- tools/perf/util/kvm-stat.h | 8 +- tools/perf/util/util.c | 17 +++ tools/perf/util/util.h | 1 + 25 files changed, 688 insertions(+), 122 deletions(-) create mode 100644 tools/perf/arch/powerpc/util/book3s_hcalls.h create mode 100644 tools/perf/arch/powerpc/util/book3s_hv_exits.h create mode 100644 tools/perf/arch/powerpc/util/kvm-stat.c ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2016-01-29 21:17 Arnaldo Carvalho de Melo @ 2016-02-03 10:02 ` Ingo Molnar 0 siblings, 0 replies; 56+ messages in thread From: Ingo Molnar @ 2016-02-03 10:02 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: linux-kernel, Adrian Hunter, Alexander Yarygin, Alexei Starovoitov, Al Grant, Brendan Gregg, Chunyan Zhang, Daniel Borkmann, David Ahern, He Kuang, Hemant Kumar, Jiri Olsa, linux-arm-kernel, linux-doc, linuxppc-dev, Li Zefan, Masami Hiramatsu, Mathieu Poirier, Michael Ellerman, Mike Leach, Namhyung Kim, Naveen N . Rao, Paul Mackerras, Peter Zijlstra, pi3orama, Rabin Vincent, Scott Wood, Srikar Dronamraju, Tor Jeremiassen, Wang Nan, Will Deacon, Arnaldo Carvalho de Melo * Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > Hi Ingo, > > This is on top of the previously submitted perf-core-for-mingo tag, > please consider applying, > > - Arnaldo > > The following changes since commit 5ac76283b32b116c58e362e99542182ddcfc8262: > > perf cpumap: Auto initialize cpu__max_{node,cpu} (2016-01-26 16:08:36 -0300) > > are available in the git repository at: > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo-2 > > for you to fetch changes up to 814568db641f6587c1e98a3a85f214cb6a30fe10: > > perf build: Align the names of the build tests: (2016-01-29 17:51:04 -0300) > > ---------------------------------------------------------------- > New features: > > - Port 'perf kvm stat' to PowerPC (Hemant Kumar) > > Infrastructure: > > - Use the 'feature-dump' target to do the feature checks just once and then > add code to reuse that in the tests/make makefile, speeding up the > 'make -C tools/perf build-test' target (Wang Nan) > > - Reduce the number of tests the 'build-test' target do to those that don't > pollute the source tree (Arnaldo Carvalho de Melo) > > - Improve the output of the build tests a bit by aligning the name of the > tests, more can be done to filter out uninteresting info in the output > (Arnaldo Carvalho de Melo) > > - Add perf_evlist pointer to *info_priv_size(), more prep work for > supporting the coresight architecture (Mathieu Poirier) > > - Improve the 'perf test bp_signal' test (Wang Nan) > > - Check environment before starting the BPF 'perf test', so that we can just > 'Skip' older kernels instead of 'FAIL'ing them (Wang Nan) > > - Fix cpumode of synthesized buildid event (Wang Nan) > > Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> > > ---------------------------------------------------------------- > Arnaldo Carvalho de Melo (2): > perf tools: Speed up build-tests by reducing the number of builds tested > perf build: Align the names of the build tests: > > Hemant Kumar (4): > perf kvm/{x86,s390}: Remove dependency on uapi/kvm_perf.h > perf kvm/{x86,s390}: Remove const from kvm_events_tp > perf kvm/powerpc: Port perf kvm stat to powerpc > perf kvm/powerpc: Add support for HCALL reasons > > Jiri Olsa (1): > perf build: Fix feature-dump checks, we need to test all features > > Mathieu Poirier (1): > perf auxtrace: Add perf_evlist pointer to *info_priv_size() > > Wang Nan (8): > tools build: Check basic headers for test-compile feature checker > perf build: Remove all condition feature check {C,LD}FLAGS > perf build: Use feature dump file for build-test > perf buildid: Fix cpumode of buildid event > perf test: Check environment before start real BPF test > perf test: Improve bp_signal > perf tools: Move timestamp creation to util > perf record: Use OPT_BOOLEAN_SET for buildid cache related options > > tools/build/Makefile.feature | 8 ++ > tools/build/feature/test-compile.c | 2 + > tools/perf/Makefile | 11 +- > tools/perf/arch/powerpc/Makefile | 2 + > tools/perf/arch/powerpc/util/Build | 1 + > tools/perf/arch/powerpc/util/book3s_hcalls.h | 123 ++++++++++++++++++ > tools/perf/arch/powerpc/util/book3s_hv_exits.h | 33 +++++ > tools/perf/arch/powerpc/util/kvm-stat.c | 170 +++++++++++++++++++++++++ > tools/perf/arch/s390/util/kvm-stat.c | 10 +- > tools/perf/arch/x86/util/intel-bts.c | 4 +- > tools/perf/arch/x86/util/intel-pt.c | 4 +- > tools/perf/arch/x86/util/kvm-stat.c | 16 ++- > tools/perf/builtin-buildid-cache.c | 14 +- > tools/perf/builtin-kvm.c | 38 ++++-- > tools/perf/builtin-record.c | 12 +- > tools/perf/config/Makefile | 101 +++++++-------- > tools/perf/tests/bp_signal.c | 140 ++++++++++++++++---- > tools/perf/tests/bpf.c | 37 ++++++ > tools/perf/tests/make | 39 +++++- > tools/perf/util/auxtrace.c | 7 +- > tools/perf/util/auxtrace.h | 6 +- > tools/perf/util/build-id.c | 6 +- > tools/perf/util/kvm-stat.h | 8 +- > tools/perf/util/util.c | 17 +++ > tools/perf/util/util.h | 1 + > 25 files changed, 688 insertions(+), 122 deletions(-) > create mode 100644 tools/perf/arch/powerpc/util/book3s_hcalls.h > create mode 100644 tools/perf/arch/powerpc/util/book3s_hv_exits.h > create mode 100644 tools/perf/arch/powerpc/util/kvm-stat.c Pulled, thanks a lot Arnaldo! Ingo ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2015-12-14 17:44 Arnaldo Carvalho de Melo 0 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2015-12-14 17:44 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Alexei Starovoitov, Andi Kleen, David Ahern, He Kuang, Jiri Olsa, Josh Poimboeuf, Masami Hiramatsu, Namhyung Kim, Peter Zijlstra, pi3orama, Wang Nan, Zefan Li, Arnaldo Carvalho de Melo Hi Ingo, Please consider pulling, - Arnaldo The following changes since commit 0d76ded582c178d3cca55c9112eceb5b0f12f558: Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-12-14 09:31:39 +0100) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo for you to fetch changes up to 7efe0e034c713716060bc7794c7e332589980c70: perf record: Support custom vmlinux path (2015-12-14 13:04:12 -0300) ---------------------------------------------------------------- perf/core improvements and fixes: User visible: - Make command line options always available, even when they depend on some feature being enabled, warning the user about use of such options (Wang Nan) - Support --vmlinux in perf record, useful, so far, for eBPF, where we will set up events that will be used in the record session (He Kuang) - Automatically disable collecting branch flags and cycles with --call-graph lbr. This allows avoiding a bunch of extra MSR reads in the PMI on Skylake. (Andi Kleen) Infrastructure: - Dump the stack when a 'perf test -v ' entry segfaults, so far we would have to run it under gdb with 'set follow-fork-mode child' set to get a proper backtrace (Arnaldo Carvalho de Melo) - Initialize the refcnt in 'struct thread' to 1 and fixup its users accordingly, so that we try to have the same refcount model accross the perf codebase (Arnaldo Carvalho de Melo) - More prep work for moving the subcmd infrastructure out of tools/perf/ and into tools/lib/subcmd/ to be used by other tools/ living utilities (Josh Poimboeuf) - Fix 'perf test' hist testcases when kptr_restrict is on (Namhyung Kim) Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Andi Kleen (1): perf evsel: Disable branch flags/cycles for --callgraph lbr Arnaldo Carvalho de Melo (3): perf tools: Use same signal handling strategy as 'record' perf test: Dump the stack when test segfaults when in verbose mode perf thread: Fix reference count initial state He Kuang (1): perf record: Support custom vmlinux path Josh Poimboeuf (9): perf build: Remove unnecessary line in Makefile.feature perf test: Add Build file to dependencies for llvm-src-*.c perf test: Remove tarpkg at end of test perf build: Fix 'make clean' perf build: Rename LIB_PATH -> API_PATH perf tools: Create pager.h perf tools: Remove check for unused PERF_PAGER_IN_USE perf tools: Move help_unknown_cmd() to its own file perf tools: Convert parse-options.c internal functions to static Namhyung Kim (1): perf test: Fix hist testcases when kptr_restrict is on Wang Nan (1): perf tools: Make options always available, even if required libs not linked tools/build/Makefile.feature | 1 - tools/perf/Documentation/perf-record.txt | 10 ++- tools/perf/Makefile.perf | 15 ++-- tools/perf/builtin-probe.c | 15 +++- tools/perf/builtin-record.c | 25 +++++- tools/perf/tests/Build | 6 +- tools/perf/tests/builtin-test.c | 3 + tools/perf/tests/hists_common.c | 5 +- tools/perf/tests/make | 3 +- tools/perf/util/Build | 1 + tools/perf/util/cache.h | 5 +- tools/perf/util/evsel.c | 14 +++- tools/perf/util/help-unknown-cmd.c | 103 +++++++++++++++++++++++ tools/perf/util/help-unknown-cmd.h | 0 tools/perf/util/help.c | 107 +----------------------- tools/perf/util/help.h | 3 + tools/perf/util/intel-pt.c | 4 +- tools/perf/util/machine.c | 19 +++-- tools/perf/util/pager.c | 8 +- tools/perf/util/pager.h | 7 ++ tools/perf/util/parse-options.c | 136 +++++++++++++++++++++++++++---- tools/perf/util/parse-options.h | 14 ++-- tools/perf/util/thread.c | 10 ++- tools/perf/util/util.c | 3 +- 24 files changed, 341 insertions(+), 176 deletions(-) create mode 100644 tools/perf/util/help-unknown-cmd.c create mode 100644 tools/perf/util/help-unknown-cmd.h create mode 100644 tools/perf/util/pager.h ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2015-10-05 21:03 Arnaldo Carvalho de Melo 2015-10-06 7:09 ` Ingo Molnar 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2015-10-05 21:03 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Andi Kleen, Borislav Petkov, Chandler Carruth, Daniel Bristot de Oliveira, David Ahern, Don Zickus, Fenghua Yu, Frederic Weisbecker, Jiri Olsa, Kanaka Juvva, Kan Liang, Matt Fleming, Namhyung Kim, Peter Zijlstra, Stephane Eranian, Vikas Shivappa, Vince Weaver, Wang Nan, Arnaldo Carvalho de Melo Hi Ingo, Please consider pulling, - Arnaldo The following changes since commit e3b0ac1b7a8a590440a2030e7d10d48c59ab8a2a: Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-10-03 08:20:14 +0200) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo for you to fetch changes up to 27bf90bf0690f55c3679bcc4c325823cf1cfd19d: perf tools: Fail properly in case pattern matching fails to find tracepoint (2015-10-05 17:59:50 -0300) ---------------------------------------------------------------- perf/core improvements and fixes: User visible: - Switch the default callchain output mode to 'graph,0.5,caller', to make it look like the default for other tools, reducing the learning curve for people used to 'caller' based viewing (Arnaldo Carvalho de Melo) - Implement column based horizontal scrolling in the hists browser (top, report), making it possible to use the TUI for things like 'perf mem report' where there are many more columns than can fit in a terminal (Arnaldo Carvalho de Melo) - Support sorting by symbol_iaddr with perf.data files produced by 'perf mem record' (Don Zickus) - Display DATA_SRC sample type bit, i.e. when running 'perf evlist -v' the "DATA_SRC" wasn't appearing when set, fix it to look like: (Jiri Olsa) cpu/mem-loads/pp: ...SNIP... sample_type: IP|TID|TIME|ADDR|CPU|PERIOD|DATA_SRC - Introduce 'P' event modified, meaning 'max precision level, please', i.e.: $ perf record -e cycles:P usleep 1 Is now similar to: $ perf record usleep 1 Useful, for instance, when specifying multiple events (Jiri Olsa) - Make 'perf -v' and 'perf -h' work (Jiri Olsa) - Fail properly when pattern matching fails to find a tracepoint, i.e. '-e non:existent' was being correctly handled, with a proper error message about that not being a valid event, but '-e non:existent*' wasn't, fix it (Jiri Olsa) Infrastructure: - Separate arch specific entries in 'perf test' and add an 'Intel CQM' one to be fun on x86 only (Matt Fleming) Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Arnaldo Carvalho de Melo (3): perf callchain: Switch default to 'graph,0.5,caller' perf ui browser: Optional horizontal scrolling key binding perf hists browser: Implement horizontal scrolling Don Zickus (1): perf tools: Add support for sorting on the iaddr Jiri Olsa (9): tools lib api fs: No need to use PATH_MAX + 1 perf evlist: Display DATA_SRC sample type bit perf annotate: Fix sizeof_sym_hist overflow issue perf tools: Export perf_event_attr__set_max_precise_ip() perf tools: Introduce 'P' modifier to request max precision perf tests: Add parsing test for 'P' modifier perf tools: Setup proper width for symbol_iaddr field perf tools: Handle -h and -v options perf tools: Fail properly in case pattern matching fails to find tracepoint Matt Fleming (3): perf tests: Add arch tests perf tests: Move x86 tests into arch directory perf tests: Add Intel CQM test tools/lib/api/fs/fs.c | 2 +- tools/lib/api/fs/tracing_path.c | 6 +- tools/perf/Documentation/perf-list.txt | 1 + tools/perf/Documentation/perf-report.txt | 2 +- tools/perf/Documentation/perf.txt | 8 ++ tools/perf/arch/x86/Build | 2 +- tools/perf/arch/x86/include/arch-tests.h | 19 ++++ tools/perf/arch/x86/tests/Build | 10 +- tools/perf/arch/x86/tests/arch-tests.c | 34 ++++++ tools/perf/arch/x86/tests/dwarf-unwind.c | 1 + .../perf/{ => arch/x86}/tests/gen-insn-x86-dat.awk | 0 .../perf/{ => arch/x86}/tests/gen-insn-x86-dat.sh | 0 tools/perf/{ => arch/x86}/tests/insn-x86-dat-32.c | 0 tools/perf/{ => arch/x86}/tests/insn-x86-dat-64.c | 0 tools/perf/{ => arch/x86}/tests/insn-x86-dat-src.c | 0 tools/perf/{ => arch/x86}/tests/insn-x86.c | 3 +- tools/perf/arch/x86/tests/intel-cqm.c | 124 +++++++++++++++++++++ tools/perf/{ => arch/x86}/tests/perf-time-to-tsc.c | 4 +- tools/perf/{ => arch/x86}/tests/rdpmc.c | 7 +- tools/perf/builtin-report.c | 4 +- tools/perf/perf.c | 14 +++ tools/perf/tests/Build | 6 - tools/perf/tests/builtin-test.c | 56 ++++------ tools/perf/tests/dwarf-unwind.c | 4 + tools/perf/tests/parse-events.c | 16 +++ tools/perf/tests/tests.h | 10 +- tools/perf/ui/browser.c | 14 +++ tools/perf/ui/browser.h | 2 +- tools/perf/ui/browsers/hists.c | 22 +++- tools/perf/util/annotate.h | 2 +- tools/perf/util/evlist.c | 2 +- tools/perf/util/evlist.h | 2 + tools/perf/util/evsel.c | 5 +- tools/perf/util/evsel.h | 1 + tools/perf/util/hist.c | 13 +++ tools/perf/util/hist.h | 1 + tools/perf/util/parse-events.c | 17 ++- tools/perf/util/parse-events.l | 2 +- tools/perf/util/sort.c | 37 ++++++ tools/perf/util/sort.h | 1 + tools/perf/util/util.c | 4 +- 41 files changed, 381 insertions(+), 77 deletions(-) create mode 100644 tools/perf/arch/x86/include/arch-tests.h create mode 100644 tools/perf/arch/x86/tests/arch-tests.c rename tools/perf/{ => arch/x86}/tests/gen-insn-x86-dat.awk (100%) rename tools/perf/{ => arch/x86}/tests/gen-insn-x86-dat.sh (100%) rename tools/perf/{ => arch/x86}/tests/insn-x86-dat-32.c (100%) rename tools/perf/{ => arch/x86}/tests/insn-x86-dat-64.c (100%) rename tools/perf/{ => arch/x86}/tests/insn-x86-dat-src.c (100%) rename tools/perf/{ => arch/x86}/tests/insn-x86.c (98%) create mode 100644 tools/perf/arch/x86/tests/intel-cqm.c rename tools/perf/{ => arch/x86}/tests/perf-time-to-tsc.c (98%) rename tools/perf/{ => arch/x86}/tests/rdpmc.c (97%) ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2015-10-05 21:03 Arnaldo Carvalho de Melo @ 2015-10-06 7:09 ` Ingo Molnar 0 siblings, 0 replies; 56+ messages in thread From: Ingo Molnar @ 2015-10-06 7:09 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: linux-kernel, Adrian Hunter, Andi Kleen, Borislav Petkov, Chandler Carruth, Daniel Bristot de Oliveira, David Ahern, Don Zickus, Fenghua Yu, Frederic Weisbecker, Jiri Olsa, Kanaka Juvva, Kan Liang, Matt Fleming, Namhyung Kim, Peter Zijlstra, Stephane Eranian, Vikas Shivappa, Vince Weaver, Wang Nan, Arnaldo Carvalho de Melo * Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > Hi Ingo, > > Please consider pulling, > > - Arnaldo > > The following changes since commit e3b0ac1b7a8a590440a2030e7d10d48c59ab8a2a: > > Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-10-03 08:20:14 +0200) > > are available in the git repository at: > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo > > for you to fetch changes up to 27bf90bf0690f55c3679bcc4c325823cf1cfd19d: > > perf tools: Fail properly in case pattern matching fails to find tracepoint (2015-10-05 17:59:50 -0300) > > ---------------------------------------------------------------- > perf/core improvements and fixes: > > User visible: > > - Switch the default callchain output mode to 'graph,0.5,caller', to make it > look like the default for other tools, reducing the learning curve for > people used to 'caller' based viewing (Arnaldo Carvalho de Melo) > > - Implement column based horizontal scrolling in the hists browser (top, report), > making it possible to use the TUI for things like 'perf mem report' where > there are many more columns than can fit in a terminal (Arnaldo Carvalho de Melo) > > - Support sorting by symbol_iaddr with perf.data files produced by > 'perf mem record' (Don Zickus) > > - Display DATA_SRC sample type bit, i.e. when running 'perf evlist -v' the > "DATA_SRC" wasn't appearing when set, fix it to look like: (Jiri Olsa) > > cpu/mem-loads/pp: ...SNIP... sample_type: IP|TID|TIME|ADDR|CPU|PERIOD|DATA_SRC > > - Introduce 'P' event modified, meaning 'max precision level, please', i.e.: > > $ perf record -e cycles:P usleep 1 > > Is now similar to: > > $ perf record usleep 1 > > Useful, for instance, when specifying multiple events (Jiri Olsa) > > - Make 'perf -v' and 'perf -h' work (Jiri Olsa) > > - Fail properly when pattern matching fails to find a tracepoint, i.e. > '-e non:existent' was being correctly handled, with a proper error message > about that not being a valid event, but '-e non:existent*' wasn't, > fix it (Jiri Olsa) > > Infrastructure: > > - Separate arch specific entries in 'perf test' and add an 'Intel CQM' one > to be fun on x86 only (Matt Fleming) > > Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> > > ---------------------------------------------------------------- > Arnaldo Carvalho de Melo (3): > perf callchain: Switch default to 'graph,0.5,caller' > perf ui browser: Optional horizontal scrolling key binding > perf hists browser: Implement horizontal scrolling > > Don Zickus (1): > perf tools: Add support for sorting on the iaddr > > Jiri Olsa (9): > tools lib api fs: No need to use PATH_MAX + 1 > perf evlist: Display DATA_SRC sample type bit > perf annotate: Fix sizeof_sym_hist overflow issue > perf tools: Export perf_event_attr__set_max_precise_ip() > perf tools: Introduce 'P' modifier to request max precision > perf tests: Add parsing test for 'P' modifier > perf tools: Setup proper width for symbol_iaddr field > perf tools: Handle -h and -v options > perf tools: Fail properly in case pattern matching fails to find tracepoint > > Matt Fleming (3): > perf tests: Add arch tests > perf tests: Move x86 tests into arch directory > perf tests: Add Intel CQM test > > tools/lib/api/fs/fs.c | 2 +- > tools/lib/api/fs/tracing_path.c | 6 +- > tools/perf/Documentation/perf-list.txt | 1 + > tools/perf/Documentation/perf-report.txt | 2 +- > tools/perf/Documentation/perf.txt | 8 ++ > tools/perf/arch/x86/Build | 2 +- > tools/perf/arch/x86/include/arch-tests.h | 19 ++++ > tools/perf/arch/x86/tests/Build | 10 +- > tools/perf/arch/x86/tests/arch-tests.c | 34 ++++++ > tools/perf/arch/x86/tests/dwarf-unwind.c | 1 + > .../perf/{ => arch/x86}/tests/gen-insn-x86-dat.awk | 0 > .../perf/{ => arch/x86}/tests/gen-insn-x86-dat.sh | 0 > tools/perf/{ => arch/x86}/tests/insn-x86-dat-32.c | 0 > tools/perf/{ => arch/x86}/tests/insn-x86-dat-64.c | 0 > tools/perf/{ => arch/x86}/tests/insn-x86-dat-src.c | 0 > tools/perf/{ => arch/x86}/tests/insn-x86.c | 3 +- > tools/perf/arch/x86/tests/intel-cqm.c | 124 +++++++++++++++++++++ > tools/perf/{ => arch/x86}/tests/perf-time-to-tsc.c | 4 +- > tools/perf/{ => arch/x86}/tests/rdpmc.c | 7 +- > tools/perf/builtin-report.c | 4 +- > tools/perf/perf.c | 14 +++ > tools/perf/tests/Build | 6 - > tools/perf/tests/builtin-test.c | 56 ++++------ > tools/perf/tests/dwarf-unwind.c | 4 + > tools/perf/tests/parse-events.c | 16 +++ > tools/perf/tests/tests.h | 10 +- > tools/perf/ui/browser.c | 14 +++ > tools/perf/ui/browser.h | 2 +- > tools/perf/ui/browsers/hists.c | 22 +++- > tools/perf/util/annotate.h | 2 +- > tools/perf/util/evlist.c | 2 +- > tools/perf/util/evlist.h | 2 + > tools/perf/util/evsel.c | 5 +- > tools/perf/util/evsel.h | 1 + > tools/perf/util/hist.c | 13 +++ > tools/perf/util/hist.h | 1 + > tools/perf/util/parse-events.c | 17 ++- > tools/perf/util/parse-events.l | 2 +- > tools/perf/util/sort.c | 37 ++++++ > tools/perf/util/sort.h | 1 + > tools/perf/util/util.c | 4 +- > 41 files changed, 381 insertions(+), 77 deletions(-) > create mode 100644 tools/perf/arch/x86/include/arch-tests.h > create mode 100644 tools/perf/arch/x86/tests/arch-tests.c > rename tools/perf/{ => arch/x86}/tests/gen-insn-x86-dat.awk (100%) > rename tools/perf/{ => arch/x86}/tests/gen-insn-x86-dat.sh (100%) > rename tools/perf/{ => arch/x86}/tests/insn-x86-dat-32.c (100%) > rename tools/perf/{ => arch/x86}/tests/insn-x86-dat-64.c (100%) > rename tools/perf/{ => arch/x86}/tests/insn-x86-dat-src.c (100%) > rename tools/perf/{ => arch/x86}/tests/insn-x86.c (98%) > create mode 100644 tools/perf/arch/x86/tests/intel-cqm.c > rename tools/perf/{ => arch/x86}/tests/perf-time-to-tsc.c (98%) > rename tools/perf/{ => arch/x86}/tests/rdpmc.c (97%) Pulled, thanks a lot Arnaldo! Ingo ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2015-09-30 21:54 Arnaldo Carvalho de Melo 2015-10-01 7:05 ` Ingo Molnar 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2015-09-30 21:54 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Borislav Petkov, Chandler Carruth, David Ahern, Dominik Dingel, Frederic Weisbecker, He Kuang, H . Peter Anvin, Jiri Olsa, linuxppc-dev, linux-tip-commits, Masami Hiramatsu, Matt Fleming, Namhyung Kim, Naveen N . Rao, Peter Zijlstra, Raphael Beamonte, Stephane Eranian, Sukadev Bhattiprolu, Thomas Gleixner, Vinson Lee, Wang Nan, Arnaldo Carvalho de Melo Hi Ingo, Please consider pulling, - Arnaldo The following changes since commit 9c17dbc6eb73bdd8a6aaea1baefd37ff78d86148: Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-09-29 09:43:46 +0200) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo for you to fetch changes up to 7f8d1ade1b19f684ed3a7c4fb1dc5d347127b438: perf tools: By default use the most precise "cycles" hw counter available (2015-09-30 18:34:39 -0300) ---------------------------------------------------------------- perf/core improvements and fixes: User visible: - By default use the most precise "cycles" hw counter available, i.e. when the user doesn't specify any event, it will try using cycles:ppp, cycles:pp, etc (Arnaldo Carvalho de Melo) - Remove blank lines, headers when piping output in 'perf list', so that it can be sanely used with 'wc -l', etc (Arnaldo Carvalho de Melo) - Amend documentation about max_stack and synthesized callchains (Adrian Hunter) - Fix 'perf probe -l' for probes added to kernel module functions (Masami Hiramatsu) Build fixes: - Fix shadowed declarations that break the build on older distros (Jiri Olsa) - Fix build break on powerpc due to sample_reg_masks (Sukadev Bhattiprolu) Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Adrian Hunter (1): perf report: Amend documentation about max_stack and synthesized callchains Arnaldo Carvalho de Melo (7): perf maps: Introduce maps__find_symbol_by_name() perf machine: Use machine__kernel_map() thoroughly perf machine: Add method for common kernel_map(FUNCTION) operation tools lib symbol: Rename kallsyms2elf_type to kallsyms2elf_binding tools lib symbol: Introduce kallsyms2elf_type perf list: Remove blank lines, headers when piping output perf tools: By default use the most precise "cycles" hw counter available Jiri Olsa (2): tools: Fix shadowed declaration in err.h perf tools: Fix shadowed declaration in parse-events.c Masami Hiramatsu (5): perf probe: Fix to remove dot suffix from second or latter events perf probe: Begin and end libdwfl report session correctly perf probe: Show correct source lines of probes on kmodules perf probe: Fix a segfault bug in debuginfo_cache perf probe: Improve error message when %return is on inlined function Sukadev Bhattiprolu (1): perf tools: Fix build break on powerpc due to sample_reg_masks tools/include/linux/err.h | 4 +- tools/lib/symbol/kallsyms.c | 6 ++ tools/lib/symbol/kallsyms.h | 4 +- tools/perf/Documentation/perf-report.txt | 2 + tools/perf/builtin-kmem.c | 2 +- tools/perf/builtin-list.c | 2 +- tools/perf/builtin-report.c | 2 +- tools/perf/tests/code-reading.c | 2 +- tools/perf/tests/vmlinux-kallsyms.c | 4 +- tools/perf/util/Build | 2 +- tools/perf/util/event.c | 7 +-- tools/perf/util/evlist.c | 22 +++++++- tools/perf/util/intel-pt.c | 2 +- tools/perf/util/machine.c | 26 ++++----- tools/perf/util/machine.h | 8 ++- tools/perf/util/map.c | 21 ++++--- tools/perf/util/map.h | 2 + tools/perf/util/parse-events.c | 53 +++++++++--------- tools/perf/util/perf_regs.c | 2 + tools/perf/util/perf_regs.h | 1 + tools/perf/util/pmu.c | 2 +- tools/perf/util/probe-event.c | 96 ++++++++++++++++++++------------ tools/perf/util/probe-finder.c | 58 +++++++++++++++++-- tools/perf/util/symbol.c | 2 +- 24 files changed, 224 insertions(+), 108 deletions(-) ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2015-09-30 21:54 Arnaldo Carvalho de Melo @ 2015-10-01 7:05 ` Ingo Molnar 0 siblings, 0 replies; 56+ messages in thread From: Ingo Molnar @ 2015-10-01 7:05 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: linux-kernel, Adrian Hunter, Borislav Petkov, Chandler Carruth, David Ahern, Dominik Dingel, Frederic Weisbecker, He Kuang, H . Peter Anvin, Jiri Olsa, linuxppc-dev, linux-tip-commits, Masami Hiramatsu, Matt Fleming, Namhyung Kim, Naveen N . Rao, Peter Zijlstra, Raphael Beamonte, Stephane Eranian, Sukadev Bhattiprolu, Thomas Gleixner, Vinson Lee, Wang Nan, Arnaldo Carvalho de Melo * Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > Hi Ingo, > > Please consider pulling, > > - Arnaldo > > The following changes since commit 9c17dbc6eb73bdd8a6aaea1baefd37ff78d86148: > > Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-09-29 09:43:46 +0200) > > are available in the git repository at: > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo > > for you to fetch changes up to 7f8d1ade1b19f684ed3a7c4fb1dc5d347127b438: > > perf tools: By default use the most precise "cycles" hw counter available (2015-09-30 18:34:39 -0300) > > ---------------------------------------------------------------- > perf/core improvements and fixes: > > User visible: > > - By default use the most precise "cycles" hw counter available, i.e. > when the user doesn't specify any event, it will try using cycles:ppp, > cycles:pp, etc (Arnaldo Carvalho de Melo) That looks really useful! > - Remove blank lines, headers when piping output in 'perf list', so that it can > be sanely used with 'wc -l', etc (Arnaldo Carvalho de Melo) > > - Amend documentation about max_stack and synthesized callchains (Adrian Hunter) > > - Fix 'perf probe -l' for probes added to kernel module functions (Masami Hiramatsu) > > Build fixes: > > - Fix shadowed declarations that break the build on older distros (Jiri Olsa) > > - Fix build break on powerpc due to sample_reg_masks (Sukadev Bhattiprolu) > > Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> > > ---------------------------------------------------------------- > Adrian Hunter (1): > perf report: Amend documentation about max_stack and synthesized callchains > > Arnaldo Carvalho de Melo (7): > perf maps: Introduce maps__find_symbol_by_name() > perf machine: Use machine__kernel_map() thoroughly > perf machine: Add method for common kernel_map(FUNCTION) operation > tools lib symbol: Rename kallsyms2elf_type to kallsyms2elf_binding > tools lib symbol: Introduce kallsyms2elf_type > perf list: Remove blank lines, headers when piping output > perf tools: By default use the most precise "cycles" hw counter available > > Jiri Olsa (2): > tools: Fix shadowed declaration in err.h > perf tools: Fix shadowed declaration in parse-events.c > > Masami Hiramatsu (5): > perf probe: Fix to remove dot suffix from second or latter events > perf probe: Begin and end libdwfl report session correctly > perf probe: Show correct source lines of probes on kmodules > perf probe: Fix a segfault bug in debuginfo_cache > perf probe: Improve error message when %return is on inlined function > > Sukadev Bhattiprolu (1): > perf tools: Fix build break on powerpc due to sample_reg_masks > > tools/include/linux/err.h | 4 +- > tools/lib/symbol/kallsyms.c | 6 ++ > tools/lib/symbol/kallsyms.h | 4 +- > tools/perf/Documentation/perf-report.txt | 2 + > tools/perf/builtin-kmem.c | 2 +- > tools/perf/builtin-list.c | 2 +- > tools/perf/builtin-report.c | 2 +- > tools/perf/tests/code-reading.c | 2 +- > tools/perf/tests/vmlinux-kallsyms.c | 4 +- > tools/perf/util/Build | 2 +- > tools/perf/util/event.c | 7 +-- > tools/perf/util/evlist.c | 22 +++++++- > tools/perf/util/intel-pt.c | 2 +- > tools/perf/util/machine.c | 26 ++++----- > tools/perf/util/machine.h | 8 ++- > tools/perf/util/map.c | 21 ++++--- > tools/perf/util/map.h | 2 + > tools/perf/util/parse-events.c | 53 +++++++++--------- > tools/perf/util/perf_regs.c | 2 + > tools/perf/util/perf_regs.h | 1 + > tools/perf/util/pmu.c | 2 +- > tools/perf/util/probe-event.c | 96 ++++++++++++++++++++------------ > tools/perf/util/probe-finder.c | 58 +++++++++++++++++-- > tools/perf/util/symbol.c | 2 +- > 24 files changed, 224 insertions(+), 108 deletions(-) Pulled, thanks a lot Arnaldo! Ingo ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2015-08-25 16:14 Arnaldo Carvalho de Melo 2015-08-26 13:39 ` Arnaldo Carvalho de Melo 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2015-08-25 16:14 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Borislav Petkov, David Ahern, Frederic Weisbecker, Jiri Olsa, Masami Hiramatsu, Namhyung Kim, pi3orama, Stephane Eranian, Wang Nan, Arnaldo Carvalho de Melo Hi Ingo, Please consider pulling, now to get back looking at eBPF :-) - Arnaldo The following changes since commit 0e53909a1cf0153736fb52c216558a65530d8c40: Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-08-22 08:45:46 +0200) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo for you to fetch changes up to 1cbebfc7abb549ed3f4080e12c4d7b07d1f16637: perf probe: Support probing at absolute addresses (2015-08-25 12:36:22 -0300) ---------------------------------------------------------------- perf/core improvements and fixes: User visible: - Add support for using several Intel PT features (CYC, MTC packets), the relevant documentation was updated: tools/perf/Documentation/intel-pt.txt, briefly describing those packets, its purposes, how to configure them in the event config terms and relevant external documentation for further reading. (Adrian Hunter) - Introduce support for probing at an absolut address, for user and kernel 'perf probe's, useful when one have the symbol maps on a developer machine but not on an embedded system (Wang Nan) - Clear the progress bar at the end of a ordered_events flush, fixing an UI artifact when, after ordering the events the screen doesn't get completely redraw, for instance, when an error window covers just the center of the screen and waits for user input. (Arnaldo Carvalho de Melo) - Fix 'annotate' segfault by resetting the dso find_symbol cache when removing symbols (Arnaldo Carvalho de Melo) Infrastructure: - Fix Intel PT 'instructions' sample period (Adrian Hunter) - Prevent segfault when reading probe point with absolute address (Wang Nan) Build fixes: - Fix tarball build broken by pt/bts (Adrian Hunter) Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Adrian Hunter (11): perf tools: Fix tarball build broken by pt/bts perf tools: Fix Intel PT 'instructions' sample period perf tools: Add Intel PT support for PSB periods perf tools: Add new Intel PT packet definitions perf tools: Pass Intel PT information for decoding MTC and CYC perf tools: Add Intel PT support for decoding MTC packets perf tools: Add Intel PT support for using MTC packets perf tools: Add Intel PT support for decoding CYC packets perf tools: Add Intel PT support for using CYC packets perf tools: Add Intel PT support for decoding TRACESTOP packets perf tools: Update Intel PT documentation Arnaldo Carvalho de Melo (3): perf annotate: Reset the dso find_symbol cache when removing symbols perf ui tui progress: Implement the ui_progress_ops->finish() method perf ordered_events: Clear the progress bar at the end of a flush Wang Nan (2): perf probe: Prevent segfault when reading probe point with absolute address perf probe: Support probing at absolute addresses tools/perf/Documentation/intel-pt.txt | 194 ++++++- tools/perf/arch/x86/util/intel-pt.c | 271 +++++++++- tools/perf/builtin-annotate.c | 1 + tools/perf/ui/tui/progress.c | 19 +- tools/perf/util/dso.h | 2 + tools/perf/util/intel-pt-decoder/inat.c | 2 +- tools/perf/util/intel-pt-decoder/inat.h | 2 +- tools/perf/util/intel-pt-decoder/inat_types.h | 29 ++ tools/perf/util/intel-pt-decoder/insn.c | 4 +- tools/perf/util/intel-pt-decoder/insn.h | 2 +- .../perf/util/intel-pt-decoder/intel-pt-decoder.c | 555 ++++++++++++++++++++- .../perf/util/intel-pt-decoder/intel-pt-decoder.h | 5 + .../util/intel-pt-decoder/intel-pt-insn-decoder.c | 2 +- .../util/intel-pt-decoder/intel-pt-pkt-decoder.c | 142 +++++- .../util/intel-pt-decoder/intel-pt-pkt-decoder.h | 6 + tools/perf/util/intel-pt.c | 67 ++- tools/perf/util/intel-pt.h | 5 + tools/perf/util/ordered-events.c | 3 + tools/perf/util/probe-event.c | 152 +++++- tools/perf/util/probe-event.h | 3 + tools/perf/util/probe-finder.c | 21 +- tools/perf/util/symbol.c | 10 + 22 files changed, 1404 insertions(+), 93 deletions(-) create mode 100644 tools/perf/util/intel-pt-decoder/inat_types.h ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2015-08-25 16:14 Arnaldo Carvalho de Melo @ 2015-08-26 13:39 ` Arnaldo Carvalho de Melo 0 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2015-08-26 13:39 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Adrian Hunter, Borislav Petkov, David Ahern, Frederic Weisbecker, Jiri Olsa, Masami Hiramatsu, Namhyung Kim, pi3orama, Stephane Eranian, Wang Nan Em Tue, Aug 25, 2015 at 01:14:22PM -0300, Arnaldo Carvalho de Melo escreveu: > Hi Ingo, > > Please consider pulling, now to get back looking at eBPF :-) Ingo, please do not process this one, there was a NAK from Masami to some new syntax on adding non symbolic probes, will resubmit with the new patches provided by Wang Nan that already got Masami's approval. - Arnaldo > - Arnaldo > > The following changes since commit 0e53909a1cf0153736fb52c216558a65530d8c40: > > Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-08-22 08:45:46 +0200) > > are available in the git repository at: > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo > > for you to fetch changes up to 1cbebfc7abb549ed3f4080e12c4d7b07d1f16637: > > perf probe: Support probing at absolute addresses (2015-08-25 12:36:22 -0300) > > ---------------------------------------------------------------- > perf/core improvements and fixes: > > User visible: > > - Add support for using several Intel PT features (CYC, MTC packets), the > relevant documentation was updated: tools/perf/Documentation/intel-pt.txt, > briefly describing those packets, its purposes, how to configure them in > the event config terms and relevant external documentation for further > reading. (Adrian Hunter) > > - Introduce support for probing at an absolut address, for user and kernel > 'perf probe's, useful when one have the symbol maps on a developer machine > but not on an embedded system (Wang Nan) > > - Clear the progress bar at the end of a ordered_events flush, fixing > an UI artifact when, after ordering the events the screen doesn't get > completely redraw, for instance, when an error window covers just the > center of the screen and waits for user input. (Arnaldo Carvalho de Melo) > > - Fix 'annotate' segfault by resetting the dso find_symbol cache when removing > symbols (Arnaldo Carvalho de Melo) > > Infrastructure: > > - Fix Intel PT 'instructions' sample period (Adrian Hunter) > > - Prevent segfault when reading probe point with absolute address (Wang Nan) > > Build fixes: > > - Fix tarball build broken by pt/bts (Adrian Hunter) > > Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> > > ---------------------------------------------------------------- > Adrian Hunter (11): > perf tools: Fix tarball build broken by pt/bts > perf tools: Fix Intel PT 'instructions' sample period > perf tools: Add Intel PT support for PSB periods > perf tools: Add new Intel PT packet definitions > perf tools: Pass Intel PT information for decoding MTC and CYC > perf tools: Add Intel PT support for decoding MTC packets > perf tools: Add Intel PT support for using MTC packets > perf tools: Add Intel PT support for decoding CYC packets > perf tools: Add Intel PT support for using CYC packets > perf tools: Add Intel PT support for decoding TRACESTOP packets > perf tools: Update Intel PT documentation > > Arnaldo Carvalho de Melo (3): > perf annotate: Reset the dso find_symbol cache when removing symbols > perf ui tui progress: Implement the ui_progress_ops->finish() method > perf ordered_events: Clear the progress bar at the end of a flush > > Wang Nan (2): > perf probe: Prevent segfault when reading probe point with absolute address > perf probe: Support probing at absolute addresses > > tools/perf/Documentation/intel-pt.txt | 194 ++++++- > tools/perf/arch/x86/util/intel-pt.c | 271 +++++++++- > tools/perf/builtin-annotate.c | 1 + > tools/perf/ui/tui/progress.c | 19 +- > tools/perf/util/dso.h | 2 + > tools/perf/util/intel-pt-decoder/inat.c | 2 +- > tools/perf/util/intel-pt-decoder/inat.h | 2 +- > tools/perf/util/intel-pt-decoder/inat_types.h | 29 ++ > tools/perf/util/intel-pt-decoder/insn.c | 4 +- > tools/perf/util/intel-pt-decoder/insn.h | 2 +- > .../perf/util/intel-pt-decoder/intel-pt-decoder.c | 555 ++++++++++++++++++++- > .../perf/util/intel-pt-decoder/intel-pt-decoder.h | 5 + > .../util/intel-pt-decoder/intel-pt-insn-decoder.c | 2 +- > .../util/intel-pt-decoder/intel-pt-pkt-decoder.c | 142 +++++- > .../util/intel-pt-decoder/intel-pt-pkt-decoder.h | 6 + > tools/perf/util/intel-pt.c | 67 ++- > tools/perf/util/intel-pt.h | 5 + > tools/perf/util/ordered-events.c | 3 + > tools/perf/util/probe-event.c | 152 +++++- > tools/perf/util/probe-event.h | 3 + > tools/perf/util/probe-finder.c | 21 +- > tools/perf/util/symbol.c | 10 + > 22 files changed, 1404 insertions(+), 93 deletions(-) > create mode 100644 tools/perf/util/intel-pt-decoder/inat_types.h > -- > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2015-04-07 16:40 Arnaldo Carvalho de Melo 0 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2015-04-07 16:40 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Andrew Morton, Borislav Petkov, David Ahern, Don Zickus, Frederic Weisbecker, He Kuang, H. Peter Anvin, Jiri Olsa, John Stultz, Joonsoo Kim, Linus Torvalds, linux-mm, Minchan Kim, Namhyung Kim, Paul Mackerras, Peter Zijlstra, pi3orama, Stephane Eranian, Steven Rostedt, Thomas Gleixner, Wang Nan, Yunlong Song, Zefan Li, Arnaldo Carvalho de Melo Hi Ingo, Please consider pulling, - Arnaldo The following changes since commit 6645f3187f5beb64f7a40515cfa18f3889264ece: Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-04-03 07:00:02 +0200) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo for you to fetch changes up to d083e5ff09eccc0afd44e02ec85f10c06271e93b: perf tools: Merge all perf_event_attr print functions (2015-04-07 13:25:05 -0300) ---------------------------------------------------------------- perf/core improvements and fixes: - Teach about perf_event_attr.clockid to 'perf record' (Peter Zijlstra) - perf sched replay improvements for high CPU core count machines (Yunlong Song) - Consider PERF_RECORD_ events with cpumode == 0 in 'perf top', removing one cause of long term memory usage buildup, i.e. not processing PERF_RECORD_EXIT events (Arnaldo Carvalho de Melo) - Respect -i option 'in perf kmem' (Jiri Olsa) Infrastructure: - Honor operator priority in libtraceevent (Namhyung Kim) - Merge all perf_event_attr print functions (Peter Zijlstra) - Check kmaps access to make code more robust (Wang Nan) - Fix inverted logic in perf_mmap__empty() (He Kuang) Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Arnaldo Carvalho de Melo (1): perf top: Consider PERF_RECORD_ events with cpumode == 0 He Kuang (1): perf evlist: Fix inverted logic in perf_mmap__empty Jiri Olsa (1): perf kmem: Respect -i option Namhyung Kim (1): tools lib traceevent: Honor operator priority Peter Zijlstra (2): perf record: Add clockid parameter perf tools: Merge all perf_event_attr print functions Wang Nan (1): perf kmaps: Check kmaps to make code more robust Yunlong Song (9): perf sched replay: Use struct task_desc instead of struct task_task for correct meaning perf sched replay: Increase the MAX_PID value to fix assertion failure problem perf sched replay: Alloc the memory of pid_to_task dynamically to adapt to the unexpected change of pid_max perf sched replay: Realloc the memory of pid_to_task stepwise to adapt to the different pid_max configurations perf sched replay: Fix the segmentation fault problem caused by pr_err in threads perf sched replay: Handle the dead halt of sem_wait when create_tasks() fails for any task perf sched replay: Fix the EMFILE error caused by the limitation of the maximum open files perf sched replay: Support using -f to override perf.data file ownership perf sched replay: Use replay_repeat to calculate the runavg of cpu usage instead of the default value 10 tools/lib/traceevent/event-parse.c | 17 +- tools/perf/Documentation/perf-record.txt | 7 + tools/perf/builtin-kmem.c | 3 +- tools/perf/builtin-record.c | 80 ++++++++ tools/perf/builtin-sched.c | 67 +++++-- tools/perf/builtin-top.c | 8 +- tools/perf/perf.h | 2 + tools/perf/util/evlist.c | 2 +- tools/perf/util/evsel.c | 325 ++++++++++++++++--------------- tools/perf/util/evsel.h | 6 + tools/perf/util/header.c | 28 +-- tools/perf/util/machine.c | 5 +- tools/perf/util/map.c | 20 ++ tools/perf/util/map.h | 6 +- tools/perf/util/probe-event.c | 2 + tools/perf/util/session.c | 3 + tools/perf/util/symbol-elf.c | 16 +- tools/perf/util/symbol.c | 34 +++- 18 files changed, 422 insertions(+), 209 deletions(-) ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2014-11-19 16:03 Arnaldo Carvalho de Melo 2014-11-20 7:33 ` Ingo Molnar 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-11-19 16:03 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Andi Kleen, Aravinda Prasad, Borislav Petkov, Brendan Gregg, Corey Ashford, David Ahern, Don Zickus, Frederic Weisbecker, Hemant Kumar, Jean Pihet, Jiri Olsa, Kan Liang, Masami Hiramatsu, Mike Galbraith, Namhyung Kim, Oleg Nesterov, Paul Mackerras, Pekka Enberg, Peter Zijlstra, Srikar Dronamraju, Stephane Eranian, Steven Rostedt, systemtap, Vasant Hegde, WANG Chao, Arnaldo Carvalho de Melo Hi Ingo, Please consider pulling, - Arnaldo The following changes since commit 2565711fb7d7c28e0cd93c8971b520d1b10b857c: perf: Improve the perf_sample_data struct layout (2014-11-16 11:42:04 +0100) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo for you to fetch changes up to a84808083688d82d7f1e5786ccf5df0ff7d448cb: perf tools: Only override the default :tid comm entry (2014-11-19 12:37:26 -0300) ---------------------------------------------------------------- perf/core improvements and fixes: User visible fixes: - Fallback to kallsyms when using the minimal 'ELF' loader (Arnaldo Carvalho de Melo) - Fix annotation with kcore (Adrian Hunter) - Fix up srcline histogram key formatting (Arnaldo Carvalho de Melo) - Add missing handler for PERF_RECORD_MMAP2 events in 'perf diff' (Kan Liang) User visible changes/new features: - Only print base source file for srcline histogram sort key (Andi Kleen) - Support source line numbers in annotate using a hotkey (Andi Kleen) Infrastructure: - Do not poll events that use the system_wide flag (Adrian Hunter) - Add perf-read-vdso32 and perf-read-vdsox32 to .gitignore (Adrian Hunter) perf tools: Only override the default :tid comm entry (Adrian Hunter) - Factor out adding new call chain entries (Andi Kleen) - Use al.addr to set up call chain (Andi Kleen) - Use a common function to resolve symbol or name (Andi Kleen) - Fix ftrace:function event recording (Jiri Olsa) - Move disable_buildid_cache() to util/build-id.c (Namhyung Kim) - Clean up libelf feature support code (Namhyung Kim) - fix typo in python 'perf test' (WANG Chao) Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Adrian Hunter (4): perf tools: Fix annotation with kcore perf evlist: Do not poll events that use the system_wide flag perf tools: Add perf-read-vdso32 and perf-read-vdsox32 to .gitignore perf tools: Only override the default :tid comm entry Andi Kleen (5): perf callchain: Factor out adding new call chain entries perf callchain: Use al.addr to set up call chain perf callchain: Use a common function to resolve symbol or name perf tools: Only print base source file for srcline perf annotate: Support source line numbers in annotate Arnaldo Carvalho de Melo (2): perf symbols: Fallback to kallsyms when using the minimal 'ELF' loader perf hists: Fix up srcline histogram key formatting Jiri Olsa (1): perf evsel: Fix ftrace:function event recording Kan Liang (1): perf diff: Add missing handler for PERF_RECORD_MMAP2 events Namhyung Kim (2): perf build-id: Move disable_buildid_cache() to util/build-id.c perf tools: Clean up libelf feature support code WANG Chao (1): perf test: fix typo in python test tools/perf/.gitignore | 2 ++ tools/perf/Makefile.perf | 2 -- tools/perf/builtin-diff.c | 1 + tools/perf/config/Makefile | 5 ++-- tools/perf/tests/builtin-test.c | 2 +- tools/perf/ui/browsers/annotate.c | 13 +++++++++- tools/perf/ui/browsers/hists.c | 17 ------------- tools/perf/ui/gtk/hists.c | 11 +-------- tools/perf/ui/stdio/hist.c | 23 +++++++----------- tools/perf/util/annotate.c | 32 ++++++++++++++++++++---- tools/perf/util/annotate.h | 1 + tools/perf/util/build-id.c | 11 +++++++++ tools/perf/util/build-id.h | 1 + tools/perf/util/callchain.c | 19 +++++++++++++++ tools/perf/util/callchain.h | 3 +++ tools/perf/util/evlist.c | 10 +++++++- tools/perf/util/evsel.c | 8 ++++++ tools/perf/util/header.c | 10 +------- tools/perf/util/machine.c | 51 ++++++++++++++++++++++++--------------- tools/perf/util/sort.c | 2 +- tools/perf/util/srcline.c | 2 +- tools/perf/util/symbol-minimal.c | 1 - tools/perf/util/thread.c | 5 ++-- tools/perf/util/util.h | 1 - 24 files changed, 145 insertions(+), 88 deletions(-) ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2014-11-19 16:03 Arnaldo Carvalho de Melo @ 2014-11-20 7:33 ` Ingo Molnar 0 siblings, 0 replies; 56+ messages in thread From: Ingo Molnar @ 2014-11-20 7:33 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: linux-kernel, Adrian Hunter, Andi Kleen, Aravinda Prasad, Borislav Petkov, Brendan Gregg, Corey Ashford, David Ahern, Don Zickus, Frederic Weisbecker, Hemant Kumar, Jean Pihet, Jiri Olsa, Kan Liang, Masami Hiramatsu, Mike Galbraith, Namhyung Kim, Oleg Nesterov, Paul Mackerras, Pekka Enberg, Peter Zijlstra, Srikar Dronamraju, Stephane Eranian, Steven Rostedt, systemtap, Vasant Hegde, WANG Chao, Arnaldo Carvalho de Melo * Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > Hi Ingo, > > Please consider pulling, > > - Arnaldo > > The following changes since commit 2565711fb7d7c28e0cd93c8971b520d1b10b857c: > > perf: Improve the perf_sample_data struct layout (2014-11-16 11:42:04 +0100) > > are available in the git repository at: > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo > > for you to fetch changes up to a84808083688d82d7f1e5786ccf5df0ff7d448cb: > > perf tools: Only override the default :tid comm entry (2014-11-19 12:37:26 -0300) > > ---------------------------------------------------------------- > perf/core improvements and fixes: > > User visible fixes: > > - Fallback to kallsyms when using the minimal 'ELF' loader (Arnaldo Carvalho de Melo) > > - Fix annotation with kcore (Adrian Hunter) > > - Fix up srcline histogram key formatting (Arnaldo Carvalho de Melo) > > - Add missing handler for PERF_RECORD_MMAP2 events in 'perf diff' (Kan Liang) > > User visible changes/new features: > > - Only print base source file for srcline histogram sort key (Andi Kleen) > > - Support source line numbers in annotate using a hotkey (Andi Kleen) > > Infrastructure: > > - Do not poll events that use the system_wide flag (Adrian Hunter) > > - Add perf-read-vdso32 and perf-read-vdsox32 to .gitignore (Adrian Hunter) > > perf tools: Only override the default :tid comm entry (Adrian Hunter) > > - Factor out adding new call chain entries (Andi Kleen) > > - Use al.addr to set up call chain (Andi Kleen) > > - Use a common function to resolve symbol or name (Andi Kleen) > > - Fix ftrace:function event recording (Jiri Olsa) > > - Move disable_buildid_cache() to util/build-id.c (Namhyung Kim) > > - Clean up libelf feature support code (Namhyung Kim) > > - fix typo in python 'perf test' (WANG Chao) > > Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> > > ---------------------------------------------------------------- > Adrian Hunter (4): > perf tools: Fix annotation with kcore > perf evlist: Do not poll events that use the system_wide flag > perf tools: Add perf-read-vdso32 and perf-read-vdsox32 to .gitignore > perf tools: Only override the default :tid comm entry > > Andi Kleen (5): > perf callchain: Factor out adding new call chain entries > perf callchain: Use al.addr to set up call chain > perf callchain: Use a common function to resolve symbol or name > perf tools: Only print base source file for srcline > perf annotate: Support source line numbers in annotate > > Arnaldo Carvalho de Melo (2): > perf symbols: Fallback to kallsyms when using the minimal 'ELF' loader > perf hists: Fix up srcline histogram key formatting > > Jiri Olsa (1): > perf evsel: Fix ftrace:function event recording > > Kan Liang (1): > perf diff: Add missing handler for PERF_RECORD_MMAP2 events > > Namhyung Kim (2): > perf build-id: Move disable_buildid_cache() to util/build-id.c > perf tools: Clean up libelf feature support code > > WANG Chao (1): > perf test: fix typo in python test > > tools/perf/.gitignore | 2 ++ > tools/perf/Makefile.perf | 2 -- > tools/perf/builtin-diff.c | 1 + > tools/perf/config/Makefile | 5 ++-- > tools/perf/tests/builtin-test.c | 2 +- > tools/perf/ui/browsers/annotate.c | 13 +++++++++- > tools/perf/ui/browsers/hists.c | 17 ------------- > tools/perf/ui/gtk/hists.c | 11 +-------- > tools/perf/ui/stdio/hist.c | 23 +++++++----------- > tools/perf/util/annotate.c | 32 ++++++++++++++++++++---- > tools/perf/util/annotate.h | 1 + > tools/perf/util/build-id.c | 11 +++++++++ > tools/perf/util/build-id.h | 1 + > tools/perf/util/callchain.c | 19 +++++++++++++++ > tools/perf/util/callchain.h | 3 +++ > tools/perf/util/evlist.c | 10 +++++++- > tools/perf/util/evsel.c | 8 ++++++ > tools/perf/util/header.c | 10 +------- > tools/perf/util/machine.c | 51 ++++++++++++++++++++++++--------------- > tools/perf/util/sort.c | 2 +- > tools/perf/util/srcline.c | 2 +- > tools/perf/util/symbol-minimal.c | 1 - > tools/perf/util/thread.c | 5 ++-- > tools/perf/util/util.h | 1 - > 24 files changed, 145 insertions(+), 88 deletions(-) Pulled into tip:perf/core, thanks a lot Arnaldo! Ingo ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2014-08-22 16:29 Arnaldo Carvalho de Melo 2014-08-24 10:11 ` Ingo Molnar 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-08-22 16:29 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Andi Kleen, Corey Ashford, David Ahern, Don Zickus, Frederic Weisbecker, Jean Pihet, Jiri Olsa, Joe Mario, Michael Ellerman, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Stephane Eranian, Sukadev Bhattiprolu, Arnaldo Carvalho de Melo Hi Ingo, Please consider pulling, - Arnaldo The following changes since commit af924aa35129edf338ebc8da6a4eae08cf7cb297: Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2014-08-18 10:14:43 +0200) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo for you to fetch changes up to 141da74d2eff8de134c5a5ffaf2800a8a1b27517: perf hists browser: Consolidate callchain print functions in TUI (2014-08-22 13:12:15 -0300) ---------------------------------------------------------------- perf/core improvements and fixes: User visible: o Don't try to find DSOs in SYSV maps (Don Zickus) o Fallback to MAP__FUNCTION if daddr maps are NULL, i.e. addresses get looked upon more maps (Don Zickus) o Kernel fix to properly handle exited tasks, by returning POLLHUP values on perf event file descriptors. Tooling changes will come next, but were tested with this kernel fix. (Jiri Olsa) o Add +field argument support for --field option, so that one can add fields to the default list of fields to show, i.e. now one can just do: perf report --fields +pid And the pid will appear in addition to the default fields. (Jiri Olsa) Developer stuff: o More Intel PT prep stuff, including: - Add a 'perf test' for tracking with sched_switch - Add 'flush' callback to scripting API o hists browser (used in top and report) refactorings, getting rid of unused variables and reducing source code size by handling similar cases in a fewer functions (Namhyung Kim). o Explicitly include util/debug.h for powerpc, was being indirectly included, broke the build when some change made it stop being included. (Sukadev Bhattiprolu) Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Adrian Hunter (4): perf tests: Add a test for tracking with sched_switch perf scripting: Add 'flush' callback to scripting API perf machine: Rename machine__get_kernel_start_addr() method perf tools: Add machine__kernel_ip() Don Zickus (2): perf symbols: Don't try to find DSOs in SYSV maps perf machine: Fallback to MAP__FUNCTION if daddr maps are NULL Jiri Olsa (4): perf: Fix perf_poll to return proper POLLHUP value perf: Add PERF_EVENT_STATE_EXIT state for events with exited task perf top: Use set_term_quiet() instead of open coded equivalent perf tools: Add +field argument support for --field option Namhyung Kim (5): perf hists browser: Get rid of unused 'remaining' variable perf hists browser: Fix children overhead dump perf hists browser: Factor out hist_browser__show_callchain_entry() perf hists browser: Cleanup callchain print functions perf hists browser: Consolidate callchain print functions in TUI Sukadev Bhattiprolu (1): perf tools powerpc: Explicitly include util/debug.h include/linux/perf_event.h | 1 + kernel/events/core.c | 12 +- tools/perf/Makefile.perf | 1 + tools/perf/arch/powerpc/util/skip-callchain-idx.c | 1 + tools/perf/builtin-script.c | 13 + tools/perf/builtin-top.c | 9 +- tools/perf/tests/builtin-test.c | 4 + tools/perf/tests/switch-tracking.c | 572 +++++++++++++++++++++ tools/perf/tests/tests.h | 1 + tools/perf/ui/browsers/hists.c | 343 +++++------- tools/perf/ui/hist.c | 6 +- tools/perf/util/event.c | 6 +- tools/perf/util/machine.c | 41 +- tools/perf/util/machine.h | 17 + tools/perf/util/map.c | 1 + .../perf/util/scripting-engines/trace-event-perl.c | 6 + .../util/scripting-engines/trace-event-python.c | 6 + tools/perf/util/sort.c | 24 +- tools/perf/util/sort.h | 1 + tools/perf/util/trace-event-scripting.c | 7 + tools/perf/util/trace-event.h | 1 + 21 files changed, 827 insertions(+), 246 deletions(-) create mode 100644 tools/perf/tests/switch-tracking.c ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2014-08-22 16:29 Arnaldo Carvalho de Melo @ 2014-08-24 10:11 ` Ingo Molnar 2014-08-24 11:16 ` Arnaldo Carvalho de Melo 0 siblings, 1 reply; 56+ messages in thread From: Ingo Molnar @ 2014-08-24 10:11 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: linux-kernel, Adrian Hunter, Andi Kleen, Corey Ashford, David Ahern, Don Zickus, Frederic Weisbecker, Jean Pihet, Jiri Olsa, Joe Mario, Michael Ellerman, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Stephane Eranian, Sukadev Bhattiprolu, Arnaldo Carvalho de Melo * Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > Hi Ingo, > > Please consider pulling, > > - Arnaldo > > The following changes since commit af924aa35129edf338ebc8da6a4eae08cf7cb297: > > Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2014-08-18 10:14:43 +0200) > > are available in the git repository at: > > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo > > for you to fetch changes up to 141da74d2eff8de134c5a5ffaf2800a8a1b27517: > > perf hists browser: Consolidate callchain print functions in TUI (2014-08-22 13:12:15 -0300) > > ---------------------------------------------------------------- > perf/core improvements and fixes: > > User visible: > > o Don't try to find DSOs in SYSV maps (Don Zickus) > > o Fallback to MAP__FUNCTION if daddr maps are NULL, > i.e. addresses get looked upon more maps (Don Zickus) > > o Kernel fix to properly handle exited tasks, by returning POLLHUP values > on perf event file descriptors. Tooling changes will come next, but were > tested with this kernel fix. (Jiri Olsa) So the signoffs are really weird (and invalid) here: commit ce1d77c667e40631d6673717e65e72f72265bf10 Author: Jiri Olsa <jolsa@kernel.org> Date: Fri Aug 15 16:24:45 2014 -0300 perf top: Use set_term_quiet() instead of open coded equivalent ... Cc: Jiri Olsa <jolsa@kernel.org> Cc: Namhyung Kim <namhyung@kernel.org> Cc: Paul Mackerras <paulus@samba.org> Cc: Peter Zijlstra <a.p.zijlstra@chello.nl> Link: http://lkml.kernel.org/n/tip-h7n9te70flmaqfnj6l06ay6r@git.kernel.org [ Yanked this out of a patch containing this and some other change ] Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> and in 7713886adc15 and f017bf0a8d92 - results of a rebase gone wrong? Thanks, Ingo ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2014-08-24 10:11 ` Ingo Molnar @ 2014-08-24 11:16 ` Arnaldo Carvalho de Melo 2014-08-24 14:47 ` Ingo Molnar 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2014-08-24 11:16 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Adrian Hunter, Andi Kleen, Corey Ashford, David Ahern, Don Zickus, Frederic Weisbecker, Jean Pihet, Jiri Olsa, Joe Mario, Michael Ellerman, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Stephane Eranian, Sukadev Bhattiprolu Em Sun, Aug 24, 2014 at 12:11:24PM +0200, Ingo Molnar escreveu: > * Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > > o Kernel fix to properly handle exited tasks, by returning POLLHUP values > > on perf event file descriptors. Tooling changes will come next, but were > > tested with this kernel fix. (Jiri Olsa) > So the signoffs are really weird (and invalid) here: > commit ce1d77c667e40631d6673717e65e72f72265bf10 > Author: Jiri Olsa <jolsa@kernel.org> > Date: Fri Aug 15 16:24:45 2014 -0300 > perf top: Use set_term_quiet() instead of open coded equivalent > ... > Cc: Jiri Olsa <jolsa@kernel.org> > Cc: Namhyung Kim <namhyung@kernel.org> > Cc: Paul Mackerras <paulus@samba.org> > Cc: Peter Zijlstra <a.p.zijlstra@chello.nl> > Link: http://lkml.kernel.org/n/tip-h7n9te70flmaqfnj6l06ay6r@git.kernel.org > [ Yanked this out of a patch containing this and some other change ] > Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> > and in 7713886adc15 and f017bf0a8d92 - results of a rebase gone wrong? Should be ok now, I recreated the perf-core-for-mingo signed tag, please let me know if there are still any problems, Thanks! - Arnaldo ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2014-08-24 11:16 ` Arnaldo Carvalho de Melo @ 2014-08-24 14:47 ` Ingo Molnar 0 siblings, 0 replies; 56+ messages in thread From: Ingo Molnar @ 2014-08-24 14:47 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: linux-kernel, Adrian Hunter, Andi Kleen, Corey Ashford, David Ahern, Don Zickus, Frederic Weisbecker, Jean Pihet, Jiri Olsa, Joe Mario, Michael Ellerman, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Stephane Eranian, Sukadev Bhattiprolu * Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > Em Sun, Aug 24, 2014 at 12:11:24PM +0200, Ingo Molnar escreveu: > > * Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > > > o Kernel fix to properly handle exited tasks, by returning POLLHUP values > > > on perf event file descriptors. Tooling changes will come next, but were > > > tested with this kernel fix. (Jiri Olsa) > > > So the signoffs are really weird (and invalid) here: > > > commit ce1d77c667e40631d6673717e65e72f72265bf10 > > Author: Jiri Olsa <jolsa@kernel.org> > > Date: Fri Aug 15 16:24:45 2014 -0300 > > > perf top: Use set_term_quiet() instead of open coded equivalent > > > ... > > Cc: Jiri Olsa <jolsa@kernel.org> > > Cc: Namhyung Kim <namhyung@kernel.org> > > Cc: Paul Mackerras <paulus@samba.org> > > Cc: Peter Zijlstra <a.p.zijlstra@chello.nl> > > Link: http://lkml.kernel.org/n/tip-h7n9te70flmaqfnj6l06ay6r@git.kernel.org > > [ Yanked this out of a patch containing this and some other change ] > > Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> > > > and in 7713886adc15 and f017bf0a8d92 - results of a rebase gone wrong? > > Should be ok now, I recreated the perf-core-for-mingo signed tag, please > let me know if there are still any problems, Looks good now. Pulled, thanks a lot Arnaldo! Ingo ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2013-11-18 20:27 Arnaldo Carvalho de Melo 0 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2013-11-18 20:27 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, David Ahern, Frederic Weisbecker, Jiri Olsa, Mike Galbraith, Namhyung Kim, Paul Mackerras, Pekka Enberg, Peter Zijlstra, Ramkumar Ramachandra, Stephane Eranian, Steven Rostedt, Arnaldo Carvalho de Melo From: Arnaldo Carvalho de Melo <acme@ghostprotocols.net> Hi Ingo, Please consider pulling, - Arnaldo The following changes since commit 89b4be142bf2491a94af325f5206fc2f2aa18960: Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/urgent (2013-11-15 07:35:50 +0100) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux tags/perf-core-for-mingo for you to fetch changes up to b4070e339cd39e4b56604eddc3e93ee355027d45: tools lib traceevent: Fix use of multiple options in processing field (2013-11-18 17:09:51 -0300) ---------------------------------------------------------------- perf/core improvements and fixes: . Make per-cpu mmaps the default, having inheritance turned on for all targets but --tid/-t. From Adrian Hunter. . Make 'perf top -g' refer to callchains, not --groups, to be consistent with 'perf record -g'. From David Ahern. . Tag thread comm as overriden, fix from Frederic Weisbecker. . Fix bogus group name when processing the perf.data file header, from Namhyung Kim. . Fix possible memory leaks in process_group_desc(), from Namhyung Kim. . Remove 'perf trace' thread summary coloring, from Pekka Enberg. . Generalize the bash completion scripts and make it work with zsh too, from Ramkumar Ramachandra. . Fix use of multiple options in processing field in libtraceevent, from Steven Rostedt. Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Adrian Hunter (4): perf record: Make per-cpu mmaps the default. perf tools: Allow '--inherit' as the negation of '--no-inherit' perf tools: Add option macro OPT_BOOLEAN_SET perf record: Default -t option to no inheritance David Ahern (1): perf top: Make -g refer to callchains Frederic Weisbecker (1): perf tools: Tag thread comm as overriden Namhyung Kim (3): perf header: Fix bogus group name perf header: Fix possible memory leaks in process_group_desc() perf script: Move evname print code to process_event() Pekka Enberg (1): perf trace: Remove thread summary coloring Ramkumar Ramachandra (5): perf completion: introduce a layer of indirection perf completion: factor out compgen stuff perf completion: factor out call to __ltrim_colon_completions perf completion: introduce zsh support perf completion: rename file to reflect zsh support Steven Rostedt (1): tools lib traceevent: Fix use of multiple options in processing field tools/lib/traceevent/event-parse.c | 23 ++++- tools/perf/Documentation/perf-record.txt | 12 +-- tools/perf/Documentation/perf-top.txt | 5 +- tools/perf/Makefile.perf | 4 +- tools/perf/builtin-record.c | 13 ++- tools/perf/builtin-script.c | 11 ++- tools/perf/builtin-top.c | 4 +- tools/perf/builtin-trace.c | 13 +-- tools/perf/{bash_completion => perf-completion.sh} | 100 ++++++++++++++++++--- tools/perf/perf.h | 1 + tools/perf/tests/attr/test-record-no-inherit | 2 +- tools/perf/util/evlist.c | 6 +- tools/perf/util/evsel.c | 5 +- tools/perf/util/header.c | 6 +- tools/perf/util/parse-options.c | 21 +++++ tools/perf/util/parse-options.h | 8 ++ tools/perf/util/target.c | 11 ++- tools/perf/util/target.h | 4 +- tools/perf/util/thread.c | 11 ++- 19 files changed, 198 insertions(+), 62 deletions(-) rename tools/perf/{bash_completion => perf-completion.sh} (65%) ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2013-10-23 20:57 Arnaldo Carvalho de Melo 2013-10-24 6:52 ` Ingo Molnar 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2013-10-23 20:57 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Andi Kleen, David Ahern, Frederic Weisbecker, Hitoshi Mitake, Jiri Olsa, Linus Torvalds, Masami Hiramatsu, Mike Galbraith, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Stephane Eranian, Arnaldo Carvalho de Melo From: Arnaldo Carvalho de Melo <acme@ghostprotocols.net> Hi Ingo, Please consider pulling, - Arnaldo The following changes since commit aa30a2e03a453aad9fd96c3f2d4a82c3497674e5: Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2013-10-23 09:45:50 +0200) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux tags/perf-core-for-mingo for you to fetch changes up to c1fb5651bb40f9efaf32d280f39e06df7e352673: perf tools: Show progress on histogram collapsing (2013-10-23 15:48:24 -0300) ---------------------------------------------------------------- perf/core improvements and fixes: . Show progress on histogram collapsing, that can take a long time, from Namhyung Kim. . Support "$vars" meta argument syntax for local variables, allowing asking for all possible variables at a given probe point to be collected when it hits, from Masami Hiramatsu. . Address the root cause of that 'perf sched' stack initialization build slowdown, by programmatically setting a big array after moving the global variable back to the stack. Fix from Adrian Hunter. . Do not repipe attributes to a perf.data file in 'perf inject', fix from Adrian Hunter . Change the procps visible command-name of invididual benchmark tests plus cleanups, from Ingo Molnar. . Do not accept parse_tag_value() overflow, fix from Adrian Hunter. . Validate that mmap_pages is not too big. From Adrian Hunter. . Fix non-debug build, from Adrian Hunter . Clarify the "sample parsing" test entry. . Consider PERF_SAMPLE_TRANSACTION in the "sample parsing" test. Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- Adrian Hunter (7): perf sched: Make struct perf_sched sched a local variable perf sched: Optimize build time perf script: Make perf_script a local variable perf inject: Do not repipe attributes to a perf.data file perf tools: Do not accept parse_tag_value() overflow perf evlist: Validate that mmap_pages is not too big perf tools: Fix non-debug build Arnaldo Carvalho de Melo (5): perf test: Clarify the "sample parsing" test entry perf test: Consider PERF_SAMPLE_TRANSACTION in the "sample parsing" test perf tools: Stop using 'self' in some more places perf ui: Rename ui_progress to ui_progress_ops perf ui progress: Per progress bar state Ingo Molnar (1): perf bench: Change the procps visible command-name of invididual benchmark tests plus cleanups Masami Hiramatsu (2): perf probe: Support "$vars" meta argument syntax for local variables perf probe: Find fentry mcount fuzzed parameter location Namhyung Kim (1): perf tools: Show progress on histogram collapsing tools/perf/Makefile.perf | 1 + tools/perf/builtin-annotate.c | 6 +- tools/perf/builtin-bench.c | 239 +++++++++++++++++++------------------- tools/perf/builtin-diff.c | 7 +- tools/perf/builtin-inject.c | 27 +++-- tools/perf/builtin-report.c | 24 ++-- tools/perf/builtin-sched.c | 44 +++---- tools/perf/builtin-script.c | 40 ++++--- tools/perf/builtin-top.c | 4 +- tools/perf/config/Makefile | 4 + tools/perf/tests/hists_link.c | 2 +- tools/perf/tests/sample-parsing.c | 4 +- tools/perf/ui/gtk/gtk.h | 2 +- tools/perf/ui/gtk/progress.c | 20 ++-- tools/perf/ui/gtk/setup.c | 2 +- tools/perf/ui/progress.c | 32 +++-- tools/perf/ui/progress.h | 19 +-- tools/perf/ui/tui/progress.c | 15 +-- tools/perf/ui/tui/setup.c | 3 +- tools/perf/ui/tui/tui.h | 6 + tools/perf/util/build-id.c | 6 +- tools/perf/util/evlist.c | 14 ++- tools/perf/util/hist.c | 23 ++-- tools/perf/util/hist.h | 3 +- tools/perf/util/probe-event.c | 1 - tools/perf/util/probe-finder.c | 133 ++++++++++++++++++--- tools/perf/util/probe-finder.h | 1 + tools/perf/util/session.c | 24 ++-- tools/perf/util/sort.c | 124 ++++++++++---------- tools/perf/util/strfilter.c | 46 ++++---- tools/perf/util/thread.c | 72 ++++++------ tools/perf/util/util.c | 2 + 32 files changed, 560 insertions(+), 390 deletions(-) create mode 100644 tools/perf/ui/tui/tui.h ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2013-10-23 20:57 Arnaldo Carvalho de Melo @ 2013-10-24 6:52 ` Ingo Molnar 0 siblings, 0 replies; 56+ messages in thread From: Ingo Molnar @ 2013-10-24 6:52 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter, Andi Kleen, David Ahern, Frederic Weisbecker, Hitoshi Mitake, Jiri Olsa, Linus Torvalds, Masami Hiramatsu, Mike Galbraith, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Stephane Eranian, Arnaldo Carvalho de Melo * Arnaldo Carvalho de Melo <acme@infradead.org> wrote: > From: Arnaldo Carvalho de Melo <acme@ghostprotocols.net> > > Hi Ingo, > > Please consider pulling, > > - Arnaldo > > The following changes since commit aa30a2e03a453aad9fd96c3f2d4a82c3497674e5: > > Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2013-10-23 09:45:50 +0200) > > are available in the git repository at: > > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux tags/perf-core-for-mingo > > for you to fetch changes up to c1fb5651bb40f9efaf32d280f39e06df7e352673: > > perf tools: Show progress on histogram collapsing (2013-10-23 15:48:24 -0300) > > ---------------------------------------------------------------- > perf/core improvements and fixes: > > . Show progress on histogram collapsing, that can take a long time, from > Namhyung Kim. > > . Support "$vars" meta argument syntax for local variables, allowing > asking for all possible variables at a given probe point to be > collected when it hits, from Masami Hiramatsu. > > . Address the root cause of that 'perf sched' stack initialization build > slowdown, by programmatically setting a big array after moving the > global variable back to the stack. Fix from Adrian Hunter. > > . Do not repipe attributes to a perf.data file in 'perf inject', > fix from Adrian Hunter > > . Change the procps visible command-name of invididual benchmark tests > plus cleanups, from Ingo Molnar. > > . Do not accept parse_tag_value() overflow, fix from Adrian Hunter. > > . Validate that mmap_pages is not too big. From Adrian Hunter. > > . Fix non-debug build, from Adrian Hunter > > . Clarify the "sample parsing" test entry. > > . Consider PERF_SAMPLE_TRANSACTION in the "sample parsing" test. > > Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> > > ---------------------------------------------------------------- > Adrian Hunter (7): > perf sched: Make struct perf_sched sched a local variable > perf sched: Optimize build time > perf script: Make perf_script a local variable > perf inject: Do not repipe attributes to a perf.data file > perf tools: Do not accept parse_tag_value() overflow > perf evlist: Validate that mmap_pages is not too big > perf tools: Fix non-debug build > > Arnaldo Carvalho de Melo (5): > perf test: Clarify the "sample parsing" test entry > perf test: Consider PERF_SAMPLE_TRANSACTION in the "sample parsing" test > perf tools: Stop using 'self' in some more places > perf ui: Rename ui_progress to ui_progress_ops > perf ui progress: Per progress bar state > > Ingo Molnar (1): > perf bench: Change the procps visible command-name of invididual benchmark tests plus cleanups > > Masami Hiramatsu (2): > perf probe: Support "$vars" meta argument syntax for local variables > perf probe: Find fentry mcount fuzzed parameter location > > Namhyung Kim (1): > perf tools: Show progress on histogram collapsing > > tools/perf/Makefile.perf | 1 + > tools/perf/builtin-annotate.c | 6 +- > tools/perf/builtin-bench.c | 239 +++++++++++++++++++------------------- > tools/perf/builtin-diff.c | 7 +- > tools/perf/builtin-inject.c | 27 +++-- > tools/perf/builtin-report.c | 24 ++-- > tools/perf/builtin-sched.c | 44 +++---- > tools/perf/builtin-script.c | 40 ++++--- > tools/perf/builtin-top.c | 4 +- > tools/perf/config/Makefile | 4 + > tools/perf/tests/hists_link.c | 2 +- > tools/perf/tests/sample-parsing.c | 4 +- > tools/perf/ui/gtk/gtk.h | 2 +- > tools/perf/ui/gtk/progress.c | 20 ++-- > tools/perf/ui/gtk/setup.c | 2 +- > tools/perf/ui/progress.c | 32 +++-- > tools/perf/ui/progress.h | 19 +-- > tools/perf/ui/tui/progress.c | 15 +-- > tools/perf/ui/tui/setup.c | 3 +- > tools/perf/ui/tui/tui.h | 6 + > tools/perf/util/build-id.c | 6 +- > tools/perf/util/evlist.c | 14 ++- > tools/perf/util/hist.c | 23 ++-- > tools/perf/util/hist.h | 3 +- > tools/perf/util/probe-event.c | 1 - > tools/perf/util/probe-finder.c | 133 ++++++++++++++++++--- > tools/perf/util/probe-finder.h | 1 + > tools/perf/util/session.c | 24 ++-- > tools/perf/util/sort.c | 124 ++++++++++---------- > tools/perf/util/strfilter.c | 46 ++++---- > tools/perf/util/thread.c | 72 ++++++------ > tools/perf/util/util.c | 2 + > 32 files changed, 560 insertions(+), 390 deletions(-) > create mode 100644 tools/perf/ui/tui/tui.h Pulled, thanks Arnaldo! Ingo ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2012-02-14 1:52 Arnaldo Carvalho de Melo 2012-02-14 2:50 ` Namhyung Kim 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2012-02-14 1:52 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, David Ahern, Frederic Weisbecker, Jason Wang, Jiri Olsa, Joerg Roedel, Lin Ming, Namhyung Kim, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver, arnaldo.melo, Arnaldo Carvalho de Melo The following changes since commit c98fdeaa92731308ed80386261fa2589addefa47: x86/sched/perf/AMD: Set sched_clock_stable (2012-02-07 13:12:08 +0100) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux perf-core-for-mingo for you to fetch changes up to 7e1ccd3804281fc0755eb726b654469c40a96d89: perf tools: cleanup initialization of attr->size (2012-02-13 23:35:04 -0200) ---------------------------------------------------------------- perf/core fixes and improvements. Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> ---------------------------------------------------------------- David Ahern (3): perf record: No build id option fails perf tools: Fix out of tree compiles perf tools: Allow multiple threads or processes in record, stat, top Jiri Olsa (3): perf tools: Remove unused functions from debugfs object perf tools: Add sysfs mountpoint interface perf tools: Add bitmap_or function into bitmap object Joerg Roedel (2): perf top: Don't process samples with no valid machine object perf tools: Change perf_guest default back to false Namhyung Kim (4): perf tools: Fix build dependency of perf python extension perf tools: Implement islower/isupper macro into util.h perf tools: ctype.c only wants util.h perf tools: Get rid of ctype.h in symbol.c Robert Richter (2): perf tools: Moving code in header.c perf tools: Factor out feature op to process header sections Stephane Eranian (2): perf tools: fix endianness detection in perf.data perf tools: cleanup initialization of attr->size tools/perf/Documentation/perf-record.txt | 4 +- tools/perf/Documentation/perf-stat.txt | 4 +- tools/perf/Documentation/perf-top.txt | 4 +- tools/perf/MANIFEST | 1 + tools/perf/Makefile | 7 +- tools/perf/builtin-record.c | 14 +- tools/perf/builtin-stat.c | 31 ++- tools/perf/builtin-test.c | 2 - tools/perf/builtin-top.c | 18 +- tools/perf/perf.h | 5 +- tools/perf/util/bitmap.c | 10 + tools/perf/util/ctype.c | 2 +- tools/perf/util/debugfs.c | 141 ---------- tools/perf/util/debugfs.h | 6 - tools/perf/util/evlist.c | 10 +- tools/perf/util/evlist.h | 4 +- tools/perf/util/evsel.c | 2 +- tools/perf/util/header.c | 421 +++++++++++++++++------------- tools/perf/util/hist.h | 1 + tools/perf/util/include/linux/bitmap.h | 11 + tools/perf/util/python-ext-sources | 19 ++ tools/perf/util/session.c | 10 + tools/perf/util/setup.py | 8 +- tools/perf/util/symbol.c | 2 +- tools/perf/util/sysfs.c | 60 +++++ tools/perf/util/sysfs.h | 6 + tools/perf/util/thread_map.c | 128 +++++++++ tools/perf/util/thread_map.h | 4 + tools/perf/util/top.c | 10 +- tools/perf/util/top.h | 2 +- tools/perf/util/usage.c | 6 +- tools/perf/util/util.c | 4 +- tools/perf/util/util.h | 6 +- 33 files changed, 571 insertions(+), 392 deletions(-) create mode 100644 tools/perf/util/python-ext-sources create mode 100644 tools/perf/util/sysfs.c create mode 100644 tools/perf/util/sysfs.h ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2012-02-14 1:52 Arnaldo Carvalho de Melo @ 2012-02-14 2:50 ` Namhyung Kim 2012-02-14 3:07 ` Namhyung Kim 0 siblings, 1 reply; 56+ messages in thread From: Namhyung Kim @ 2012-02-14 2:50 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: Ingo Molnar, linux-kernel, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, David Ahern, Frederic Weisbecker, Jason Wang, Jiri Olsa, Joerg Roedel, Lin Ming, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver, arnaldo.melo, Arnaldo Carvalho de Melo Hello, 2012-02-14 10:52 AM, Arnaldo Carvalho de Melo wrote: > The following changes since commit c98fdeaa92731308ed80386261fa2589addefa47: > > x86/sched/perf/AMD: Set sched_clock_stable (2012-02-07 13:12:08 +0100) > > are available in the git repository at: > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux perf-core-for-mingo > > for you to fetch changes up to 7e1ccd3804281fc0755eb726b654469c40a96d89: > > perf tools: cleanup initialization of attr->size (2012-02-13 23:35:04 -0200) > > ---------------------------------------------------------------- > perf/core fixes and improvements. > > Signed-off-by: Arnaldo Carvalho de Melo<acme@redhat.com> > I've got a following error during a simple test: $ ./perf record -- sleep 1 Error: sys_perf_event_open() syscall returned with 22 (Invalid argument). /bin/dmesg may provide additional information. Fatal: No CONFIG_PERF_EVENTS=y kernel support configured? sleep: Terminated Any thoughts? Thanks, Namhyung > ---------------------------------------------------------------- > David Ahern (3): > perf record: No build id option fails > perf tools: Fix out of tree compiles > perf tools: Allow multiple threads or processes in record, stat, top > > Jiri Olsa (3): > perf tools: Remove unused functions from debugfs object > perf tools: Add sysfs mountpoint interface > perf tools: Add bitmap_or function into bitmap object > > Joerg Roedel (2): > perf top: Don't process samples with no valid machine object > perf tools: Change perf_guest default back to false > > Namhyung Kim (4): > perf tools: Fix build dependency of perf python extension > perf tools: Implement islower/isupper macro into util.h > perf tools: ctype.c only wants util.h > perf tools: Get rid of ctype.h in symbol.c > > Robert Richter (2): > perf tools: Moving code in header.c > perf tools: Factor out feature op to process header sections > > Stephane Eranian (2): > perf tools: fix endianness detection in perf.data > perf tools: cleanup initialization of attr->size > > tools/perf/Documentation/perf-record.txt | 4 +- > tools/perf/Documentation/perf-stat.txt | 4 +- > tools/perf/Documentation/perf-top.txt | 4 +- > tools/perf/MANIFEST | 1 + > tools/perf/Makefile | 7 +- > tools/perf/builtin-record.c | 14 +- > tools/perf/builtin-stat.c | 31 ++- > tools/perf/builtin-test.c | 2 - > tools/perf/builtin-top.c | 18 +- > tools/perf/perf.h | 5 +- > tools/perf/util/bitmap.c | 10 + > tools/perf/util/ctype.c | 2 +- > tools/perf/util/debugfs.c | 141 ---------- > tools/perf/util/debugfs.h | 6 - > tools/perf/util/evlist.c | 10 +- > tools/perf/util/evlist.h | 4 +- > tools/perf/util/evsel.c | 2 +- > tools/perf/util/header.c | 421 +++++++++++++++++------------- > tools/perf/util/hist.h | 1 + > tools/perf/util/include/linux/bitmap.h | 11 + > tools/perf/util/python-ext-sources | 19 ++ > tools/perf/util/session.c | 10 + > tools/perf/util/setup.py | 8 +- > tools/perf/util/symbol.c | 2 +- > tools/perf/util/sysfs.c | 60 +++++ > tools/perf/util/sysfs.h | 6 + > tools/perf/util/thread_map.c | 128 +++++++++ > tools/perf/util/thread_map.h | 4 + > tools/perf/util/top.c | 10 +- > tools/perf/util/top.h | 2 +- > tools/perf/util/usage.c | 6 +- > tools/perf/util/util.c | 4 +- > tools/perf/util/util.h | 6 +- > 33 files changed, 571 insertions(+), 392 deletions(-) > create mode 100644 tools/perf/util/python-ext-sources > create mode 100644 tools/perf/util/sysfs.c > create mode 100644 tools/perf/util/sysfs.h ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2012-02-14 2:50 ` Namhyung Kim @ 2012-02-14 3:07 ` Namhyung Kim 2012-02-14 5:10 ` Namhyung Kim 0 siblings, 1 reply; 56+ messages in thread From: Namhyung Kim @ 2012-02-14 3:07 UTC (permalink / raw) To: linux-kernel Cc: linux-kernel, Ingo Molnar, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, David Ahern, Frederic Weisbecker, Jason Wang, Jiri Olsa, Joerg Roedel, Lin Ming, Namhyung Kim, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver, arnaldo.melo, Arnaldo Carvalho de Melo 2012-02-14 오전 11:50, Namhyung Kim 쓴 글: > Hello, > > 2012-02-14 10:52 AM, Arnaldo Carvalho de Melo wrote: >> The following changes since commit c98fdeaa92731308ed80386261fa2589addefa47: >> >> x86/sched/perf/AMD: Set sched_clock_stable (2012-02-07 13:12:08 +0100) >> >> are available in the git repository at: >> >> git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux perf-core-for-mingo >> >> for you to fetch changes up to 7e1ccd3804281fc0755eb726b654469c40a96d89: >> >> perf tools: cleanup initialization of attr->size (2012-02-13 23:35:04 -0200) >> >> ---------------------------------------------------------------- >> perf/core fixes and improvements. >> >> Signed-off-by: Arnaldo Carvalho de Melo<acme@redhat.com> >> > > I've got a following error during a simple test: > > $ ./perf record -- sleep 1 > > Error: sys_perf_event_open() syscall returned with 22 (Invalid argument). > /bin/dmesg may provide additional information. > > Fatal: No CONFIG_PERF_EVENTS=y kernel support configured? > > sleep: Terminated > > > Any thoughts? > > Thanks, > Namhyung > git bisect told me: c4a7dca92bbb9881a5d678720f1d0c2153499749 is the first bad commit commit c4a7dca92bbb9881a5d678720f1d0c2153499749 Author: Joerg Roedel <joerg.roedel@amd.com> Date: Fri Feb 10 18:05:05 2012 +0100 perf tools: Change perf_guest default back to false Setting perf_guest to true by default makes no sense because the perf subcommands can not setup guest symbol information and thus not process and guest samples. The only exception is perf-kvm which changes the perf_guest value on its own. So change the default for perf_guest back to false. Cc: David Ahern <dsahern@gmail.com> Cc: Ingo Molnar <mingo@elte.hu> Cc: Jason Wang <jasowang@redhat.com> Cc: Paul Mackerras <paulus@samba.org> Cc: Peter Zijlstra <a.p.zijlstra@chello.nl> Link: http://lkml.kernel.org/r/1328893505-4115-3-git-send-email-joerg.roedel@amd.com Signed-off-by: Joerg Roedel <joerg.roedel@amd.com> Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> :040000 040000 64d65944c9081dfd9ace65dbc19fca655cbdc708 4f0e86ab85314ba68f2295ae1e599f3bb5ab9484 M tools Thanks, Namhyung ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2012-02-14 3:07 ` Namhyung Kim @ 2012-02-14 5:10 ` Namhyung Kim 2012-02-14 5:23 ` David Ahern 0 siblings, 1 reply; 56+ messages in thread From: Namhyung Kim @ 2012-02-14 5:10 UTC (permalink / raw) To: Namhyung Kim Cc: linux-kernel, Ingo Molnar, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, David Ahern, Frederic Weisbecker, Jason Wang, Jiri Olsa, Joerg Roedel, Lin Ming, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver, arnaldo.melo, Arnaldo Carvalho de Melo 2012-02-14 12:07 PM, Namhyung Kim wrote: > 2012-02-14 오전 11:50, Namhyung Kim 쓴 글: >> Hello, >> >> 2012-02-14 10:52 AM, Arnaldo Carvalho de Melo wrote: >>> The following changes since commit c98fdeaa92731308ed80386261fa2589addefa47: >>> >>> x86/sched/perf/AMD: Set sched_clock_stable (2012-02-07 13:12:08 +0100) >>> >>> are available in the git repository at: >>> >>> git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux perf-core-for-mingo >>> >>> for you to fetch changes up to 7e1ccd3804281fc0755eb726b654469c40a96d89: >>> >>> perf tools: cleanup initialization of attr->size (2012-02-13 23:35:04 -0200) >>> >>> ---------------------------------------------------------------- >>> perf/core fixes and improvements. >>> >>> Signed-off-by: Arnaldo Carvalho de Melo<acme@redhat.com> >>> >> >> I've got a following error during a simple test: >> >> $ ./perf record -- sleep 1 >> >> Error: sys_perf_event_open() syscall returned with 22 (Invalid argument). >> /bin/dmesg may provide additional information. >> >> Fatal: No CONFIG_PERF_EVENTS=y kernel support configured? >> >> sleep: Terminated >> >> >> Any thoughts? >> >> Thanks, >> Namhyung >> > > git bisect told me: > > c4a7dca92bbb9881a5d678720f1d0c2153499749 is the first bad commit > commit c4a7dca92bbb9881a5d678720f1d0c2153499749 > Author: Joerg Roedel<joerg.roedel@amd.com> > Date: Fri Feb 10 18:05:05 2012 +0100 > > perf tools: Change perf_guest default back to false > > Setting perf_guest to true by default makes no sense because the perf > subcommands can not setup guest symbol information and thus not process > and guest samples. The only exception is perf-kvm which changes the > perf_guest value on its own. So change the default for perf_guest back > to false. > > Cc: David Ahern<dsahern@gmail.com> > Cc: Ingo Molnar<mingo@elte.hu> > Cc: Jason Wang<jasowang@redhat.com> > Cc: Paul Mackerras<paulus@samba.org> > Cc: Peter Zijlstra<a.p.zijlstra@chello.nl> > Link: http://lkml.kernel.org/r/1328893505-4115-3-git-send-email-joerg.roedel@amd.com > Signed-off-by: Joerg Roedel<joerg.roedel@amd.com> > Signed-off-by: Arnaldo Carvalho de Melo<acme@redhat.com> > > :040000 040000 64d65944c9081dfd9ace65dbc19fca655cbdc708 4f0e86ab85314ba68f2295ae1e599f3bb5ab9484 M tools > > > Thanks, > Namhyung > It happened because I used old kernel version that isn't aware of the exclude_guest bit. So the check on attr->__reserved_1 in core.c::perf_copy_attr() failed and then ended up seeing EINVAL at perf_evsel__open(). So I think it should be handled in the same way of handling sample_id_all bit in perf record and top if we want to set perf_guest to false by default. I'll send patches for them soon. Thanks, Namhyung ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2012-02-14 5:10 ` Namhyung Kim @ 2012-02-14 5:23 ` David Ahern 2012-02-14 5:48 ` Namhyung Kim 2012-02-14 10:50 ` Joerg Roedel 0 siblings, 2 replies; 56+ messages in thread From: David Ahern @ 2012-02-14 5:23 UTC (permalink / raw) To: Namhyung Kim, Joerg Roedel Cc: Namhyung Kim, linux-kernel, Ingo Molnar, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, Frederic Weisbecker, Jason Wang, Jiri Olsa, Lin Ming, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver, arnaldo.melo, Arnaldo Carvalho de Melo On 2/13/12 10:10 PM, Namhyung Kim wrote: >> > > It happened because I used old kernel version that isn't aware of the > exclude_guest bit. So the check on attr->__reserved_1 in > core.c::perf_copy_attr() failed and then ended up seeing EINVAL at > perf_evsel__open(). comes back to 1aed2671738785e8f5aea663a6fda91aa7ef59b5 again. See https://lkml.org/lkml/2012/2/8/234 > > So I think it should be handled in the same way of handling > sample_id_all bit in perf record and top if we want to set perf_guest to > false by default. I'll send patches for them soon. > > Thanks, > Namhyung ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2012-02-14 5:23 ` David Ahern @ 2012-02-14 5:48 ` Namhyung Kim 2012-02-14 5:52 ` David Ahern 2012-02-14 10:50 ` Joerg Roedel 1 sibling, 1 reply; 56+ messages in thread From: Namhyung Kim @ 2012-02-14 5:48 UTC (permalink / raw) To: David Ahern Cc: Joerg Roedel, Namhyung Kim, linux-kernel, Ingo Molnar, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, Frederic Weisbecker, Jason Wang, Jiri Olsa, Lin Ming, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver, arnaldo.melo, Arnaldo Carvalho de Melo Hi, 2012-02-14 2:23 PM, David Ahern wrote: > On 2/13/12 10:10 PM, Namhyung Kim wrote: >>> >> >> It happened because I used old kernel version that isn't aware of the >> exclude_guest bit. So the check on attr->__reserved_1 in >> core.c::perf_copy_attr() failed and then ended up seeing EINVAL at >> perf_evsel__open(). > > comes back to 1aed2671738785e8f5aea663a6fda91aa7ef59b5 again. See > > https://lkml.org/lkml/2012/2/8/234 > Thanks for the pointer, then I'll wait until the guys figure out what the correct fix is :) Thanks, Namhyung ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2012-02-14 5:48 ` Namhyung Kim @ 2012-02-14 5:52 ` David Ahern 2012-02-14 5:58 ` Namhyung Kim 0 siblings, 1 reply; 56+ messages in thread From: David Ahern @ 2012-02-14 5:52 UTC (permalink / raw) To: Namhyung Kim Cc: Joerg Roedel, Namhyung Kim, linux-kernel, Ingo Molnar, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, Frederic Weisbecker, Jason Wang, Jiri Olsa, Lin Ming, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver, arnaldo.melo, Arnaldo Carvalho de Melo On 2/13/12 10:48 PM, Namhyung Kim wrote: > Hi, > > 2012-02-14 2:23 PM, David Ahern wrote: >> On 2/13/12 10:10 PM, Namhyung Kim wrote: >>>> >>> >>> It happened because I used old kernel version that isn't aware of the >>> exclude_guest bit. So the check on attr->__reserved_1 in >>> core.c::perf_copy_attr() failed and then ended up seeing EINVAL at >>> perf_evsel__open(). >> >> comes back to 1aed2671738785e8f5aea663a6fda91aa7ef59b5 again. See >> >> https://lkml.org/lkml/2012/2/8/234 >> > > Thanks for the pointer, then I'll wait until the guys figure out what > the correct fix is :) > > Thanks, > Namhyung Resetting that default was it. :-) Does reverting the entire patch fix your problem? David ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2012-02-14 5:52 ` David Ahern @ 2012-02-14 5:58 ` Namhyung Kim 0 siblings, 0 replies; 56+ messages in thread From: Namhyung Kim @ 2012-02-14 5:58 UTC (permalink / raw) To: David Ahern Cc: Joerg Roedel, Namhyung Kim, linux-kernel, Ingo Molnar, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, Frederic Weisbecker, Jason Wang, Jiri Olsa, Lin Ming, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver, arnaldo.melo, Arnaldo Carvalho de Melo 2012-02-14 2:52 PM, David Ahern wrote: > On 2/13/12 10:48 PM, Namhyung Kim wrote: >> Hi, >> >> 2012-02-14 2:23 PM, David Ahern wrote: >>> On 2/13/12 10:10 PM, Namhyung Kim wrote: >>>>> >>>> >>>> It happened because I used old kernel version that isn't aware of the >>>> exclude_guest bit. So the check on attr->__reserved_1 in >>>> core.c::perf_copy_attr() failed and then ended up seeing EINVAL at >>>> perf_evsel__open(). >>> >>> comes back to 1aed2671738785e8f5aea663a6fda91aa7ef59b5 again. See >>> >>> https://lkml.org/lkml/2012/2/8/234 >>> >> >> Thanks for the pointer, then I'll wait until the guys figure out what >> the correct fix is :) >> >> Thanks, >> Namhyung > > Resetting that default was it. :-) > > Does reverting the entire patch fix your problem? > > David Yes, of course. OK, then I'll start to rework perf target cleanups after reverting it. Thanks, Namhyung ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2012-02-14 5:23 ` David Ahern 2012-02-14 5:48 ` Namhyung Kim @ 2012-02-14 10:50 ` Joerg Roedel 2012-02-14 13:10 ` Arnaldo Carvalho de Melo 1 sibling, 1 reply; 56+ messages in thread From: Joerg Roedel @ 2012-02-14 10:50 UTC (permalink / raw) To: David Ahern Cc: Namhyung Kim, Namhyung Kim, linux-kernel, Ingo Molnar, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, Frederic Weisbecker, Jason Wang, Jiri Olsa, Lin Ming, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver, arnaldo.melo, Arnaldo Carvalho de Melo On Mon, Feb 13, 2012 at 10:23:07PM -0700, David Ahern wrote: > On 2/13/12 10:10 PM, Namhyung Kim wrote: > >> > > > >It happened because I used old kernel version that isn't aware of the > >exclude_guest bit. So the check on attr->__reserved_1 in > >core.c::perf_copy_attr() failed and then ended up seeing EINVAL at > >perf_evsel__open(). > > comes back to 1aed2671738785e8f5aea663a6fda91aa7ef59b5 again. See > > https://lkml.org/lkml/2012/2/8/234 Grr. I totally forgot that issue. Is there some way to detect the perf kernel-features? Otherwise the only fix seems to be to drop the patch changing the default. The crashes should be fixed by the other patch already. Joerg -- AMD Operating System Research Center Advanced Micro Devices GmbH Einsteinring 24 85609 Dornach General Managers: Alberto Bozzo Registration: Dornach, Landkr. Muenchen; Registerger. Muenchen, HRB Nr. 43632 ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2012-02-14 10:50 ` Joerg Roedel @ 2012-02-14 13:10 ` Arnaldo Carvalho de Melo 2012-02-14 14:38 ` Arnaldo Carvalho de Melo 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2012-02-14 13:10 UTC (permalink / raw) To: Joerg Roedel Cc: David Ahern, Namhyung Kim, Namhyung Kim, linux-kernel, Ingo Molnar, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, Frederic Weisbecker, Jason Wang, Jiri Olsa, Lin Ming, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver Em Tue, Feb 14, 2012 at 11:50:18AM +0100, Joerg Roedel escreveu: > On Mon, Feb 13, 2012 at 10:23:07PM -0700, David Ahern wrote: > > On 2/13/12 10:10 PM, Namhyung Kim wrote: > > >It happened because I used old kernel version that isn't aware of the > > >exclude_guest bit. So the check on attr->__reserved_1 in > > >core.c::perf_copy_attr() failed and then ended up seeing EINVAL at > > >perf_evsel__open(). > > comes back to 1aed2671738785e8f5aea663a6fda91aa7ef59b5 again. See > > https://lkml.org/lkml/2012/2/8/234 > Grr. I totally forgot that issue. Is there some way to detect the perf > kernel-features? Otherwise the only fix seems to be to drop the patch > changing the default. The crashes should be fixed by the other patch > already. Well, we need to detect it like we do with sample_id_all. In this case we would try with exclude_guest, if it fails with EINVAL, we realise it is not available, clean that bit and try again. - Arnaldo ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2012-02-14 13:10 ` Arnaldo Carvalho de Melo @ 2012-02-14 14:38 ` Arnaldo Carvalho de Melo 2012-02-14 15:10 ` Joerg Roedel 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2012-02-14 14:38 UTC (permalink / raw) To: Joerg Roedel Cc: David Ahern, Namhyung Kim, Namhyung Kim, linux-kernel, Ingo Molnar, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, Frederic Weisbecker, Jason Wang, Jiri Olsa, Lin Ming, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver Em Tue, Feb 14, 2012 at 11:10:46AM -0200, Arnaldo Carvalho de Melo escreveu: > Em Tue, Feb 14, 2012 at 11:50:18AM +0100, Joerg Roedel escreveu: > > On Mon, Feb 13, 2012 at 10:23:07PM -0700, David Ahern wrote: > > > On 2/13/12 10:10 PM, Namhyung Kim wrote: > > > >It happened because I used old kernel version that isn't aware of the > > > >exclude_guest bit. So the check on attr->__reserved_1 in > > > >core.c::perf_copy_attr() failed and then ended up seeing EINVAL at > > > >perf_evsel__open(). > > > > comes back to 1aed2671738785e8f5aea663a6fda91aa7ef59b5 again. See > > > > https://lkml.org/lkml/2012/2/8/234 > > > Grr. I totally forgot that issue. Is there some way to detect the perf > > kernel-features? Otherwise the only fix seems to be to drop the patch > > changing the default. The crashes should be fixed by the other patch > > already. > > Well, we need to detect it like we do with sample_id_all. In this case > we would try with exclude_guest, if it fails with EINVAL, we realise it > is not available, clean that bit and try again. The following patch should fix it, still untested tho, booting a machine with an < 3.1 kernel to check... - Arnaldo diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c index 08ed24b..d6c10e8 100644 --- a/tools/perf/builtin-record.c +++ b/tools/perf/builtin-record.c @@ -205,6 +205,9 @@ static void perf_record__open(struct perf_record *rec) if (opts->group && pos != first) group_fd = first->fd; +fallback_missing_features: + if (opts->exclude_guest_missing) + attr->exclude_guest = attr->exclude_host = 0; retry_sample_id: attr->sample_id_all = opts->sample_id_all_avail ? 1 : 0; try_again: @@ -218,15 +221,23 @@ try_again: } else if (err == ENODEV && opts->cpu_list) { die("No such device - did you specify" " an out-of-range profile CPU?\n"); - } else if (err == EINVAL && opts->sample_id_all_avail) { - /* - * Old kernel, no attr->sample_id_type_all field - */ - opts->sample_id_all_avail = false; - if (!opts->sample_time && !opts->raw_samples && !time_needed) - attr->sample_type &= ~PERF_SAMPLE_TIME; - - goto retry_sample_id; + } else if (err == EINVAL) { + if (!opts->exclude_guest_missing && + (attr->exclude_guest || attr->exclude_host)) { + pr_debug("Old kernel, cannot exclude " + "guest or host samples.\n"); + opts->exclude_guest_missing = true; + goto fallback_missing_features; + } else if (opts->sample_id_all_avail) { + /* + * Old kernel, no attr->sample_id_type_all field + */ + opts->sample_id_all_avail = false; + if (!opts->sample_time && !opts->raw_samples && !time_needed) + attr->sample_type &= ~PERF_SAMPLE_TIME; + + goto retry_sample_id; + } } /* diff --git a/tools/perf/builtin-top.c b/tools/perf/builtin-top.c index 5a88c0d..02e11ff 100644 --- a/tools/perf/builtin-top.c +++ b/tools/perf/builtin-top.c @@ -872,6 +872,9 @@ static void perf_top__start_counters(struct perf_top *top) attr->mmap = 1; attr->comm = 1; attr->inherit = top->inherit; +fallback_missing_features: + if (top->exclude_guest_missing) + attr->exclude_guest = attr->exclude_host = 0; retry_sample_id: attr->sample_id_all = top->sample_id_all_avail ? 1 : 0; try_again: @@ -883,12 +886,20 @@ try_again: if (err == EPERM || err == EACCES) { ui__error_paranoid(); goto out_err; - } else if (err == EINVAL && top->sample_id_all_avail) { - /* - * Old kernel, no attr->sample_id_type_all field - */ - top->sample_id_all_avail = false; - goto retry_sample_id; + } else if (err == EINVAL) { + if (!top->exclude_guest_missing && + (attr->exclude_guest || attr->exclude_host)) { + pr_debug("Old kernel, cannot exclude " + "guest or host samples.\n"); + top->exclude_guest_missing = true; + goto fallback_missing_features; + } else if (top->sample_id_all_avail) { + /* + * Old kernel, no attr->sample_id_type_all field + */ + top->sample_id_all_avail = false; + goto retry_sample_id; + } } /* * If it's cycles then fall back to hrtimer diff --git a/tools/perf/perf.h b/tools/perf/perf.h index 03a0456..8b9c436 100644 --- a/tools/perf/perf.h +++ b/tools/perf/perf.h @@ -199,6 +199,7 @@ struct perf_record_opts { bool sample_address; bool sample_time; bool sample_id_all_avail; + bool exclude_guest_missing; bool system_wide; bool period; unsigned int freq; diff --git a/tools/perf/util/top.h b/tools/perf/util/top.h index 49eb848..7dea891 100644 --- a/tools/perf/util/top.h +++ b/tools/perf/util/top.h @@ -35,6 +35,7 @@ struct perf_top { bool inherit; bool group; bool sample_id_all_avail; + bool exclude_guest_missing; bool dump_symtab; const char *cpu_list; struct hist_entry *sym_filter_entry; ^ permalink raw reply related [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2012-02-14 14:38 ` Arnaldo Carvalho de Melo @ 2012-02-14 15:10 ` Joerg Roedel 2012-02-14 16:11 ` Arnaldo Carvalho de Melo 0 siblings, 1 reply; 56+ messages in thread From: Joerg Roedel @ 2012-02-14 15:10 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: David Ahern, Namhyung Kim, Namhyung Kim, linux-kernel, Ingo Molnar, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, Frederic Weisbecker, Jason Wang, Jiri Olsa, Lin Ming, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver On Tue, Feb 14, 2012 at 12:38:53PM -0200, Arnaldo Carvalho de Melo wrote: > diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c > index 08ed24b..d6c10e8 100644 > --- a/tools/perf/builtin-record.c > +++ b/tools/perf/builtin-record.c > @@ -205,6 +205,9 @@ static void perf_record__open(struct perf_record *rec) > > if (opts->group && pos != first) > group_fd = first->fd; > +fallback_missing_features: > + if (opts->exclude_guest_missing) > + attr->exclude_guest = attr->exclude_host = 0; > retry_sample_id: > attr->sample_id_all = opts->sample_id_all_avail ? 1 : 0; > try_again: > @@ -218,15 +221,23 @@ try_again: > } else if (err == ENODEV && opts->cpu_list) { > die("No such device - did you specify" > " an out-of-range profile CPU?\n"); > - } else if (err == EINVAL && opts->sample_id_all_avail) { > - /* > - * Old kernel, no attr->sample_id_type_all field > - */ > - opts->sample_id_all_avail = false; > - if (!opts->sample_time && !opts->raw_samples && !time_needed) > - attr->sample_type &= ~PERF_SAMPLE_TIME; > - > - goto retry_sample_id; > + } else if (err == EINVAL) { > + if (!opts->exclude_guest_missing && > + (attr->exclude_guest || attr->exclude_host)) { > + pr_debug("Old kernel, cannot exclude " > + "guest or host samples.\n"); > + opts->exclude_guest_missing = true; > + goto fallback_missing_features; > + } else if (opts->sample_id_all_avail) { > + /* > + * Old kernel, no attr->sample_id_type_all field > + */ > + opts->sample_id_all_avail = false; > + if (!opts->sample_time && !opts->raw_samples && !time_needed) > + attr->sample_type &= ~PERF_SAMPLE_TIME; > + > + goto retry_sample_id; > + } > } > > /* > diff --git a/tools/perf/builtin-top.c b/tools/perf/builtin-top.c > index 5a88c0d..02e11ff 100644 > --- a/tools/perf/builtin-top.c > +++ b/tools/perf/builtin-top.c > @@ -872,6 +872,9 @@ static void perf_top__start_counters(struct perf_top *top) > attr->mmap = 1; > attr->comm = 1; > attr->inherit = top->inherit; > +fallback_missing_features: > + if (top->exclude_guest_missing) > + attr->exclude_guest = attr->exclude_host = 0; > retry_sample_id: > attr->sample_id_all = top->sample_id_all_avail ? 1 : 0; > try_again: > @@ -883,12 +886,20 @@ try_again: > if (err == EPERM || err == EACCES) { > ui__error_paranoid(); > goto out_err; > - } else if (err == EINVAL && top->sample_id_all_avail) { > - /* > - * Old kernel, no attr->sample_id_type_all field > - */ > - top->sample_id_all_avail = false; > - goto retry_sample_id; > + } else if (err == EINVAL) { > + if (!top->exclude_guest_missing && > + (attr->exclude_guest || attr->exclude_host)) { > + pr_debug("Old kernel, cannot exclude " > + "guest or host samples.\n"); > + top->exclude_guest_missing = true; > + goto fallback_missing_features; > + } else if (top->sample_id_all_avail) { > + /* > + * Old kernel, no attr->sample_id_type_all field > + */ > + top->sample_id_all_avail = false; > + goto retry_sample_id; > + } > } > /* > * If it's cycles then fall back to hrtimer > diff --git a/tools/perf/perf.h b/tools/perf/perf.h > index 03a0456..8b9c436 100644 > --- a/tools/perf/perf.h > +++ b/tools/perf/perf.h > @@ -199,6 +199,7 @@ struct perf_record_opts { > bool sample_address; > bool sample_time; > bool sample_id_all_avail; > + bool exclude_guest_missing; > bool system_wide; > bool period; > unsigned int freq; > diff --git a/tools/perf/util/top.h b/tools/perf/util/top.h > index 49eb848..7dea891 100644 > --- a/tools/perf/util/top.h > +++ b/tools/perf/util/top.h > @@ -35,6 +35,7 @@ struct perf_top { > bool inherit; > bool group; > bool sample_id_all_avail; > + bool exclude_guest_missing; > bool dump_symtab; > const char *cpu_list; > struct hist_entry *sym_filter_entry; I was about to prepare a similar patch :) But anyway, this one works too. Tested-by: Joerg Roedel <joerg.roedel@amd.com> (on 2.6.32) -- AMD Operating System Research Center Advanced Micro Devices GmbH Einsteinring 24 85609 Dornach General Managers: Alberto Bozzo Registration: Dornach, Landkr. Muenchen; Registerger. Muenchen, HRB Nr. 43632 ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2012-02-14 15:10 ` Joerg Roedel @ 2012-02-14 16:11 ` Arnaldo Carvalho de Melo 0 siblings, 0 replies; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2012-02-14 16:11 UTC (permalink / raw) To: Joerg Roedel Cc: David Ahern, Namhyung Kim, Namhyung Kim, linux-kernel, Ingo Molnar, Andi Kleen, Anshuman Khandual, Arun Sharma, Corey Ashford, Frederic Weisbecker, Jason Wang, Jiri Olsa, Lin Ming, Paul Mackerras, Peter Zijlstra, Roberto Agostino Vitillo, Robert Richter, Stephane Eranian, Thomas Gleixner, Vince Weaver Em Tue, Feb 14, 2012 at 04:10:39PM +0100, Joerg Roedel escreveu: > On Tue, Feb 14, 2012 at 12:38:53PM -0200, Arnaldo Carvalho de Melo wrote: > > bool sample_id_all_avail; > > + bool exclude_guest_missing; > > bool dump_symtab; > > const char *cpu_list; > > struct hist_entry *sym_filter_entry; > > I was about to prepare a similar patch :) But anyway, this one works > too. > > Tested-by: Joerg Roedel <joerg.roedel@amd.com> > > (on 2.6.32) thanks, I'm commiting it now after testing it on a 3.0-rt kernel, - Arnaldo ^ permalink raw reply [flat|nested] 56+ messages in thread
* [GIT PULL 00/16] perf/core improvements and fixes @ 2011-09-29 22:47 Arnaldo Carvalho de Melo 2011-10-04 7:57 ` Ingo Molnar 0 siblings, 1 reply; 56+ messages in thread From: Arnaldo Carvalho de Melo @ 2011-09-29 22:47 UTC (permalink / raw) To: Ingo Molnar Cc: linux-kernel, Arnaldo Carvalho de Melo, Andi Kleen, David Ahern, Frederic Weisbecker, Jim Cromie, Jiri Olsa, Mike Galbraith, Neil Horman, Paul Mackerras, Peter Zijlstra, Stephane Eranian, Thomas Gleixner, arnaldo.melo Hi Ingo, Please consider pulling from: git://github.com/acmel/linux.git perf/core This was rebased against your perf/core on tesla. Regards, - Arnaldo Andi Kleen (2): perf tools: Make stat/record print fatal signals of the target program perf: Support setting the disassembler style Arnaldo Carvalho de Melo (6): perf symbols: Stop using 'self' in map_groups__ methods perf buildid-list: Add option to show the running kernel build id perf buildid-list: Support showing the build id in an ELF file perf top browser: Fix up line width calculation perf top: Improve lost events warning perf report: Fix stdio event name header printing Jim Cromie (5): perf stat: Add --log-fd <N> option to redirect stderr elsewhere perf stat: Fix +- nan% in --no-aggr runs perf stat: Suppress printing std-dev when its 0 perf stat: Allow tab as cvs delimiter perf stat: Fix spelling in comment Jiri Olsa (2): perf sched: Fix script command documentation perf symbols: Treat all memory maps without dso file as loaded Neil Horman (1): perf script: Add drop monitor script tools/perf/Documentation/perf-annotate.txt | 3 + tools/perf/Documentation/perf-buildid-list.txt | 6 + tools/perf/Documentation/perf-report.txt | 3 + tools/perf/Documentation/perf-sched.txt | 6 +- tools/perf/Documentation/perf-stat.txt | 11 ++- tools/perf/builtin-annotate.c | 2 + tools/perf/builtin-buildid-list.c | 52 ++++++++++- tools/perf/builtin-record.c | 19 ++++- tools/perf/builtin-report.c | 7 +- tools/perf/builtin-stat.c | 35 ++++++- tools/perf/builtin-top.c | 34 ++++---- .../perf/scripts/python/bin/net_dropmonitor-record | 2 + .../perf/scripts/python/bin/net_dropmonitor-report | 4 + tools/perf/scripts/python/net_dropmonitor.py | 72 ++++++++++++++ tools/perf/util/annotate.c | 6 +- tools/perf/util/annotate.h | 2 + tools/perf/util/map.c | 98 +++++++++++--------- tools/perf/util/map.h | 42 ++++---- tools/perf/util/top.h | 3 + tools/perf/util/ui/browsers/top.c | 28 +++++- 20 files changed, 333 insertions(+), 102 deletions(-) create mode 100755 tools/perf/scripts/python/bin/net_dropmonitor-record create mode 100755 tools/perf/scripts/python/bin/net_dropmonitor-report create mode 100755 tools/perf/scripts/python/net_dropmonitor.py ^ permalink raw reply [flat|nested] 56+ messages in thread
* Re: [GIT PULL 00/16] perf/core improvements and fixes 2011-09-29 22:47 Arnaldo Carvalho de Melo @ 2011-10-04 7:57 ` Ingo Molnar 0 siblings, 0 replies; 56+ messages in thread From: Ingo Molnar @ 2011-10-04 7:57 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: linux-kernel, Andi Kleen, David Ahern, Frederic Weisbecker, Jim Cromie, Jiri Olsa, Mike Galbraith, Neil Horman, Paul Mackerras, Peter Zijlstra, Stephane Eranian, Thomas Gleixner, arnaldo.melo * Arnaldo Carvalho de Melo <acme@infradead.org> wrote: > Hi Ingo, > > Please consider pulling from: > > git://github.com/acmel/linux.git perf/core > > This was rebased against your perf/core on tesla. > > Regards, > > - Arnaldo > > Andi Kleen (2): > perf tools: Make stat/record print fatal signals of the target program > perf: Support setting the disassembler style > > Arnaldo Carvalho de Melo (6): > perf symbols: Stop using 'self' in map_groups__ methods > perf buildid-list: Add option to show the running kernel build id > perf buildid-list: Support showing the build id in an ELF file > perf top browser: Fix up line width calculation > perf top: Improve lost events warning > perf report: Fix stdio event name header printing > > Jim Cromie (5): > perf stat: Add --log-fd <N> option to redirect stderr elsewhere > perf stat: Fix +- nan% in --no-aggr runs > perf stat: Suppress printing std-dev when its 0 > perf stat: Allow tab as cvs delimiter > perf stat: Fix spelling in comment > > Jiri Olsa (2): > perf sched: Fix script command documentation > perf symbols: Treat all memory maps without dso file as loaded > > Neil Horman (1): > perf script: Add drop monitor script > > tools/perf/Documentation/perf-annotate.txt | 3 + > tools/perf/Documentation/perf-buildid-list.txt | 6 + > tools/perf/Documentation/perf-report.txt | 3 + > tools/perf/Documentation/perf-sched.txt | 6 +- > tools/perf/Documentation/perf-stat.txt | 11 ++- > tools/perf/builtin-annotate.c | 2 + > tools/perf/builtin-buildid-list.c | 52 ++++++++++- > tools/perf/builtin-record.c | 19 ++++- > tools/perf/builtin-report.c | 7 +- > tools/perf/builtin-stat.c | 35 ++++++- > tools/perf/builtin-top.c | 34 ++++---- > .../perf/scripts/python/bin/net_dropmonitor-record | 2 + > .../perf/scripts/python/bin/net_dropmonitor-report | 4 + > tools/perf/scripts/python/net_dropmonitor.py | 72 ++++++++++++++ > tools/perf/util/annotate.c | 6 +- > tools/perf/util/annotate.h | 2 + > tools/perf/util/map.c | 98 +++++++++++--------- > tools/perf/util/map.h | 42 ++++---- > tools/perf/util/top.h | 3 + > tools/perf/util/ui/browsers/top.c | 28 +++++- > 20 files changed, 333 insertions(+), 102 deletions(-) > create mode 100755 tools/perf/scripts/python/bin/net_dropmonitor-record > create mode 100755 tools/perf/scripts/python/bin/net_dropmonitor-report > create mode 100755 tools/perf/scripts/python/net_dropmonitor.py Pulled, thanks a lot Arnaldo! Ingo ^ permalink raw reply [flat|nested] 56+ messages in thread
end of thread, other threads:[~2019-01-08 15:32 UTC | newest] Thread overview: 56+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2014-03-14 21:29 [GIT PULL 00/16] perf/core improvements and fixes Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 01/16] perf bench: Add futex-hash microbenchmark Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 02/16] perf bench: Add futex-wake microbenchmark Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 03/16] perf bench: Add futex-requeue microbenchmark Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 04/16] perf mem: Clarify load-latency in documentation Arnaldo Carvalho de Melo 2014-03-14 21:29 ` [PATCH 05/16] perf probe: Clarify x86 register naming for perf probe Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 06/16] perf tools: Fix synthesizing mmaps for threads Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 07/16] perf ui/stdio: Fix invalid output on event group report Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 08/16] perf ui/gtk: Reuse generic __hpp__fmt() code Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 09/16] perf ui/hists: Pass struct hpp to print functions Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 10/16] perf ui/tui: Reuse generic __hpp__fmt() code Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 11/16] perf session: Change header.misc dump from decimal to hex Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 12/16] perf symbols: Introduce thread__find_cpumode_addr_location Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 13/16] perf ui hists: Pass evsel to hpp->header/width functions explicitly Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 14/16] perf kvm: introduce --list-cmds for use by scripts Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 15/16] perf tools: Speed up thread map generation Arnaldo Carvalho de Melo 2014-03-14 21:30 ` [PATCH 16/16] perf machine: Factor machine__find_thread to take tid argument Arnaldo Carvalho de Melo 2014-03-18 8:24 ` [GIT PULL 00/16] perf/core improvements and fixes Ingo Molnar -- strict thread matches above, loose matches on Subject: below -- 2019-01-04 18:33 Arnaldo Carvalho de Melo 2019-01-08 15:32 ` Ingo Molnar 2016-07-05 0:38 Arnaldo Carvalho de Melo 2016-07-05 10:14 ` Ingo Molnar 2016-01-29 21:17 Arnaldo Carvalho de Melo 2016-02-03 10:02 ` Ingo Molnar 2015-12-14 17:44 Arnaldo Carvalho de Melo 2015-10-05 21:03 Arnaldo Carvalho de Melo 2015-10-06 7:09 ` Ingo Molnar 2015-09-30 21:54 Arnaldo Carvalho de Melo 2015-10-01 7:05 ` Ingo Molnar 2015-08-25 16:14 Arnaldo Carvalho de Melo 2015-08-26 13:39 ` Arnaldo Carvalho de Melo 2015-04-07 16:40 Arnaldo Carvalho de Melo 2014-11-19 16:03 Arnaldo Carvalho de Melo 2014-11-20 7:33 ` Ingo Molnar 2014-08-22 16:29 Arnaldo Carvalho de Melo 2014-08-24 10:11 ` Ingo Molnar 2014-08-24 11:16 ` Arnaldo Carvalho de Melo 2014-08-24 14:47 ` Ingo Molnar 2013-11-18 20:27 Arnaldo Carvalho de Melo 2013-10-23 20:57 Arnaldo Carvalho de Melo 2013-10-24 6:52 ` Ingo Molnar 2012-02-14 1:52 Arnaldo Carvalho de Melo 2012-02-14 2:50 ` Namhyung Kim 2012-02-14 3:07 ` Namhyung Kim 2012-02-14 5:10 ` Namhyung Kim 2012-02-14 5:23 ` David Ahern 2012-02-14 5:48 ` Namhyung Kim 2012-02-14 5:52 ` David Ahern 2012-02-14 5:58 ` Namhyung Kim 2012-02-14 10:50 ` Joerg Roedel 2012-02-14 13:10 ` Arnaldo Carvalho de Melo 2012-02-14 14:38 ` Arnaldo Carvalho de Melo 2012-02-14 15:10 ` Joerg Roedel 2012-02-14 16:11 ` Arnaldo Carvalho de Melo 2011-09-29 22:47 Arnaldo Carvalho de Melo 2011-10-04 7:57 ` Ingo Molnar
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).