All of lore.kernel.org
 help / color / mirror / Atom feed
* [GIT PULL 0/6] perf/core improvements and fixes
@ 2015-06-03 22:40 Arnaldo Carvalho de Melo
  2015-06-03 22:40 ` [PATCH 1/6] perf probe: Fix segfault when glob matching function without debuginfo Arnaldo Carvalho de Melo
                   ` (6 more replies)
  0 siblings, 7 replies; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2015-06-03 22:40 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter,
	Alexei Starovoitov, Brendan Gregg, Daniel Borkmann, David Ahern,
	He Kuang, Jiri Olsa, Kaixu Xia, Madhavan Srinivasan,
	Masami Hiramatsu, Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	pi3orama, Sukadev Bhattiprolu, Wang Nan, Zefan Li,
	Arnaldo Carvalho de Melo

Hi Ingo,

	Please consider applying.

	One of the next requests probably will have the eBPF work by Wang Nan,
but I am still going thru it and want to test it thoroughly.

	BTW: Have you looked at it lately? It is at:

http://lkml.kernel.org/r/1433144296-74992-1-git-send-email-wangnan0@huawei.com

Super summary from the above cover letter:

---------------------
It enables 'perf record' to filter events using eBPF programs like:

 # perf record --event bpf-file.o sleep 1

Events are selected and filtered according to definitions in bpf-file.o.
---------------------

	The first two patches from that series are in this pull req, as
they just move stuff into tools/include/linux/ from tools/perf/include.

Regards,

- Arnaldo

The following changes since commit 5c9b9bc67c684e40b3a5e7e9facde0fb7200cd8c:

  Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-05-29 20:19:02 +0200)

are available in the git repository at:

  git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo

for you to fetch changes up to 1f121b03d058dd07199d8924373d3c52a207f63b:

  perf tools: Deal with kernel module names in '[]' correctly (2015-06-03 10:02:38 -0300)

----------------------------------------------------------------
perf/core improvements and fixes:

User visible:

- Fix 'perf probe' segfault when glob matching function without debuginfo (Wang Nan)

- Remove newline char when reading event scale and unit (Madhavan Srinivasan)

- Deal with kernel module names in '[]' correctly (Wang Nan)

Infrastructure:

- Fix the search for the kernel DSO on the unified list (Arnaldo Carvalho de Melo)

- Move tools/perf/util/include/linux/{kernel.h,list.h,poison.h} to tools/include,
  to be used in tools/lib/bpf/ (Wang Nan)

Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>

----------------------------------------------------------------
Arnaldo Carvalho de Melo (1):
      perf machine: Fix the search for the kernel DSO on the unified list

Madhavan Srinivasan (1):
      perf tools: Remove newline char when reading event scale and unit

Wang Nan (4):
      perf probe: Fix segfault when glob matching function without debuginfo
      perf tools: Move linux/kernel.h to tools/include
      tools: Move tools/perf/util/include/linux/{list.h,poison.h} to tools/include
      perf tools: Deal with kernel module names in '[]' correctly

 tools/{perf/util => }/include/linux/kernel.h |  4 +-
 tools/{perf/util => }/include/linux/list.h   |  6 +--
 tools/include/linux/poison.h                 |  1 +
 tools/perf/MANIFEST                          |  3 ++
 tools/perf/tests/kmod-path.c                 | 72 ++++++++++++++++++++++++++++
 tools/perf/util/dso.c                        | 47 ++++++++++++++++--
 tools/perf/util/dso.h                        |  2 +-
 tools/perf/util/header.c                     |  8 ++--
 tools/perf/util/include/linux/poison.h       |  1 -
 tools/perf/util/machine.c                    | 22 ++++++++-
 tools/perf/util/pmu.c                        | 11 ++++-
 tools/perf/util/probe-event.c                | 26 ++++++++--
 12 files changed, 179 insertions(+), 24 deletions(-)
 rename tools/{perf/util => }/include/linux/kernel.h (97%)
 rename tools/{perf/util => }/include/linux/list.h (90%)
 create mode 100644 tools/include/linux/poison.h
 delete mode 100644 tools/perf/util/include/linux/poison.h

^ permalink raw reply	[flat|nested] 43+ messages in thread

* [PATCH 1/6] perf probe: Fix segfault when glob matching function without debuginfo
  2015-06-03 22:40 [GIT PULL 0/6] perf/core improvements and fixes Arnaldo Carvalho de Melo
@ 2015-06-03 22:40 ` Arnaldo Carvalho de Melo
  2015-06-03 22:40 ` [PATCH 2/6] perf tools: Remove newline char when reading event scale and unit Arnaldo Carvalho de Melo
                   ` (5 subsequent siblings)
  6 siblings, 0 replies; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2015-06-03 22:40 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Wang Nan, Jiri Olsa, Namhyung Kim, Zefan Li,
	Arnaldo Carvalho de Melo

From: Wang Nan <wangnan0@huawei.com>

Commit 4c859351226c920b227fec040a3b447f0d482af3 ("perf probe: Support
glob wildcards for function name") introduces segfault problems when
debuginfo is not available:

 # perf probe 'sys_w*'
  Added new events:
  Segmentation fault

The first problem resides in find_probe_trace_events_from_map(). In
that function, find_probe_functions() is called to match each symbol
against glob to find the number of matching functions, but still use
map__for_each_symbol_by_name() to find 'struct symbol' for matching
functions. Unfortunately, map__for_each_symbol_by_name() does
exact matching by searching in an rbtree.

It doesn't know glob matching, and not easy for it to support it because
it use rbtree based binary search, but we are unable to ensure all names
matched by the glob (any glob passed by user) reside in one subtree.

This patch drops map__for_each_symbol_by_name(). Since there is no
rbtree again, re-matching all symbols costs a lot. This patch avoid it
by saving all matching results into an array (syms).

The second problem is the lost of tp->realname. In
__add_probe_trace_events(), if pev->point.function is glob, the event
name should be set to tev->point.realname. This patch ensures its
existence by strdup sym->name instead of leaving a NULL pointer there.

After this patch:

 # perf probe 'sys_w*'
 Added new events:
   probe:sys_waitid     (on sys_w*)
   probe:sys_wait4      (on sys_w*)
   probe:sys_waitpid    (on sys_w*)
   probe:sys_write      (on sys_w*)
   probe:sys_writev     (on sys_w*)

 You can now use it in all perf tools, such as:

         perf record -e probe:sys_writev -aR sleep 1

Signed-off-by: Wang Nan <wangnan0@huawei.com>
Acked-by: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Zefan Li <lizefan@huawei.com>
Link: http://lkml.kernel.org/r/1432892747-232506-1-git-send-email-wangnan0@huawei.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
---
 tools/perf/util/probe-event.c | 26 +++++++++++++++++++++-----
 1 file changed, 21 insertions(+), 5 deletions(-)

diff --git a/tools/perf/util/probe-event.c b/tools/perf/util/probe-event.c
index d27edef5eb5b..e6f215b7a052 100644
--- a/tools/perf/util/probe-event.c
+++ b/tools/perf/util/probe-event.c
@@ -2494,7 +2494,8 @@ close_out:
 	return ret;
 }
 
-static int find_probe_functions(struct map *map, char *name)
+static int find_probe_functions(struct map *map, char *name,
+				struct symbol **syms)
 {
 	int found = 0;
 	struct symbol *sym;
@@ -2504,8 +2505,11 @@ static int find_probe_functions(struct map *map, char *name)
 		return 0;
 
 	map__for_each_symbol(map, sym, tmp) {
-		if (strglobmatch(sym->name, name))
+		if (strglobmatch(sym->name, name)) {
 			found++;
+			if (syms && found < probe_conf.max_probes)
+				syms[found - 1] = sym;
+		}
 	}
 
 	return found;
@@ -2528,11 +2532,12 @@ static int find_probe_trace_events_from_map(struct perf_probe_event *pev,
 	struct map *map = NULL;
 	struct ref_reloc_sym *reloc_sym = NULL;
 	struct symbol *sym;
+	struct symbol **syms = NULL;
 	struct probe_trace_event *tev;
 	struct perf_probe_point *pp = &pev->point;
 	struct probe_trace_point *tp;
 	int num_matched_functions;
-	int ret, i;
+	int ret, i, j;
 
 	map = get_target_map(pev->target, pev->uprobes);
 	if (!map) {
@@ -2540,11 +2545,17 @@ static int find_probe_trace_events_from_map(struct perf_probe_event *pev,
 		goto out;
 	}
 
+	syms = malloc(sizeof(struct symbol *) * probe_conf.max_probes);
+	if (!syms) {
+		ret = -ENOMEM;
+		goto out;
+	}
+
 	/*
 	 * Load matched symbols: Since the different local symbols may have
 	 * same name but different addresses, this lists all the symbols.
 	 */
-	num_matched_functions = find_probe_functions(map, pp->function);
+	num_matched_functions = find_probe_functions(map, pp->function, syms);
 	if (num_matched_functions == 0) {
 		pr_err("Failed to find symbol %s in %s\n", pp->function,
 			pev->target ? : "kernel");
@@ -2575,7 +2586,9 @@ static int find_probe_trace_events_from_map(struct perf_probe_event *pev,
 
 	ret = 0;
 
-	map__for_each_symbol_by_name(map, pp->function, sym) {
+	for (j = 0; j < num_matched_functions; j++) {
+		sym = syms[j];
+
 		tev = (*tevs) + ret;
 		tp = &tev->point;
 		if (ret == num_matched_functions) {
@@ -2599,6 +2612,8 @@ static int find_probe_trace_events_from_map(struct perf_probe_event *pev,
 			tp->symbol = strdup_or_goto(sym->name, nomem_out);
 			tp->offset = pp->offset;
 		}
+		tp->realname = strdup_or_goto(sym->name, nomem_out);
+
 		tp->retprobe = pp->retprobe;
 		if (pev->target)
 			tev->point.module = strdup_or_goto(pev->target,
@@ -2629,6 +2644,7 @@ static int find_probe_trace_events_from_map(struct perf_probe_event *pev,
 
 out:
 	put_target_map(map, pev->uprobes);
+	free(syms);
 	return ret;
 
 nomem_out:
-- 
2.1.0


^ permalink raw reply related	[flat|nested] 43+ messages in thread

* [PATCH 2/6] perf tools: Remove newline char when reading event scale and unit
  2015-06-03 22:40 [GIT PULL 0/6] perf/core improvements and fixes Arnaldo Carvalho de Melo
  2015-06-03 22:40 ` [PATCH 1/6] perf probe: Fix segfault when glob matching function without debuginfo Arnaldo Carvalho de Melo
@ 2015-06-03 22:40 ` Arnaldo Carvalho de Melo
  2015-06-03 22:40 ` [PATCH 3/6] perf machine: Fix the search for the kernel DSO on the unified list Arnaldo Carvalho de Melo
                   ` (4 subsequent siblings)
  6 siblings, 0 replies; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2015-06-03 22:40 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Madhavan Srinivasan, Jiri Olsa, Michael Ellerman,
	Peter Zijlstra, Sukadev Bhattiprolu, Arnaldo Carvalho de Melo

From: Madhavan Srinivasan <maddy@linux.vnet.ibm.com>

The <fd979c013207> commit intruduced the perf_event_sysfs_show function
to display the event_str value of an attr in kernel/event/core.c. But
the function returns the value with a newline char.

So, if a event also carries a event.unit file, when printing the counter
data perf tool formatting goes for a spin.

That is, because of the event unit, event name is printed in the newline
because of perf_event_sysfs_show returns with a newline char.

Now fixing perf core will break API, hencing proposing a fix in the perf tool.

Signed-off-by: Madhavan Srinivasan <maddy@linux.vnet.ibm.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Michael Ellerman <mpe@ellerman.id.au>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
Link: http://lkml.kernel.org/r/1433052383-21802-1-git-send-email-maddy@linux.vnet.ibm.com
[ Add spaces around operators ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
---
 tools/perf/util/pmu.c | 11 +++++++++--
 1 file changed, 9 insertions(+), 2 deletions(-)

diff --git a/tools/perf/util/pmu.c b/tools/perf/util/pmu.c
index 5d3ab7c8ceaf..0fcc624eb767 100644
--- a/tools/perf/util/pmu.c
+++ b/tools/perf/util/pmu.c
@@ -112,7 +112,11 @@ static int perf_pmu__parse_scale(struct perf_pmu_alias *alias, char *dir, char *
 	if (sret < 0)
 		goto error;
 
-	scale[sret] = '\0';
+	if (scale[sret - 1] == '\n')
+		scale[sret - 1] = '\0';
+	else
+		scale[sret] = '\0';
+
 	/*
 	 * save current locale
 	 */
@@ -154,7 +158,10 @@ static int perf_pmu__parse_unit(struct perf_pmu_alias *alias, char *dir, char *n
 
 	close(fd);
 
-	alias->unit[sret] = '\0';
+	if (alias->unit[sret - 1] == '\n')
+		alias->unit[sret - 1] = '\0';
+	else
+		alias->unit[sret] = '\0';
 
 	return 0;
 error:
-- 
2.1.0


^ permalink raw reply related	[flat|nested] 43+ messages in thread

* [PATCH 3/6] perf machine: Fix the search for the kernel DSO on the unified list
  2015-06-03 22:40 [GIT PULL 0/6] perf/core improvements and fixes Arnaldo Carvalho de Melo
  2015-06-03 22:40 ` [PATCH 1/6] perf probe: Fix segfault when glob matching function without debuginfo Arnaldo Carvalho de Melo
  2015-06-03 22:40 ` [PATCH 2/6] perf tools: Remove newline char when reading event scale and unit Arnaldo Carvalho de Melo
@ 2015-06-03 22:40 ` Arnaldo Carvalho de Melo
  2015-06-03 22:40 ` [PATCH 4/6] perf tools: Move linux/kernel.h to tools/include Arnaldo Carvalho de Melo
                   ` (3 subsequent siblings)
  6 siblings, 0 replies; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2015-06-03 22:40 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter,
	David Ahern, Jiri Olsa, Namhyung Kim

From: Arnaldo Carvalho de Melo <acme@redhat.com>

When unifying the user_dsos and kernel_dsos a bug was introduced by
inverting the check for dso->kernel, fix it.

Fixes: 3d39ac538629 ("perf machine: No need to have two DSOs lists")
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Link: http://lkml.kernel.org/n/tip-xnrnq0kams3s2z9ek1wjb506@git.kernel.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
---
 tools/perf/util/machine.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tools/perf/util/machine.c b/tools/perf/util/machine.c
index 2ed61f59d415..4e29e80932e5 100644
--- a/tools/perf/util/machine.c
+++ b/tools/perf/util/machine.c
@@ -1149,7 +1149,7 @@ static int machine__process_kernel_mmap_event(struct machine *machine,
 		struct dso *dso;
 
 		list_for_each_entry(dso, &machine->dsos.head, node) {
-			if (dso->kernel && is_kernel_module(dso->long_name))
+			if (!dso->kernel || is_kernel_module(dso->long_name))
 				continue;
 
 			kernel = dso;
-- 
2.1.0


^ permalink raw reply related	[flat|nested] 43+ messages in thread

* [PATCH 4/6] perf tools: Move linux/kernel.h to tools/include
  2015-06-03 22:40 [GIT PULL 0/6] perf/core improvements and fixes Arnaldo Carvalho de Melo
                   ` (2 preceding siblings ...)
  2015-06-03 22:40 ` [PATCH 3/6] perf machine: Fix the search for the kernel DSO on the unified list Arnaldo Carvalho de Melo
@ 2015-06-03 22:40 ` Arnaldo Carvalho de Melo
  2015-06-03 22:40 ` [PATCH 5/6] tools: Move tools/perf/util/include/linux/{list.h,poison.h} " Arnaldo Carvalho de Melo
                   ` (2 subsequent siblings)
  6 siblings, 0 replies; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2015-06-03 22:40 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Wang Nan, Brendan Gregg, Daniel Borkmann,
	David Ahern, He Kuang, Jiri Olsa, Kaixu Xia, Masami Hiramatsu,
	Namhyung Kim, Peter Zijlstra, Zefan Li, pi3orama,
	Arnaldo Carvalho de Melo

From: Wang Nan <wangnan0@huawei.com>

This patch moves kernel.h from tools/perf/util/include/linux/kernel.h
to tools/include/linux/kernel.h to enable other libraries use macros in
it, like libbpf which will be introduced by further patches.

MANIFEST is also updated for 'make perf-*-src-pkg'.

Signed-off-by: Wang Nan <wangnan0@huawei.com>
Acked-by: Alexei Starovoitov <ast@plumgrid.com>
Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
Cc: Daniel Borkmann <daniel@iogearbox.net>
Cc: David Ahern <dsahern@gmail.com>
Cc: He Kuang <hekuang@huawei.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Kaixu Xia <xiakaixu@huawei.com>
Cc: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Zefan Li <lizefan@huawei.com>
Cc: pi3orama@163.com
Link: http://lkml.kernel.org/r/1433144296-74992-2-git-send-email-wangnan0@huawei.com
[ Fixed up the ifdef guard to match other entries in tools/include/linux ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
---
 tools/include/linux/kernel.h           | 107 +++++++++++++++++++++++++++++++++
 tools/perf/MANIFEST                    |   1 +
 tools/perf/util/include/linux/kernel.h | 107 ---------------------------------
 3 files changed, 108 insertions(+), 107 deletions(-)
 create mode 100644 tools/include/linux/kernel.h
 delete mode 100644 tools/perf/util/include/linux/kernel.h

diff --git a/tools/include/linux/kernel.h b/tools/include/linux/kernel.h
new file mode 100644
index 000000000000..76df53539c2a
--- /dev/null
+++ b/tools/include/linux/kernel.h
@@ -0,0 +1,107 @@
+#ifndef __TOOLS_LINUX_KERNEL_H
+#define __TOOLS_LINUX_KERNEL_H
+
+#include <stdarg.h>
+#include <stdio.h>
+#include <stdlib.h>
+#include <assert.h>
+
+#define DIV_ROUND_UP(n,d) (((n) + (d) - 1) / (d))
+
+#define PERF_ALIGN(x, a)	__PERF_ALIGN_MASK(x, (typeof(x))(a)-1)
+#define __PERF_ALIGN_MASK(x, mask)	(((x)+(mask))&~(mask))
+
+#ifndef offsetof
+#define offsetof(TYPE, MEMBER) ((size_t) &((TYPE *)0)->MEMBER)
+#endif
+
+#ifndef container_of
+/**
+ * container_of - cast a member of a structure out to the containing structure
+ * @ptr:	the pointer to the member.
+ * @type:	the type of the container struct this is embedded in.
+ * @member:	the name of the member within the struct.
+ *
+ */
+#define container_of(ptr, type, member) ({			\
+	const typeof(((type *)0)->member) * __mptr = (ptr);	\
+	(type *)((char *)__mptr - offsetof(type, member)); })
+#endif
+
+#define BUILD_BUG_ON_ZERO(e) (sizeof(struct { int:-!!(e); }))
+
+#ifndef max
+#define max(x, y) ({				\
+	typeof(x) _max1 = (x);			\
+	typeof(y) _max2 = (y);			\
+	(void) (&_max1 == &_max2);		\
+	_max1 > _max2 ? _max1 : _max2; })
+#endif
+
+#ifndef min
+#define min(x, y) ({				\
+	typeof(x) _min1 = (x);			\
+	typeof(y) _min2 = (y);			\
+	(void) (&_min1 == &_min2);		\
+	_min1 < _min2 ? _min1 : _min2; })
+#endif
+
+#ifndef roundup
+#define roundup(x, y) (                                \
+{                                                      \
+	const typeof(y) __y = y;		       \
+	(((x) + (__y - 1)) / __y) * __y;	       \
+}                                                      \
+)
+#endif
+
+#ifndef BUG_ON
+#ifdef NDEBUG
+#define BUG_ON(cond) do { if (cond) {} } while (0)
+#else
+#define BUG_ON(cond) assert(!(cond))
+#endif
+#endif
+
+/*
+ * Both need more care to handle endianness
+ * (Don't use bitmap_copy_le() for now)
+ */
+#define cpu_to_le64(x)	(x)
+#define cpu_to_le32(x)	(x)
+
+static inline int
+vscnprintf(char *buf, size_t size, const char *fmt, va_list args)
+{
+	int i;
+	ssize_t ssize = size;
+
+	i = vsnprintf(buf, size, fmt, args);
+
+	return (i >= ssize) ? (ssize - 1) : i;
+}
+
+static inline int scnprintf(char * buf, size_t size, const char * fmt, ...)
+{
+	va_list args;
+	ssize_t ssize = size;
+	int i;
+
+	va_start(args, fmt);
+	i = vsnprintf(buf, size, fmt, args);
+	va_end(args);
+
+	return (i >= ssize) ? (ssize - 1) : i;
+}
+
+/*
+ * This looks more complex than it should be. But we need to
+ * get the type for the ~ right in round_down (it needs to be
+ * as wide as the result!), and we want to evaluate the macro
+ * arguments just once each.
+ */
+#define __round_mask(x, y) ((__typeof__(x))((y)-1))
+#define round_up(x, y) ((((x)-1) | __round_mask(x, y))+1)
+#define round_down(x, y) ((x) & ~__round_mask(x, y))
+
+#endif
diff --git a/tools/perf/MANIFEST b/tools/perf/MANIFEST
index a83cf75164e1..fce4a47347aa 100644
--- a/tools/perf/MANIFEST
+++ b/tools/perf/MANIFEST
@@ -40,6 +40,7 @@ tools/include/linux/bitops.h
 tools/include/linux/compiler.h
 tools/include/linux/export.h
 tools/include/linux/hash.h
+tools/include/linux/kernel.h
 tools/include/linux/log2.h
 tools/include/linux/types.h
 include/asm-generic/bitops/arch_hweight.h
diff --git a/tools/perf/util/include/linux/kernel.h b/tools/perf/util/include/linux/kernel.h
deleted file mode 100644
index 09e8e7aea7c6..000000000000
--- a/tools/perf/util/include/linux/kernel.h
+++ /dev/null
@@ -1,107 +0,0 @@
-#ifndef PERF_LINUX_KERNEL_H_
-#define PERF_LINUX_KERNEL_H_
-
-#include <stdarg.h>
-#include <stdio.h>
-#include <stdlib.h>
-#include <assert.h>
-
-#define DIV_ROUND_UP(n,d) (((n) + (d) - 1) / (d))
-
-#define PERF_ALIGN(x, a)	__PERF_ALIGN_MASK(x, (typeof(x))(a)-1)
-#define __PERF_ALIGN_MASK(x, mask)	(((x)+(mask))&~(mask))
-
-#ifndef offsetof
-#define offsetof(TYPE, MEMBER) ((size_t) &((TYPE *)0)->MEMBER)
-#endif
-
-#ifndef container_of
-/**
- * container_of - cast a member of a structure out to the containing structure
- * @ptr:	the pointer to the member.
- * @type:	the type of the container struct this is embedded in.
- * @member:	the name of the member within the struct.
- *
- */
-#define container_of(ptr, type, member) ({			\
-	const typeof(((type *)0)->member) * __mptr = (ptr);	\
-	(type *)((char *)__mptr - offsetof(type, member)); })
-#endif
-
-#define BUILD_BUG_ON_ZERO(e) (sizeof(struct { int:-!!(e); }))
-
-#ifndef max
-#define max(x, y) ({				\
-	typeof(x) _max1 = (x);			\
-	typeof(y) _max2 = (y);			\
-	(void) (&_max1 == &_max2);		\
-	_max1 > _max2 ? _max1 : _max2; })
-#endif
-
-#ifndef min
-#define min(x, y) ({				\
-	typeof(x) _min1 = (x);			\
-	typeof(y) _min2 = (y);			\
-	(void) (&_min1 == &_min2);		\
-	_min1 < _min2 ? _min1 : _min2; })
-#endif
-
-#ifndef roundup
-#define roundup(x, y) (                                \
-{                                                      \
-	const typeof(y) __y = y;		       \
-	(((x) + (__y - 1)) / __y) * __y;	       \
-}                                                      \
-)
-#endif
-
-#ifndef BUG_ON
-#ifdef NDEBUG
-#define BUG_ON(cond) do { if (cond) {} } while (0)
-#else
-#define BUG_ON(cond) assert(!(cond))
-#endif
-#endif
-
-/*
- * Both need more care to handle endianness
- * (Don't use bitmap_copy_le() for now)
- */
-#define cpu_to_le64(x)	(x)
-#define cpu_to_le32(x)	(x)
-
-static inline int
-vscnprintf(char *buf, size_t size, const char *fmt, va_list args)
-{
-	int i;
-	ssize_t ssize = size;
-
-	i = vsnprintf(buf, size, fmt, args);
-
-	return (i >= ssize) ? (ssize - 1) : i;
-}
-
-static inline int scnprintf(char * buf, size_t size, const char * fmt, ...)
-{
-	va_list args;
-	ssize_t ssize = size;
-	int i;
-
-	va_start(args, fmt);
-	i = vsnprintf(buf, size, fmt, args);
-	va_end(args);
-
-	return (i >= ssize) ? (ssize - 1) : i;
-}
-
-/*
- * This looks more complex than it should be. But we need to
- * get the type for the ~ right in round_down (it needs to be
- * as wide as the result!), and we want to evaluate the macro
- * arguments just once each.
- */
-#define __round_mask(x, y) ((__typeof__(x))((y)-1))
-#define round_up(x, y) ((((x)-1) | __round_mask(x, y))+1)
-#define round_down(x, y) ((x) & ~__round_mask(x, y))
-
-#endif
-- 
2.1.0


^ permalink raw reply related	[flat|nested] 43+ messages in thread

* [PATCH 5/6] tools: Move tools/perf/util/include/linux/{list.h,poison.h} to tools/include
  2015-06-03 22:40 [GIT PULL 0/6] perf/core improvements and fixes Arnaldo Carvalho de Melo
                   ` (3 preceding siblings ...)
  2015-06-03 22:40 ` [PATCH 4/6] perf tools: Move linux/kernel.h to tools/include Arnaldo Carvalho de Melo
@ 2015-06-03 22:40 ` Arnaldo Carvalho de Melo
  2015-06-03 22:40 ` [PATCH 6/6] perf tools: Deal with kernel module names in '[]' correctly Arnaldo Carvalho de Melo
  2015-06-04  5:48 ` [GIT PULL 0/6] perf/core improvements and fixes Ingo Molnar
  6 siblings, 0 replies; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2015-06-03 22:40 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Wang Nan, Alexei Starovoitov, Brendan Gregg,
	Daniel Borkmann, David Ahern, He Kuang, Jiri Olsa, Kaixu Xia,
	Masami Hiramatsu, Namhyung Kim, Peter Zijlstra, Zefan Li,
	pi3orama, Arnaldo Carvalho de Melo

From: Wang Nan <wangnan0@huawei.com>

This patch moves list.h from tools/perf/util/include/linux/list.h to
tools/include/linux/list.h to enable other libraries use macros in it,
like libbpf which will be introduced by further patches. Since list.h
depend on poison.h, poison.h is also moved.

Both file use relative path, so one '..' is removed for each header to
make them suit for new directory.

MANIFEST is also updated for 'make perf-*-src-pkg'.

Signed-off-by: Wang Nan <wangnan0@huawei.com>
Cc: Alexei Starovoitov <alexei.starovoitov@gmail.com>
Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
Cc: Daniel Borkmann <daniel@iogearbox.net>
Cc: David Ahern <dsahern@gmail.com>
Cc: He Kuang <hekuang@huawei.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Kaixu Xia <xiakaixu@huawei.com>
Cc: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Zefan Li <lizefan@huawei.com>
Cc: pi3orama@163.com
Link: http://lkml.kernel.org/r/1433144296-74992-3-git-send-email-wangnan0@huawei.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
---
 tools/include/linux/list.h             | 29 +++++++++++++++++++++++++++++
 tools/include/linux/poison.h           |  1 +
 tools/perf/MANIFEST                    |  2 ++
 tools/perf/util/include/linux/list.h   | 29 -----------------------------
 tools/perf/util/include/linux/poison.h |  1 -
 5 files changed, 32 insertions(+), 30 deletions(-)
 create mode 100644 tools/include/linux/list.h
 create mode 100644 tools/include/linux/poison.h
 delete mode 100644 tools/perf/util/include/linux/list.h
 delete mode 100644 tools/perf/util/include/linux/poison.h

diff --git a/tools/include/linux/list.h b/tools/include/linux/list.h
new file mode 100644
index 000000000000..76b014c96893
--- /dev/null
+++ b/tools/include/linux/list.h
@@ -0,0 +1,29 @@
+#include <linux/kernel.h>
+#include <linux/types.h>
+
+#include "../../../include/linux/list.h"
+
+#ifndef TOOLS_LIST_H
+#define TOOLS_LIST_H
+/**
+ * list_del_range - deletes range of entries from list.
+ * @begin: first element in the range to delete from the list.
+ * @end: last element in the range to delete from the list.
+ * Note: list_empty on the range of entries does not return true after this,
+ * the entries is in an undefined state.
+ */
+static inline void list_del_range(struct list_head *begin,
+				  struct list_head *end)
+{
+	begin->prev->next = end->next;
+	end->next->prev = begin->prev;
+}
+
+/**
+ * list_for_each_from	-	iterate over a list from one of its nodes
+ * @pos:  the &struct list_head to use as a loop cursor, from where to start
+ * @head: the head for your list.
+ */
+#define list_for_each_from(pos, head) \
+	for (; pos != (head); pos = pos->next)
+#endif
diff --git a/tools/include/linux/poison.h b/tools/include/linux/poison.h
new file mode 100644
index 000000000000..0c27bdf14233
--- /dev/null
+++ b/tools/include/linux/poison.h
@@ -0,0 +1 @@
+#include "../../../include/linux/poison.h"
diff --git a/tools/perf/MANIFEST b/tools/perf/MANIFEST
index fce4a47347aa..a0bdd6124583 100644
--- a/tools/perf/MANIFEST
+++ b/tools/perf/MANIFEST
@@ -41,7 +41,9 @@ tools/include/linux/compiler.h
 tools/include/linux/export.h
 tools/include/linux/hash.h
 tools/include/linux/kernel.h
+tools/include/linux/list.h
 tools/include/linux/log2.h
+tools/include/linux/poison.h
 tools/include/linux/types.h
 include/asm-generic/bitops/arch_hweight.h
 include/asm-generic/bitops/const_hweight.h
diff --git a/tools/perf/util/include/linux/list.h b/tools/perf/util/include/linux/list.h
deleted file mode 100644
index 76ddbc726343..000000000000
--- a/tools/perf/util/include/linux/list.h
+++ /dev/null
@@ -1,29 +0,0 @@
-#include <linux/kernel.h>
-#include <linux/types.h>
-
-#include "../../../../include/linux/list.h"
-
-#ifndef PERF_LIST_H
-#define PERF_LIST_H
-/**
- * list_del_range - deletes range of entries from list.
- * @begin: first element in the range to delete from the list.
- * @end: last element in the range to delete from the list.
- * Note: list_empty on the range of entries does not return true after this,
- * the entries is in an undefined state.
- */
-static inline void list_del_range(struct list_head *begin,
-				  struct list_head *end)
-{
-	begin->prev->next = end->next;
-	end->next->prev = begin->prev;
-}
-
-/**
- * list_for_each_from	-	iterate over a list from one of its nodes
- * @pos:  the &struct list_head to use as a loop cursor, from where to start
- * @head: the head for your list.
- */
-#define list_for_each_from(pos, head) \
-	for (; pos != (head); pos = pos->next)
-#endif
diff --git a/tools/perf/util/include/linux/poison.h b/tools/perf/util/include/linux/poison.h
deleted file mode 100644
index fef6dbc9ce13..000000000000
--- a/tools/perf/util/include/linux/poison.h
+++ /dev/null
@@ -1 +0,0 @@
-#include "../../../../include/linux/poison.h"
-- 
2.1.0


^ permalink raw reply related	[flat|nested] 43+ messages in thread

* [PATCH 6/6] perf tools: Deal with kernel module names in '[]' correctly
  2015-06-03 22:40 [GIT PULL 0/6] perf/core improvements and fixes Arnaldo Carvalho de Melo
                   ` (4 preceding siblings ...)
  2015-06-03 22:40 ` [PATCH 5/6] tools: Move tools/perf/util/include/linux/{list.h,poison.h} " Arnaldo Carvalho de Melo
@ 2015-06-03 22:40 ` Arnaldo Carvalho de Melo
  2015-06-04  5:48 ` [GIT PULL 0/6] perf/core improvements and fixes Ingo Molnar
  6 siblings, 0 replies; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2015-06-03 22:40 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Wang Nan, Namhyung Kim, Zefan Li, Arnaldo Carvalho de Melo

From: Wang Nan <wangnan0@huawei.com>

Before patch ba92732e9808 ('perf kmaps: Check kmaps to make code more
robust'), 'perf report' and 'perf annotate' will segfault if trace data
contains kernel module information like this:

 # perf report -D -i ./perf.data
 ...
 0 0 0x188 [0x50]: PERF_RECORD_MMAP -1/0: [0xffffffbff1018000(0xf068000) @ 0]: x [test_module]
 ...

 # perf report -i ./perf.data --objdump=/path/to/objdump --kallsyms=/path/to/kallsyms

 perf: Segmentation fault
 -------- backtrace --------
 /path/to/perf[0x503478]
 /lib64/libc.so.6(+0x3545f)[0x7fb201f3745f]
 /path/to/perf[0x499b56]
 /path/to/perf(dso__load_kallsyms+0x13c)[0x49b56c]
 /path/to/perf(dso__load+0x72e)[0x49c21e]
 /path/to/perf(map__load+0x6e)[0x4ae9ee]
 /path/to/perf(thread__find_addr_map+0x24c)[0x47deec]
 /path/to/perf(perf_event__preprocess_sample+0x88)[0x47e238]
 /path/to/perf[0x43ad02]
 /path/to/perf[0x4b55bc]
 /path/to/perf(ordered_events__flush+0xca)[0x4b57ea]
 /path/to/perf[0x4b1a01]
 /path/to/perf(perf_session__process_events+0x3be)[0x4b428e]
 /path/to/perf(cmd_report+0xf11)[0x43bfc1]
 /path/to/perf[0x474702]
 /path/to/perf(main+0x5f5)[0x42de95]
 /lib64/libc.so.6(__libc_start_main+0xf4)[0x7fb201f23bd4]
 /path/to/perf[0x42dfc4]

This is because __kmod_path__parse treats '[' leading names as kernel
name instead of names of kernel module.

If perf.data contains build information and the buildid of such modules
can be found, the dso->kernel of it will be set to DSO_TYPE_KERNEL by
__event_process_build_id(), not kernel module.

It will then be passed to dso__load() -> dso__load_kernel_sym() ->
dso__load_kcore() if --kallsyms is provided.

The refered patch adds NULL pointer checker to avoid segfault. However,
such kernel modules are still processed incorrectly.

This patch fixes __kmod_path__parse, makes it treat names like
'[test_module]' as kernel modules.

kmod-path.c is also update to reflect the above changes.

Signed-off-by: Wang Nan <wangnan0@huawei.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Zefan Li <lizefan@huawei.com>
Link: http://lkml.kernel.org/r/1433321541-170245-1-git-send-email-wangnan0@huawei.com
[ Fixed the merged with 0443f36b0de0 ("perf machine: Fix the search
  for the kernel DSO on the unified list" ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
---
 tools/perf/tests/kmod-path.c | 72 ++++++++++++++++++++++++++++++++++++++++++++
 tools/perf/util/dso.c        | 47 ++++++++++++++++++++++++++---
 tools/perf/util/dso.h        |  2 +-
 tools/perf/util/header.c     |  8 ++---
 tools/perf/util/machine.c    | 22 +++++++++++++-
 5 files changed, 140 insertions(+), 11 deletions(-)

diff --git a/tools/perf/tests/kmod-path.c b/tools/perf/tests/kmod-path.c
index e8d7cbb9320c..08c433b4bf4f 100644
--- a/tools/perf/tests/kmod-path.c
+++ b/tools/perf/tests/kmod-path.c
@@ -34,9 +34,21 @@ static int test(const char *path, bool alloc_name, bool alloc_ext,
 	return 0;
 }
 
+static int test_is_kernel_module(const char *path, int cpumode, bool expect)
+{
+	TEST_ASSERT_VAL("is_kernel_module",
+			(!!is_kernel_module(path, cpumode)) == (!!expect));
+	pr_debug("%s (cpumode: %d) - is_kernel_module: %s\n",
+			path, cpumode, expect ? "true" : "false");
+	return 0;
+}
+
 #define T(path, an, ae, k, c, n, e) \
 	TEST_ASSERT_VAL("failed", !test(path, an, ae, k, c, n, e))
 
+#define M(path, c, e) \
+	TEST_ASSERT_VAL("failed", !test_is_kernel_module(path, c, e))
+
 int test__kmod_path__parse(void)
 {
 	/* path                alloc_name  alloc_ext   kmod  comp   name     ext */
@@ -44,30 +56,90 @@ int test__kmod_path__parse(void)
 	T("/xxxx/xxxx/x-x.ko", false     , true      , true, false, NULL   , NULL);
 	T("/xxxx/xxxx/x-x.ko", true      , false     , true, false, "[x_x]", NULL);
 	T("/xxxx/xxxx/x-x.ko", false     , false     , true, false, NULL   , NULL);
+	M("/xxxx/xxxx/x-x.ko", PERF_RECORD_MISC_CPUMODE_UNKNOWN, true);
+	M("/xxxx/xxxx/x-x.ko", PERF_RECORD_MISC_KERNEL, true);
+	M("/xxxx/xxxx/x-x.ko", PERF_RECORD_MISC_USER, false);
 
 	/* path                alloc_name  alloc_ext   kmod  comp  name   ext */
 	T("/xxxx/xxxx/x.ko.gz", true     , true      , true, true, "[x]", "gz");
 	T("/xxxx/xxxx/x.ko.gz", false    , true      , true, true, NULL , "gz");
 	T("/xxxx/xxxx/x.ko.gz", true     , false     , true, true, "[x]", NULL);
 	T("/xxxx/xxxx/x.ko.gz", false    , false     , true, true, NULL , NULL);
+	M("/xxxx/xxxx/x.ko.gz", PERF_RECORD_MISC_CPUMODE_UNKNOWN, true);
+	M("/xxxx/xxxx/x.ko.gz", PERF_RECORD_MISC_KERNEL, true);
+	M("/xxxx/xxxx/x.ko.gz", PERF_RECORD_MISC_USER, false);
 
 	/* path              alloc_name  alloc_ext  kmod   comp  name    ext */
 	T("/xxxx/xxxx/x.gz", true      , true     , false, true, "x.gz" ,"gz");
 	T("/xxxx/xxxx/x.gz", false     , true     , false, true, NULL   ,"gz");
 	T("/xxxx/xxxx/x.gz", true      , false    , false, true, "x.gz" , NULL);
 	T("/xxxx/xxxx/x.gz", false     , false    , false, true, NULL   , NULL);
+	M("/xxxx/xxxx/x.gz", PERF_RECORD_MISC_CPUMODE_UNKNOWN, false);
+	M("/xxxx/xxxx/x.gz", PERF_RECORD_MISC_KERNEL, false);
+	M("/xxxx/xxxx/x.gz", PERF_RECORD_MISC_USER, false);
 
 	/* path   alloc_name  alloc_ext  kmod   comp  name     ext */
 	T("x.gz", true      , true     , false, true, "x.gz", "gz");
 	T("x.gz", false     , true     , false, true, NULL  , "gz");
 	T("x.gz", true      , false    , false, true, "x.gz", NULL);
 	T("x.gz", false     , false    , false, true, NULL  , NULL);
+	M("x.gz", PERF_RECORD_MISC_CPUMODE_UNKNOWN, false);
+	M("x.gz", PERF_RECORD_MISC_KERNEL, false);
+	M("x.gz", PERF_RECORD_MISC_USER, false);
 
 	/* path      alloc_name  alloc_ext  kmod  comp  name  ext */
 	T("x.ko.gz", true      , true     , true, true, "[x]", "gz");
 	T("x.ko.gz", false     , true     , true, true, NULL , "gz");
 	T("x.ko.gz", true      , false    , true, true, "[x]", NULL);
 	T("x.ko.gz", false     , false    , true, true, NULL , NULL);
+	M("x.ko.gz", PERF_RECORD_MISC_CPUMODE_UNKNOWN, true);
+	M("x.ko.gz", PERF_RECORD_MISC_KERNEL, true);
+	M("x.ko.gz", PERF_RECORD_MISC_USER, false);
+
+	/* path            alloc_name  alloc_ext  kmod  comp   name             ext */
+	T("[test_module]", true      , true     , true, false, "[test_module]", NULL);
+	T("[test_module]", false     , true     , true, false, NULL           , NULL);
+	T("[test_module]", true      , false    , true, false, "[test_module]", NULL);
+	T("[test_module]", false     , false    , true, false, NULL           , NULL);
+	M("[test_module]", PERF_RECORD_MISC_CPUMODE_UNKNOWN, true);
+	M("[test_module]", PERF_RECORD_MISC_KERNEL, true);
+	M("[test_module]", PERF_RECORD_MISC_USER, false);
+
+	/* path            alloc_name  alloc_ext  kmod  comp   name             ext */
+	T("[test.module]", true      , true     , true, false, "[test.module]", NULL);
+	T("[test.module]", false     , true     , true, false, NULL           , NULL);
+	T("[test.module]", true      , false    , true, false, "[test.module]", NULL);
+	T("[test.module]", false     , false    , true, false, NULL           , NULL);
+	M("[test.module]", PERF_RECORD_MISC_CPUMODE_UNKNOWN, true);
+	M("[test.module]", PERF_RECORD_MISC_KERNEL, true);
+	M("[test.module]", PERF_RECORD_MISC_USER, false);
+
+	/* path     alloc_name  alloc_ext  kmod   comp   name      ext */
+	T("[vdso]", true      , true     , false, false, "[vdso]", NULL);
+	T("[vdso]", false     , true     , false, false, NULL    , NULL);
+	T("[vdso]", true      , false    , false, false, "[vdso]", NULL);
+	T("[vdso]", false     , false    , false, false, NULL    , NULL);
+	M("[vdso]", PERF_RECORD_MISC_CPUMODE_UNKNOWN, false);
+	M("[vdso]", PERF_RECORD_MISC_KERNEL, false);
+	M("[vdso]", PERF_RECORD_MISC_USER, false);
+
+	/* path         alloc_name  alloc_ext  kmod   comp   name          ext */
+	T("[vsyscall]", true      , true     , false, false, "[vsyscall]", NULL);
+	T("[vsyscall]", false     , true     , false, false, NULL        , NULL);
+	T("[vsyscall]", true      , false    , false, false, "[vsyscall]", NULL);
+	T("[vsyscall]", false     , false    , false, false, NULL        , NULL);
+	M("[vsyscall]", PERF_RECORD_MISC_CPUMODE_UNKNOWN, false);
+	M("[vsyscall]", PERF_RECORD_MISC_KERNEL, false);
+	M("[vsyscall]", PERF_RECORD_MISC_USER, false);
+
+	/* path                alloc_name  alloc_ext  kmod   comp   name      ext */
+	T("[kernel.kallsyms]", true      , true     , false, false, "[kernel.kallsyms]", NULL);
+	T("[kernel.kallsyms]", false     , true     , false, false, NULL               , NULL);
+	T("[kernel.kallsyms]", true      , false    , false, false, "[kernel.kallsyms]", NULL);
+	T("[kernel.kallsyms]", false     , false    , false, false, NULL               , NULL);
+	M("[kernel.kallsyms]", PERF_RECORD_MISC_CPUMODE_UNKNOWN, false);
+	M("[kernel.kallsyms]", PERF_RECORD_MISC_KERNEL, false);
+	M("[kernel.kallsyms]", PERF_RECORD_MISC_USER, false);
 
 	return 0;
 }
diff --git a/tools/perf/util/dso.c b/tools/perf/util/dso.c
index b335db3532a2..5ec9e892c89b 100644
--- a/tools/perf/util/dso.c
+++ b/tools/perf/util/dso.c
@@ -166,12 +166,28 @@ bool is_supported_compression(const char *ext)
 	return false;
 }
 
-bool is_kernel_module(const char *pathname)
+bool is_kernel_module(const char *pathname, int cpumode)
 {
 	struct kmod_path m;
-
-	if (kmod_path__parse(&m, pathname))
-		return NULL;
+	int mode = cpumode & PERF_RECORD_MISC_CPUMODE_MASK;
+
+	WARN_ONCE(mode != cpumode,
+		  "Internal error: passing unmasked cpumode (%x) to is_kernel_module",
+		  cpumode);
+
+	switch (mode) {
+	case PERF_RECORD_MISC_USER:
+	case PERF_RECORD_MISC_HYPERVISOR:
+	case PERF_RECORD_MISC_GUEST_USER:
+		return false;
+	/* Treat PERF_RECORD_MISC_CPUMODE_UNKNOWN as kernel */
+	default:
+		if (kmod_path__parse(&m, pathname)) {
+			pr_err("Failed to check whether %s is a kernel module or not. Assume it is.",
+					pathname);
+			return true;
+		}
+	}
 
 	return m.kmod;
 }
@@ -215,12 +231,33 @@ int __kmod_path__parse(struct kmod_path *m, const char *path,
 {
 	const char *name = strrchr(path, '/');
 	const char *ext  = strrchr(path, '.');
+	bool is_simple_name = false;
 
 	memset(m, 0x0, sizeof(*m));
 	name = name ? name + 1 : path;
 
+	/*
+	 * '.' is also a valid character for module name. For example:
+	 * [aaa.bbb] is a valid module name. '[' should have higher
+	 * priority than '.ko' suffix.
+	 *
+	 * The kernel names are from machine__mmap_name. Such
+	 * name should belong to kernel itself, not kernel module.
+	 */
+	if (name[0] == '[') {
+		is_simple_name = true;
+		if ((strncmp(name, "[kernel.kallsyms]", 17) == 0) ||
+		    (strncmp(name, "[guest.kernel.kallsyms", 22) == 0) ||
+		    (strncmp(name, "[vdso]", 6) == 0) ||
+		    (strncmp(name, "[vsyscall]", 10) == 0)) {
+			m->kmod = false;
+
+		} else
+			m->kmod = true;
+	}
+
 	/* No extension, just return name. */
-	if (ext == NULL) {
+	if ((ext == NULL) || is_simple_name) {
 		if (alloc_name) {
 			m->name = strdup(name);
 			return m->name ? 0 : -ENOMEM;
diff --git a/tools/perf/util/dso.h b/tools/perf/util/dso.h
index 24a507a54147..ba2d90ed881f 100644
--- a/tools/perf/util/dso.h
+++ b/tools/perf/util/dso.h
@@ -220,7 +220,7 @@ char dso__symtab_origin(const struct dso *dso);
 int dso__read_binary_type_filename(const struct dso *dso, enum dso_binary_type type,
 				   char *root_dir, char *filename, size_t size);
 bool is_supported_compression(const char *ext);
-bool is_kernel_module(const char *pathname);
+bool is_kernel_module(const char *pathname, int cpumode);
 bool decompress_to_file(const char *ext, const char *filename, int output_fd);
 bool dso__needs_decompress(struct dso *dso);
 
diff --git a/tools/perf/util/header.c b/tools/perf/util/header.c
index 851143a7988d..ac5aaaeed7ff 100644
--- a/tools/perf/util/header.c
+++ b/tools/perf/util/header.c
@@ -1239,7 +1239,7 @@ static int __event_process_build_id(struct build_id_event *bev,
 {
 	int err = -1;
 	struct machine *machine;
-	u16 misc;
+	u16 cpumode;
 	struct dso *dso;
 	enum dso_kernel_type dso_type;
 
@@ -1247,9 +1247,9 @@ static int __event_process_build_id(struct build_id_event *bev,
 	if (!machine)
 		goto out;
 
-	misc = bev->header.misc & PERF_RECORD_MISC_CPUMODE_MASK;
+	cpumode = bev->header.misc & PERF_RECORD_MISC_CPUMODE_MASK;
 
-	switch (misc) {
+	switch (cpumode) {
 	case PERF_RECORD_MISC_KERNEL:
 		dso_type = DSO_TYPE_KERNEL;
 		break;
@@ -1270,7 +1270,7 @@ static int __event_process_build_id(struct build_id_event *bev,
 
 		dso__set_build_id(dso, &bev->build_id);
 
-		if (!is_kernel_module(filename))
+		if (!is_kernel_module(filename, cpumode))
 			dso->kernel = dso_type;
 
 		build_id__sprintf(dso->build_id, sizeof(dso->build_id),
diff --git a/tools/perf/util/machine.c b/tools/perf/util/machine.c
index 4e29e80932e5..9e02c86f39f5 100644
--- a/tools/perf/util/machine.c
+++ b/tools/perf/util/machine.c
@@ -1149,9 +1149,29 @@ static int machine__process_kernel_mmap_event(struct machine *machine,
 		struct dso *dso;
 
 		list_for_each_entry(dso, &machine->dsos.head, node) {
-			if (!dso->kernel || is_kernel_module(dso->long_name))
+
+			/*
+			 * The cpumode passed to is_kernel_module is not the
+			 * cpumode of *this* event. If we insist on passing
+			 * correct cpumode to is_kernel_module, we should
+			 * record the cpumode when we adding this dso to the
+			 * linked list.
+			 *
+			 * However we don't really need passing correct
+			 * cpumode.  We know the correct cpumode must be kernel
+			 * mode (if not, we should not link it onto kernel_dsos
+			 * list).
+			 *
+			 * Therefore, we pass PERF_RECORD_MISC_CPUMODE_UNKNOWN.
+			 * is_kernel_module() treats it as a kernel cpumode.
+			 */
+
+			if (!dso->kernel ||
+			    is_kernel_module(dso->long_name,
+					     PERF_RECORD_MISC_CPUMODE_UNKNOWN))
 				continue;
 
+
 			kernel = dso;
 			break;
 		}
-- 
2.1.0


^ permalink raw reply related	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-03 22:40 [GIT PULL 0/6] perf/core improvements and fixes Arnaldo Carvalho de Melo
                   ` (5 preceding siblings ...)
  2015-06-03 22:40 ` [PATCH 6/6] perf tools: Deal with kernel module names in '[]' correctly Arnaldo Carvalho de Melo
@ 2015-06-04  5:48 ` Ingo Molnar
  2015-06-04  6:07   ` Wangnan (F)
  6 siblings, 1 reply; 43+ messages in thread
From: Ingo Molnar @ 2015-06-04  5:48 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo
  Cc: linux-kernel, Adrian Hunter, Alexei Starovoitov, Brendan Gregg,
	Daniel Borkmann, David Ahern, He Kuang, Jiri Olsa, Kaixu Xia,
	Madhavan Srinivasan, Masami Hiramatsu, Michael Ellerman,
	Namhyung Kim, Peter Zijlstra, pi3orama, Sukadev Bhattiprolu,
	Wang Nan, Zefan Li, Arnaldo Carvalho de Melo


* Arnaldo Carvalho de Melo <acme@kernel.org> wrote:

> Hi Ingo,
> 
> 	Please consider applying.
> 
> 	One of the next requests probably will have the eBPF work by Wang Nan,
> but I am still going thru it and want to test it thoroughly.
> 
> 	BTW: Have you looked at it lately? It is at:
> 
> http://lkml.kernel.org/r/1433144296-74992-1-git-send-email-wangnan0@huawei.com
> 
> Super summary from the above cover letter:
> 
> ---------------------
> It enables 'perf record' to filter events using eBPF programs like:
> 
>  # perf record --event bpf-file.o sleep 1
> 
> Events are selected and filtered according to definitions in bpf-file.o.

Looks useful, but I think the UI needs one more tweak: could you fix it to be able 
to filter based on the eBPF _source_ file, not just the object file?

People want to tweak such filters as they profile, so we should use the eBPF 
source code as the primary interface. We can compile it internally to the .o just 
fine. The .o file is a totally uninteresting intermediate product in itself.

I.e. we need to first think through such profiling workflows from beginning to end 
before allowing them upstream.

> ---------------------
> 
> 	The first two patches from that series are in this pull req, as
> they just move stuff into tools/include/linux/ from tools/perf/include.
> 
> Regards,
> 
> - Arnaldo
> 
> The following changes since commit 5c9b9bc67c684e40b3a5e7e9facde0fb7200cd8c:
> 
>   Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-05-29 20:19:02 +0200)
> 
> are available in the git repository at:
> 
>   git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo
> 
> for you to fetch changes up to 1f121b03d058dd07199d8924373d3c52a207f63b:
> 
>   perf tools: Deal with kernel module names in '[]' correctly (2015-06-03 10:02:38 -0300)
> 
> ----------------------------------------------------------------
> perf/core improvements and fixes:
> 
> User visible:
> 
> - Fix 'perf probe' segfault when glob matching function without debuginfo (Wang Nan)
> 
> - Remove newline char when reading event scale and unit (Madhavan Srinivasan)
> 
> - Deal with kernel module names in '[]' correctly (Wang Nan)
> 
> Infrastructure:
> 
> - Fix the search for the kernel DSO on the unified list (Arnaldo Carvalho de Melo)
> 
> - Move tools/perf/util/include/linux/{kernel.h,list.h,poison.h} to tools/include,
>   to be used in tools/lib/bpf/ (Wang Nan)
> 
> Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
> 
> ----------------------------------------------------------------
> Arnaldo Carvalho de Melo (1):
>       perf machine: Fix the search for the kernel DSO on the unified list
> 
> Madhavan Srinivasan (1):
>       perf tools: Remove newline char when reading event scale and unit
> 
> Wang Nan (4):
>       perf probe: Fix segfault when glob matching function without debuginfo
>       perf tools: Move linux/kernel.h to tools/include
>       tools: Move tools/perf/util/include/linux/{list.h,poison.h} to tools/include
>       perf tools: Deal with kernel module names in '[]' correctly
> 
>  tools/{perf/util => }/include/linux/kernel.h |  4 +-
>  tools/{perf/util => }/include/linux/list.h   |  6 +--
>  tools/include/linux/poison.h                 |  1 +
>  tools/perf/MANIFEST                          |  3 ++
>  tools/perf/tests/kmod-path.c                 | 72 ++++++++++++++++++++++++++++
>  tools/perf/util/dso.c                        | 47 ++++++++++++++++--
>  tools/perf/util/dso.h                        |  2 +-
>  tools/perf/util/header.c                     |  8 ++--
>  tools/perf/util/include/linux/poison.h       |  1 -
>  tools/perf/util/machine.c                    | 22 ++++++++-
>  tools/perf/util/pmu.c                        | 11 ++++-
>  tools/perf/util/probe-event.c                | 26 ++++++++--
>  12 files changed, 179 insertions(+), 24 deletions(-)
>  rename tools/{perf/util => }/include/linux/kernel.h (97%)
>  rename tools/{perf/util => }/include/linux/list.h (90%)
>  create mode 100644 tools/include/linux/poison.h
>  delete mode 100644 tools/perf/util/include/linux/poison.h

Pulled, thanks a lot Arnaldo!

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-04  5:48 ` [GIT PULL 0/6] perf/core improvements and fixes Ingo Molnar
@ 2015-06-04  6:07   ` Wangnan (F)
  2015-06-04  7:21     ` Ingo Molnar
  2015-06-04 10:17     ` [EXPERIENCE] My experience on using perf record BPF filter on a real usecase Wangnan (F)
  0 siblings, 2 replies; 43+ messages in thread
From: Wangnan (F) @ 2015-06-04  6:07 UTC (permalink / raw)
  To: Ingo Molnar, Arnaldo Carvalho de Melo
  Cc: linux-kernel, Adrian Hunter, Alexei Starovoitov, Brendan Gregg,
	Daniel Borkmann, David Ahern, He Kuang, Jiri Olsa, Kaixu Xia,
	Madhavan Srinivasan, Masami Hiramatsu, Michael Ellerman,
	Namhyung Kim, Peter Zijlstra, pi3orama, Sukadev Bhattiprolu,
	Zefan Li, Arnaldo Carvalho de Melo, Alexei Starovoitov,
	Alexei Starovoitov



On 2015/6/4 13:48, Ingo Molnar wrote:
> * Arnaldo Carvalho de Melo <acme@kernel.org> wrote:
>
>> Hi Ingo,
>>
>> 	Please consider applying.
>>
>> 	One of the next requests probably will have the eBPF work by Wang Nan,
>> but I am still going thru it and want to test it thoroughly.
>>
>> 	BTW: Have you looked at it lately? It is at:
>>
>> http://lkml.kernel.org/r/1433144296-74992-1-git-send-email-wangnan0@huawei.com
>>
>> Super summary from the above cover letter:
>>
>> ---------------------
>> It enables 'perf record' to filter events using eBPF programs like:
>>
>>   # perf record --event bpf-file.o sleep 1
>>
>> Events are selected and filtered according to definitions in bpf-file.o.
> Looks useful, but I think the UI needs one more tweak: could you fix it to be able
> to filter based on the eBPF _source_ file, not just the object file?
>
> People want to tweak such filters as they profile, so we should use the eBPF
> source code as the primary interface. We can compile it internally to the .o just
> fine. The .o file is a totally uninteresting intermediate product in itself.
>
> I.e. we need to first think through such profiling workflows from beginning to end
> before allowing them upstream.

In a private mail Alexei Starovoitov disscussed with me about this. He 
said that he is working
on a shared object which can compile C program into BPF bytecode on the 
fly. After he done his
work, I think perf can support dtrace-like profiling that, users will be 
able to feed source
code to perf directly on cmdline. He said he can release it on June. I 
added him to the CC-list.

However I think the '.o' intermediate is still needed. I'd like to share 
a real profiling
experience using eBPF today, please keep an eye on it. In my experience, 
since we are using C
instead of dtrace, the code piece could be relative complex. Therefore, 
even if perf is able
to compile the C source on the fly, I think user still need to transfer 
the profiling scripts
to the target machine. Therefor, for him, precompiling and do some 
debugging on a high-end server
then transfer it into target machine (like a smartphone) is tolerable, 
and useful for me.

Thank you.

>> ---------------------
>>
>> 	The first two patches from that series are in this pull req, as
>> they just move stuff into tools/include/linux/ from tools/perf/include.
>>
>> Regards,
>>
>> - Arnaldo
>>
>> The following changes since commit 5c9b9bc67c684e40b3a5e7e9facde0fb7200cd8c:
>>
>>    Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-05-29 20:19:02 +0200)
>>
>> are available in the git repository at:
>>
>>    git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo
>>
>> for you to fetch changes up to 1f121b03d058dd07199d8924373d3c52a207f63b:
>>
>>    perf tools: Deal with kernel module names in '[]' correctly (2015-06-03 10:02:38 -0300)
>>
>> ----------------------------------------------------------------
>> perf/core improvements and fixes:
>>
>> User visible:
>>
>> - Fix 'perf probe' segfault when glob matching function without debuginfo (Wang Nan)
>>
>> - Remove newline char when reading event scale and unit (Madhavan Srinivasan)
>>
>> - Deal with kernel module names in '[]' correctly (Wang Nan)
>>
>> Infrastructure:
>>
>> - Fix the search for the kernel DSO on the unified list (Arnaldo Carvalho de Melo)
>>
>> - Move tools/perf/util/include/linux/{kernel.h,list.h,poison.h} to tools/include,
>>    to be used in tools/lib/bpf/ (Wang Nan)
>>
>> Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
>>
>> ----------------------------------------------------------------
>> Arnaldo Carvalho de Melo (1):
>>        perf machine: Fix the search for the kernel DSO on the unified list
>>
>> Madhavan Srinivasan (1):
>>        perf tools: Remove newline char when reading event scale and unit
>>
>> Wang Nan (4):
>>        perf probe: Fix segfault when glob matching function without debuginfo
>>        perf tools: Move linux/kernel.h to tools/include
>>        tools: Move tools/perf/util/include/linux/{list.h,poison.h} to tools/include
>>        perf tools: Deal with kernel module names in '[]' correctly
>>
>>   tools/{perf/util => }/include/linux/kernel.h |  4 +-
>>   tools/{perf/util => }/include/linux/list.h   |  6 +--
>>   tools/include/linux/poison.h                 |  1 +
>>   tools/perf/MANIFEST                          |  3 ++
>>   tools/perf/tests/kmod-path.c                 | 72 ++++++++++++++++++++++++++++
>>   tools/perf/util/dso.c                        | 47 ++++++++++++++++--
>>   tools/perf/util/dso.h                        |  2 +-
>>   tools/perf/util/header.c                     |  8 ++--
>>   tools/perf/util/include/linux/poison.h       |  1 -
>>   tools/perf/util/machine.c                    | 22 ++++++++-
>>   tools/perf/util/pmu.c                        | 11 ++++-
>>   tools/perf/util/probe-event.c                | 26 ++++++++--
>>   12 files changed, 179 insertions(+), 24 deletions(-)
>>   rename tools/{perf/util => }/include/linux/kernel.h (97%)
>>   rename tools/{perf/util => }/include/linux/list.h (90%)
>>   create mode 100644 tools/include/linux/poison.h
>>   delete mode 100644 tools/perf/util/include/linux/poison.h
> Pulled, thanks a lot Arnaldo!
>
> 	Ingo



^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-04  6:07   ` Wangnan (F)
@ 2015-06-04  7:21     ` Ingo Molnar
  2015-06-04 10:00       ` Wangnan (F)
  2015-06-04 10:17     ` [EXPERIENCE] My experience on using perf record BPF filter on a real usecase Wangnan (F)
  1 sibling, 1 reply; 43+ messages in thread
From: Ingo Molnar @ 2015-06-04  7:21 UTC (permalink / raw)
  To: Wangnan (F)
  Cc: Arnaldo Carvalho de Melo, linux-kernel, Adrian Hunter,
	Alexei Starovoitov, Brendan Gregg, Daniel Borkmann, David Ahern,
	He Kuang, Jiri Olsa, Kaixu Xia, Madhavan Srinivasan,
	Masami Hiramatsu, Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	pi3orama, Sukadev Bhattiprolu, Zefan Li,
	Arnaldo Carvalho de Melo, Alexei Starovoitov


* Wangnan (F) <wangnan0@huawei.com> wrote:

> On 2015/6/4 13:48, Ingo Molnar wrote:
> >* Arnaldo Carvalho de Melo <acme@kernel.org> wrote:
> >
> >>Hi Ingo,
> >>
> >>	Please consider applying.
> >>
> >>	One of the next requests probably will have the eBPF work by Wang Nan,
> >>but I am still going thru it and want to test it thoroughly.
> >>
> >>	BTW: Have you looked at it lately? It is at:
> >>
> >>http://lkml.kernel.org/r/1433144296-74992-1-git-send-email-wangnan0@huawei.com
> >>
> >>Super summary from the above cover letter:
> >>
> >>---------------------
> >>It enables 'perf record' to filter events using eBPF programs like:
> >>
> >>  # perf record --event bpf-file.o sleep 1
> >>
> >>Events are selected and filtered according to definitions in bpf-file.o.
> >Looks useful, but I think the UI needs one more tweak: could you fix it to be able
> >to filter based on the eBPF _source_ file, not just the object file?
> >
> >People want to tweak such filters as they profile, so we should use the eBPF
> >source code as the primary interface. We can compile it internally to the .o just
> >fine. The .o file is a totally uninteresting intermediate product in itself.
> >
> >I.e. we need to first think through such profiling workflows from beginning to end
> >before allowing them upstream.
> 
> In a private mail Alexei Starovoitov disscussed with me about this. He said that 
> he is working on a shared object which can compile C program into BPF bytecode 
> on the fly. After he done his work, I think perf can support dtrace-like 
> profiling that, users will be able to feed source code to perf directly on 
> cmdline. He said he can release it on June. I added him to the CC-list.
> 
> However I think the '.o' intermediate is still needed. [...]

So how do you generate the .o? Why cannot the tool, if it sees that the filter 
parameter is eBPF source code, do that automatically?

I.e. you are making the user jump through hoops for no good reason - that's a bad 
UI and a bad workflow. Please don't do that!

Thanks,

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-04  7:21     ` Ingo Molnar
@ 2015-06-04 10:00       ` Wangnan (F)
  2015-06-04 12:40         ` Ingo Molnar
  0 siblings, 1 reply; 43+ messages in thread
From: Wangnan (F) @ 2015-06-04 10:00 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Arnaldo Carvalho de Melo, linux-kernel, Adrian Hunter,
	Alexei Starovoitov, Brendan Gregg, Daniel Borkmann, David Ahern,
	He Kuang, Jiri Olsa, Kaixu Xia, Madhavan Srinivasan,
	Masami Hiramatsu, Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	pi3orama, Sukadev Bhattiprolu, Zefan Li,
	Arnaldo Carvalho de Melo, Alexei Starovoitov



On 2015/6/4 15:21, Ingo Molnar wrote:
> * Wangnan (F) <wangnan0@huawei.com> wrote:
>
>> On 2015/6/4 13:48, Ingo Molnar wrote:
>>> * Arnaldo Carvalho de Melo <acme@kernel.org> wrote:
>>>
>>>> Hi Ingo,
>>>>
>>>> 	Please consider applying.
>>>>
>>>> 	One of the next requests probably will have the eBPF work by Wang Nan,
>>>> but I am still going thru it and want to test it thoroughly.
>>>>
>>>> 	BTW: Have you looked at it lately? It is at:
>>>>
>>>> http://lkml.kernel.org/r/1433144296-74992-1-git-send-email-wangnan0@huawei.com
>>>>
>>>> Super summary from the above cover letter:
>>>>
>>>> ---------------------
>>>> It enables 'perf record' to filter events using eBPF programs like:
>>>>
>>>>   # perf record --event bpf-file.o sleep 1
>>>>
>>>> Events are selected and filtered according to definitions in bpf-file.o.
>>> Looks useful, but I think the UI needs one more tweak: could you fix it to be able
>>> to filter based on the eBPF _source_ file, not just the object file?
>>>
>>> People want to tweak such filters as they profile, so we should use the eBPF
>>> source code as the primary interface. We can compile it internally to the .o just
>>> fine. The .o file is a totally uninteresting intermediate product in itself.
>>>
>>> I.e. we need to first think through such profiling workflows from beginning to end
>>> before allowing them upstream.
>> In a private mail Alexei Starovoitov disscussed with me about this. He said that
>> he is working on a shared object which can compile C program into BPF bytecode
>> on the fly. After he done his work, I think perf can support dtrace-like
>> profiling that, users will be able to feed source code to perf directly on
>> cmdline. He said he can release it on June. I added him to the CC-list.
>>
>> However I think the '.o' intermediate is still needed. [...]
> So how do you generate the .o? Why cannot the tool, if it sees that the filter
> parameter is eBPF source code, do that automatically?

I think compiling on the fly is our goal, and Alexei is working on it. 
However,
it looks like some limitations are still exist. For example, the BPF 
program may
require the definition of kernel structure for it to derefrernce struct 
pointers.
Therefore, the machine on which compiling occures should have kernel 
header and
configuration installed. However, most of production environment doesn't 
have
them for security and reasons.

I think BPF should be used to profile not only development environment 
but also
production systems. If we only concern development systems, why not 
recompiling
kernel with stub core or using kProbe modules?

What about this:

Give perf the ability to generate the '.o' file (to memory or dump to a 
real file)
on development environment. If we are willing to profiling that system, 
we can make
load the generated object into kernel and start trace by (perf record 
--event bpf_file.c ...).
If we are willing to profile systems on which the compiling environment 
is missing, we can
make perf to create a '.o' file (perf bpf compile bpf_file.c -o 
bpf_file.o) then copy
it onto target environment, use perf record --event bpf_file.o ... to 
filter events.

Thank you.

> I.e. you are making the user jump through hoops for no good reason - that's a bad
> UI and a bad workflow. Please don't do that!
>
> Thanks,
>
> 	Ingo



^ permalink raw reply	[flat|nested] 43+ messages in thread

* [EXPERIENCE] My experience on using perf record BPF filter on a real usecase
  2015-06-04  6:07   ` Wangnan (F)
  2015-06-04  7:21     ` Ingo Molnar
@ 2015-06-04 10:17     ` Wangnan (F)
  2015-06-10  6:42       ` Alexei Starovoitov
  1 sibling, 1 reply; 43+ messages in thread
From: Wangnan (F) @ 2015-06-04 10:17 UTC (permalink / raw)
  To: Ingo Molnar, Arnaldo Carvalho de Melo
  Cc: linux-kernel, Adrian Hunter, Alexei Starovoitov, Brendan Gregg,
	Daniel Borkmann, David Ahern, He Kuang, Jiri Olsa, Kaixu Xia,
	Madhavan Srinivasan, Masami Hiramatsu, Michael Ellerman,
	Namhyung Kim, Peter Zijlstra, pi3orama, Sukadev Bhattiprolu,
	Zefan Li, Arnaldo Carvalho de Melo, Alexei Starovoitov

Hi all,

I'd like to share my exprience on using 'perf record' BPF filter in a
real usecase to show the power and shortcome in my patch series:

  https://lkml.kernel.org/r/1433144296-74992-1-git-send-email-wangnan0@huawei.com

and other works on eBPF.

My usecase shows that such filter is useful. Also, I hope it can help us
to find way to further improve it.

My task is to find the reason why iozone test result is bad on some
specific cases. The development environment is a x86_64 server, the target
machine is a smartphone with Android. By previous analysis I have
already got some useful information:

  1. iozone computes bandwidth by averaging time of each sys_write.

  2. In our case, 1% sys_write takes 75% of total time, so what I need
     to do now should be finding the reason why those sys_write take so
     long.

  3. By sampling call stack on sched:sched_switch, I find that those
     sys_write calls lock_page() and blocks on it.

I decide to use BPF filter to find the other side of this locking
contention. The idea is simple:

  1. For all calls of lock_page(), probe at entry and exit points of
     it. Measure the execution time of the lock_page() call. If it takes
     too long (longer than 0.1 second) then there should have a lock
     contention. Take the sample at exit point.

  2. For all calls of unlock_page(), if the page is acquiring by other
     on at least 0.1 second before, take a sample at this point.

Currently making the above idea work is possible but not very
straightforward. One problem I can identify is:

     Doesn't like ftrace, there is no way for eBPF program to access call
     stack information. Without extra information, eBPF programs are
     unable to match lock_page events and corresponding lock_page%return
     events. Currently the only way for passing information between
     programs are maps. To simulate call stack matching, I create a
     BPF_FUNC_git_tid() which returns current->pid, and a
     proc_locking_page_map map which records the acquired page and time
     of calling lock_page.

     Another problem is: at the entry of lock_page() and
     unlock_page(), for fetching the page pointer I have to directly
     use 'ctx->regs[0]' (I am on aarch64). Which is not protable.

The final program I used is attached at the bottom of this email. It
costs more than 100 lines of code. I have to do some debugging to
ensure it works correctly on a virtual machine.

It is compiled using:

  # $CLANG ${INCLUDE} -D__KERNEL__ -Wno-unused-value -Wno-pointer-sign -O2 \
        -emit-llvm -c lock_page.c -o - | $LLC -march=bpf -filetype=obj -o  \
        lock_page.o

Then the lock_page.o is transfered onto target system.

I loaded it using following command:

  # perf record -e syscalls:sys_enter_write -e syscalls:sys_exit_write \
          -e lock_page.o -a iozone ...

Here is another inconvenience. Currently I only concern on write
syscall issued by iozone. However, without '-a' I'm unable to collect
information of the locker. If I want to filter sys_{enter,exit}_write
belong to iozone out using eBPF, I need to implement another function
like BPF_FUNC_git_comm. Another method is to use perf '--filter' after
the two events. However it looks strange to use two filter mechanisms
together. This time I choose to do filtering offline using perf script.

The result is resonable. Finaly I found the two side of lock contention.
It shows the way to improve. I'm sorry I can't share the call stack in
this list.

One inconvenience in this stage is: the information is
printed into ring buffer while the samples are stored into perf.data.
By analysing perf.data without ftrace ring buffer I don't know how long
the lock_page() cost becasue I don't sample at the entry of
lock_page().

The final part is the BPF program I used. I think there should have
better way to do it. If any know how to make it shorter please let me
know.

Thank you.

/* ------------- START OF BPF PROGRAM ------------- */
     /* __lock_page pass to unlock_page, key is pid */
     struct proc_locking_page {
         unsigned long page;
         unsigned long time;
     };

     struct bpf_map_def SEC("maps") proc_locking_page_map = {
         .type = BPF_MAP_TYPE_HASH,
         .key_size = sizeof(unsigned long),
         .value_size = sizeof(struct proc_locking_page),
         .max_entries = 1000000,
     };

     /* from page get pid */
     struct page_being_locked_by_proc {
         unsigned long tid;
         unsigned long time;
     };

     struct bpf_map_def SEC("maps") page_being_locked_by_proc_map = {
         .type = BPF_MAP_TYPE_HASH,
         .key_size = sizeof(unsigned long),
         .value_size = sizeof(struct page_being_locked_by_proc),
         .max_entries = 1000000,
     };

     SEC("lock_page=__lock_page")
     int lock_page_recorder(struct pt_regs *ctx)
     {
         unsigned long tid = bpf_get_tid();
         unsigned long page = ctx->regs[0];
         unsigned long curr_ns = bpf_ktime_get_ns();

         struct proc_locking_page locking_page;
         struct page_being_locked_by_proc being_locked;

         locking_page.page = page;
         locking_page.time = curr_ns;
         being_locked.tid = tid;
         being_locked.time = curr_ns;

         bpf_map_update_elem(&proc_locking_page_map, &tid, 
&locking_page, BPF_ANY);
         bpf_map_update_elem(&page_being_locked_by_proc_map, &page, 
&being_locked, BPF_ANY);
         return 0;
     }

     SEC("lock_page_ret=__lock_page%return")
     int lock_page_return_recorder(struct pt_regs *ctx)
     {
         unsigned long tid = bpf_get_tid();
         unsigned long curr_ns = bpf_ktime_get_ns();
         unsigned long page;
         unsigned long diff_time;
         struct proc_locking_page *p_locking_page;


         p_locking_page = bpf_map_lookup_elem(&proc_locking_page_map, &tid);


         /* BAD!! */
         if (!p_locking_page)
                 return 0;

         page = p_locking_page->page;
         diff_time = curr_ns - p_locking_page->time;
         bpf_map_delete_elem(&proc_locking_page_map, &tid);
         bpf_map_delete_elem(&page_being_locked_by_proc_map, &page);

         if (diff_time > 10000000) {
                 char fmt[] = "tid %d get page %lx using %d ns\n";
                 bpf_trace_printk(fmt, sizeof(fmt), tid, page, diff_time);
                 return 1;
         }

         return 0;
     }

     SEC("unlock_page=unlock_page")
     int unlock_page_recorder(struct pt_regs *ctx)
     {
         unsigned long tid = bpf_get_tid();
         unsigned long page = ctx->regs[0];
         unsigned long time = bpf_ktime_get_ns();
         unsigned long diff_time;
         struct page_being_locked_by_proc *p_being_locked;
         char fmt[] = "%d vs %d, %d ns\n";

         p_being_locked = 
bpf_map_lookup_elem(&page_being_locked_by_proc_map, &page);
         if (!p_being_locked)
                 return 0;
         diff_time = time - p_being_locked->time;
         if (diff_time > 10000000) {
                 bpf_trace_printk(fmt, sizeof(fmt), tid, 
p_being_locked->tid, diff_time);
                 return 1;
         }
         return 0;
     }
/* ------------- END OF BPF PROGRAM ------------- */


^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-04 10:00       ` Wangnan (F)
@ 2015-06-04 12:40         ` Ingo Molnar
  2015-06-04 12:58           ` pi3orama
  0 siblings, 1 reply; 43+ messages in thread
From: Ingo Molnar @ 2015-06-04 12:40 UTC (permalink / raw)
  To: Wangnan (F)
  Cc: Arnaldo Carvalho de Melo, linux-kernel, Adrian Hunter,
	Alexei Starovoitov, Brendan Gregg, Daniel Borkmann, David Ahern,
	He Kuang, Jiri Olsa, Kaixu Xia, Madhavan Srinivasan,
	Masami Hiramatsu, Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	pi3orama, Sukadev Bhattiprolu, Zefan Li,
	Arnaldo Carvalho de Melo, Alexei Starovoitov


* Wangnan (F) <wangnan0@huawei.com> wrote:

> > So how do you generate the .o? Why cannot the tool, if it sees that the filter 
> > parameter is eBPF source code, do that automatically?
> 
> I think compiling on the fly is our goal, and Alexei is working on it.

So what exact command line are you using to create the .o?

What exactly should users type to create a simple eBPF filter profile?

Thanks,

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-04 12:40         ` Ingo Molnar
@ 2015-06-04 12:58           ` pi3orama
  2015-06-04 14:04             ` Ingo Molnar
  0 siblings, 1 reply; 43+ messages in thread
From: pi3orama @ 2015-06-04 12:58 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Wangnan (F),
	Arnaldo Carvalho de Melo, linux-kernel, Adrian Hunter,
	Alexei Starovoitov, Brendan Gregg, Daniel Borkmann, David Ahern,
	He Kuang, Jiri Olsa, Kaixu Xia, Madhavan Srinivasan,
	Masami Hiramatsu, Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	Sukadev Bhattiprolu, Zefan Li, Arnaldo Carvalho de Melo,
	Alexei Starovoitov



发自我的 iPhone

> 在 2015年6月4日,下午8:40,Ingo Molnar <mingo@kernel.org> 写道:
> 
> 
> * Wangnan (F) <wangnan0@huawei.com> wrote:
> 
>>> So how do you generate the .o? Why cannot the tool, if it sees that the filter 
>>> parameter is eBPF source code, do that automatically?
>> 
>> I think compiling on the fly is our goal, and Alexei is working on it.
> 
> So what exact command line are you using to create the .o?
> 
> What exactly should users type to create a simple eBPF filter profile?
> 

I have mentioned in previous mail:

Use

 # perf record -e bpf_source.c cmdline 

 to create a eBPF filter from source,

Use

# perf record -e bpf_object.o cmdline 

to create a eBPF filter from object intermedia.

Use

# perf bpf compile bpf_source.c --kbuild=kernel-build-dir -o bpf_object.o

to create the .o

I think this should be enough. Currently only the second case has been implemented.

Thanks.

> Thanks,
> 
>    Ingo


^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-04 12:58           ` pi3orama
@ 2015-06-04 14:04             ` Ingo Molnar
  2015-06-04 16:22               ` Alexei Starovoitov
  0 siblings, 1 reply; 43+ messages in thread
From: Ingo Molnar @ 2015-06-04 14:04 UTC (permalink / raw)
  To: pi3orama
  Cc: Wangnan (F),
	Arnaldo Carvalho de Melo, linux-kernel, Adrian Hunter,
	Alexei Starovoitov, Brendan Gregg, Daniel Borkmann, David Ahern,
	He Kuang, Jiri Olsa, Kaixu Xia, Madhavan Srinivasan,
	Masami Hiramatsu, Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	Sukadev Bhattiprolu, Zefan Li, Arnaldo Carvalho de Melo,
	Alexei Starovoitov


* pi3orama <pi3orama@163.com> wrote:

> 
> 
> 发自我的 iPhone
> 
> > 在 2015年6月4日,下午8:40,Ingo Molnar <mingo@kernel.org> 写道:
> > 
> > 
> > * Wangnan (F) <wangnan0@huawei.com> wrote:
> > 
> >>> So how do you generate the .o? Why cannot the tool, if it sees that the filter 
> >>> parameter is eBPF source code, do that automatically?
> >> 
> >> I think compiling on the fly is our goal, and Alexei is working on it.
> > 
> > So what exact command line are you using to create the .o?
> > 
> > What exactly should users type to create a simple eBPF filter profile?
> 
> I have mentioned in previous mail:
> 
> Use
> 
>  # perf record -e bpf_source.c cmdline 
> 
>  to create a eBPF filter from source,
> 
> Use
> 
> # perf record -e bpf_object.o cmdline 
> 
> to create a eBPF filter from object intermedia.
> 
> Use
> 
> # perf bpf compile bpf_source.c --kbuild=kernel-build-dir -o bpf_object.o
> 
> to create the .o
> 
> I think this should be enough. Currently only the second case has been implemented.

So if users cannot actually generate .o files then it's premature to merge this in 
such an incomplete form!

It should be possible to use a feature that we are merging.

Thanks,

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-04 14:04             ` Ingo Molnar
@ 2015-06-04 16:22               ` Alexei Starovoitov
  2015-06-04 21:48                 ` Masami Hiramatsu
  2015-06-05  6:41                 ` Ingo Molnar
  0 siblings, 2 replies; 43+ messages in thread
From: Alexei Starovoitov @ 2015-06-04 16:22 UTC (permalink / raw)
  To: Ingo Molnar, pi3orama
  Cc: Wangnan (F),
	Arnaldo Carvalho de Melo, linux-kernel, Adrian Hunter,
	Brendan Gregg, Daniel Borkmann, David Ahern, He Kuang, Jiri Olsa,
	Kaixu Xia, Madhavan Srinivasan, Masami Hiramatsu,
	Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	Sukadev Bhattiprolu, Zefan Li, Arnaldo Carvalho de Melo,
	Alexei Starovoitov

On 6/4/15 7:04 AM, Ingo Molnar wrote:
>> >  # perf record -e bpf_source.c cmdline
>> >
>> >  to create a eBPF filter from source,
>> >
>> >Use
>> >
>> ># perf record -e bpf_object.o cmdline
>> >
>> >to create a eBPF filter from object intermedia.
>> >
>> >Use
>> >
>> ># perf bpf compile bpf_source.c --kbuild=kernel-build-dir -o bpf_object.o
>> >
>> >to create the .o
>> >
>> >I think this should be enough. Currently only the second case has been implemented.
> So if users cannot actually generate .o files then it's premature to merge this in
> such an incomplete form!
>
> It should be possible to use a feature that we are merging.

of course it's usable :) There is some confusion here.
To compile .c into .o one can easily use
clang -O2 -emit-llvm -c file.c -o - | llc -march=bpf -o file.o
any version of clang is ok,
llc needs to be fresh with bpf backend.

For a lot of cases kernel headers are not needed, so above
will work fine.
For our TC examples we recommend to use 'bcc' alias:
bcc() {
   clang -O2 -emit-llvm -c $1 -o - | llc -march=bpf -filetype=obj -o 
"`basename $1 .c`.o"
}
then compiling as easy as 'bcc file.c'

What Wang mentioned that we're working on is fully integrated 'bcc'.
It will use clang/llvm as libraries, so no intermediate steps will
be needed, but some folks will always have concerns about
ultra-embedded environments where even 20Mb of libllvm.so is too much.

So I think we need to support both 'perf record -e file.[co]'


^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-04 16:22               ` Alexei Starovoitov
@ 2015-06-04 21:48                 ` Masami Hiramatsu
  2015-06-04 22:07                   ` Alexei Starovoitov
  2015-06-05  6:41                 ` Ingo Molnar
  1 sibling, 1 reply; 43+ messages in thread
From: Masami Hiramatsu @ 2015-06-04 21:48 UTC (permalink / raw)
  To: Alexei Starovoitov, Ingo Molnar, pi3orama
  Cc: Wangnan (F),
	Arnaldo Carvalho de Melo, linux-kernel, Adrian Hunter,
	Brendan Gregg, Daniel Borkmann, David Ahern, He Kuang, Jiri Olsa,
	Kaixu Xia, Madhavan Srinivasan, Michael Ellerman, Namhyung Kim,
	Peter Zijlstra, Sukadev Bhattiprolu, Zefan Li,
	Arnaldo Carvalho de Melo, Alexei Starovoitov,
	cti.systems-productivity-manager.ts

On 2015/06/05 1:22, Alexei Starovoitov wrote:
> On 6/4/15 7:04 AM, Ingo Molnar wrote:
>>>>  # perf record -e bpf_source.c cmdline
>>>>
>>>>  to create a eBPF filter from source,
>>>>
>>>> Use
>>>>
>>>> # perf record -e bpf_object.o cmdline
>>>>
>>>> to create a eBPF filter from object intermedia.
>>>>
>>>> Use
>>>>
>>>> # perf bpf compile bpf_source.c --kbuild=kernel-build-dir -o bpf_object.o
>>>>
>>>> to create the .o
>>>>
>>>> I think this should be enough. Currently only the second case has been implemented.
>> So if users cannot actually generate .o files then it's premature to merge this in
>> such an incomplete form!
>>
>> It should be possible to use a feature that we are merging.
> 
> of course it's usable :) There is some confusion here.
> To compile .c into .o one can easily use
> clang -O2 -emit-llvm -c file.c -o - | llc -march=bpf -o file.o
> any version of clang is ok,
> llc needs to be fresh with bpf backend.
> 
> For a lot of cases kernel headers are not needed, so above
> will work fine.
> For our TC examples we recommend to use 'bcc' alias:
> bcc() {
>    clang -O2 -emit-llvm -c $1 -o - | llc -march=bpf -filetype=obj -o 
> "`basename $1 .c`.o"
> }
> then compiling as easy as 'bcc file.c'
> 
> What Wang mentioned that we're working on is fully integrated 'bcc'.
> It will use clang/llvm as libraries, so no intermediate steps will
> be needed, but some folks will always have concerns about
> ultra-embedded environments where even 20Mb of libllvm.so is too much.
> 
> So I think we need to support both 'perf record -e file.[co]'

I think we'd better make 'perf record -e file.c' default and '-e file.o'
should be an option.

Thank you,

> 
> 


-- 
Masami HIRAMATSU
Linux Technology Research Center, System Productivity Research Dept.
Center for Technology Innovation - Systems Engineering
Hitachi, Ltd., Research & Development Group
E-mail: masami.hiramatsu.pt@hitachi.com

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-04 21:48                 ` Masami Hiramatsu
@ 2015-06-04 22:07                   ` Alexei Starovoitov
  0 siblings, 0 replies; 43+ messages in thread
From: Alexei Starovoitov @ 2015-06-04 22:07 UTC (permalink / raw)
  To: Masami Hiramatsu, Ingo Molnar, pi3orama
  Cc: Wangnan (F),
	Arnaldo Carvalho de Melo, linux-kernel, Adrian Hunter,
	Brendan Gregg, Daniel Borkmann, David Ahern, He Kuang, Jiri Olsa,
	Kaixu Xia, Madhavan Srinivasan, Michael Ellerman, Namhyung Kim,
	Peter Zijlstra, Sukadev Bhattiprolu, Zefan Li,
	Arnaldo Carvalho de Melo, Alexei Starovoitov,
	cti.systems-productivity-manager.ts

On 6/4/15 2:48 PM, Masami Hiramatsu wrote:
> On 2015/06/05 1:22, Alexei Starovoitov wrote:
>> On 6/4/15 7:04 AM, Ingo Molnar wrote:
>>>>>   # perf record -e bpf_source.c cmdline
>>>>>
>>>>>   to create a eBPF filter from source,
>>>>>
>>>>> Use
>>>>>
>>>>> # perf record -e bpf_object.o cmdline
>>>>>
>>>>> to create a eBPF filter from object intermedia.
>>>>>
>>>>> Use
>>>>>
>>>>> # perf bpf compile bpf_source.c --kbuild=kernel-build-dir -o bpf_object.o
>>>>>
>>>>> to create the .o
>>>>>
>>>>> I think this should be enough. Currently only the second case has been implemented.
>>> So if users cannot actually generate .o files then it's premature to merge this in
>>> such an incomplete form!
>>>
>>> It should be possible to use a feature that we are merging.
>>
>> of course it's usable :) There is some confusion here.
>> To compile .c into .o one can easily use
>> clang -O2 -emit-llvm -c file.c -o - | llc -march=bpf -o file.o
>> any version of clang is ok,
>> llc needs to be fresh with bpf backend.
>>
>> For a lot of cases kernel headers are not needed, so above
>> will work fine.
>> For our TC examples we recommend to use 'bcc' alias:
>> bcc() {
>>     clang -O2 -emit-llvm -c $1 -o - | llc -march=bpf -filetype=obj -o
>> "`basename $1 .c`.o"
>> }
>> then compiling as easy as 'bcc file.c'
>>
>> What Wang mentioned that we're working on is fully integrated 'bcc'.
>> It will use clang/llvm as libraries, so no intermediate steps will
>> be needed, but some folks will always have concerns about
>> ultra-embedded environments where even 20Mb of libllvm.so is too much.
>>
>> So I think we need to support both 'perf record -e file.[co]'
>
> I think we'd better make 'perf record -e file.c' default and '-e file.o'
> should be an option.

what do you mean 'default' ? It's a command line :)
.c is easier to use of course, no question.


^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-04 16:22               ` Alexei Starovoitov
  2015-06-04 21:48                 ` Masami Hiramatsu
@ 2015-06-05  6:41                 ` Ingo Molnar
  2015-06-05  8:53                   ` Wangnan (F)
  1 sibling, 1 reply; 43+ messages in thread
From: Ingo Molnar @ 2015-06-05  6:41 UTC (permalink / raw)
  To: Alexei Starovoitov
  Cc: pi3orama, Wangnan (F),
	Arnaldo Carvalho de Melo, linux-kernel, Adrian Hunter,
	Brendan Gregg, Daniel Borkmann, David Ahern, He Kuang, Jiri Olsa,
	Kaixu Xia, Madhavan Srinivasan, Masami Hiramatsu,
	Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	Sukadev Bhattiprolu, Zefan Li, Arnaldo Carvalho de Melo,
	Alexei Starovoitov


* Alexei Starovoitov <ast@plumgrid.com> wrote:

> On 6/4/15 7:04 AM, Ingo Molnar wrote:
> >>>  # perf record -e bpf_source.c cmdline
> >>>
> >>>  to create a eBPF filter from source,
> >>>
> >>>Use
> >>>
> >>># perf record -e bpf_object.o cmdline
> >>>
> >>>to create a eBPF filter from object intermedia.
> >>>
> >>>Use
> >>>
> >>># perf bpf compile bpf_source.c --kbuild=kernel-build-dir -o bpf_object.o
> >>>
> >>>to create the .o
> >>>
> >>>I think this should be enough. Currently only the second case has been implemented.
> >
> > So if users cannot actually generate .o files then it's premature to merge 
> > this in such an incomplete form!
> >
> > It should be possible to use a feature that we are merging.
> 
> of course it's usable :) There is some confusion here.
> To compile .c into .o one can easily use
> clang -O2 -emit-llvm -c file.c -o - | llc -march=bpf -o file.o

There's no confusion here: you guys are trying to sell me what at this stage is 
incomplete and hard to use, and I'm resisting it as I should! :-)

We also have different definitions of 'easily'. It might be 'easy' to type:
 
    clang -O2 -emit-llvm -c file.c -o - | llc -march=bpf -o file.o

... for some tooling developer intimate with eBPF, but to the first time user who 
found an interesting looking eBPF scriptlet on the net or in the documentation and 
wants to try his luck? It's absolutely non-obvious!

The current usage to get a _minimal_ eBPF script running is non-obvious and 
obscure to the level of being a show stopper.

I don't understand why you guys are even wasting time arguing about it: it's not 
that hard to auto-build from source code. It's one of the basic features of 
tooling. If you ever built perf you'll know that typing 'make install' will type 
in all those quirky build lines automatically for you, without requiring you to 
perform any other step, no matter how trivial.

Doubly annoying, you seem to have the UI principles wrong, you seem to think that 
a .o is a proper user interface. It absolutely is _not_ okay.

The Linux kernel project and as an extension the perf project deals with source 
code, and I'm 100% suspicious of approaches that somehow think that .o objects are 
the right UI for _anything_ except temporary files that sometimes show up in 
object directories...

Fix the 'newbie user' UI flow as a _first_ priority, not as a second thought!

Every single quirky line or nonsensical option you require a first time user to 
type halves the number of new users we'll get. You need to understand why dtrace 
is so popular:

   - it's bloody easy to use

   - it's a safe environment you can deploy in critical environments

   - it's flexible

   - instrumentation hacks are very easy to share

eBPF based scripting got 3 out of those 4 right, but please don't forget item 1 
either, because without that we have nothing but a bunch of unusable functionality 
in the kernel and in tooling that benefits only very few people. Okay?

> So I think we need to support both 'perf record -e file.[co]'

Why do you even need to ask? Of course!

Think through how users will meet eBPF scripts and how they will interact with 
them:

  - they'll see or download an eBPF scriptlet somewhere and will have a .c file.

  - ideally there will be built-in eBPF scriptlets just like we have tracing 
    plugins, and there's a good UI to query them and see their description and 
    source code.

  - then they will want to use it all with the minimum amount of fuss

  - they don't care how the eBPF scriptlet gets to the kernel: whether the kernel 
    can read and build the .c files, or whether there's some user tooling that
    turns it into bytecode. Most humans don't read bytecode!

  - they will absolutely not download random .o's and we should not encourage that
    in any case - these things should be source code based.

These things compile in an eye blink, there's very little reason to ever deal with 
a .o, except some weird and rare usecases...

In fact I'm NAK-ing the whole .o based interface until the .c interface is made 
the _primary_ one and works well and until I see that you have thought through 
basic usability questions...

Thanks,

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-05  6:41                 ` Ingo Molnar
@ 2015-06-05  8:53                   ` Wangnan (F)
  2015-06-05 12:05                     ` Ingo Molnar
  2015-06-05 13:59                     ` Arnaldo Carvalho de Melo
  0 siblings, 2 replies; 43+ messages in thread
From: Wangnan (F) @ 2015-06-05  8:53 UTC (permalink / raw)
  To: Ingo Molnar, Alexei Starovoitov
  Cc: pi3orama, Arnaldo Carvalho de Melo, linux-kernel, Adrian Hunter,
	Brendan Gregg, Daniel Borkmann, David Ahern, He Kuang, Jiri Olsa,
	Kaixu Xia, Madhavan Srinivasan, Masami Hiramatsu,
	Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	Sukadev Bhattiprolu, Zefan Li, Arnaldo Carvalho de Melo,
	Alexei Starovoitov



On 2015/6/5 14:41, Ingo Molnar wrote:
> * Alexei Starovoitov <ast@plumgrid.com> wrote:
>
>> On 6/4/15 7:04 AM, Ingo Molnar wrote:
>>>>>   # perf record -e bpf_source.c cmdline
>>>>>
>>>>>   to create a eBPF filter from source,
>>>>>
>>>>> Use
>>>>>
>>>>> # perf record -e bpf_object.o cmdline
>>>>>
>>>>> to create a eBPF filter from object intermedia.
>>>>>
>>>>> Use
>>>>>
>>>>> # perf bpf compile bpf_source.c --kbuild=kernel-build-dir -o bpf_object.o
>>>>>
>>>>> to create the .o
>>>>>
>>>>> I think this should be enough. Currently only the second case has been implemented.
>>> So if users cannot actually generate .o files then it's premature to merge
>>> this in such an incomplete form!
>>>
>>> It should be possible to use a feature that we are merging.
>> of course it's usable :) There is some confusion here.
>> To compile .c into .o one can easily use
>> clang -O2 -emit-llvm -c file.c -o - | llc -march=bpf -o file.o
> There's no confusion here: you guys are trying to sell me what at this stage is
> incomplete and hard to use, and I'm resisting it as I should! :-)
>
> We also have different definitions of 'easily'. It might be 'easy' to type:
>   
>      clang -O2 -emit-llvm -c file.c -o - | llc -march=bpf -o file.o
>
> ... for some tooling developer intimate with eBPF, but to the first time user who
> found an interesting looking eBPF scriptlet on the net or in the documentation and
> wants to try his luck? It's absolutely non-obvious!
>
> The current usage to get a _minimal_ eBPF script running is non-obvious and
> obscure to the level of being a show stopper.
>
> I don't understand why you guys are even wasting time arguing about it: it's not
> that hard to auto-build from source code. It's one of the basic features of
> tooling. If you ever built perf you'll know that typing 'make install' will type
> in all those quirky build lines automatically for you, without requiring you to
> perform any other step, no matter how trivial.
>
> Doubly annoying, you seem to have the UI principles wrong, you seem to think that
> a .o is a proper user interface. It absolutely is _not_ okay.
>
> The Linux kernel project and as an extension the perf project deals with source
> code, and I'm 100% suspicious of approaches that somehow think that .o objects are
> the right UI for _anything_ except temporary files that sometimes show up in
> object directories...
>
> Fix the 'newbie user' UI flow as a _first_ priority, not as a second thought!
>
> Every single quirky line or nonsensical option you require a first time user to
> type halves the number of new users we'll get. You need to understand why dtrace
> is so popular:
>
>     - it's bloody easy to use
>
>     - it's a safe environment you can deploy in critical environments
>
>     - it's flexible
>
>     - instrumentation hacks are very easy to share
>
> eBPF based scripting got 3 out of those 4 right, but please don't forget item 1
> either, because without that we have nothing but a bunch of unusable functionality
> in the kernel and in tooling that benefits only very few people. Okay?
>
>> So I think we need to support both 'perf record -e file.[co]'
> Why do you even need to ask? Of course!
>
> Think through how users will meet eBPF scripts and how they will interact with
> them:
>
>    - they'll see or download an eBPF scriptlet somewhere and will have a .c file.
>
>    - ideally there will be built-in eBPF scriptlets just like we have tracing
>      plugins, and there's a good UI to query them and see their description and
>      source code.
>
>    - then they will want to use it all with the minimum amount of fuss
>
>    - they don't care how the eBPF scriptlet gets to the kernel: whether the kernel
>      can read and build the .c files, or whether there's some user tooling that
>      turns it into bytecode. Most humans don't read bytecode!
>
>    - they will absolutely not download random .o's and we should not encourage that
>      in any case - these things should be source code based.
>
> These things compile in an eye blink, there's very little reason to ever deal with
> a .o, except some weird and rare usecases...
>
> In fact I'm NAK-ing the whole .o based interface until the .c interface is made
> the _primary_ one and works well and until I see that you have thought through
> basic usability questions...

OK. Let's start making a nice UI.

At this stage, what about wrapping current clang and llc workflow into perf,
let it call them to compile '.c' scripts? This is the way 'perf 
annotate' using
objdump. I can do this job, but firstly I'd like to know people's opinion on
it, and the value of the wrapper if Alexei Starovoitov's dynamic compiler
shared object is coming.

Following functions will be added:

  - perf searches clang and llc under current $PATH,

  - Users are allowed to pass the position of those programs,
    if perf failed to find the automatically,

  - Users are allowed to pass extra compiling options to clang and llc, like
    include directories,

  - 'perf record' automatically calls them if a '.c' is passed using 
'--event'.

  - 'perf bpf compile' command will be added to compile a '.c' find into 
'.o',

Further, basic header files should be shipped with kernel headers.

User interface update:

  - --llc, --clang, --llc-opt and --clang-opt option will be added to 
'perf record'
    to indicate the position and extra options to them. Ideally they can 
be leave
    blank.

  - 'perf bpf compile' will be added.

One problem I can find is that, the wrapper will make perf depend on 
llvm. I don't
think the compiler will be deployed in production environments... And 
also, the
embedded case...

Any suggestion? Do you think the above idea is on the right way?

Thanks.

> Thanks,
>
> 	Ingo



^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-05  8:53                   ` Wangnan (F)
@ 2015-06-05 12:05                     ` Ingo Molnar
  2015-06-05 14:06                       ` Arnaldo Carvalho de Melo
  2015-06-05 13:59                     ` Arnaldo Carvalho de Melo
  1 sibling, 1 reply; 43+ messages in thread
From: Ingo Molnar @ 2015-06-05 12:05 UTC (permalink / raw)
  To: Wangnan (F)
  Cc: Alexei Starovoitov, pi3orama, Arnaldo Carvalho de Melo,
	linux-kernel, Adrian Hunter, Brendan Gregg, Daniel Borkmann,
	David Ahern, He Kuang, Jiri Olsa, Kaixu Xia, Madhavan Srinivasan,
	Masami Hiramatsu, Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	Sukadev Bhattiprolu, Zefan Li, Arnaldo Carvalho de Melo,
	Alexei Starovoitov


* Wangnan (F) <wangnan0@huawei.com> wrote:

> OK. Let's start making a nice UI.

Thanks!

> [...]
>
> One problem I can find is that, the wrapper will make perf depend on llvm. I 
> don't think the compiler will be deployed in production environments... And 
> also, the embedded case...

What dependencies are there?

On the usage side there should be very few outright dependencies: if the llvm 
binary is not available, or doesn't support what you need, or there's no runtime 
environment you can use to build the bytecode, you should display an informative 
error message so that the user knows what is missing and how to install it.

Thanks,

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-05  8:53                   ` Wangnan (F)
  2015-06-05 12:05                     ` Ingo Molnar
@ 2015-06-05 13:59                     ` Arnaldo Carvalho de Melo
  1 sibling, 0 replies; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2015-06-05 13:59 UTC (permalink / raw)
  To: Wangnan (F)
  Cc: Ingo Molnar, Alexei Starovoitov, pi3orama, linux-kernel,
	Adrian Hunter, Brendan Gregg, Daniel Borkmann, David Ahern,
	He Kuang, Jiri Olsa, Kaixu Xia, Madhavan Srinivasan,
	Masami Hiramatsu, Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	Sukadev Bhattiprolu, Zefan Li, Alexei Starovoitov

Em Fri, Jun 05, 2015 at 04:53:03PM +0800, Wangnan (F) escreveu:
> On 2015/6/5 14:41, Ingo Molnar wrote:
> >* Alexei Starovoitov <ast@plumgrid.com> wrote:
> >>On 6/4/15 7:04 AM, Ingo Molnar wrote:
> >In fact I'm NAK-ing the whole .o based interface until the .c interface is made
> >the _primary_ one and works well and until I see that you have thought through
> >basic usability questions...
 
> OK. Let's start making a nice UI.
 
> At this stage, what about wrapping current clang and llc workflow into perf,
> let it call them to compile '.c' scripts? This is the way 'perf annotate'
> using
> objdump. I can do this job, but firstly I'd like to know people's opinion on

Right, no need for, at a first step, to save this into a cache, or use
libraries, etc, just automate the bpf.c into bpf.o, load it and use it
as an event.

> it, and the value of the wrapper if Alexei Starovoitov's dynamic compiler
> shared object is coming.

No need to wait for that, when it comes we can use it, but Ingo
established as the door for this to be accepted is that we could use:

  perf record -e foo.c usleep

Right?
 
> Following functions will be added:
> 
>  - perf searches clang and llc under current $PATH,

Good for a first step
 
>  - Users are allowed to pass the position of those programs,
>    if perf failed to find the automatically,

I would leave all this configurabilty for later, stating that it has to
be in the PATH should be ok for a first step.
 
>  - Users are allowed to pass extra compiling options to clang and llc, like
>    include directories,

For later too?
 
>  - 'perf record' automatically calls them if a '.c' is passed using
> '--event'.

Right.
 
>  - 'perf bpf compile' command will be added to compile a '.c' find into
> '.o',

for later? I.e. 'perf record -e foo.c usleep' could start by generating
the foo.o and not deleting it, so:

   perf record -e foo.c usleep

Followed by:

   perf record -e foo.o usleep

Would work, the later would be like a quick hack so that we could have
access to a pre-compiled foo.o quickly, at this introductory stage.

> Further, basic header files should be shipped with kernel headers.
> 
> User interface update:
> 
>  - --llc, --clang, --llc-opt and --clang-opt option will be added to 'perf
> record'
>    to indicate the position and extra options to them. Ideally they can be
> leave
>    blank.

Couldn't this be left to a section in a .perfconfig file to avoid having
so many command line options? We could have one of those files per
"project", after we add a --config option to 'perf record', to override
whatever it finds in the config file search it already does, i.e.:

[clang]

  path = /a/b/clang
  opt = -a -b -c -d

[llc]

  path = /d/e/llc
  opt = -r -t -y -u

But even this can be left for a second step.
 
>  - 'perf bpf compile' will be added.

> One problem I can find is that, the wrapper will make perf depend on
> llvm. I don't think the compiler will be deployed in production
> environments... And also, the embedded case...

Well, we can always build a subset of perf, using the command line
options to disable certain features.
 
- Arnaldo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-05 12:05                     ` Ingo Molnar
@ 2015-06-05 14:06                       ` Arnaldo Carvalho de Melo
  2015-06-07 13:11                         ` Ingo Molnar
  0 siblings, 1 reply; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2015-06-05 14:06 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Wangnan (F),
	Alexei Starovoitov, pi3orama, linux-kernel, Adrian Hunter,
	Brendan Gregg, Daniel Borkmann, David Ahern, He Kuang, Jiri Olsa,
	Kaixu Xia, Madhavan Srinivasan, Masami Hiramatsu,
	Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	Sukadev Bhattiprolu, Zefan Li, Alexei Starovoitov

Em Fri, Jun 05, 2015 at 02:05:50PM +0200, Ingo Molnar escreveu:
> * Wangnan (F) <wangnan0@huawei.com> wrote:
<SNIP>
> > One problem I can find is that, the wrapper will make perf depend on llvm. I 
> > don't think the compiler will be deployed in production environments... And 
> > also, the embedded case...
 
> What dependencies are there?
 
> On the usage side there should be very few outright dependencies: if the llvm 
> binary is not available, or doesn't support what you need, or there's no runtime 
> environment you can use to build the bytecode, you should display an informative 
> error message so that the user knows what is missing and how to install it.

Right, something like:

[acme@zoo ~]$ perf trace -e nanosleep usleep 1
Error:	No permissions to read /sys/kernel/debug/tracing/events/raw_syscalls/sys_(enter|exit)
Hint:	Try 'sudo mount -o remount,mode=755 /sys/kernel/debug'
[acme@zoo ~]$ sudo mount -o remount,mode=755 /sys/kernel/debug
[sudo] password for acme: 
[acme@zoo ~]$ perf trace -e nanosleep usleep 1
     0.565 ( 0.060 ms): usleep/17648 nanosleep(rqtp: 0x7fff22baebf0) = 0
[acme@zoo ~]$ perf trace --all-cpus
Error:	Operation not permitted.
Hint:	Check /proc/sys/kernel/perf_event_paranoid setting.
Hint:	For system wide tracing it needs to be set to -1.
Hint:	Try: 'sudo sh -c "echo -1 > /proc/sys/kernel/perf_event_paranoid"'
Hint:	The current value is 1.
[acme@zoo ~]$ 
[acme@zoo ~]$ trace -a -e poll usleep 1
[acme@zoo ~]$ trace -a -e poll usleep 1
 0.041 ( 0.000 ms): firefox/1458  ... [continued]: poll()) = 1
 0.267 ( 0.003 ms): firefox/1458 poll(ufds: 0x7f43d6ea1340, nfds: 5) = 0 Timeout
 0.275 ( 0.001 ms): firefox/1458 poll(ufds: 0x7f43d6ea1340, nfds: 5) = 0 Timeout
 0.283 ( 0.001 ms): firefox/1458 poll(ufds: 0x7f43d6ea1340, nfds: 5) = 0 Timeout
 0.979 ( 0.000 ms): gnome-terminal/2572  ... [continued]: poll()) = 1
 1.056 ( 0.768 ms): firefox/1458 poll(ufds: 0x7f43d6ea1340, nfds: 5, timeout_msecs: 4294967295) ...
 1.065 ( 0.009 ms): gnome-terminal/2572 poll(ufds: 0x1934250, nfds: 23, timeout_msecs: 10) = 1
 1.087 ( 0.007 ms): gnome-terminal/2572 poll(ufds: 0x1934250, nfds: 23, timeout_msecs: 10) = 2
 1.132 ( 0.007 ms): gnome-terminal/2572 poll(ufds: 0x1934250, nfds: 23, timeout_msecs: 10) = 1
 1.161 ( 0.013 ms): gnome-terminal/2572 poll(ufds: 0x1934250, nfds: 23, timeout_msecs: 10) = 1
[acme@zoo ~]$

I.e. Explain the mistake and provide a hint to solve it, as close to the actual
commands needed to perform such corrective/enabling action as possible.

- Arnaldo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-06-05 14:06                       ` Arnaldo Carvalho de Melo
@ 2015-06-07 13:11                         ` Ingo Molnar
  0 siblings, 0 replies; 43+ messages in thread
From: Ingo Molnar @ 2015-06-07 13:11 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo
  Cc: Wangnan (F),
	Alexei Starovoitov, pi3orama, linux-kernel, Adrian Hunter,
	Brendan Gregg, Daniel Borkmann, David Ahern, He Kuang, Jiri Olsa,
	Kaixu Xia, Madhavan Srinivasan, Masami Hiramatsu,
	Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	Sukadev Bhattiprolu, Zefan Li, Alexei Starovoitov


* Arnaldo Carvalho de Melo <acme@kernel.org> wrote:

> Em Fri, Jun 05, 2015 at 02:05:50PM +0200, Ingo Molnar escreveu:
> > * Wangnan (F) <wangnan0@huawei.com> wrote:
> <SNIP>
> > > One problem I can find is that, the wrapper will make perf depend on llvm. I 
> > > don't think the compiler will be deployed in production environments... And 
> > > also, the embedded case...
>  
> > What dependencies are there?
>  
> > On the usage side there should be very few outright dependencies: if the llvm 
> > binary is not available, or doesn't support what you need, or there's no runtime 
> > environment you can use to build the bytecode, you should display an informative 
> > error message so that the user knows what is missing and how to install it.
> 
> Right, something like:
> 
> [acme@zoo ~]$ perf trace -e nanosleep usleep 1
> Error:	No permissions to read /sys/kernel/debug/tracing/events/raw_syscalls/sys_(enter|exit)
> Hint:	Try 'sudo mount -o remount,mode=755 /sys/kernel/debug'

> [acme@zoo ~]$ sudo mount -o remount,mode=755 /sys/kernel/debug
> [sudo] password for acme: 
> [acme@zoo ~]$ perf trace -e nanosleep usleep 1
>      0.565 ( 0.060 ms): usleep/17648 nanosleep(rqtp: 0x7fff22baebf0) = 0
> [acme@zoo ~]$ perf trace --all-cpus
> Error:	Operation not permitted.
> Hint:	Check /proc/sys/kernel/perf_event_paranoid setting.
> Hint:	For system wide tracing it needs to be set to -1.
> Hint:	Try: 'sudo sh -c "echo -1 > /proc/sys/kernel/perf_event_paranoid"'
> Hint:	The current value is 1.
> [acme@zoo ~]$ 
> [acme@zoo ~]$ trace -a -e poll usleep 1
> [acme@zoo ~]$ trace -a -e poll usleep 1
>  0.041 ( 0.000 ms): firefox/1458  ... [continued]: poll()) = 1
>  0.267 ( 0.003 ms): firefox/1458 poll(ufds: 0x7f43d6ea1340, nfds: 5) = 0 Timeout
>  0.275 ( 0.001 ms): firefox/1458 poll(ufds: 0x7f43d6ea1340, nfds: 5) = 0 Timeout
>  0.283 ( 0.001 ms): firefox/1458 poll(ufds: 0x7f43d6ea1340, nfds: 5) = 0 Timeout
>  0.979 ( 0.000 ms): gnome-terminal/2572  ... [continued]: poll()) = 1
>  1.056 ( 0.768 ms): firefox/1458 poll(ufds: 0x7f43d6ea1340, nfds: 5, timeout_msecs: 4294967295) ...
>  1.065 ( 0.009 ms): gnome-terminal/2572 poll(ufds: 0x1934250, nfds: 23, timeout_msecs: 10) = 1
>  1.087 ( 0.007 ms): gnome-terminal/2572 poll(ufds: 0x1934250, nfds: 23, timeout_msecs: 10) = 2
>  1.132 ( 0.007 ms): gnome-terminal/2572 poll(ufds: 0x1934250, nfds: 23, timeout_msecs: 10) = 1
>  1.161 ( 0.013 ms): gnome-terminal/2572 poll(ufds: 0x1934250, nfds: 23, timeout_msecs: 10) = 1
> [acme@zoo ~]$
> 
> I.e. Explain the mistake and provide a hint to solve it, as close to the actual
> commands needed to perform such corrective/enabling action as possible.

Yeah, I absolutely love such tooling hints.

Thanks,

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [EXPERIENCE] My experience on using perf record BPF filter on a real usecase
  2015-06-04 10:17     ` [EXPERIENCE] My experience on using perf record BPF filter on a real usecase Wangnan (F)
@ 2015-06-10  6:42       ` Alexei Starovoitov
  2015-06-10  6:48         ` Wangnan (F)
  0 siblings, 1 reply; 43+ messages in thread
From: Alexei Starovoitov @ 2015-06-10  6:42 UTC (permalink / raw)
  To: Wangnan (F), Ingo Molnar, Arnaldo Carvalho de Melo
  Cc: linux-kernel, Adrian Hunter, Brendan Gregg, Daniel Borkmann,
	David Ahern, He Kuang, Jiri Olsa, Kaixu Xia, Madhavan Srinivasan,
	Masami Hiramatsu, Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	pi3orama, Sukadev Bhattiprolu, Zefan Li,
	Arnaldo Carvalho de Melo

On 6/4/15 3:17 AM, Wangnan (F) wrote:
> Hi all,
>
> I'd like to share my exprience on using 'perf record' BPF filter in a
> real usecase to show the power and shortcome in my patch series:

thanks for sharing!

> Here is another inconvenience. Currently I only concern on write
> syscall issued by iozone. However, without '-a' I'm unable to collect
> information of the locker. If I want to filter sys_{enter,exit}_write
> belong to iozone out using eBPF, I need to implement another function
> like BPF_FUNC_git_comm. Another method is to use perf '--filter' after
> the two events. However it looks strange to use two filter mechanisms
> together. This time I choose to do filtering offline using perf script.

that doesn't sound clean.
btw, I've been playing for a while with
bpf_get_current_task_info() helper:
https://git.kernel.org/cgit/linux/kernel/git/ast/bpf.git/commit/?id=c5453ffa107ddf95a91920cc947bb8bf9eab16d6
I think it's a better mechanism.
The user can get pid only via:
u32 pid = 0;
bpf_get_current_task_info(&pid, sizeof(pid));
or full pid + comm + future fields via full 'struct bpf_task_info'
Thoughts?


^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [EXPERIENCE] My experience on using perf record BPF filter on a real usecase
  2015-06-10  6:42       ` Alexei Starovoitov
@ 2015-06-10  6:48         ` Wangnan (F)
  0 siblings, 0 replies; 43+ messages in thread
From: Wangnan (F) @ 2015-06-10  6:48 UTC (permalink / raw)
  To: Alexei Starovoitov, Ingo Molnar, Arnaldo Carvalho de Melo
  Cc: linux-kernel, Adrian Hunter, Brendan Gregg, Daniel Borkmann,
	David Ahern, He Kuang, Jiri Olsa, Kaixu Xia, Madhavan Srinivasan,
	Masami Hiramatsu, Michael Ellerman, Namhyung Kim, Peter Zijlstra,
	pi3orama, Sukadev Bhattiprolu, Zefan Li,
	Arnaldo Carvalho de Melo



On 2015/6/10 14:42, Alexei Starovoitov wrote:
> On 6/4/15 3:17 AM, Wangnan (F) wrote:
>> Hi all,
>>
>> I'd like to share my exprience on using 'perf record' BPF filter in a
>> real usecase to show the power and shortcome in my patch series:
>
> thanks for sharing!
>
>> Here is another inconvenience. Currently I only concern on write
>> syscall issued by iozone. However, without '-a' I'm unable to collect
>> information of the locker. If I want to filter sys_{enter,exit}_write
>> belong to iozone out using eBPF, I need to implement another function
>> like BPF_FUNC_git_comm. Another method is to use perf '--filter' after
>> the two events. However it looks strange to use two filter mechanisms
>> together. This time I choose to do filtering offline using perf script.
>
> that doesn't sound clean.
> btw, I've been playing for a while with
> bpf_get_current_task_info() helper:
> https://git.kernel.org/cgit/linux/kernel/git/ast/bpf.git/commit/?id=c5453ffa107ddf95a91920cc947bb8bf9eab16d6 
>
> I think it's a better mechanism.
> The user can get pid only via:
> u32 pid = 0;
> bpf_get_current_task_info(&pid, sizeof(pid));
> or full pid + comm + future fields via full 'struct bpf_task_info'
> Thoughts?
>

Looks good. Thank you for your information!



^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2017-03-16 16:09 [GIT PULL 0/6] perf/core improvements and fixes Arnaldo Carvalho de Melo
@ 2017-03-16 16:30 ` Ingo Molnar
  0 siblings, 0 replies; 43+ messages in thread
From: Ingo Molnar @ 2017-03-16 16:30 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo
  Cc: linux-kernel, Adrian Hunter, Alexander Shishkin,
	Alexei Starovoitov, Ananth N Mavinakayanahalli, Andi Kleen,
	Andy Lutomirski, Anton Blanchard, Borislav Petkov, Brendan Gregg,
	David Ahern, David Miller, He Kuang, Hemant Kumar, Jiri Olsa,
	linuxppc-dev, Masami Hiramatsu, Mathieu Poirier,
	Michael Ellerman, Namhyung Kim, Naveen N . Rao, Peter Zijlstra,
	Ravi Bangoria, Srikar Dronamraju, Stephane Eranian,
	Steven Rostedt, Sukadev Bhattiprolu, Taeung Song,
	Thomas Gleixner, Wang Nan, Arnaldo Carvalho de Melo


* Arnaldo Carvalho de Melo <acme@kernel.org> wrote:

> Hi Ingo,
> 
> 	Please consider pulling,
> 
> - Arnaldo
> 
> Test results at the end of this message, as usual.
> 
> The following changes since commit ffa86c2f1a8862cf58c873f6f14d4b2c3250fb48:
> 
>   Merge tag 'perf-core-for-mingo-4.12-20170314' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2017-03-15 19:27:27 +0100)
> 
> are available in the git repository at:
> 
>   git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo-4.12-20170316
> 
> for you to fetch changes up to 61f35d750683b21e9e3836e309195c79c1daed74:
> 
>   uprobes: Default UPROBES_EVENTS to Y (2017-03-16 12:42:02 -0300)
> 
> ----------------------------------------------------------------
> perf/core improvements and fixes:
> 
> New features:
> 
> - Add 'brstackinsn' field in 'perf script' to reuse the x86 instruction
>   decoder used in the Intel PT code to study hot paths to samples (Andi Kleen)
> 
> Kernel:
> 
> - Default UPROBES_EVENTS to Y (Alexei Starovoitov)
> 
> - Fix check for kretprobe offset within function entry (Naveen N. Rao)
> 
> Infrastructure:
> 
> - Introduce util func is_sdt_event() (Ravi Bangoria)
> 
> - Make perf_event__synthesize_mmap_events() scale on older kernels where
>   reading /proc/pid/maps is way slower than reading /proc/pid/task/pid/maps (Stephane Eranian)
> 
> Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
> 
> ----------------------------------------------------------------
> Andi Kleen (1):
>       perf script: Add 'brstackinsn' for branch stacks
> 
> Arnaldo Carvalho de Melo (2):
>       tools headers: Sync {tools/,}arch/x86/include/asm/cpufeatures.h
>       uprobes: Default UPROBES_EVENTS to Y
> 
> Naveen N. Rao (1):
>       trace/kprobes: Fix check for kretprobe offset within function entry
> 
> Ravi Bangoria (1):
>       perf probe: Introduce util func is_sdt_event()
> 
> Stephane Eranian (1):
>       perf tools: Make perf_event__synthesize_mmap_events() scale
> 
>  include/linux/kprobes.h                            |   1 +
>  kernel/kprobes.c                                   |  40 ++--
>  kernel/trace/Kconfig                               |   2 +-
>  kernel/trace/trace_kprobe.c                        |   2 +-
>  tools/arch/x86/include/asm/cpufeatures.h           |   5 +-
>  tools/perf/Documentation/perf-script.txt           |  13 +-
>  tools/perf/builtin-script.c                        | 264 ++++++++++++++++++++-
>  tools/perf/util/Build                              |   1 +
>  tools/perf/util/dump-insn.c                        |  14 ++
>  tools/perf/util/dump-insn.h                        |  22 ++
>  tools/perf/util/event.c                            |   4 +-
>  .../util/intel-pt-decoder/intel-pt-insn-decoder.c  |  24 ++
>  tools/perf/util/parse-events.h                     |  20 ++
>  tools/perf/util/probe-event.c                      |   9 +-
>  14 files changed, 381 insertions(+), 40 deletions(-)
>  create mode 100644 tools/perf/util/dump-insn.c
>  create mode 100644 tools/perf/util/dump-insn.h

Pulled, thanks a lot Arnaldo!

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* [GIT PULL 0/6] perf/core improvements and fixes
@ 2017-03-16 16:09 Arnaldo Carvalho de Melo
  2017-03-16 16:30 ` Ingo Molnar
  0 siblings, 1 reply; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2017-03-16 16:09 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter,
	Alexander Shishkin, Alexei Starovoitov,
	Ananth N Mavinakayanahalli, Andi Kleen, Andy Lutomirski,
	Anton Blanchard, Borislav Petkov, Brendan Gregg, David Ahern,
	David Miller, He Kuang, Hemant Kumar, Jiri Olsa, linuxppc-dev,
	Masami Hiramatsu, Mathieu Poirier, Michael Ellerman,
	Namhyung Kim, Naveen N . Rao, Peter Zijlstra, Ravi Bangoria,
	Srikar Dronamraju, Stephane Eranian, Steven Rostedt,
	Sukadev Bhattiprolu, Taeung Song, Thomas Gleixner, Wang Nan,
	Arnaldo Carvalho de Melo

Hi Ingo,

	Please consider pulling,

- Arnaldo

Test results at the end of this message, as usual.

The following changes since commit ffa86c2f1a8862cf58c873f6f14d4b2c3250fb48:

  Merge tag 'perf-core-for-mingo-4.12-20170314' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2017-03-15 19:27:27 +0100)

are available in the git repository at:

  git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo-4.12-20170316

for you to fetch changes up to 61f35d750683b21e9e3836e309195c79c1daed74:

  uprobes: Default UPROBES_EVENTS to Y (2017-03-16 12:42:02 -0300)

----------------------------------------------------------------
perf/core improvements and fixes:

New features:

- Add 'brstackinsn' field in 'perf script' to reuse the x86 instruction
  decoder used in the Intel PT code to study hot paths to samples (Andi Kleen)

Kernel:

- Default UPROBES_EVENTS to Y (Alexei Starovoitov)

- Fix check for kretprobe offset within function entry (Naveen N. Rao)

Infrastructure:

- Introduce util func is_sdt_event() (Ravi Bangoria)

- Make perf_event__synthesize_mmap_events() scale on older kernels where
  reading /proc/pid/maps is way slower than reading /proc/pid/task/pid/maps (Stephane Eranian)

Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>

----------------------------------------------------------------
Andi Kleen (1):
      perf script: Add 'brstackinsn' for branch stacks

Arnaldo Carvalho de Melo (2):
      tools headers: Sync {tools/,}arch/x86/include/asm/cpufeatures.h
      uprobes: Default UPROBES_EVENTS to Y

Naveen N. Rao (1):
      trace/kprobes: Fix check for kretprobe offset within function entry

Ravi Bangoria (1):
      perf probe: Introduce util func is_sdt_event()

Stephane Eranian (1):
      perf tools: Make perf_event__synthesize_mmap_events() scale

 include/linux/kprobes.h                            |   1 +
 kernel/kprobes.c                                   |  40 ++--
 kernel/trace/Kconfig                               |   2 +-
 kernel/trace/trace_kprobe.c                        |   2 +-
 tools/arch/x86/include/asm/cpufeatures.h           |   5 +-
 tools/perf/Documentation/perf-script.txt           |  13 +-
 tools/perf/builtin-script.c                        | 264 ++++++++++++++++++++-
 tools/perf/util/Build                              |   1 +
 tools/perf/util/dump-insn.c                        |  14 ++
 tools/perf/util/dump-insn.h                        |  22 ++
 tools/perf/util/event.c                            |   4 +-
 .../util/intel-pt-decoder/intel-pt-insn-decoder.c  |  24 ++
 tools/perf/util/parse-events.h                     |  20 ++
 tools/perf/util/probe-event.c                      |   9 +-
 14 files changed, 381 insertions(+), 40 deletions(-)
 create mode 100644 tools/perf/util/dump-insn.c
 create mode 100644 tools/perf/util/dump-insn.h

Test results:

The first ones are container (docker) based builds of tools/perf with and
without libelf support, objtool where it is supported and samples/bpf/, ditto.
Where clang is available, it is also used to build perf with/without libelf.

Several are cross builds, the ones with -x-ARCH, and the android one, and those
may not have all the features built, due to lack of multi-arch devel packages,
available and being used so far on just a few, like
debian:experimental-x-{arm64,mipsel}.

The 'perf test' one will perform a variety of tests exercising
tools/perf/util/, tools/lib/{bpf,traceevent,etc}, as well as run perf commands
with a variety of command line event specifications to then intercept the
sys_perf_event syscall to check that the perf_event_attr fields are set up as
expected, among a variety of other unit tests.

Then there is the 'make -C tools/perf build-test' ones, that build tools/perf/
with a variety of feature sets, exercising the build with an incomplete set of
features as well as with a complete one. It is planned to have it run on each
of the containers mentioned above, using some container orchestration
infrastructure. Get in contact if interested in helping having this in place.

  # dm
   1 alpine:3.4: Ok
   2 alpine:3.5: Ok
   3 alpine:edge: Ok
   4 android-ndk:r12b-arm: Ok
   5 archlinux:latest: Ok
   6 centos:5: Ok
   7 centos:6: Ok
   8 centos:7: Ok
   9 debian:7: Ok
  10 debian:8: Ok
  11 debian:experimental: Ok
  12 debian:experimental-x-arm64: Ok
  13 debian:experimental-x-mips: Ok
  14 debian:experimental-x-mips64: Ok
  15 debian:experimental-x-mipsel: Ok
  16 fedora:20: Ok
  17 fedora:21: Ok
  18 fedora:22: Ok
  19 fedora:23: Ok
  20 fedora:24: Ok
  21 fedora:24-x-ARC-uClibc: Ok
  22 fedora:25: Ok
  23 fedora:rawhide: Ok
  24 mageia:5: Ok
  25 opensuse:13.2: Ok
  26 opensuse:42.1: Ok
  27 opensuse:tumbleweed: Ok
  28 ubuntu:12.04.5: Ok
  29 ubuntu:14.04.4: Ok
  30 ubuntu:14.04.4-x-linaro-arm64: Ok
  31 ubuntu:15.10: Ok
  32 ubuntu:16.04: Ok
  33 ubuntu:16.04-x-arm: Ok
  34 ubuntu:16.04-x-arm64: Ok
  35 ubuntu:16.04-x-powerpc: Ok
  36 ubuntu:16.04-x-powerpc64: Ok
  37 ubuntu:16.04-x-s390: Ok
  38 ubuntu:16.10: Ok
  39 ubuntu:17.04: Ok

  # uname -a
  Linux zoo 4.9.13-100.fc24.x86_64 #1 SMP Mon Feb 27 16:57:22 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux
  # perf test
   1: vmlinux symtab matches kallsyms            : Ok
   2: Detect openat syscall event                : Ok
   3: Detect openat syscall event on all cpus    : Ok
   4: Read samples using the mmap interface      : Ok
   5: Parse event definition strings             : Ok
   6: PERF_RECORD_* events & perf_sample fields  : Ok
   7: Parse perf pmu format                      : Ok
   8: DSO data read                              : Ok
   9: DSO data cache                             : Ok
  10: DSO data reopen                            : Ok
  11: Roundtrip evsel->name                      : Ok
  12: Parse sched tracepoints fields             : Ok
  13: syscalls:sys_enter_openat event fields     : Ok
  14: Setup struct perf_event_attr               : Ok
  15: Match and link multiple hists              : Ok
  16: 'import perf' in python                    : Ok
  17: Breakpoint overflow signal handler         : Ok
  18: Breakpoint overflow sampling               : Ok
  19: Number of exit events of a simple workload : Ok
  20: Software clock events period values        : Ok
  21: Object code reading                        : Ok
  22: Sample parsing                             : Ok
  23: Use a dummy software event to keep tracking: Ok
  24: Parse with no sample_id_all bit set        : Ok
  25: Filter hist entries                        : Ok
  26: Lookup mmap thread                         : Ok
  27: Share thread mg                            : Ok
  28: Sort output of hist entries                : Ok
  29: Cumulate child hist entries                : Ok
  30: Track with sched_switch                    : Ok
  31: Filter fds with revents mask in a fdarray  : Ok
  32: Add fd to a fdarray, making it autogrow    : Ok
  33: kmod_path__parse                           : Ok
  34: Thread map                                 : Ok
  35: LLVM search and compile                    :
  35.1: Basic BPF llvm compile                    : Ok
  35.2: kbuild searching                          : Ok
  35.3: Compile source for BPF prologue generation: Ok
  35.4: Compile source for BPF relocation         : Ok
  36: Session topology                           : Ok
  37: BPF filter                                 :
  37.1: Basic BPF filtering                      : Ok
  37.2: BPF pinning                              : Ok
  37.3: BPF prologue generation                  : Ok
  37.4: BPF relocation checker                   : Ok
  38: Synthesize thread map                      : Ok
  39: Remove thread map                          : Ok
  40: Synthesize cpu map                         : Ok
  41: Synthesize stat config                     : Ok
  42: Synthesize stat                            : Ok
  43: Synthesize stat round                      : Ok
  44: Synthesize attr update                     : Ok
  45: Event times                                : Ok
  46: Read backward ring buffer                  : Ok
  47: Print cpu map                              : Ok
  48: Probe SDT events                           : Ok
  49: is_printable_array                         : Ok
  50: Print bitmap                               : Ok
  51: perf hooks                                 : Ok
  52: builtin clang support                      : Skip (not compiled in)
  53: unit_number__scnprintf                     : Ok
  54: x86 rdpmc                                  : Ok
  55: Convert perf time to TSC                   : Ok
  56: DWARF unwind                               : Ok
  57: x86 instruction decoder - new instructions : Ok
  58: Intel cqm nmi context read                 : Skip
  # 
  $ make -C tools/perf build-test
  make: Entering directory '/home/acme/git/linux/tools/perf'
  - tarpkg: ./tests/perf-targz-src-pkg .
              make_no_libelf_O: make NO_LIBELF=1
             make_no_libperl_O: make NO_LIBPERL=1
             make_no_scripts_O: make NO_LIBPYTHON=1 NO_LIBPERL=1
             make_no_libnuma_O: make NO_LIBNUMA=1
                   make_pure_O: make
                 make_perf_o_O: make perf.o
                make_no_newt_O: make NO_NEWT=1
                   make_help_O: make help
            make_no_libaudit_O: make NO_LIBAUDIT=1
                 make_static_O: make LDFLAGS=-static
                    make_doc_O: make doc
              make_clean_all_O: make clean all
                  make_debug_O: make DEBUG=1
                make_minimal_O: make NO_LIBPERL=1 NO_LIBPYTHON=1 NO_NEWT=1 NO_GTK2=1 NO_DEMANGLE=1 NO_LIBELF=1 NO_LIBUNWIND=1 NO_BACKTRACE=1 NO_LIBNUMA=1 NO_LIBAUDIT=1 NO_LIBBIONIC=1 NO_LIBDW_DWARF_UNWIND=1 NO_AUXTRACE=1 NO_LIBBPF=1 NO_LIBCRYPTO=1 NO_SDT=1 NO_JVMTI=1
           make_no_libunwind_O: make NO_LIBUNWIND=1
           make_no_libbionic_O: make NO_LIBBIONIC=1
           make_no_libpython_O: make NO_LIBPYTHON=1
               make_no_slang_O: make NO_SLANG=1
        make_with_babeltrace_O: make LIBBABELTRACE=1
         make_install_prefix_O: make install prefix=/tmp/krava
                  make_no_ui_O: make NO_NEWT=1 NO_SLANG=1 NO_GTK2=1
       make_util_pmu_bison_o_O: make util/pmu-bison.o
            make_no_demangle_O: make NO_DEMANGLE=1
   make_install_prefix_slash_O: make install prefix=/tmp/krava/
  make_no_libdw_dwarf_unwind_O: make NO_LIBDW_DWARF_UNWIND=1
            make_no_auxtrace_O: make NO_AUXTRACE=1
            make_install_bin_O: make install-bin
             make_util_map_o_O: make util/map.o
                   make_tags_O: make tags
              make_no_libbpf_O: make NO_LIBBPF=1
                make_install_O: make install
                make_no_gtk2_O: make NO_GTK2=1
           make_no_backtrace_O: make NO_BACKTRACE=1
         make_with_clangllvm_O: make LIBCLANGLLVM=1
  OK
  make: Leaving directory '/home/acme/git/linux/tools/perf'

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-08-31 21:36 Arnaldo Carvalho de Melo
@ 2015-09-01  8:26 ` Ingo Molnar
  0 siblings, 0 replies; 43+ messages in thread
From: Ingo Molnar @ 2015-09-01  8:26 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo
  Cc: linux-kernel, Adrian Hunter, Alexei Starovoitov, Andi Kleen,
	Brendan Gregg, Daniel Borkmann, David Ahern, He Kuang, Jiri Olsa,
	Kaixu Xia, Kan Liang, Masami Hiramatsu, Namhyung Kim,
	Peter Zijlstra, pi3orama, Stephane Eranian, Sukadev Bhattiprolu,
	Wang Nan, Zefan Li, Arnaldo Carvalho de Melo


* Arnaldo Carvalho de Melo <acme@kernel.org> wrote:

> Hi Ingo,
> 
> 	Please consider pulling,
> 
> - Arnaldo
> 
> The following changes since commit bac2e4a96d1c0bcce5e9654dcc902f75576b9b03:
> 
>   Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-08-31 10:25:46 +0200)
> 
> are available in the git repository at:
> 
>   git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo
> 
> for you to fetch changes up to bcc84ec65ad1bd9f777a1fade6f8e5e0c5808fa5:
> 
>   perf record: Add ability to name registers to record (2015-08-31 18:01:33 -0300)
> 
> ----------------------------------------------------------------
> perf/core improvements and fixes:
> 
> User visible:
> 
> - Add ability to specify to select which registers to record,
>   to reduce the size of perf.data files, and also allow printing
>   the registers in 'perf script': (Stephane Eranian)
> 
>     # perf record --intr-regs=AX,SP usleep 1
>     [ perf record: Woken up 1 times to write data ]
>     [ perf record: Captured and wrote 0.016 MB perf.data (8 samples) ]
>     # perf script -F ip,sym,iregs | tail -5
>      ffffffff8105f42a native_write_msr_safe   AX:0xf    SP:0xffff8802629c3c00
>      ffffffff8105f42a native_write_msr_safe   AX:0xf    SP:0xffff8802629c3c00
>      ffffffff81761ac0 _raw_spin_lock   AX:0xffff8801bfcf8020    SP:0xffff8802629c3ce8
>      ffffffff81202bf8 __vma_adjust_trans_huge   AX:0x7ffc75200000    SP:0xffff8802629c3b30
>      ffffffff8122b089 dput   AX:0x101    SP:0xffff8802629c3c78
>     #
> 
> Infrastructure:
> 
> - Open event on evsel cpus and threads (Kan Liang)
> 
> - New bpf API to get name from a BPF object (Wang Nan)
> 
> Build fixes:
> 
> - Fix build on powerpc broken by pt/bts (Adrian Hunter)
> 
> Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
> 
> ----------------------------------------------------------------
> Adrian Hunter (1):
>       perf tools: Fix build on powerpc broken by pt/bts
> 
> Kan Liang (1):
>       perf evlist: Open event on evsel cpus and threads
> 
> Stephane Eranian (3):
>       perf script: Enable printing of interrupted machine state
>       perf/x86: Add list of register names
>       perf record: Add ability to name registers to record
> 
> Wang Nan (1):
>       bpf tools: New API to get name from a BPF object
> 
>  tools/lib/bpf/libbpf.c                             | 25 +++++++-
>  tools/lib/bpf/libbpf.h                             |  4 +-
>  tools/perf/Documentation/perf-record.txt           |  6 +-
>  tools/perf/Documentation/perf-script.txt           |  2 +-
>  tools/perf/arch/x86/util/Build                     |  1 +
>  tools/perf/arch/x86/util/perf_regs.c               | 30 +++++++++
>  tools/perf/builtin-record.c                        |  9 ++-
>  tools/perf/builtin-script.c                        | 31 +++++++++-
>  tools/perf/perf.h                                  |  2 +-
>  tools/perf/tests/llvm.c                            |  2 +-
>  tools/perf/util/Build                              |  2 +
>  tools/perf/util/evlist.c                           |  4 ++
>  tools/perf/util/evsel.c                            |  2 +-
>  .../util/intel-pt-decoder/intel-pt-insn-decoder.c  |  3 +
>  tools/perf/util/parse-regs-options.c               | 71 ++++++++++++++++++++++
>  tools/perf/util/parse-regs-options.h               |  5 ++
>  tools/perf/util/perf_regs.h                        |  7 +++
>  17 files changed, 193 insertions(+), 13 deletions(-)
>  create mode 100644 tools/perf/arch/x86/util/perf_regs.c
>  create mode 100644 tools/perf/util/parse-regs-options.c
>  create mode 100644 tools/perf/util/parse-regs-options.h

Pulled into tip:perf/urgent, thanks a lot Arnaldo!

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* [GIT PULL 0/6] perf/core improvements and fixes
@ 2015-08-31 21:36 Arnaldo Carvalho de Melo
  2015-09-01  8:26 ` Ingo Molnar
  0 siblings, 1 reply; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2015-08-31 21:36 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter,
	Alexei Starovoitov, Andi Kleen, Brendan Gregg, Daniel Borkmann,
	David Ahern, He Kuang, Jiri Olsa, Kaixu Xia, Kan Liang,
	Masami Hiramatsu, Namhyung Kim, Peter Zijlstra, pi3orama,
	Stephane Eranian, Sukadev Bhattiprolu, Wang Nan, Zefan Li,
	Arnaldo Carvalho de Melo

Hi Ingo,

	Please consider pulling,

- Arnaldo

The following changes since commit bac2e4a96d1c0bcce5e9654dcc902f75576b9b03:

  Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-08-31 10:25:46 +0200)

are available in the git repository at:

  git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo

for you to fetch changes up to bcc84ec65ad1bd9f777a1fade6f8e5e0c5808fa5:

  perf record: Add ability to name registers to record (2015-08-31 18:01:33 -0300)

----------------------------------------------------------------
perf/core improvements and fixes:

User visible:

- Add ability to specify to select which registers to record,
  to reduce the size of perf.data files, and also allow printing
  the registers in 'perf script': (Stephane Eranian)

    # perf record --intr-regs=AX,SP usleep 1
    [ perf record: Woken up 1 times to write data ]
    [ perf record: Captured and wrote 0.016 MB perf.data (8 samples) ]
    # perf script -F ip,sym,iregs | tail -5
     ffffffff8105f42a native_write_msr_safe   AX:0xf    SP:0xffff8802629c3c00
     ffffffff8105f42a native_write_msr_safe   AX:0xf    SP:0xffff8802629c3c00
     ffffffff81761ac0 _raw_spin_lock   AX:0xffff8801bfcf8020    SP:0xffff8802629c3ce8
     ffffffff81202bf8 __vma_adjust_trans_huge   AX:0x7ffc75200000    SP:0xffff8802629c3b30
     ffffffff8122b089 dput   AX:0x101    SP:0xffff8802629c3c78
    #

Infrastructure:

- Open event on evsel cpus and threads (Kan Liang)

- New bpf API to get name from a BPF object (Wang Nan)

Build fixes:

- Fix build on powerpc broken by pt/bts (Adrian Hunter)

Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>

----------------------------------------------------------------
Adrian Hunter (1):
      perf tools: Fix build on powerpc broken by pt/bts

Kan Liang (1):
      perf evlist: Open event on evsel cpus and threads

Stephane Eranian (3):
      perf script: Enable printing of interrupted machine state
      perf/x86: Add list of register names
      perf record: Add ability to name registers to record

Wang Nan (1):
      bpf tools: New API to get name from a BPF object

 tools/lib/bpf/libbpf.c                             | 25 +++++++-
 tools/lib/bpf/libbpf.h                             |  4 +-
 tools/perf/Documentation/perf-record.txt           |  6 +-
 tools/perf/Documentation/perf-script.txt           |  2 +-
 tools/perf/arch/x86/util/Build                     |  1 +
 tools/perf/arch/x86/util/perf_regs.c               | 30 +++++++++
 tools/perf/builtin-record.c                        |  9 ++-
 tools/perf/builtin-script.c                        | 31 +++++++++-
 tools/perf/perf.h                                  |  2 +-
 tools/perf/tests/llvm.c                            |  2 +-
 tools/perf/util/Build                              |  2 +
 tools/perf/util/evlist.c                           |  4 ++
 tools/perf/util/evsel.c                            |  2 +-
 .../util/intel-pt-decoder/intel-pt-insn-decoder.c  |  3 +
 tools/perf/util/parse-regs-options.c               | 71 ++++++++++++++++++++++
 tools/perf/util/parse-regs-options.h               |  5 ++
 tools/perf/util/perf_regs.h                        |  7 +++
 17 files changed, 193 insertions(+), 13 deletions(-)
 create mode 100644 tools/perf/arch/x86/util/perf_regs.c
 create mode 100644 tools/perf/util/parse-regs-options.c
 create mode 100644 tools/perf/util/parse-regs-options.h

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-03-31 20:59 Arnaldo Carvalho de Melo
@ 2015-04-01  7:56 ` Ingo Molnar
  0 siblings, 0 replies; 43+ messages in thread
From: Ingo Molnar @ 2015-04-01  7:56 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo
  Cc: linux-kernel, Adrian Hunter, Andi Kleen, Borislav Petkov,
	David Ahern, David Hildenbrand, Don Zickus, Frederic Weisbecker,
	Hendrik Brueckner, Jeremie Galarneau, Jiri Olsa, Joe Mario,
	Kan Liang, Martin Schwidefsky, Namhyung Kim, Paul Mackerras,
	Peter Zijlstra, Sebastian Andrzej Siewior, Stephane Eranian,
	Tom Zanussi, Wang Nan, Arnaldo Carvalho de Melo


* Arnaldo Carvalho de Melo <acme@kernel.org> wrote:

> Hi Ingo,
> 
> 	Please consider pulling,
> 
> - Arnaldo
> 
> The following changes since commit ccd41c86ad4d464d0ed4e48d80759ff85c2115b0:
> 
>   perf: Fix racy group access (2015-03-27 09:49:45 +0100)
> 
> are available in the git repository at:
> 
>   git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo
> 
> for you to fetch changes up to 9870d7809575515e26194f4b3df1750872a6ee60:
> 
>   perf ordered_samples: Remove references to perf_{evlist,tool} and machines (2015-03-31 17:52:32 -0300)
> 
> ----------------------------------------------------------------
> perf/core improvements and fixes:
> 
> User visible:
> 
> - Fix 'perf script' pipe mode segfault, by always initializing ordered_events in
>   perf_session__new. (Arnaldo Carvalho de Melo)
> 
> - Fix ppid for synthesized fork events (David Ahern)
> 
> - Fix kernel symbol resolution of callchains in S/390 by remembering the
>   cpumode. (David Hildenbrand)
> 
> Infrastructure:
> 
> - Disable libbabeltrace check by default in the build system (Jiri Olsa)
> 
> Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
> 
> ----------------------------------------------------------------
> Arnaldo Carvalho de Melo (2):
>       perf session: Always initialize ordered_events
>       perf ordered_samples: Remove references to perf_{evlist,tool} and machines
> 
> David Ahern (2):
>       perf tools: Refactor comm/tgid lookup
>       perf tools: Fix ppid for synthesized fork events
> 
> David Hildenbrand (1):
>       perf callchain: Fix kernel symbol resolution by remembering the cpumode
> 
> Jiri Olsa (1):
>       perf build: Disable libbabeltrace check by default
> 
>  tools/perf/Makefile.perf         |   2 +-
>  tools/perf/config/Makefile       |   5 +-
>  tools/perf/util/event.c          | 147 ++++++++++++++++++++++++---------------
>  tools/perf/util/event.h          |   1 -
>  tools/perf/util/machine.c        |  28 ++++----
>  tools/perf/util/ordered-events.c |  21 ++----
>  tools/perf/util/ordered-events.h |  14 +---
>  tools/perf/util/session.c        |  48 ++++++++-----
>  tools/perf/util/session.h        |   1 +
>  9 files changed, 144 insertions(+), 123 deletions(-)

Pulled, thanks a lot Arnaldo!

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* [GIT PULL 0/6] perf/core improvements and fixes
@ 2015-03-31 20:59 Arnaldo Carvalho de Melo
  2015-04-01  7:56 ` Ingo Molnar
  0 siblings, 1 reply; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2015-03-31 20:59 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter,
	Andi Kleen, Borislav Petkov, David Ahern, David Hildenbrand,
	Don Zickus, Frederic Weisbecker, Hendrik Brueckner,
	Jeremie Galarneau, Jiri Olsa, Joe Mario, Kan Liang,
	Martin Schwidefsky, Namhyung Kim, Paul Mackerras, Peter Zijlstra,
	Sebastian Andrzej Siewior, Stephane Eranian, Tom Zanussi,
	Wang Nan, Arnaldo Carvalho de Melo

Hi Ingo,

	Please consider pulling,

- Arnaldo

The following changes since commit ccd41c86ad4d464d0ed4e48d80759ff85c2115b0:

  perf: Fix racy group access (2015-03-27 09:49:45 +0100)

are available in the git repository at:

  git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo

for you to fetch changes up to 9870d7809575515e26194f4b3df1750872a6ee60:

  perf ordered_samples: Remove references to perf_{evlist,tool} and machines (2015-03-31 17:52:32 -0300)

----------------------------------------------------------------
perf/core improvements and fixes:

User visible:

- Fix 'perf script' pipe mode segfault, by always initializing ordered_events in
  perf_session__new. (Arnaldo Carvalho de Melo)

- Fix ppid for synthesized fork events (David Ahern)

- Fix kernel symbol resolution of callchains in S/390 by remembering the
  cpumode. (David Hildenbrand)

Infrastructure:

- Disable libbabeltrace check by default in the build system (Jiri Olsa)

Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>

----------------------------------------------------------------
Arnaldo Carvalho de Melo (2):
      perf session: Always initialize ordered_events
      perf ordered_samples: Remove references to perf_{evlist,tool} and machines

David Ahern (2):
      perf tools: Refactor comm/tgid lookup
      perf tools: Fix ppid for synthesized fork events

David Hildenbrand (1):
      perf callchain: Fix kernel symbol resolution by remembering the cpumode

Jiri Olsa (1):
      perf build: Disable libbabeltrace check by default

 tools/perf/Makefile.perf         |   2 +-
 tools/perf/config/Makefile       |   5 +-
 tools/perf/util/event.c          | 147 ++++++++++++++++++++++++---------------
 tools/perf/util/event.h          |   1 -
 tools/perf/util/machine.c        |  28 ++++----
 tools/perf/util/ordered-events.c |  21 ++----
 tools/perf/util/ordered-events.h |  14 +---
 tools/perf/util/session.c        |  48 ++++++++-----
 tools/perf/util/session.h        |   1 +
 9 files changed, 144 insertions(+), 123 deletions(-)

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2015-03-26 15:40 Arnaldo Carvalho de Melo
@ 2015-03-27  7:37 ` Ingo Molnar
  0 siblings, 0 replies; 43+ messages in thread
From: Ingo Molnar @ 2015-03-27  7:37 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo
  Cc: linux-kernel, Adrian Hunter, Borislav Petkov, David Ahern,
	David Ahern, Don Zickus, Frederic Weisbecker, Guilherme Cox,
	Jiri Olsa, Jiri Olsa, Namhyung Kim, Stephane Eranian,
	Steven Rostedt, Arnaldo Carvalho de Melo


* Arnaldo Carvalho de Melo <acme@kernel.org> wrote:

> Hi Ingo,
> 
> 	Please consider pulling,
> 
> - Arnaldo
> 
> The following changes since commit baa5a7bc5dd069bb37de9c8bdb5ea7f4e2e939e9:
> 
>   Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-03-24 17:22:44 +0100)
> 
> are available in the git repository at:
> 
> 
>   git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo
> 
> for you to fetch changes up to 7c27f78a297b54c3c2f5075cb15d33431b7f6333:
> 
>   tools lib traceevent: Zero should not be considered "not found" in eval_flag() (2015-03-26 10:52:29 -0300)
> 
> ----------------------------------------------------------------
> perf/core improvements and fixes:
> 
> User visible:
> 
> - Show the first event with an invalid filter (David Ahern, Arnaldo Carvalho de Melo)
> 
> - Fix garbage output when intermixing syscalls from different threads in 'perf trace' (Arnaldo Carvalho de Melo)
> 
> - Fix 'perf timechart' SIBGUS error on sparc64 (David Ahern)
> 
> Infrastructure:
> 
> - Set JOBS based on CPU or processor, making it work on SPARC, where
>   /proc/cpuinfo has "CPU", not "processor" (David Ahern)
> 
> - Zero should not be considered "not found" in libtraceevent's eval_flag() (Steven Rostedt)
> 
> Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
> 
> ----------------------------------------------------------------
> Arnaldo Carvalho de Melo (2):
>       perf evlist: Return the first evsel with an invalid filter in apply_filters()
>       perf trace: Fix syscall enter formatting bug
> 
> David Ahern (3):
>       perf timechart: Fix SIBGUS error on sparc64
>       perf: Bump max number of cpus to 1024
>       perf tools: Set JOBS based on CPU or processor
> 
> Steven Rostedt (1):
>       tools lib traceevent: Zero should not be considered "not found" in eval_flag()
> 
>  tools/lib/traceevent/event-parse.c | 10 +++++-----
>  tools/perf/Makefile                |  2 +-
>  tools/perf/builtin-record.c        |  5 +++--
>  tools/perf/builtin-stat.c          |  5 +++--
>  tools/perf/builtin-trace.c         |  2 +-
>  tools/perf/perf.h                  |  2 +-
>  tools/perf/util/evlist.c           |  6 ++++--
>  tools/perf/util/evlist.h           |  2 +-
>  tools/perf/util/evsel.c            |  2 +-
>  9 files changed, 20 insertions(+), 16 deletions(-)

Pulled, thanks a lot Arnaldo!

Btw., I still see:

config/Makefile:609: No libbabeltrace found, disables 'perf data' CTF format support, please install libbabeltrace-dev[el]/libbabeltrace-ctf-dev

this annoyance needs to be fixed ASAP, before I can send perf/core to 
Linus.

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* [GIT PULL 0/6] perf/core improvements and fixes
@ 2015-03-26 15:40 Arnaldo Carvalho de Melo
  2015-03-27  7:37 ` Ingo Molnar
  0 siblings, 1 reply; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2015-03-26 15:40 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Arnaldo Carvalho de Melo, Adrian Hunter,
	Borislav Petkov, David Ahern, David Ahern, Don Zickus,
	Frederic Weisbecker, Guilherme Cox, Jiri Olsa, Jiri Olsa,
	Namhyung Kim, Stephane Eranian, Steven Rostedt,
	Arnaldo Carvalho de Melo

Hi Ingo,

	Please consider pulling,

- Arnaldo

The following changes since commit baa5a7bc5dd069bb37de9c8bdb5ea7f4e2e939e9:

  Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core (2015-03-24 17:22:44 +0100)

are available in the git repository at:


  git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-core-for-mingo

for you to fetch changes up to 7c27f78a297b54c3c2f5075cb15d33431b7f6333:

  tools lib traceevent: Zero should not be considered "not found" in eval_flag() (2015-03-26 10:52:29 -0300)

----------------------------------------------------------------
perf/core improvements and fixes:

User visible:

- Show the first event with an invalid filter (David Ahern, Arnaldo Carvalho de Melo)

- Fix garbage output when intermixing syscalls from different threads in 'perf trace' (Arnaldo Carvalho de Melo)

- Fix 'perf timechart' SIBGUS error on sparc64 (David Ahern)

Infrastructure:

- Set JOBS based on CPU or processor, making it work on SPARC, where
  /proc/cpuinfo has "CPU", not "processor" (David Ahern)

- Zero should not be considered "not found" in libtraceevent's eval_flag() (Steven Rostedt)

Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>

----------------------------------------------------------------
Arnaldo Carvalho de Melo (2):
      perf evlist: Return the first evsel with an invalid filter in apply_filters()
      perf trace: Fix syscall enter formatting bug

David Ahern (3):
      perf timechart: Fix SIBGUS error on sparc64
      perf: Bump max number of cpus to 1024
      perf tools: Set JOBS based on CPU or processor

Steven Rostedt (1):
      tools lib traceevent: Zero should not be considered "not found" in eval_flag()

 tools/lib/traceevent/event-parse.c | 10 +++++-----
 tools/perf/Makefile                |  2 +-
 tools/perf/builtin-record.c        |  5 +++--
 tools/perf/builtin-stat.c          |  5 +++--
 tools/perf/builtin-trace.c         |  2 +-
 tools/perf/perf.h                  |  2 +-
 tools/perf/util/evlist.c           |  6 ++++--
 tools/perf/util/evlist.h           |  2 +-
 tools/perf/util/evsel.c            |  2 +-
 9 files changed, 20 insertions(+), 16 deletions(-)

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2014-05-12  9:27 Jiri Olsa
@ 2014-05-12 15:59 ` Ingo Molnar
  0 siblings, 0 replies; 43+ messages in thread
From: Ingo Molnar @ 2014-05-12 15:59 UTC (permalink / raw)
  To: Jiri Olsa
  Cc: linux-kernel, Adrian Hunter, Arnaldo Carvalho de Melo,
	Bernhard Rosenkraenzer, Corey Ashford, David Ahern, Dongsheng,
	Frederic Weisbecker, Irina Tirdea, Namhyung Kim, Paul Mackerras,
	Peter Zijlstra, Stephane Eranian


* Jiri Olsa <jolsa@kernel.org> wrote:

> hi Ingo,
> please consider pulling
> 
> thanks,
> jirka
> 
> 
> The following changes since commit 3e46d21285577a8c9e4c37f9b1002e567c440b28:
> 
>   Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/jolsa/perf into perf/core (2014-05-05 19:37:51 +0200)
> 
> are available in the git repository at:
> 
> 
>   git://git.kernel.org/pub/scm/linux/kernel/git/jolsa/perf.git tags/perf-core-for-mingo
> 
> for you to fetch changes up to 13ce34df11833482cd698331fdbb3f8ced06340d:
> 
>   perf tools: Use tid for finding thread (2014-05-12 11:09:50 +0200)
> 
> ----------------------------------------------------------------
> perf/core improvements and fixes:
> 
> . Propagate exit status of a command line workload for
>   record command (Namhyung Kim)
> 
> . Use tid for finding thread (Namhyung Kim)
> 
> . Clarify the output of perf sched map plus small sched
>   command fixies (Dongsheng Yang)
> 
> Signed-off-by: Jiri Olsa <jolsa@kernel.org>
> 
> ----------------------------------------------------------------
> Dongsheng (3):
>       perf tools: Add missing event for perf sched record.
>       perf tools: Adapt the TASK_STATE_TO_CHAR_STR to new value in kernel space.
>       perf tools: Clarify the output of perf sched map.
> 
> Namhyung Kim (3):
>       perf record: Propagate exit status of a command line workload
>       perf tools: Get rid of on_exit() feature test
>       perf tools: Use tid for finding thread
> 
>  tools/perf/builtin-inject.c                     |   2 +-
>  tools/perf/builtin-kmem.c                       |   2 +-
>  tools/perf/builtin-record.c                     | 158 +++++++++---------------
>  tools/perf/builtin-sched.c                      |  38 +++---
>  tools/perf/config/Makefile                      |   8 --
>  tools/perf/config/feature-checks/Makefile       |   4 -
>  tools/perf/config/feature-checks/test-all.c     |   5 -
>  tools/perf/config/feature-checks/test-on-exit.c |  16 ---
>  tools/perf/tests/code-reading.c                 |   2 +-
>  tools/perf/tests/hists_filter.c                 |   1 +
>  tools/perf/tests/hists_link.c                   |   2 +
>  tools/perf/util/build-id.c                      |   2 +-
>  tools/perf/util/event.c                         |   2 +-
>  13 files changed, 90 insertions(+), 152 deletions(-)
>  delete mode 100644 tools/perf/config/feature-checks/test-on-exit.c

Pulled, thanks a lot Jiri!

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* [GIT PULL 0/6] perf/core improvements and fixes
@ 2014-05-12  9:27 Jiri Olsa
  2014-05-12 15:59 ` Ingo Molnar
  0 siblings, 1 reply; 43+ messages in thread
From: Jiri Olsa @ 2014-05-12  9:27 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Adrian Hunter, Arnaldo Carvalho de Melo,
	Bernhard Rosenkraenzer, Corey Ashford, David Ahern, Dongsheng,
	Frederic Weisbecker, Irina Tirdea, Jiri Olsa, Namhyung Kim,
	Paul Mackerras, Peter Zijlstra, Stephane Eranian

hi Ingo,
please consider pulling

thanks,
jirka


The following changes since commit 3e46d21285577a8c9e4c37f9b1002e567c440b28:

  Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/jolsa/perf into perf/core (2014-05-05 19:37:51 +0200)

are available in the git repository at:


  git://git.kernel.org/pub/scm/linux/kernel/git/jolsa/perf.git tags/perf-core-for-mingo

for you to fetch changes up to 13ce34df11833482cd698331fdbb3f8ced06340d:

  perf tools: Use tid for finding thread (2014-05-12 11:09:50 +0200)

----------------------------------------------------------------
perf/core improvements and fixes:

. Propagate exit status of a command line workload for
  record command (Namhyung Kim)

. Use tid for finding thread (Namhyung Kim)

. Clarify the output of perf sched map plus small sched
  command fixies (Dongsheng Yang)

Signed-off-by: Jiri Olsa <jolsa@kernel.org>

----------------------------------------------------------------
Dongsheng (3):
      perf tools: Add missing event for perf sched record.
      perf tools: Adapt the TASK_STATE_TO_CHAR_STR to new value in kernel space.
      perf tools: Clarify the output of perf sched map.

Namhyung Kim (3):
      perf record: Propagate exit status of a command line workload
      perf tools: Get rid of on_exit() feature test
      perf tools: Use tid for finding thread

 tools/perf/builtin-inject.c                     |   2 +-
 tools/perf/builtin-kmem.c                       |   2 +-
 tools/perf/builtin-record.c                     | 158 +++++++++---------------
 tools/perf/builtin-sched.c                      |  38 +++---
 tools/perf/config/Makefile                      |   8 --
 tools/perf/config/feature-checks/Makefile       |   4 -
 tools/perf/config/feature-checks/test-all.c     |   5 -
 tools/perf/config/feature-checks/test-on-exit.c |  16 ---
 tools/perf/tests/code-reading.c                 |   2 +-
 tools/perf/tests/hists_filter.c                 |   1 +
 tools/perf/tests/hists_link.c                   |   2 +
 tools/perf/util/build-id.c                      |   2 +-
 tools/perf/util/event.c                         |   2 +-
 13 files changed, 90 insertions(+), 152 deletions(-)
 delete mode 100644 tools/perf/config/feature-checks/test-on-exit.c

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2014-04-30 10:11 Jiri Olsa
@ 2014-05-01  6:26 ` Ingo Molnar
  0 siblings, 0 replies; 43+ messages in thread
From: Ingo Molnar @ 2014-05-01  6:26 UTC (permalink / raw)
  To: Jiri Olsa
  Cc: Alexander Yarygin, Arnaldo Carvalho de Melo, Borislav Petkov,
	Christian Borntraeger, Corey Ashford, David Ahern,
	Frederic Weisbecker, Jean Pihet, Namhyung Kim, Paul Mackerras,
	Peter Zijlstra, Stephane Eranian, Will Deacon, linux-kernel


* Jiri Olsa <jolsa@kernel.org> wrote:

> hi Ingo,
> please consider pulling
> 
> resending 0/6.. it got lost somehow from previous post..
> 
> thanks,
> jirka
> 
> 
> The following changes since commit 201131998fbf074b03679afedcc29948e63331ef:
> 
>   Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/jolsa/perf into perf/core (2014-04-29 08:41:21 +0200)
> 
> are available in the git repository at:
> 
> 
>   git://git.kernel.org/pub/scm/linux/kernel/git/jolsa/perf.git tags/perf-core-for-mingo
> 
> for you to fetch changes up to 8ab596afb97bc9e2f9140dc1d993e81749acff42:
> 
>   perf tools ARM64: Wire up perf_regs and unwind support (2014-04-29 15:31:19 +0200)
> 
> ----------------------------------------------------------------
> perf/core improvements and fixes:
> 
> . Wire up perf_regs and unwind support for ARM64 (Jean Pihet)
> 
> . Move u64_swap union to its single user's header, evsel.h (Borislav Petkov)
> 
> . Fix for s390 to properly parse tracepoints plus test code (Alexander Yarygin)
> 
> . Handle EINTR error for readn/writen (Namhyung Kim)
> 
> Signed-off-by: Jiri Olsa <jolsa@kernel.org>
> 
> ----------------------------------------------------------------
> Alexander Yarygin (3):
>       perf tools: Parse tracepoints with '-' in system name
>       perf tests: Add numeric identifier to evlist_test
>       perf tests: Add a test of kvm-390: trace event
> 
> Borislav Petkov (1):
>       perf tools: Move u64_swap union
> 
> Jean Pihet (1):
>       perf tools ARM64: Wire up perf_regs and unwind support
> 
> Namhyung Kim (1):
>       perf tools: Handle EINTR error for readn/writen
> 
>  tools/perf/arch/arm64/Makefile                |   7 ++
>  tools/perf/arch/arm64/include/perf_regs.h     |  88 ++++++++++++++++
>  tools/perf/arch/arm64/util/dwarf-regs.c       |  80 +++++++++++++++
>  tools/perf/arch/arm64/util/unwind-libunwind.c |  82 +++++++++++++++
>  tools/perf/config/Makefile                    |   8 +-
>  tools/perf/tests/parse-events.c               | 142 ++++++++++++++++++--------
>  tools/perf/util/evsel.h                       |   5 +
>  tools/perf/util/parse-events.y                |  12 +++
>  tools/perf/util/types.h                       |   5 -
>  tools/perf/util/util.c                        |   2 +
>  10 files changed, 380 insertions(+), 51 deletions(-)
>  create mode 100644 tools/perf/arch/arm64/Makefile
>  create mode 100644 tools/perf/arch/arm64/include/perf_regs.h
>  create mode 100644 tools/perf/arch/arm64/util/dwarf-regs.c
>  create mode 100644 tools/perf/arch/arm64/util/unwind-libunwind.c

Pulled, thanks a lot Jiri!

I also merged perf/urgent into perf/core, because a conflict developed 
between the two.

Thanks,

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* [GIT PULL 0/6] perf/core improvements and fixes
@ 2014-04-30 10:11 Jiri Olsa
  2014-05-01  6:26 ` Ingo Molnar
  0 siblings, 1 reply; 43+ messages in thread
From: Jiri Olsa @ 2014-04-30 10:11 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Alexander Yarygin, Arnaldo Carvalho de Melo, Borislav Petkov,
	Christian Borntraeger, Corey Ashford, David Ahern,
	Frederic Weisbecker, Ingo Molnar, Jean Pihet, Namhyung Kim,
	Paul Mackerras, Peter Zijlstra, Stephane Eranian, Will Deacon,
	linux-kernel

hi Ingo,
please consider pulling

resending 0/6.. it got lost somehow from previous post..

thanks,
jirka


The following changes since commit 201131998fbf074b03679afedcc29948e63331ef:

  Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/jolsa/perf into perf/core (2014-04-29 08:41:21 +0200)

are available in the git repository at:


  git://git.kernel.org/pub/scm/linux/kernel/git/jolsa/perf.git tags/perf-core-for-mingo

for you to fetch changes up to 8ab596afb97bc9e2f9140dc1d993e81749acff42:

  perf tools ARM64: Wire up perf_regs and unwind support (2014-04-29 15:31:19 +0200)

----------------------------------------------------------------
perf/core improvements and fixes:

. Wire up perf_regs and unwind support for ARM64 (Jean Pihet)

. Move u64_swap union to its single user's header, evsel.h (Borislav Petkov)

. Fix for s390 to properly parse tracepoints plus test code (Alexander Yarygin)

. Handle EINTR error for readn/writen (Namhyung Kim)

Signed-off-by: Jiri Olsa <jolsa@kernel.org>

----------------------------------------------------------------
Alexander Yarygin (3):
      perf tools: Parse tracepoints with '-' in system name
      perf tests: Add numeric identifier to evlist_test
      perf tests: Add a test of kvm-390: trace event

Borislav Petkov (1):
      perf tools: Move u64_swap union

Jean Pihet (1):
      perf tools ARM64: Wire up perf_regs and unwind support

Namhyung Kim (1):
      perf tools: Handle EINTR error for readn/writen

 tools/perf/arch/arm64/Makefile                |   7 ++
 tools/perf/arch/arm64/include/perf_regs.h     |  88 ++++++++++++++++
 tools/perf/arch/arm64/util/dwarf-regs.c       |  80 +++++++++++++++
 tools/perf/arch/arm64/util/unwind-libunwind.c |  82 +++++++++++++++
 tools/perf/config/Makefile                    |   8 +-
 tools/perf/tests/parse-events.c               | 142 ++++++++++++++++++--------
 tools/perf/util/evsel.h                       |   5 +
 tools/perf/util/parse-events.y                |  12 +++
 tools/perf/util/types.h                       |   5 -
 tools/perf/util/util.c                        |   2 +
 10 files changed, 380 insertions(+), 51 deletions(-)
 create mode 100644 tools/perf/arch/arm64/Makefile
 create mode 100644 tools/perf/arch/arm64/include/perf_regs.h
 create mode 100644 tools/perf/arch/arm64/util/dwarf-regs.c
 create mode 100644 tools/perf/arch/arm64/util/unwind-libunwind.c

^ permalink raw reply	[flat|nested] 43+ messages in thread

* [GIT PULL 0/6] perf/core improvements and fixes
@ 2012-10-31 14:48 Arnaldo Carvalho de Melo
  0 siblings, 0 replies; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2012-10-31 14:48 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Arnaldo Carvalho de Melo, Andi Kleen, Andrew Vagin,
	Borislav Petkov, David Ahern, David Howells, Frederic Weisbecker,
	Irina Tirdea, Jiri Olsa, Joonsoo Kim, Mike Galbraith,
	Namhyung Kim, Namhyung Kim, Paul Mackerras, Pekka Enberg,
	Peter Zijlstra, Peter Zijlstra, Stephane Eranian, Steven Rostedt,
	arnaldo.melo, Arnaldo Carvalho de Melo

Hi Ingo,

	Please consider pulling,

- Arnaldo

The following changes since commit 95d18aa2b6c05351181934b3bc34ce038cc7b637:

  perf/x86: Fix sparse warnings (2012-10-30 10:10:52 +0100)

are available in the git repository at:

  git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux tags/perf-core-for-mingo

for you to fetch changes up to ffadcf090d468e9c4938b718649f38dd10cfdb02:

  perf annotate: Handle XBEGIN like a jump (2012-10-31 12:18:26 -0200)

----------------------------------------------------------------
perf/core improvements and fixes

. Handle XBEGIN like a jump in the annotate browser, from Andi Kleen

. Build speedups for --version, from Ingo Molnar.

. Add info about cross compiling for Android ARM, from Joonsoo Kim.

. Warn about missing libelf, from Namhyung Kim.

Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>

----------------------------------------------------------------
Andi Kleen (1):
      perf annotate: Handle XBEGIN like a jump

Arnaldo Carvalho de Melo (1):
      perf tools: Handle --version string generation on machines without git

Ingo Molnar (2):
      perf tools: Speed up the perf build time by simplifying the perf --version string generation
      perf tools: Further speed up the perf build

Joonsoo Kim (1):
      perf tools: Add info about cross compiling for Android ARM

Namhyung Kim (1):
      perf tools: Warn about missing libelf

 tools/perf/Documentation/android.txt |    5 ++++-
 tools/perf/Makefile                  |    2 ++
 tools/perf/util/PERF-VERSION-GEN     |   14 +++++---------
 tools/perf/util/annotate.c           |    2 ++
 4 files changed, 13 insertions(+), 10 deletions(-)

^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2011-04-20 16:23 Arnaldo Carvalho de Melo
@ 2011-04-21  8:58 ` Ingo Molnar
  0 siblings, 0 replies; 43+ messages in thread
From: Ingo Molnar @ 2011-04-21  8:58 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo
  Cc: linux-kernel, Arun Sharma, Frederic Weisbecker, Michael Witten,
	Mike Galbraith, Paul Mackerras, Peter Zijlstra,
	Raghavendra D Prabhu, Stephane Eranian, Thomas Gleixner,
	Tom Zanussi, Arnaldo Carvalho de Melo


* Arnaldo Carvalho de Melo <acme@infradead.org> wrote:

> Hi Ingo,
> 
>         Please consider pulling from:
> 
> git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux-2.6 perf/core
> 
> Regards,
> 
> - Arnaldo
> 
> Arnaldo Carvalho de Melo (1):
>   perf symbols: Give more useful names to 'self' parameters
> 
> Arun Sharma (1):
>   perf script: Add support for PERF_TYPE_RAW
> 
> Michael Witten (4):
>   perf tools: Makefile: Clean up `python/perf.so' rule
>   perf tools: Makefile: PYTHON{,_CONFIG} to bandage Python 3 incompatibility
>   perf tools: Move `try-cc'
>   perf tools: git mv tools/perf/{features-tests.mak,config/}
> 
>  tools/perf/Makefile                 |  107 +++++--
>  tools/perf/builtin-script.c         |   14 +-
>  tools/perf/config/feature-tests.mak |  128 +++++++
>  tools/perf/config/utilities.mak     |  188 +++++++++++
>  tools/perf/feature-tests.mak        |  130 -------
>  tools/perf/util/symbol.c            |  629 ++++++++++++++++++-----------------
>  tools/perf/util/symbol.h            |   78 +++--
>  7 files changed, 771 insertions(+), 503 deletions(-)
>  create mode 100644 tools/perf/config/feature-tests.mak
>  create mode 100644 tools/perf/config/utilities.mak
>  delete mode 100644 tools/perf/feature-tests.mak

Pulled, thanks a lot Arnaldo!

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* [GIT PULL 0/6] perf/core improvements and fixes
@ 2011-04-20 16:23 Arnaldo Carvalho de Melo
  2011-04-21  8:58 ` Ingo Molnar
  0 siblings, 1 reply; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2011-04-20 16:23 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Arnaldo Carvalho de Melo, Arun Sharma,
	Frederic Weisbecker, Michael Witten, Mike Galbraith,
	Paul Mackerras, Peter Zijlstra, Raghavendra D Prabhu,
	Stephane Eranian, Thomas Gleixner, Tom Zanussi,
	Arnaldo Carvalho de Melo

Hi Ingo,

        Please consider pulling from:

git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux-2.6 perf/core

Regards,

- Arnaldo

Arnaldo Carvalho de Melo (1):
  perf symbols: Give more useful names to 'self' parameters

Arun Sharma (1):
  perf script: Add support for PERF_TYPE_RAW

Michael Witten (4):
  perf tools: Makefile: Clean up `python/perf.so' rule
  perf tools: Makefile: PYTHON{,_CONFIG} to bandage Python 3 incompatibility
  perf tools: Move `try-cc'
  perf tools: git mv tools/perf/{features-tests.mak,config/}

 tools/perf/Makefile                 |  107 +++++--
 tools/perf/builtin-script.c         |   14 +-
 tools/perf/config/feature-tests.mak |  128 +++++++
 tools/perf/config/utilities.mak     |  188 +++++++++++
 tools/perf/feature-tests.mak        |  130 -------
 tools/perf/util/symbol.c            |  629 ++++++++++++++++++-----------------
 tools/perf/util/symbol.h            |   78 +++--
 7 files changed, 771 insertions(+), 503 deletions(-)
 create mode 100644 tools/perf/config/feature-tests.mak
 create mode 100644 tools/perf/config/utilities.mak
 delete mode 100644 tools/perf/feature-tests.mak


^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [GIT PULL 0/6] perf/core improvements and fixes
  2011-02-16 22:54 Arnaldo Carvalho de Melo
@ 2011-02-17 13:50 ` Ingo Molnar
  0 siblings, 0 replies; 43+ messages in thread
From: Ingo Molnar @ 2011-02-17 13:50 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo
  Cc: linux-kernel, 2nddept-manager, Franck Bui-Huu,
	Frederic Weisbecker, Jesse Brandeburg, Masami Hiramatsu,
	Mike Galbraith, Paul Mackerras, Peter Zijlstra, Stephane Eranian,
	Tom Zanussi, Arnaldo Carvalho de Melo


* Arnaldo Carvalho de Melo <acme@infradead.org> wrote:

> Hi Ingo,
> 
>         Please consider pulling from:
> 
> git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux-2.6 perf/core
> 
> Regards,
> 
> - Arnaldo
> 
> Arnaldo Carvalho de Melo (3):
>   perf ui: Serialize screen updates
>   perf annotate: Check if offset is less than symbol size
>   perf annotate browser: Use the percent color for the whole line
> 
> Jesse Brandeburg (1):
>   perf tools: Update Makefile with some help
> 
> Masami Hiramatsu (2):
>   perf probe: Support function@filename syntax for --line
>   perf probe: Show filename which contains target function
> 
>  tools/perf/Documentation/Makefile       |   19 ++++++++++---------
>  tools/perf/Documentation/perf-probe.txt |    7 ++++---
>  tools/perf/Makefile                     |   31 +++++++++++++++++++++++++++++++
>  tools/perf/util/annotate.c              |    3 ++-
>  tools/perf/util/probe-event.c           |   17 +++++++++++++----
>  tools/perf/util/ui/browser.c            |    7 +++++++
>  tools/perf/util/ui/browsers/annotate.c  |    5 +++--
>  tools/perf/util/ui/helpline.c           |    5 ++++-
>  tools/perf/util/ui/setup.c              |    3 +++
>  tools/perf/util/ui/ui.h                 |    8 ++++++++
>  10 files changed, 85 insertions(+), 20 deletions(-)
>  create mode 100644 tools/perf/util/ui/ui.h

Pulled, thanks a lot Arnaldo!

	Ingo

^ permalink raw reply	[flat|nested] 43+ messages in thread

* [GIT PULL 0/6] perf/core improvements and fixes
@ 2011-02-16 22:54 Arnaldo Carvalho de Melo
  2011-02-17 13:50 ` Ingo Molnar
  0 siblings, 1 reply; 43+ messages in thread
From: Arnaldo Carvalho de Melo @ 2011-02-16 22:54 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: linux-kernel, Arnaldo Carvalho de Melo, 2nddept-manager,
	Franck Bui-Huu, Frederic Weisbecker, Ingo Molnar,
	Jesse Brandeburg, Masami Hiramatsu, Mike Galbraith,
	Paul Mackerras, Peter Zijlstra, Stephane Eranian, Tom Zanussi,
	Arnaldo Carvalho de Melo

Hi Ingo,

        Please consider pulling from:

git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux-2.6 perf/core

Regards,

- Arnaldo

Arnaldo Carvalho de Melo (3):
  perf ui: Serialize screen updates
  perf annotate: Check if offset is less than symbol size
  perf annotate browser: Use the percent color for the whole line

Jesse Brandeburg (1):
  perf tools: Update Makefile with some help

Masami Hiramatsu (2):
  perf probe: Support function@filename syntax for --line
  perf probe: Show filename which contains target function

 tools/perf/Documentation/Makefile       |   19 ++++++++++---------
 tools/perf/Documentation/perf-probe.txt |    7 ++++---
 tools/perf/Makefile                     |   31 +++++++++++++++++++++++++++++++
 tools/perf/util/annotate.c              |    3 ++-
 tools/perf/util/probe-event.c           |   17 +++++++++++++----
 tools/perf/util/ui/browser.c            |    7 +++++++
 tools/perf/util/ui/browsers/annotate.c  |    5 +++--
 tools/perf/util/ui/helpline.c           |    5 ++++-
 tools/perf/util/ui/setup.c              |    3 +++
 tools/perf/util/ui/ui.h                 |    8 ++++++++
 10 files changed, 85 insertions(+), 20 deletions(-)
 create mode 100644 tools/perf/util/ui/ui.h


^ permalink raw reply	[flat|nested] 43+ messages in thread

end of thread, other threads:[~2017-03-16 16:30 UTC | newest]

Thread overview: 43+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-06-03 22:40 [GIT PULL 0/6] perf/core improvements and fixes Arnaldo Carvalho de Melo
2015-06-03 22:40 ` [PATCH 1/6] perf probe: Fix segfault when glob matching function without debuginfo Arnaldo Carvalho de Melo
2015-06-03 22:40 ` [PATCH 2/6] perf tools: Remove newline char when reading event scale and unit Arnaldo Carvalho de Melo
2015-06-03 22:40 ` [PATCH 3/6] perf machine: Fix the search for the kernel DSO on the unified list Arnaldo Carvalho de Melo
2015-06-03 22:40 ` [PATCH 4/6] perf tools: Move linux/kernel.h to tools/include Arnaldo Carvalho de Melo
2015-06-03 22:40 ` [PATCH 5/6] tools: Move tools/perf/util/include/linux/{list.h,poison.h} " Arnaldo Carvalho de Melo
2015-06-03 22:40 ` [PATCH 6/6] perf tools: Deal with kernel module names in '[]' correctly Arnaldo Carvalho de Melo
2015-06-04  5:48 ` [GIT PULL 0/6] perf/core improvements and fixes Ingo Molnar
2015-06-04  6:07   ` Wangnan (F)
2015-06-04  7:21     ` Ingo Molnar
2015-06-04 10:00       ` Wangnan (F)
2015-06-04 12:40         ` Ingo Molnar
2015-06-04 12:58           ` pi3orama
2015-06-04 14:04             ` Ingo Molnar
2015-06-04 16:22               ` Alexei Starovoitov
2015-06-04 21:48                 ` Masami Hiramatsu
2015-06-04 22:07                   ` Alexei Starovoitov
2015-06-05  6:41                 ` Ingo Molnar
2015-06-05  8:53                   ` Wangnan (F)
2015-06-05 12:05                     ` Ingo Molnar
2015-06-05 14:06                       ` Arnaldo Carvalho de Melo
2015-06-07 13:11                         ` Ingo Molnar
2015-06-05 13:59                     ` Arnaldo Carvalho de Melo
2015-06-04 10:17     ` [EXPERIENCE] My experience on using perf record BPF filter on a real usecase Wangnan (F)
2015-06-10  6:42       ` Alexei Starovoitov
2015-06-10  6:48         ` Wangnan (F)
  -- strict thread matches above, loose matches on Subject: below --
2017-03-16 16:09 [GIT PULL 0/6] perf/core improvements and fixes Arnaldo Carvalho de Melo
2017-03-16 16:30 ` Ingo Molnar
2015-08-31 21:36 Arnaldo Carvalho de Melo
2015-09-01  8:26 ` Ingo Molnar
2015-03-31 20:59 Arnaldo Carvalho de Melo
2015-04-01  7:56 ` Ingo Molnar
2015-03-26 15:40 Arnaldo Carvalho de Melo
2015-03-27  7:37 ` Ingo Molnar
2014-05-12  9:27 Jiri Olsa
2014-05-12 15:59 ` Ingo Molnar
2014-04-30 10:11 Jiri Olsa
2014-05-01  6:26 ` Ingo Molnar
2012-10-31 14:48 Arnaldo Carvalho de Melo
2011-04-20 16:23 Arnaldo Carvalho de Melo
2011-04-21  8:58 ` Ingo Molnar
2011-02-16 22:54 Arnaldo Carvalho de Melo
2011-02-17 13:50 ` Ingo Molnar

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.