All of lore.kernel.org
 help / color / mirror / Atom feed
From: Frederic Weisbecker <fweisbec@gmail.com>
To: Arnaldo Carvalho de Melo <acme@ghostprotocols.net>,
	Andi Kleen <ak@linux.intel.com>
Cc: Ingo Molnar <mingo@kernel.org>,
	linux-kernel@vger.kernel.org,
	Arnaldo Carvalho de Melo <acme@redhat.com>
Subject: Re: [PATCH 04/11] perf tools: Support callchain sorting based on addresses
Date: Mon, 22 Jul 2013 22:33:12 +0200	[thread overview]
Message-ID: <20130722203310.GC21711@somewhere> (raw)
In-Reply-To: <1374524559-9839-5-git-send-email-acme@infradead.org>

On Mon, Jul 22, 2013 at 05:22:32PM -0300, Arnaldo Carvalho de Melo wrote:
> From: Andi Kleen <ak@linux.intel.com>
> 
> With programs with very large functions it can be useful to distinguish
> the callgraph nodes on more than just function names. So for example if
> you have multiple calls to the same function, it ends up being separate
> nodes in the chain.
> 
> This patch adds a new key field to the callgraph options, that allows
> comparing nodes on functions (as today, default) and addresses.

Looks useful but you probably don't want the leaf node (inner most callee)
to be the precise ip/adress rather instead of the function. Otherwise you'll
have way too much branch per hist entry.

Also how do you display callchains in this mode? It seems that it uses
the usual function names. How do you dinstinguish the various branches
in this case?

It would be probably nice to display the nodes as func+offset in this mode?

> 
> Longer term it would be nice to also handle src lines, but that would
> need more changes and address is a reasonable proxy for it today.
> 
> I right now reference the global params, as there was no simple way to
> register a params pointer.
> 
> Signed-off-by: Andi Kleen <ak@linux.intel.com>
> Cc: Frederic Weisbecker <fweisbec@gmail.com>
> Link: http://lkml.kernel.org/n/tip-0uskktybf0e7wrnoi5e9b9it@git.kernel.org
> Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
> ---
>  tools/perf/Documentation/perf-report.txt |  8 ++++++--
>  tools/perf/builtin-report.c              | 19 +++++++++++++++----
>  tools/perf/util/callchain.c              |  7 +++++--
>  tools/perf/util/callchain.h              |  6 ++++++
>  tools/perf/util/hist.c                   |  3 ++-
>  5 files changed, 34 insertions(+), 9 deletions(-)
> 
> diff --git a/tools/perf/Documentation/perf-report.txt b/tools/perf/Documentation/perf-report.txt
> index 747ff50..2b8097e 100644
> --- a/tools/perf/Documentation/perf-report.txt
> +++ b/tools/perf/Documentation/perf-report.txt
> @@ -115,7 +115,7 @@ OPTIONS
>  --dump-raw-trace::
>          Dump raw trace in ASCII.
>  
> --g [type,min[,limit],order]::
> +-g [type,min[,limit],order[,key]]::
>  --call-graph::
>          Display call chains using type, min percent threshold, optional print
>  	limit and order.
> @@ -129,7 +129,11 @@ OPTIONS
>  	- callee: callee based call graph.
>  	- caller: inverted caller based call graph.
>  
> -	Default: fractal,0.5,callee.
> +	key can be:
> +	- function: compare on functions
> +	- address: compare on individual code addresses
> +
> +	Default: fractal,0.5,callee,function.
>  
>  -G::
>  --inverted::
> diff --git a/tools/perf/builtin-report.c b/tools/perf/builtin-report.c
> index a34c587..d785d89 100644
> --- a/tools/perf/builtin-report.c
> +++ b/tools/perf/builtin-report.c
> @@ -667,12 +667,23 @@ parse_callchain_opt(const struct option *opt, const char *arg, int unset)
>  	}
>  
>  	/* get the call chain order */
> -	if (!strcmp(tok2, "caller"))
> +	if (!strncmp(tok2, "caller", strlen("caller")))
>  		callchain_param.order = ORDER_CALLER;
> -	else if (!strcmp(tok2, "callee"))
> +	else if (!strncmp(tok2, "callee", strlen("callee")))
>  		callchain_param.order = ORDER_CALLEE;
>  	else
>  		return -1;
> +
> +	/* Get the sort key */
> +	tok2 = strtok(NULL, ",");
> +	if (!tok2)
> +		goto setup;
> +	if (!strncmp(tok2, "function", strlen("function")))
> +		callchain_param.key = CCKEY_FUNCTION;
> +	else if (!strncmp(tok2, "address", strlen("address")))
> +		callchain_param.key = CCKEY_ADDRESS;
> +	else
> +		return -1;
>  setup:
>  	if (callchain_register_param(&callchain_param) < 0) {
>  		fprintf(stderr, "Can't register callchain params\n");
> @@ -784,8 +795,8 @@ int cmd_report(int argc, const char **argv, const char *prefix __maybe_unused)
>  	OPT_BOOLEAN('x', "exclude-other", &symbol_conf.exclude_other,
>  		    "Only display entries with parent-match"),
>  	OPT_CALLBACK_DEFAULT('g', "call-graph", &report, "output_type,min_percent[,print_limit],call_order",
> -		     "Display callchains using output_type (graph, flat, fractal, or none) , min percent threshold, optional print limit and callchain order. "
> -		     "Default: fractal,0.5,callee", &parse_callchain_opt, callchain_default_opt),
> +		     "Display callchains using output_type (graph, flat, fractal, or none) , min percent threshold, optional print limit, callchain order, key (function or address). "
> +		     "Default: fractal,0.5,callee,function", &parse_callchain_opt, callchain_default_opt),
>  	OPT_BOOLEAN('G', "inverted", &report.inverted_callchain,
>  		    "alias for inverted call graph"),
>  	OPT_CALLBACK(0, "ignore-callees", NULL, "regex",
> diff --git a/tools/perf/util/callchain.c b/tools/perf/util/callchain.c
> index 42b6a63..4fee33b 100644
> --- a/tools/perf/util/callchain.c
> +++ b/tools/perf/util/callchain.c
> @@ -15,6 +15,7 @@
>  #include <errno.h>
>  #include <math.h>
>  
> +#include "hist.h"
>  #include "util.h"
>  #include "callchain.h"
>  
> @@ -327,7 +328,8 @@ append_chain(struct callchain_node *root,
>  	/*
>  	 * Lookup in the current node
>  	 * If we have a symbol, then compare the start to match
> -	 * anywhere inside a function.
> +	 * anywhere inside a function, unless function
> +	 * mode is disabled.
>  	 */
>  	list_for_each_entry(cnode, &root->val, list) {
>  		struct callchain_cursor_node *node;
> @@ -339,7 +341,8 @@ append_chain(struct callchain_node *root,
>  
>  		sym = node->sym;
>  
> -		if (cnode->ms.sym && sym) {
> +		if (cnode->ms.sym && sym &&
> +		    callchain_param.key == CCKEY_FUNCTION) {
>  			if (cnode->ms.sym->start != sym->start)
>  				break;
>  		} else if (cnode->ip != node->ip)
> diff --git a/tools/perf/util/callchain.h b/tools/perf/util/callchain.h
> index 3ee9f67..812d5a0 100644
> --- a/tools/perf/util/callchain.h
> +++ b/tools/perf/util/callchain.h
> @@ -41,12 +41,18 @@ struct callchain_param;
>  typedef void (*sort_chain_func_t)(struct rb_root *, struct callchain_root *,
>  				 u64, struct callchain_param *);
>  
> +enum chain_key {
> +	CCKEY_FUNCTION,
> +	CCKEY_ADDRESS
> +};
> +
>  struct callchain_param {
>  	enum chain_mode 	mode;
>  	u32			print_limit;
>  	double			min_percent;
>  	sort_chain_func_t	sort;
>  	enum chain_order	order;
> +	enum chain_key		key;
>  };
>  
>  struct callchain_list {
> diff --git a/tools/perf/util/hist.c b/tools/perf/util/hist.c
> index a9dd1b9..46a0d35 100644
> --- a/tools/perf/util/hist.c
> +++ b/tools/perf/util/hist.c
> @@ -24,7 +24,8 @@ enum hist_filter {
>  struct callchain_param	callchain_param = {
>  	.mode	= CHAIN_GRAPH_REL,
>  	.min_percent = 0.5,
> -	.order  = ORDER_CALLEE
> +	.order  = ORDER_CALLEE,
> +	.key	= CCKEY_FUNCTION
>  };
>  
>  u16 hists__col_len(struct hists *hists, enum hist_column col)
> -- 
> 1.8.1.4
> 

  reply	other threads:[~2013-07-22 20:33 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-07-22 20:22 [GIT PULL 00/11] perf/core improvements and fixes Arnaldo Carvalho de Melo
2013-07-22 20:22 ` [PATCH 01/11] perf script: Fix named threads support Arnaldo Carvalho de Melo
2013-07-22 20:22 ` [PATCH 02/11] perf evsel: Handle ENODEV on default cycles event Arnaldo Carvalho de Melo
2013-07-22 20:22 ` [PATCH 03/11] perf bench: Fix memcpy benchmark for large sizes Arnaldo Carvalho de Melo
2013-07-22 20:22 ` [PATCH 04/11] perf tools: Support callchain sorting based on addresses Arnaldo Carvalho de Melo
2013-07-22 20:33   ` Frederic Weisbecker [this message]
2013-07-22 20:22 ` [PATCH 05/11] perf tools: Fix build with perl 5.18 Arnaldo Carvalho de Melo
2013-07-22 20:22 ` [PATCH 06/11] perf tests: Run ctags/cscope make tests only with needed binaries Arnaldo Carvalho de Melo
2013-07-22 20:22 ` [PATCH 07/11] perf tests: Rename TMP to TMP_O tests/make variable Arnaldo Carvalho de Melo
2013-07-22 20:22 ` [PATCH 08/11] perf tests: Add DESTDIR=TMP_DEST " Arnaldo Carvalho de Melo
2013-07-22 20:22 ` [PATCH 09/11] perf tests: Add 'make install/install-bin' tests into tests/make Arnaldo Carvalho de Melo
2013-07-22 20:22 ` [PATCH 10/11] perf tests: Add broken install-* " Arnaldo Carvalho de Melo
2013-07-22 20:22 ` [PATCH 11/11] perf tools: Move weight back to common sort keys Arnaldo Carvalho de Melo
2013-07-23  7:38 ` [GIT PULL 00/11] perf/core improvements and fixes Ingo Molnar

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20130722203310.GC21711@somewhere \
    --to=fweisbec@gmail.com \
    --cc=acme@ghostprotocols.net \
    --cc=acme@redhat.com \
    --cc=ak@linux.intel.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.