linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* `perf report` about 1000x(!) slower in linux 4.15
@ 2018-03-20 11:57 Jan-Oliver Kaiser
  2018-03-20 13:38 ` Arnaldo Carvalho de Melo
  0 siblings, 1 reply; 3+ messages in thread
From: Jan-Oliver Kaiser @ 2018-03-20 11:57 UTC (permalink / raw)
  To: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo
  Cc: linux-kernel, milian.wolff

Dear perf maintainers,

After upgrading my system to linux 4.15 (from 4.14), `perf report` 
became unusably slow. I estimate a decrease in performance by a factor 
of 100x-1000x. Some 21M perf.data files take about 30 seconds in the 
"Processing events" step.  `git bisect` points to

commit d8a88dd243a170a226aba33e7c53704db2f82aa6 (HEAD, refs/bisect/bad)
Author: Milian Wolff <milian.wolff@kdab.com>
     perf util: Enable handling of inlined frames by default

The slowdown can be worked around with `--no-inline`. If the slowdown is 
expected, I would suggest reverting the default setting here or maybe 
printing a warning if a lot of time is spent on this feature.

Do you need any additional information about my system or the recorded 
data I am looking at?

Thanks,
Janno

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: `perf report` about 1000x(!) slower in linux 4.15
  2018-03-20 11:57 `perf report` about 1000x(!) slower in linux 4.15 Jan-Oliver Kaiser
@ 2018-03-20 13:38 ` Arnaldo Carvalho de Melo
  2018-03-20 16:54   ` Jan-Oliver Kaiser
  0 siblings, 1 reply; 3+ messages in thread
From: Arnaldo Carvalho de Melo @ 2018-03-20 13:38 UTC (permalink / raw)
  To: Jan-Oliver Kaiser; +Cc: Peter Zijlstra, Ingo Molnar, linux-kernel, milian.wolff

Em Tue, Mar 20, 2018 at 12:57:29PM +0100, Jan-Oliver Kaiser escreveu:
> After upgrading my system to linux 4.15 (from 4.14), `perf report` became
> unusably slow. I estimate a decrease in performance by a factor of
> 100x-1000x. Some 21M perf.data files take about 30 seconds in the
> "Processing events" step.  `git bisect` points to
 
> commit d8a88dd243a170a226aba33e7c53704db2f82aa6 (HEAD, refs/bisect/bad)
> Author: Milian Wolff <milian.wolff@kdab.com>
>     perf util: Enable handling of inlined frames by default
 
> The slowdown can be worked around with `--no-inline`. If the slowdown is
> expected, I would suggest reverting the default setting here or maybe
> printing a warning if a lot of time is spent on this feature.
 
> Do you need any additional information about my system or the recorded data
> I am looking at?

Can you try with the latest perf tool? 

[acme@jouet perf]$ make perf-tarxz-src-pkg ; ls -la perf-4*
  TAR
  PERF_VERSION = 4.16.rc6.gecd380
-rw-rw-r--. 1 acme acme 1323568 Mar 20 10:30 perf-4.16.0-rc6.tar.xz
[acme@jouet perf]$

With a recently checked out kernel sources, or, as a convenience, I'm
pushing this to:

  http://vger.kernel.org/~acme/perf/perf-4.16.0-rc6.tar.xz

You just expand it and then:

[acme@jouet tmp]$ tar xf perf-4.16.0-rc6.tar.xz 
[acme@jouet tmp]$ cd perf-4.16.0-rc6/
[acme@jouet perf-4.16.0-rc6]$ make -C tools/perf install-bin


And check if the problem is present there as well.

If it is, please tell us what is your distro, the output of:

perf report --header-only

Thanks,

- Arnaldo

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: `perf report` about 1000x(!) slower in linux 4.15
  2018-03-20 13:38 ` Arnaldo Carvalho de Melo
@ 2018-03-20 16:54   ` Jan-Oliver Kaiser
  0 siblings, 0 replies; 3+ messages in thread
From: Jan-Oliver Kaiser @ 2018-03-20 16:54 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo
  Cc: Peter Zijlstra, Ingo Molnar, linux-kernel, milian.wolff

The behavior persists with the most recent head of linux/master 
(1b5f3ba415fe4cf8b8b39c8d104ed44cde330658).

$ ./perf --version
perf version 4.16.rc6.g1b5f3ba4

$ uname -r
4.15.9-towo.1-siduction-amd64

(This is a debian unstable variant.)

$ ./perf report --header-only -i <my_perf.data>
# ========
# captured on: Fri Mar 16 18:14:05 2018
# hostname : blackbox
# os release : 4.15.9-towo.1-siduction-amd64
# perf version : 4.15.4
# arch : x86_64
# nrcpus online : 4
# nrcpus avail : 4
# cpudesc : Intel(R) Core(TM) i7-5500U CPU @ 2.40GHz
# cpuid : GenuineIntel,6,61,4
# total memory : 16343572 kB
# cmdline : /usr/bin/perf_4.15 record -F 99 --call-graph dwarf -- coqc 
-q -I /home/janno/.opam/iris-mtac2/lib/coq//user-contrib/Unicoq -I src 
-Q tests Mtac2Tests -R theories Mtac2 timings/decapp_vs_mmatch.v
# event : name = cycles:uppp, , size = 112, { sample_period, sample_freq 
} = 99, sample_type = 
IP|TID|TIME|ADDR|CALLCHAIN|PERIOD|REGS_USER|STACK_USER|DATA_SRC, 
disabled = 1, inherit = 1, exclude_kernel = 1, mma$
# CPU_TOPOLOGY info available, use -I to display
# NUMA_TOPOLOGY info available, use -I to display
# pmu mappings: intel_pt = 6, uncore_arb = 11, cstate_pkg = 14, 
breakpoint = 5, uncore_cbox_1 = 10, power = 12, cpu = 4, software = 1, 
uncore_imc = 8, uncore_cbox_0 = 9, cstate_core = 13, msr = 7
# CACHE info available, use -I to display
# missing features: TRACING_DATA BRANCH_STACK GROUP_DESC AUXTRACE STAT 
SAMPLE_TIME
# ========
#

Best,
Janno

On 03/20/2018 02:38 PM, Arnaldo Carvalho de Melo wrote:
> Em Tue, Mar 20, 2018 at 12:57:29PM +0100, Jan-Oliver Kaiser escreveu:
>> After upgrading my system to linux 4.15 (from 4.14), `perf report` became
>> unusably slow. I estimate a decrease in performance by a factor of
>> 100x-1000x. Some 21M perf.data files take about 30 seconds in the
>> "Processing events" step.  `git bisect` points to
>   
>> commit d8a88dd243a170a226aba33e7c53704db2f82aa6 (HEAD, refs/bisect/bad)
>> Author: Milian Wolff <milian.wolff@kdab.com>
>>      perf util: Enable handling of inlined frames by default
>   
>> The slowdown can be worked around with `--no-inline`. If the slowdown is
>> expected, I would suggest reverting the default setting here or maybe
>> printing a warning if a lot of time is spent on this feature.
>   
>> Do you need any additional information about my system or the recorded data
>> I am looking at?
> 
> Can you try with the latest perf tool?
> 
> [acme@jouet perf]$ make perf-tarxz-src-pkg ; ls -la perf-4*
>    TAR
>    PERF_VERSION = 4.16.rc6.gecd380
> -rw-rw-r--. 1 acme acme 1323568 Mar 20 10:30 perf-4.16.0-rc6.tar.xz
> [acme@jouet perf]$
> 
> With a recently checked out kernel sources, or, as a convenience, I'm
> pushing this to:
> 
>    http://vger.kernel.org/~acme/perf/perf-4.16.0-rc6.tar.xz
> 
> You just expand it and then:
> 
> [acme@jouet tmp]$ tar xf perf-4.16.0-rc6.tar.xz
> [acme@jouet tmp]$ cd perf-4.16.0-rc6/
> [acme@jouet perf-4.16.0-rc6]$ make -C tools/perf install-bin
> 
> 
> And check if the problem is present there as well.
> 
> If it is, please tell us what is your distro, the output of:
> 
> perf report --header-only
> 
> Thanks,
> 
> - Arnaldo
> 

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2018-03-20 16:54 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-03-20 11:57 `perf report` about 1000x(!) slower in linux 4.15 Jan-Oliver Kaiser
2018-03-20 13:38 ` Arnaldo Carvalho de Melo
2018-03-20 16:54   ` Jan-Oliver Kaiser

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).