* [GIT PULL] perf tools changes for v5.19: 3rd batch
@ 2022-06-03 22:31 Arnaldo Carvalho de Melo
2022-06-04 20:56 ` pr-tracker-bot
0 siblings, 1 reply; 2+ messages in thread
From: Arnaldo Carvalho de Melo @ 2022-06-03 22:31 UTC (permalink / raw)
To: Linus Torvalds
Cc: Ingo Molnar, Thomas Gleixner, Jiri Olsa, Namhyung Kim,
Clark Williams, Kate Carcia, linux-kernel, linux-perf-users,
Arnaldo Carvalho de Melo, Fangrui Song, German Gomez, Ian Rogers,
Kevin Nomura, Leo Yan, Sebastian Ullrich, Thomas Richter,
Zhengjun Xing, Arnaldo Carvalho de Melo
Hi Linus,
Please consider pulling,
Best regards,
- Arnaldo
Reduced set of tests at the end of this message.
The following changes since commit 9be4cbd09da820a20d400670a45fc1571f6a13b8:
driver core: Set default deferred_probe_timeout back to 0. (2022-06-03 11:58:54 -0700)
are available in the Git repository at:
git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-tools-for-v5.19-2022-06-04
for you to fetch changes up to 1bcca2b1bd67f3c0e5c3a88ed16c6389f01a5b31:
perf vendor events intel: Update metrics for Alderlake (2022-06-03 21:45:32 +0200)
----------------------------------------------------------------
perf tools changes for v5.19: 3rd batch
- Synthesize task events for pre-existing threads when using 'perf lock --threads',
as we need to show task names.
- Fix unwinding with ld.lld (>= version 10.0) linked objects, where
.eh_frame_hdr and .text are in different PT_LOAD program headers, which makes
perf record --call-graph dwarf fail with such obkects.
- Check if 'perf record' hangs in the ARM SPE (Statistical Profiling Extensions)
'perf test' entry when recording a workload with forks.
- Trace physical address for Arm SPE events, needed for 'perf c2c' to locate
the memory node for samples.
- Fix sorting in percent_rmt_hitm_cmp() in 'perf c2c'.
- Further support for Intel hybrid systems in the evlist and 'perf record' code.
- Update IBM s/390 vendor event JSON tables.
- Add metrics (JSON) for Intel Sapphirerapids.
- Update metrics for Intel Alderlake.
- Correct typo of sysf 'event_source' directory in the documentation.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
----------------------------------------------------------------
Fangrui Song (1):
perf unwind: Fix segbase for ld.lld linked objects
German Gomez (1):
perf test arm-spe: Check if perf-record hangs when recording workload with forks
Ian Rogers (1):
perf docs: Correct typo of event_sources
Leo Yan (2):
perf mem: Trace physical address for Arm SPE events
perf c2c: Fix sorting in percent_rmt_hitm_cmp()
Namhyung Kim (1):
perf lock: Change to synthesize task events
Thomas Richter (7):
perf list: Add IBM z16 event description for s390
perf list: Update event description for IBM z10 to latest level
perf list: Update event description for IBM z13 to latest level
perf list: Update event description for IBM z14 to latest level
perf list: Update event description for IBM z15 to latest level
perf list: Update event description for IBM z196/z114 to latest level
perf list: Update event description for IBM zEC12/zBC12 to latest level
Zhengjun Xing (4):
perf evlist: Extend arch_evsel__must_be_in_group to support hybrid systems
perf record: Support sample-read topdown metric group for hybrid platforms
perf vendor events intel: Add metrics for Sapphirerapids
perf vendor events intel: Update metrics for Alderlake
tools/perf/Documentation/perf-record.txt | 2 +-
tools/perf/Documentation/perf-stat.txt | 2 +-
tools/perf/Documentation/perf-top.txt | 2 +-
tools/perf/arch/arm64/util/mem-events.c | 6 +-
tools/perf/arch/x86/util/evsel.c | 5 +-
tools/perf/arch/x86/util/evsel.h | 7 +
tools/perf/arch/x86/util/topdown.c | 21 +-
tools/perf/builtin-c2c.c | 4 +-
tools/perf/builtin-lock.c | 2 +-
tools/perf/pmu-events/arch/s390/cf_z10/basic.json | 48 +-
tools/perf/pmu-events/arch/s390/cf_z10/crypto.json | 64 +--
.../perf/pmu-events/arch/s390/cf_z10/extended.json | 36 +-
tools/perf/pmu-events/arch/s390/cf_z13/basic.json | 48 +-
tools/perf/pmu-events/arch/s390/cf_z13/crypto.json | 64 +--
.../perf/pmu-events/arch/s390/cf_z13/extended.json | 100 ++--
tools/perf/pmu-events/arch/s390/cf_z14/basic.json | 32 +-
tools/perf/pmu-events/arch/s390/cf_z14/crypto.json | 64 +--
.../perf/pmu-events/arch/s390/cf_z14/extended.json | 102 ++--
tools/perf/pmu-events/arch/s390/cf_z15/basic.json | 32 +-
tools/perf/pmu-events/arch/s390/cf_z15/crypto.json | 114 -----
.../perf/pmu-events/arch/s390/cf_z15/crypto6.json | 112 +++++
.../perf/pmu-events/arch/s390/cf_z15/extended.json | 108 ++---
tools/perf/pmu-events/arch/s390/cf_z16/basic.json | 58 +++
.../perf/pmu-events/arch/s390/cf_z16/crypto6.json | 142 ++++++
.../perf/pmu-events/arch/s390/cf_z16/extended.json | 492 +++++++++++++++++++
.../pmu-events/arch/s390/cf_z16/transaction.json | 7 +
tools/perf/pmu-events/arch/s390/cf_z196/basic.json | 48 +-
.../perf/pmu-events/arch/s390/cf_z196/crypto.json | 64 +--
.../pmu-events/arch/s390/cf_z196/extended.json | 44 +-
.../perf/pmu-events/arch/s390/cf_zec12/basic.json | 48 +-
.../perf/pmu-events/arch/s390/cf_zec12/crypto.json | 64 +--
.../pmu-events/arch/s390/cf_zec12/extended.json | 66 +--
tools/perf/pmu-events/arch/s390/mapfile.csv | 1 +
.../pmu-events/arch/x86/alderlake/adl-metrics.json | 163 +++++--
.../arch/x86/sapphirerapids/spr-metrics.json | 530 +++++++++++++++++++++
tools/perf/tests/shell/test_arm_spe_fork.sh | 92 ++++
tools/perf/util/dso.h | 2 +
tools/perf/util/unwind-libunwind-local.c | 105 ++--
38 files changed, 2163 insertions(+), 738 deletions(-)
create mode 100644 tools/perf/arch/x86/util/evsel.h
delete mode 100644 tools/perf/pmu-events/arch/s390/cf_z15/crypto.json
create mode 100644 tools/perf/pmu-events/arch/s390/cf_z16/basic.json
create mode 100644 tools/perf/pmu-events/arch/s390/cf_z16/crypto6.json
create mode 100644 tools/perf/pmu-events/arch/s390/cf_z16/extended.json
create mode 100644 tools/perf/pmu-events/arch/s390/cf_z16/transaction.json
create mode 100644 tools/perf/pmu-events/arch/x86/sapphirerapids/spr-metrics.json
create mode 100755 tools/perf/tests/shell/test_arm_spe_fork.sh
Test results:
The container based builds will return when I get back home.
The 'perf test' one will perform a variety of tests exercising
tools/perf/util/, tools/lib/{bpf,traceevent,etc}, as well as run perf commands
with a variety of command line event specifications to then intercept the
sys_perf_event syscall to check that the perf_event_attr fields are set up as
expected, among a variety of other unit tests.
Then there is the 'make -C tools/perf build-test' ones, that build tools/perf/
with a variety of feature sets, exercising the build with an incomplete set of
features as well as with a complete one.
$ uname -a
Linux quaco 5.17.11-300.fc36.x86_64 #1 SMP PREEMPT Wed May 25 15:04:05 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
$ git log --oneline -1
1bcca2b1bd67f3c0 (HEAD -> perf/core) perf vendor events intel: Update metrics for Alderlake
$ perf -v
perf version 5.18.g1bcca2b1bd67
$ sudo su -
# perf -vv
perf version 5.18.g1bcca2b1bd67
dwarf: [ on ] # HAVE_DWARF_SUPPORT
dwarf_getlocations: [ on ] # HAVE_DWARF_GETLOCATIONS_SUPPORT
glibc: [ on ] # HAVE_GLIBC_SUPPORT
syscall_table: [ on ] # HAVE_SYSCALL_TABLE_SUPPORT
libbfd: [ on ] # HAVE_LIBBFD_SUPPORT
debuginfod: [ on ] # HAVE_DEBUGINFOD_SUPPORT
libelf: [ on ] # HAVE_LIBELF_SUPPORT
libnuma: [ on ] # HAVE_LIBNUMA_SUPPORT
numa_num_possible_cpus: [ on ] # HAVE_LIBNUMA_SUPPORT
libperl: [ on ] # HAVE_LIBPERL_SUPPORT
libpython: [ on ] # HAVE_LIBPYTHON_SUPPORT
libslang: [ on ] # HAVE_SLANG_SUPPORT
libcrypto: [ OFF ] # HAVE_LIBCRYPTO_SUPPORT
libunwind: [ on ] # HAVE_LIBUNWIND_SUPPORT
libdw-dwarf-unwind: [ on ] # HAVE_DWARF_SUPPORT
zlib: [ on ] # HAVE_ZLIB_SUPPORT
lzma: [ on ] # HAVE_LZMA_SUPPORT
get_cpuid: [ on ] # HAVE_AUXTRACE_SUPPORT
bpf: [ on ] # HAVE_LIBBPF_SUPPORT
aio: [ on ] # HAVE_AIO_SUPPORT
zstd: [ on ] # HAVE_ZSTD_SUPPORT
libpfm4: [ OFF ] # HAVE_LIBPFM
# perf test
1: vmlinux symtab matches kallsyms : Ok
2: Detect openat syscall event : Ok
3: Detect openat syscall event on all cpus : Ok
4: Read samples using the mmap interface : Ok
5: Test data source output : Ok
6: Parse event definition strings :
6.1: Test event parsing : Ok
6.2: Test parsing of "hybrid" CPU events : Skip (not hybrid)
6.3: Parsing of all PMU events from sysfs : Ok
6.4: Parsing of given PMU events from sysfs : Ok
6.5: Parsing of aliased events from sysfs : Skip (no aliases in sysfs)
6.6: Parsing of aliased events : Ok
6.7: Parsing of terms (event modifiers) : Ok
7: Simple expression parser : Ok
8: PERF_RECORD_* events & perf_sample fields : Ok
9: Parse perf pmu format : Ok
10: PMU events :
10.1: PMU event table sanity : Ok
10.2: PMU event map aliases : Ok
10.3: Parsing of PMU event table metrics : Ok
10.4: Parsing of PMU event table metrics with fake PMUs : Ok
11: DSO data read : Ok
12: DSO data cache : Ok
13: DSO data reopen : Ok
14: Roundtrip evsel->name : Ok
15: Parse sched tracepoints fields : Ok
16: syscalls:sys_enter_openat event fields : Ok
17: Setup struct perf_event_attr : Ok
18: Match and link multiple hists : Ok
19: 'import perf' in python : Ok
20: Breakpoint overflow signal handler : Ok
21: Breakpoint overflow sampling : Ok
22: Breakpoint accounting : Ok
23: Watchpoint :
23.1: Read Only Watchpoint : Skip (missing hardware support)
23.2: Write Only Watchpoint : Ok
23.3: Read / Write Watchpoint : Ok
23.4: Modify Watchpoint : Ok
24: Number of exit events of a simple workload : Ok
25: Software clock events period values : Ok
26: Object code reading : Ok
27: Sample parsing : Ok
28: Use a dummy software event to keep tracking : Ok
29: Parse with no sample_id_all bit set : Ok
30: Filter hist entries : Ok
31: Lookup mmap thread : Ok
32: Share thread maps : Ok
33: Sort output of hist entries : Ok
34: Cumulate child hist entries : Ok
35: Track with sched_switch : Ok
36: Filter fds with revents mask in a fdarray : Ok
37: Add fd to a fdarray, making it autogrow : Ok
38: kmod_path__parse : Ok
39: Thread map : Ok
40: LLVM search and compile :
40.1: Basic BPF llvm compile : Ok
40.2: kbuild searching : Ok
40.3: Compile source for BPF prologue generation : Ok
40.4: Compile source for BPF relocation : Ok
41: Session topology : Ok
42: BPF filter :
42.1: Basic BPF filtering : Ok
42.2: BPF pinning : Ok
42.3: BPF prologue generation : Ok
43: Synthesize thread map : Ok
44: Remove thread map : Ok
45: Synthesize cpu map : Ok
46: Synthesize stat config : Ok
47: Synthesize stat : Ok
48: Synthesize stat round : Ok
49: Synthesize attr update : Ok
50: Event times : Ok
51: Read backward ring buffer : Ok
52: Print cpu map : Ok
53: Merge cpu map : Ok
54: Probe SDT events : Ok
55: is_printable_array : Ok
56: Print bitmap : Ok
57: perf hooks : Ok
58: builtin clang support :
58.1: builtin clang compile C source to IR : Skip (not compiled in)
58.2: builtin clang compile C source to ELF object : Skip (not compiled in)
59: unit_number__scnprintf : Ok
60: mem2node : Ok
61: time utils : Ok
62: Test jit_write_elf : Ok
63: Test libpfm4 support :
63.1: test of individual --pfm-events : Skip (not compiled in)
63.2: test groups of --pfm-events : Skip (not compiled in)
64: Test api io : Ok
65: maps__merge_in : Ok
66: Demangle Java : Ok
67: Demangle OCaml : Ok
68: Parse and process metrics : Ok
69: PE file support : Ok
70: Event expansion for cgroups : Ok
71: Convert perf time to TSC :
71.1: TSC support : Ok
71.2: Perf time to TSC : Ok
72: dlfilter C API : Ok
73: Sigtrap : Ok
74: x86 rdpmc : Ok
75: Test dwarf unwind : Ok
76: x86 instruction decoder - new instructions : Ok
77: Intel PT packet decoder : Ok
78: x86 bp modify : Ok
79: x86 Sample parsing : Ok
80: build id cache operations : Ok
81: daemon operations : Ok
82: perf pipe recording and injection test : Ok
83: Add vfs_getname probe to get syscall args filenames : Ok
84: probe libc's inet_pton & backtrace it with ping : Ok
85: Use vfs_getname probe to get syscall args filenames : Ok
86: Zstd perf.data compression/decompression : Ok
87: perf record tests : Ok
88: perf record offcpu profiling tests : Ok
89: perf stat CSV output linter : Skip
90: perf stat csv summary test : Ok
91: perf stat metrics (shadow stat) test : Ok
92: perf stat tests : Ok
93: perf all metricgroups test : Ok
94: perf all metrics test : FAILED!
95: perf all PMU test : Ok
96: perf stat --bpf-counters test : Ok
97: Check Arm64 callgraphs are complete in fp mode : Skip
98: Check Arm CoreSight trace data recording and synthesized samples: Skip
99: Check Arm SPE trace data recording and synthesized samples : Skip
100: Check Arm SPE doesn't hang when there are forks : Skip
101: Miscellaneous Intel PT testing : Ok
102: Check open filename arg using perf trace + vfs_getname : Ok
#
The CORESIGHT=1 build is failing, being investigated, couldn't reproduce it
building it out of 'make -C tools/perf build-test', as below.
Also LIBBPF_DYNAMIC=1 fails on fedora:36, as libbpf-devel is old, should work
with 0.8.0, that is in fedora:rawhide and should be on 36 soon according to
Jiri Olsa.
$ cat /proc/cpuinfo | grep "model name" -m1
model name : Intel(R) Core(TM) i7-8650U CPU @ 1.90GHz
$ git log --oneline -1 ; time make -C tools/perf/ build-test
1bcca2b1bd67f3c0 (HEAD -> perf/core, acme.korg/tmp.perf/core) perf vendor events intel: Update metrics for Alderlake
make: Entering directory '/home/acme/git/perf/tools/perf'
- tarpkg: ./tests/perf-targz-src-pkg .
make_static: make LDFLAGS=-static NO_PERF_READ_VDSO32=1 NO_PERF_READ_VDSOX32=1 NO_JVMTI=1 -j8 DESTDIR=/tmp/tmp.x5DTDzzkiO
make_with_gtk2: make GTK2=1 -j8 DESTDIR=/tmp/tmp.3FlErTijHh
make_minimal_O: make NO_LIBPERL=1 NO_LIBPYTHON=1 NO_NEWT=1 NO_GTK2=1 NO_DEMANGLE=1 NO_LIBELF=1 NO_LIBUNWIND=1 NO_BACKTRACE=1 NO_LIBNUMA=1 NO_LIBAUDIT=1 NO_LIBBIONIC=1 NO_LIBDW_DWARF_UNWIND=1 NO_AUXTRACE=1 NO_LIBBPF=1 NO_LIBCRYPTO=1 NO_SDT=1 NO_JVMTI=1 NO_LIBZSTD=1 NO_LIBCAP=1 NO_SYSCALL_TABLE=1
make_no_newt_O: make NO_NEWT=1
make_no_ui_O: make NO_NEWT=1 NO_SLANG=1 NO_GTK2=1
make_debug_O: make DEBUG=1
make_no_auxtrace_O: make NO_AUXTRACE=1
make_install_prefix_slash_O: make install prefix=/tmp/krava/
make_util_pmu_bison_o_O: make util/pmu-bison.o
make_no_backtrace_O: make NO_BACKTRACE=1
make_no_libperl_O: make NO_LIBPERL=1
make_tags_O: make tags
make_no_libdw_dwarf_unwind_O: make NO_LIBDW_DWARF_UNWIND=1
make_no_libbionic_O: make NO_LIBBIONIC=1
make_no_sdt_O: make NO_SDT=1
make_no_libaudit_O: make NO_LIBAUDIT=1
make_perf_o_O: make perf.o
make_cscope_O: make cscope
make_install_O: make install
make_no_slang_O: make NO_SLANG=1
make_no_syscall_tbl_O: make NO_SYSCALL_TABLE=1
make_no_libelf_O: make NO_LIBELF=1
make_no_libbpf_DEBUG_O: make NO_LIBBPF=1 DEBUG=1
make_no_libbpf_O: make NO_LIBBPF=1
make_no_demangle_O: make NO_DEMANGLE=1
make_install_bin_O: make install-bin
make_install_prefix_O: make install prefix=/tmp/krava
make_clean_all_O: make clean all
make_pure_O: make
make_no_libnuma_O: make NO_LIBNUMA=1
make_doc_O: make doc
make_no_libcrypto_O: make NO_LIBCRYPTO=1
make_no_libpython_O: make NO_LIBPYTHON=1
make_help_O: make help
make_no_gtk2_O: make NO_GTK2=1
make_with_clangllvm_O: make LIBCLANGLLVM=1
make_no_scripts_O: make NO_LIBPYTHON=1 NO_LIBPERL=1
make_util_map_o_O: make util/map.o
make_with_libpfm4_O: make LIBPFM4=1
make_with_babeltrace_O: make LIBBABELTRACE=1
make_no_libunwind_O: make NO_LIBUNWIND=1
OK
make: Leaving directory '/home/acme/git/perf/tools/perf'
real 18m58.348s
user 112m8.915s
sys 15m59.473s
$
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: [GIT PULL] perf tools changes for v5.19: 3rd batch
2022-06-03 22:31 [GIT PULL] perf tools changes for v5.19: 3rd batch Arnaldo Carvalho de Melo
@ 2022-06-04 20:56 ` pr-tracker-bot
0 siblings, 0 replies; 2+ messages in thread
From: pr-tracker-bot @ 2022-06-04 20:56 UTC (permalink / raw)
To: Arnaldo Carvalho de Melo
Cc: Linus Torvalds, Ingo Molnar, Thomas Gleixner, Jiri Olsa,
Namhyung Kim, Clark Williams, Kate Carcia, linux-kernel,
linux-perf-users, Arnaldo Carvalho de Melo, Fangrui Song,
German Gomez, Ian Rogers, Kevin Nomura, Leo Yan,
Sebastian Ullrich, Thomas Richter, Zhengjun Xing,
Arnaldo Carvalho de Melo
The pull request you sent on Sat, 4 Jun 2022 00:31:36 +0200:
> git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-tools-for-v5.19-2022-06-04
has been merged into torvalds/linux.git:
https://git.kernel.org/torvalds/c/45b2e5ad6837dfe4de6b9028c575bd57c132774c
Thank you!
--
Deet-doot-dot, I am a bot.
https://korg.docs.kernel.org/prtracker.html
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2022-06-04 20:56 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-06-03 22:31 [GIT PULL] perf tools changes for v5.19: 3rd batch Arnaldo Carvalho de Melo
2022-06-04 20:56 ` pr-tracker-bot
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.