All of lore.kernel.org
 help / color / mirror / Atom feed
* [GIT PULL] perf tools changes for v5.19: 3rd batch
@ 2022-06-03 22:31 Arnaldo Carvalho de Melo
  2022-06-04 20:56 ` pr-tracker-bot
  0 siblings, 1 reply; 2+ messages in thread
From: Arnaldo Carvalho de Melo @ 2022-06-03 22:31 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: Ingo Molnar, Thomas Gleixner, Jiri Olsa, Namhyung Kim,
	Clark Williams, Kate Carcia, linux-kernel, linux-perf-users,
	Arnaldo Carvalho de Melo, Fangrui Song, German Gomez, Ian Rogers,
	Kevin Nomura, Leo Yan, Sebastian Ullrich, Thomas Richter,
	Zhengjun Xing, Arnaldo Carvalho de Melo

Hi Linus,

	Please consider pulling,

Best regards,

- Arnaldo

Reduced set of tests at the end of this message.

The following changes since commit 9be4cbd09da820a20d400670a45fc1571f6a13b8:

  driver core: Set default deferred_probe_timeout back to 0. (2022-06-03 11:58:54 -0700)

are available in the Git repository at:

  git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-tools-for-v5.19-2022-06-04

for you to fetch changes up to 1bcca2b1bd67f3c0e5c3a88ed16c6389f01a5b31:

  perf vendor events intel: Update metrics for Alderlake (2022-06-03 21:45:32 +0200)

----------------------------------------------------------------
perf tools changes for v5.19: 3rd batch

- Synthesize task events for pre-existing threads when using 'perf lock --threads',
  as we need to show task names.

- Fix unwinding with ld.lld (>= version 10.0) linked objects, where
  .eh_frame_hdr and .text are in different PT_LOAD program headers, which makes
  perf record --call-graph dwarf fail with such obkects.

- Check if 'perf record' hangs in the ARM SPE (Statistical Profiling Extensions)
  'perf test' entry when recording a workload with forks.

- Trace physical address for Arm SPE events, needed for 'perf c2c' to locate
  the memory node for samples.

- Fix sorting in percent_rmt_hitm_cmp() in 'perf c2c'.

- Further support for Intel hybrid systems in the evlist and 'perf record' code.

- Update IBM s/390 vendor event JSON tables.

- Add metrics (JSON) for Intel Sapphirerapids.

- Update metrics for Intel Alderlake.

- Correct typo of sysf 'event_source' directory in the documentation.

Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>

----------------------------------------------------------------
Fangrui Song (1):
      perf unwind: Fix segbase for ld.lld linked objects

German Gomez (1):
      perf test arm-spe: Check if perf-record hangs when recording workload with forks

Ian Rogers (1):
      perf docs: Correct typo of event_sources

Leo Yan (2):
      perf mem: Trace physical address for Arm SPE events
      perf c2c: Fix sorting in percent_rmt_hitm_cmp()

Namhyung Kim (1):
      perf lock: Change to synthesize task events

Thomas Richter (7):
      perf list: Add IBM z16 event description for s390
      perf list: Update event description for IBM z10 to latest level
      perf list: Update event description for IBM z13 to latest level
      perf list: Update event description for IBM z14 to latest level
      perf list: Update event description for IBM z15 to latest level
      perf list: Update event description for IBM z196/z114 to latest level
      perf list: Update event description for IBM zEC12/zBC12 to latest level

Zhengjun Xing (4):
      perf evlist: Extend arch_evsel__must_be_in_group to support hybrid systems
      perf record: Support sample-read topdown metric group for hybrid platforms
      perf vendor events intel: Add metrics for Sapphirerapids
      perf vendor events intel: Update metrics for Alderlake

 tools/perf/Documentation/perf-record.txt           |   2 +-
 tools/perf/Documentation/perf-stat.txt             |   2 +-
 tools/perf/Documentation/perf-top.txt              |   2 +-
 tools/perf/arch/arm64/util/mem-events.c            |   6 +-
 tools/perf/arch/x86/util/evsel.c                   |   5 +-
 tools/perf/arch/x86/util/evsel.h                   |   7 +
 tools/perf/arch/x86/util/topdown.c                 |  21 +-
 tools/perf/builtin-c2c.c                           |   4 +-
 tools/perf/builtin-lock.c                          |   2 +-
 tools/perf/pmu-events/arch/s390/cf_z10/basic.json  |  48 +-
 tools/perf/pmu-events/arch/s390/cf_z10/crypto.json |  64 +--
 .../perf/pmu-events/arch/s390/cf_z10/extended.json |  36 +-
 tools/perf/pmu-events/arch/s390/cf_z13/basic.json  |  48 +-
 tools/perf/pmu-events/arch/s390/cf_z13/crypto.json |  64 +--
 .../perf/pmu-events/arch/s390/cf_z13/extended.json | 100 ++--
 tools/perf/pmu-events/arch/s390/cf_z14/basic.json  |  32 +-
 tools/perf/pmu-events/arch/s390/cf_z14/crypto.json |  64 +--
 .../perf/pmu-events/arch/s390/cf_z14/extended.json | 102 ++--
 tools/perf/pmu-events/arch/s390/cf_z15/basic.json  |  32 +-
 tools/perf/pmu-events/arch/s390/cf_z15/crypto.json | 114 -----
 .../perf/pmu-events/arch/s390/cf_z15/crypto6.json  | 112 +++++
 .../perf/pmu-events/arch/s390/cf_z15/extended.json | 108 ++---
 tools/perf/pmu-events/arch/s390/cf_z16/basic.json  |  58 +++
 .../perf/pmu-events/arch/s390/cf_z16/crypto6.json  | 142 ++++++
 .../perf/pmu-events/arch/s390/cf_z16/extended.json | 492 +++++++++++++++++++
 .../pmu-events/arch/s390/cf_z16/transaction.json   |   7 +
 tools/perf/pmu-events/arch/s390/cf_z196/basic.json |  48 +-
 .../perf/pmu-events/arch/s390/cf_z196/crypto.json  |  64 +--
 .../pmu-events/arch/s390/cf_z196/extended.json     |  44 +-
 .../perf/pmu-events/arch/s390/cf_zec12/basic.json  |  48 +-
 .../perf/pmu-events/arch/s390/cf_zec12/crypto.json |  64 +--
 .../pmu-events/arch/s390/cf_zec12/extended.json    |  66 +--
 tools/perf/pmu-events/arch/s390/mapfile.csv        |   1 +
 .../pmu-events/arch/x86/alderlake/adl-metrics.json | 163 +++++--
 .../arch/x86/sapphirerapids/spr-metrics.json       | 530 +++++++++++++++++++++
 tools/perf/tests/shell/test_arm_spe_fork.sh        |  92 ++++
 tools/perf/util/dso.h                              |   2 +
 tools/perf/util/unwind-libunwind-local.c           | 105 ++--
 38 files changed, 2163 insertions(+), 738 deletions(-)
 create mode 100644 tools/perf/arch/x86/util/evsel.h
 delete mode 100644 tools/perf/pmu-events/arch/s390/cf_z15/crypto.json
 create mode 100644 tools/perf/pmu-events/arch/s390/cf_z16/basic.json
 create mode 100644 tools/perf/pmu-events/arch/s390/cf_z16/crypto6.json
 create mode 100644 tools/perf/pmu-events/arch/s390/cf_z16/extended.json
 create mode 100644 tools/perf/pmu-events/arch/s390/cf_z16/transaction.json
 create mode 100644 tools/perf/pmu-events/arch/x86/sapphirerapids/spr-metrics.json
 create mode 100755 tools/perf/tests/shell/test_arm_spe_fork.sh

Test results:

The container based builds will return when I get back home.

The 'perf test' one will perform a variety of tests exercising
tools/perf/util/, tools/lib/{bpf,traceevent,etc}, as well as run perf commands
with a variety of command line event specifications to then intercept the
sys_perf_event syscall to check that the perf_event_attr fields are set up as
expected, among a variety of other unit tests.

Then there is the 'make -C tools/perf build-test' ones, that build tools/perf/
with a variety of feature sets, exercising the build with an incomplete set of
features as well as with a complete one.

  $ uname -a
  Linux quaco 5.17.11-300.fc36.x86_64 #1 SMP PREEMPT Wed May 25 15:04:05 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
  $ git log --oneline -1
  1bcca2b1bd67f3c0 (HEAD -> perf/core) perf vendor events intel: Update metrics for Alderlake
  $ perf -v
  perf version 5.18.g1bcca2b1bd67
  $ sudo su -
  # perf -vv
  perf version 5.18.g1bcca2b1bd67
                   dwarf: [ on  ]  # HAVE_DWARF_SUPPORT
      dwarf_getlocations: [ on  ]  # HAVE_DWARF_GETLOCATIONS_SUPPORT
                   glibc: [ on  ]  # HAVE_GLIBC_SUPPORT
           syscall_table: [ on  ]  # HAVE_SYSCALL_TABLE_SUPPORT
                  libbfd: [ on  ]  # HAVE_LIBBFD_SUPPORT
              debuginfod: [ on  ]  # HAVE_DEBUGINFOD_SUPPORT
                  libelf: [ on  ]  # HAVE_LIBELF_SUPPORT
                 libnuma: [ on  ]  # HAVE_LIBNUMA_SUPPORT
  numa_num_possible_cpus: [ on  ]  # HAVE_LIBNUMA_SUPPORT
                 libperl: [ on  ]  # HAVE_LIBPERL_SUPPORT
               libpython: [ on  ]  # HAVE_LIBPYTHON_SUPPORT
                libslang: [ on  ]  # HAVE_SLANG_SUPPORT
               libcrypto: [ OFF ]  # HAVE_LIBCRYPTO_SUPPORT
               libunwind: [ on  ]  # HAVE_LIBUNWIND_SUPPORT
      libdw-dwarf-unwind: [ on  ]  # HAVE_DWARF_SUPPORT
                    zlib: [ on  ]  # HAVE_ZLIB_SUPPORT
                    lzma: [ on  ]  # HAVE_LZMA_SUPPORT
               get_cpuid: [ on  ]  # HAVE_AUXTRACE_SUPPORT
                     bpf: [ on  ]  # HAVE_LIBBPF_SUPPORT
                     aio: [ on  ]  # HAVE_AIO_SUPPORT
                    zstd: [ on  ]  # HAVE_ZSTD_SUPPORT
                 libpfm4: [ OFF ]  # HAVE_LIBPFM
  # perf test
    1: vmlinux symtab matches kallsyms                                 : Ok
    2: Detect openat syscall event                                     : Ok
    3: Detect openat syscall event on all cpus                         : Ok
    4: Read samples using the mmap interface                           : Ok
    5: Test data source output                                         : Ok
    6: Parse event definition strings                                  :
    6.1: Test event parsing                                            : Ok
    6.2: Test parsing of "hybrid" CPU events                           : Skip (not hybrid)
    6.3: Parsing of all PMU events from sysfs                          : Ok
    6.4: Parsing of given PMU events from sysfs                        : Ok
    6.5: Parsing of aliased events from sysfs                          : Skip (no aliases in sysfs)
    6.6: Parsing of aliased events                                     : Ok
    6.7: Parsing of terms (event modifiers)                            : Ok
    7: Simple expression parser                                        : Ok
    8: PERF_RECORD_* events & perf_sample fields                       : Ok
    9: Parse perf pmu format                                           : Ok
   10: PMU events                                                      :
   10.1: PMU event table sanity                                        : Ok
   10.2: PMU event map aliases                                         : Ok
   10.3: Parsing of PMU event table metrics                            : Ok
   10.4: Parsing of PMU event table metrics with fake PMUs             : Ok
   11: DSO data read                                                   : Ok
   12: DSO data cache                                                  : Ok
   13: DSO data reopen                                                 : Ok
   14: Roundtrip evsel->name                                           : Ok
   15: Parse sched tracepoints fields                                  : Ok
   16: syscalls:sys_enter_openat event fields                          : Ok
   17: Setup struct perf_event_attr                                    : Ok
   18: Match and link multiple hists                                   : Ok
   19: 'import perf' in python                                         : Ok
   20: Breakpoint overflow signal handler                              : Ok
   21: Breakpoint overflow sampling                                    : Ok
   22: Breakpoint accounting                                           : Ok
   23: Watchpoint                                                      :
   23.1: Read Only Watchpoint                                          : Skip (missing hardware support)
   23.2: Write Only Watchpoint                                         : Ok
   23.3: Read / Write Watchpoint                                       : Ok
   23.4: Modify Watchpoint                                             : Ok
   24: Number of exit events of a simple workload                      : Ok
   25: Software clock events period values                             : Ok
   26: Object code reading                                             : Ok
   27: Sample parsing                                                  : Ok
   28: Use a dummy software event to keep tracking                     : Ok
   29: Parse with no sample_id_all bit set                             : Ok
   30: Filter hist entries                                             : Ok
   31: Lookup mmap thread                                              : Ok
   32: Share thread maps                                               : Ok
   33: Sort output of hist entries                                     : Ok
   34: Cumulate child hist entries                                     : Ok
   35: Track with sched_switch                                         : Ok
   36: Filter fds with revents mask in a fdarray                       : Ok
   37: Add fd to a fdarray, making it autogrow                         : Ok
   38: kmod_path__parse                                                : Ok
   39: Thread map                                                      : Ok
   40: LLVM search and compile                                         :
   40.1: Basic BPF llvm compile                                        : Ok
   40.2: kbuild searching                                              : Ok
   40.3: Compile source for BPF prologue generation                    : Ok
   40.4: Compile source for BPF relocation                             : Ok
   41: Session topology                                                : Ok
   42: BPF filter                                                      :
   42.1: Basic BPF filtering                                           : Ok
   42.2: BPF pinning                                                   : Ok
   42.3: BPF prologue generation                                       : Ok
   43: Synthesize thread map                                           : Ok
   44: Remove thread map                                               : Ok
   45: Synthesize cpu map                                              : Ok
   46: Synthesize stat config                                          : Ok
   47: Synthesize stat                                                 : Ok
   48: Synthesize stat round                                           : Ok
   49: Synthesize attr update                                          : Ok
   50: Event times                                                     : Ok
   51: Read backward ring buffer                                       : Ok
   52: Print cpu map                                                   : Ok
   53: Merge cpu map                                                   : Ok
   54: Probe SDT events                                                : Ok
   55: is_printable_array                                              : Ok
   56: Print bitmap                                                    : Ok
   57: perf hooks                                                      : Ok
   58: builtin clang support                                           :
   58.1: builtin clang compile C source to IR                          : Skip (not compiled in)
   58.2: builtin clang compile C source to ELF object                  : Skip (not compiled in)
   59: unit_number__scnprintf                                          : Ok
   60: mem2node                                                        : Ok
   61: time utils                                                      : Ok
   62: Test jit_write_elf                                              : Ok
   63: Test libpfm4 support                                            :
   63.1: test of individual --pfm-events                               : Skip (not compiled in)
   63.2: test groups of --pfm-events                                   : Skip (not compiled in)
   64: Test api io                                                     : Ok
   65: maps__merge_in                                                  : Ok
   66: Demangle Java                                                   : Ok
   67: Demangle OCaml                                                  : Ok
   68: Parse and process metrics                                       : Ok
   69: PE file support                                                 : Ok
   70: Event expansion for cgroups                                     : Ok
   71: Convert perf time to TSC                                        :
   71.1: TSC support                                                   : Ok
   71.2: Perf time to TSC                                              : Ok
   72: dlfilter C API                                                  : Ok
   73: Sigtrap                                                         : Ok
   74: x86 rdpmc                                                       : Ok
   75: Test dwarf unwind                                               : Ok
   76: x86 instruction decoder - new instructions                      : Ok
   77: Intel PT packet decoder                                         : Ok
   78: x86 bp modify                                                   : Ok
   79: x86 Sample parsing                                              : Ok
   80: build id cache operations                                       : Ok
   81: daemon operations                                               : Ok
   82: perf pipe recording and injection test                          : Ok
   83: Add vfs_getname probe to get syscall args filenames             : Ok
   84: probe libc's inet_pton & backtrace it with ping                 : Ok
   85: Use vfs_getname probe to get syscall args filenames             : Ok
   86: Zstd perf.data compression/decompression                        : Ok
   87: perf record tests                                               : Ok
   88: perf record offcpu profiling tests                              : Ok
   89: perf stat CSV output linter                                     : Skip
   90: perf stat csv summary test                                      : Ok
   91: perf stat metrics (shadow stat) test                            : Ok
   92: perf stat tests                                                 : Ok
   93: perf all metricgroups test                                      : Ok
   94: perf all metrics test                                           : FAILED!
   95: perf all PMU test                                               : Ok
   96: perf stat --bpf-counters test                                   : Ok
   97: Check Arm64 callgraphs are complete in fp mode                  : Skip
   98: Check Arm CoreSight trace data recording and synthesized samples: Skip
   99: Check Arm SPE trace data recording and synthesized samples      : Skip
  100: Check Arm SPE doesn't hang when there are forks                 : Skip
  101: Miscellaneous Intel PT testing                                  : Ok
  102: Check open filename arg using perf trace + vfs_getname          : Ok
  #

The CORESIGHT=1 build is failing, being investigated, couldn't reproduce it
building it out of 'make -C tools/perf build-test', as below.

Also LIBBPF_DYNAMIC=1 fails on fedora:36, as libbpf-devel is old, should work
with 0.8.0, that is in fedora:rawhide and should be on 36 soon according to
Jiri Olsa.

  $ cat /proc/cpuinfo | grep "model name" -m1
  model name	: Intel(R) Core(TM) i7-8650U CPU @ 1.90GHz
  $ git log --oneline -1 ; time make -C tools/perf/ build-test
  1bcca2b1bd67f3c0 (HEAD -> perf/core, acme.korg/tmp.perf/core) perf vendor events intel: Update metrics for Alderlake
  make: Entering directory '/home/acme/git/perf/tools/perf'
  - tarpkg: ./tests/perf-targz-src-pkg .
                   make_static: make LDFLAGS=-static NO_PERF_READ_VDSO32=1 NO_PERF_READ_VDSOX32=1 NO_JVMTI=1 -j8  DESTDIR=/tmp/tmp.x5DTDzzkiO
                make_with_gtk2: make GTK2=1 -j8  DESTDIR=/tmp/tmp.3FlErTijHh
                make_minimal_O: make NO_LIBPERL=1 NO_LIBPYTHON=1 NO_NEWT=1 NO_GTK2=1 NO_DEMANGLE=1 NO_LIBELF=1 NO_LIBUNWIND=1 NO_BACKTRACE=1 NO_LIBNUMA=1 NO_LIBAUDIT=1 NO_LIBBIONIC=1 NO_LIBDW_DWARF_UNWIND=1 NO_AUXTRACE=1 NO_LIBBPF=1 NO_LIBCRYPTO=1 NO_SDT=1 NO_JVMTI=1 NO_LIBZSTD=1 NO_LIBCAP=1 NO_SYSCALL_TABLE=1
                make_no_newt_O: make NO_NEWT=1
                  make_no_ui_O: make NO_NEWT=1 NO_SLANG=1 NO_GTK2=1
                  make_debug_O: make DEBUG=1
            make_no_auxtrace_O: make NO_AUXTRACE=1
   make_install_prefix_slash_O: make install prefix=/tmp/krava/
       make_util_pmu_bison_o_O: make util/pmu-bison.o
           make_no_backtrace_O: make NO_BACKTRACE=1
             make_no_libperl_O: make NO_LIBPERL=1
                   make_tags_O: make tags
  make_no_libdw_dwarf_unwind_O: make NO_LIBDW_DWARF_UNWIND=1
           make_no_libbionic_O: make NO_LIBBIONIC=1
                 make_no_sdt_O: make NO_SDT=1
            make_no_libaudit_O: make NO_LIBAUDIT=1
                 make_perf_o_O: make perf.o
                 make_cscope_O: make cscope
                make_install_O: make install
               make_no_slang_O: make NO_SLANG=1
         make_no_syscall_tbl_O: make NO_SYSCALL_TABLE=1
              make_no_libelf_O: make NO_LIBELF=1
        make_no_libbpf_DEBUG_O: make NO_LIBBPF=1 DEBUG=1
              make_no_libbpf_O: make NO_LIBBPF=1
            make_no_demangle_O: make NO_DEMANGLE=1
            make_install_bin_O: make install-bin
         make_install_prefix_O: make install prefix=/tmp/krava
              make_clean_all_O: make clean all
                   make_pure_O: make
             make_no_libnuma_O: make NO_LIBNUMA=1
                    make_doc_O: make doc
           make_no_libcrypto_O: make NO_LIBCRYPTO=1
           make_no_libpython_O: make NO_LIBPYTHON=1
                   make_help_O: make help
                make_no_gtk2_O: make NO_GTK2=1
         make_with_clangllvm_O: make LIBCLANGLLVM=1
             make_no_scripts_O: make NO_LIBPYTHON=1 NO_LIBPERL=1
             make_util_map_o_O: make util/map.o
           make_with_libpfm4_O: make LIBPFM4=1
        make_with_babeltrace_O: make LIBBABELTRACE=1
           make_no_libunwind_O: make NO_LIBUNWIND=1
  OK
  make: Leaving directory '/home/acme/git/perf/tools/perf'
  
  real	18m58.348s
  user	112m8.915s
  sys	15m59.473s
  $

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: [GIT PULL] perf tools changes for v5.19: 3rd batch
  2022-06-03 22:31 [GIT PULL] perf tools changes for v5.19: 3rd batch Arnaldo Carvalho de Melo
@ 2022-06-04 20:56 ` pr-tracker-bot
  0 siblings, 0 replies; 2+ messages in thread
From: pr-tracker-bot @ 2022-06-04 20:56 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo
  Cc: Linus Torvalds, Ingo Molnar, Thomas Gleixner, Jiri Olsa,
	Namhyung Kim, Clark Williams, Kate Carcia, linux-kernel,
	linux-perf-users, Arnaldo Carvalho de Melo, Fangrui Song,
	German Gomez, Ian Rogers, Kevin Nomura, Leo Yan,
	Sebastian Ullrich, Thomas Richter, Zhengjun Xing,
	Arnaldo Carvalho de Melo

The pull request you sent on Sat,  4 Jun 2022 00:31:36 +0200:

> git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-tools-for-v5.19-2022-06-04

has been merged into torvalds/linux.git:
https://git.kernel.org/torvalds/c/45b2e5ad6837dfe4de6b9028c575bd57c132774c

Thank you!

-- 
Deet-doot-dot, I am a bot.
https://korg.docs.kernel.org/prtracker.html

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2022-06-04 20:56 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-06-03 22:31 [GIT PULL] perf tools changes for v5.19: 3rd batch Arnaldo Carvalho de Melo
2022-06-04 20:56 ` pr-tracker-bot

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.