All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH v2 00/16] Compress the pmu_event tables
@ 2022-07-28 22:28 ` Ian Rogers
  0 siblings, 0 replies; 34+ messages in thread
From: Ian Rogers @ 2022-07-28 22:28 UTC (permalink / raw)
  To: John Garry, Will Deacon, James Clark, Mike Leach, Leo Yan,
	Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Mark Rutland, Alexander Shishkin, Jiri Olsa, Namhyung Kim,
	Andi Kleen, Zhengjun Xing, Ravi Bangoria, Kan Liang,
	Adrian Hunter, linux-kernel, linux-arm-kernel, linux-perf-users
  Cc: Stephane Eranian, Ian Rogers

jevents.py creates a number of large arrays from the json events. The
arrays contain pointers to strings that need relocating. The
relocations have file size, run time and memory costs. These changes
refactor the pmu_events API so that the storage of the pmu_event
struct isn't exposed. The format is then changed to an offset within a
combined big string, with adjacent pmu_event struct variables being
next to each other in the string separated by \0 - meaning only the
first variable of the struct needs its offset recording.

Some related fixes are contained with the patches. The architecture
jevents.py creates tables for can now be set by the JEVENTS_ARCH make
variable, with a new 'all' that generates the events and metrics for
all architectures.

An example of the improvement to the file size on x86 is:
no jevents - the same 19,788,464bytes
x86 jevents - ~16.7% file size saving 23,744,288bytes vs 28,502,632bytes
all jevents - ~19.5% file size saving 24,469,056bytes vs 30,379,920bytes
default build options plus NO_LIBBFD=1.

I originally suggested fixing this problem in:
https://lore.kernel.org/linux-perf-users/CAP-5=fVB8G4bdb9T=FncRTh9oBVKCS=+=eowAO+YSgAhab+Dtg@mail.gmail.com/

v2. Split the substring folding optimization to its own patch and
    comment tweaks as suggested by Namhyung Kim
    <namhyung@kernel.org>. Recompute the file size savings with the
    latest json events and metrics.

Ian Rogers (16):
  perf jevents: Simplify generation of C-string
  perf jevents: Add JEVENTS_ARCH make option
  perf jevent: Add an 'all' architecture argument
  perf jevents: Remove the type/version variables
  perf jevents: Provide path to json file on error
  perf jevents: Sort json files entries
  perf pmu-events: Hide pmu_sys_event_tables
  perf pmu-events: Avoid passing pmu_events_map
  perf pmu-events: Hide pmu_events_map
  perf test: Use full metric resolution
  perf pmu-events: Move test events/metrics to json
  perf pmu-events: Don't assume pmu_event is an array
  perf pmu-events: Hide the pmu_events
  perf metrics: Copy entire pmu_event in find metric
  perf jevents: Compress the pmu_events_table
  perf jevents: Fold strings optimization

 tools/perf/arch/arm64/util/pmu.c              |   4 +-
 tools/perf/pmu-events/Build                   |   6 +-
 .../arch/test/test_soc/cpu/metrics.json       |  64 +++
 tools/perf/pmu-events/empty-pmu-events.c      | 204 +++++++-
 tools/perf/pmu-events/jevents.py              | 485 +++++++++++++++---
 tools/perf/pmu-events/pmu-events.h            |  40 +-
 tools/perf/tests/expand-cgroup.c              |  25 +-
 tools/perf/tests/parse-metric.c               |  77 +--
 tools/perf/tests/pmu-events.c                 | 466 +++++++----------
 tools/perf/util/metricgroup.c                 | 275 ++++++----
 tools/perf/util/metricgroup.h                 |   5 +-
 tools/perf/util/pmu.c                         | 139 ++---
 tools/perf/util/pmu.h                         |   8 +-
 tools/perf/util/s390-sample-raw.c             |  50 +-
 14 files changed, 1135 insertions(+), 713 deletions(-)
 create mode 100644 tools/perf/pmu-events/arch/test/test_soc/cpu/metrics.json

-- 
2.37.1.455.g008518b4e5-goog


^ permalink raw reply	[flat|nested] 34+ messages in thread

end of thread, other threads:[~2022-07-28 23:46 UTC | newest]

Thread overview: 34+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-07-28 22:28 [PATCH v2 00/16] Compress the pmu_event tables Ian Rogers
2022-07-28 22:28 ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 01/16] perf jevents: Simplify generation of C-string Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 02/16] perf jevents: Add JEVENTS_ARCH make option Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 03/16] perf jevent: Add an 'all' architecture argument Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 04/16] perf jevents: Remove the type/version variables Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 05/16] perf jevents: Provide path to json file on error Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 06/16] perf jevents: Sort json files entries Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 07/16] perf pmu-events: Hide pmu_sys_event_tables Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 08/16] perf pmu-events: Avoid passing pmu_events_map Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 09/16] perf pmu-events: Hide pmu_events_map Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 10/16] perf test: Use full metric resolution Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 11/16] perf pmu-events: Move test events/metrics to json Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 12/16] perf pmu-events: Don't assume pmu_event is an array Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 13/16] perf pmu-events: Hide the pmu_events Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 14/16] perf metrics: Copy entire pmu_event in find metric Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 15/16] perf jevents: Compress the pmu_events_table Ian Rogers
2022-07-28 22:28   ` Ian Rogers
2022-07-28 22:28 ` [PATCH v2 16/16] perf jevents: Fold strings optimization Ian Rogers
2022-07-28 22:28   ` Ian Rogers

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.