Re: perf stat report segfaults

* Re: perf stat report segfaults
       [not found] ` <CAP-5=fU=kydiXW2Q_zc21dywD-wPWtwA8-dvcnu-e0VzOch_Hg@mail.gmail.com>
@ 2022-05-18 15:45   ` Ian Rogers
  2022-05-18 21:47   ` Michael Petlan
  1 sibling, 0 replies; 3+ messages in thread
From: Ian Rogers @ 2022-05-18 15:45 UTC (permalink / raw)
  To: Michael Petlan; +Cc: Arnaldo de Melo, Jiri Olsa, linux-perf-users

Resending with corrected linux-perf-users mailing list address.

Thanks,
Ian

On Wed, May 18, 2022 at 8:43 AM Ian Rogers <irogers@google.com> wrote:
>
> On Wed, May 18, 2022, 12:41 AM Michael Petlan <mpetlan@redhat.com> wrote:
>>
>> Hello Ian.
>>
>> I have been rebasing perf in RHEL to v5.17 and I have hit the following
>> problem:
>>
>> # perf stat record -- ls
>> [...]
>> # perf stat report
>> Segmentation fault (core dumped)
>>
>> Investigation led me to your patch:
>>
>> commit 7ac0089d138f80dcd7ba8ca368a9b2bdfe780b16
>> Author: Ian Rogers <irogers@google.com>
>> Date: Tue Jan 4 22:13:38 2022 -0800
>>
>>   perf evsel: Pass cpu not cpu map index to synthesize
>>
>> This results in that perf_event__synthesize_stat()'s second argument
>> is -1 instead of 0 before.
>>
>> With that, the cpu field is stored as ff ff ff ff, as you can see in the
>> raw report:
>>
>> -1 -1 0x630 [0x30]: PERF_RECORD_STAT
>> ... id 9597, cpu -1, thread 1
>> ... value 3431216, enabled 3431216, running 3431216
>> : unhandled!
>>
>> 0x660 [0x30]: event: 76
>> .
>> . ... raw event: size 48 bytes
>> .  0000:  00 00 00 4c 00 00 00 30 00 00 00 00 00 00 25 7e  ...L...0......%~
>> .+>0010:  ff ff ff ff 00 00 00 00 00 00 00 00 00 00 00 36  ...............6
>> .| 0020:  00 00 00 00 01 44 6e 89 00 00 00 00 01 44 6e 89  .....Dn......Dn.
>>  |
>>  here
>>
>> This value is later loaded and used as an index to xyarray in libperf, which
>> causes the segfault.
>
>
> Thanks for reporting this and sorry for the breakage! This kind of problem is something I've been trying  to fix. Basically perf has cpu maps that are sorted arrays of CPU numbers. For a counter that is available on every CPU this gets reported as say [0,1,2,3] on a 4 CPU system. Some counters are available on fewer CPUs, for example my 2 socket skylake machine has a memory controller counter with a cpu map of [0,18]. The indices into the CPU map are more densely encoded than just using the CPU number, so rather than have arrays (for things like file descriptors, saved values, etc.) sized by the number of CPUs they are sized by the CPU map size and indexed using the CPU map index. The problem I've been trying to fix is when the code has inadvertently swapped the two values. This can work if the CPU map has an entry for every CPU in the system, but fails for say my Skylake memory controller.
>
> In your example here the CPU value of -1 is a special "any" CPU marker that is described in the perf_event_open man page. When trying to work out the intent of the code in fixing CPU vs index I'd push the value through the API and try to use an intention revealing name, so typically cpu_map_idx. In this case this appears to have been done on the writing side but not on the reading side.
>
>> This is my "hotfix":
>>
>> diff --git a/tools/perf/util/stat.c b/tools/perf/util/stat.c
>> index 817a2de264b4..b6ca193ff972 100644
>> --- a/tools/perf/util/stat.c
>> +++ b/tools/perf/util/stat.c
>> @@ -472,9 +472,10 @@ int perf_stat_process_counter(struct perf_stat_config *config,
>>  int perf_event__process_stat_event(struct perf_session *session,
>>                                    union perf_event *event)
>>  {
>> -       struct perf_counts_values count;
>> +  struct perf_counts_values count, *ptr;
>>         struct perf_record_stat *st = &event->stat;
>>         struct evsel *counter;
>> +       int cpu = st->cpu;
>>
>>         count.val = st->val;
>>         count.ena = st->ena;
>> @@ -486,7 +487,9 @@ int perf_event__process_stat_event(struct perf_session *session,
>>                 return -EINVAL;
>>         }
>>
>> -       *perf_counts(counter->counts, st->cpu, st->thread) = count;
>> +       if (cpu == -1) cpu = 0;
>> +       ptr = perf_counts(counter->counts, cpu, st->thread);
>> +       if (ptr) *ptr = count;
>>         counter->supported = true;
>>         return 0;
>>  }
>>
>>
>> It needs to be reworked, but checking whether perf_counts() returns
>> not-NULL pointer is necessary anyway.
>>
>> My question here is what the -1 actually means and whether I can simply
>> do "if (cpu == -1) cpu = 0;", since the data lies on "offset" 0 anyway,
>> even when it's recorded by perf with your patch...
>
>
> So I'd rather this wasn't fixed this way. The "cpu" here is really a cpu_map_idx, we need to work backward to make sure that is the case. To try to make this safer I created a struct perf_cpu to disambiguate indices from CPU values. When we load the -1 from the file we need to turn it into an index using the cpu map. As the array is sorted the index will always be 0, but it would be nicer to use perf_cpu_map__idx [1] to show we're translating from what to what and how it relates to a particular cpu map.
>
> [1] https://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git/tree/tools/lib/perf/include/internal/cpumap.h?h=perf/core#n27
>
>>
>> I think the -1 comes from perf_event_open syscall, where cpu == -1 means
>> all cpus. Am I right?
>
>
> Yep, I'm trying to distinguish "all" from "any". On a 4 CPU system all would be [0, 1, 2, 3] while any would be [-1].
>
>>
>> In your opinion, what should be done to fix this problem the best way?
>
>
> Notes above but I'll also try to take a look at this using your very simple reproducer (which should definitely be a test).
>
> Thanks,
> Ian
>
>>
>> Thank you.
>> Michael

^ permalink raw reply	[flat|nested] 3+ messages in thread