From: Arnaldo Carvalho de Melo <acme@kernel.org> To: Leo Yan <leo.yan@linaro.org> Cc: James Clark <james.clark@arm.com>, John Garry <john.garry@huawei.com>, Will Deacon <will@kernel.org>, Mathieu Poirier <mathieu.poirier@linaro.org>, Peter Zijlstra <peterz@infradead.org>, Ingo Molnar <mingo@redhat.com>, Mark Rutland <mark.rutland@arm.com>, Alexander Shishkin <alexander.shishkin@linux.intel.com>, Jiri Olsa <jolsa@redhat.com>, Namhyung Kim <namhyung@kernel.org>, Dave Martin <Dave.Martin@arm.com>, Al Grant <Al.Grant@arm.com>, linux-arm-kernel@lists.infradead.org, linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH v5 5/5] perf arm-spe: Don't wait for PERF_RECORD_EXIT event Date: Thu, 1 Jul 2021 14:03:16 -0300 [thread overview] Message-ID: <YN31VC9e/33QYhwn@kernel.org> (raw) In-Reply-To: <20210628121217.GD200044@leoy-ThinkPad-X240s> Em Mon, Jun 28, 2021 at 08:12:17PM +0800, Leo Yan escreveu: > On Fri, Jun 25, 2021 at 02:25:15PM +0100, James Clark wrote: > > For the whole set: > > Reviewed-by: James Clark <james.clark@arm.com> > > Tested-by: James Clark <james.clark@arm.com> > > I see a big improvement in decoding involving multiple processes because the timestamps are now > > > > For example perf-exec samples are visible right before the exec is done, and on an > > application that forks, samples are visible from all processes. For example: > > perf record -e arm_spe// -- bash -c "stress -c 1" > > perf script > > perf-exec 4502 [003] 259755.050409: 1 l1d-access: ffff80001014b840 sched_clock+0x40 ([kernel.kallsyms]) > > perf-exec 4502 [003] 259755.050409: 1 tlb-access: ffff80001014b840 sched_clock+0x40 ([kernel.kallsyms]) > > perf-exec 4502 [003] 259755.050409: 1 memory: ffff80001014b840 sched_clock+0x40 ([kernel.kallsyms]) > > perf-exec 4502 [003] 259755.050411: 1 tlb-access: ffff800010120fb8 __rcu_read_lock+0x0 ([kernel.kallsyms]) > > bash 4502 [003] 259755.050411: 1 branch-miss: ffff8000105b2a40 memcpy+0x80 ([kernel.kallsyms]) > > bash 4502 [003] 259755.050411: 1 tlb-access: 0 [unknown] ([unknown]) > > ... > > stress 4502 [003] 259755.051468: 1 l1d-access: ffff800010259a24 __vma_adjust+0x1f4 ([kernel.kallsyms]) > > stress 4502 [003] 259755.051468: 1 tlb-access: ffff800010259a24 __vma_adjust+0x1f4 ([kernel.kallsyms]) > > stress 4502 [003] 259755.051468: 1 memory: ffff800010259a24 __vma_adjust+0x1f4 ([kernel.kallsyms]) > > Previously samples were only attributed to 'stress', which was obviously wrong. > > Thanks a lot for the review and testing, James! > > Hi Arnaldo, I confirmed this patch set can be cleanly applied on > the latest acme/perf/core branch, so could you pick up this patch > set? Applied, thanks, please let me know if there is still something outstanding, - Arnaldo
WARNING: multiple messages have this Message-ID (diff)
From: Arnaldo Carvalho de Melo <acme@kernel.org> To: Leo Yan <leo.yan@linaro.org> Cc: James Clark <james.clark@arm.com>, John Garry <john.garry@huawei.com>, Will Deacon <will@kernel.org>, Mathieu Poirier <mathieu.poirier@linaro.org>, Peter Zijlstra <peterz@infradead.org>, Ingo Molnar <mingo@redhat.com>, Mark Rutland <mark.rutland@arm.com>, Alexander Shishkin <alexander.shishkin@linux.intel.com>, Jiri Olsa <jolsa@redhat.com>, Namhyung Kim <namhyung@kernel.org>, Dave Martin <Dave.Martin@arm.com>, Al Grant <Al.Grant@arm.com>, linux-arm-kernel@lists.infradead.org, linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH v5 5/5] perf arm-spe: Don't wait for PERF_RECORD_EXIT event Date: Thu, 1 Jul 2021 14:03:16 -0300 [thread overview] Message-ID: <YN31VC9e/33QYhwn@kernel.org> (raw) In-Reply-To: <20210628121217.GD200044@leoy-ThinkPad-X240s> Em Mon, Jun 28, 2021 at 08:12:17PM +0800, Leo Yan escreveu: > On Fri, Jun 25, 2021 at 02:25:15PM +0100, James Clark wrote: > > For the whole set: > > Reviewed-by: James Clark <james.clark@arm.com> > > Tested-by: James Clark <james.clark@arm.com> > > I see a big improvement in decoding involving multiple processes because the timestamps are now > > > > For example perf-exec samples are visible right before the exec is done, and on an > > application that forks, samples are visible from all processes. For example: > > perf record -e arm_spe// -- bash -c "stress -c 1" > > perf script > > perf-exec 4502 [003] 259755.050409: 1 l1d-access: ffff80001014b840 sched_clock+0x40 ([kernel.kallsyms]) > > perf-exec 4502 [003] 259755.050409: 1 tlb-access: ffff80001014b840 sched_clock+0x40 ([kernel.kallsyms]) > > perf-exec 4502 [003] 259755.050409: 1 memory: ffff80001014b840 sched_clock+0x40 ([kernel.kallsyms]) > > perf-exec 4502 [003] 259755.050411: 1 tlb-access: ffff800010120fb8 __rcu_read_lock+0x0 ([kernel.kallsyms]) > > bash 4502 [003] 259755.050411: 1 branch-miss: ffff8000105b2a40 memcpy+0x80 ([kernel.kallsyms]) > > bash 4502 [003] 259755.050411: 1 tlb-access: 0 [unknown] ([unknown]) > > ... > > stress 4502 [003] 259755.051468: 1 l1d-access: ffff800010259a24 __vma_adjust+0x1f4 ([kernel.kallsyms]) > > stress 4502 [003] 259755.051468: 1 tlb-access: ffff800010259a24 __vma_adjust+0x1f4 ([kernel.kallsyms]) > > stress 4502 [003] 259755.051468: 1 memory: ffff800010259a24 __vma_adjust+0x1f4 ([kernel.kallsyms]) > > Previously samples were only attributed to 'stress', which was obviously wrong. > > Thanks a lot for the review and testing, James! > > Hi Arnaldo, I confirmed this patch set can be cleanly applied on > the latest acme/perf/core branch, so could you pick up this patch > set? Applied, thanks, please let me know if there is still something outstanding, - Arnaldo _______________________________________________ linux-arm-kernel mailing list linux-arm-kernel@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
next prev parent reply other threads:[~2021-07-01 17:03 UTC|newest] Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top 2021-05-19 7:19 [PATCH v5 0/5] perf arm-spe: Enable timestamp Leo Yan 2021-05-19 7:19 ` Leo Yan 2021-05-19 7:19 ` [PATCH v5 1/5] perf arm-spe: Save clock parameters from TIME_CONV event Leo Yan 2021-05-19 7:19 ` Leo Yan 2021-05-19 7:19 ` [PATCH v5 2/5] perf arm-spe: Convert event kernel time to counter value Leo Yan 2021-05-19 7:19 ` Leo Yan 2021-05-19 7:19 ` [PATCH v5 3/5] perf arm-spe: Assign kernel time to synthesized event Leo Yan 2021-05-19 7:19 ` Leo Yan 2021-05-19 7:19 ` [PATCH v5 4/5] perf arm-spe: Bail out if the trace is later than perf event Leo Yan 2021-05-19 7:19 ` Leo Yan 2021-05-19 7:19 ` [PATCH v5 5/5] perf arm-spe: Don't wait for PERF_RECORD_EXIT event Leo Yan 2021-05-19 7:19 ` Leo Yan 2021-06-25 13:25 ` James Clark 2021-06-25 13:25 ` James Clark 2021-06-28 12:12 ` Leo Yan 2021-06-28 12:12 ` Leo Yan 2021-07-01 17:03 ` Arnaldo Carvalho de Melo [this message] 2021-07-01 17:03 ` Arnaldo Carvalho de Melo 2021-07-02 1:31 ` Leo Yan 2021-07-02 1:31 ` Leo Yan
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=YN31VC9e/33QYhwn@kernel.org \ --to=acme@kernel.org \ --cc=Al.Grant@arm.com \ --cc=Dave.Martin@arm.com \ --cc=alexander.shishkin@linux.intel.com \ --cc=james.clark@arm.com \ --cc=john.garry@huawei.com \ --cc=jolsa@redhat.com \ --cc=leo.yan@linaro.org \ --cc=linux-arm-kernel@lists.infradead.org \ --cc=linux-kernel@vger.kernel.org \ --cc=linux-perf-users@vger.kernel.org \ --cc=mark.rutland@arm.com \ --cc=mathieu.poirier@linaro.org \ --cc=mingo@redhat.com \ --cc=namhyung@kernel.org \ --cc=peterz@infradead.org \ --cc=will@kernel.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.