From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754095AbcKVHCA (ORCPT ); Tue, 22 Nov 2016 02:02:00 -0500 Received: from mail-pg0-f65.google.com ([74.125.83.65]:33617 "EHLO mail-pg0-f65.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751328AbcKVHB7 (ORCPT ); Tue, 22 Nov 2016 02:01:59 -0500 Date: Tue, 22 Nov 2016 16:00:57 +0900 From: Namhyung Kim To: Arnaldo Carvalho de Melo , David Ahern Cc: Ingo Molnar , Peter Zijlstra , Jiri Olsa , LKML , Stephane Eranian , Andi Kleen Subject: Re: [PATCHSET 0/7] perf sched: Introduce timehist command, again (v2) Message-ID: <20161122070057.GD10108@danjae.aot.lge.com> References: <20161116060634.28477-1-namhyung@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20161116060634.28477-1-namhyung@kernel.org> User-Agent: Mutt/1.7.1 (2016-10-04) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Arnaldo, Could you please take a look at this? Thanks, Namhyung On Wed, Nov 16, 2016 at 03:06:26PM +0900, Namhyung Kim wrote: > Hello, > > This patchset is a rebased version of David's sched timehist work [1]. > I plan to improve perf sched command more and think that having > timehist command before the work looks good. It seems David is busy > these days, so I'm retrying it by myself. > > * changes in v2) > - change name 'b/n time' to 'wait time' (Ingo) > - show arrow between functions in the callchain (Ingo) > - fix a bug in calculating initial run time > > This implements only basic feature and a few options. I just split > the patch to make it easier to review and did some cosmetic changes. > More patches will come later. > > The below is from the David's original description (w/ slight change): > > ------------------------8<------------------------- > 'perf sched timehist' provides an analysis of scheduling events. > > Example usage: > perf sched record -- sleep 1 > perf sched timehist > > By default it shows the individual schedule events, including the time between > sched-in events for the task, the task scheduling delay (time between wakeup > and actually running) and run time for the task: > > time cpu task name[tid/pid] wait time sch delay run time > ------------- ---- -------------------- --------- --------- --------- > 79371.874569 [11] gcc[31949] 0.014 0.000 1.148 > 79371.874591 [10] gcc[31951] 0.000 0.000 0.024 > 79371.874603 [10] migration/10[59] 3.350 0.004 0.011 > 79371.874604 [11] 1.148 0.000 0.035 > 79371.874723 [05] 0.016 0.000 1.383 > 79371.874746 [05] gcc[31949] 0.153 0.078 0.022 > ... > > Times are in msec.usec. > > If callchains were recorded they are appended to the line with a default stack depth of 5: > > 79371.874569 [11] gcc[31949] 0.014 0.000 1.148 wait_for_completion_killable <- do_fork <- sys_vfork <- stub_vfork <- __vfork > 79371.874591 [10] gcc[31951] 0.000 0.000 0.024 __cond_resched <- _cond_resched <- wait_for_completion <- stop_one_cpu <- sched_exec > 79371.874603 [10] migration/10[59] 3.350 0.004 0.011 smpboot_thread_fn <- kthread <- ret_from_fork > 79371.874604 [11] 1.148 0.000 0.035 cpu_startup_entry <- start_secondary > 79371.874723 [05] 0.016 0.000 1.383 cpu_startup_entry <- start_secondary > 79371.874746 [05] gcc[31949] 0.153 0.078 0.022 do_wait sys_wait4 <- system_call_fastpath <- __GI___waitpid > > --no-call-graph can be used to not show the callchains. --max-stack is used > to control the number of frames shown (default of 5). -x/--excl options can > be used to collapse redundant callchains to get more relevant data on screen. > > Similar to perf-trace -s and -S can be used to dump a statistical summary > without or with events (respectively). Statistics include min run time, > average run time and max run time. Stats are also shown for run time by > cpu. > > The cpu-visual option provides a visual aid for sched switches by cpu: > ... > 79371.874569 [11] s gcc[31949] 0.014 0.000 1.148 > 79371.874591 [10] s gcc[31951] 0.000 0.000 0.024 > 79371.874603 [10] s migration/10[59] 3.350 0.004 0.011 > 79371.874604 [11] i 1.148 0.000 0.035 > 79371.874723 [05] i 0.016 0.000 1.383 > 79371.874746 [05] s gcc[31949] 0.153 0.078 0.022 > ... > > ------------------------8<------------------------- > > This code is available at 'perf/timehist-v2' branch in my tree > > git://git.kernel.org/pub/scm/linux/kernel/git/namhyung/linux-perf.git > > Any feedback is welcomed. > > Thanks, > Namhyung > > > [1] https://lkml.org/lkml/2013/12/1/129 > > David Ahern (6): > perf sched timehist: Introduce timehist command > perf sched timehist: Add summary options > perf sched timehist: Add -w/--wakeups option > perf sched timehist: Add call graph options > perf sched timehist: Add -V/--cpu-visual option > perf sched: Add documentation for timehist options > > Namhyung Kim (2): > perf symbol: Print symbol offsets conditionally > perf tools: Support printing callchains with arrows > > tools/perf/Documentation/perf-sched.txt | 46 +- > tools/perf/builtin-sched.c | 914 +++++++++++++++++++++++++++++++- > tools/perf/util/evsel.h | 1 + > tools/perf/util/evsel_fprintf.c | 12 +- > tools/perf/util/symbol.h | 3 +- > tools/perf/util/symbol_fprintf.c | 11 +- > 6 files changed, 972 insertions(+), 15 deletions(-) > > -- > 2.10.1 >