From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753475AbbCJOZe (ORCPT ); Tue, 10 Mar 2015 10:25:34 -0400 Received: from mail-ig0-f182.google.com ([209.85.213.182]:41752 "EHLO mail-ig0-f182.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752798AbbCJOZd (ORCPT ); Tue, 10 Mar 2015 10:25:33 -0400 Message-ID: <54FEFED7.5000503@gmail.com> Date: Tue, 10 Mar 2015 08:25:27 -0600 From: David Ahern User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.10; rv:31.0) Gecko/20100101 Thunderbird/31.5.0 MIME-Version: 1.0 To: Namhyung Kim , Arnaldo Carvalho de Melo CC: Adrian Hunter , Jiri Olsa , Ingo Molnar , =?UTF-8?B?RnLDqWTDqXJpYyBXZWlzYmVja2Vy?= , Linux Kernel Mailing List Subject: Re: heads up/RFC: 'perf trace' using ordered_events References: <20150303164940.GM5187@kernel.org> <20150304010123.GG27046@danjae> <20150304010709.GQ5187@kernel.org> <54FAF2E4.3030109@intel.com> <20150309132135.GG5187@kernel.org> <20150310060644.GC943@sejong> In-Reply-To: <20150310060644.GC943@sejong> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 3/10/15 12:06 AM, Namhyung Kim wrote: > Hi Arnaldo, > > On Mon, Mar 09, 2015 at 10:21:35AM -0300, Arnaldo Carvalho de Melo wrote: >> For trace I need to take advantage of the fact that each mmap is ordered >> already and then just sort by the timestamp in the mmap head, etc. >> >> In retrospect, the perf.data file should have kept that ordering, i.e. >> have one file per mmap, that would be saved in parallel, without any of >> those PERF_RECORD_FINISHED_ROUND records. >> >> But I have to experiment with that, leaving the existing code around to >> deal with older files. > > It seems like what you said is almost same as my multi-thread work. > It saves data files per mmap and then merges them with an index > table so that they can be processed in parallel. I think you and Jiri both have worked on saving a file per mmap. Where does that stand? I would like to try it out. With the 1024 cpu systems I am seeing 5GB files in 1 second runs and perf is not handling it well. The perf-script/perf-report (stdio) will 'hang' for 45 minutes munging through the file. I have to connect gdb from time to time to verify it is making progress (file_offset is increasing). I believe what happens is that there is 'no round' -- it has to process all mmaps (1024 cpus and 6 or 7 events) through the ordered events queue before it can push out results. I need to look at that once I figure out the task scheduler problem. David