From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.3 required=3.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,NICE_REPLY_A,SPF_HELO_NONE, SPF_PASS,USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id CBE71C433B4 for ; Thu, 6 May 2021 14:17:48 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 976E761075 for ; Thu, 6 May 2021 14:17:48 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234561AbhEFOSp (ORCPT ); Thu, 6 May 2021 10:18:45 -0400 Received: from mga06.intel.com ([134.134.136.31]:48746 "EHLO mga06.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233737AbhEFOSn (ORCPT ); Thu, 6 May 2021 10:18:43 -0400 IronPort-SDR: 33ZLxfnA7o8IDcgkDqHAkDvQtilmnt//Uthm41OKf/AmlSinh7oQtYbEpTeDJP9FrtaUbe3AR8 t4QFmyTTNrnA== X-IronPort-AV: E=McAfee;i="6200,9189,9976"; a="259761859" X-IronPort-AV: E=Sophos;i="5.82,277,1613462400"; d="scan'208";a="259761859" Received: from orsmga004.jf.intel.com ([10.7.209.38]) by orsmga104.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 06 May 2021 07:17:18 -0700 IronPort-SDR: hn02Kj8lMN1tRBnSxSDHsBy1bNo+nrUylKglaI8arSXJDe0mBho9E15Cs/Nm/LWV3A4FDH0h7H e2AN+d5DHQuQ== X-IronPort-AV: E=Sophos;i="5.82,277,1613462400"; d="scan'208";a="539970015" Received: from akleen-mobl1.amr.corp.intel.com (HELO [10.212.219.76]) ([10.212.219.76]) by orsmga004-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 06 May 2021 07:17:18 -0700 Subject: Re: [PATCH v5 00/20] Introduce threaded trace streaming for basic perf record operation To: Namhyung Kim , Alexey Bayduraev Cc: Arnaldo Carvalho de Melo , Jiri Olsa , Alexander Shishkin , Peter Zijlstra , Ingo Molnar , linux-kernel , Adrian Hunter , Alexander Antonov , Alexei Budankov References: From: Andi Kleen Message-ID: <4926ae4f-b14b-1048-229b-0b789204c192@linux.intel.com> Date: Thu, 6 May 2021 07:17:17 -0700 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.10.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-US Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 5/5/2021 11:20 PM, Namhyung Kim wrote: > > Do you have an idea how to improve it? > > I have to say again that I don't like merely adding more threads to > record. Yeah, parallelizing the perf record is good, but we have to > think about the perf report (and others) too. perf report/script can be already parallelized with --time xx/x% and a simple shell script that runs multiple processes. While that's a bit awkward for interactive use it works fine for scripting. I use it all the time for PT batch processing for example. The real bottleneck we have is really record on systems with many CPUs (which are more and more common), and that can only be fixed with some variant of this patch kit. -Andi