From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 77DCFC433EF for ; Mon, 13 Dec 2021 16:10:18 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240404AbhLMQKR (ORCPT ); Mon, 13 Dec 2021 11:10:17 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:36276 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234908AbhLMQKQ (ORCPT ); Mon, 13 Dec 2021 11:10:16 -0500 Received: from mail-io1-xd34.google.com (mail-io1-xd34.google.com [IPv6:2607:f8b0:4864:20::d34]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2158CC061574 for ; Mon, 13 Dec 2021 08:10:16 -0800 (PST) Received: by mail-io1-xd34.google.com with SMTP id q72so19007583iod.12 for ; Mon, 13 Dec 2021 08:10:16 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=DtD0mEb50I7aZGSUK6tagzUKH6gP/G5BXylrgwHSko8=; b=jhyho1evvUYVgGY6Y/EMFo0OWrpcH5OEFzSnbhkAVsMmFL/XOxgyu8nmSMBfPGr1IG hDGG4OTb4zjswGYyNLFKYQmcukSigaMIcDiLzc0q+ydMylBN1eH+tabibsoirxorrAnv q9xw3GWbLLHyuwbU8z74zL37KBObCTVa2CQ3KwlIYY9MzxH6hmtH/N5YCYhYz6TRlD7z /30hfbZc0Z7K/J1EtqLIM4zUs1mDYATFiSGgjvijEyLm2H08XZd1l3PiP2rVCT2WkIa/ jeDkmDTXlyhNn7ehw0GvXMmHUA7O+JVLSuwxEJG5sVn+BmAjqeP9CqS0kZcyivUCihPh yb9A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=DtD0mEb50I7aZGSUK6tagzUKH6gP/G5BXylrgwHSko8=; b=JEMIoSygaL4wPpevdwh4VCapIVtheGJAGzism5tJtASQCYz/9iAnaNALXDQU4rvAMo O/MHox9e0W74U8R48On/jOK4lkdHBcf2SsVztWn7sfk5KSMZlYZlM1pRcY63dc4IMpDQ Zt5xnPsZhWD1gz81+Y4zJARdsPmrVMA2iielQ1Cp9a0pytB5L8T6T0V+mLK5ogSr8LbT pJdpWHQaHFVRUMad7+IFoIYeFx7DmY+BO0fPyOc7LdlMHwMkT65ZWUM8xvI0aOJnDQRn pJiz5gbQrV+y0P21N3JhVFnuf5mFo3LJoAVx2Mi95Ht03qhldEWqFupONjwGIiGY1hoY /b8A== X-Gm-Message-State: AOAM533C04PBBusfqaY1HHwoeK1gy9dobs66Y7JaS6zdSpLN6UuBs5s+ udicOlSSpe+tZ2/T34kfKU/F5sth9TzcKfMUmI7HeA== X-Google-Smtp-Source: ABdhPJxkOBCIc1higoQpTE6i07rWNH/y+S/K4/kf4WrljzAwludPzIqFp8MZyIOrGJkP4CXhASvTjgWV3hkmk9PVaeI= X-Received: by 2002:a05:6638:144f:: with SMTP id l15mr33654639jad.21.1639411815204; Mon, 13 Dec 2021 08:10:15 -0800 (PST) MIME-Version: 1.0 References: <20211208024607.1784932-1-irogers@google.com> In-Reply-To: From: Ian Rogers Date: Mon, 13 Dec 2021 08:10:03 -0800 Message-ID: Subject: Re: [PATCH 00/22] Refactor perf cpumap To: James Clark Cc: eranian@google.com, Andi Kleen , Jiri Olsa , Namhyung Kim , John Garry , Kajol Jain , "Paul A . Clarke" , Arnaldo Carvalho de Melo , Riccardo Mancini , Kan Liang , Peter Zijlstra , Ingo Molnar , Mark Rutland , Alexander Shishkin , linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org, Vineet Singh , Mathieu Poirier , Suzuki K Poulose , Mike Leach , Leo Yan , coresight@lists.linaro.org, linux-arm-kernel@lists.infradead.org Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Dec 13, 2021 at 3:39 AM James Clark wrote: > > > > On 08/12/2021 02:45, Ian Rogers wrote: > > Perf cpu map has various functions where a cpumap and index are passed > > in order to load the cpu. A problem with this is that the wrong index > > may be passed for the cpumap, causing problems like aggregation on the > > wrong CPU: > > https://lore.kernel.org/lkml/20211204023409.969668-1-irogers@google.com/ > > > > This patch set refactors the cpu map API, greatly reducing it and > > explicitly passing the cpu (rather than the pair) to functions that > > need it. Comments are added at the same time. > > > > Ian Rogers (22): > > libperf: Add comments to perf_cpu_map. > > perf stat: Add aggr creators that are passed a cpu. > > perf stat: Switch aggregation to use for_each loop > > perf stat: Switch to cpu version of cpu_map__get > > perf cpumap: Switch cpu_map__build_map to cpu function > > perf cpumap: Remove map+index get_socket > > perf cpumap: Remove map+index get_die > > perf cpumap: Remove map+index get_core > > perf cpumap: Remove map+index get_node > > perf cpumap: Add comments to aggr_cpu_id > > perf cpumap: Remove unused cpu_map__socket > > perf cpumap: Simplify equal function name. > > perf cpumap: Rename empty functions. > > perf cpumap: Document cpu__get_node and remove redundant function > > perf cpumap: Remove map from function names that don't use a map. > > perf cpumap: Remove cpu_map__cpu, use libperf function. > > perf cpumap: Refactor cpu_map__build_map > > perf cpumap: Rename cpu_map__get_X_aggr_by_cpu functions > > perf cpumap: Move 'has' function to libperf > > perf cpumap: Add some comments to cpu_aggr_map > > perf cpumap: Trim the cpu_aggr_map > > perf stat: Fix memory leak in check_per_pkg > > > > tools/lib/perf/cpumap.c | 7 +- > > tools/lib/perf/include/internal/cpumap.h | 9 +- > > tools/lib/perf/include/perf/cpumap.h | 1 + > > tools/perf/arch/arm/util/cs-etm.c | 16 +- > > tools/perf/builtin-ftrace.c | 2 +- > > tools/perf/builtin-sched.c | 6 +- > > tools/perf/builtin-stat.c | 273 ++++++++++++----------- > > tools/perf/tests/topology.c | 10 +- > > tools/perf/util/cpumap.c | 182 ++++++--------- > > tools/perf/util/cpumap.h | 102 ++++++--- > > tools/perf/util/cputopo.c | 2 +- > > tools/perf/util/env.c | 6 +- > > tools/perf/util/stat-display.c | 69 +++--- > > tools/perf/util/stat.c | 9 +- > > tools/perf/util/stat.h | 3 +- > > 15 files changed, 361 insertions(+), 336 deletions(-) > > > > For the whole set: > > Reviewed-by: James Clark > > I didn't see any obvious issues with mixing up aggregation modes or CPU/idx types. Also > gave perf stat a test in the different modes and didn't see an issue. > > But I'm wondering if it's possible to go further and add a struct around the CPU int so that the > compiler checks for correctness instead. It still seems quite easy to mix up index and > CPU, for example these functions are subtly different, but both use int: > > LIBPERF_API int perf_cpu_map__cpu(const struct perf_cpu_map *cpus, int idx); > LIBPERF_API bool perf_cpu_map__has(const struct perf_cpu_map *map, int cpu); > > Something like this would make it impossible to make a mistake: > > struct cpu { int cpu }; > > I mean it's more of a coincidence that CPUs can be identified by an integer, but they are more > of an object than an integer, so it could make sense to wrap it. But maybe it could be quite > cumbersome to use and be overkill. Thanks James! I am working on a v2 patch set and will have a go at adding this to the end. Ian