From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <linux-kernel-owner@kernel.org>
X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on
	aws-us-west-2-korg-lkml-1.web.codeaurora.org
X-Spam-Level: 
X-Spam-Status: No, score=-6.0 required=3.0 tests=BAYES_00,DKIMWL_WL_HIGH,
	DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,MAILING_LIST_MULTI,SPF_HELO_NONE,
	SPF_PASS,URIBL_BLOCKED autolearn=no autolearn_force=no version=3.4.0
Received: from mail.kernel.org (mail.kernel.org [198.145.29.99])
	by smtp.lore.kernel.org (Postfix) with ESMTP id F0190C433ED
	for <linux-kernel@archiver.kernel.org>; Thu,  1 Apr 2021 19:45:37 +0000 (UTC)
Received: from vger.kernel.org (vger.kernel.org [23.128.96.18])
	by mail.kernel.org (Postfix) with ESMTP id B071B610CD
	for <linux-kernel@archiver.kernel.org>; Thu,  1 Apr 2021 19:45:37 +0000 (UTC)
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
        id S234910AbhDATpg (ORCPT <rfc822;linux-kernel@archiver.kernel.org>);
        Thu, 1 Apr 2021 15:45:36 -0400
Received: from mail.kernel.org ([198.145.29.99]:42680 "EHLO mail.kernel.org"
        rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP
        id S233786AbhDATpf (ORCPT <rfc822;linux-kernel@vger.kernel.org>);
        Thu, 1 Apr 2021 15:45:35 -0400
Received: by mail.kernel.org (Postfix) with ESMTPSA id 5DBDC610D1
        for <linux-kernel@vger.kernel.org>; Thu,  1 Apr 2021 19:45:35 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org;
        s=k20201202; t=1617306335;
        bh=YxxWZjKcHVFAJxVP7D2vVHYnV0HmubnXog0nAnFIDng=;
        h=References:In-Reply-To:From:Date:Subject:To:Cc:From;
        b=UkvRFzkF3M84Vh1YjK8l4Le7vhb3jZV7cz/ci9WodgY9WjObHFnhjUHgWJ7yTHhCr
         pdxKqE6rnD+mrExeVWBRUwU0Gdtf0qQJE6zNgUg3FC3M4SF0bDZc1/ToEkbpOrvK8e
         wIAzJQtoJTboifwOkuXd8zahRqDGcKN6zIw1NxzvR3guAQ85bSFuttUQGNs+Xr2tZy
         YouWLCz81w7224z4xKsbvIGzJ5U4iituPJ3ZD6Ns+Jp0VawPtDsS+wzFL/Wlk/6hCL
         bHfToXDoxt2IOP+abBerfuouIiVLLQMSD1egWbOEmRRlIMhK5cgpRWRAW1jIwEdcs3
         zjYXtiuuz9dFw==
Received: by mail-ed1-f51.google.com with SMTP id b16so3202150eds.7
        for <linux-kernel@vger.kernel.org>; Thu, 01 Apr 2021 12:45:35 -0700 (PDT)
X-Gm-Message-State: AOAM5319MmHo3qbKxb846zH8GbCJDGQ6Axak1STMMj3ZyPbnQpzWLzAV
        qlled/8b+5tJO5v61YMIMQrphR61aJJgiX3H+Q==
X-Google-Smtp-Source: ABdhPJw8nEZBw8l4GWz2Oh6ZHOBceHodL6tZOUJqaIPUMKLuMbXE6bphZ7FuN2mH5smoDsd4rpFbh6Vu7Mepd5Pwid0=
X-Received: by 2002:a05:6402:5252:: with SMTP id t18mr11993876edd.258.1617306333802;
 Thu, 01 Apr 2021 12:45:33 -0700 (PDT)
MIME-Version: 1.0
References: <20210311000837.3630499-1-robh@kernel.org> <20210311000837.3630499-3-robh@kernel.org>
 <20210330153125.GC6567@willie-the-truck> <CAL_JsqKN4=T4tHofEoBoWVEZSQEj_m=561_kEdEEkz5szHszhQ@mail.gmail.com>
 <20210331160059.GD7815@willie-the-truck>
In-Reply-To: <20210331160059.GD7815@willie-the-truck>
From:   Rob Herring <robh@kernel.org>
Date:   Thu, 1 Apr 2021 14:45:21 -0500
X-Gmail-Original-Message-ID: <CAL_Jsq+ga_-5=UBTAn4eGhpL6j2aWFVh9CdxR4XSG7WpYfxNsA@mail.gmail.com>
Message-ID: <CAL_Jsq+ga_-5=UBTAn4eGhpL6j2aWFVh9CdxR4XSG7WpYfxNsA@mail.gmail.com>
Subject: Re: [PATCH v6 02/10] arm64: perf: Enable PMU counter direct access
 for perf event
To:     Will Deacon <will@kernel.org>
Cc:     Catalin Marinas <catalin.marinas@arm.com>,
        Peter Zijlstra <peterz@infradead.org>,
        Ingo Molnar <mingo@redhat.com>,
        Arnaldo Carvalho de Melo <acme@kernel.org>,
        Jiri Olsa <jolsa@redhat.com>,
        Mark Rutland <mark.rutland@arm.com>,
        Ian Rogers <irogers@google.com>,
        Alexander Shishkin <alexander.shishkin@linux.intel.com>,
        Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>,
        Zachary.Leaf@arm.com, Raphael Gault <raphael.gault@arm.com>,
        Jonathan Cameron <Jonathan.Cameron@huawei.com>,
        Namhyung Kim <namhyung@kernel.org>,
        Itaru Kitayama <itaru.kitayama@gmail.com>,
        linux-arm-kernel <linux-arm-kernel@lists.infradead.org>,
        "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>
Content-Type: text/plain; charset="UTF-8"
Precedence: bulk
List-ID: <linux-kernel.vger.kernel.org>
X-Mailing-List: linux-kernel@vger.kernel.org

On Wed, Mar 31, 2021 at 11:01 AM Will Deacon <will@kernel.org> wrote:
>
> On Tue, Mar 30, 2021 at 12:09:38PM -0500, Rob Herring wrote:
> > On Tue, Mar 30, 2021 at 10:31 AM Will Deacon <will@kernel.org> wrote:
> > >
> > > On Wed, Mar 10, 2021 at 05:08:29PM -0700, Rob Herring wrote:
> > > > From: Raphael Gault <raphael.gault@arm.com>
> > > >
> > > > Keep track of event opened with direct access to the hardware counters
> > > > and modify permissions while they are open.
> > > >
> > > > The strategy used here is the same which x86 uses: every time an event
> > > > is mapped, the permissions are set if required. The atomic field added
> > > > in the mm_context helps keep track of the different event opened and
> > > > de-activate the permissions when all are unmapped.
> > > > We also need to update the permissions in the context switch code so
> > > > that tasks keep the right permissions.
> > > >
> > > > In order to enable 64-bit counters for userspace when available, a new
> > > > config1 bit is added for userspace to indicate it wants userspace counter
> > > > access. This bit allows the kernel to decide if chaining should be
> > > > disabled and chaining and userspace access are incompatible.
> > > > The modes for config1 are as follows:
> > > >
> > > > config1 = 0 or 2 : user access enabled and always 32-bit
> > > > config1 = 1 : user access disabled and always 64-bit (using chaining if needed)
> > > > config1 = 3 : user access enabled and counter size matches underlying counter.
> > >
> > > In this last case, how does userspace know whether it got a 32-bit or a
> > > 64-bit counter?
> >
> > pmc_width in the user page. If the read loop is correct, then it
> > doesn't matter to the user.
>
> Gotcha; please mention that in this comment,
>
> > > > +static void refresh_pmuserenr(void *mm)
> > > > +{
> > > > +     if (mm == current->active_mm)
> > > > +             perf_switch_user_access(mm);
> > > > +}
> > > > +
> > > > +static void armv8pmu_event_mapped(struct perf_event *event, struct mm_struct *mm)
> > > > +{
> > > > +     struct arm_pmu *armpmu = to_arm_pmu(event->pmu);
> > > > +
> > > > +     if (!(event->hw.flags & ARMPMU_EL0_RD_CNTR))
> > > > +             return;
> > > > +
> > > > +     /*
> > > > +      * This function relies on not being called concurrently in two
> > > > +      * tasks in the same mm.  Otherwise one task could observe
> > > > +      * pmu_direct_access > 1 and return all the way back to
> > > > +      * userspace with user access disabled while another task is still
> > > > +      * doing on_each_cpu_mask() to enable user access.
> > > > +      *
> > > > +      * For now, this can't happen because all callers hold mmap_lock
> > > > +      * for write.  If this changes, we'll need a different solution.
> > > > +      */
> > > > +     lockdep_assert_held_write(&mm->mmap_lock);
> > > > +
> > > > +     if (atomic_inc_return(&mm->context.pmu_direct_access) == 1)
> > > > +             on_each_cpu_mask(&armpmu->supported_cpus, refresh_pmuserenr, mm, 1);
> > > > +}
> > >
> > > Why do we need to cross-call here? Seems like it would be a tonne simpler to
> > > handle the trap. Is there a reason not to do that?
> >
> > Alignment with x86? You didn't like the trap handler either:
> >
> > > Hmm... this feels pretty fragile since, although we may expect userspace only
> > > to trigger this in the context of the specific perf use-case, we don't have
> > > a way to detect that, so the ABI we're exposing is that EL0 accesses to
> > > non-existent counters will return 0. I don't really think that's something
> > > we want to commit to.
> >
> > Full mail: https://lore.kernel.org/r/20200928182601.GA11974@willie-the-truck
> >
> > The handler would be different, but we'd still have the problem of not
> > being able to detect the use case, right?
>
> Well hang on, the thing you link to was about _emulating_ the access. I'm
> not saying we should do that, but rather just enable EL0 PMU access lazily
> instead of using the IPI. We're going to enable it when we context-switch
> anyway.
>
> The alternative is enabling EL0 access unconditionally and leaving it
> enabled.

That BTW is what x86 originally did. This pmu_direct_access stuff came
about as part of locking down access a bit.

In any case, I've modified this to trap. It avoids the mmap lock
dependency too, so that's nice.

> > > > +
> > > > +static void armv8pmu_event_unmapped(struct perf_event *event, struct mm_struct *mm)
> > > > +{
> > > > +     struct arm_pmu *armpmu = to_arm_pmu(event->pmu);
> > > > +
> > > > +     if (!(event->hw.flags & ARMPMU_EL0_RD_CNTR))
> > > > +             return;
> > > > +
> > > > +     if (atomic_dec_and_test(&mm->context.pmu_direct_access))
> > > > +             on_each_cpu_mask(&armpmu->supported_cpus, refresh_pmuserenr, mm, 1);
> > >
> > > Given that the pmu_direct_access field is global per-mm, won't this go
> > > wrong if multiple PMUs are opened by the same process but only a subset
> > > are exposed to EL0? Perhaps pmu_direct_access should be treated as a mask
> > > rather than a counter, so that we can 'and' it with the supported_cpus for
> > > the PMU we're dealing with.
> >
> > It needs to be a count to support multiple events on the same PMU. If
> > the event is not enabled for EL0, then we'd exit out on the
> > ARMPMU_EL0_RD_CNTR check. So I think we're fine.
>
> I'm still not convinced; pmu_direct_access is shared between PMUs, so
> testing the result of atomic_dec_and_test() just doesn't make sense to
> me, as another PMU could be playing with the count.

How is that a problem? Let's make a concrete example:

map PMU1:event2 -> pmu_direct_access = 1 -> enable access
map PMU2:event3 -> pmu_direct_access = 2
map PMU1:event4 -> pmu_direct_access = 3
unmap PMU2:event3 -> pmu_direct_access = 2
unmap PMU1:event2 -> pmu_direct_access = 1
unmap PMU1:event4 -> pmu_direct_access = 0 -> disable access

The only issue I can see is PMU2 remains enabled for user access until
the last unmap. But we're sharing the mm, so who cares? Also, in this
scenario it is the user's problem to pin themselves to cores sharing a
PMU. If the user doesn't do that, they get to keep the pieces.

We have to have something that says 'we have mmapped user accessible
events for the mm context'. I don't think there is any way around that
if we gate access on mmapping events.

Rob

From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <SRS0=UN2r=I6=lists.infradead.org=linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@kernel.org>
X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on
	aws-us-west-2-korg-lkml-1.web.codeaurora.org
X-Spam-Level: 
X-Spam-Status: No, score=-4.0 required=3.0 tests=BAYES_00,DKIMWL_WL_HIGH,
	DKIM_SIGNED,DKIM_VALID,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,
	URIBL_BLOCKED autolearn=no autolearn_force=no version=3.4.0
Received: from mail.kernel.org (mail.kernel.org [198.145.29.99])
	by smtp.lore.kernel.org (Postfix) with ESMTP id B8FFFC433B4
	for <linux-arm-kernel@archiver.kernel.org>; Thu,  1 Apr 2021 19:47:45 +0000 (UTC)
Received: from desiato.infradead.org (desiato.infradead.org [90.155.92.199])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(No client certificate requested)
	by mail.kernel.org (Postfix) with ESMTPS id 51ECC610CB
	for <linux-arm-kernel@archiver.kernel.org>; Thu,  1 Apr 2021 19:47:45 +0000 (UTC)
DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 51ECC610CB
Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=kernel.org
Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed;
	d=lists.infradead.org; s=desiato.20200630; h=Sender:Content-Transfer-Encoding
	:Content-Type:List-Subscribe:List-Help:List-Post:List-Archive:
	List-Unsubscribe:List-Id:Cc:To:Subject:Message-ID:Date:From:In-Reply-To:
	References:MIME-Version:Reply-To:Content-ID:Content-Description:Resent-Date:
	Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Owner;
	 bh=UfM+jAUdNBtx0rnzNwxHMG8tfBXD31bEkXeEP4MQp20=; b=HZV5uliYgNTxnj+6z5Xqn+nRe
	USmC7XxoUzeR5injdsczLmkCC49mlvGsP6XGyShb1RTx3wlvsYwc8uJMj5QxL2aEDxoIo970Pqz9A
	IJRMlmZNyYnnBdTK7HPvl/9e5jRRkJoID80zRsZ+PH6KKquK0GqOWnsFX8wBm/CHL9jgXBZ0A15v/
	6Xsmr2O9Iu0vtRZZRKesv+CLtHNkIZnw++ppFjnBXPFQZHYE6+I/P0K/PjwKFAl/ehpEislehEZe7
	zFaqNQZpKdhPClpU1pn/Qe55Q8o1MRibibZxPENhsOS+ZGi8csx5Ga3E7efAJq/V1+K+vPgN6Y5L1
	V46TK/+Kw==;
Received: from localhost ([::1] helo=desiato.infradead.org)
	by desiato.infradead.org with esmtp (Exim 4.94 #2 (Red Hat Linux))
	id 1lS3GD-00ApWe-Mo; Thu, 01 Apr 2021 19:45:46 +0000
Received: from mail.kernel.org ([198.145.29.99])
 by desiato.infradead.org with esmtps (Exim 4.94 #2 (Red Hat Linux))
 id 1lS3G6-00ApUd-Uz
 for linux-arm-kernel@lists.infradead.org; Thu, 01 Apr 2021 19:45:42 +0000
Received: by mail.kernel.org (Postfix) with ESMTPSA id 4A985610CA
 for <linux-arm-kernel@lists.infradead.org>;
 Thu,  1 Apr 2021 19:45:35 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org;
 s=k20201202; t=1617306335;
 bh=YxxWZjKcHVFAJxVP7D2vVHYnV0HmubnXog0nAnFIDng=;
 h=References:In-Reply-To:From:Date:Subject:To:Cc:From;
 b=UkvRFzkF3M84Vh1YjK8l4Le7vhb3jZV7cz/ci9WodgY9WjObHFnhjUHgWJ7yTHhCr
 pdxKqE6rnD+mrExeVWBRUwU0Gdtf0qQJE6zNgUg3FC3M4SF0bDZc1/ToEkbpOrvK8e
 wIAzJQtoJTboifwOkuXd8zahRqDGcKN6zIw1NxzvR3guAQ85bSFuttUQGNs+Xr2tZy
 YouWLCz81w7224z4xKsbvIGzJ5U4iituPJ3ZD6Ns+Jp0VawPtDsS+wzFL/Wlk/6hCL
 bHfToXDoxt2IOP+abBerfuouIiVLLQMSD1egWbOEmRRlIMhK5cgpRWRAW1jIwEdcs3
 zjYXtiuuz9dFw==
Received: by mail-ed1-f51.google.com with SMTP id h13so3207581eds.5
 for <linux-arm-kernel@lists.infradead.org>;
 Thu, 01 Apr 2021 12:45:35 -0700 (PDT)
X-Gm-Message-State: AOAM533NjNBu3lvW4vYQpgEiw6tA1ihWfDCbUa01KMQduQ3U2Q6z1sbJ
 2M5N0ny3UVdUoIpe7a0ZqzQPGxp3Zx9ozDfnhQ==
X-Google-Smtp-Source: ABdhPJw8nEZBw8l4GWz2Oh6ZHOBceHodL6tZOUJqaIPUMKLuMbXE6bphZ7FuN2mH5smoDsd4rpFbh6Vu7Mepd5Pwid0=
X-Received: by 2002:a05:6402:5252:: with SMTP id
 t18mr11993876edd.258.1617306333802; 
 Thu, 01 Apr 2021 12:45:33 -0700 (PDT)
MIME-Version: 1.0
References: <20210311000837.3630499-1-robh@kernel.org>
 <20210311000837.3630499-3-robh@kernel.org>
 <20210330153125.GC6567@willie-the-truck>
 <CAL_JsqKN4=T4tHofEoBoWVEZSQEj_m=561_kEdEEkz5szHszhQ@mail.gmail.com>
 <20210331160059.GD7815@willie-the-truck>
In-Reply-To: <20210331160059.GD7815@willie-the-truck>
From: Rob Herring <robh@kernel.org>
Date: Thu, 1 Apr 2021 14:45:21 -0500
X-Gmail-Original-Message-ID: <CAL_Jsq+ga_-5=UBTAn4eGhpL6j2aWFVh9CdxR4XSG7WpYfxNsA@mail.gmail.com>
Message-ID: <CAL_Jsq+ga_-5=UBTAn4eGhpL6j2aWFVh9CdxR4XSG7WpYfxNsA@mail.gmail.com>
Subject: Re: [PATCH v6 02/10] arm64: perf: Enable PMU counter direct access
 for perf event
To: Will Deacon <will@kernel.org>
Cc: Catalin Marinas <catalin.marinas@arm.com>,
 Peter Zijlstra <peterz@infradead.org>, 
 Ingo Molnar <mingo@redhat.com>, Arnaldo Carvalho de Melo <acme@kernel.org>,
 Jiri Olsa <jolsa@redhat.com>, 
 Mark Rutland <mark.rutland@arm.com>, Ian Rogers <irogers@google.com>, 
 Alexander Shishkin <alexander.shishkin@linux.intel.com>, 
 Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>, Zachary.Leaf@arm.com, 
 Raphael Gault <raphael.gault@arm.com>,
 Jonathan Cameron <Jonathan.Cameron@huawei.com>, 
 Namhyung Kim <namhyung@kernel.org>, Itaru Kitayama <itaru.kitayama@gmail.com>, 
 linux-arm-kernel <linux-arm-kernel@lists.infradead.org>, 
 "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>
X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.8.0 (BSD) ) MR-646709E3 
X-CRM114-CacheID: sfid-20210401_204539_346515_86067CD8 
X-CRM114-Status: GOOD (  56.44  )
X-BeenThere: linux-arm-kernel@lists.infradead.org
X-Mailman-Version: 2.1.34
Precedence: list
List-Id: <linux-arm-kernel.lists.infradead.org>
List-Unsubscribe: <http://lists.infradead.org/mailman/options/linux-arm-kernel>, 
 <mailto:linux-arm-kernel-request@lists.infradead.org?subject=unsubscribe>
List-Archive: <http://lists.infradead.org/pipermail/linux-arm-kernel/>
List-Post: <mailto:linux-arm-kernel@lists.infradead.org>
List-Help: <mailto:linux-arm-kernel-request@lists.infradead.org?subject=help>
List-Subscribe: <http://lists.infradead.org/mailman/listinfo/linux-arm-kernel>, 
 <mailto:linux-arm-kernel-request@lists.infradead.org?subject=subscribe>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Sender: "linux-arm-kernel" <linux-arm-kernel-bounces@lists.infradead.org>
Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org

On Wed, Mar 31, 2021 at 11:01 AM Will Deacon <will@kernel.org> wrote:
>
> On Tue, Mar 30, 2021 at 12:09:38PM -0500, Rob Herring wrote:
> > On Tue, Mar 30, 2021 at 10:31 AM Will Deacon <will@kernel.org> wrote:
> > >
> > > On Wed, Mar 10, 2021 at 05:08:29PM -0700, Rob Herring wrote:
> > > > From: Raphael Gault <raphael.gault@arm.com>
> > > >
> > > > Keep track of event opened with direct access to the hardware counters
> > > > and modify permissions while they are open.
> > > >
> > > > The strategy used here is the same which x86 uses: every time an event
> > > > is mapped, the permissions are set if required. The atomic field added
> > > > in the mm_context helps keep track of the different event opened and
> > > > de-activate the permissions when all are unmapped.
> > > > We also need to update the permissions in the context switch code so
> > > > that tasks keep the right permissions.
> > > >
> > > > In order to enable 64-bit counters for userspace when available, a new
> > > > config1 bit is added for userspace to indicate it wants userspace counter
> > > > access. This bit allows the kernel to decide if chaining should be
> > > > disabled and chaining and userspace access are incompatible.
> > > > The modes for config1 are as follows:
> > > >
> > > > config1 = 0 or 2 : user access enabled and always 32-bit
> > > > config1 = 1 : user access disabled and always 64-bit (using chaining if needed)
> > > > config1 = 3 : user access enabled and counter size matches underlying counter.
> > >
> > > In this last case, how does userspace know whether it got a 32-bit or a
> > > 64-bit counter?
> >
> > pmc_width in the user page. If the read loop is correct, then it
> > doesn't matter to the user.
>
> Gotcha; please mention that in this comment,
>
> > > > +static void refresh_pmuserenr(void *mm)
> > > > +{
> > > > +     if (mm == current->active_mm)
> > > > +             perf_switch_user_access(mm);
> > > > +}
> > > > +
> > > > +static void armv8pmu_event_mapped(struct perf_event *event, struct mm_struct *mm)
> > > > +{
> > > > +     struct arm_pmu *armpmu = to_arm_pmu(event->pmu);
> > > > +
> > > > +     if (!(event->hw.flags & ARMPMU_EL0_RD_CNTR))
> > > > +             return;
> > > > +
> > > > +     /*
> > > > +      * This function relies on not being called concurrently in two
> > > > +      * tasks in the same mm.  Otherwise one task could observe
> > > > +      * pmu_direct_access > 1 and return all the way back to
> > > > +      * userspace with user access disabled while another task is still
> > > > +      * doing on_each_cpu_mask() to enable user access.
> > > > +      *
> > > > +      * For now, this can't happen because all callers hold mmap_lock
> > > > +      * for write.  If this changes, we'll need a different solution.
> > > > +      */
> > > > +     lockdep_assert_held_write(&mm->mmap_lock);
> > > > +
> > > > +     if (atomic_inc_return(&mm->context.pmu_direct_access) == 1)
> > > > +             on_each_cpu_mask(&armpmu->supported_cpus, refresh_pmuserenr, mm, 1);
> > > > +}
> > >
> > > Why do we need to cross-call here? Seems like it would be a tonne simpler to
> > > handle the trap. Is there a reason not to do that?
> >
> > Alignment with x86? You didn't like the trap handler either:
> >
> > > Hmm... this feels pretty fragile since, although we may expect userspace only
> > > to trigger this in the context of the specific perf use-case, we don't have
> > > a way to detect that, so the ABI we're exposing is that EL0 accesses to
> > > non-existent counters will return 0. I don't really think that's something
> > > we want to commit to.
> >
> > Full mail: https://lore.kernel.org/r/20200928182601.GA11974@willie-the-truck
> >
> > The handler would be different, but we'd still have the problem of not
> > being able to detect the use case, right?
>
> Well hang on, the thing you link to was about _emulating_ the access. I'm
> not saying we should do that, but rather just enable EL0 PMU access lazily
> instead of using the IPI. We're going to enable it when we context-switch
> anyway.
>
> The alternative is enabling EL0 access unconditionally and leaving it
> enabled.

That BTW is what x86 originally did. This pmu_direct_access stuff came
about as part of locking down access a bit.

In any case, I've modified this to trap. It avoids the mmap lock
dependency too, so that's nice.

> > > > +
> > > > +static void armv8pmu_event_unmapped(struct perf_event *event, struct mm_struct *mm)
> > > > +{
> > > > +     struct arm_pmu *armpmu = to_arm_pmu(event->pmu);
> > > > +
> > > > +     if (!(event->hw.flags & ARMPMU_EL0_RD_CNTR))
> > > > +             return;
> > > > +
> > > > +     if (atomic_dec_and_test(&mm->context.pmu_direct_access))
> > > > +             on_each_cpu_mask(&armpmu->supported_cpus, refresh_pmuserenr, mm, 1);
> > >
> > > Given that the pmu_direct_access field is global per-mm, won't this go
> > > wrong if multiple PMUs are opened by the same process but only a subset
> > > are exposed to EL0? Perhaps pmu_direct_access should be treated as a mask
> > > rather than a counter, so that we can 'and' it with the supported_cpus for
> > > the PMU we're dealing with.
> >
> > It needs to be a count to support multiple events on the same PMU. If
> > the event is not enabled for EL0, then we'd exit out on the
> > ARMPMU_EL0_RD_CNTR check. So I think we're fine.
>
> I'm still not convinced; pmu_direct_access is shared between PMUs, so
> testing the result of atomic_dec_and_test() just doesn't make sense to
> me, as another PMU could be playing with the count.

How is that a problem? Let's make a concrete example:

map PMU1:event2 -> pmu_direct_access = 1 -> enable access
map PMU2:event3 -> pmu_direct_access = 2
map PMU1:event4 -> pmu_direct_access = 3
unmap PMU2:event3 -> pmu_direct_access = 2
unmap PMU1:event2 -> pmu_direct_access = 1
unmap PMU1:event4 -> pmu_direct_access = 0 -> disable access

The only issue I can see is PMU2 remains enabled for user access until
the last unmap. But we're sharing the mm, so who cares? Also, in this
scenario it is the user's problem to pin themselves to cores sharing a
PMU. If the user doesn't do that, they get to keep the pieces.

We have to have something that says 'we have mmapped user accessible
events for the mm context'. I don't think there is any way around that
if we gate access on mmapping events.

Rob

_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel