From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-14.0 required=3.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER, INCLUDES_PATCH,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 40AD7C433ED for ; Mon, 26 Apr 2021 09:23:39 +0000 (UTC) Received: from desiato.infradead.org (desiato.infradead.org [90.155.92.199]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id 9B2C06100B for ; Mon, 26 Apr 2021 09:23:38 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 9B2C06100B Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=arm.com Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=desiato.20200630; h=Sender:Content-Transfer-Encoding :Content-Type:List-Subscribe:List-Help:List-Post:List-Archive: List-Unsubscribe:List-Id:In-Reply-To:MIME-Version:References:Message-ID: Subject:Cc:To:From:Date:Reply-To:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Owner; bh=tz8Zx1/TJY3QFUuyb7c9oo/F8UBNLwDXB2FVfBSERa8=; b=ToG6IYAe6mfQj807B/Gfrmw1q tm+ycSyRuzEUvlHG5QA5TDJtQDYRXugG00VLPvyyZbr/+YF95vPwEZb6wJAtq7lyC/ZDJDuIFCBF8 EWwjo0yahGNf6foMJ/m5t/SOUnV624p145AVApjd9rtS2jD4MJ5YMjVKjQC4wMEqq+jNbvC7+oRiQ PaCZfbs8nDOXCeLKsNfqZTREEhdBOGTTib3PbEa43fyJzguBRXEQwvQV5JxoN6ZM2OSbki6v31w9r pvu01DZ8joNY0XeJuBWaCeNc65muwcrDd+RM5F0MonT2XOoIr52hU31AcuA64I9bcjQ5uWX3FZZoV tdODHE1nQ==; Received: from localhost ([::1] helo=desiato.infradead.org) by desiato.infradead.org with esmtp (Exim 4.94 #2 (Red Hat Linux)) id 1laxRL-007IPO-P9; Mon, 26 Apr 2021 09:22:03 +0000 Received: from bombadil.infradead.org ([2607:7c80:54:e::133]) by desiato.infradead.org with esmtps (Exim 4.94 #2 (Red Hat Linux)) id 1laxRI-007IOn-08 for linux-arm-kernel@desiato.infradead.org; Mon, 26 Apr 2021 09:22:00 +0000 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=bombadil.20210309; h=In-Reply-To:Content-Type:MIME-Version :References:Message-ID:Subject:Cc:To:From:Date:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description; bh=ntvncL+vLwR/feAcmjyFJC4kBq9M4r4lyq/U000BmxI=; b=LsxAMxn5fnLHu+QNuRL4rEiYbD VcUQfzDSZZ9WOEfH9bjxzzgNWyjAiQSLZa83xN0jXCBZMbUi132kJP36I4QnwEZoYYz16YHW9dbgZ 8WiBoWAxsEuUk7LRGKQbEtZyDW+8b/b2JSv8ufEQ77fkN3kpEmkDS99gz9c00uAyYNcrnJbTnHY5C q2WEgn37c52ha8TdtdtKIoxNypeZs9UJXAt7nTzo/fPsKWuiHUziWtwpC4YOzVG2KeJx1ZQHBZ20p +UVrKf/PD8YoskVHWDDkvfK0EnvuCnpTafq3uh/tJeGcQOVlIWXpC4DU40zct65LzG9kA7gdQFVxB ksL+1z/Q==; Received: from foss.arm.com ([217.140.110.172]) by bombadil.infradead.org with esmtp (Exim 4.94 #2 (Red Hat Linux)) id 1laxRD-00Fr7N-Ho for linux-arm-kernel@lists.infradead.org; Mon, 26 Apr 2021 09:21:58 +0000 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id BE4CD1FB; Mon, 26 Apr 2021 02:21:47 -0700 (PDT) Received: from C02TD0UTHF1T.local (unknown [10.57.1.6]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 923253F73B; Mon, 26 Apr 2021 02:21:45 -0700 (PDT) Date: Mon, 26 Apr 2021 10:21:39 +0100 From: Mark Rutland To: Zenghui Yu Cc: linux-arm-kernel@lists.infradead.org, will@kernel.org, elver@google.com, paulmck@kernel.org, peterz@infradead.org, catalin.marinas@arm.com, james.morse@arm.com, dvyukov@google.com, wanghaibin.wang@huawei.com Subject: Re: [PATCHv2 09/11] arm64: entry: fix non-NMI kernel<->kernel transitions Message-ID: <20210426092139.GA16287@C02TD0UTHF1T.local> References: <20201130115950.22492-1-mark.rutland@arm.com> <20201130115950.22492-10-mark.rutland@arm.com> MIME-Version: 1.0 Content-Disposition: inline In-Reply-To: X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.8.0 (BSD) ) MR-646709E3 X-CRM114-CacheID: sfid-20210426_022155_698508_76C4B882 X-CRM114-Status: GOOD ( 33.38 ) X-BeenThere: linux-arm-kernel@lists.infradead.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org On Sun, Apr 25, 2021 at 01:29:31PM +0800, Zenghui Yu wrote: > Hi Mark, Hi Zenghui, > On 2020/11/30 19:59, Mark Rutland wrote: > > There are periods in kernel mode when RCU is not watching and/or the > > scheduler tick is disabled, but we can still take exceptions such as > > interrupts. The arm64 exception handlers do not account for this, and > > it's possible that RCU is not watching while an exception handler runs. > > > > The x86/generic entry code handles this by ensuring that all (non-NMI) > > kernel exception handlers call irqentry_enter() and irqentry_exit(), > > which handle RCU, lockdep, and IRQ flag tracing. We can't yet move to > > the generic entry code, and already hadnle the user<->kernel transitions > > elsewhere, so we add new kernel<->kernel transition helpers alog the > > lines of the generic entry code. > > > > Since we now track interrupts becoming masked when an exception is > > taken, local_daif_inherit() is modified to track interrupts becoming > > re-enabled when the original context is inherited. To balance the > > entry/exit paths, each handler masks all DAIF exceptions before > > exit_to_kernel_mode(). > > > > Signed-off-by: Mark Rutland > > Cc: Catalin Marinas > > Cc: James Morse > > Cc: Will Deacon > > [...] > > > +/* > > + * This is intended to match the logic in irqentry_enter(), handling the kernel > > + * mode transitions only. > > + */ > > +static void noinstr enter_from_kernel_mode(struct pt_regs *regs) > > +{ > > + regs->exit_rcu = false; > > + > > + if (!IS_ENABLED(CONFIG_TINY_RCU) && is_idle_task(current)) { > > + lockdep_hardirqs_off(CALLER_ADDR0); > > + rcu_irq_enter(); > > + trace_hardirqs_off_finish(); > > + > > + regs->exit_rcu = true; > > + return; > > + } > > + > > + lockdep_hardirqs_off(CALLER_ADDR0); > > + rcu_irq_enter_check_tick(); > > + trace_hardirqs_off_finish(); > > +} > > Booting a lockdep-enabled kernel with "irqchip.gicv3_pseudo_nmi=1" would > result in splats as below: > > | DEBUG_LOCKS_WARN_ON(!irqs_disabled()) > | WARNING: CPU: 3 PID: 125 at kernel/locking/lockdep.c:4258 > lockdep_hardirqs_off+0xd4/0xe8 > | Modules linked in: > | CPU: 3 PID: 125 Comm: modprobe Tainted: G W 5.12.0-rc8+ > #463 > | Hardware name: QEMU KVM Virtual Machine, BIOS 0.0.0 02/06/2015 > | pstate: 604003c5 (nZCv DAIF +PAN -UAO -TCO BTYPE=--) > | pc : lockdep_hardirqs_off+0xd4/0xe8 > | lr : lockdep_hardirqs_off+0xd4/0xe8 > | sp : ffff80002a39bad0 > | pmr_save: 000000e0 > | x29: ffff80002a39bad0 x28: ffff0000de214bc0 > | x27: ffff0000de1c0400 x26: 000000000049b328 > | x25: 0000000000406f30 x24: ffff0000de1c00a0 > | x23: 0000000020400005 x22: ffff8000105f747c > | x21: 0000000096000044 x20: 0000000000498ef9 > | x19: ffff80002a39bc88 x18: ffffffffffffffff > | x17: 0000000000000000 x16: ffff800011c61eb0 > | x15: ffff800011700a88 x14: 0720072007200720 > | x13: 0720072007200720 x12: 0720072007200720 > | x11: 0720072007200720 x10: 0720072007200720 > | x9 : ffff80002a39bad0 x8 : ffff80002a39bad0 > | x7 : ffff8000119f0800 x6 : c0000000ffff7fff > | x5 : ffff8000119f07a8 x4 : 0000000000000001 > | x3 : 9bcdab23f2432800 x2 : ffff800011730538 > | x1 : 9bcdab23f2432800 x0 : 0000000000000000 > | Call trace: > | lockdep_hardirqs_off+0xd4/0xe8 > | enter_from_kernel_mode.isra.5+0x7c/0xa8 > | el1_abort+0x24/0x100 > | el1_sync_handler+0x80/0xd0 > | el1_sync+0x6c/0x100 > | __arch_clear_user+0xc/0x90 > | load_elf_binary+0x9fc/0x1450 > | bprm_execve+0x404/0x880 > | kernel_execve+0x180/0x188 > | call_usermodehelper_exec_async+0xdc/0x158 > | ret_from_fork+0x10/0x18 > > The code that triggers the splat is lockdep_hardirqs_off+0xd4/0xe8: > > | /* > | * So we're supposed to get called after you mask local IRQs, but for > | * some reason the hardware doesn't quite think you did a proper job. > | */ > | if (DEBUG_LOCKS_WARN_ON(!irqs_disabled())) > | return; > > which looks like a false positive as DAIF are all masked on taking > an synchronous exception and hardirqs are therefore disabled. With > pseudo NMI used, irqs_disabled() takes the value of ICC_PMR_EL1 as > the interrupt enable state, which is GIC_PRIO_IRQON (0xe0) in this > case and doesn't help much. Not dig further though. Thanks for this report. I think I understand the problem. In some paths (e.g. el1_dbg, el0_svc) we update the PMR with (GIC_PRIO_IRQON | GIC_PRIO_PSR_I_SET) before we notify lockdep, but in others (e.g. el1_abort) we do not. The case where we do not are where lockdep will warn, since IRQs will be masked by DAIF but not the PMR, as you describe above. With the current PMR management scheme, we'll need to consistently update the PMR earlier in the entry code. Does the below diff help? Thanks, Mark. ---->8---- diff --git a/arch/arm64/kernel/entry-common.c b/arch/arm64/kernel/entry-common.c index 9d3588450473..117412bae915 100644 --- a/arch/arm64/kernel/entry-common.c +++ b/arch/arm64/kernel/entry-common.c @@ -226,14 +226,6 @@ static void noinstr el1_dbg(struct pt_regs *regs, unsigned long esr) { unsigned long far = read_sysreg(far_el1); - /* - * The CPU masked interrupts, and we are leaving them masked during - * do_debug_exception(). Update PMR as if we had called - * local_daif_mask(). - */ - if (system_uses_irq_prio_masking()) - gic_write_pmr(GIC_PRIO_IRQON | GIC_PRIO_PSR_I_SET); - arm64_enter_el1_dbg(regs); if (!cortex_a76_erratum_1463225_debug_handler(regs)) do_debug_exception(far, esr, regs); @@ -398,9 +390,6 @@ static void noinstr el0_dbg(struct pt_regs *regs, unsigned long esr) /* Only watchpoints write FAR_EL1, otherwise its UNKNOWN */ unsigned long far = read_sysreg(far_el1); - if (system_uses_irq_prio_masking()) - gic_write_pmr(GIC_PRIO_IRQON | GIC_PRIO_PSR_I_SET); - enter_from_user_mode(); do_debug_exception(far, esr, regs); local_daif_restore(DAIF_PROCCTX_NOIRQ); @@ -408,9 +397,6 @@ static void noinstr el0_dbg(struct pt_regs *regs, unsigned long esr) static void noinstr el0_svc(struct pt_regs *regs) { - if (system_uses_irq_prio_masking()) - gic_write_pmr(GIC_PRIO_IRQON | GIC_PRIO_PSR_I_SET); - enter_from_user_mode(); cortex_a76_erratum_1463225_svc_handler(); do_el0_svc(regs); @@ -486,9 +472,6 @@ static void noinstr el0_cp15(struct pt_regs *regs, unsigned long esr) static void noinstr el0_svc_compat(struct pt_regs *regs) { - if (system_uses_irq_prio_masking()) - gic_write_pmr(GIC_PRIO_IRQON | GIC_PRIO_PSR_I_SET); - enter_from_user_mode(); cortex_a76_erratum_1463225_svc_handler(); do_el0_svc_compat(regs); diff --git a/arch/arm64/kernel/entry.S b/arch/arm64/kernel/entry.S index 6acfc5e6b5e0..7d46c74a8706 100644 --- a/arch/arm64/kernel/entry.S +++ b/arch/arm64/kernel/entry.S @@ -292,6 +292,8 @@ alternative_else_nop_endif alternative_if ARM64_HAS_IRQ_PRIO_MASKING mrs_s x20, SYS_ICC_PMR_EL1 str x20, [sp, #S_PMR_SAVE] + orr x20, x20, #GIC_PRIO_PSR_I_SET + msr_s SYS_ICC_PMR_EL1, x20 alternative_else_nop_endif /* Re-enable tag checking (TCO set on exception entry) */ _______________________________________________ linux-arm-kernel mailing list linux-arm-kernel@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-arm-kernel