From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 6A3DFC43334 for ; Wed, 22 Jun 2022 14:49:39 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1358693AbiFVOti (ORCPT ); Wed, 22 Jun 2022 10:49:38 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:46752 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1358184AbiFVOtL (ORCPT ); Wed, 22 Jun 2022 10:49:11 -0400 Received: from frasgout.his.huawei.com (frasgout.his.huawei.com [185.176.79.56]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4A1323DDE5 for ; Wed, 22 Jun 2022 07:49:08 -0700 (PDT) Received: from fraeml738-chm.china.huawei.com (unknown [172.18.147.201]) by frasgout.his.huawei.com (SkyGuard) with ESMTP id 4LSmV00NqVz689nQ; Wed, 22 Jun 2022 22:48:40 +0800 (CST) Received: from lhreml724-chm.china.huawei.com (10.201.108.75) by fraeml738-chm.china.huawei.com (10.206.15.219) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.24; Wed, 22 Jun 2022 16:49:06 +0200 Received: from [10.202.227.197] (10.202.227.197) by lhreml724-chm.china.huawei.com (10.201.108.75) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.24; Wed, 22 Jun 2022 15:49:05 +0100 Message-ID: Date: Wed, 22 Jun 2022 15:49:02 +0100 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.6.1 Subject: Re: [aarch64] INFO: rcu_sched detected expedited stalls on CPUs/tasks To: Bruno Goncalves CC: Pierre Gondois , , LKML , CKI Project , Ionela Voinescu , Dietmar Eggemann References: <99a207dc-93cd-1bea-2ffc-404a9f6587bf@arm.com> <90175f7e-0a2f-c83d-6fb5-916f885bbe81@huawei.com> From: John Garry In-Reply-To: Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: 7bit X-Originating-IP: [10.202.227.197] X-ClientProxiedBy: lhreml734-chm.china.huawei.com (10.201.108.85) To lhreml724-chm.china.huawei.com (10.201.108.75) X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 22/06/2022 15:40, Bruno Goncalves wrote: Hi Bruno, > With the config change that does not set > CONFIG_RCU_EXP_CPU_STALL_TIMEOUT the problem seems to be fixed for us. > The newer Fedora kernels already have the config fixed. > OK, thanks for the info. Well those debug options I enabled didn't cause problems previously. I'll see if it is one in particular and go from there. > >> On v5.19-rc3 I just enabled some debug configs on a vanilla kernel and >> can easily reproduce a RCU stall on boot, as below. >> >> CONFIG_RCU_EXP_CPU_STALL_TIMEOUT=0 for me, that being the default. >> >> Table To iBMC Success. >> GetVariable Status : Not Found. >> [ 0.000000] Booting Linux on physical CPU 0x0000010000 [0x410fd082] >> [ 0.000000] Linux version 5.19.0-rc3-00001-gd8610c1c16e8 >> (john@debian) (gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU >> Binutils for Debian) 2.37) #187 SMP PREEMPT Wed Jun 22 14:08:56 BST 2022 >> [ 0.000000] Machine model: Hisilicon PhosphorV660 Development Board >> [ 0.000000] efi: EFI v2.60 by EDK II >> [ 0.000000] efi: SMBIOS=0x3eff0000 SMBIOS 3.0=0x39aa0000 >> ACPI=0x39b70000 ACPI 2.0=0x39b70014 MEMATTR=0x3b8d0018 >> MEMRESERVE=0x3a002d18 Thanks! From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from bombadil.infradead.org (bombadil.infradead.org [198.137.202.133]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id B78BEC43334 for ; Wed, 22 Jun 2022 14:50:13 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=bombadil.20210309; h=Sender:Content-Type: Content-Transfer-Encoding:List-Subscribe:List-Help:List-Post:List-Archive: List-Unsubscribe:List-Id:In-Reply-To:From:References:CC:To:Subject: MIME-Version:Date:Message-ID:Reply-To:Content-ID:Content-Description: Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID: List-Owner; bh=uvUEBcLdN2srWPA+xCwEFk/oVc7+SRO7DfcrFrmGLgI=; b=KRRcb+kA7Sp9uA doSCJvxQMYjWtV/5/soR6J+BpAUkzYnvVUqAWG9MMWVin5PvcUGgtU+oijW41pIY1u2cBsUCdOX9s ctSwgCklImN8Ofz/UQhNA2y7242ecFuCs4UOjpNm+yV+E9MAp+DLh7/J4MBWK7VvPm3KLTgTzczf2 IYV5dVDjb0XprNaLz+F93XHUBiagCFRedLMzRstz819RgFvhrqmvt1CNEwLUeVLLEIo4QZreGvxCT INzFo2HzRUVcJAsKQuvhNmD5luKIgpJ9+KhC0WzF/4hFR3mNlMMJE72k+HLpjsESkFYNxMg/nS0u4 4ibbINo2cA+jAzyLgJng==; Received: from localhost ([::1] helo=bombadil.infradead.org) by bombadil.infradead.org with esmtp (Exim 4.94.2 #2 (Red Hat Linux)) id 1o41fQ-00B0W3-6o; Wed, 22 Jun 2022 14:49:16 +0000 Received: from frasgout.his.huawei.com ([185.176.79.56]) by bombadil.infradead.org with esmtps (Exim 4.94.2 #2 (Red Hat Linux)) id 1o41fM-00B0T7-3n for linux-arm-kernel@lists.infradead.org; Wed, 22 Jun 2022 14:49:14 +0000 Received: from fraeml738-chm.china.huawei.com (unknown [172.18.147.201]) by frasgout.his.huawei.com (SkyGuard) with ESMTP id 4LSmV00NqVz689nQ; Wed, 22 Jun 2022 22:48:40 +0800 (CST) Received: from lhreml724-chm.china.huawei.com (10.201.108.75) by fraeml738-chm.china.huawei.com (10.206.15.219) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.24; Wed, 22 Jun 2022 16:49:06 +0200 Received: from [10.202.227.197] (10.202.227.197) by lhreml724-chm.china.huawei.com (10.201.108.75) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.24; Wed, 22 Jun 2022 15:49:05 +0100 Message-ID: Date: Wed, 22 Jun 2022 15:49:02 +0100 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.6.1 Subject: Re: [aarch64] INFO: rcu_sched detected expedited stalls on CPUs/tasks To: Bruno Goncalves CC: Pierre Gondois , , LKML , CKI Project , Ionela Voinescu , Dietmar Eggemann References: <99a207dc-93cd-1bea-2ffc-404a9f6587bf@arm.com> <90175f7e-0a2f-c83d-6fb5-916f885bbe81@huawei.com> From: John Garry In-Reply-To: X-Originating-IP: [10.202.227.197] X-ClientProxiedBy: lhreml734-chm.china.huawei.com (10.201.108.85) To lhreml724-chm.china.huawei.com (10.201.108.75) X-CFilter-Loop: Reflected X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.8.0 (BSD) ) MR-646709E3 X-CRM114-CacheID: sfid-20220622_074912_384704_4294FADA X-CRM114-Status: UNSURE ( 7.87 ) X-CRM114-Notice: Please train this message. X-BeenThere: linux-arm-kernel@lists.infradead.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset="us-ascii"; Format="flowed" Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org On 22/06/2022 15:40, Bruno Goncalves wrote: Hi Bruno, > With the config change that does not set > CONFIG_RCU_EXP_CPU_STALL_TIMEOUT the problem seems to be fixed for us. > The newer Fedora kernels already have the config fixed. > OK, thanks for the info. Well those debug options I enabled didn't cause problems previously. I'll see if it is one in particular and go from there. > >> On v5.19-rc3 I just enabled some debug configs on a vanilla kernel and >> can easily reproduce a RCU stall on boot, as below. >> >> CONFIG_RCU_EXP_CPU_STALL_TIMEOUT=0 for me, that being the default. >> >> Table To iBMC Success. >> GetVariable Status : Not Found. >> [ 0.000000] Booting Linux on physical CPU 0x0000010000 [0x410fd082] >> [ 0.000000] Linux version 5.19.0-rc3-00001-gd8610c1c16e8 >> (john@debian) (gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU >> Binutils for Debian) 2.37) #187 SMP PREEMPT Wed Jun 22 14:08:56 BST 2022 >> [ 0.000000] Machine model: Hisilicon PhosphorV660 Development Board >> [ 0.000000] efi: EFI v2.60 by EDK II >> [ 0.000000] efi: SMBIOS=0x3eff0000 SMBIOS 3.0=0x39aa0000 >> ACPI=0x39b70000 ACPI 2.0=0x39b70014 MEMATTR=0x3b8d0018 >> MEMRESERVE=0x3a002d18 Thanks! _______________________________________________ linux-arm-kernel mailing list linux-arm-kernel@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-arm-kernel