From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id B6A79C433F5 for ; Fri, 22 Oct 2021 08:03:00 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 9E43E60FF2 for ; Fri, 22 Oct 2021 08:03:00 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232290AbhJVIFM (ORCPT ); Fri, 22 Oct 2021 04:05:12 -0400 Received: from mga07.intel.com ([134.134.136.100]:29042 "EHLO mga07.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231984AbhJVIFK (ORCPT ); Fri, 22 Oct 2021 04:05:10 -0400 X-IronPort-AV: E=McAfee;i="6200,9189,10144"; a="292716084" X-IronPort-AV: E=Sophos;i="5.87,172,1631602800"; d="scan'208";a="292716084" Received: from orsmga005.jf.intel.com ([10.7.209.41]) by orsmga105.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 22 Oct 2021 01:02:53 -0700 X-IronPort-AV: E=Sophos;i="5.87,172,1631602800"; d="scan'208";a="663090522" Received: from xsang-optiplex-9020.sh.intel.com (HELO xsang-OptiPlex-9020) ([10.239.159.41]) by orsmga005-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 22 Oct 2021 01:02:51 -0700 Date: Fri, 22 Oct 2021 16:22:43 +0800 From: Oliver Sang To: Nathan Chancellor Cc: Kees Cook , kernel test robot , LKML , lkp@lists.01.org Subject: Re: [vmlinux.lds.h] d4c6399900: BUG:unable_to_handle_page_fault_for_address Message-ID: <20211022082243.GB23206@xsang-OptiPlex-9020> References: <20210903053159.GA29784@xsang-OptiPlex-9020> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.9.4 (2018-02-28) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org hi Nathan Chancellor, On Fri, Sep 03, 2021 at 11:51:04AM -0700, Nathan Chancellor wrote: > On Fri, Sep 03, 2021 at 01:31:59PM +0800, kernel test robot wrote: > > > > > > Greeting, > > > > FYI, we noticed the following commit (built with gcc-9): > > > > commit: d4c6399900364facd84c9e35ce1540b6046c345f ("vmlinux.lds.h: Avoid orphan section with !SMP") > > https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master > > > > > > in testcase: trinity > > version: trinity-x86_64-03f10b67-1_20210401 > > with following parameters: > > > > runtime: 300s > > > > test-description: Trinity is a linux system call fuzz tester. > > test-url: http://codemonkey.org.uk/projects/trinity/ > > > > > > on test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 16G > > > > caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace): > > > > > > > > If you fix the issue, kindly add following tag > > Reported-by: kernel test robot > > > > > > [ 103.254262] BUG: unable to handle page fault for address: ffffffffbb443040 > > [ 103.255486] #PF: supervisor write access in kernel mode > > [ 103.256427] #PF: error_code(0x0002) - not-present page > > [ 103.257362] PGD 2cec37067 P4D 2cec37067 PUD 2cec38063 PMD 100235063 PTE 800ffffd2f9bc062 > > [ 103.258757] Oops: 0002 [#1] KASAN PTI > > [ 103.259355] CPU: 0 PID: 1 Comm: swapper Not tainted 5.13.0-rc2+ #1 > > [ 103.260390] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.12.0-1 04/01/2014 > > [ 103.261811] RIP: 0010:kvm_guest_apic_eoi_write+0x12/0x90 > > [ 103.262740] Code: 00 48 c7 c7 28 8d b1 ba e8 2b b5 60 00 eb cc 66 0f 1f 84 00 00 00 00 00 53 be 08 00 00 00 48 c7 c7 40 30 44 bb e8 ee b8 60 00 <48> 0f ba 35 95 d0 59 05 00 72 4e 48 c7 c0 80 fc 7f b9 48 ba 00 00 > > [ 103.265736] RSP: 0018:ffffc90000007fc8 EFLAGS: 00010046 > > [ 103.266640] RAX: 0000000000000001 RBX: ffffffffb97ffa40 RCX: ffffffffb5ea5fa2 > > [ 103.267869] RDX: 0000000000000001 RSI: 0000000000000008 RDI: ffffffffbb443040 > > [ 103.269087] RBP: 0000000000000000 R08: 0000000000000001 R09: fffffbfff7688609 > > [ 103.270329] R10: ffffffffbb443047 R11: fffffbfff7688608 R12: 0000000000000000 > > [ 103.271490] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 > > [ 103.272646] FS: 0000000000000000(0000) GS:ffffffffb9a7f000(0000) knlGS:0000000000000000 > > [ 103.273993] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > > [ 103.274963] CR2: ffffffffbb443040 CR3: 00000002cec34000 CR4: 00000000000406b0 > > [ 103.276150] Call Trace: > > [ 103.276564] > > [ 103.276913] __sysvec_apic_timer_interrupt+0x62/0x370 > > [ 103.277781] sysvec_apic_timer_interrupt+0x62/0x80 > > [ 103.278602] > > [ 103.278984] asm_sysvec_apic_timer_interrupt+0x12/0x20 > > [ 103.279890] RIP: 0010:call_rcu+0xc/0x150 > > [ 103.280559] Code: c7 c7 e0 db 05 ba e8 d3 23 33 02 85 c0 75 cc eb 9f 66 66 2e 0f 1f 84 00 00 00 00 00 66 90 48 b8 00 00 00 00 00 fc ff df 55 53 <48> 89 fb 48 83 c7 08 48 89 fa 48 c1 ea 03 48 83 ec 08 80 3c 02 00 > > [ 103.283542] RSP: 0018:ffffc9000001fdb8 EFLAGS: 00000246 > > [ 103.284404] RAX: dffffc0000000000 RBX: fffff52000003fc6 RCX: 1ffffffff75653d5 > > [ 103.285532] RDX: 1ffff92000003fdd RSI: ffffffffb60b1810 RDI: ffffc9000001fe80 > > [ 103.286712] RBP: ffffc9000001fe80 R08: ffffc9000001fe60 R09: 0000000000000000 > > [ 103.287888] R10: 0000000000000001 R11: ffffc9000001fe90 R12: ffffc9000001fe60 > > [ 103.289058] R13: 0000000000000000 R14: dffffc0000000000 R15: 0000000000000000 > > [ 103.290290] ? rcu_tasks_pregp_step+0x10/0x10 > > [ 103.291029] __wait_rcu_gp+0x160/0x440 > > [ 103.291650] rcu_barrier+0x83/0xc0 > > [ 103.292234] ? poll_state_synchronize_rcu+0x10/0x10 > > [ 103.293040] ? synchronize_rcu+0x80/0x80 > > [ 103.293720] ? lockdep_hardirqs_on_prepare+0x26b/0x3e0 > > [ 103.294554] ? trace_hardirqs_on+0x3d/0x1d0 > > [ 103.295279] ? _vdso_data+0xf80/0xf80 > > [ 103.295919] ? _vdso_data+0xf80/0xf80 > > [ 103.296525] ? free_kernel_image_pages+0xd/0x30 > > [ 103.297307] ? rest_init+0x18e/0x18e > > [ 103.297937] kernel_init+0x20/0x112 > > [ 103.298537] ret_from_fork+0x22/0x30 > > [ 103.299197] Modules linked in: > > [ 103.299740] CR2: ffffffffbb443040 > > [ 103.300304] ---[ end trace 733607da50d3f759 ]--- > > > > > > To reproduce: > > > > > > > > git clone https://github.com/intel/lkp-tests.git > > cd lkp-tests > > bin/lkp qemu -k job-script # job-script is attached in this email > > Hi, > > This command mentions a bzImage but there is not one attached here nor > is the configuration file attached or linked anywhere so I am not really > able to investigate this without either of those :) sorry for late. the original report was based on a randconfig. unfortunately, one env issue caused we lost that config and related kernel images. and there is maybe other issues which caused we didn't attach that config in original report as we usually do. in brief, we cannot reproduce this issue now. sorry for inconvenience. and what you mentioned that there is no bzImage should be caused by same issue which caused no config attached. normally we supply the reproducer about how to build bzImage based on attached config. we are updating our code to avoid such issues in the future. > > Cheers, > Nathan From mboxrd@z Thu Jan 1 00:00:00 1970 Content-Type: multipart/mixed; boundary="===============3180042861833074051==" MIME-Version: 1.0 From: Oliver Sang To: lkp@lists.01.org Subject: Re: [vmlinux.lds.h] d4c6399900: BUG:unable_to_handle_page_fault_for_address Date: Fri, 22 Oct 2021 16:22:43 +0800 Message-ID: <20211022082243.GB23206@xsang-OptiPlex-9020> In-Reply-To: List-Id: --===============3180042861833074051== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable hi Nathan Chancellor, On Fri, Sep 03, 2021 at 11:51:04AM -0700, Nathan Chancellor wrote: > On Fri, Sep 03, 2021 at 01:31:59PM +0800, kernel test robot wrote: > > = > > = > > Greeting, > > = > > FYI, we noticed the following commit (built with gcc-9): > > = > > commit: d4c6399900364facd84c9e35ce1540b6046c345f ("vmlinux.lds.h: Avoid= orphan section with !SMP") > > https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master > > = > > = > > in testcase: trinity > > version: trinity-x86_64-03f10b67-1_20210401 > > with following parameters: > > = > > runtime: 300s > > = > > test-description: Trinity is a linux system call fuzz tester. > > test-url: http://codemonkey.org.uk/projects/trinity/ > > = > > = > > on test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2= -m 16G > > = > > caused below changes (please refer to attached dmesg/kmsg for entire lo= g/backtrace): > > = > > = > > = > > If you fix the issue, kindly add following tag > > Reported-by: kernel test robot > > = > > = > > [ 103.254262] BUG: unable to handle page fault for address: ffffffffbb= 443040 > > [ 103.255486] #PF: supervisor write access in kernel mode > > [ 103.256427] #PF: error_code(0x0002) - not-present page > > [ 103.257362] PGD 2cec37067 P4D 2cec37067 PUD 2cec38063 PMD 100235063 = PTE 800ffffd2f9bc062 > > [ 103.258757] Oops: 0002 [#1] KASAN PTI > > [ 103.259355] CPU: 0 PID: 1 Comm: swapper Not tainted 5.13.0-rc2+ #1 > > [ 103.260390] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), B= IOS 1.12.0-1 04/01/2014 > > [ 103.261811] RIP: 0010:kvm_guest_apic_eoi_write+0x12/0x90 > > [ 103.262740] Code: 00 48 c7 c7 28 8d b1 ba e8 2b b5 60 00 eb cc 66 0f= 1f 84 00 00 00 00 00 53 be 08 00 00 00 48 c7 c7 40 30 44 bb e8 ee b8 60 00= <48> 0f ba 35 95 d0 59 05 00 72 4e 48 c7 c0 80 fc 7f b9 48 ba 00 00 > > [ 103.265736] RSP: 0018:ffffc90000007fc8 EFLAGS: 00010046 > > [ 103.266640] RAX: 0000000000000001 RBX: ffffffffb97ffa40 RCX: fffffff= fb5ea5fa2 > > [ 103.267869] RDX: 0000000000000001 RSI: 0000000000000008 RDI: fffffff= fbb443040 > > [ 103.269087] RBP: 0000000000000000 R08: 0000000000000001 R09: fffffbf= ff7688609 > > [ 103.270329] R10: ffffffffbb443047 R11: fffffbfff7688608 R12: 0000000= 000000000 > > [ 103.271490] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000= 000000000 > > [ 103.272646] FS: 0000000000000000(0000) GS:ffffffffb9a7f000(0000) kn= lGS:0000000000000000 > > [ 103.273993] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > > [ 103.274963] CR2: ffffffffbb443040 CR3: 00000002cec34000 CR4: 0000000= 0000406b0 > > [ 103.276150] Call Trace: > > [ 103.276564] > > [ 103.276913] __sysvec_apic_timer_interrupt+0x62/0x370 > > [ 103.277781] sysvec_apic_timer_interrupt+0x62/0x80 > > [ 103.278602] > > [ 103.278984] asm_sysvec_apic_timer_interrupt+0x12/0x20 > > [ 103.279890] RIP: 0010:call_rcu+0xc/0x150 > > [ 103.280559] Code: c7 c7 e0 db 05 ba e8 d3 23 33 02 85 c0 75 cc eb 9f= 66 66 2e 0f 1f 84 00 00 00 00 00 66 90 48 b8 00 00 00 00 00 fc ff df 55 53= <48> 89 fb 48 83 c7 08 48 89 fa 48 c1 ea 03 48 83 ec 08 80 3c 02 00 > > [ 103.283542] RSP: 0018:ffffc9000001fdb8 EFLAGS: 00000246 > > [ 103.284404] RAX: dffffc0000000000 RBX: fffff52000003fc6 RCX: 1ffffff= ff75653d5 > > [ 103.285532] RDX: 1ffff92000003fdd RSI: ffffffffb60b1810 RDI: ffffc90= 00001fe80 > > [ 103.286712] RBP: ffffc9000001fe80 R08: ffffc9000001fe60 R09: 0000000= 000000000 > > [ 103.287888] R10: 0000000000000001 R11: ffffc9000001fe90 R12: ffffc90= 00001fe60 > > [ 103.289058] R13: 0000000000000000 R14: dffffc0000000000 R15: 0000000= 000000000 > > [ 103.290290] ? rcu_tasks_pregp_step+0x10/0x10 > > [ 103.291029] __wait_rcu_gp+0x160/0x440 > > [ 103.291650] rcu_barrier+0x83/0xc0 > > [ 103.292234] ? poll_state_synchronize_rcu+0x10/0x10 > > [ 103.293040] ? synchronize_rcu+0x80/0x80 > > [ 103.293720] ? lockdep_hardirqs_on_prepare+0x26b/0x3e0 > > [ 103.294554] ? trace_hardirqs_on+0x3d/0x1d0 > > [ 103.295279] ? _vdso_data+0xf80/0xf80 > > [ 103.295919] ? _vdso_data+0xf80/0xf80 > > [ 103.296525] ? free_kernel_image_pages+0xd/0x30 > > [ 103.297307] ? rest_init+0x18e/0x18e > > [ 103.297937] kernel_init+0x20/0x112 > > [ 103.298537] ret_from_fork+0x22/0x30 > > [ 103.299197] Modules linked in: > > [ 103.299740] CR2: ffffffffbb443040 > > [ 103.300304] ---[ end trace 733607da50d3f759 ]--- > > = > > = > > To reproduce: > > = > > = > > = > > git clone https://github.com/intel/lkp-tests.git > > cd lkp-tests > > bin/lkp qemu -k job-script # job-script is attached i= n this email > = > Hi, > = > This command mentions a bzImage but there is not one attached here nor > is the configuration file attached or linked anywhere so I am not really > able to investigate this without either of those :) sorry for late. the original report was based on a randconfig. unfortunately, one env issue caused we lost that config and related kernel images. and there is maybe other issues which caused we didn't attach that config in original report as we usually do. in brief, we cannot reproduce this issue now. sorry for inconvenience. and what you mentioned that there is no bzImage should be caused by same issue which caused no config attached. normally we supply the reproducer about how to build bzImage based on attached config. we are updating our code to avoid such issues in the future. > = > Cheers, > Nathan --===============3180042861833074051==--