From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 00AECEB64DD for ; Wed, 5 Jul 2023 06:47:00 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 846D38D0001; Wed, 5 Jul 2023 02:47:00 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 7F6856B0072; Wed, 5 Jul 2023 02:47:00 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 697838D0001; Wed, 5 Jul 2023 02:47:00 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0016.hostedemail.com [216.40.44.16]) by kanga.kvack.org (Postfix) with ESMTP id 56A896B0071 for ; Wed, 5 Jul 2023 02:47:00 -0400 (EDT) Received: from smtpin16.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay10.hostedemail.com (Postfix) with ESMTP id 20B5AC07D3 for ; Wed, 5 Jul 2023 06:47:00 +0000 (UTC) X-FDA: 80976625800.16.4E3B1D0 Received: from mail-yb1-f174.google.com (mail-yb1-f174.google.com [209.85.219.174]) by imf07.hostedemail.com (Postfix) with ESMTP id 2C46640009 for ; Wed, 5 Jul 2023 06:46:57 +0000 (UTC) Authentication-Results: imf07.hostedemail.com; dkim=pass header.d=google.com header.s=20221208 header.b=gA0dXeNN; dmarc=pass (policy=reject) header.from=google.com; spf=pass (imf07.hostedemail.com: domain of surenb@google.com designates 209.85.219.174 as permitted sender) smtp.mailfrom=surenb@google.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1688539618; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=YExDtaOFNAIXQFQh4fxwxBinPAmwx4eECBeCM3cJOGg=; b=jYOza9xTLNqlsm6hBNlhe1AbeEdfb+EMuWBEIozPr8M5JJemGWW3tvg4nNOL+oWXEQogxG otvl5vJzADhyKU9aIBQY3DcYamUanJ/jGc8j5d+QTpJzqrPypy6Lui1kfPZb5/9vvGKqLp OQZAZ66IlOn5JsfQX1NCjuYE2k8vWRg= ARC-Authentication-Results: i=1; imf07.hostedemail.com; dkim=pass header.d=google.com header.s=20221208 header.b=gA0dXeNN; dmarc=pass (policy=reject) header.from=google.com; spf=pass (imf07.hostedemail.com: domain of surenb@google.com designates 209.85.219.174 as permitted sender) smtp.mailfrom=surenb@google.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1688539618; a=rsa-sha256; cv=none; b=YEmo1lHOJ4L1jwlbt3epDBw6Sp0xcyVjAvAorEN3YrJrVC0PjU62S267nJhHRUEiFyTBcC ECorWzTbkjFYU/uEqFzP8MsVsP3tPquN15zlOzU8kmatkymH7DZA15upz+h8zAKXnUIj2S 8gnQQEegM8oVX5cb53mhdHGs3U2O4V4= Received: by mail-yb1-f174.google.com with SMTP id 3f1490d57ef6-c5e76dfcc36so1290482276.2 for ; Tue, 04 Jul 2023 23:46:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20221208; t=1688539617; x=1691131617; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:from:to:cc:subject:date :message-id:reply-to; bh=YExDtaOFNAIXQFQh4fxwxBinPAmwx4eECBeCM3cJOGg=; b=gA0dXeNNA3UioszdJf0NYCcktm2CLHrrOzx7SyDNzfk8JC9fnZQriXOPtllEr4Omsu Ege9pavuWpWoAf8rnboR87NmWcQW+SpKnvl0xrQV8sEDPkGFsW/GbzDzWj8cibCFwjO2 A3lYE7QxkcMkyPa6Dhm5N/j93oT6zu6N+ggiQyHhJUo0QNnubCe3cOIvkBZKMgQxrPhh +cdjVWC2yzP9WwwO4CyCsEvTChC/DzXXKtKA1oOPs1gm/lOuqa9OR6qo/h3XFn2ZywGq VKUPu2ia/DrEiKAEdjpjscS6c6vc9ALSEDzF4YfXGyELdt4YpGSpOA6KalqRHZhE8KQH UilA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1688539617; x=1691131617; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=YExDtaOFNAIXQFQh4fxwxBinPAmwx4eECBeCM3cJOGg=; b=dLD7r8old3xZ3Hn8y+b5T27D6v7kIUloQCwAza8zHheK44nkO6kYbVSeA2sTKrq0Ra 2gGNf150nVGO3kNkrDGBxf2yS7nAsvIWf3rQD+jL1EsU9LMsl6fvStGwM/qKM4zp9Z6K GSFZ9zkPGxDHPp4Z8WOND+UAxGkj2ARD8188BnTeXgYFIij0HG068po1ZQ1m2vzVEnWV Zk1C72IPp6KKJu7Rh7u4dlxxv1wtgd9mRP/3s8skRGiOCKMLkJvhDwRX3zAMKKqLUwCI QBaV0LQAHnAHmkM0ze5mEgAGWsBY6hwPs59VuZkgcI6nVZ+gDY6L7ozh51tmy3V88zjJ WKyQ== X-Gm-Message-State: ABy/qLZkP9vQ5iMi8PD3fmdk4Pfb4vp2b8CRlMRyhT54SedWB/eq7tC9 z+bcdYT4s+5ooqrZvnbPTPBXGjQRkSYGjdqPzmCNxg== X-Google-Smtp-Source: APBJJlEV1TlzDehJyHFZyJ2JBAwBmvzuQD5vprQQ8YvU6BXA3qhBZQojImJakdc2dodw19qHVDpM80mR/Z3nM+rEF5c= X-Received: by 2002:a25:b283:0:b0:c63:cc9:43fd with SMTP id k3-20020a25b283000000b00c630cc943fdmr892170ybj.9.1688539616853; Tue, 04 Jul 2023 23:46:56 -0700 (PDT) MIME-Version: 1.0 References: <2efa2c89-3765-721d-2c3c-00590054aa5b@redhat.com> <3c042dcd-192e-7050-07f1-ce891b95dfca@redhat.com> <7d6ba07b-ee60-8920-b91c-04c826eb4690@applied-asynchrony.com> In-Reply-To: From: Suren Baghdasaryan Date: Tue, 4 Jul 2023 23:46:45 -0700 Message-ID: Subject: Re: [PATCH 1/1] mm: disable CONFIG_PER_VMA_LOCK by default until its fixed To: =?UTF-8?Q?Holger_Hoffst=C3=A4tte?= Cc: Matthew Wilcox , David Hildenbrand , akpm@linux-foundation.org, jirislaby@kernel.org, jacobly.alt@gmail.com, michel@lespinasse.org, jglisse@google.com, mhocko@suse.com, vbabka@suse.cz, hannes@cmpxchg.org, mgorman@techsingularity.net, dave@stgolabs.net, liam.howlett@oracle.com, peterz@infradead.org, ldufour@linux.ibm.com, paulmck@kernel.org, mingo@redhat.com, will@kernel.org, luto@kernel.org, songliubraving@fb.com, peterx@redhat.com, dhowells@redhat.com, hughd@google.com, bigeasy@linutronix.de, kent.overstreet@linux.dev, punit.agrawal@bytedance.com, lstoakes@gmail.com, peterjung1337@gmail.com, rientjes@google.com, chriscli@google.com, axelrasmussen@google.com, joelaf@google.com, minchan@google.com, rppt@kernel.org, jannh@google.com, shakeelb@google.com, tatashin@google.com, edumazet@google.com, gthelen@google.com, linux-mm@kvack.org Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Rspam-User: X-Rspamd-Server: rspam12 X-Rspamd-Queue-Id: 2C46640009 X-Stat-Signature: uzmja8fjxe4hs9afj6fsnmof3mcnm6pp X-HE-Tag: 1688539617-728755 X-HE-Meta: U2FsdGVkX1/eJ490cOiFokZQRU8qYXk9ahNGaqXWXbJyMbTgnOQpAPhf3iM1qBvU74yT2/gZreqhVGoUoCGkHMGxOKPablpic7wzv9yxW0TbJ5LtU49HCrOqEkw+0x+L070rrFwVDXwgVgjkorLOhWbNrkflf9Ypf88BjfYNJ8rVj1WBpP+RKEhulwxFNPa4JoH5Siiagk+QyNBHmFyg4qpl76VZ2XakC0US64fApN+C9rSfF/CZ+U6WlrkK9rxKkfxswWWEPGugzc1WrgLsP3pSQwct+SprMzTza40fPP4/dmcrfF3sQH4uCqdMmQCBOGdKtHKeTmt1DkmkcN56waii6LwW7F725IlsywM5cMqJ+KbQGl+VKn0JYg5oCmH1sTLFeSBv+/zjpMbX0OQzmtfmcuqclu2SoGuiGjrncX71+Fadjz76XQYQYV6Es922Z7UUMksBjMdExpGOAbHQ7UZTC9hgjFwCT2FnSb3cBXefpIqVHZ9BBF9OVADT4/4Y6gMKzlkCN1ZcXMccD12f5XCPPFPiMuOvpk1QtxFXMdzeVxp06gQ2NtjMY2bAv3ue0+e92crSArHtdJ9T9PqjECX3SRE65QCk0GCnCaxpCNOIeEgTkA3h0Qw6ZVteI19tMrO8aQoLCFKqkMotdOP3rj1RkIGYWX4hQYARUwk37inkkUzGFqOZ6ohS20R4UZl34AOeYRM6qVCviC81CAu4u5qpXOeHDem4l6W8cq5Mc4faKb7Xr9LOhn2y+XGMOsVKn0CGK6nAIhbkYK2E0icEvsWl9FCPmk4kUTj6SrRl7JI9xHx1yk9asfWj/5c82t7FBbmvTmI+zuOfgwWs8zdh4snitKbeQjxDLZKwl4lI+NHqqdsUtN+7dj4YQ477D7IB/5jBKbVm2pp4CEpx4fbIeaWKTInp5mxBtahUDbmY53r3FFYYgQsR2eVFSuejRnHO3OanmMNTUbYGGJZID6Y fKcVxT5a w1xlW2pTAaNP+nfSUbxEg6SpqUiX1yVHdGzLv/INGPpP3ES4VRjHpRI/yap7rGH7bhwV6+QLmx7y77Z6JVRKLPWPe3yBotg7Opw/1yRlOAwY79Lzxl/1wWb/A8ozDXDAZJUzpY6gcZR2YHmh4DKDtzhfDl2fEMUCRrcDZvRHktx84XfBtLCWgjNsRdwZGxd3sZMMhwTGRfDXaWq4vq7oj0jncaVjNuYnLT6ix6+5a0RRXFMdvId7+IhOq3tljib2WzY3QkOW5MoaUx4S+f4W2GaMiwLgPSV4sl/BnUy/LGrTXzaHfY6VKNoqXYhVIkQo9BkXsAum4sClnTik9liivekg+45h2NRI+WiW5xjrNHE+AmzF7JTHQe9zkRnGQo+qWcSc46ySjEXkqpfNkFbiJXbiu/j7vRT3x4RrBDQ1asU9ippK0fimzjlw77+zdr7vqbR89UVyjz/3f8AkatvP17eZax9kRartNb0fVDHcLvHURj+VGho6jNG6xTXXWOyFKAZaZxnXxNhk2mEoJ5rPLK+PkHIOna7eFF6Deb6yGpwds0lsFSR8po+Le8kNijts9aVWiydqpqTdaagwu00DA5YIUnmlgTo/V7T5hAUlVPiUB4AKHuRGLLgOY6zMn41wyw8AYhWS+DOrcKPM= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On Tue, Jul 4, 2023 at 4:59=E2=80=AFPM Holger Hoffst=C3=A4tte wrote: > > On 2023-07-05 00:42, Matthew Wilcox wrote: > > On Tue, Jul 04, 2023 at 11:34:27PM +0200, Holger Hoffst=C3=A4tte wrote: > >> I applied the fix and did a clean rebuild. The first attempt to boot r= esulted in > >> the following oops, though it kind of continued: > > > > It would be helpful to run this through decode_stacktrace.sh > > > >> Jul 4 22:35:22 hho kernel: BUG: kernel NULL pointer dereference, addr= ess: 0000000000000052 > >> Jul 4 22:35:22 hho kernel: #PF: supervisor read access in kernel mode > >> Jul 4 22:35:22 hho kernel: #PF: error_code(0x0000) - not-present page > >> Jul 4 22:35:22 hho kernel: PGD 0 P4D 0 > >> Jul 4 22:35:22 hho kernel: Oops: 0000 [#1] SMP > >> Jul 4 22:35:22 hho kernel: CPU: 10 PID: 1740 Comm: start-stop-daem No= t tainted 6.4.1 #1 > >> Jul 4 22:35:22 hho kernel: Hardware name: LENOVO 20U50001GE/20U50001G= E, BIOS R19ET32W (1.16 ) 01/26/2021 > >> Jul 4 22:35:22 hho kernel: RIP: 0010:wq_worker_comm+0x63/0xc0 > >> Jul 4 22:35:22 hho kernel: Code: 43 2c 20 75 1d 5b 5d 48 c7 c7 e0 a4 = 43 82 41 5c 41 5d 41 5e e9 7e 6b 8b 00 5b 5d 41 5c 41 5d 41 5e c3 48 89 df = e8 ad 35 00 00 <4c> 8b 70 48 48 89 c3 4d 85 f6 74 cf 4c 89 f7 e8 29 b6 8b 0= 0 80 7b > > > > Faulting insn: > > > > 0: 4c 8b 70 48 mov 0x48(%rax),%r14 > > > > and rax is 0xa, which matches up with 0x52 as the faulting address. > > > > I'm not sure this is related to the VMA patches. It might be something > > unrelated that doesn't often come up? > > See below for the reveal! > > >> Jul 4 22:35:22 hho kernel: RSP: 0018:ffffc90000fb7bb8 EFLAGS: 0001020= 2 > >> Jul 4 22:35:22 hho kernel: RAX: 000000000000000a RBX: ffff88810cd4330= 0 RCX: 0001020304050608 > >> Jul 4 22:35:22 hho kernel: RDX: ffff88811395bfc0 RSI: 7ffffffffffffff= f RDI: ffff88810cd43300 > >> Jul 4 22:35:22 hho kernel: RBP: 000000000000000f R08: ffffc90000fb7be= 8 R09: 0000000000000040 > >> Jul 4 22:35:22 hho kernel: R10: fefefefefefefeff R11: 000000000000004= 0 R12: ffffc90000fb7be8 > >> Jul 4 22:35:22 hho kernel: R13: 0000000000000040 R14: 000000000000000= c R15: 0000000000000001 > >> Jul 4 22:35:22 hho kernel: FS: 00007f39dde1c740(0000) GS:ffff8887ef6= 80000(0000) knlGS:0000000000000000 > >> Jul 4 22:35:22 hho kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008= 0050033 > >> Jul 4 22:35:22 hho kernel: CR2: 0000000000000052 CR3: 000000011218800= 0 CR4: 0000000000350ee0 > >> Jul 4 22:35:22 hho kernel: Call Trace: > >> Jul 4 22:35:22 hho kernel: > >> Jul 4 22:35:22 hho kernel: ? __die+0x1f/0x60 > >> Jul 4 22:35:22 hho kernel: ? page_fault_oops+0x14d/0x410 > >> Jul 4 22:35:22 hho kernel: ? xa_load+0x82/0xa0 > >> Jul 4 22:35:22 hho kernel: ? exc_page_fault+0x60/0x100 > >> Jul 4 22:35:22 hho kernel: ? asm_exc_page_fault+0x22/0x30 > >> Jul 4 22:35:22 hho kernel: ? wq_worker_comm+0x63/0xc0 > >> Jul 4 22:35:22 hho last message buffered 1 times > >> Jul 4 22:35:22 hho kernel: proc_task_name+0xa4/0xb0 > >> Jul 4 22:35:22 hho kernel: ? seq_put_decimal_ull_width+0x96/0x100 > >> Jul 4 22:35:22 hho kernel: do_task_stat+0x44b/0xe10 > >> Jul 4 22:35:22 hho kernel: proc_single_show+0x4b/0xa0 > >> Jul 4 22:35:22 hho kernel: seq_read_iter+0xff/0x410 > >> Jul 4 22:35:22 hho kernel: ? generic_fillattr+0x45/0xf0 > >> Jul 4 22:35:22 hho kernel: seq_read+0x93/0xb0 > >> Jul 4 22:35:22 hho kernel: vfs_read+0x9b/0x2c0 > >> Jul 4 22:35:22 hho kernel: ? __do_sys_newfstatat+0x22/0x30 > >> Jul 4 22:35:22 hho kernel: ksys_read+0x53/0xc0 > >> Jul 4 22:35:22 hho kernel: do_syscall_64+0x35/0x80 > >> Jul 4 22:35:22 hho kernel: entry_SYSCALL_64_after_hwframe+0x46/0xb0 > >> Jul 4 22:35:22 hho kernel: RIP: 0033:0x7f39ddf5877d > >> Jul 4 22:35:22 hho kernel: Code: b9 fe ff ff 48 8d 3d 1a 71 0a 00 50 = e8 2c 12 02 00 66 2e 0f 1f 84 00 00 00 00 00 66 90 80 3d 81 4c 0e 00 00 74 = 17 31 c0 0f 05 <48> 3d 00 f0 ff ff 77 5b c3 66 2e 0f 1f 84 00 00 00 00 00 5= 3 48 83 > >> Jul 4 22:35:22 hho kernel: RSP: 002b:00007ffe4b98b6f8 EFLAGS: 0000024= 6 ORIG_RAX: 0000000000000000 > >> Jul 4 22:35:22 hho kernel: RAX: ffffffffffffffda RBX: 00005655194cab4= 0 RCX: 00007f39ddf5877d > >> Jul 4 22:35:22 hho kernel: RDX: 0000000000000400 RSI: 00005655194ccd3= 0 RDI: 0000000000000004 > >> Jul 4 22:35:22 hho kernel: RBP: 00007ffe4b98b760 R08: 00007f39ddff8cb= 2 R09: 0000000000000001 > >> Jul 4 22:35:22 hho kernel: R10: 0000000000001000 R11: 000000000000024= 6 R12: 00007f39de0324a0 > >> Jul 4 22:35:22 hho kernel: R13: 00005655194cd140 R14: 0000000000000a6= 8 R15: 00007f39de031ba0 > >> Jul 4 22:35:22 hho kernel: > >> Jul 4 22:35:22 hho kernel: Modules linked in: mousedev sch_fq_codel b= pf_preload snd_ctl_led amdgpu iwlmvm snd_hda_codec_realtek snd_hda_codec_ge= neric snd_hda_codec_hdmi mac80211 pkcs8_key_parser drm_ttm_helper ttm iommu= _v2 gpu_sched snd_hda_intel libarc4 i2c_algo_bit snd_intel_dspcfg drm_buddy= drm_suballoc_helper uvcvideo snd_hda_codec drm_display_helper edac_mce_amd= videobuf2_vmalloc snd_hwdep crct10dif_pclmul videobuf2_memops uvc crc32_pc= lmul cec snd_hda_core crc32c_intel videobuf2_v4l2 ghash_clmulni_intel lm92 = r8169 sha512_ssse3 snd_pcm videodev psmouse thinkpad_acpi iwlwifi drivetemp= ledtrig_audio drm_kms_helper rapl videobuf2_common realtek snd_timer serio= _raw snd_rn_pci_acp3x wmi_bmof platform_profile cfg80211 mc snd_acp_config = k10temp snd syscopyarea mdio_devres ucsi_acpi snd_soc_acpi sysfillrect drm = snd_pci_acp3x i2c_piix4 sysimgblt soundcore typec_ucsi ipmi_devintf rfkill = roles libphy ipmi_msghandler typec video battery ac wmi i2c_scmi button > >> Jul 4 22:35:22 hho kernel: CR2: 0000000000000052 > >> Jul 4 22:35:22 hho kernel: ---[ end trace 0000000000000000 ]--- > >> Jul 4 22:35:22 hho kernel: RIP: 0010:wq_worker_comm+0x63/0xc0 > >> Jul 4 22:35:22 hho kernel: Code: 43 2c 20 75 1d 5b 5d 48 c7 c7 e0 a4 = 43 82 41 5c 41 5d 41 5e e9 7e 6b 8b 00 5b 5d 41 5c 41 5d 41 5e c3 48 89 df = e8 ad 35 00 00 <4c> 8b 70 48 48 89 c3 4d 85 f6 74 cf 4c 89 f7 e8 29 b6 8b 0= 0 80 7b > >> Jul 4 22:35:22 hho kernel: RSP: 0018:ffffc90000fb7bb8 EFLAGS: 0001020= 2 > >> Jul 4 22:35:22 hho kernel: RAX: 000000000000000a RBX: ffff88810cd4330= 0 RCX: 0001020304050608 > >> Jul 4 22:35:22 hho kernel: RDX: ffff88811395bfc0 RSI: 7ffffffffffffff= f RDI: ffff88810cd43300 > >> Jul 4 22:35:22 hho kernel: RBP: 000000000000000f R08: ffffc90000fb7be= 8 R09: 0000000000000040 > >> Jul 4 22:35:22 hho kernel: R10: fefefefefefefeff R11: 000000000000004= 0 R12: ffffc90000fb7be8 > >> Jul 4 22:35:22 hho kernel: R13: 0000000000000040 R14: 000000000000000= c R15: 0000000000000001 > >> Jul 4 22:35:22 hho kernel: FS: 00007f39dde1c740(0000) GS:ffff8887ef6= 80000(0000) knlGS:0000000000000000 > >> Jul 4 22:35:22 hho kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008= 0050033 > >> Jul 4 22:35:22 hho kernel: CR2: 0000000000000052 CR3: 000000011218800= 0 CR4: 0000000000350ee0 > >> Jul 4 22:35:22 hho kernel: note: start-stop-daem[1740] exited with ir= qs disabled > >> Jul 4 22:35:22 hho kernel: Generic FE-GE Realtek PHY r8169-0-200:00: = attached PHY driver (mii_bus:phy_addr=3Dr8169-0-200:00, irq=3DMAC) > >> Jul 4 22:35:22 hho kernel: r8169 0000:02:00.0 eth0: Link is Down > >> Jul 4 22:35:24 hho kernel: r8169 0000:02:00.0 eth0: Link is Up - 1Gbp= s/Full - flow control rx/tx > >> Jul 4 22:35:24 hho kernel: IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link = becomes ready > >> > >> It then kind of limped along until I rebooted again. This second attem= pt to boot > >> died and locked up completely, again during amdgpu initialization, and= is on display here: > >> https://imgur.com/a/3ZE66kh > > > > refill_obj_stock() is also somewhat unrelated to VMA stuff. This is > > all very bizarre. > > > >> Finally I just edited mm/Kconfig and set config PER_VMA_LOCK to "defbo= ol n" to override > >> any setting in my old config. That made everything work again - it's w= hat I'm using now. > > > > Could I ask you to try a few boots with PER_VMA_LOCK set to "n", just > > to eliminate the possibility that this is a coincidence? > > > > HOLY SMOKES! You are on to something! I wanted to do 10 reboots and didn'= t expect > anything to happen since this has been working fine since forever, and I = don't boot > that often since suspend is quite reliable these days. It did 9 without p= roblems and > then on the 10th reboot it crapped out, again with the xa_load pagefault. Ok, sounds like the results of the fix are inconclusive. I guess we should wait for more testing before concluding whether the fix is valid. In the meantime, per Andrew's request, I posted the patchset that includes both the fix and the proper kill switch of the feature at https://lore.kernel.org/all/20230705063711.2670599-1-surenb@google.com/. Thanks, Suren. > > Here's the first trace: > > holger>/tmp/linux-6.4.1/scripts/decode_stacktrace.sh /boot/kernel-genkern= el-x86_64-6.4.1 < /tmp/kern.log > Jul 4 22:35:22 hho kernel: [drm] Initialized amdgpu 3.52.0 20150101 for = 0000:06:00.0 on minor 0 > Jul 4 22:35:22 hho kernel: fbcon: amdgpudrmfb (fb0) is primary device > Jul 4 22:35:22 hho kernel: [drm] DSC precompute is not needed. > Jul 4 22:35:22 hho kernel: Console: switching to colour frame buffer dev= ice 240x67 > Jul 4 22:35:22 hho kernel: amdgpu 0000:06:00.0: [drm] fb0: amdgpudrmfb f= rame buffer device > Jul 4 22:35:22 hho kernel: BUG: kernel NULL pointer dereference, address= : 0000000000000052 > Jul 4 22:35:22 hho kernel: #PF: supervisor read access in kernel mode > Jul 4 22:35:22 hho kernel: #PF: error_code(0x0000) - not-present page > Jul 4 22:35:22 hho kernel: PGD 0 P4D 0 > Jul 4 22:35:22 hho kernel: Oops: 0000 [#1] SMP > Jul 4 22:35:22 hho kernel: CPU: 10 PID: 1740 Comm: start-stop-daem Not t= ainted 6.4.1 #1 > Jul 4 22:35:22 hho kernel: Hardware name: LENOVO 20U50001GE/20U50001GE, = BIOS R19ET32W (1.16 ) 01/26/2021 > Jul 4 22:35:22 hho kernel: RIP: wq_worker_comm+0x63/0xc0 > Jul 4 22:35:22 hho kernel: Code: 43 2c 20 75 1d 5b 5d 48 c7 c7 e0 a4 43 8= 2 41 5c 41 5d 41 5e e9 7e 6b 8b 00 5b 5d 41 5c 41 5d 41 5e c3 48 89 df e8 a= d 35 00 00 <4c> 8b 70 48 48 89 c3 4d 85 f6 74 cf 4c 89 f7 e8 29 b6 8b 00 80= 7b > All code > =3D=3D=3D=3D=3D=3D=3D=3D > 0: 43 2c 20 rex.XB sub $0x20,%al > 3: 75 1d jne 0x22 > 5: 5b pop %rbx > 6: 5d pop %rbp > 7: 48 c7 c7 e0 a4 43 82 mov $0xffffffff8243a4e0,%rdi > e: 41 5c pop %r12 > 10: 41 5d pop %r13 > 12: 41 5e pop %r14 > 14: e9 7e 6b 8b 00 jmp 0x8b6b97 > 19: 5b pop %rbx > 1a: 5d pop %rbp > 1b: 41 5c pop %r12 > 1d: 41 5d pop %r13 > 1f: 41 5e pop %r14 > 21: c3 ret > 22: 48 89 df mov %rbx,%rdi > 25: e8 ad 35 00 00 call 0x35d7 > 2a:* 4c 8b 70 48 mov 0x48(%rax),%r14 <-- trapp= ing instruction > 2e: 48 89 c3 mov %rax,%rbx > 31: 4d 85 f6 test %r14,%r14 > 34: 74 cf je 0x5 > 36: 4c 89 f7 mov %r14,%rdi > 39: e8 29 b6 8b 00 call 0x8bb667 > 3e: 80 .byte 0x80 > 3f: 7b .byte 0x7b > > Code starting with the faulting instruction > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > 0: 4c 8b 70 48 mov 0x48(%rax),%r14 > 4: 48 89 c3 mov %rax,%rbx > 7: 4d 85 f6 test %r14,%r14 > a: 74 cf je 0xffffffffffffffdb > c: 4c 89 f7 mov %r14,%rdi > f: e8 29 b6 8b 00 call 0x8bb63d > 14: 80 .byte 0x80 > 15: 7b .byte 0x7b > Jul 4 22:35:22 hho kernel: RSP: 0018:ffffc90000fb7bb8 EFLAGS: 00010202 > Jul 4 22:35:22 hho kernel: RAX: 000000000000000a RBX: ffff88810cd43300 R= CX: 0001020304050608 > Jul 4 22:35:22 hho kernel: RDX: ffff88811395bfc0 RSI: 7fffffffffffffff R= DI: ffff88810cd43300 > Jul 4 22:35:22 hho kernel: RBP: 000000000000000f R08: ffffc90000fb7be8 R= 09: 0000000000000040 > Jul 4 22:35:22 hho kernel: R10: fefefefefefefeff R11: 0000000000000040 R= 12: ffffc90000fb7be8 > Jul 4 22:35:22 hho kernel: R13: 0000000000000040 R14: 000000000000000c R= 15: 0000000000000001 > Jul 4 22:35:22 hho kernel: FS: 00007f39dde1c740(0000) GS:ffff8887ef6800= 00(0000) knlGS:0000000000000000 > Jul 4 22:35:22 hho kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005= 0033 > Jul 4 22:35:22 hho kernel: CR2: 0000000000000052 CR3: 0000000112188000 C= R4: 0000000000350ee0 > Jul 4 22:35:22 hho kernel: Call Trace: > Jul 4 22:35:22 hho kernel: > Jul 4 22:35:22 hho kernel: ? __die+0x1f/0x60 > Jul 4 22:35:22 hho kernel: ? page_fault_oops+0x14d/0x410 > Jul 4 22:35:22 hho kernel: ? xa_load+0x82/0xa0 > Jul 4 22:35:22 hho kernel: ? exc_page_fault+0x60/0x100 > Jul 4 22:35:22 hho kernel: ? asm_exc_page_fault+0x22/0x30 > Jul 4 22:35:22 hho kernel: ? wq_worker_comm+0x63/0xc0 > Jul 4 22:35:22 hho last message buffered 1 times > Jul 4 22:35:22 hho kernel: proc_task_name+0xa4/0xb0 > Jul 4 22:35:22 hho kernel: ? seq_put_decimal_ull_width+0x96/0x100 > Jul 4 22:35:22 hho kernel: do_task_stat+0x44b/0xe10 > Jul 4 22:35:22 hho kernel: proc_single_show+0x4b/0xa0 > Jul 4 22:35:22 hho kernel: seq_read_iter+0xff/0x410 > Jul 4 22:35:22 hho kernel: ? generic_fillattr+0x45/0xf0 > Jul 4 22:35:22 hho kernel: seq_read+0x93/0xb0 > Jul 4 22:35:22 hho kernel: vfs_read+0x9b/0x2c0 > Jul 4 22:35:22 hho kernel: ? __do_sys_newfstatat+0x22/0x30 > Jul 4 22:35:22 hho kernel: ksys_read+0x53/0xc0 > Jul 4 22:35:22 hho kernel: do_syscall_64+0x35/0x80 > Jul 4 22:35:22 hho kernel: entry_SYSCALL_64_after_hwframe+0x46/0xb0 > Jul 4 22:35:22 hho kernel: RIP: 0033:0x7f39ddf5877d > Jul 4 22:35:22 hho kernel: Code: b9 fe ff ff 48 8d 3d 1a 71 0a 00 50 e8 2= c 12 02 00 66 2e 0f 1f 84 00 00 00 00 00 66 90 80 3d 81 4c 0e 00 00 74 17 3= 1 c0 0f 05 <48> 3d 00 f0 ff ff 77 5b c3 66 2e 0f 1f 84 00 00 00 00 00 53 48= 83 > All code > =3D=3D=3D=3D=3D=3D=3D=3D > 0: b9 fe ff ff 48 mov $0x48fffffe,%ecx > 5: 8d 3d 1a 71 0a 00 lea 0xa711a(%rip),%edi # 0xa712= 5 > b: 50 push %rax > c: e8 2c 12 02 00 call 0x2123d > 11: 66 2e 0f 1f 84 00 00 cs nopw 0x0(%rax,%rax,1) > 18: 00 00 00 > 1b: 66 90 xchg %ax,%ax > 1d: 80 3d 81 4c 0e 00 00 cmpb $0x0,0xe4c81(%rip) # 0xe4ca= 5 > 24: 74 17 je 0x3d > 26: 31 c0 xor %eax,%eax > 28: 0f 05 syscall > 2a:* 48 3d 00 f0 ff ff cmp $0xfffffffffffff000,%rax <= -- trapping instruction > 30: 77 5b ja 0x8d > 32: c3 ret > 33: 66 2e 0f 1f 84 00 00 cs nopw 0x0(%rax,%rax,1) > 3a: 00 00 00 > 3d: 53 push %rbx > 3e: 48 rex.W > 3f: 83 .byte 0x83 > > Code starting with the faulting instruction > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > 0: 48 3d 00 f0 ff ff cmp $0xfffffffffffff000,%rax > 6: 77 5b ja 0x63 > 8: c3 ret > 9: 66 2e 0f 1f 84 00 00 cs nopw 0x0(%rax,%rax,1) > 10: 00 00 00 > 13: 53 push %rbx > 14: 48 rex.W > 15: 83 .byte 0x83 > Jul 4 22:35:22 hho kernel: RSP: 002b:00007ffe4b98b6f8 EFLAGS: 00000246 O= RIG_RAX: 0000000000000000 > Jul 4 22:35:22 hho kernel: RAX: ffffffffffffffda RBX: 00005655194cab40 R= CX: 00007f39ddf5877d > Jul 4 22:35:22 hho kernel: RDX: 0000000000000400 RSI: 00005655194ccd30 R= DI: 0000000000000004 > Jul 4 22:35:22 hho kernel: RBP: 00007ffe4b98b760 R08: 00007f39ddff8cb2 R= 09: 0000000000000001 > Jul 4 22:35:22 hho kernel: R10: 0000000000001000 R11: 0000000000000246 R= 12: 00007f39de0324a0 > Jul 4 22:35:22 hho kernel: R13: 00005655194cd140 R14: 0000000000000a68 R= 15: 00007f39de031ba0 > Jul 4 22:35:22 hho kernel: > Jul 4 22:35:22 hho kernel: Modules linked in: mousedev sch_fq_codel bpf_= preload snd_ctl_led amdgpu iwlmvm snd_hda_codec_realtek snd_hda_codec_gener= ic snd_hda_codec_hdmi mac80211 pkcs8_key_parser drm_ttm_helper ttm iommu_v2= gpu_sched snd_hda_intel libarc4 i2c_algo_bit snd_intel_dspcfg drm_buddy dr= m_suballoc_helper uvcvideo snd_hda_codec drm_display_helper edac_mce_amd vi= deobuf2_vmalloc snd_hwdep crct10dif_pclmul videobuf2_memops uvc crc32_pclmu= l cec snd_hda_core crc32c_intel videobuf2_v4l2 ghash_clmulni_intel lm92 r81= 69 sha512_ssse3 snd_pcm videodev psmouse thinkpad_acpi iwlwifi drivetemp le= dtrig_audio drm_kms_helper rapl videobuf2_common realtek snd_timer serio_ra= w snd_rn_pci_acp3x wmi_bmof platform_profile cfg80211 mc snd_acp_config k10= temp snd syscopyarea mdio_devres ucsi_acpi snd_soc_acpi sysfillrect drm snd= _pci_acp3x i2c_piix4 sysimgblt soundcore typec_ucsi ipmi_devintf rfkill rol= es libphy ipmi_msghandler typec video battery ac wmi i2c_scmi button > Jul 4 22:35:22 hho kernel: CR2: 0000000000000052 > Jul 4 22:35:22 hho kernel: ---[ end trace 0000000000000000 ]--- > Jul 4 22:35:22 hho kernel: RIP: wq_worker_comm+0x63/0xc0 > Jul 4 22:35:22 hho kernel: Code: 43 2c 20 75 1d 5b 5d 48 c7 c7 e0 a4 43 8= 2 41 5c 41 5d 41 5e e9 7e 6b 8b 00 5b 5d 41 5c 41 5d 41 5e c3 48 89 df e8 a= d 35 00 00 <4c> 8b 70 48 48 89 c3 4d 85 f6 74 cf 4c 89 f7 e8 29 b6 8b 00 80= 7b > All code > =3D=3D=3D=3D=3D=3D=3D=3D > 0: 43 2c 20 rex.XB sub $0x20,%al > 3: 75 1d jne 0x22 > 5: 5b pop %rbx > 6: 5d pop %rbp > 7: 48 c7 c7 e0 a4 43 82 mov $0xffffffff8243a4e0,%rdi > e: 41 5c pop %r12 > 10: 41 5d pop %r13 > 12: 41 5e pop %r14 > 14: e9 7e 6b 8b 00 jmp 0x8b6b97 > 19: 5b pop %rbx > 1a: 5d pop %rbp > 1b: 41 5c pop %r12 > 1d: 41 5d pop %r13 > 1f: 41 5e pop %r14 > 21: c3 ret > 22: 48 89 df mov %rbx,%rdi > 25: e8 ad 35 00 00 call 0x35d7 > 2a:* 4c 8b 70 48 mov 0x48(%rax),%r14 <-- trapp= ing instruction > 2e: 48 89 c3 mov %rax,%rbx > 31: 4d 85 f6 test %r14,%r14 > 34: 74 cf je 0x5 > 36: 4c 89 f7 mov %r14,%rdi > 39: e8 29 b6 8b 00 call 0x8bb667 > 3e: 80 .byte 0x80 > 3f: 7b .byte 0x7b > > Code starting with the faulting instruction > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > 0: 4c 8b 70 48 mov 0x48(%rax),%r14 > 4: 48 89 c3 mov %rax,%rbx > 7: 4d 85 f6 test %r14,%r14 > a: 74 cf je 0xffffffffffffffdb > c: 4c 89 f7 mov %r14,%rdi > f: e8 29 b6 8b 00 call 0x8bb63d > 14: 80 .byte 0x80 > 15: 7b .byte 0x7b > Jul 4 22:35:22 hho kernel: RSP: 0018:ffffc90000fb7bb8 EFLAGS: 00010202 > Jul 4 22:35:22 hho kernel: RAX: 000000000000000a RBX: ffff88810cd43300 R= CX: 0001020304050608 > Jul 4 22:35:22 hho kernel: RDX: ffff88811395bfc0 RSI: 7fffffffffffffff R= DI: ffff88810cd43300 > Jul 4 22:35:22 hho kernel: RBP: 000000000000000f R08: ffffc90000fb7be8 R= 09: 0000000000000040 > Jul 4 22:35:22 hho kernel: R10: fefefefefefefeff R11: 0000000000000040 R= 12: ffffc90000fb7be8 > Jul 4 22:35:22 hho kernel: R13: 0000000000000040 R14: 000000000000000c R= 15: 0000000000000001 > Jul 4 22:35:22 hho kernel: FS: 00007f39dde1c740(0000) GS:ffff8887ef6800= 00(0000) knlGS:0000000000000000 > Jul 4 22:35:22 hho kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005= 0033 > Jul 4 22:35:22 hho kernel: CR2: 0000000000000052 CR3: 0000000112188000 C= R4: 0000000000350ee0 > Jul 4 22:35:22 hho kernel: note: start-stop-daem[1740] exited with irqs = disabled > Jul 4 22:35:22 hho kernel: Generic FE-GE Realtek PHY r8169-0-200:00: att= ached PHY driver (mii_bus:phy_addr=3Dr8169-0-200:00, irq=3DMAC) > Jul 4 22:35:22 hho kernel: r8169 0000:02:00.0 eth0: Link is Down > Jul 4 22:35:24 hho kernel: r8169 0000:02:00.0 eth0: Link is Up - 1Gbps/F= ull - flow control rx/tx > Jul 4 22:35:24 hho kernel: IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link bec= omes ready > > Here is the second one from the reboot bonanza: > > holger>/tmp/linux-6.4.1/scripts/decode_stacktrace.sh /boot/kernel-genkern= el-x86_64-6.4.1 < /tmp/kern.log > Jul 5 01:34:20 hho kernel: [drm] Initialized amdgpu 3.52.0 20150101 for = 0000:06:00.0 on minor 0 > Jul 5 01:34:20 hho kernel: fbcon: amdgpudrmfb (fb0) is primary device > Jul 5 01:34:20 hho kernel: [drm] DSC precompute is not needed. > Jul 5 01:34:20 hho kernel: Console: switching to colour frame buffer dev= ice 240x67 > Jul 5 01:34:20 hho kernel: amdgpu 0000:06:00.0: [drm] fb0: amdgpudrmfb f= rame buffer device > Jul 5 01:34:20 hho kernel: BUG: kernel NULL pointer dereference, address= : 0000000000000052 > Jul 5 01:34:20 hho kernel: #PF: supervisor read access in kernel mode > Jul 5 01:34:20 hho kernel: #PF: error_code(0x0000) - not-present page > Jul 5 01:34:20 hho kernel: PGD 0 P4D 0 > Jul 5 01:34:20 hho kernel: Oops: 0000 [#1] SMP > Jul 5 01:34:20 hho kernel: CPU: 8 PID: 1716 Comm: start-stop-daem Not ta= inted 6.4.1 #1 > Jul 5 01:34:20 hho kernel: Hardware name: LENOVO 20U50001GE/20U50001GE, = BIOS R19ET32W (1.16 ) 01/26/2021 > Jul 5 01:34:20 hho kernel: RIP: wq_worker_comm+0x63/0xc0 > Jul 5 01:34:20 hho kernel: Code: 43 2c 20 75 1d 5b 5d 48 c7 c7 e0 a4 43 8= 2 41 5c 41 5d 41 5e e9 2e 59 8b 00 5b 5d 41 5c 41 5d 41 5e c3 48 89 df e8 a= d 35 00 00 <4c> 8b 70 48 48 89 c3 4d 85 f6 74 cf 4c 89 f7 e8 d9 a3 8b 00 80= 7b > All code > =3D=3D=3D=3D=3D=3D=3D=3D > 0: 43 2c 20 rex.XB sub $0x20,%al > 3: 75 1d jne 0x22 > 5: 5b pop %rbx > 6: 5d pop %rbp > 7: 48 c7 c7 e0 a4 43 82 mov $0xffffffff8243a4e0,%rdi > e: 41 5c pop %r12 > 10: 41 5d pop %r13 > 12: 41 5e pop %r14 > 14: e9 2e 59 8b 00 jmp 0x8b5947 > 19: 5b pop %rbx > 1a: 5d pop %rbp > 1b: 41 5c pop %r12 > 1d: 41 5d pop %r13 > 1f: 41 5e pop %r14 > 21: c3 ret > 22: 48 89 df mov %rbx,%rdi > 25: e8 ad 35 00 00 call 0x35d7 > 2a:* 4c 8b 70 48 mov 0x48(%rax),%r14 <-- trapp= ing instruction > 2e: 48 89 c3 mov %rax,%rbx > 31: 4d 85 f6 test %r14,%r14 > 34: 74 cf je 0x5 > 36: 4c 89 f7 mov %r14,%rdi > 39: e8 d9 a3 8b 00 call 0x8ba417 > 3e: 80 .byte 0x80 > 3f: 7b .byte 0x7b > > Code starting with the faulting instruction > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > 0: 4c 8b 70 48 mov 0x48(%rax),%r14 > 4: 48 89 c3 mov %rax,%rbx > 7: 4d 85 f6 test %r14,%r14 > a: 74 cf je 0xffffffffffffffdb > c: 4c 89 f7 mov %r14,%rdi > f: e8 d9 a3 8b 00 call 0x8ba3ed > 14: 80 .byte 0x80 > 15: 7b .byte 0x7b > Jul 5 01:34:20 hho kernel: RSP: 0018:ffffc90001027bb8 EFLAGS: 00010202 > Jul 5 01:34:20 hho kernel: RAX: 000000000000000a RBX: ffff888111052640 R= CX: 0001020304050608 > Jul 5 01:34:20 hho kernel: RDX: ffff88810490b300 RSI: 7fffffffffffffff R= DI: ffff888111052640 > Jul 5 01:34:20 hho kernel: RBP: 000000000000000f R08: ffffc90001027be8 R= 09: 0000000000000040 > Jul 5 01:34:20 hho kernel: R10: fefefefefefefeff R11: 0000000000000040 R= 12: ffffc90001027be8 > Jul 5 01:34:20 hho kernel: R13: 0000000000000040 R14: 000000000000000c R= 15: 0000000000000001 > Jul 5 01:34:20 hho kernel: FS: 00007f917809a740(0000) GS:ffff8887ef6000= 00(0000) knlGS:0000000000000000 > Jul 5 01:34:20 hho kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005= 0033 > Jul 5 01:34:20 hho kernel: CR2: 0000000000000052 CR3: 0000000107562000 C= R4: 0000000000350ee0 > Jul 5 01:34:20 hho kernel: Call Trace: > Jul 5 01:34:20 hho kernel: > Jul 5 01:34:20 hho kernel: ? __die+0x1f/0x60 > Jul 5 01:34:20 hho kernel: ? page_fault_oops+0x14d/0x410 > Jul 5 01:34:20 hho kernel: ? xa_load+0x82/0xa0 > Jul 5 01:34:20 hho last message buffered 1 times > Jul 5 01:34:20 hho kernel: ? exc_page_fault+0x60/0x100 > Jul 5 01:34:20 hho kernel: ? asm_exc_page_fault+0x22/0x30 > Jul 5 01:34:20 hho kernel: ? wq_worker_comm+0x63/0xc0 > Jul 5 01:34:20 hho last message buffered 1 times > Jul 5 01:34:20 hho kernel: proc_task_name+0xa4/0xb0 > Jul 5 01:34:20 hho kernel: ? seq_put_decimal_ull_width+0x96/0x100 > Jul 5 01:34:20 hho kernel: do_task_stat+0x44b/0xe10 > Jul 5 01:34:20 hho kernel: proc_single_show+0x4b/0xa0 > Jul 5 01:34:20 hho kernel: seq_read_iter+0xff/0x410 > Jul 5 01:34:20 hho kernel: ? generic_fillattr+0x45/0xf0 > Jul 5 01:34:20 hho kernel: seq_read+0x93/0xb0 > Jul 5 01:34:20 hho kernel: vfs_read+0x9b/0x2c0 > Jul 5 01:34:20 hho kernel: ? __do_sys_newfstatat+0x22/0x30 > Jul 5 01:34:20 hho kernel: ksys_read+0x53/0xc0 > Jul 5 01:34:20 hho kernel: do_syscall_64+0x35/0x80 > Jul 5 01:34:20 hho kernel: entry_SYSCALL_64_after_hwframe+0x46/0xb0 > Jul 5 01:34:20 hho kernel: RIP: 0033:0x7f91781d677d > Jul 5 01:34:20 hho kernel: Code: b9 fe ff ff 48 8d 3d 1a 71 0a 00 50 e8 2= c 12 02 00 66 2e 0f 1f 84 00 00 00 00 00 66 90 80 3d 81 4c 0e 00 00 74 17 3= 1 c0 0f 05 <48> 3d 00 f0 ff ff 77 5b c3 66 2e 0f 1f 84 00 00 00 00 00 53 48= 83 > All code > =3D=3D=3D=3D=3D=3D=3D=3D > 0: b9 fe ff ff 48 mov $0x48fffffe,%ecx > 5: 8d 3d 1a 71 0a 00 lea 0xa711a(%rip),%edi # 0xa712= 5 > b: 50 push %rax > c: e8 2c 12 02 00 call 0x2123d > 11: 66 2e 0f 1f 84 00 00 cs nopw 0x0(%rax,%rax,1) > 18: 00 00 00 > 1b: 66 90 xchg %ax,%ax > 1d: 80 3d 81 4c 0e 00 00 cmpb $0x0,0xe4c81(%rip) # 0xe4ca= 5 > 24: 74 17 je 0x3d > 26: 31 c0 xor %eax,%eax > 28: 0f 05 syscall > 2a:* 48 3d 00 f0 ff ff cmp $0xfffffffffffff000,%rax <= -- trapping instruction > 30: 77 5b ja 0x8d > 32: c3 ret > 33: 66 2e 0f 1f 84 00 00 cs nopw 0x0(%rax,%rax,1) > 3a: 00 00 00 > 3d: 53 push %rbx > 3e: 48 rex.W > 3f: 83 .byte 0x83 > > Code starting with the faulting instruction > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > 0: 48 3d 00 f0 ff ff cmp $0xfffffffffffff000,%rax > 6: 77 5b ja 0x63 > 8: c3 ret > 9: 66 2e 0f 1f 84 00 00 cs nopw 0x0(%rax,%rax,1) > 10: 00 00 00 > 13: 53 push %rbx > 14: 48 rex.W > 15: 83 .byte 0x83 > Jul 5 01:34:20 hho kernel: RSP: 002b:00007ffe56a8adb8 EFLAGS: 00000246 O= RIG_RAX: 0000000000000000 > Jul 5 01:34:20 hho kernel: RAX: ffffffffffffffda RBX: 0000559458207b40 R= CX: 00007f91781d677d > Jul 5 01:34:20 hho kernel: RDX: 0000000000000400 RSI: 0000559458209d30 R= DI: 0000000000000004 > Jul 5 01:34:20 hho kernel: RBP: 00007ffe56a8ae20 R08: 00007f9178276cb2 R= 09: 0000000000000001 > Jul 5 01:34:20 hho kernel: R10: 0000000000001000 R11: 0000000000000246 R= 12: 00007f91782b04a0 > Jul 5 01:34:20 hho kernel: R13: 000055945820a140 R14: 0000000000000a68 R= 15: 00007f91782afba0 > Jul 5 01:34:20 hho kernel: > Jul 5 01:34:20 hho kernel: Modules linked in: sch_fq_codel bpf_preload m= ousedev snd_ctl_led iwlmvm snd_hda_codec_realtek amdgpu pkcs8_key_parser sn= d_hda_codec_generic mac80211 libarc4 drm_ttm_helper snd_hda_codec_hdmi ttm = iommu_v2 uvcvideo gpu_sched videobuf2_vmalloc i2c_algo_bit videobuf2_memops= snd_hda_intel drm_buddy uvc edac_mce_amd snd_intel_dspcfg crct10dif_pclmul= videobuf2_v4l2 drm_suballoc_helper crc32_pclmul lm92 snd_hda_codec drm_dis= play_helper crc32c_intel videodev snd_hwdep ghash_clmulni_intel r8169 drive= temp cec sha512_ssse3 thinkpad_acpi snd_hda_core videobuf2_common psmouse r= ealtek iwlwifi drm_kms_helper rapl ledtrig_audio snd_pcm mc serio_raw snd_r= n_pci_acp3x platform_profile syscopyarea wmi_bmof mdio_devres k10temp ipmi_= devintf snd_timer snd_acp_config sysfillrect cfg80211 drm ucsi_acpi sysimgb= lt snd snd_soc_acpi libphy i2c_piix4 ipmi_msghandler snd_pci_acp3x typec_uc= si soundcore rfkill video roles typec battery ac wmi i2c_scmi button > Jul 5 01:34:20 hho kernel: CR2: 0000000000000052 > Jul 5 01:34:20 hho kernel: ---[ end trace 0000000000000000 ]--- > Jul 5 01:34:20 hho kernel: RIP: wq_worker_comm+0x63/0xc0 > Jul 5 01:34:20 hho kernel: Code: 43 2c 20 75 1d 5b 5d 48 c7 c7 e0 a4 43 8= 2 41 5c 41 5d 41 5e e9 2e 59 8b 00 5b 5d 41 5c 41 5d 41 5e c3 48 89 df e8 a= d 35 00 00 <4c> 8b 70 48 48 89 c3 4d 85 f6 74 cf 4c 89 f7 e8 d9 a3 8b 00 80= 7b > All code > =3D=3D=3D=3D=3D=3D=3D=3D > 0: 43 2c 20 rex.XB sub $0x20,%al > 3: 75 1d jne 0x22 > 5: 5b pop %rbx > 6: 5d pop %rbp > 7: 48 c7 c7 e0 a4 43 82 mov $0xffffffff8243a4e0,%rdi > e: 41 5c pop %r12 > 10: 41 5d pop %r13 > 12: 41 5e pop %r14 > 14: e9 2e 59 8b 00 jmp 0x8b5947 > 19: 5b pop %rbx > 1a: 5d pop %rbp > 1b: 41 5c pop %r12 > 1d: 41 5d pop %r13 > 1f: 41 5e pop %r14 > 21: c3 ret > 22: 48 89 df mov %rbx,%rdi > 25: e8 ad 35 00 00 call 0x35d7 > 2a:* 4c 8b 70 48 mov 0x48(%rax),%r14 <-- trapp= ing instruction > 2e: 48 89 c3 mov %rax,%rbx > 31: 4d 85 f6 test %r14,%r14 > 34: 74 cf je 0x5 > 36: 4c 89 f7 mov %r14,%rdi > 39: e8 d9 a3 8b 00 call 0x8ba417 > 3e: 80 .byte 0x80 > 3f: 7b .byte 0x7b > > Code starting with the faulting instruction > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > 0: 4c 8b 70 48 mov 0x48(%rax),%r14 > 4: 48 89 c3 mov %rax,%rbx > 7: 4d 85 f6 test %r14,%r14 > a: 74 cf je 0xffffffffffffffdb > c: 4c 89 f7 mov %r14,%rdi > f: e8 d9 a3 8b 00 call 0x8ba3ed > 14: 80 .byte 0x80 > 15: 7b .byte 0x7b > Jul 5 01:34:20 hho kernel: RSP: 0018:ffffc90001027bb8 EFLAGS: 00010202 > Jul 5 01:34:20 hho kernel: RAX: 000000000000000a RBX: ffff888111052640 R= CX: 0001020304050608 > Jul 5 01:34:20 hho kernel: RDX: ffff88810490b300 RSI: 7fffffffffffffff R= DI: ffff888111052640 > Jul 5 01:34:20 hho kernel: RBP: 000000000000000f R08: ffffc90001027be8 R= 09: 0000000000000040 > Jul 5 01:34:20 hho kernel: R10: fefefefefefefeff R11: 0000000000000040 R= 12: ffffc90001027be8 > Jul 5 01:34:20 hho kernel: R13: 0000000000000040 R14: 000000000000000c R= 15: 0000000000000001 > Jul 5 01:34:20 hho kernel: FS: 00007f917809a740(0000) GS:ffff8887ef6000= 00(0000) knlGS:0000000000000000 > Jul 5 01:34:20 hho kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005= 0033 > Jul 5 01:34:20 hho kernel: CR2: 0000000000000052 CR3: 0000000107562000 C= R4: 0000000000350ee0 > Jul 5 01:34:20 hho kernel: note: start-stop-daem[1716] exited with irqs = disabled > Jul 5 01:34:20 hho kernel: Generic FE-GE Realtek PHY r8169-0-200:00: att= ached PHY driver (mii_bus:phy_addr=3Dr8169-0-200:00, irq=3DMAC) > Jul 5 01:34:21 hho kernel: r8169 0000:02:00.0 eth0: Link is Down > Jul 5 01:34:23 hho kernel: r8169 0000:02:00.0 eth0: Link is Up - 1Gbps/F= ull - flow control rx/tx > Jul 5 01:34:23 hho kernel: IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link bec= omes ready > > The crashing process was openrc's start-stop-daemon starting acpid, thoug= h I think > both are just the victims here. > > Hope this helps! > > cheers > Holger