All of lore.kernel.org
 help / color / mirror / Atom feed
* kernel BUG at include/linux/swapops.h:204!
@ 2021-07-10  7:33 Igor Raits
  2021-07-10 12:46 ` Hillf Danton
  2021-07-11  4:17 ` Hugh Dickins
  0 siblings, 2 replies; 28+ messages in thread
From: Igor Raits @ 2021-07-10  7:33 UTC (permalink / raw)
  To: linux-mm, Andrew Morton

[-- Attachment #1: Type: text/plain, Size: 5975 bytes --]

Hello,

I've seen one weird bug on 5.12.14 that happened a couple of times when I
started a bunch of VMs on a server.

I've briefly googled this problem but could not find any relevant commit
that would fix this issue.

Do you have any hint how to debug this further or know the fix by any
chance?

Thanks in advance. Stack trace following:

[  376.876610] ------------[ cut here ]------------
[  376.881274] kernel BUG at include/linux/swapops.h:204!
[  376.886455] invalid opcode: 0000 [#1] SMP NOPTI
[  376.891014] CPU: 40 PID: 11775 Comm: rpc-worker Tainted: G            E
    5.12.14-1.gdc.el8.x86_64 #1
[  376.900464] Hardware name: HPE ProLiant DL380 Gen10/ProLiant DL380
Gen10, BIOS U30 05/24/2021
[  376.909038] RIP: 0010:pmd_migration_entry_wait+0x132/0x140
[  376.914562] Code: 02 00 00 00 5b 4c 89 c7 5d e9 8a e4 f6 ff 48 81 e2 00
f0 ff ff 48 f7 d2 48 21 c2 89 d1 f7 c2 81 01 00 00 75 80 e9 44 ff ff ff
<0f> 0b 48 8b 2d 75 bd 30 01 e9 ef fe ff ff 0f 1f 44 00 00 41 55 48
[  376.933443] RSP: 0000:ffffb65a5e1cfdc8 EFLAGS: 00010246
[  376.938701] RAX: 0017ffffc0000000 RBX: ffff908b8ecabaf8 RCX:
ffffffffffffffff
[  376.945878] RDX: 0000000000000000 RSI: ffff908b8ecabaf8 RDI:
fffff497473b2ae8
[  376.953055] RBP: fffff497473b2ae8 R08: fffff49747fa8080 R09:
0000000000000000
[  376.960230] R10: 0000000000000000 R11: 0000000000000000 R12:
0000000000000af8
[  376.967407] R13: 0400000000000000 R14: 0400000000000080 R15:
ffff908bbef7b6a8
[  376.974582] FS:  00007f5bb1f81700(0000) GS:ffff90e87fd80000(0000)
knlGS:0000000000000000
[  376.982718] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  376.988497] CR2: 00007f5b2bfffd98 CR3: 00000001f793e006 CR4:
00000000007726e0
[  376.995673] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[  377.002849] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
0000000000000400
[  377.010026] PKRU: 55555554
[  377.012745] Call Trace:
[  377.015207]  __handle_mm_fault+0x5ad/0x6e0
[  377.019335]  handle_mm_fault+0xc5/0x290
[  377.023194]  do_user_addr_fault+0x1cd/0x740
[  377.027406]  exc_page_fault+0x54/0x110
[  377.031182]  ? asm_exc_page_fault+0x8/0x30
[  377.035307]  asm_exc_page_fault+0x1e/0x30
[  377.039340] RIP: 0033:0x7f5bb91d6734
[  377.042937] Code: 89 08 48 8b 35 dd 3b 21 00 4c 8d 0d d6 3b 21 00 31 c0
4c 39 ce 74 73 0f 1f 80 00 00 00 00 48 8d 96 40 fd ff ff 49 39 d2 74 22
<48> 8b 96 d8 03 00 00 48 01 15 4e 7c 21 00 80 be 50 03 00 00 00 c7
[  377.061820] RSP: 002b:00007f5bb1f7ff58 EFLAGS: 00010206
[  377.067076] RAX: 0000000000000000 RBX: 0000000000000000 RCX:
00007f5ba0000020
[  377.074255] RDX: 00007f5b2bfff700 RSI: 00007f5b2bfff9c0 RDI:
0000000000000001
[  377.081429] RBP: 0000000000000001 R08: 0000000000000000 R09:
00007f5bb93ea2f0
[  377.088606] R10: 00007f5bb1f81700 R11: 0000000000000202 R12:
0000000000000001
[  377.095782] R13: 0000000000000006 R14: 0000000000000cb4 R15:
00007f5bb1f801f0
[  377.102958] Modules linked in: ebt_arp(E) nft_meta_bridge(E)
ip6_tables(E) xt_CT(E) nf_log_ipv4(E) nf_log_common(E) nft_limit(E)
nft_counter(E) xt_LOG(E) xt_limit(E) xt_mac(E) xt_set(E) xt_multiport(E)
xt_state(E) xt_conntrack(E) xt_comment(E) xt_physdev(E) nft_compat(E)
ip_set_hash_net(E) ip_set(E) vhost_net(E) vhost(E) vhost_iotlb(E) tap(E)
tun(E) tcp_diag(E) udp_diag(E) inet_diag(E) netconsole(E) nf_tables(E)
vxlan(E) ip6_udp_tunnel(E) udp_tunnel(E) nfnetlink(E) binfmt_misc(E)
iscsi_tcp(E) libiscsi_tcp(E) 8021q(E) garp(E) mrp(E) bonding(E) tls(E)
vfat(E) fat(E) dm_service_time(E) dm_multipath(E) rpcrdma(E) sunrpc(E)
rdma_ucm(E) ib_srpt(E) ib_isert(E) iscsi_target_mod(E) target_core_mod(E)
ib_iser(E) rdma_cm(E) iw_cm(E) ib_cm(E) libiscsi(E) scsi_transport_iscsi(E)
intel_rapl_msr(E) qedr(E) intel_rapl_common(E) ib_uverbs(E)
isst_if_common(E) ib_core(E) nfit(E) libnvdimm(E) x86_pkg_temp_thermal(E)
intel_powerclamp(E) coretemp(E) kvm_intel(E) kvm(E) irqbypass(E)
crct10dif_pclmul(E)
[  377.102999]  crc32_pclmul(E) ghash_clmulni_intel(E) rapl(E)
intel_cstate(E) ipmi_ssif(E) acpi_ipmi(E) ipmi_si(E) mei_me(E) ioatdma(E)
ipmi_devintf(E) dm_mod(E) ses(E) intel_uncore(E) pcspkr(E) qede(E)
enclosure(E) tg3(E) mei(E) lpc_ich(E) hpilo(E) hpwdt(E)
intel_pch_thermal(E) dca(E) ipmi_msghandler(E) acpi_power_meter(E) ext4(E)
mbcache(E) jbd2(E) sd_mod(E) t10_pi(E) sg(E) qedf(E) qed(E) crc8(E)
libfcoe(E) libfc(E) smartpqi(E) scsi_transport_fc(E) scsi_transport_sas(E)
wmi(E) nf_conntrack(E) nf_defrag_ipv6(E) libcrc32c(E) crc32c_intel(E)
nf_defrag_ipv4(E) br_netfilter(E) bridge(E) stp(E) llc(E)
[  377.243468] ---[ end trace 04bce3bb051f7620 ]---
[  377.385645] RIP: 0010:pmd_migration_entry_wait+0x132/0x140
[  377.391194] Code: 02 00 00 00 5b 4c 89 c7 5d e9 8a e4 f6 ff 48 81 e2 00
f0 ff ff 48 f7 d2 48 21 c2 89 d1 f7 c2 81 01 00 00 75 80 e9 44 ff ff ff
<0f> 0b 48 8b 2d 75 bd 30 01 e9 ef fe ff ff 0f 1f 44 00 00 41 55 48
[  377.410091] RSP: 0000:ffffb65a5e1cfdc8 EFLAGS: 00010246
[  377.415355] RAX: 0017ffffc0000000 RBX: ffff908b8ecabaf8 RCX:
ffffffffffffffff
[  377.422540] RDX: 0000000000000000 RSI: ffff908b8ecabaf8 RDI:
fffff497473b2ae8
[  377.429721] RBP: fffff497473b2ae8 R08: fffff49747fa8080 R09:
0000000000000000
[  377.436902] R10: 0000000000000000 R11: 0000000000000000 R12:
0000000000000af8
[  377.444086] R13: 0400000000000000 R14: 0400000000000080 R15:
ffff908bbef7b6a8
[  377.451272] FS:  00007f5bb1f81700(0000) GS:ffff90e87fd80000(0000)
knlGS:0000000000000000
[  377.459415] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  377.465196] CR2: 00007f5b2bfffd98 CR3: 00000001f793e006 CR4:
00000000007726e0
[  377.472377] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[  377.479556] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
0000000000000400
[  377.486738] PKRU: 55555554
[  377.489465] Kernel panic - not syncing: Fatal exception
[  377.573911] Kernel Offset: 0xa000000 from 0xffffffff81000000 (relocation
range: 0xffffffff80000000-0xffffffffbfffffff)
[  377.716482] ---[ end Kernel panic - not syncing: Fatal exception ]---

[-- Attachment #2: Type: text/html, Size: 6390 bytes --]

^ permalink raw reply	[flat|nested] 28+ messages in thread
* kernel BUG at include/linux/swapops.h:204!
@ 2022-03-10  4:21 Ryan Tierney
  2022-03-10  7:35 ` Greg KH
  0 siblings, 1 reply; 28+ messages in thread
From: Ryan Tierney @ 2022-03-10  4:21 UTC (permalink / raw)
  To: stable; +Cc: regressions

Hi Linux kernel team,

We have had multiple servers lock up over the course of a few months on 
kernel version 5.10.92
Looking to find out any information to help mitigate this.

I've also found a related report here back in 2021: 
https://lore.kernel.org/all/757b684a-67b5-999b-7f2d-b55fb1c61fd8@google.com/T/

Please see the stack trace obtained during this lockup

  [2136172.975892] ------------[ cut here ]------------
  [2136172.975896] kernel BUG at include\/linux\/swapops.h:204!
  [2136172.981268] invalid opcode: 0000 [#1] SMP NOPTI
  [2136172.985983] CPU: 49 PID: 1672949 Comm: apt-cache Not tainted 5.10.0-11-amd64 #1 Debian 5.10.92-1
  [2136172.994944] Hardware name: Supermicro AS -1114S-WN10RT\/H12SSW-NTR, BIOS 2.3 10\/28\/2021
  [2136173.003042] RIP: e030:__migration_entry_wait+0xf9\/0x100
  [2136173.008444] Code: 0f 45 c2 41 8b 40 34 85 c0 74 9f 8d 50 01 f0 41 0f b1 50 34 75 f1 48 89 ef e8 03 e0 e4 ff 66 90 5b 4c 89 c7 5d e9 27 48 f7 ff <0f> 0b 0f 1f 44 00 00 0f 1f 44 00 00 49 89 f9 48 8b 3e e8 08 39 d8
  [2136173.027394] RSP: e02b:ffffc90075a4fdb8 EFLAGS: 00010246
  [2136173.032795] RAX: 000fffffc0000000 RBX: ffff88807b0e6968 RCX: ffffea00006a6f87
  [2136173.040114] RDX: 0000000000000000 RSI: ffff88807b0e6968 RDI: fffffffffcac8400
  [2136173.047429] RBP: ffffea0001ec39a8 R08: ffffea00006a6f40 R09: ffff88804679f2c0
  [2136173.054745] R10: 000ffffffffff000 R11: 0000000000000000 R12: ffffc90075a4fe40
  [2136173.062060] R13: 7c0000000001a9bd R14: fff0000000000fff R15: 0000000000000000
  [2136173.069378] FS:  00007f2cc7eee800(0000) GS:ffff888117440000(0000) knlGS:0000000000000000
  [2136173.077651] CS:  e030 DS: 0000 ES: 0000 CR0: 0000000080050033
  [2136173.083575] CR2: 00007f2cc4b2df8c CR3: 000000008d396000 CR4: 0000000000050660
  [2136173.090893] Call Trace:
  [2136173.093527]  do_swap_page+0x66f\/0x900
  [2136173.097370]  handle_mm_fault+0xd7d\/0x1bf0
  [2136173.101584]  ? xfs_file_read_iter+0x6e\/0xd0 [xfs]
  [2136173.106468]  do_user_addr_fault+0x1b8\/0x3f0
  [2136173.110835]  exc_page_fault+0x78\/0x160
  [2136173.114766]  ? asm_exc_page_fault+0x8\/0x30
  [2136173.119038]  asm_exc_page_fault+0x1e\/0x30
  [2136173.123224] RIP: e033:0x7f2cc8930fcb
  [2136173.126977] Code: 8b 46 08 48 8d 14 80 48 8d 04 50 48 8d 44 85 00 48 39 c3 74 5a 8b 43 10 48 8d 14 80 48 8d 04 50 48 8d 5c 85 00 48 39 eb 74 45 <8b> 33 4c 01 c6 0f b7 46 fe 49 39 c4 75 c7 4c 89 e2 4c 89 44 24 08
  [2136173.145919] RSP: e02b:00007ffd2a3f3e80 EFLAGS: 00010206
  [2136173.151323] RAX: 0000000000000005 RBX: 00007f2cc4b2df8c RCX: 0000000000000006
  [2136173.158632] RDX: 00007f2cc351b2e6 RSI: 00007ffd2a3f3fd0 RDI: 00007f2cc351b2e6
  [2136173.165948] RBP: 00007f2cc339b000 R08: 00007f2cc339b000 R09: 0000000000000de0
  [2136173.173262] R10: 00007f2cc4c8bc7c R11: 00000000206e6f68 R12: 0000000000000005
  [2136173.180580] R13: 00007ffd2a3f3f20 R14: 00007f2cc4b0d410 R15: 000055ab3740af08
  [2136173.187892] Modules linked in: xen_acpi_processor xen_gntdev xen_evtchn binfmt_misc xenfs xen_privcmd nls_ascii nls_cp437 vfat fat ghash_clmulni_intel aesni_intel libaes crypto_simd cryptd glue_helper wmi_bmof efi_pstore pcspkr ast drm_vram_helper drm_ttm_helper ttm ccp drm_kms_helper rng_core rndis_host cdc_ether cec usbnet i2c_algo_bit joydev mii k10temp sp5100_tco watchdog ipmi_ssif evdev bridge acpi_ipmi ipmi_si 8021q ipmi_devintf garp ipmi_msghandler stp mrp llc button bonding loop psmouse uhci_hcd ohci_hcd ehci_hcd drbd lru_cache drm fuse configfs efivarfs ip_tables x_tables autofs4 xfs raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor hid_generic usbhid hid raid6_pq libcrc32c crc32c_generic raid0 multipath linear raid1 raid10 md_mod crc32_pclmul crc32c_intel ahci libahci xhci_pci nvme xhci_hcd libata nvme_core i40e t10_pi usbcore bnxt_en scsi_mod crc_t10dif crct10dif_generic crct10dif_pclmul crct10dif_common ptp usb_common i2c_piix4 pps_core wmi
  [2136173.274260] ---[ end trace 37e8c1af7f6e782a ]---


^ permalink raw reply	[flat|nested] 28+ messages in thread

end of thread, other threads:[~2022-03-10  7:35 UTC | newest]

Thread overview: 28+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-07-10  7:33 kernel BUG at include/linux/swapops.h:204! Igor Raits
2021-07-10 12:46 ` Hillf Danton
2021-07-11  4:17 ` Hugh Dickins
2021-07-11  6:06   ` Igor Raits
2021-07-15 17:47     ` Igor Raits
2021-07-16 19:45       ` Hugh Dickins
2021-07-19 19:11         ` Hugh Dickins
2021-07-19 22:12           ` Peter Xu
2021-07-19 22:42             ` Hugh Dickins
2021-07-20  0:34               ` Peter Xu
2021-07-20  3:31                 ` Hugh Dickins
2021-07-20  7:47             ` Igor Raits
2021-07-20 16:01               ` Peter Xu
2021-07-20 16:05                 ` Igor Raits
2021-07-20 15:51           ` [PATCH stable 5.13.y/5.12.y 0/2] mm/thp: Fix uffd-wp with fork(); crash on pmd migration entry on fork Peter Xu
2021-07-20 15:51             ` [PATCH stable 5.13.y/5.12.y 1/2] mm/thp: simplify copying of huge zero page pmd when fork Peter Xu
2021-07-20 15:51             ` [PATCH stable 5.13.y/5.12.y 2/2] mm/userfaultfd: fix uffd-wp special cases for fork() Peter Xu
2021-07-20 20:32             ` [PATCH stable 5.13.y/5.12.y 0/2] mm/thp: Fix uffd-wp with fork(); crash on pmd migration entry on fork Hugh Dickins
2021-07-20 20:32               ` Hugh Dickins
2021-07-22 14:02               ` Greg KH
2021-07-20 15:56           ` [PATCH stable 5.10.y " Peter Xu
2021-07-20 15:56             ` [PATCH stable 5.10.y 1/2] mm/thp: simplify copying of huge zero page pmd when fork Peter Xu
2021-07-20 15:56             ` [PATCH stable 5.10.y 2/2] mm/userfaultfd: fix uffd-wp special cases for fork() Peter Xu
2021-07-20 20:38             ` [PATCH stable 5.10.y 0/2] mm/thp: Fix uffd-wp with fork(); crash on pmd migration entry on fork Hugh Dickins
2021-07-20 20:38               ` Hugh Dickins
2021-07-22 14:05               ` Greg KH
2022-03-10  4:21 kernel BUG at include/linux/swapops.h:204! Ryan Tierney
2022-03-10  7:35 ` Greg KH

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.