Kernel BUG when registering the ring

* Kernel BUG when registering the ring
@ 2020-02-11  1:22 Glauber Costa
  2020-02-11  3:25 ` Jens Axboe
  0 siblings, 1 reply; 10+ messages in thread
From: Glauber Costa @ 2020-02-11  1:22 UTC (permalink / raw)
  To: io-uring, Avi Kivity, Jens Axboe

[-- Attachment #1: Type: text/plain, Size: 430 bytes --]

Hello my dear io_uring friends

Today I tried to run my tests on a different, more powerful hardware
(70+ Hyperthreads) and it crashed on creating the ring.

I don't recall anything fancy in my code for creating the ring -
except maybe that I size the cq ring differently than the sq ring.

The backtrace attached leads me to believe that we just accessed a
null struct somewhere

Hash is ba2db2d4d262f7ccf6fe86b00c3538056d7c5218

[-- Attachment #2: creation.txt --]
[-- Type: text/plain, Size: 5666 bytes --]

[  894.918927] XFS (nvme0n1p1): Mounting V5 Filesystem
[  894.928964] XFS (nvme0n1p1): Ending clean mount
[  894.930111] xfs filesystem being mounted at /var/disk1 supports timestamps until 2038 (0x7fffffff)
[  901.000820] BUG: unable to handle page fault for address: 0000000000002088
[  901.000887] #PF: supervisor read access in kernel mode
[  901.000927] #PF: error_code(0x0000) - not-present page
[  901.000969] PGD 174101b067 P4D 174101b067 PUD 17c30cd067 PMD 0 
[  901.001019] Oops: 0000 [#1] SMP NOPTI
[  901.001052] CPU: 40 PID: 2144 Comm: io_tester Not tainted 5.5.0+ #6
[  901.001101] Hardware name: Intel Corporation S2600WFT/S2600WFT, BIOS SE5C620.86B.02.01.0008.031920191559 03/19/2019
[  901.001187] RIP: 0010:__alloc_pages_nodemask+0x132/0x340
[  901.001231] Code: 18 01 75 04 41 80 ce 80 89 e8 48 8b 54 24 08 8b 74 24 1c c1 e8 0c 48 8b 3c 24 83 e0 01 88 44 24 20 48 85 d2 0f 85 74 01 00 00 <3b> 77 08 0f 82 6b 01 00 00 48 89 7c 24 10 89 ea 48 8b 07 b9 00 02
[  901.001370] RSP: 0018:ffffb8be4d0b7c28 EFLAGS: 00010246
[  901.001413] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 000000000000e8e8
[  901.001466] RDX: 0000000000000000 RSI: 0000000000000002 RDI: 0000000000002080
[  901.001499] RBP: 0000000000012cc0 R08: 0000000000000000 R09: 0000000000000002
[  901.001516] R10: 0000000000000dc0 R11: ffff995c60400100 R12: 0000000000000000
[  901.001534] R13: 0000000000012cc0 R14: 0000000000000001 R15: ffff995c60db00f0
[  901.001552] FS:  00007f4d115ca900(0000) GS:ffff995c60d80000(0000) knlGS:0000000000000000
[  901.001572] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  901.001586] CR2: 0000000000002088 CR3: 00000017cca66002 CR4: 00000000007606e0
[  901.001604] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  901.001622] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  901.001640] PKRU: 55555554
[  901.001647] Call Trace:
[  901.001663]  alloc_slab_page+0x46/0x320
[  901.001676]  new_slab+0x9d/0x4e0
[  901.001687]  ___slab_alloc+0x507/0x6a0
[  901.001702]  ? io_wq_create+0xb4/0x2a0
[  901.001713]  __slab_alloc+0x1c/0x30
[  901.001725]  kmem_cache_alloc_node_trace+0xa6/0x260
[  901.001738]  io_wq_create+0xb4/0x2a0
[  901.001750]  io_uring_setup+0x97f/0xaa0
[  901.001762]  ? io_remove_personalities+0x30/0x30
[  901.001776]  ? io_poll_trigger_evfd+0x30/0x30
[  901.001791]  do_syscall_64+0x5b/0x1c0
[  901.001806]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  901.001820] RIP: 0033:0x7f4d116cb1ed
[  901.001831] Code: 00 c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 6b 5c 0c 00 f7 d8 64 89 01 48
[  901.001875] RSP: 002b:00007fff641ddf58 EFLAGS: 00000202 ORIG_RAX: 00000000000001a9
[  901.001894] RAX: ffffffffffffffda RBX: 00007fff641de0f0 RCX: 00007f4d116cb1ed
[  901.001911] RDX: 0000000000000000 RSI: 00007fff641ddfb0 RDI: 0000000000000080
[  901.001928] RBP: 0000000000000080 R08: 0000000000000000 R09: 0000600000081c20
[  901.002471] R10: 00007f4d115c9800 R11: 0000000000000202 R12: 00007fff641ddfb0
[  901.002971] R13: 00007fff641de0f0 R14: 00007fff641de0c0 R15: 00007fff641de4e8
[  901.003470] Modules linked in: ip6t_REJECT nf_reject_ipv6 ip6t_rpfilter ipt_REJECT nf_reject_ipv4 xt_conntrack ebtable_nat ebtable_broute ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat iptable_mangle iptable_raw iptable_security nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nfnetlink ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ib_isert iscsi_target_mod ib_srpt target_core_mod ib_srp scsi_transport_srp ib_ipoib vfat fat ib_umad intel_rapl_msr intel_rapl_common rpcrdma isst_if_common sunrpc rdma_ucm ib_iser skx_edac rdma_cm x86_pkg_temp_thermal intel_powerclamp iw_cm coretemp kvm_intel ib_cm libiscsi scsi_transport_iscsi kvm irqbypass crct10dif_pclmul crc32_pclmul i40iw ghash_clmulni_intel ib_uverbs iTCO_wdt iTCO_vendor_support intel_cstate ib_core ipmi_ssif intel_uncore joydev intel_rapl_perf mei_me i2c_i801 ioatdma switchtec pcspkr lpc_ich mei ipmi_si dca ipmi_devintf ipmi_msghandler dax_pmem dax_pmem_core acpi_power_meter acpi_pad
[  901.003504]  ip_tables xfs libcrc32c rfkill nd_pmem nd_btt ast i2c_algo_bit drm_vram_helper drm_ttm_helper ttm drm_kms_helper cec drm i40e megaraid_sas crc32c_intel nvme nvme_core nfit libnvdimm wmi pkcs8_key_parser
[  901.009213] CR2: 0000000000002088
[  901.009814] ---[ end trace 2bb8b12f7dc58981 ]---
[  901.109907] RIP: 0010:__alloc_pages_nodemask+0x132/0x340
[  901.110546] Code: 18 01 75 04 41 80 ce 80 89 e8 48 8b 54 24 08 8b 74 24 1c c1 e8 0c 48 8b 3c 24 83 e0 01 88 44 24 20 48 85 d2 0f 85 74 01 00 00 <3b> 77 08 0f 82 6b 01 00 00 48 89 7c 24 10 89 ea 48 8b 07 b9 00 02
[  901.111761] RSP: 0018:ffffb8be4d0b7c28 EFLAGS: 00010246
[  901.112368] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 000000000000e8e8
[  901.112982] RDX: 0000000000000000 RSI: 0000000000000002 RDI: 0000000000002080
[  901.113596] RBP: 0000000000012cc0 R08: 0000000000000000 R09: 0000000000000002
[  901.114206] R10: 0000000000000dc0 R11: ffff995c60400100 R12: 0000000000000000
[  901.114821] R13: 0000000000012cc0 R14: 0000000000000001 R15: ffff995c60db00f0
[  901.115418] FS:  00007f4d115ca900(0000) GS:ffff995c60d80000(0000) knlGS:0000000000000000
[  901.116019] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  901.116619] CR2: 0000000000002088 CR3: 00000017cca66002 CR4: 00000000007606e0
[  901.117233] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  901.117839] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  901.118444] PKRU: 55555554

^ permalink raw reply	[flat|nested] 10+ messages in thread