All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ming Lei <ming.lei@redhat.com>
To: Yi Zhang <yi.zhang@redhat.com>
Cc: Ming Lei <minlei@redhat.com>, linux-block@vger.kernel.org
Subject: Re: Fwd: [bug report] WARNING at block/genhd.c:537 device_add_disk+0x1ad/0x390 observed with blktests block/001
Date: Wed, 15 Sep 2021 19:26:14 +0800	[thread overview]
Message-ID: <YUHYVtMXADA3aicd@T590> (raw)
In-Reply-To: <CAHj4cs8EUAET8X2LKHnG2yVwqkGuAV2Jb37W1z1pr7KK64yNfg@mail.gmail.com>

On Wed, Sep 15, 2021 at 07:13:55PM +0800, Yi Zhang wrote:
> Hi Ming
> Just want let you know I found this issue during blktests block/001 on
> storage-qe-36 server, I send this report to linux-block, but the mail was
> blocked. :(

That is sad, :-(

BTW, the following patch can fix the issue:

https://lore.kernel.org/linux-block/20210915092547.990285-1-ming.lei@redhat.com/T/#u


> 
> ---------- Forwarded message ---------
> From: Yi Zhang <yi.zhang@redhat.com>
> Date: Wed, Sep 15, 2021 at 1:28 PM
> Subject: [bug report] WARNING at block/genhd.c:537
> device_add_disk+0x1ad/0x390 observed with blktests block/001
> To: linux-block <linux-block@vger.kernel.org>
> 
> 
> Hello
> 
> The following kernel warnings/panic triggered on 5.15.0-rc1 with blktests
> block/001, feel free to let me know if you need more info.
> 
> [  138.287801] run blktests block/001 at 2021-09-15 01:15:33
> [  138.310004] scsi_debug:sdebug_driver_probe: scsi_debug: trim poll_queues
> to 0. poll_q/nr_hw = (0/1)
> [  138.319283] sd 15:0:0:0: Power-on or device reset occurred
> [  138.319292] scsi_debug:sdebug_driver_probe: scsi_debug: trim poll_queues
> to 0. poll_q/nr_hw = (0/1)
> [  138.333918] scsi_debug:sdebug_driver_probe: scsi_debug: trim poll_queues
> to 0. poll_q/nr_hw = (0/1)
> [  138.334012] sd 16:0:0:0: Power-on or device reset occurred
> [  138.348549] scsi_debug:sdebug_driver_probe: scsi_debug: trim poll_queues
> to 0. poll_q/nr_hw = (0/1)
> [  138.348629] sd 17:0:0:0: Power-on or device reset occurred
> [  138.363189] sd 18:0:0:0: Power-on or device reset occurred
> [  138.461052] ------------[ cut here ]------------
> [  138.465670] WARNING: CPU: 1 PID: 1161 at block/genhd.c:537
> device_add_disk+0x1ad/0x390
> [  138.473591] Modules linked in: dm_service_time scsi_debug
> rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache
> netfs rfkill sunrpc vfat fat dm_multipath intel_rapl_msr intel_rapl_common
> isst_if_common skx_edac x86_pkg_temp_thermal intel_powerclamp coretemp
> kvm_intel kvm mgag200 i2c_algo_bit drm_kms_helper irqbypass iTCO_wdt
> crct10dif_pclmul iTCO_vendor_support crc32_pclmul ipmi_ssif syscopyarea
> sysfillrect sysimgblt ghash_clmulni_intel fb_sys_fops rapl acpi_ipmi drm
> ipmi_si intel_cstate mei_me ipmi_devintf dax_pmem_compat intel_uncore mei
> i2c_i801 pcspkr wmi_bmof nd_pmem device_dax lpc_ich ipmi_msghandler
> intel_pch_thermal i2c_smbus nd_btt dax_pmem_core acpi_power_meter xfs
> libcrc32c sd_mod t10_pi sg ahci libahci libata nfit tg3 megaraid_sas
> crc32c_intel libnvdimm wmi dm_mirror dm_region_hash dm_log dm_mod
> [  138.533963] BUG: kernel NULL pointer dereference, address:
> 0000000000000098
> [  138.546449] CPU: 1 PID: 1161 Comm: multipathd Tainted: G S        I
>   5.15.0-rc1 #1
> [  138.553403] #PF: supervisor read access in kernel mode
> [  138.553404] #PF: error_code(0x0000) - not-present page
> [  138.553405] PGD 0 P4D 0
> [  138.561405] Hardware name: Dell Inc. PowerEdge R640/06NR82, BIOS 2.11.2
> 004/21/2021
> [  138.566542]
> [  138.566543] Oops: 0000 [#1] SMP NOPTI
> [  138.566545] CPU: 2 PID: 0 Comm: swapper/2 Tainted: G S        I
> 5.15.0-rc1 #1
> [  138.571683] RIP: 0010:device_add_disk+0x1ad/0x390
> [  138.574221] Hardware name: Dell Inc. PowerEdge R640/06NR82, BIOS 2.11.2
> 004/21/2021
> [  138.574222] RIP: 0010:wb_timer_fn+0x37/0x320
> [  138.581872] Code: 81 3b 03 01 00 00 0f 85 39 ff ff ff e9 25 ff ff ff 0f
> 0b bd ea ff ff ff e9 30 ff ff ff 4c 89 ef e8 a8 b6 1b 00 e9 f8 fe ff ff
> <0f> 0b bd 01 00 00 00 e9 17 ff ff ff 0f 0b bd ea ff ff ff e9 0b ff
> [  138.583363] Code: 48 8b 5f 60 4c 8b 67 50 8b ab 98 00 00 00 8b 93 b8 00
> 00 00 8b 83 d8 00 00 00 4c 8b 6b 28 01 d5 01 c5 48 8b 43 60 48 8b 40 78
> <4c> 8b b0 98 00 00 00 4d 85 ed 0f 84 c0 00 00 00 48 83 7b 30 00 0f
> [  138.583365] RSP: 0018:ffffb4d7c0300e80 EFLAGS: 00010246
> [  138.587029] RSP: 0018:ffffb4d7c0e2bc40 EFLAGS: 00010282
> [  138.594672] RAX: 0000000000000000 RBX: ffff9fd941d74a00 RCX:
> 0000000000000020
> [  138.594674] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
> ffff9fd9418cfd80
> [  138.594674] RBP: 0000000000000000 R08: 0000000000000000 R09:
> 0000000000000000
> [  138.594675] R10: 000000000000001f R11: 00000000000003a8 R12:
> ffff9fd9420a0cc0
> [  138.599381]
> [  138.607032] R13: 0000000000000000 R14: ffff9fd9418cfd90 R15:
> ffff9fe87fa97f70
> [  138.607033] FS:  0000000000000000(0000) GS:ffff9fe87fa80000(0000)
> knlGS:0000000000000000
> [  138.607035] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  138.611306] RAX: 0000000000000000 RBX: ffff9fd9e4187400 RCX:
> dead000000000122
> [  138.630049] CR2: 0000000000000098 CR3: 000000010371e004 CR4:
> 00000000007706e0
> [  138.630051] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
> 0000000000000000
> [  138.630051] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
> 0000000000000400
> [  138.630052] PKRU: 55555554
> [  138.648797] RDX: ffff9fd98299bf00 RSI: ffffffffa8eac593 RDI:
> 0000000000000000
> [  138.654021] Call Trace:
> [  138.654023]  <IRQ>
> [  138.654025]  ? blk_stat_free_callback_rcu+0x30/0x30
> [  138.659248] RBP: 00000000fffffffe R08: 0000000000000001 R09:
> ffffffffa8dfdb00
> [  138.666378]  call_timer_fn+0x24/0xf0
> [  138.673512] R10: ffff9fd94f701780 R11: 0000000000000001 R12:
> ffff9fda40024800
> [  138.680643]  run_timer_softirq+0x1c7/0x3d0
> [  138.680646]  ? update_process_times+0xb0/0xc0
> [  138.687777] R13: ffff9fda40024840 R14: ffff9fda40024800 R15:
> 0000000000000000
> [  138.689276]  ? tick_sched_handle.isra.24+0x1f/0x60
> [  138.696408] FS:  00007f473d28d700(0000) GS:ffff9fe87fa40000(0000)
> knlGS:0000000000000000
> [  138.704494]  ? timerqueue_add+0x6f/0x80
> [  138.704496]  ? enqueue_hrtimer+0x2f/0x70
> [  138.710242] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  138.717371]  ? ktime_get+0x3b/0xa0
> [  138.717373]  __do_softirq+0xc6/0x285
> [  138.724505] CR2: 00007f347b9e0590 CR3: 000000010c092006 CR4:
> 00000000007706e0
> [  138.731638]  irq_exit_rcu+0xa6/0xc0
> [  138.738771] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
> 0000000000000000
> [  138.741482]  sysvec_apic_timer_interrupt+0x6e/0x90
> [  138.741485]  </IRQ>
> [  138.748614] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
> 0000000000000400
> [  138.751056]  asm_sysvec_apic_timer_interrupt+0x12/0x20
> [  138.753076] PKRU: 55555554
> [  138.757948] RIP: 0010:cpuidle_enter_state+0xd6/0x350
> [  138.765080] Call Trace:
> [  138.768652] Code: 49 89 c4 0f 1f 44 00 00 31 ff e8 05 05 98 ff 45 84 ff
> 74 12 9c 58 f6 c4 02 0f 85 32 02 00 00 31 ff e8 ce a4 9e ff fb 45 85 f6
> <0f> 88 e0 00 00 00 49 63 d6 4c 2b 24 24 48 8d 04 52 48 8d 04 82 49
> [  138.768653] RSP: 0018:ffffb4d7c01bbe80 EFLAGS: 00000202
> [  138.768654] RAX: ffff9fe87faa7c00 RBX: 0000000000000002 RCX:
> 000000000000001f
> [  138.775787]  dm_setup_md_queue+0xbe/0xf0 [dm_mod]
> [  138.779883] RDX: 0000002041447a1c RSI: 000000002c3ddb16 RDI:
> 0000000000000000
> [  138.779884] RBP: ffffd4d7bf680588 R08: 0000000000000002 R09:
> 0000000000027440
> [  138.779884] R10: 000000a2559bce5c R11: ffff9fe87faa6944 R12:
> 0000002041447a1c
> [  138.779885] R13: ffffffffaa4ca6a0 R14: 0000000000000002 R15:
> 0000000000000000
> [  138.784243]  table_load+0x1c0/0x2e0 [dm_mod]
> [  138.791376]  cpuidle_enter+0x29/0x40
> [  138.796169]  ? retrieve_status+0x1e0/0x1e0 [dm_mod]
> [  138.804251]  do_idle+0x257/0x2a0
> [  138.808092]  ctl_ioctl+0x1ad/0x420 [dm_mod]
> [  138.812009]  cpu_startup_entry+0x19/0x20
> [  138.812011]  start_secondary+0x116/0x150
> [  138.817758]  dm_ctl_ioctl+0xa/0x10 [dm_mod]
> [  138.821159]  secondary_startup_64_no_verify+0xc2/0xcb
> [  138.821162] Modules linked in:
> [  138.824740]  __x64_sys_ioctl+0x81/0xc0
> [  138.831872]  dm_service_time scsi_debug rpcsec_gss_krb5 auth_rpcgss
> nfsv4 dns_resolver
> [  138.835366]  do_syscall_64+0x37/0x80
> [  138.842496]  nfs lockd grace fscache netfs rfkill sunrpc vfat
> [  138.847290]  entry_SYSCALL_64_after_hwframe+0x44/0xae
> [  138.849386]  fat dm_multipath intel_rapl_msr intel_rapl_common
> isst_if_common skx_edac x86_pkg_temp_thermal
> [  138.856519] RIP: 0033:0x7f473b4dd62b
> [  138.861650]  intel_powerclamp coretemp kvm_intel kvm mgag200
> i2c_algo_bit drm_kms_helper irqbypass
> [  138.864362] Code: 0f 1e fa 48 8b 05 5d b8 2c 00 64 c7 00 26 00 00 00 48
> c7 c0 ff ff ff ff c3 66 0f 1f 44 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05
> <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 2d b8 2c 00 f7 d8 64 89 01 48
> [  138.869319]  iTCO_wdt crct10dif_pclmul iTCO_vendor_support crc32_pclmul
> ipmi_ssif syscopyarea sysfillrect
> [  138.871773] RSP: 002b:00007f473d28b428 EFLAGS: 00000206
> [  138.890517]  sysimgblt ghash_clmulni_intel fb_sys_fops rapl acpi_ipmi
> drm ipmi_si intel_cstate mei_me
> [  138.895745]  ORIG_RAX: 0000000000000010
> [  138.902874]  ipmi_devintf dax_pmem_compat intel_uncore mei i2c_i801
> pcspkr wmi_bmof nd_pmem
> [  138.907582] RAX: ffffffffffffffda RBX: 00007f473bf6b270 RCX:
> 00007f473b4dd62b
> [  138.914713]  device_dax lpc_ich ipmi_msghandler intel_pch_thermal
> i2c_smbus nd_btt dax_pmem_core acpi_power_meter
> [  138.921848] RDX: 00007f472401dbf0 RSI: 00000000c138fd09 RDI:
> 0000000000000005
> [  138.928978]  xfs libcrc32c sd_mod t10_pi sg ahci libahci libata nfit
> [  138.936111] RBP: 00007f473bfa6f83 R08: 00007f473d289180 R09:
> 00007f473c1b5058
> [  138.940373]  tg3 megaraid_sas crc32c_intel libnvdimm wmi dm_mirror
> dm_region_hash
> [  138.943956] R10: 0000000000000000 R11: 0000000000000206 R12:
> 00007f472401dbf0
> [  138.948834]  dm_log dm_mod
> [  138.948836] CR2: 0000000000000098
> [  138.948837] ---[ end trace 3235ef2b3e8a55ec ]---
> [  138.952068] R13: 00007f472401dca0 R14: 0000000000000000 R15:
> 00007f4724007530
> [  138.959447] RIP: 0010:wb_timer_fn+0x37/0x320
> [  138.961114] ---[ end trace 3235ef2b3e8a55ed ]---
> [  138.965030] Code: 48 8b 5f 60 4c 8b 67 50 8b ab 98 00 00 00 8b 93 b8 00
> 00 00 8b 83 d8 00 00 00 4c 8b 6b 28 01 d5 01 c5 48 8b 43 60 48 8b 40 78
> <4c> 8b b0 98 00 00 00 4d 85 ed 0f 84 c0 00 00 00 48 83 7b 30 00 0f
> [  138.965032] RSP: 0018:ffffb4d7c0300e80 EFLAGS: 00010246
> [  138.969219] kobject_add_internal failed for dm (error: -2 parent: dm-3)
> [  138.974269] RAX: 0000000000000000 RBX: ffff9fd941d74a00 RCX:
> 0000000000000020
> [  138.974271] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
> ffff9fd9418cfd80
> [  138.974271] RBP: 0000000000000000 R08: 0000000000000000 R09:
> 0000000000000000
> [  138.974272] R10: 000000000000001f R11: 00000000000003a8 R12:
> ffff9fd9420a0cc0
> [  138.974273] R13: 0000000000000000 R14: ffff9fd9418cfd90 R15:
> ffff9fe87fa97f70
> [  138.975397] scsi 15:0:0:0: Direct-Access     Linux    scsi_debug
> 0190 PQ: 0 ANSI: 7
> [  138.975475] sd 15:0:0:0: Attached scsi generic sg1 type 0
> [  138.975496] sd 15:0:0:0: Power-on or device reset occurred
> [  138.976516] sd 15:0:0:0: [sdc] Read Capacity(16) failed: Result:
> hostbyte=DID_ERROR driverbyte=DRIVER_OK
> [  138.976518] sd 15:0:0:0: [sdc] Sense not available.
> [  138.976520] sd 15:0:0:0: [sdc] Read Capacity(10) failed: Result:
> hostbyte=DID_ERROR driverbyte=DRIVER_OK
> [  138.976521] sd 15:0:0:0: [sdc] Sense not available.
> [  138.976524] sd 15:0:0:0: [sdc] 0 512-byte logical blocks: (0 B/0 B)
> [  138.976525] sd 15:0:0:0: [sdc] 0-byte physical blocks
> [  138.976526] sd 15:0:0:0: [sdc] Write Protect is off
> [  138.976527] sd 15:0:0:0: [sdc] Mode Sense: 00 00 00 00
> [  138.976528] sd 15:0:0:0: [sdc] Asking for cache data failed
> [  138.976529] sd 15:0:0:0: [sdc] Assuming drive cache: write through
> [  138.977337] ------------[ cut here ]------------
> [  138.981081] FS:  0000000000000000(0000) GS:ffff9fe87fa80000(0000)
> knlGS:0000000000000000
> [  138.981082] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  138.981083] CR2: 0000000000000098 CR3: 000000010371e004 CR4:
> 00000000007706e0
> [  138.981084] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
> 0000000000000000
> [  138.988993] WARNING: CPU: 1 PID: 1161 at block/genhd.c:564
> del_gendisk+0x186/0x1c0
> [  138.992562] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
> 0000000000000400
> [  138.992563] PKRU: 55555554
> [  138.992564] Kernel panic - not syncing: Fatal exception in interrupt
> [  139.493526] Kernel Offset: 0x27a00000 from 0xffffffff81000000
> (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
> [  139.872338] ---[ end Kernel panic - not syncing: Fatal exception in
> interrupt ]---
> 
> 
> 
> -- 
> Best Regards,
>   Yi Zhang
> 
> 
> -- 
> Best Regards,
>   Yi Zhang

-- 
Ming


           reply	other threads:[~2021-09-15 11:26 UTC|newest]

Thread overview: expand[flat|nested]  mbox.gz  Atom feed
 [parent not found: <CAHj4cs8EUAET8X2LKHnG2yVwqkGuAV2Jb37W1z1pr7KK64yNfg@mail.gmail.com>]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=YUHYVtMXADA3aicd@T590 \
    --to=ming.lei@redhat.com \
    --cc=linux-block@vger.kernel.org \
    --cc=minlei@redhat.com \
    --cc=yi.zhang@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.