linux-rdma.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Issus with blktest/srp on 5.15-rc1 and rdma_rxe
@ 2021-09-16 19:36 ` Bernard Metzler
  2021-09-16 22:21   ` Robert Pearson
  2021-09-17  8:15   ` Re: Issus with blktest/srp on 5.15-rc1 and rdma_rxe Bernard Metzler
  0 siblings, 2 replies; 8+ messages in thread
From: Bernard Metzler @ 2021-09-16 19:36 UTC (permalink / raw)
  To: linux-rdma; +Cc: Bob Pearson, Jason Gunthorpe, Bart Van Assche

Hi,

if I run the complete srp test series from the blktests suite,
the dmesg log contains many rdma_rxe messages of type:
 
rdma_rxe: rxe_invalidate_mr: rkey (n) doesn't match mr->ibmr.rkey (n + 1)

where 'n' is the current key. I expect this is not intended
behavior.

I am at commit 1b789bd4dbd48a92f5427d9c37a72a8f6ca17754



Furthermore, running ./check -q srp/005 sometimes I get this:

[  308.903330] sd 11:0:0:1: [sde] Attached SCSI disk
[  308.917293] scsi 11:0:0:1: alua: Detached
[  308.918191] BUG: kernel NULL pointer dereference, address: 0000000000000000
[  308.918223] #PF: supervisor instruction fetch in kernel mode
[  308.918242] #PF: error_code(0x0010) - not-present page
[  308.918259] PGD 0 P4D 0 
[  308.918271] Oops: 0010 [#1] SMP PTI
[  308.918285] CPU: 1 PID: 4214 Comm: kworker/1:255 Not tainted 5.15.0-rc1+ #4
[  308.918309] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./Z77 Extreme6, BIOS P2.80 07/01/2013
[  308.918338] Workqueue: srp_remove srp_remove_work [ib_srp]
[  308.918362] RIP: 0010:0x0
[  308.918375] Code: Unable to access opcode bytes at RIP 0xffffffffffffffd6.
[  308.918397] RSP: 0018:ffffb6124b9a3b68 EFLAGS: 00010086
[  308.918414] RAX: 0000000000000001 RBX: ffffb6124b9a3ce0 RCX: 0000000000000000
[  308.918437] RDX: 0000000000000000 RSI: ffffb6124b9a3c50 RDI: ffff966063a27a00
[  308.918459] RBP: ffffb6124b9a3bb0 R08: ffff966067481c00 R09: ffffeb578489b808
[  308.918481] R10: ffff966043c0f200 R11: ffffb6124b9a3d00 R12: ffff966063a27a00
[  308.918503] R13: 0000000000000004 R14: 0000000000000000 R15: ffffb6124b9a3c50
[  308.918524] FS:  0000000000000000(0000) GS:ffff966157680000(0000) knlGS:0000000000000000
[  308.918550] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  308.918568] CR2: ffffffffffffffd6 CR3: 000000005060a004 CR4: 00000000001706e0
[  308.918590] Call Trace:
[  308.918601]  __ib_process_cq+0x89/0x150 [ib_core]
[  308.918640]  ib_process_cq_direct+0x30/0x50 [ib_core]
[  308.918669]  ? xas_store+0x331/0x640
[  308.918684]  ? free_unref_page_commit.isra.135+0x91/0x140
[  308.918705]  ? free_unref_page+0x6e/0xd0
[  308.918719]  ? __free_pages+0xa3/0xc0
[  308.918733]  ? kfree+0x32f/0x3b0
[  308.918748]  srp_destroy_qp+0x24/0x40 [ib_srp]
[  308.918767]  srp_free_ch_ib+0x77/0x180 [ib_srp]
[  308.918784]  srp_remove_work+0xde/0x1a0 [ib_srp]
[  308.918801]  process_one_work+0x1d0/0x380
[  308.918817]  worker_thread+0x37/0x390
[  308.918831]  ? process_one_work+0x380/0x380
[  308.918846]  kthread+0x12f/0x150
[  308.918859]  ? set_kthread_struct+0x40/0x40
[  308.918874]  ret_from_fork+0x22/0x30


With ./check -q srp/008 I sometimes get something similar:

[ 1772.149274] sd 11:0:0:1: [sde] Attached SCSI disk
[ 1772.150096] scsi 11:0:0:2: alua: Detached
[ 1772.150184] blk_update_request: I/O error, dev sde, sector 8 op 0x0:(READ) flags 0x80700 phys_seg 1 prio class 0
[ 1772.151653] blk_update_request: I/O error, dev sde, sector 8 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[ 1772.153080] Buffer I/O error on dev sde, logical block 1, async page read
[ 1772.169139] scsi 11:0:0:1: alua: Detached
[ 1772.169446] BUG: kernel NULL pointer dereference, address: 0000000000000000
[ 1772.170881] #PF: supervisor instruction fetch in kernel mode
[ 1772.172297] #PF: error_code(0x0010) - not-present page
[ 1772.173751] PGD 0 P4D 0 
[ 1772.175165] Oops: 0010 [#1] SMP PTI
[ 1772.176575] CPU: 3 PID: 8654 Comm: kworker/3:60 Not tainted 5.15.0-rc1+ #4
[ 1772.177995] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./Z77 Extreme6, BIOS P2.80 07/01/2013
[ 1772.179430] Workqueue: srp_remove srp_remove_work [ib_srp]
[ 1772.180859] RIP: 0010:0x0
[ 1772.182276] Code: Unable to access opcode bytes at RIP 0xffffffffffffffd6.
[ 1772.183705] RSP: 0018:ffffa9710a2f7b68 EFLAGS: 00010086
[ 1772.185129] RAX: 0000000000000001 RBX: ffffa9710a2f7c50 RCX: 0000000000000000
[ 1772.186566] RDX: 0000000000000000 RSI: ffffa9710a2f7c08 RDI: ffff91a703825a00
[ 1772.187994] RBP: ffffa9710a2f7bb0 R08: ffff91a7184f8300 R09: 0000000000000000
[ 1772.189425] R10: ffff91a7869ce000 R11: ffffa9710a2f7d00 R12: ffff91a703825a00
[ 1772.190848] R13: 0000000000000002 R14: 0000000000000000 R15: ffffa9710a2f7c08
[ 1772.192270] FS:  0000000000000000(0000) GS:ffff91a817780000(0000) knlGS:0000000000000000
[ 1772.193689] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1772.195118] CR2: ffffffffffffffd6 CR3: 0000000111458006 CR4: 00000000001706e0
[ 1772.196558] Call Trace:
[ 1772.197975]  __ib_process_cq+0x89/0x150 [ib_core]
[ 1772.199402]  ib_process_cq_direct+0x30/0x50 [ib_core]
[ 1772.200830]  ? put_cpu_partial+0x98/0xb0
[ 1772.202260]  ? __slab_free+0x226/0x3c0
[ 1772.203664]  ? __slab_free+0x226/0x3c0
[ 1772.205038]  ? xas_store+0x331/0x640
[ 1772.206404]  ? rxe_elem_release+0x4f/0x60 [rdma_rxe]
[ 1772.207763]  ? kfree+0x372/0x3b0
[ 1772.209101]  ? srp_destroy_fr_pool+0x43/0x50 [ib_srp]
[ 1772.210439]  srp_destroy_qp+0x24/0x40 [ib_srp]
[ 1772.211759]  srp_free_ch_ib+0x77/0x180 [ib_srp]
[ 1772.213093]  srp_remove_work+0xde/0x1a0 [ib_srp]
[ 1772.214417]  process_one_work+0x1d0/0x380
[ 1772.215756]  worker_thread+0x37/0x390
[ 1772.217082]  ? process_one_work+0x380/0x380
[ 1772.218422]  kthread+0x12f/0x150
[ 1772.219744]  ? set_kthread_struct+0x40/0x40
[ 1772.221059]  ret_from_fork+0x22/0x30


Many thanks,
Bernard.



^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Issus with blktest/srp on 5.15-rc1 and rdma_rxe
  2021-09-16 19:36 ` Issus with blktest/srp on 5.15-rc1 and rdma_rxe Bernard Metzler
@ 2021-09-16 22:21   ` Robert Pearson
  2021-09-17  8:23     ` Bernard Metzler
  2021-09-17  8:15   ` Re: Issus with blktest/srp on 5.15-rc1 and rdma_rxe Bernard Metzler
  1 sibling, 1 reply; 8+ messages in thread
From: Robert Pearson @ 2021-09-16 22:21 UTC (permalink / raw)
  To: Bernard Metzler; +Cc: linux-rdma, Jason Gunthorpe, Bart Van Assche

Bernard,
That would indicate that you have not applied the patch series
RDMA/rxe: Fix various bugs which fixes the rkey not match rxe bug.
I do not know how to get it to select the siw device instead of the rxe
device but Bart does.

Bob

On Thu, Sep 16, 2021 at 2:36 PM Bernard Metzler <BMT@zurich.ibm.com> wrote:
>
> Hi,
>
> if I run the complete srp test series from the blktests suite,
> the dmesg log contains many rdma_rxe messages of type:
>
> rdma_rxe: rxe_invalidate_mr: rkey (n) doesn't match mr->ibmr.rkey (n + 1)
>
> where 'n' is the current key. I expect this is not intended
> behavior.
>
> I am at commit 1b789bd4dbd48a92f5427d9c37a72a8f6ca17754
>
>
>
> Furthermore, running ./check -q srp/005 sometimes I get this:
>
> [  308.903330] sd 11:0:0:1: [sde] Attached SCSI disk
> [  308.917293] scsi 11:0:0:1: alua: Detached
> [  308.918191] BUG: kernel NULL pointer dereference, address: 0000000000000000
> [  308.918223] #PF: supervisor instruction fetch in kernel mode
> [  308.918242] #PF: error_code(0x0010) - not-present page
> [  308.918259] PGD 0 P4D 0
> [  308.918271] Oops: 0010 [#1] SMP PTI
> [  308.918285] CPU: 1 PID: 4214 Comm: kworker/1:255 Not tainted 5.15.0-rc1+ #4
> [  308.918309] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./Z77 Extreme6, BIOS P2.80 07/01/2013
> [  308.918338] Workqueue: srp_remove srp_remove_work [ib_srp]
> [  308.918362] RIP: 0010:0x0
> [  308.918375] Code: Unable to access opcode bytes at RIP 0xffffffffffffffd6.
> [  308.918397] RSP: 0018:ffffb6124b9a3b68 EFLAGS: 00010086
> [  308.918414] RAX: 0000000000000001 RBX: ffffb6124b9a3ce0 RCX: 0000000000000000
> [  308.918437] RDX: 0000000000000000 RSI: ffffb6124b9a3c50 RDI: ffff966063a27a00
> [  308.918459] RBP: ffffb6124b9a3bb0 R08: ffff966067481c00 R09: ffffeb578489b808
> [  308.918481] R10: ffff966043c0f200 R11: ffffb6124b9a3d00 R12: ffff966063a27a00
> [  308.918503] R13: 0000000000000004 R14: 0000000000000000 R15: ffffb6124b9a3c50
> [  308.918524] FS:  0000000000000000(0000) GS:ffff966157680000(0000) knlGS:0000000000000000
> [  308.918550] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  308.918568] CR2: ffffffffffffffd6 CR3: 000000005060a004 CR4: 00000000001706e0
> [  308.918590] Call Trace:
> [  308.918601]  __ib_process_cq+0x89/0x150 [ib_core]
> [  308.918640]  ib_process_cq_direct+0x30/0x50 [ib_core]
> [  308.918669]  ? xas_store+0x331/0x640
> [  308.918684]  ? free_unref_page_commit.isra.135+0x91/0x140
> [  308.918705]  ? free_unref_page+0x6e/0xd0
> [  308.918719]  ? __free_pages+0xa3/0xc0
> [  308.918733]  ? kfree+0x32f/0x3b0
> [  308.918748]  srp_destroy_qp+0x24/0x40 [ib_srp]
> [  308.918767]  srp_free_ch_ib+0x77/0x180 [ib_srp]
> [  308.918784]  srp_remove_work+0xde/0x1a0 [ib_srp]
> [  308.918801]  process_one_work+0x1d0/0x380
> [  308.918817]  worker_thread+0x37/0x390
> [  308.918831]  ? process_one_work+0x380/0x380
> [  308.918846]  kthread+0x12f/0x150
> [  308.918859]  ? set_kthread_struct+0x40/0x40
> [  308.918874]  ret_from_fork+0x22/0x30
>
>
> With ./check -q srp/008 I sometimes get something similar:
>
> [ 1772.149274] sd 11:0:0:1: [sde] Attached SCSI disk
> [ 1772.150096] scsi 11:0:0:2: alua: Detached
> [ 1772.150184] blk_update_request: I/O error, dev sde, sector 8 op 0x0:(READ) flags 0x80700 phys_seg 1 prio class 0
> [ 1772.151653] blk_update_request: I/O error, dev sde, sector 8 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
> [ 1772.153080] Buffer I/O error on dev sde, logical block 1, async page read
> [ 1772.169139] scsi 11:0:0:1: alua: Detached
> [ 1772.169446] BUG: kernel NULL pointer dereference, address: 0000000000000000
> [ 1772.170881] #PF: supervisor instruction fetch in kernel mode
> [ 1772.172297] #PF: error_code(0x0010) - not-present page
> [ 1772.173751] PGD 0 P4D 0
> [ 1772.175165] Oops: 0010 [#1] SMP PTI
> [ 1772.176575] CPU: 3 PID: 8654 Comm: kworker/3:60 Not tainted 5.15.0-rc1+ #4
> [ 1772.177995] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./Z77 Extreme6, BIOS P2.80 07/01/2013
> [ 1772.179430] Workqueue: srp_remove srp_remove_work [ib_srp]
> [ 1772.180859] RIP: 0010:0x0
> [ 1772.182276] Code: Unable to access opcode bytes at RIP 0xffffffffffffffd6.
> [ 1772.183705] RSP: 0018:ffffa9710a2f7b68 EFLAGS: 00010086
> [ 1772.185129] RAX: 0000000000000001 RBX: ffffa9710a2f7c50 RCX: 0000000000000000
> [ 1772.186566] RDX: 0000000000000000 RSI: ffffa9710a2f7c08 RDI: ffff91a703825a00
> [ 1772.187994] RBP: ffffa9710a2f7bb0 R08: ffff91a7184f8300 R09: 0000000000000000
> [ 1772.189425] R10: ffff91a7869ce000 R11: ffffa9710a2f7d00 R12: ffff91a703825a00
> [ 1772.190848] R13: 0000000000000002 R14: 0000000000000000 R15: ffffa9710a2f7c08
> [ 1772.192270] FS:  0000000000000000(0000) GS:ffff91a817780000(0000) knlGS:0000000000000000
> [ 1772.193689] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 1772.195118] CR2: ffffffffffffffd6 CR3: 0000000111458006 CR4: 00000000001706e0
> [ 1772.196558] Call Trace:
> [ 1772.197975]  __ib_process_cq+0x89/0x150 [ib_core]
> [ 1772.199402]  ib_process_cq_direct+0x30/0x50 [ib_core]
> [ 1772.200830]  ? put_cpu_partial+0x98/0xb0
> [ 1772.202260]  ? __slab_free+0x226/0x3c0
> [ 1772.203664]  ? __slab_free+0x226/0x3c0
> [ 1772.205038]  ? xas_store+0x331/0x640
> [ 1772.206404]  ? rxe_elem_release+0x4f/0x60 [rdma_rxe]
> [ 1772.207763]  ? kfree+0x372/0x3b0
> [ 1772.209101]  ? srp_destroy_fr_pool+0x43/0x50 [ib_srp]
> [ 1772.210439]  srp_destroy_qp+0x24/0x40 [ib_srp]
> [ 1772.211759]  srp_free_ch_ib+0x77/0x180 [ib_srp]
> [ 1772.213093]  srp_remove_work+0xde/0x1a0 [ib_srp]
> [ 1772.214417]  process_one_work+0x1d0/0x380
> [ 1772.215756]  worker_thread+0x37/0x390
> [ 1772.217082]  ? process_one_work+0x380/0x380
> [ 1772.218422]  kthread+0x12f/0x150
> [ 1772.219744]  ? set_kthread_struct+0x40/0x40
> [ 1772.221059]  ret_from_fork+0x22/0x30
>
>
> Many thanks,
> Bernard.
>
>

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Re: Issus with blktest/srp on 5.15-rc1 and rdma_rxe
  2021-09-16 19:36 ` Issus with blktest/srp on 5.15-rc1 and rdma_rxe Bernard Metzler
  2021-09-16 22:21   ` Robert Pearson
@ 2021-09-17  8:15   ` Bernard Metzler
  1 sibling, 0 replies; 8+ messages in thread
From: Bernard Metzler @ 2021-09-17  8:15 UTC (permalink / raw)
  To: Robert Pearson; +Cc: linux-rdma, Jason Gunthorpe, Bart Van Assche

-----"Robert Pearson" <rpearsonhpe@gmail.com> wrote: -----

>To: "Bernard Metzler" <BMT@zurich.ibm.com>
>From: "Robert Pearson" <rpearsonhpe@gmail.com>
>Date: 09/17/2021 12:21AM
>Cc: "linux-rdma" <linux-rdma@vger.kernel.org>, "Jason Gunthorpe"
><jgg@nvidia.com>, "Bart Van Assche" <bvanassche@acm.org>
>Subject: [EXTERNAL] Re: Issus with blktest/srp on 5.15-rc1 and
>rdma_rxe
>
>Bernard,
>That would indicate that you have not applied the patch series
>RDMA/rxe: Fix various bugs which fixes the rkey not match rxe bug.


Hi Bob, oh, sorry, I obviously missed that one!

So I expect the Oops will disappear with that as well,
will check later.



Many thanks,
Bernard.

>I do not know how to get it to select the siw device instead of the
>rxe
>device but Bart does.
>
>Bob
>
>On Thu, Sep 16, 2021 at 2:36 PM Bernard Metzler <BMT@zurich.ibm.com>
>wrote:
>>
>> Hi,
>>
>> if I run the complete srp test series from the blktests suite,
>> the dmesg log contains many rdma_rxe messages of type:
>>
>> rdma_rxe: rxe_invalidate_mr: rkey (n) doesn't match mr->ibmr.rkey
>(n + 1)
>>
>> where 'n' is the current key. I expect this is not intended
>> behavior.
>>
>> I am at commit 1b789bd4dbd48a92f5427d9c37a72a8f6ca17754
>>
>>
>>
>> Furthermore, running ./check -q srp/005 sometimes I get this:
>>
>> [  308.903330] sd 11:0:0:1: [sde] Attached SCSI disk
>> [  308.917293] scsi 11:0:0:1: alua: Detached
>> [  308.918191] BUG: kernel NULL pointer dereference, address:
>0000000000000000
>> [  308.918223] #PF: supervisor instruction fetch in kernel mode
>> [  308.918242] #PF: error_code(0x0010) - not-present page
>> [  308.918259] PGD 0 P4D 0
>> [  308.918271] Oops: 0010 [#1] SMP PTI
>> [  308.918285] CPU: 1 PID: 4214 Comm: kworker/1:255 Not tainted
>5.15.0-rc1+ #4
>> [  308.918309] Hardware name: To Be Filled By O.E.M. To Be Filled
>By O.E.M./Z77 Extreme6, BIOS P2.80 07/01/2013
>> [  308.918338] Workqueue: srp_remove srp_remove_work [ib_srp]
>> [  308.918362] RIP: 0010:0x0
>> [  308.918375] Code: Unable to access opcode bytes at RIP
>0xffffffffffffffd6.
>> [  308.918397] RSP: 0018:ffffb6124b9a3b68 EFLAGS: 00010086
>> [  308.918414] RAX: 0000000000000001 RBX: ffffb6124b9a3ce0 RCX:
>0000000000000000
>> [  308.918437] RDX: 0000000000000000 RSI: ffffb6124b9a3c50 RDI:
>ffff966063a27a00
>> [  308.918459] RBP: ffffb6124b9a3bb0 R08: ffff966067481c00 R09:
>ffffeb578489b808
>> [  308.918481] R10: ffff966043c0f200 R11: ffffb6124b9a3d00 R12:
>ffff966063a27a00
>> [  308.918503] R13: 0000000000000004 R14: 0000000000000000 R15:
>ffffb6124b9a3c50
>> [  308.918524] FS:  0000000000000000(0000)
>GS:ffff966157680000(0000) knlGS:0000000000000000
>> [  308.918550] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>> [  308.918568] CR2: ffffffffffffffd6 CR3: 000000005060a004 CR4:
>00000000001706e0
>> [  308.918590] Call Trace:
>> [  308.918601]  __ib_process_cq+0x89/0x150 [ib_core]
>> [  308.918640]  ib_process_cq_direct+0x30/0x50 [ib_core]
>> [  308.918669]  ? xas_store+0x331/0x640
>> [  308.918684]  ? free_unref_page_commit.isra.135+0x91/0x140
>> [  308.918705]  ? free_unref_page+0x6e/0xd0
>> [  308.918719]  ? __free_pages+0xa3/0xc0
>> [  308.918733]  ? kfree+0x32f/0x3b0
>> [  308.918748]  srp_destroy_qp+0x24/0x40 [ib_srp]
>> [  308.918767]  srp_free_ch_ib+0x77/0x180 [ib_srp]
>> [  308.918784]  srp_remove_work+0xde/0x1a0 [ib_srp]
>> [  308.918801]  process_one_work+0x1d0/0x380
>> [  308.918817]  worker_thread+0x37/0x390
>> [  308.918831]  ? process_one_work+0x380/0x380
>> [  308.918846]  kthread+0x12f/0x150
>> [  308.918859]  ? set_kthread_struct+0x40/0x40
>> [  308.918874]  ret_from_fork+0x22/0x30
>>
>>
>> With ./check -q srp/008 I sometimes get something similar:
>>
>> [ 1772.149274] sd 11:0:0:1: [sde] Attached SCSI disk
>> [ 1772.150096] scsi 11:0:0:2: alua: Detached
>> [ 1772.150184] blk_update_request: I/O error, dev sde, sector 8 op
>0x0:(READ) flags 0x80700 phys_seg 1 prio class 0
>> [ 1772.151653] blk_update_request: I/O error, dev sde, sector 8 op
>0x0:(READ) flags 0x0 phys_seg 1 prio class 0
>> [ 1772.153080] Buffer I/O error on dev sde, logical block 1, async
>page read
>> [ 1772.169139] scsi 11:0:0:1: alua: Detached
>> [ 1772.169446] BUG: kernel NULL pointer dereference, address:
>0000000000000000
>> [ 1772.170881] #PF: supervisor instruction fetch in kernel mode
>> [ 1772.172297] #PF: error_code(0x0010) - not-present page
>> [ 1772.173751] PGD 0 P4D 0
>> [ 1772.175165] Oops: 0010 [#1] SMP PTI
>> [ 1772.176575] CPU: 3 PID: 8654 Comm: kworker/3:60 Not tainted
>5.15.0-rc1+ #4
>> [ 1772.177995] Hardware name: To Be Filled By O.E.M. To Be Filled
>By O.E.M./Z77 Extreme6, BIOS P2.80 07/01/2013
>> [ 1772.179430] Workqueue: srp_remove srp_remove_work [ib_srp]
>> [ 1772.180859] RIP: 0010:0x0
>> [ 1772.182276] Code: Unable to access opcode bytes at RIP
>0xffffffffffffffd6.
>> [ 1772.183705] RSP: 0018:ffffa9710a2f7b68 EFLAGS: 00010086
>> [ 1772.185129] RAX: 0000000000000001 RBX: ffffa9710a2f7c50 RCX:
>0000000000000000
>> [ 1772.186566] RDX: 0000000000000000 RSI: ffffa9710a2f7c08 RDI:
>ffff91a703825a00
>> [ 1772.187994] RBP: ffffa9710a2f7bb0 R08: ffff91a7184f8300 R09:
>0000000000000000
>> [ 1772.189425] R10: ffff91a7869ce000 R11: ffffa9710a2f7d00 R12:
>ffff91a703825a00
>> [ 1772.190848] R13: 0000000000000002 R14: 0000000000000000 R15:
>ffffa9710a2f7c08
>> [ 1772.192270] FS:  0000000000000000(0000)
>GS:ffff91a817780000(0000) knlGS:0000000000000000
>> [ 1772.193689] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>> [ 1772.195118] CR2: ffffffffffffffd6 CR3: 0000000111458006 CR4:
>00000000001706e0
>> [ 1772.196558] Call Trace:
>> [ 1772.197975]  __ib_process_cq+0x89/0x150 [ib_core]
>> [ 1772.199402]  ib_process_cq_direct+0x30/0x50 [ib_core]
>> [ 1772.200830]  ? put_cpu_partial+0x98/0xb0
>> [ 1772.202260]  ? __slab_free+0x226/0x3c0
>> [ 1772.203664]  ? __slab_free+0x226/0x3c0
>> [ 1772.205038]  ? xas_store+0x331/0x640
>> [ 1772.206404]  ? rxe_elem_release+0x4f/0x60 [rdma_rxe]
>> [ 1772.207763]  ? kfree+0x372/0x3b0
>> [ 1772.209101]  ? srp_destroy_fr_pool+0x43/0x50 [ib_srp]
>> [ 1772.210439]  srp_destroy_qp+0x24/0x40 [ib_srp]
>> [ 1772.211759]  srp_free_ch_ib+0x77/0x180 [ib_srp]
>> [ 1772.213093]  srp_remove_work+0xde/0x1a0 [ib_srp]
>> [ 1772.214417]  process_one_work+0x1d0/0x380
>> [ 1772.215756]  worker_thread+0x37/0x390
>> [ 1772.217082]  ? process_one_work+0x380/0x380
>> [ 1772.218422]  kthread+0x12f/0x150
>> [ 1772.219744]  ? set_kthread_struct+0x40/0x40
>> [ 1772.221059]  ret_from_fork+0x22/0x30
>>
>>
>> Many thanks,
>> Bernard.
>>
>>
>

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Re: Issus with blktest/srp on 5.15-rc1 and rdma_rxe
  2021-09-16 22:21   ` Robert Pearson
@ 2021-09-17  8:23     ` Bernard Metzler
  2021-09-18  2:56       ` Bart Van Assche
  2021-09-21 20:08       ` Still issues with blktest/srp on 5.15-rc1 and software rdma providers Bernard Metzler
  0 siblings, 2 replies; 8+ messages in thread
From: Bernard Metzler @ 2021-09-17  8:23 UTC (permalink / raw)
  To: Yi Zhang; +Cc: Robert Pearson, linux-rdma, Jason Gunthorpe, Bart Van Assche

-----"Yi Zhang" <yi.zhang@redhat.com> wrote: -----

>To: "Robert Pearson" <rpearsonhpe@gmail.com>
>From: "Yi Zhang" <yi.zhang@redhat.com>
>Date: 09/17/2021 09:29AM
>Cc: "Bernard Metzler" <BMT@zurich.ibm.com>, "linux-rdma"
><linux-rdma@vger.kernel.org>, "Jason Gunthorpe" <jgg@nvidia.com>,
>"Bart Van Assche" <bvanassche@acm.org>
>Subject: [EXTERNAL] Re: Issus with blktest/srp on 5.15-rc1 and
>rdma_rxe
>
>                    On Fri, Sep 17, 2021 at 6:21 AM Robert Pearson
><rpearsonhpe@gmail.com> wrote: Bernard, That would indicate that you
>have not applied the patch series RDMA/rxe: Fix various bugs which
>fixes the rkey not match rxe bug. I do not know how
>
>     
>
>
>On Fri, Sep 17, 2021 at 6:21 AM Robert Pearson
><rpearsonhpe@gmail.com> wrote:
>Bernard,
> That would indicate that you have not applied the patch series
> RDMA/rxe: Fix various bugs which fixes the rkey not match rxe bug.
> I do not know how to get it to select the siw device instead of the
>rxe
> device but Bart does.
> 
>
>Just try use_siw=1 ./check -q srp/005

srp/015 seem to be dedicated to siw testing. It selects siw if available.
I think this is how Bart found it.
Unfortunately, for some reason I am not aware of, testing defaults to
rxe only for the other tests. Maybe at least the helper should
talk about this hidden option.

Thanks,
Bernard.

>  Bob
> 
> On Thu, Sep 16, 2021 at 2:36 PM Bernard Metzler <BMT@zurich.ibm.com>
>wrote:
> >
> > Hi,
> >
> > if I run the complete srp test series from the blktests suite,
> > the dmesg log contains many rdma_rxe messages of type:
> >
> > rdma_rxe: rxe_invalidate_mr: rkey (n) doesn't match mr->ibmr.rkey
>(n + 1)
> >
> > where 'n' is the current key. I expect this is not intended
> > behavior.
> >
> > I am at commit 1b789bd4dbd48a92f5427d9c37a72a8f6ca17754
> >
> >
> >
> > Furthermore, running ./check -q srp/005 sometimes I get this:
> >
> > [  308.903330] sd 11:0:0:1: [sde] Attached SCSI disk
> > [  308.917293] scsi 11:0:0:1: alua: Detached
> > [  308.918191] BUG: kernel NULL pointer dereference, address:
>0000000000000000
> > [  308.918223] #PF: supervisor instruction fetch in kernel mode
> > [  308.918242] #PF: error_code(0x0010) - not-present page
> > [  308.918259] PGD 0 P4D 0
> > [  308.918271] Oops: 0010 [#1] SMP PTI
> > [  308.918285] CPU: 1 PID: 4214 Comm: kworker/1:255 Not tainted
>5.15.0-rc1+ #4
> > [  308.918309] Hardware name: To Be Filled By O.E.M. To Be Filled
>By O.E.M./Z77 Extreme6, BIOS P2.80 07/01/2013
> > [  308.918338] Workqueue: srp_remove srp_remove_work [ib_srp]
> > [  308.918362] RIP: 0010:0x0
> > [  308.918375] Code: Unable to access opcode bytes at RIP
>0xffffffffffffffd6.
> > [  308.918397] RSP: 0018:ffffb6124b9a3b68 EFLAGS: 00010086
> > [  308.918414] RAX: 0000000000000001 RBX: ffffb6124b9a3ce0 RCX:
>0000000000000000
> > [  308.918437] RDX: 0000000000000000 RSI: ffffb6124b9a3c50 RDI:
>ffff966063a27a00
> > [  308.918459] RBP: ffffb6124b9a3bb0 R08: ffff966067481c00 R09:
>ffffeb578489b808
> > [  308.918481] R10: ffff966043c0f200 R11: ffffb6124b9a3d00 R12:
>ffff966063a27a00
> > [  308.918503] R13: 0000000000000004 R14: 0000000000000000 R15:
>ffffb6124b9a3c50
> > [  308.918524] FS:  0000000000000000(0000)
>GS:ffff966157680000(0000) knlGS:0000000000000000
> > [  308.918550] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > [  308.918568] CR2: ffffffffffffffd6 CR3: 000000005060a004 CR4:
>00000000001706e0
> > [  308.918590] Call Trace:
> > [  308.918601]  __ib_process_cq+0x89/0x150 [ib_core]
> > [  308.918640]  ib_process_cq_direct+0x30/0x50 [ib_core]
> > [  308.918669]  ? xas_store+0x331/0x640
> > [  308.918684]  ? free_unref_page_commit.isra.135+0x91/0x140
> > [  308.918705]  ? free_unref_page+0x6e/0xd0
> > [  308.918719]  ? __free_pages+0xa3/0xc0
> > [  308.918733]  ? kfree+0x32f/0x3b0
> > [  308.918748]  srp_destroy_qp+0x24/0x40 [ib_srp]
> > [  308.918767]  srp_free_ch_ib+0x77/0x180 [ib_srp]
> > [  308.918784]  srp_remove_work+0xde/0x1a0 [ib_srp]
> > [  308.918801]  process_one_work+0x1d0/0x380
> > [  308.918817]  worker_thread+0x37/0x390
> > [  308.918831]  ? process_one_work+0x380/0x380
> > [  308.918846]  kthread+0x12f/0x150
> > [  308.918859]  ? set_kthread_struct+0x40/0x40
> > [  308.918874]  ret_from_fork+0x22/0x30
> >
> >
> > With ./check -q srp/008 I sometimes get something similar:
> >
> > [ 1772.149274] sd 11:0:0:1: [sde] Attached SCSI disk
> > [ 1772.150096] scsi 11:0:0:2: alua: Detached
> > [ 1772.150184] blk_update_request: I/O error, dev sde, sector 8 op
>0x0:(READ) flags 0x80700 phys_seg 1 prio class 0
> > [ 1772.151653] blk_update_request: I/O error, dev sde, sector 8 op
>0x0:(READ) flags 0x0 phys_seg 1 prio class 0
> > [ 1772.153080] Buffer I/O error on dev sde, logical block 1, async
>page read
> > [ 1772.169139] scsi 11:0:0:1: alua: Detached
> > [ 1772.169446] BUG: kernel NULL pointer dereference, address:
>0000000000000000
> > [ 1772.170881] #PF: supervisor instruction fetch in kernel mode
> > [ 1772.172297] #PF: error_code(0x0010) - not-present page
> > [ 1772.173751] PGD 0 P4D 0
> > [ 1772.175165] Oops: 0010 [#1] SMP PTI
> > [ 1772.176575] CPU: 3 PID: 8654 Comm: kworker/3:60 Not tainted
>5.15.0-rc1+ #4
> > [ 1772.177995] Hardware name: To Be Filled By O.E.M. To Be Filled
>By O.E.M./Z77 Extreme6, BIOS P2.80 07/01/2013
> > [ 1772.179430] Workqueue: srp_remove srp_remove_work [ib_srp]
> > [ 1772.180859] RIP: 0010:0x0
> > [ 1772.182276] Code: Unable to access opcode bytes at RIP
>0xffffffffffffffd6.
> > [ 1772.183705] RSP: 0018:ffffa9710a2f7b68 EFLAGS: 00010086
> > [ 1772.185129] RAX: 0000000000000001 RBX: ffffa9710a2f7c50 RCX:
>0000000000000000
> > [ 1772.186566] RDX: 0000000000000000 RSI: ffffa9710a2f7c08 RDI:
>ffff91a703825a00
> > [ 1772.187994] RBP: ffffa9710a2f7bb0 R08: ffff91a7184f8300 R09:
>0000000000000000
> > [ 1772.189425] R10: ffff91a7869ce000 R11: ffffa9710a2f7d00 R12:
>ffff91a703825a00
> > [ 1772.190848] R13: 0000000000000002 R14: 0000000000000000 R15:
>ffffa9710a2f7c08
> > [ 1772.192270] FS:  0000000000000000(0000)
>GS:ffff91a817780000(0000) knlGS:0000000000000000
> > [ 1772.193689] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > [ 1772.195118] CR2: ffffffffffffffd6 CR3: 0000000111458006 CR4:
>00000000001706e0
> > [ 1772.196558] Call Trace:
> > [ 1772.197975]  __ib_process_cq+0x89/0x150 [ib_core]
> > [ 1772.199402]  ib_process_cq_direct+0x30/0x50 [ib_core]
> > [ 1772.200830]  ? put_cpu_partial+0x98/0xb0
> > [ 1772.202260]  ? __slab_free+0x226/0x3c0
> > [ 1772.203664]  ? __slab_free+0x226/0x3c0
> > [ 1772.205038]  ? xas_store+0x331/0x640
> > [ 1772.206404]  ? rxe_elem_release+0x4f/0x60 [rdma_rxe]
> > [ 1772.207763]  ? kfree+0x372/0x3b0
> > [ 1772.209101]  ? srp_destroy_fr_pool+0x43/0x50 [ib_srp]
> > [ 1772.210439]  srp_destroy_qp+0x24/0x40 [ib_srp]
> > [ 1772.211759]  srp_free_ch_ib+0x77/0x180 [ib_srp]
> > [ 1772.213093]  srp_remove_work+0xde/0x1a0 [ib_srp]
> > [ 1772.214417]  process_one_work+0x1d0/0x380
> > [ 1772.215756]  worker_thread+0x37/0x390
> > [ 1772.217082]  ? process_one_work+0x380/0x380
> > [ 1772.218422]  kthread+0x12f/0x150
> > [ 1772.219744]  ? set_kthread_struct+0x40/0x40
> > [ 1772.221059]  ret_from_fork+0x22/0x30
> >
> >
> > Many thanks,
> > Bernard.
> >
> >
> 
> 
>
>-- 
>Best Regards,
>  Yi Zhang  

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Issus with blktest/srp on 5.15-rc1 and rdma_rxe
  2021-09-17  8:23     ` Bernard Metzler
@ 2021-09-18  2:56       ` Bart Van Assche
  2021-09-18 12:15         ` Yi Zhang
  2021-09-21 20:08       ` Still issues with blktest/srp on 5.15-rc1 and software rdma providers Bernard Metzler
  1 sibling, 1 reply; 8+ messages in thread
From: Bart Van Assche @ 2021-09-18  2:56 UTC (permalink / raw)
  To: Bernard Metzler, Yi Zhang; +Cc: Robert Pearson, linux-rdma, Jason Gunthorpe

On 9/17/21 01:23, Bernard Metzler wrote:
> -----"Yi Zhang" <yi.zhang@redhat.com> wrote: -----
>> Just try use_siw=1 ./check -q srp/005
> 
> srp/015 seem to be dedicated to siw testing. It selects siw if available.
> I think this is how Bart found it.
> Unfortunately, for some reason I am not aware of, testing defaults to
> rxe only for the other tests. Maybe at least the helper should
> talk about this hidden option.

Originally only test srp/015 selected siw. Yi Zhang added support for
running all SRP tests with the siw driver. See also blktests commit
d23c3aa0c1c0 ("common/multipath-over-rdma: allow to set use_siw"). How
about submitting a documentation patch to the blktests project?

Thanks,

Bart.

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Issus with blktest/srp on 5.15-rc1 and rdma_rxe
  2021-09-18  2:56       ` Bart Van Assche
@ 2021-09-18 12:15         ` Yi Zhang
  0 siblings, 0 replies; 8+ messages in thread
From: Yi Zhang @ 2021-09-18 12:15 UTC (permalink / raw)
  To: Bart Van Assche
  Cc: Bernard Metzler, Robert Pearson, linux-rdma, Jason Gunthorpe

On Sat, Sep 18, 2021 at 10:56 AM Bart Van Assche <bvanassche@acm.org> wrote:
>
> On 9/17/21 01:23, Bernard Metzler wrote:
> > -----"Yi Zhang" <yi.zhang@redhat.com> wrote: -----
> >> Just try use_siw=1 ./check -q srp/005
> >
> > srp/015 seem to be dedicated to siw testing. It selects siw if available.
> > I think this is how Bart found it.
> > Unfortunately, for some reason I am not aware of, testing defaults to
> > rxe only for the other tests. Maybe at least the helper should
> > talk about this hidden option.
>
> Originally only test srp/015 selected siw. Yi Zhang added support for
> running all SRP tests with the siw driver. See also blktests commit
> d23c3aa0c1c0 ("common/multipath-over-rdma: allow to set use_siw"). How
> about submitting a documentation patch to the blktests project?

Agree, I just draft it and created PR for it.

>
> Thanks,
>
> Bart.
>


-- 
Best Regards,
  Yi Zhang


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Still issues with blktest/srp on 5.15-rc1 and software rdma providers
  2021-09-17  8:23     ` Bernard Metzler
  2021-09-18  2:56       ` Bart Van Assche
@ 2021-09-21 20:08       ` Bernard Metzler
  2021-09-21 20:16         ` Bart Van Assche
  1 sibling, 1 reply; 8+ messages in thread
From: Bernard Metzler @ 2021-09-21 20:08 UTC (permalink / raw)
  To: linux-rdma; +Cc: Yi Zhang, Robert Pearson, Jason Gunthorpe, Bart Van Assche

Hi,
 
I further investigated srp blktest with software rdma
drivers and I am still running into issues. These seem
not to be specific to using rxe or siw driver, but happen
with both occasionally. Can we run tests using hardware
rdma drivers with that blktest tool as well?


First I see some WARNINGs which relate to resources not
created or unable to get destroyed (maybe since not created
before):

...

[ 1437.197989] sd 11:0:0:1: [sde] Attached SCSI disk
[ 1437.845266] ------------[ cut here ]------------
[ 1437.845269] WARNING: CPU: 3 PID: 26257 at block/genhd.c:537 device_add_disk+0x1cb/0x3b0
...
[ 1437.845360] Call Trace:
[ 1437.845363]  dm_setup_md_queue+0xc8/0x100
[ 1437.845368]  table_load+0x1be/0x2d0
[ 1437.845371]  ctl_ioctl+0x1d6/0x4c0
[ 1437.845373]  ? retrieve_status+0x1d0/0x1d0
[ 1437.845377]  dm_ctl_ioctl+0xe/0x20
[ 1437.845379]  __x64_sys_ioctl+0x118/0x910
[ 1437.845384]  ? switch_fpu_return+0x56/0xc0
[ 1437.845388]  do_syscall_64+0x3a/0x80
[ 1437.845391]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 1437.845395] RIP: 0033:0x7f81419dbb97
[ 1437.845398] Code: 00 00 90 48 8b 05 09 73 2c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d d9 72 2c 00 f7 d8 64 89 01 48
[ 1437.845400] RSP: 002b:00007f814363b508 EFLAGS: 00000202 ORIG_RAX: 0000000000000010
[ 1437.845402] RAX: ffffffffffffffda RBX: 00007f81423b8d60 RCX: 00007f81419dbb97
[ 1437.845403] RDX: 00007f812c026c30 RSI: 00000000c138fd09 RDI: 0000000000000009
[ 1437.845403] RBP: 00007f81423f38b3 R08: 00007f8143639260 R09: 00007f81426018f8
[ 1437.845404] R10: 0000000000000000 R11: 0000000000000202 R12: 00007f812c026c30
[ 1437.845405] R13: 0000000000000000 R14: 00007f812c026ce0 R15: 00007f812c00adc0
[ 1437.845407] ---[ end trace c416dea93915334e ]---





...

[ 1437.845411] kobject_add_internal failed for dm (error: -2 parent: dm-2)
[ 1437.845451] ------------[ cut here ]------------
[ 1437.845451] WARNING: CPU: 3 PID: 26257 at block/genhd.c:564 del_gendisk+0x1a4/0x1e0
...
[ 1437.845516] Call Trace:
[ 1437.845517]  dm_setup_md_queue+0xef/0x100
[ 1437.845520]  table_load+0x1be/0x2d0
[ 1437.845522]  ctl_ioctl+0x1d6/0x4c0
[ 1437.845523]  ? retrieve_status+0x1d0/0x1d0
[ 1437.845527]  dm_ctl_ioctl+0xe/0x20
[ 1437.845528]  __x64_sys_ioctl+0x118/0x910
[ 1437.845531]  ? switch_fpu_return+0x56/0xc0
[ 1437.845533]  do_syscall_64+0x3a/0x80
[ 1437.845535]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 1437.845537] RIP: 0033:0x7f81419dbb97
[ 1437.845538] Code: 00 00 90 48 8b 05 09 73 2c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d d9 72 2c 00 f7 d8 64 89 01 48
[ 1437.845540] RSP: 002b:00007f814363b508 EFLAGS: 00000202 ORIG_RAX: 0000000000000010
[ 1437.845542] RAX: ffffffffffffffda RBX: 00007f81423b8d60 RCX: 00007f81419dbb97
[ 1437.845543] RDX: 00007f812c026c30 RSI: 00000000c138fd09 RDI: 0000000000000009
[ 1437.845544] RBP: 00007f81423f38b3 R08: 00007f8143639260 R09: 00007f81426018f8
[ 1437.845545] R10: 0000000000000000 R11: 0000000000000202 R12: 00007f812c026c30
[ 1437.845546] R13: 0000000000000000 R14: 00007f812c026ce0 R15: 00007f812c00adc0
[ 1437.845547] ---[ end trace c416dea93915334f ]---



...
[ 1437.845552] ------------[ cut here ]------------
[ 1437.845553] kernfs: can not remove 'sdc', no directory
[ 1437.845557] WARNING: CPU: 3 PID: 26257 at fs/kernfs/dir.c:1524 kernfs_remove_by_name_ns+0x88/0xa0
[ 1437.845562] Modules linked in:
...
[ 1437.845619] Call Trace:
[ 1437.845620]  sysfs_remove_link+0x19/0x30
[ 1437.845623]  bd_unlink_disk_holder+0x6d/0xd0
[ 1437.845627]  dm_put_table_device+0x62/0xe0
[ 1437.845629]  dm_put_device+0x88/0xe0
[ 1437.845631]  ? dm_put_path_selector+0x40/0x50 [dm_multipath]
[ 1437.845635]  free_priority_group+0x8e/0xc0 [dm_multipath]
[ 1437.845638]  free_multipath+0x78/0xb0 [dm_multipath]
[ 1437.845640]  multipath_dtr+0x2a/0x30 [dm_multipath]
[ 1437.845642]  dm_table_destroy+0x67/0x130
[ 1437.845645]  table_load+0x110/0x2d0
[ 1437.845647]  ctl_ioctl+0x1d6/0x4c0
[ 1437.845648]  ? retrieve_status+0x1d0/0x1d0
[ 1437.845651]  dm_ctl_ioctl+0xe/0x20
[ 1437.845653]  __x64_sys_ioctl+0x118/0x910
[ 1437.845655]  ? switch_fpu_return+0x56/0xc0
[ 1437.845657]  do_syscall_64+0x3a/0x80
[ 1437.845659]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 1437.845662] RIP: 0033:0x7f81419dbb97
[ 1437.845663] Code: 00 00 90 48 8b 05 09 73 2c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d d9 72 2c 00 f7 d8 64 89 01 48
[ 1437.845664] RSP: 002b:00007f814363b508 EFLAGS: 00000202 ORIG_RAX: 0000000000000010
[ 1437.845665] RAX: ffffffffffffffda RBX: 00007f81423b8d60 RCX: 00007f81419dbb97
[ 1437.845666] RDX: 00007f812c026c30 RSI: 00000000c138fd09 RDI: 0000000000000009
[ 1437.845667] RBP: 00007f81423f38b3 R08: 00007f8143639260 R09: 00007f81426018f8
[ 1437.845668] R10: 0000000000000000 R11: 0000000000000202 R12: 00007f812c026c30
[ 1437.845669] R13: 0000000000000000 R14: 00007f812c026ce0 R15: 00007f812c00adc0
[ 1437.845670] ---[ end trace c416dea939153350 ]---



and a final Oops close to blk_mq_free_rqs:

[ 1438.976875] scsi 11:0:0:1: alua: Detached
[ 1438.980927] BUG: unable to handle page fault for address: ffffffffc0d83160
[ 1438.980960] #PF: supervisor read access in kernel mode
[ 1438.980978] #PF: error_code(0x0000) - not-present page
[ 1438.980995] PGD 15f60e067 P4D 15f60e067 PUD 15f610067 PMD 1bc2e3067 PTE 0
[ 1438.981019] Oops: 0000 [#1] SMP PTI
[ 1438.981033] CPU: 3 PID: 26257 Comm: multipathd Tainted: G        W         5.15.0-rc1+ #1
[ 1438.981059] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./Z77 Extreme6, BIOS P2.80 07/01/2013
[ 1438.981088] RIP: 0010:scsi_mq_exit_request+0x18/0x50
[ 1438.981107] Code: 00 00 e8 5b 14 76 00 5d c3 e8 e4 cb e1 ff 5d c3 66 90 0f 1f 44 00 00 55 48 89 e5 53 48 8b 7f 60 48 89 f3 48 8b 87 98 00 00 00 <48> 8b 40 40 48 85 c0 74 0c 48 8d b6 10 01 00 00 e8 23 14 76 00 48
[ 1438.981160] RSP: 0018:ffffa289c0447b38 EFLAGS: 00010286
[ 1438.981178] RAX: ffffffffc0d83120 RBX: ffff975354360000 RCX: 0000000000000000
[ 1438.981201] RDX: 0000000000000000 RSI: ffff975354360000 RDI: ffff97534cfd1000
[ 1438.981223] RBP: ffffa289c0447b40 R08: 0000000000009c6b R09: 0000000000009c6b
[ 1438.981245] R10: 0000000000000002 R11: 0000000000000000 R12: 0000000000000000
[ 1438.981266] R13: ffff97534a34a240 R14: 0000000000000000 R15: 0000000000000000
[ 1438.981288] FS:  00007f814363d700(0000) GS:ffff975357780000(0000) knlGS:0000000000000000
[ 1438.981313] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1438.981331] CR2: ffffffffc0d83160 CR3: 00000001b7fcc006 CR4: 00000000001706e0
[ 1438.981354] Call Trace:
[ 1438.981365]  blk_mq_free_rqs+0x5f/0x1b0
[ 1438.981381]  blk_mq_free_map_and_requests+0x37/0x70
[ 1438.981398]  blk_mq_free_tag_set+0x27/0x90
[ 1438.981413]  scsi_mq_destroy_tags+0x15/0x20
[ 1438.981429]  scsi_host_dev_release+0x8b/0xf0
[ 1438.981445]  device_release+0x38/0x90
[ 1438.981459]  kobject_put+0x87/0x190
[ 1438.981475]  put_device+0x13/0x20
[ 1438.981488]  scsi_target_dev_release+0x1f/0x30
[ 1438.981504]  device_release+0x38/0x90
[ 1438.981518]  kobject_put+0x87/0x190
[ 1438.981532]  put_device+0x13/0x20
[ 1438.981544]  scsi_device_dev_release_usercontext+0x2a0/0x2b0
[ 1438.981565]  execute_in_process_context+0x25/0x70
[ 1438.981583]  scsi_device_dev_release+0x1c/0x20
[ 1438.981600]  device_release+0x38/0x90
[ 1438.981613]  kobject_put+0x87/0x190
[ 1438.981627]  put_device+0x13/0x20
[ 1438.981639]  scsi_device_put+0x2c/0x30
[ 1438.981653]  scsi_disk_put+0x30/0x50
[ 1438.981668]  sd_release+0x37/0xb0
[ 1438.981681]  blkdev_put_whole+0x30/0x50
[ 1438.981696]  blkdev_put+0x92/0x150
[ 1438.981710]  blkdev_close+0x27/0x30
[ 1438.981723]  __fput+0x8b/0x240
[ 1438.981736]  ____fput+0xe/0x10
[ 1438.981748]  task_work_run+0x74/0xb0
[ 1438.981762]  exit_to_user_mode_prepare+0x14e/0x150
[ 1438.981782]  syscall_exit_to_user_mode+0x16/0x30
[ 1438.981799]  do_syscall_64+0x46/0x80
[ 1438.981813]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 1438.981831] RIP: 0033:0x7f8142613c47
[ 1438.981845] Code: 00 00 0f 05 48 3d 00 f0 ff ff 77 3f c3 66 0f 1f 44 00 00 53 89 fb 48 83 ec 10 e8 c4 fb ff ff 89 df 89 c2 b8 03 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 2b 89 d7 89 44 24 0c e8 06 fc ff ff 8b 44 24
[ 1438.981897] RSP: 002b:00007f814363b840 EFLAGS: 00000293 ORIG_RAX: 0000000000000003
[ 1438.981920] RAX: 0000000000000000 RBX: 000000000000000a RCX: 00007f8142613c47
[ 1438.981942] RDX: 0000000000000000 RSI: 0000000000000001 RDI: 000000000000000a
[ 1438.981964] RBP: 0000000000000008 R08: 0000000000000001 R09: 0000000000000007
[ 1438.981986] R10: 0000000000000000 R11: 0000000000000293 R12: 0000564949b25700
[ 1438.982007] R13: 00007f81432a1ccf R14: 00007f812c02c710 R15: 00007f812c02c710
[ 1438.983180] Modules linked in: ib_srpt target_core_iblock target_core_mod scsi_debug rdma_rxe ip6_udp_tunnel udp_tunnel null_blk dm_service_time configs bridge stp llc nf_nat_ftp nf_conntrack_ftp xt_CT ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ib_iser ip_set nfnetlink libiscsi ebtable_nat ebtable_broute scsi_transport_iscsi ip6table_mangle ip6table_raw ip6table_security iptable_mangle iptable_raw iptable_security ebtable_filter ebtables ip6table_filter ip6table_nat ip6_tables iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 rpcrdma sunrpc ib_ipoib rdma_ucm ib_umad dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua iw_cxgb4 libcxgb intel_rapl_msr intel_rapl_common ib_uverbs x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel rdma_cm iw_cm kvm ib_cm ib_core snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_intel snd_intel_dspcfg irqbypass snd_hda_codec crc32_pclmul rapl snd_hwdep snd_hda_core intel_cstate intel_uncore
[ 1438.983224]  snd_pcm snd_timer iTCO_wdt mei_me snd iTCO_vendor_support mxm_wmi mei soundcore i2c_i801 i2c_smbus lpc_ich wmi xfs i915 i2c_algo_bit ttm drm_kms_helper firewire_ohci firewire_core syscopyarea sysfillrect cxgb4 crc_itu_t sysimgblt fb_sys_fops tg3 drm ptp crc32c_intel csiostor scsi_transport_fc pps_core video [last unloaded: scsi_transport_srp]
[ 1438.992637] CR2: ffffffffc0d83160
[ 1438.994057] ---[ end trace c416dea939153351 ]---
[ 1438.995476] RIP: 0010:scsi_mq_exit_request+0x18/0x50
[ 1438.996905] Code: 00 00 e8 5b 14 76 00 5d c3 e8 e4 cb e1 ff 5d c3 66 90 0f 1f 44 00 00 55 48 89 e5 53 48 8b 7f 60 48 89 f3 48 8b 87 98 00 00 00 <48> 8b 40 40 48 85 c0 74 0c 48 8d b6 10 01 00 00 e8 23 14 76 00 48
[ 1438.998414] RSP: 0018:ffffa289c0447b38 EFLAGS: 00010286
[ 1438.999954] RAX: ffffffffc0d83120 RBX: ffff975354360000 RCX: 0000000000000000
[ 1439.001513] RDX: 0000000000000000 RSI: ffff975354360000 RDI: ffff97534cfd1000
[ 1439.003079] RBP: ffffa289c0447b40 R08: 0000000000009c6b R09: 0000000000009c6b
[ 1439.004652] R10: 0000000000000002 R11: 0000000000000000 R12: 0000000000000000
[ 1439.006218] R13: ffff97534a34a240 R14: 0000000000000000 R15: 0000000000000000
[ 1439.007777] FS:  00007f814363d700(0000) GS:ffff975357780000(0000) knlGS:0000000000000000
[ 1439.009340] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1439.010906] CR2: ffffffffc0d83160 CR3: 00000001b7fcc006 CR4: 00000000001706e0



Thanks,
Bernard.

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Still issues with blktest/srp on 5.15-rc1 and software rdma providers
  2021-09-21 20:08       ` Still issues with blktest/srp on 5.15-rc1 and software rdma providers Bernard Metzler
@ 2021-09-21 20:16         ` Bart Van Assche
  0 siblings, 0 replies; 8+ messages in thread
From: Bart Van Assche @ 2021-09-21 20:16 UTC (permalink / raw)
  To: Bernard Metzler, linux-rdma
  Cc: Yi Zhang, Robert Pearson, Jason Gunthorpe, linux-block

On 9/21/21 1:08 PM, Bernard Metzler wrote:
> I further investigated srp blktest with software rdma
> drivers and I am still running into issues. These seem
> not to be specific to using rxe or siw driver, but happen
> with both occasionally. Can we run tests using hardware
> rdma drivers with that blktest tool as well?
> 
> 
> First I see some WARNINGs which relate to resources not
> created or unable to get destroyed (maybe since not created
> before):
> 
> ...
> 
> [ 1437.197989] sd 11:0:0:1: [sde] Attached SCSI disk
> [ 1437.845266] ------------[ cut here ]------------
> [ 1437.845269] WARNING: CPU: 3 PID: 26257 at block/genhd.c:537 device_add_disk+0x1cb/0x3b0
> ...
> [ 1437.845360] Call Trace:
> [ 1437.845363]  dm_setup_md_queue+0xc8/0x100
> [ 1437.845368]  table_load+0x1be/0x2d0
> [ 1437.845371]  ctl_ioctl+0x1d6/0x4c0
> [ 1437.845373]  ? retrieve_status+0x1d0/0x1d0
> [ 1437.845377]  dm_ctl_ioctl+0xe/0x20
> [ 1437.845379]  __x64_sys_ioctl+0x118/0x910
> [ 1437.845384]  ? switch_fpu_return+0x56/0xc0
> [ 1437.845388]  do_syscall_64+0x3a/0x80
> [ 1437.845391]  entry_SYSCALL_64_after_hwframe+0x44/0xae
> [ 1437.845395] RIP: 0033:0x7f81419dbb97
> [ 1437.845398] Code: 00 00 90 48 8b 05 09 73 2c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d d9 72 2c 00 f7 d8 64 89 01 48
> [ 1437.845400] RSP: 002b:00007f814363b508 EFLAGS: 00000202 ORIG_RAX: 0000000000000010
> [ 1437.845402] RAX: ffffffffffffffda RBX: 00007f81423b8d60 RCX: 00007f81419dbb97
> [ 1437.845403] RDX: 00007f812c026c30 RSI: 00000000c138fd09 RDI: 0000000000000009
> [ 1437.845403] RBP: 00007f81423f38b3 R08: 00007f8143639260 R09: 00007f81426018f8
> [ 1437.845404] R10: 0000000000000000 R11: 0000000000000202 R12: 00007f812c026c30
> [ 1437.845405] R13: 0000000000000000 R14: 00007f812c026ce0 R15: 00007f812c00adc0
> [ 1437.845407] ---[ end trace c416dea93915334e ]---
> 
> 
> 
> 
> 
> ...
> 
> [ 1437.845411] kobject_add_internal failed for dm (error: -2 parent: dm-2)
> [ 1437.845451] ------------[ cut here ]------------
> [ 1437.845451] WARNING: CPU: 3 PID: 26257 at block/genhd.c:564 del_gendisk+0x1a4/0x1e0
> ...
> [ 1437.845516] Call Trace:
> [ 1437.845517]  dm_setup_md_queue+0xef/0x100
> [ 1437.845520]  table_load+0x1be/0x2d0
> [ 1437.845522]  ctl_ioctl+0x1d6/0x4c0
> [ 1437.845523]  ? retrieve_status+0x1d0/0x1d0
> [ 1437.845527]  dm_ctl_ioctl+0xe/0x20
> [ 1437.845528]  __x64_sys_ioctl+0x118/0x910
> [ 1437.845531]  ? switch_fpu_return+0x56/0xc0
> [ 1437.845533]  do_syscall_64+0x3a/0x80
> [ 1437.845535]  entry_SYSCALL_64_after_hwframe+0x44/0xae
> [ 1437.845537] RIP: 0033:0x7f81419dbb97
> [ 1437.845538] Code: 00 00 90 48 8b 05 09 73 2c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d d9 72 2c 00 f7 d8 64 89 01 48
> [ 1437.845540] RSP: 002b:00007f814363b508 EFLAGS: 00000202 ORIG_RAX: 0000000000000010
> [ 1437.845542] RAX: ffffffffffffffda RBX: 00007f81423b8d60 RCX: 00007f81419dbb97
> [ 1437.845543] RDX: 00007f812c026c30 RSI: 00000000c138fd09 RDI: 0000000000000009
> [ 1437.845544] RBP: 00007f81423f38b3 R08: 00007f8143639260 R09: 00007f81426018f8
> [ 1437.845545] R10: 0000000000000000 R11: 0000000000000202 R12: 00007f812c026c30
> [ 1437.845546] R13: 0000000000000000 R14: 00007f812c026ce0 R15: 00007f812c00adc0
> [ 1437.845547] ---[ end trace c416dea93915334f ]---
> 
> 
> 
> ...
> [ 1437.845552] ------------[ cut here ]------------
> [ 1437.845553] kernfs: can not remove 'sdc', no directory
> [ 1437.845557] WARNING: CPU: 3 PID: 26257 at fs/kernfs/dir.c:1524 kernfs_remove_by_name_ns+0x88/0xa0
> [ 1437.845562] Modules linked in:
> ...
> [ 1437.845619] Call Trace:
> [ 1437.845620]  sysfs_remove_link+0x19/0x30
> [ 1437.845623]  bd_unlink_disk_holder+0x6d/0xd0
> [ 1437.845627]  dm_put_table_device+0x62/0xe0
> [ 1437.845629]  dm_put_device+0x88/0xe0
> [ 1437.845631]  ? dm_put_path_selector+0x40/0x50 [dm_multipath]
> [ 1437.845635]  free_priority_group+0x8e/0xc0 [dm_multipath]
> [ 1437.845638]  free_multipath+0x78/0xb0 [dm_multipath]
> [ 1437.845640]  multipath_dtr+0x2a/0x30 [dm_multipath]
> [ 1437.845642]  dm_table_destroy+0x67/0x130
> [ 1437.845645]  table_load+0x110/0x2d0
> [ 1437.845647]  ctl_ioctl+0x1d6/0x4c0
> [ 1437.845648]  ? retrieve_status+0x1d0/0x1d0
> [ 1437.845651]  dm_ctl_ioctl+0xe/0x20
> [ 1437.845653]  __x64_sys_ioctl+0x118/0x910
> [ 1437.845655]  ? switch_fpu_return+0x56/0xc0
> [ 1437.845657]  do_syscall_64+0x3a/0x80
> [ 1437.845659]  entry_SYSCALL_64_after_hwframe+0x44/0xae
> [ 1437.845662] RIP: 0033:0x7f81419dbb97
> [ 1437.845663] Code: 00 00 90 48 8b 05 09 73 2c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d d9 72 2c 00 f7 d8 64 89 01 48
> [ 1437.845664] RSP: 002b:00007f814363b508 EFLAGS: 00000202 ORIG_RAX: 0000000000000010
> [ 1437.845665] RAX: ffffffffffffffda RBX: 00007f81423b8d60 RCX: 00007f81419dbb97
> [ 1437.845666] RDX: 00007f812c026c30 RSI: 00000000c138fd09 RDI: 0000000000000009
> [ 1437.845667] RBP: 00007f81423f38b3 R08: 00007f8143639260 R09: 00007f81426018f8
> [ 1437.845668] R10: 0000000000000000 R11: 0000000000000202 R12: 00007f812c026c30
> [ 1437.845669] R13: 0000000000000000 R14: 00007f812c026ce0 R15: 00007f812c00adc0
> [ 1437.845670] ---[ end trace c416dea939153350 ]---
> 
> 
> 
> and a final Oops close to blk_mq_free_rqs:
> 
> [ 1438.976875] scsi 11:0:0:1: alua: Detached
> [ 1438.980927] BUG: unable to handle page fault for address: ffffffffc0d83160
> [ 1438.980960] #PF: supervisor read access in kernel mode
> [ 1438.980978] #PF: error_code(0x0000) - not-present page
> [ 1438.980995] PGD 15f60e067 P4D 15f60e067 PUD 15f610067 PMD 1bc2e3067 PTE 0
> [ 1438.981019] Oops: 0000 [#1] SMP PTI
> [ 1438.981033] CPU: 3 PID: 26257 Comm: multipathd Tainted: G        W         5.15.0-rc1+ #1
> [ 1438.981059] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./Z77 Extreme6, BIOS P2.80 07/01/2013
> [ 1438.981088] RIP: 0010:scsi_mq_exit_request+0x18/0x50
> [ 1438.981107] Code: 00 00 e8 5b 14 76 00 5d c3 e8 e4 cb e1 ff 5d c3 66 90 0f 1f 44 00 00 55 48 89 e5 53 48 8b 7f 60 48 89 f3 48 8b 87 98 00 00 00 <48> 8b 40 40 48 85 c0 74 0c 48 8d b6 10 01 00 00 e8 23 14 76 00 48
> [ 1438.981160] RSP: 0018:ffffa289c0447b38 EFLAGS: 00010286
> [ 1438.981178] RAX: ffffffffc0d83120 RBX: ffff975354360000 RCX: 0000000000000000
> [ 1438.981201] RDX: 0000000000000000 RSI: ffff975354360000 RDI: ffff97534cfd1000
> [ 1438.981223] RBP: ffffa289c0447b40 R08: 0000000000009c6b R09: 0000000000009c6b
> [ 1438.981245] R10: 0000000000000002 R11: 0000000000000000 R12: 0000000000000000
> [ 1438.981266] R13: ffff97534a34a240 R14: 0000000000000000 R15: 0000000000000000
> [ 1438.981288] FS:  00007f814363d700(0000) GS:ffff975357780000(0000) knlGS:0000000000000000
> [ 1438.981313] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 1438.981331] CR2: ffffffffc0d83160 CR3: 00000001b7fcc006 CR4: 00000000001706e0
> [ 1438.981354] Call Trace:
> [ 1438.981365]  blk_mq_free_rqs+0x5f/0x1b0
> [ 1438.981381]  blk_mq_free_map_and_requests+0x37/0x70
> [ 1438.981398]  blk_mq_free_tag_set+0x27/0x90
> [ 1438.981413]  scsi_mq_destroy_tags+0x15/0x20
> [ 1438.981429]  scsi_host_dev_release+0x8b/0xf0
> [ 1438.981445]  device_release+0x38/0x90
> [ 1438.981459]  kobject_put+0x87/0x190
> [ 1438.981475]  put_device+0x13/0x20
> [ 1438.981488]  scsi_target_dev_release+0x1f/0x30
> [ 1438.981504]  device_release+0x38/0x90
> [ 1438.981518]  kobject_put+0x87/0x190
> [ 1438.981532]  put_device+0x13/0x20
> [ 1438.981544]  scsi_device_dev_release_usercontext+0x2a0/0x2b0
> [ 1438.981565]  execute_in_process_context+0x25/0x70
> [ 1438.981583]  scsi_device_dev_release+0x1c/0x20
> [ 1438.981600]  device_release+0x38/0x90
> [ 1438.981613]  kobject_put+0x87/0x190
> [ 1438.981627]  put_device+0x13/0x20
> [ 1438.981639]  scsi_device_put+0x2c/0x30
> [ 1438.981653]  scsi_disk_put+0x30/0x50
> [ 1438.981668]  sd_release+0x37/0xb0
> [ 1438.981681]  blkdev_put_whole+0x30/0x50
> [ 1438.981696]  blkdev_put+0x92/0x150
> [ 1438.981710]  blkdev_close+0x27/0x30
> [ 1438.981723]  __fput+0x8b/0x240
> [ 1438.981736]  ____fput+0xe/0x10
> [ 1438.981748]  task_work_run+0x74/0xb0
> [ 1438.981762]  exit_to_user_mode_prepare+0x14e/0x150
> [ 1438.981782]  syscall_exit_to_user_mode+0x16/0x30
> [ 1438.981799]  do_syscall_64+0x46/0x80
> [ 1438.981813]  entry_SYSCALL_64_after_hwframe+0x44/0xae
> [ 1438.981831] RIP: 0033:0x7f8142613c47
> [ 1438.981845] Code: 00 00 0f 05 48 3d 00 f0 ff ff 77 3f c3 66 0f 1f 44 00 00 53 89 fb 48 83 ec 10 e8 c4 fb ff ff 89 df 89 c2 b8 03 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 2b 89 d7 89 44 24 0c e8 06 fc ff ff 8b 44 24
> [ 1438.981897] RSP: 002b:00007f814363b840 EFLAGS: 00000293 ORIG_RAX: 0000000000000003
> [ 1438.981920] RAX: 0000000000000000 RBX: 000000000000000a RCX: 00007f8142613c47
> [ 1438.981942] RDX: 0000000000000000 RSI: 0000000000000001 RDI: 000000000000000a
> [ 1438.981964] RBP: 0000000000000008 R08: 0000000000000001 R09: 0000000000000007
> [ 1438.981986] R10: 0000000000000000 R11: 0000000000000293 R12: 0000564949b25700
> [ 1438.982007] R13: 00007f81432a1ccf R14: 00007f812c02c710 R15: 00007f812c02c710
> [ 1438.983180] Modules linked in: ib_srpt target_core_iblock target_core_mod scsi_debug rdma_rxe ip6_udp_tunnel udp_tunnel null_blk dm_service_time configs bridge stp llc nf_nat_ftp nf_conntrack_ftp xt_CT ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ib_iser ip_set nfnetlink libiscsi ebtable_nat ebtable_broute scsi_transport_iscsi ip6table_mangle ip6table_raw ip6table_security iptable_mangle iptable_raw iptable_security ebtable_filter ebtables ip6table_filter ip6table_nat ip6_tables iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 rpcrdma sunrpc ib_ipoib rdma_ucm ib_umad dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua iw_cxgb4 libcxgb intel_rapl_msr intel_rapl_common ib_uverbs x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel rdma_cm iw_cm kvm ib_cm ib_core snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_intel snd_intel_dspcfg irqbypass snd_hda_codec crc32_pclmul rapl snd_hwdep snd_hda_core intel_cstate intel_uncore
> [ 1438.983224]  snd_pcm snd_timer iTCO_wdt mei_me snd iTCO_vendor_support mxm_wmi mei soundcore i2c_i801 i2c_smbus lpc_ich wmi xfs i915 i2c_algo_bit ttm drm_kms_helper firewire_ohci firewire_core syscopyarea sysfillrect cxgb4 crc_itu_t sysimgblt fb_sys_fops tg3 drm ptp crc32c_intel csiostor scsi_transport_fc pps_core video [last unloaded: scsi_transport_srp]
> [ 1438.992637] CR2: ffffffffc0d83160
> [ 1438.994057] ---[ end trace c416dea939153351 ]---
> [ 1438.995476] RIP: 0010:scsi_mq_exit_request+0x18/0x50
> [ 1438.996905] Code: 00 00 e8 5b 14 76 00 5d c3 e8 e4 cb e1 ff 5d c3 66 90 0f 1f 44 00 00 55 48 89 e5 53 48 8b 7f 60 48 89 f3 48 8b 87 98 00 00 00 <48> 8b 40 40 48 85 c0 74 0c 48 8d b6 10 01 00 00 e8 23 14 76 00 48
> [ 1438.998414] RSP: 0018:ffffa289c0447b38 EFLAGS: 00010286
> [ 1438.999954] RAX: ffffffffc0d83120 RBX: ffff975354360000 RCX: 0000000000000000
> [ 1439.001513] RDX: 0000000000000000 RSI: ffff975354360000 RDI: ffff97534cfd1000
> [ 1439.003079] RBP: ffffa289c0447b40 R08: 0000000000009c6b R09: 0000000000009c6b
> [ 1439.004652] R10: 0000000000000002 R11: 0000000000000000 R12: 0000000000000000
> [ 1439.006218] R13: ffff97534a34a240 R14: 0000000000000000 R15: 0000000000000000
> [ 1439.007777] FS:  00007f814363d700(0000) GS:ffff975357780000(0000) knlGS:0000000000000000
> [ 1439.009340] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 1439.010906] CR2: ffffffffc0d83160 CR3: 00000001b7fcc006 CR4: 00000000001706e0

(+linux-block)

Hi Bernard,

If I remember correctly all tests from the blktests suite pass on my test
setup with kernel v5.13. I think the above call traces are regressions that
have been introduced during the kernel v5.15 merge window in the block layer.

Bart.


^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2021-09-21 20:16 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <CAHj4cs9Rzte5zbgy7o158m7JA8dbSEpxy5oR-+K0NQCK1gxG=Q@mail.gmail.com>
2021-09-16 19:36 ` Issus with blktest/srp on 5.15-rc1 and rdma_rxe Bernard Metzler
2021-09-16 22:21   ` Robert Pearson
2021-09-17  8:23     ` Bernard Metzler
2021-09-18  2:56       ` Bart Van Assche
2021-09-18 12:15         ` Yi Zhang
2021-09-21 20:08       ` Still issues with blktest/srp on 5.15-rc1 and software rdma providers Bernard Metzler
2021-09-21 20:16         ` Bart Van Assche
2021-09-17  8:15   ` Re: Issus with blktest/srp on 5.15-rc1 and rdma_rxe Bernard Metzler

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).