From: "Pearson, Robert B" <robert.pearson2@hpe.com>
To: Bart Van Assche <bvanassche@acm.org>,
Bob Pearson <rpearsonhpe@gmail.com>,
"jgg@nvidia.com" <jgg@nvidia.com>,
"zyjzyj2000@gmail.com" <zyjzyj2000@gmail.com>,
"linux-rdma@vger.kernel.org" <linux-rdma@vger.kernel.org>,
"mie@igel.co.jp" <mie@igel.co.jp>
Subject: RE: [PATCH for-rc v3 0/6] RDMA/rxe: Various bug fixes.
Date: Fri, 10 Sep 2021 19:38:31 +0000 [thread overview]
Message-ID: <CS1PR8401MB10777EEC9CF95C00D1BA62ABBCD69@CS1PR8401MB1077.NAMPRD84.PROD.OUTLOOK.COM> (raw)
In-Reply-To: <f0d96a3c-d49d-651d-93e0-a33a5eca9f1b@acm.org>
Bart,
I was able to run this test case but it is not failing. On my system it passes in ~1sec.
I have several questions about your system setup.
1. Which rdma-core are you running? Out of box or the github tree?
2. Can you run ib_send_bw? Python test suite in rdma-core?
3. Where did you get the kernel bits? Which git tree? Which branch?
Thanks,
Bob Pearson
-----Original Message-----
From: Bart Van Assche <bvanassche@acm.org>
Sent: Thursday, September 9, 2021 4:52 PM
To: Bob Pearson <rpearsonhpe@gmail.com>; jgg@nvidia.com; zyjzyj2000@gmail.com; linux-rdma@vger.kernel.org; mie@igel.co.jp
Subject: Re: [PATCH for-rc v3 0/6] RDMA/rxe: Various bug fixes.
On 9/9/21 1:44 PM, Bob Pearson wrote:
> This series of patches implements several bug fixes and minor cleanups
> of the rxe driver. Specifically these fix a bug exposed by blktest.
>
> They apply cleanly to both
> commit 2169b908894df2ce83e7eb4a399d3224b2635126 (origin/for-rc,
> for-rc) commit 6a217437f9f5482a3f6f2dc5fcd27cf0f62409ac (HEAD -> for-next,
> origin/wip/jgg-for-next, origin/for-next, origin/HEAD)
>
> These are being resubmitted to for-rc instead of for-next.
Hi Bob,
Thanks for having rebased and reposted this patch series. I have applied this series on top of commit 2169b908894d ("IB/hfi1: make hist static").
A kernel bug was triggered while running test srp/001. I have attached the kernel configuration used in my test to this email.
Thanks,
Bart.
ib_srpt Received SRP_LOGIN_REQ with i_port_id fe80:0000:0000:0000:5054:00ff:fe86:7464, t_port_id 5054:00ff:fe86:7464:5054:00ff:fe86:7464 and it_iu_len 8260 on port 1 (guid=fe80:0000:0000:0000:5054:00ff:fe86:7464); pkey 0xffff
BUG: unable to handle page fault for address: ffffc900e357d614
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page PGD 100000067 P4D 100000067 PUD 0
Oops: 0000 [#1] PREEMPT SMP KASAN
CPU: 26 PID: 148 Comm: ksoftirqd/26 Tainted: G E 5.14.0-rc6-dbg+ #2
Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.14.0-2 04/01/2014
RIP: 0010:rxe_completer+0x96d/0x1050 [rdma_rxe]
Code: e0 49 8b 44 24 08 44 89 e9 41 d3 e6 4e 8d a4 30 80 01 00 00 4d 85 e4 0f 84 f9 00 00 00 49 8d bc 24 94 00 00 00 e8 73 a8 b1 e0 <41> 8b 84 24 94 00 00 00 85 c0 0f 84 df 00 00 00 83 f8 03 0f 84 bf
RSP: 0018:ffff8881014075f8 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff88813c67c000 RCX: dffffc0000000000
RDX: 0000000000000007 RSI: ffffffff826920c0 RDI: ffffc900e357d614
RBP: ffff8881014076e8 R08: ffffffffa09b228d R09: ffff88813c67c57b
R10: ffffed10278cf8af R11: 0000000000000000 R12: ffffc900e357d580
R13: 000000000000000a R14: 00000000d9c99400 R15: ffff8881515ddd08
FS: 0000000000000000(0000) GS:ffff88842d100000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: ffffc900e357d614 CR3: 0000000002e29005 CR4: 0000000000770ee0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
PKRU: 55555554
Call Trace:
rxe_do_task+0xdd/0x160 [rdma_rxe]
rxe_run_task+0x67/0x80 [rdma_rxe]
rxe_comp_queue_pkt+0x75/0x80 [rdma_rxe]
rxe_rcv+0x345/0x480 [rdma_rxe]
rxe_xmit_packet+0x1af/0x300 [rdma_rxe]
send_ack.isra.0+0x88/0xd0 [rdma_rxe]
rxe_responder+0xf4c/0x15e0 [rdma_rxe]
rxe_do_task+0xdd/0x160 [rdma_rxe]
rxe_run_task+0x67/0x80 [rdma_rxe]
rxe_resp_queue_pkt+0x5a/0x60 [rdma_rxe]
rxe_rcv+0x370/0x480 [rdma_rxe]
rxe_xmit_packet+0x1af/0x300 [rdma_rxe]
rxe_requester+0x4f4/0xe80 [rdma_rxe]
rxe_do_task+0xdd/0x160 [rdma_rxe]
tasklet_action_common.constprop.0+0x168/0x1b0
tasklet_action+0x44/0x60
__do_softirq+0x1db/0x6ed
run_ksoftirqd+0x37/0x60
smpboot_thread_fn+0x302/0x410
kthread+0x1f6/0x220
ret_from_fork+0x1f/0x30
Modules linked in: ib_srp(E) scsi_transport_srp(E) target_core_user(E) uio(E) target_core_pscsi(E) target_core_file(E) ib_srpt(E) target_core_iblock(E) target_core_mod(E) ib_umad(E) rdma_ucm(E) ib_iser(E) libiscsi(E) scsi_transport_iscsi(E) rdma_cm(E) iw_cm(E)
scsi_debug(E) ib_cm(E) rdma_rxe(E) ip6_udp_tunnel(E) udp_tunnel(E) ib_uverbs(E) null_blk(E) ib_core(E) brd(E) af_packet(E) nft_fib_inet(E) nft_fib_ipv4(E) nft_fib_ipv6(E) nft_fib(E) nft_reject_inet(E) nf_reject_ipv4(E) nf_reject_ipv6(E) nft_reject(E) nft_ct(E)
nft_chain_nat(E) nf_tables(E) ebtable_nat(E) iTCO_wdt(E) watchdog(E) ebtable_broute(E) intel_rapl_msr(E) intel_pmc_bxt(E) ip6table_nat(E) ip6table_mangle(E) ip6table_raw(E) iptable_nat(E) nf_nat(E) nf_conntrack(E) nf_defrag_ipv6(E) nf_defrag_ipv4(E) libcrc32c(E)
iptable_mangle(E) iptable_raw(E) ip_set(E) nfnetlink(E) ebtable_filter(E) ebtables(E) ip6table_filter(E) ip6_tables(E) rfkill(E) iptable_filter(E) ip_tables(E) x_tables(E) bpfilter(E) intel_rapl_common(E)
iosf_mbi(E) isst_if_common(E) i2c_i801(E) pcspkr(E) i2c_smbus(E) virtio_net(E) lpc_ich(E) virtio_balloon(E) net_failover(E) failover(E) tiny_power_button(E) button(E) fuse(E) crct10dif_pclmul(E) crc32_pclmul(E) crc32c_intel(E) ghash_clmulni_intel(E) aesni_intel(E)
crypto_simd(E) cryptd(E) sr_mod(E) serio_raw(E) cdrom(E) virtio_gpu(E) virtio_dma_buf(E) drm_kms_helper(E) syscopyarea(E) sysfillrect(E) sysimgblt(E) fb_sys_fops(E) cec(E) drm(E) qemu_fw_cfg(E) sg(E) nbd(E) dm_multipath(E) dm_mod(E) scsi_dh_rdac(E) scsi_dh_emc(E)
scsi_dh_alua(E) virtio_rng(E)
CR2: ffffc900e357d614
---[ end trace 0667a278da47193a ]---
RIP: 0010:rxe_completer+0x96d/0x1050 [rdma_rxe]
Code: e0 49 8b 44 24 08 44 89 e9 41 d3 e6 4e 8d a4 30 80 01 00 00 4d 85 e4 0f 84 f9 00 00 00 49 8d bc 24 94 00 00 00 e8 73 a8 b1 e0 <41> 8b 84 24 94 00 00 00 85 c0 0f 84 df 00 00 00 83 f8 03 0f 84 bf
RSP: 0018:ffff8881014075f8 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff88813c67c000 RCX: dffffc0000000000
RDX: 0000000000000007 RSI: ffffffff826920c0 RDI: ffffc900e357d614
RBP: ffff8881014076e8 R08: ffffffffa09b228d R09: ffff88813c67c57b
R10: ffffed10278cf8af R11: 0000000000000000 R12: ffffc900e357d580
R13: 000000000000000a R14: 00000000d9c99400 R15: ffff8881515ddd08
FS: 0000000000000000(0000) GS:ffff88842d100000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: ffffc900e357d614 CR3: 0000000002e29005 CR4: 0000000000770ee0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
PKRU: 55555554
Kernel panic - not syncing: Fatal exception in interrupt Kernel Offset: disabled Rebooting in 90 seconds..
next prev parent reply other threads:[~2021-09-10 19:39 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-09-09 20:44 [PATCH for-rc v3 0/6] RDMA/rxe: Various bug fixes Bob Pearson
2021-09-09 20:44 ` [PATCH for-rc v3 1/6] RDMA/rxe: Add memory barriers to kernel queues Bob Pearson
2021-09-10 1:19 ` Zhu Yanjun
2021-09-10 4:01 ` Bob Pearson
2021-09-14 6:04 ` 回复: " yangx.jy
2021-09-14 15:47 ` Bob Pearson
2021-09-09 20:44 ` [PATCH for-rc v3 2/6] RDMA/rxe: Fix memory allocation while locked Bob Pearson
2021-09-09 20:44 ` [PATCH for-rc v3 3/6] RDMA/rxe: Cleanup MR status and type enums Bob Pearson
2021-09-09 20:44 ` [PATCH for-rc v3 4/6] RDMA/rxe: Separate HW and SW l/rkeys Bob Pearson
2021-09-09 20:44 ` [PATCH for-rc v3 5/6] RDMA/rxe: Create duplicate mapping tables for FMRs Bob Pearson
2021-09-09 20:44 ` [PATCH for-rc v3 6/6] RDMA/rxe: Only allow invalidate for appropriate MRs Bob Pearson
2021-09-09 21:52 ` [PATCH for-rc v3 0/6] RDMA/rxe: Various bug fixes Bart Van Assche
2021-09-10 19:38 ` Pearson, Robert B [this message]
2021-09-10 20:23 ` Bart Van Assche
2021-09-10 21:16 ` Bob Pearson
2021-09-10 21:47 ` Bob Pearson
2021-09-10 21:50 ` Bob Pearson
2021-09-10 22:07 ` Bart Van Assche
2021-09-12 14:41 ` Bob Pearson
2021-09-14 3:26 ` Bart Van Assche
2021-09-14 4:18 ` Bob Pearson
2021-09-12 14:42 ` Bob Pearson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=CS1PR8401MB10777EEC9CF95C00D1BA62ABBCD69@CS1PR8401MB1077.NAMPRD84.PROD.OUTLOOK.COM \
--to=robert.pearson2@hpe.com \
--cc=bvanassche@acm.org \
--cc=jgg@nvidia.com \
--cc=linux-rdma@vger.kernel.org \
--cc=mie@igel.co.jp \
--cc=rpearsonhpe@gmail.com \
--cc=zyjzyj2000@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).