"memory management error" with NFS/RDMA on RoCE

* "memory management error" with NFS/RDMA on RoCE
@ 2017-06-22 18:28 Chuck Lever
       [not found] ` <7F0FCF80-DB7B-46F1-BB9A-0B070603DE61-QHcLZuEGTsvQT0dZR+AlfA@public.gmane.org>
  0 siblings, 1 reply; 12+ messages in thread
From: Chuck Lever @ 2017-06-22 18:28 UTC (permalink / raw)
  To: linux-rdma

While running xfstests on an NFS/RDMA mount, I see this in
the client's /var/log/messages multiple times:

Jun 22 14:13:45 manet kernel: mlx5_0:dump_cqe:275:(pid 0): dump error cqe
Jun 22 14:13:45 manet kernel: 00000000 00000000 00000000 00000000
Jun 22 14:13:45 manet kernel: 00000000 00000000 00000000 00000000
Jun 22 14:13:45 manet kernel: 00000000 00000000 00000000 00000000
Jun 22 14:13:45 manet kernel: 00000000 08007806 250000cd 024027d3
Jun 22 14:13:45 manet kernel: rpcrdma: fastreg: memory management operation error (6/0x78)

As far as I can tell the client is able to recover and continue
the test. However, this error is not supposed to happen in normal
operation.

This is with a Mellanox CX4 in RoCEv1 mode, v4.12-rc2.

--
Chuck Lever

--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 12+ messages in thread