From mboxrd@z Thu Jan 1 00:00:00 1970 From: mlin@kernel.org (Ming Lin) Date: Tue, 1 Nov 2016 16:27:51 -0700 Subject: Failure with 8K Write operations In-Reply-To: References: <1473810683.2781.72.camel@linux.intel.com> <1473871944.2781.90.camel@linux.intel.com> Message-ID: On Tue, Nov 1, 2016@4:07 PM, Ming Lin wrote: > On Thu, Sep 15, 2016 at 6:36 AM, Narayan Ayalasomayajula > wrote: >> Hi Jay, >> >> Thanks for pointing out that I was not running the latest version of the kernel. I updated to 4.8rc6 and my FIO test that had previously failed with the Linux NVMeF target (using null_blk device as the target) is now completing successfully. I am still seeing the same NAK (Remote Access Error) failure when I use our target instead. I will debug this further but updating to 4.8rc6 did improve things. > > Hi Narayan, > > I also saw similar error with 8k write when I use my own target implementation. > Did you fix it already? Hi Narayan, With Sagi's great help off-line, I just fixed it. In my code, when I post RDMA_READ, I didn't set rdma_wr.next to NULL. Shame on myself ... example code as below int rw_ctx_post(NvmetRdmaRsp *rsp) { rsp->rdma_wr.next = NULL; return ibv_post_send(cm_id->qp, &rsp->rdma_wr, &bad_wr); } Possibly you may don't have this kind of stupid bug ...