From mboxrd@z Thu Jan  1 00:00:00 1970
From: hch@lst.de ('Christoph Hellwig')
Date: Mon, 26 Sep 2016 17:12:42 +0200
Subject: nvmf/rdma host crash during heavy load and keep alive recovery
In-Reply-To: <cea94562-b0d6-fbc0-998d-9597b8969998@grimberg.me>
References: <011501d20f5f$b94e6c80$2beb4580$@opengridcomputing.com>
 <012001d20f63$5c8f7490$15ae5db0$@opengridcomputing.com>
 <01d201d20f69$449abce0$cdd036a0$@opengridcomputing.com>
 <020001d20f70$9998fde0$cccaf9a0$@opengridcomputing.com>
 <02c001d20f93$e6a88a60$b3f99f20$@opengridcomputing.com>
 <20160916110412.GC5476@lst.de>
 <8fc2cefe-76b6-b0a3-12af-701833c286f7@grimberg.me>
 <02db01d2128b$e9244c70$bb6ce550$@opengridcomputing.com>
 <02c601d2144d$ff453a50$fdcfaef0$@opengridcomputing.com>
 <cea94562-b0d6-fbc0-998d-9597b8969998@grimberg.me>
Message-ID: <20160926151242.GA16873@lst.de>

On Fri, Sep 23, 2016@04:57:21PM -0700, Sagi Grimberg wrote:
> I'm still trying to understand how it is possible to
> get to a point where the request queue is stopped while
> the hardware context is not...

Not sure either, I need to deep dive into the code.  But just a week
or two ago Bart posted some fixes in this area for SRP, so that's where
I will start my deep dive.