All of lore.kernel.org
 help / color / mirror / Atom feed
* Sunrpc transport reconnection...
@ 2010-03-29 21:43 Trond Myklebust
       [not found] ` <1269898987.15895.95.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
  0 siblings, 1 reply; 2+ messages in thread
From: Trond Myklebust @ 2010-03-29 21:43 UTC (permalink / raw)
  To: Mr. Charles Edward Lever; +Cc: linux-nfs

Having looked more carefully at the code...

Is there any reason to keep the xprt->connect_timeout? As far as I can
see, it would appear to be completely redundant.

For UDP there is no disconnect/reconnect. The socket is set up once and
for all, so no need for connect_timeout.

In the case of a TCP reconnection, the xprt->reestablish_timeout will do
exponential back-off to prevent reconnecting unnecessarily. The
XPRT_CONNECTING lock then prevents anyone from trying to interfere with
that connect request until it gets a reply, or the TCP layer decides
that the socket has timed out. Again, it appears that
xprt->connect_timeout is redundant.

RDMA reconnection appears to follow the TCP model. Once again, there is
exponential back-off, enforced by XPRT_CONNECTING.

So why do we have xprt->connect_timeout? What is it enforcing?

Trond

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: Sunrpc transport reconnection...
       [not found] ` <1269898987.15895.95.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
@ 2010-03-29 22:25   ` Chuck Lever
  0 siblings, 0 replies; 2+ messages in thread
From: Chuck Lever @ 2010-03-29 22:25 UTC (permalink / raw)
  To: Trond Myklebust; +Cc: linux-nfs

On 03/29/2010 05:43 PM, Trond Myklebust wrote:
> Having looked more carefully at the code...
>
> Is there any reason to keep the xprt->connect_timeout? As far as I can
> see, it would appear to be completely redundant.
>
> For UDP there is no disconnect/reconnect. The socket is set up once and
> for all, so no need for connect_timeout.

I vaguely recall at least one case where UDP connect can hang.  The 
connect_timeout in that case is short, on the order of 5 seconds.

> In the case of a TCP reconnection, the xprt->reestablish_timeout will do
> exponential back-off to prevent reconnecting unnecessarily. The
> XPRT_CONNECTING lock then prevents anyone from trying to interfere with
> that connect request until it gets a reply, or the TCP layer decides
> that the socket has timed out. Again, it appears that
> xprt->connect_timeout is redundant.

It probably is redundant when the host is unreachable.  We already have 
the retransmit timeout, and the socket connect call is always 
nonblocking because we aren't supposed to sleep.

If the server refuses our connection, the connect_timeout timer appears 
to be discarded, which is probably the wrong thing to do.

> RDMA reconnection appears to follow the TCP model. Once again, there is
> exponential back-off, enforced by XPRT_CONNECTING.
>
> So why do we have xprt->connect_timeout? What is it enforcing?

It may be a vestige of the previous connect implementation, where 
call_connect actually could have slept in the RPC scheduler.

We eventually want to invoke call_timeout if the connection never 
completes.  It could be because the server keeps refusing or resetting 
our connection, or because there is no response from the server.  The 
retransmit timeout is probably appropriate for both of those cases.

-- 
chuck[dot]lever[at]oracle[dot]com

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2010-03-29 22:27 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2010-03-29 21:43 Sunrpc transport reconnection Trond Myklebust
     [not found] ` <1269898987.15895.95.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2010-03-29 22:25   ` Chuck Lever

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.