NFS/UDP slow read, lost fragments

* NFS/UDP slow read, lost fragments
@ 2003-09-25 17:59 Robert L. Millner
  2003-09-25 20:22 ` Brian Mancuso
                   ` (2 more replies)
  0 siblings, 3 replies; 12+ messages in thread
From: Robert L. Millner @ 2003-09-25 17:59 UTC (permalink / raw)
  To: nfs

Hello,

The problem I am seeing is similar to what was posted by Larry Sendlosky
on Jun 27, "2.4.20-pre3 -> 2.4.21 : nfs client read performance broken"
though I have not done as through a drill-down into the nature of the
problem.

Somewhere between 2.4.19 and 2.4.20, NFS/UDP read performance began to
suck because of a large number of request retransmits.  From tcpdump, the
retransmits are for read transactions which return data in a reasonable
time frame but are missing one or more fragments of the return packet.

The client is a PIII 550, dual CPU, 1GB RAM, eepro100 (tested with both
the e100 and eepro drivers) running a variety of kernels from the stock
Red Hat 7.3 (2.4.20-20.7 which exhibits the problem), and from stock
2.4.18 through 2.4.22.  2.4.20 and above exhibit this problem.

The server is a Network Appliance F820 running ONTAP 6.2.2.  Tests are
conducted when the F820 is not under a noticeable load.

>From the difference in behavior between kernel revisions and by tcpdump, 
it is believed that the fragments are transmitted by the server.

The timings for different IO sizes for NFSv3 reads of a 70MB file:

Configuration             Seconds Real Time
-------------------------------------------
UDP, 32k                       1080
UDP,  8k                        420
UDP,  4k                        210
UDP,  1k                         40

TCP, 32k                          6.4

The NFSv2/UDP timings for 8k, 4k and 1k are almost identical to the
NFSv3/UDP timings.

The same test with 2.4.18 yields read times for around 6.3 seconds for 32k
and 8k NFSv2 and NFSv3 over UDP.

Setting rmem_max, rmem_default, wmem_max and wmem_default to either 524284 
or 262142 make no difference.

Setting netdev_max_backlog to 1200 (from 300) makes no difference.

Setting ipfrag_high_thresh to up to 4194304 makes no difference.

We have a mix of clients and servers, not all of which support NFS/TCP yet
so we can't globally set tcp in the automounter maps for our netapp mounts
or as "localoptions" in the autofs init script.  The mount patch submitted
by Steve Dickson on Aug 6, "NFS Mount Patch:  Making NFS over TCP the
default" is probably the immediate workaround that will at least make the
mounts that really matter to us work well again.  I'll test that next.

Is this a known problem?  Is there a patch already out there or in the
works that fixes this?  What other data would help drill into this
problem?

	Rob

-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

^ permalink raw reply	[flat|nested] 12+ messages in thread