All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Atchley, Scott" <atchleyes-1Heg1YXhbW8@public.gmane.org>
To: Christoph Lameter <cl-vYTEC60ixJUAvxtiuMwx3w@public.gmane.org>
Cc: "linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org"
	<linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>
Subject: Re: IPoIB performance
Date: Wed, 5 Sep 2012 13:09:53 -0400	[thread overview]
Message-ID: <3F476926-8618-4233-A150-C5D487B55C68@ornl.gov> (raw)
In-Reply-To: <0000013997217ec3-f6e2593f-2ff8-408d-814e-0345582b31ca-000000-p/GC64/jrecnJqMo6gzdpkEOCMrvLtNR@public.gmane.org>

On Sep 5, 2012, at 11:51 AM, Christoph Lameter wrote:

> On Wed, 29 Aug 2012, Atchley, Scott wrote:
> 
>> I am benchmarking a sockets based application and I want a sanity check
>> on IPoIB performance expectations when using connected mode (65520 MTU).
>> I am using the tuning tips in Documentation/infiniband/ipoib.txt. The
>> machines have Mellanox QDR cards (see below for the verbose ibv_devinfo
>> output). I am using a 2.6.36 kernel. The hosts have single socket Intel
>> E5520 (4 core with hyper-threading on) at 2.27 GHz.
>> 
>> I am using netperf's TCP_STREAM and binding cores. The best I have seen
>> is ~13 Gbps. Is this the best I can expect from these cards?
> 
> Sounds about right, This is not a hardware limitation but
> a limitation of the socket I/O layer / PCI-E bus. The cards generally can
> process more data than the PCI bus and the OS can handle.
> 
> PCI-E on PCI 2.0 should give you up to about 2.3 Gbytes/sec with these
> nics. So there is like something that the network layer does to you that
> limits the bandwidth.

First, thanks for the reply.

I am not sure where are are getting the 2.3 GB/s value. When using verbs natively, I can get ~3.4 GB/s. I am assuming that these HCAs lack certain TCP offloads that might allow higher Socket performance. Ethtool reports:

# ethtool -k ib0
Offload parameters for ib0:
rx-checksumming: off
tx-checksumming: off
scatter-gather: off
tcp segmentation offload: off
udp fragmentation offload: off
generic segmentation offload: on
generic-receive-offload: off

There is no checksum support which I would expect to lower performance. Since checksums need to be calculated in the host, I would expect faster processors to help performance some.

So basically, am I in the ball park given this hardware?

> 
>> What should I expect as a max for ipoib with FDR cards?
> 
> More of the same. You may want to
> 
> A) increase the block size handled by the socket layer

Do you mean altering sysctl with something like:

# increase TCP max buffer size setable using setsockopt()
net.core.rmem_max = 16777216 
net.core.wmem_max = 16777216 
# increase Linux autotuning TCP buffer limit 
net.ipv4.tcp_rmem = 4096 87380 16777216
net.ipv4.tcp_wmem = 4096 65536 16777216
# increase the length of the processor input queue
net.core.netdev_max_backlog = 30000

or something increasing the SO_SNFBUF and SO_RCVBUF sizes or something else?

> B) Increase the bandwidth by using PCI-E 3 or more PCI-E lanes.
> 
> C) Bypass the socket layer. Look at Sean's rsockets layer f.e.

We actually want to test the socket stack and not bypass it.

Thanks again!

Scott

--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

  parent reply	other threads:[~2012-09-05 17:09 UTC|newest]

Thread overview: 18+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-08-29 19:35 IPoIB performance Atchley, Scott
     [not found] ` <FFD82983-4A73-4A52-B6BE-C63DA16A507C-1Heg1YXhbW8@public.gmane.org>
2012-09-05 15:51   ` Christoph Lameter
     [not found]     ` <0000013997217ec3-f6e2593f-2ff8-408d-814e-0345582b31ca-000000-p/GC64/jrecnJqMo6gzdpkEOCMrvLtNR@public.gmane.org>
2012-09-05 17:09       ` Atchley, Scott [this message]
     [not found]         ` <3F476926-8618-4233-A150-C5D487B55C68-1Heg1YXhbW8@public.gmane.org>
2012-09-05 18:20           ` Christoph Lameter
     [not found]             ` <0000013997a928ab-36ad5a02-3c82-4daf-8e8a-a86c65e92376-000000-p/GC64/jrecnJqMo6gzdpkEOCMrvLtNR@public.gmane.org>
2012-09-05 18:30               ` Atchley, Scott
     [not found]                 ` <78D8717C-0505-408B-8625-A9124AB33C9E-1Heg1YXhbW8@public.gmane.org>
2012-09-05 19:06                   ` Christoph Lameter
     [not found]                     ` <0000013997d35848-57319f33-839b-4480-a075-53b36f67bfe2-000000-p/GC64/jrecnJqMo6gzdpkEOCMrvLtNR@public.gmane.org>
2012-09-05 19:48                       ` Atchley, Scott
     [not found]                         ` <E88DEA9F-416B-4663-A292-5780DFF9B641-1Heg1YXhbW8@public.gmane.org>
2012-09-05 19:53                           ` Christoph Lameter
2012-09-05 20:12                           ` Ezra Kissel
     [not found]                             ` <5047B23C.6080604-GZvvpLG7cYSVc3sceRu5cw@public.gmane.org>
2012-09-05 20:32                               ` Atchley, Scott
2012-09-05 19:13                   ` Christoph Lameter
     [not found]                     ` <0000013997da4014-44dda0e8-01f3-48b8-b0cd-fe41164d590c-000000-p/GC64/jrecnJqMo6gzdpkEOCMrvLtNR@public.gmane.org>
2012-09-05 19:52                       ` Atchley, Scott
     [not found]                         ` <F90894CF-B55C-4AF7-845C-279FDF44351E-1Heg1YXhbW8@public.gmane.org>
2012-09-05 20:26                           ` Christoph Lameter
2012-09-05 17:52       ` Reeted
2012-09-05 17:50   ` Reeted
     [not found]     ` <504790D8.7010307-9AbUPqfR1/2XDw4h08c5KA@public.gmane.org>
2012-09-05 17:59       ` Atchley, Scott
     [not found]         ` <64A9A0CD-00E0-4455-A641-324FA9BB8BC2-1Heg1YXhbW8@public.gmane.org>
2012-09-05 19:04           ` Reeted
     [not found]             ` <5047A233.2040602-9AbUPqfR1/2XDw4h08c5KA@public.gmane.org>
2012-09-05 19:46               ` Atchley, Scott

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3F476926-8618-4233-A150-C5D487B55C68@ornl.gov \
    --to=atchleyes-1heg1yxhbw8@public.gmane.org \
    --cc=cl-vYTEC60ixJUAvxtiuMwx3w@public.gmane.org \
    --cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.