All of lore.kernel.org
 help / color / mirror / Atom feed
* Performance degradation on 4.6
@ 2016-05-31 23:46 Robert LeBlanc
  2016-06-01 21:00 ` Nicholas A. Bellinger
  0 siblings, 1 reply; 4+ messages in thread
From: Robert LeBlanc @ 2016-05-31 23:46 UTC (permalink / raw)
  To: linux-scsi

I've been working with iSER trying to maximize performance. I have two
FDR Infiniband cards (three ports total) and I've created multiple
address on the target and configured 12 paths from the initiator. The
iSER target is served up from a RAM disk. With the 4.4 kernel I'm able
to pretty much saturate the links (4.1 million seq 4K read IOPs 15.7
GB/s) using multipathd. With the same configuration on 4.6, I'm only
able to get 3.3 million IOPs or 12.8 GB/s.

Running perf top on the target and initiator, I'm seeing the target is
dominated by

73.89% [kernel] [k] memcpy_rems
 1.35% [kernel] [k] get_page_from_freelist
...

which is the same with 4.4. Perf top on the target has:

13.01% [kernel] [k] copy_user_enhanced_fast_string
 8.48% [kernel] [K] __blk_recalc_re_segments
 5.35% [kernel] [k] native_queued_spin_lock_slowpath
...

I don't recall what the initiator had on 4.4.

Single path performance has increase by about 5% which is nice, but
two path performance is about 50% that of the 4.4 kernel. Three paths
and greater are around 80% the 4.4 performance.

Any ideas on what could have caused the performance issues on 4.6? Is
there some additional information I can provide that would be useful?

Thanks,

----------------
Robert LeBlanc
PGP Fingerprint 79A2 9CA4 6CC4 45DD A904  C70E E654 3BB2 FA62 B9F1

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Performance degradation on 4.6
  2016-05-31 23:46 Performance degradation on 4.6 Robert LeBlanc
@ 2016-06-01 21:00 ` Nicholas A. Bellinger
  2016-06-02 11:37   ` Christoph Hellwig
  0 siblings, 1 reply; 4+ messages in thread
From: Nicholas A. Bellinger @ 2016-06-01 21:00 UTC (permalink / raw)
  To: Robert LeBlanc; +Cc: linux-scsi, Christoph Hellwig, Sagi Grimberg

Hi Robert,

Adding HCH + Sagi CC, as there have been significant changes made in
RDMA for v4.6 code.

On Tue, 2016-05-31 at 17:46 -0600, Robert LeBlanc wrote:
> I've been working with iSER trying to maximize performance. I have two
> FDR Infiniband cards (three ports total) and I've created multiple
> address on the target and configured 12 paths from the initiator. The
> iSER target is served up from a RAM disk. With the 4.4 kernel I'm able
> to pretty much saturate the links (4.1 million seq 4K read IOPs 15.7
> GB/s) using multipathd. With the same configuration on 4.6, I'm only
> able to get 3.3 million IOPs or 12.8 GB/s.
> 
> Running perf top on the target and initiator, I'm seeing the target is
> dominated by
> 
> 73.89% [kernel] [k] memcpy_rems
>  1.35% [kernel] [k] get_page_from_freelist
> ...
> 
> which is the same with 4.4. Perf top on the target has:
> 
> 13.01% [kernel] [k] copy_user_enhanced_fast_string
>  8.48% [kernel] [K] __blk_recalc_re_segments
>  5.35% [kernel] [k] native_queued_spin_lock_slowpath
> ...
> 
> I don't recall what the initiator had on 4.4.
> 
> Single path performance has increase by about 5% which is nice, but
> two path performance is about 50% that of the 4.4 kernel. Three paths
> and greater are around 80% the 4.4 performance.
> 
> Any ideas on what could have caused the performance issues on 4.6? Is
> there some additional information I can provide that would be useful?
> 
> Thanks,
> 
> ----------------
> Robert LeBlanc
> PGP Fingerprint 79A2 9CA4 6CC4 45DD A904  C70E E654 3BB2 FA62 B9F1
> --
> To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Performance degradation on 4.6
  2016-06-01 21:00 ` Nicholas A. Bellinger
@ 2016-06-02 11:37   ` Christoph Hellwig
  2016-06-02 15:18     ` Robert LeBlanc
  0 siblings, 1 reply; 4+ messages in thread
From: Christoph Hellwig @ 2016-06-02 11:37 UTC (permalink / raw)
  To: Nicholas A. Bellinger
  Cc: Robert LeBlanc, linux-scsi, Christoph Hellwig, Sagi Grimberg

On Wed, Jun 01, 2016 at 02:00:19PM -0700, Nicholas A. Bellinger wrote:
> Hi Robert,
> 
> Adding HCH + Sagi CC, as there have been significant changes made in
> RDMA for v4.6 code.

And 4.5 for that matter.  It would be really useful to figure out
if it's the target or the initiator that regressed, and weather it's in
4.5 or 4.6.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Performance degradation on 4.6
  2016-06-02 11:37   ` Christoph Hellwig
@ 2016-06-02 15:18     ` Robert LeBlanc
  0 siblings, 0 replies; 4+ messages in thread
From: Robert LeBlanc @ 2016-06-02 15:18 UTC (permalink / raw)
  To: Christoph Hellwig
  Cc: Nicholas A. Bellinger, linux-scsi, Christoph Hellwig, Sagi Grimberg

I'm trying to bisect this (target,initiator, kernel commit), but I'm
having issues replicating my 4.4 results at the moment. Once I get
that done, hopefully I can get to the bottom of where things are
slowing down.
----------------
Robert LeBlanc
PGP Fingerprint 79A2 9CA4 6CC4 45DD A904  C70E E654 3BB2 FA62 B9F1


On Thu, Jun 2, 2016 at 5:37 AM, Christoph Hellwig <hch@infradead.org> wrote:
> On Wed, Jun 01, 2016 at 02:00:19PM -0700, Nicholas A. Bellinger wrote:
>> Hi Robert,
>>
>> Adding HCH + Sagi CC, as there have been significant changes made in
>> RDMA for v4.6 code.
>
> And 4.5 for that matter.  It would be really useful to figure out
> if it's the target or the initiator that regressed, and weather it's in
> 4.5 or 4.6.

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2016-06-02 15:18 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-05-31 23:46 Performance degradation on 4.6 Robert LeBlanc
2016-06-01 21:00 ` Nicholas A. Bellinger
2016-06-02 11:37   ` Christoph Hellwig
2016-06-02 15:18     ` Robert LeBlanc

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.