All of lore.kernel.org
 help / color / mirror / Atom feed
* SMP performance question(Re: USB mass storage and ARM cache coherency)
@ 2010-02-27  7:32 Lin Mac
  2010-02-28 22:27 ` Catalin Marinas
  0 siblings, 1 reply; 2+ messages in thread
From: Lin Mac @ 2010-02-27  7:32 UTC (permalink / raw)
  To: linux-arm-kernel

Hi Catalin,

(I have changed my mail addr from mkl0301 at hotmail.com to mkl0301 at gmail.com)

> My latest solution - http://bit.ly/apJv3O - is to use dummy
> read-for-ownership or write-for-ownership accesses in the DMA cache
> flushing functions to force cache line migration from the other CPUs.
> Our current benchmarks only show around 10% disc throughput penalty
> compared to the normal SMP case (compared to the UP case the penalty is
> bigger but that's due to other things).

So it sounds like the performance of UP > __Normal SMP__ > RFO/WFO + SMP.

Maybe I've got the wrong expection, for I'm not experienced in SMP.
But I do expect the performance of  __Normal SMP__ should at least >=
UP's.

Why the performance of UP would > __Normal SMP__?
And what's the __Normal SMP__ definition?

Best Regard,
Mac Lin

^ permalink raw reply	[flat|nested] 2+ messages in thread

* SMP performance question(Re: USB mass storage and ARM cache coherency)
  2010-02-27  7:32 SMP performance question(Re: USB mass storage and ARM cache coherency) Lin Mac
@ 2010-02-28 22:27 ` Catalin Marinas
  0 siblings, 0 replies; 2+ messages in thread
From: Catalin Marinas @ 2010-02-28 22:27 UTC (permalink / raw)
  To: linux-arm-kernel

On Sat, 2010-02-27 at 07:32 +0000, Lin Mac wrote:
> > My latest solution - http://bit.ly/apJv3O - is to use dummy
> > read-for-ownership or write-for-ownership accesses in the DMA cache
> > flushing functions to force cache line migration from the other CPUs.
> > Our current benchmarks only show around 10% disc throughput penalty
> > compared to the normal SMP case (compared to the UP case the penalty is
> > bigger but that's due to other things).
> 
> So it sounds like the performance of UP > __Normal SMP__ > RFO/WFO + SMP.
> 
> Maybe I've got the wrong expection, for I'm not experienced in SMP.
> But I do expect the performance of  __Normal SMP__ should at least >=
> UP's.
> 
> Why the performance of UP would > __Normal SMP__?
> And what's the __Normal SMP__ definition?

By normal SMP I meant an unpatched kernel but with data corruption for
DMA transfers.

Our tests were for I/O bound operations (DMA transfers) where no matter
how many CPUs you add, the bottleneck is still the DMA engine. Adding
more CPUs could make things slightly worse by introducing extra
contention (and spinlocks, cache line ping-pong'ing).


-- 
Catalin

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2010-02-28 22:27 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2010-02-27  7:32 SMP performance question(Re: USB mass storage and ARM cache coherency) Lin Mac
2010-02-28 22:27 ` Catalin Marinas

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.