All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH v2] MCS spinlock: Use smp_cond_load_acquire()
@ 2016-04-13  3:02 Jason Low
  2016-04-13 17:43 ` Will Deacon
  0 siblings, 1 reply; 4+ messages in thread
From: Jason Low @ 2016-04-13  3:02 UTC (permalink / raw)
  To: Peter Zijlstra, Will Deacon, Linus Torvalds
  Cc: linux-kernel, mingo, paulmck, terry.rudd, waiman.long,
	boqun.feng, dave, jason.low2, kbuild test robot, kbuild-all

For qspinlocks on ARM64, we would like to use WFE instead
of purely spinning. Qspinlocks internally have lock
contenders spin on an MCS lock.

Update arch_mcs_spin_lock_contended() such that it uses
the new smp_cond_load_acquire() so that ARM64 can also
override this spin loop with its own implementation using WFE.

On x86, it can also cheaper to use this than spinning on
smp_load_acquire().

Signed-off-by: Jason Low <jason.low2@hp.com>
---
Changes from v1:
- Pass l instead of &l to smp_cond_load_acquire() since
  l is already a pointer to the lock variable.

 kernel/locking/mcs_spinlock.h | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/kernel/locking/mcs_spinlock.h b/kernel/locking/mcs_spinlock.h
index c835270..1c55987 100644
--- a/kernel/locking/mcs_spinlock.h
+++ b/kernel/locking/mcs_spinlock.h
@@ -22,13 +22,13 @@ struct mcs_spinlock {
 
 #ifndef arch_mcs_spin_lock_contended
 /*
- * Using smp_load_acquire() provides a memory barrier that ensures
- * subsequent operations happen after the lock is acquired.
+ * Using smp_cond_load_acquire() provides the acquire semantics
+ * required so that subsequent operations happen after the
+ * lock is acquired.
  */
 #define arch_mcs_spin_lock_contended(l)					\
 do {									\
-	while (!(smp_load_acquire(l)))					\
-		cpu_relax_lowlatency();					\
+	smp_cond_load_acquire(l, VAL);					\
 } while (0)
 #endif
 
-- 
2.1.4

^ permalink raw reply related	[flat|nested] 4+ messages in thread

* Re: [PATCH v2] MCS spinlock: Use smp_cond_load_acquire()
  2016-04-13  3:02 [PATCH v2] MCS spinlock: Use smp_cond_load_acquire() Jason Low
@ 2016-04-13 17:43 ` Will Deacon
  2016-04-13 20:49   ` Jason Low
  0 siblings, 1 reply; 4+ messages in thread
From: Will Deacon @ 2016-04-13 17:43 UTC (permalink / raw)
  To: Jason Low
  Cc: Peter Zijlstra, Linus Torvalds, linux-kernel, mingo, paulmck,
	terry.rudd, waiman.long, boqun.feng, dave, kbuild test robot,
	kbuild-all

On Tue, Apr 12, 2016 at 08:02:17PM -0700, Jason Low wrote:
> For qspinlocks on ARM64, we would like to use WFE instead
> of purely spinning. Qspinlocks internally have lock
> contenders spin on an MCS lock.
> 
> Update arch_mcs_spin_lock_contended() such that it uses
> the new smp_cond_load_acquire() so that ARM64 can also
> override this spin loop with its own implementation using WFE.
> 
> On x86, it can also cheaper to use this than spinning on
> smp_load_acquire().
> 
> Signed-off-by: Jason Low <jason.low2@hp.com>

FWIW, we just override arch_mcs_spin_lock_contended entirely for arch/arm/
and use wfe there so we could do the same for arm64 in mainline already.

Will

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH v2] MCS spinlock: Use smp_cond_load_acquire()
  2016-04-13 17:43 ` Will Deacon
@ 2016-04-13 20:49   ` Jason Low
  2016-04-14  8:53     ` Will Deacon
  0 siblings, 1 reply; 4+ messages in thread
From: Jason Low @ 2016-04-13 20:49 UTC (permalink / raw)
  To: Will Deacon
  Cc: Peter Zijlstra, Linus Torvalds, linux-kernel, mingo, paulmck,
	Rudd, Terry (HP Cloud Systems Linux R&D),
	Long, Wai Man, boqun.feng, dave, kbuild test robot, kbuild-all,
	jason.low2

On Wed, 2016-04-13 at 10:43 -0700, Will Deacon wrote:
> On Tue, Apr 12, 2016 at 08:02:17PM -0700, Jason Low wrote:
> > For qspinlocks on ARM64, we would like to use WFE instead
> > of purely spinning. Qspinlocks internally have lock
> > contenders spin on an MCS lock.
> > 
> > Update arch_mcs_spin_lock_contended() such that it uses
> > the new smp_cond_load_acquire() so that ARM64 can also
> > override this spin loop with its own implementation using WFE.
> > 
> > On x86, it can also cheaper to use this than spinning on
> > smp_load_acquire().
> > 
> > Signed-off-by: Jason Low <jason.low2@hp.com>
> 
> FWIW, we just override arch_mcs_spin_lock_contended entirely for arch/arm/
> and use wfe there so we could do the same for arm64 in mainline already.

Right, I was also thinking about that, although when we use
smp_cond_load_acquire() in the generic implementation, would we just end
up overriding it for the arch/arm64 version with the same thing?  :)

Jason

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH v2] MCS spinlock: Use smp_cond_load_acquire()
  2016-04-13 20:49   ` Jason Low
@ 2016-04-14  8:53     ` Will Deacon
  0 siblings, 0 replies; 4+ messages in thread
From: Will Deacon @ 2016-04-14  8:53 UTC (permalink / raw)
  To: Jason Low
  Cc: Peter Zijlstra, Linus Torvalds, linux-kernel, mingo, paulmck,
	Rudd, Terry (HP Cloud Systems Linux R&D),
	Long, Wai Man, boqun.feng, dave, kbuild test robot, kbuild-all,
	jason.low2

On Wed, Apr 13, 2016 at 01:49:04PM -0700, Jason Low wrote:
> On Wed, 2016-04-13 at 10:43 -0700, Will Deacon wrote:
> > On Tue, Apr 12, 2016 at 08:02:17PM -0700, Jason Low wrote:
> > > For qspinlocks on ARM64, we would like to use WFE instead
> > > of purely spinning. Qspinlocks internally have lock
> > > contenders spin on an MCS lock.
> > > 
> > > Update arch_mcs_spin_lock_contended() such that it uses
> > > the new smp_cond_load_acquire() so that ARM64 can also
> > > override this spin loop with its own implementation using WFE.
> > > 
> > > On x86, it can also cheaper to use this than spinning on
> > > smp_load_acquire().
> > > 
> > > Signed-off-by: Jason Low <jason.low2@hp.com>
> > 
> > FWIW, we just override arch_mcs_spin_lock_contended entirely for arch/arm/
> > and use wfe there so we could do the same for arm64 in mainline already.
> 
> Right, I was also thinking about that, although when we use
> smp_cond_load_acquire() in the generic implementation, would we just end
> up overriding it for the arch/arm64 version with the same thing?  :)

Sure, and we can convert to smp_cond_load_acquire then. I was just thinking
that you can avoid the immediate dependency on Peter's stuff, that's all.

Will

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2016-04-14  8:53 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-04-13  3:02 [PATCH v2] MCS spinlock: Use smp_cond_load_acquire() Jason Low
2016-04-13 17:43 ` Will Deacon
2016-04-13 20:49   ` Jason Low
2016-04-14  8:53     ` Will Deacon

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.