From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <linux-kernel-owner@vger.kernel.org>
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
	id S1752458AbbDAPbQ (ORCPT <rfc822;w@1wt.eu>);
	Wed, 1 Apr 2015 11:31:16 -0400
Received: from e36.co.us.ibm.com ([32.97.110.154]:60850 "EHLO
	e36.co.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
	with ESMTP id S1751178AbbDAPbN (ORCPT
	<rfc822;linux-kernel@vger.kernel.org>);
	Wed, 1 Apr 2015 11:31:13 -0400
Date: Wed, 1 Apr 2015 08:31:08 -0700
From: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
To: Oleg Nesterov <oleg@redhat.com>
Cc: Will Deacon <will.deacon@arm.com>, linux-kernel@vger.kernel.org,
        Peter Zijlstra <peterz@infradead.org>
Subject: Re: [RESEND PATCH] documentation: memory-barriers: fix
 smp_mb__before_spinlock() semantics
Message-ID: <20150401153108.GQ9023@linux.vnet.ibm.com>
Reply-To: paulmck@linux.vnet.ibm.com
References: <1427791181-21952-1-git-send-email-will.deacon@arm.com>
 <20150331175050.GA14778@redhat.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <20150331175050.GA14778@redhat.com>
User-Agent: Mutt/1.5.21 (2010-09-15)
X-TM-AS-MML: disable
X-Content-Scanned: Fidelis XPS MAILER
x-cbid: 15040115-0021-0000-0000-0000099032C4
Sender: linux-kernel-owner@vger.kernel.org
List-ID: <linux-kernel.vger.kernel.org>
X-Mailing-List: linux-kernel@vger.kernel.org

On Tue, Mar 31, 2015 at 07:50:50PM +0200, Oleg Nesterov wrote:
> On 03/31, Will Deacon wrote:
> >
> > Could somebody pick this up please? I guess I could route it via the arm64
> > tree with an Ack, but I'd rather it went through Paul or -tip.
> 
> I think this would be the best route ;)
> 
> > --- a/Documentation/memory-barriers.txt
> > +++ b/Documentation/memory-barriers.txt
> > @@ -1768,10 +1768,9 @@ for each construct.  These operations all imply certain barriers:
> >  
> >       Memory operations issued before the ACQUIRE may be completed after
> >       the ACQUIRE operation has completed.  An smp_mb__before_spinlock(),
> > -     combined with a following ACQUIRE, orders prior loads against
> > -     subsequent loads and stores and also orders prior stores against
> > -     subsequent stores.  Note that this is weaker than smp_mb()!  The
> > -     smp_mb__before_spinlock() primitive is free on many architectures.
> > +     combined with a following ACQUIRE, orders prior stores against
> > +     subsequent loads and stores. Note that this is weaker than smp_mb()!
> > +     The smp_mb__before_spinlock() primitive is free on many architectures.
> 
> I agree, this description was always wrong.
> 
> But perhaps you can also update the comment above smp_mb__before_spinlock?
> It only documents the STORE - LOAD serialization, and this was on purpose.
> 
> But people started to use this helper assuming that it can also serialize
> the STOREs. Perhaps the changelog could also mention this fact, this is why
> we need to update this comment and fix memory-barriers.txt.

If Will agrees, like the following?

							Thanx, Paul

------------------------------------------------------------------------

    documentation: memory-barriers: Fix smp_mb__before_spinlock() semantics
    
    Our current documentation claims that, when followed by an ACQUIRE,
    smp_mb__before_spinlock() orders prior loads against subsequent loads
    and stores, which isn't the intent.  This commit therefore fixes the
    documentation to state that this sequence orders only prior stores
    against subsequent loads and stores.
    
    In addition, the original intent of smp_mb__before_spinlock() was to only
    order prior loads against subsequent stores, however, people have started
    using it as if it ordered prior loads against subsequent loads and stores.
    This commit therefore also updates smp_mb__before_spinlock()'s header
    comment to reflect this new reality.
    
    Cc: Oleg Nesterov <oleg@redhat.com>
    Cc: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
    Cc: Peter Zijlstra <peterz@infradead.org>
    Signed-off-by: Will Deacon <will.deacon@arm.com>
    Signed-off-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com>

diff --git a/Documentation/memory-barriers.txt b/Documentation/memory-barriers.txt
index 6974f1c2b4e1..52c320e3f107 100644
--- a/Documentation/memory-barriers.txt
+++ b/Documentation/memory-barriers.txt
@@ -1784,10 +1784,9 @@ for each construct.  These operations all imply certain barriers:
 
      Memory operations issued before the ACQUIRE may be completed after
      the ACQUIRE operation has completed.  An smp_mb__before_spinlock(),
-     combined with a following ACQUIRE, orders prior loads against
-     subsequent loads and stores and also orders prior stores against
-     subsequent stores.  Note that this is weaker than smp_mb()!  The
-     smp_mb__before_spinlock() primitive is free on many architectures.
+     combined with a following ACQUIRE, orders prior stores against
+     subsequent loads and stores. Note that this is weaker than smp_mb()!
+     The smp_mb__before_spinlock() primitive is free on many architectures.
 
  (2) RELEASE operation implication:
 
diff --git a/include/linux/spinlock.h b/include/linux/spinlock.h
index 3e18379dfa6f..0063b24b4f36 100644
--- a/include/linux/spinlock.h
+++ b/include/linux/spinlock.h
@@ -120,7 +120,7 @@ do {								\
 /*
  * Despite its name it doesn't necessarily has to be a full barrier.
  * It should only guarantee that a STORE before the critical section
- * can not be reordered with a LOAD inside this section.
+ * can not be reordered with LOADs and STOREs inside this section.
  * spin_lock() is the one-way barrier, this LOAD can not escape out
  * of the region. So the default implementation simply ensures that
  * a STORE can not move into the critical section, smp_wmb() should