From mboxrd@z Thu Jan 1 00:00:00 1970 From: Peter Zijlstra Subject: Re: [RFC 10/12] x86, rwsem: simplify __down_write Date: Thu, 9 Jun 2016 19:36:40 +0200 Message-ID: <20160609173640.GX30154@twins.programming.kicks-ass.net> References: <20160603223417.GB3892@twins.programming.kicks-ass.net> <1454444369-2146-1-git-send-email-mhocko@kernel.org> <1454444369-2146-11-git-send-email-mhocko@kernel.org> <20160203081016.GD32652@gmail.com> <20160603161339.GC3693@twins.programming.kicks-ass.net> <29355.1465483258@warthog.procyon.org.uk> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Return-path: Received: from merlin.infradead.org ([205.233.59.134]:59649 "EHLO merlin.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932077AbcFIRg6 (ORCPT ); Thu, 9 Jun 2016 13:36:58 -0400 Content-Disposition: inline In-Reply-To: <29355.1465483258@warthog.procyon.org.uk> Sender: linux-arch-owner@vger.kernel.org List-ID: To: David Howells Cc: Ingo Molnar , Michal Hocko , LKML , Ingo Molnar , Thomas Gleixner , "H. Peter Anvin" , "David S. Miller" , Tony Luck , Andrew Morton , Chris Zankel , Max Filippov , x86@kernel.org, linux-alpha@vger.kernel.org, linux-ia64@vger.kernel.org, linux-s390@vger.kernel.org, linux-sh@vger.kernel.org, sparclinux@vger.kernel.org, linux-xtensa@linux-xtensa.org, linux-arch@vger.kernel.org, Michal Hocko , Linus Torvalds , "Paul E. McKenney" , Jason Low On Thu, Jun 09, 2016 at 03:40:58PM +0100, David Howells wrote: > Peter Zijlstra wrote: > > > Blergh; so looking at more asm there's still a few tricks we cannot do. > > So while overall size is down, some paths do end up more expensive. (It > > typically boils down to creative use of condition flags, which is very > > hard in C) > > It can be done using ISO __atomic_fetch_add() and suchlike. (ISO-C11, ISO as such is a bad abbreviation I think) Maybe, but we're almost there with __GCC_ASM_FLAG_OUTPUTS__. atomic_long_add_negative() can be made to do inc;j(n)s for __down_read. the try_cmpxchg family you wanted to add independent from the ISO-C11 bits can do the cmpxchg-j(n)z for __down_{read,write}_trylock. That only leaves us wanting an atomic_long_fetch_add_negative() for __up_{read,write}(). Although I suppose, for this to be of use for our weakly ordered friends, we need _relaxed versions of all that (so that _acquire and _release variants are generated).