From mboxrd@z Thu Jan 1 00:00:00 1970 From: Gavin Hu Subject: [PATCH v1 0/5] spinlock optimization and test case enhancements Date: Thu, 20 Dec 2018 18:42:41 +0800 Message-ID: <20181220104246.5590-1-gavin.hu@arm.com> Cc: thomas@monjalon.net, jerinj@marvell.com, hemant.agrawal@nxp.com, bruce.richardson@intel.com, chaozhu@linux.vnet.ibm.com, nd@arm.com, Honnappa.Nagarahalli@arm.com, Gavin Hu To: dev@dpdk.org Return-path: Received: from foss.arm.com (foss.arm.com [217.140.101.70]) by dpdk.org (Postfix) with ESMTP id 124AA1B96E for ; Thu, 20 Dec 2018 11:43:06 +0100 (CET) List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" V1: 1. Remove the 1us delay outside of the locked region to really benchmark the spinlock acquire/release performance, not the delay API. 2. Use the precise version of getting timestamps for more precise benchmarking results. 3. Amortize the overhead of getting the timestamp by 10000 loops 4. Move the arm specific implementation to arm folder to remove the hardcoded implementation. 5. Use atomic primitives, which translate to one-way barriers, instead of two-way sync primitives, to optimize for performance. Gavin Hu (5): test/spinlock: remove 1us delay to create contention test/spinlock: get timestamp more precisely test/spinlock: amortize the cost of getting time spinlock: move the implementation to arm specific file spinlock: reimplement with atomic one-way barrier builtins .../common/include/arch/arm/rte_spinlock.h | 28 +++++++++++++++++ .../common/include/generic/rte_spinlock.h | 28 +---------------- test/test/test_spinlock.c | 35 +++++++++++----------- 3 files changed, 47 insertions(+), 44 deletions(-) -- 2.11.0