From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ozlabs.org (ozlabs.org [103.22.144.67]) (using TLSv1.2 with cipher ADH-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by lists.ozlabs.org (Postfix) with ESMTPS id 3w6bGh4cpxzDq5b for ; Tue, 18 Apr 2017 16:45:00 +1000 (AEST) From: Michael Ellerman To: "Naveen N. Rao" Cc: Paul Mackerras , linuxppc-dev@lists.ozlabs.org, Anton Blanchard , Matthew Wilcox Subject: Re: [PATCH 1/2] powerpc: string: implement optimized memset variants In-Reply-To: <874ly4v0vs.fsf@concordia.ellerman.id.au> References: <20170322193030.GA8008@bombadil.infradead.org> <87mvc6b575.fsf@concordia.ellerman.id.au> <20170328102109.GC4762@naverao1-tp.localdomain> <87a884jow3.fsf@concordia.ellerman.id.au> <20170330071613.GE4762@naverao1-tp.localdomain> <874ly4v0vs.fsf@concordia.ellerman.id.au> Date: Tue, 18 Apr 2017 16:45:00 +1000 Message-ID: <87k26i6wpf.fsf@concordia.ellerman.id.au> MIME-Version: 1.0 Content-Type: text/plain List-Id: Linux on PowerPC Developers Mail List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Michael Ellerman writes: > "Naveen N. Rao" writes: >> (generic) is with Matt's arch-independent patches applied. Profiling >> indicates that most of the overhead is actually with the lzo >> decompression... >> >> Also, with a simple module to memset64() a 1GB vmalloc'ed buffer, here >> are the results: >> generic: 0.245315533 seconds time elapsed ( +- 1.83% ) >> optimized: 0.169282701 seconds time elapsed ( +- 1.96% ) > > Great, that's pretty conclusive. > > I'm pretty sure I can take these 2 patches independently of Matt's > series, they just won't be used by much until his series goes in, so > I'll do that unless someone yells. Hmm, just went to merge these, but I don't see Matt's series in linux-next, so I'll hold off for now. cheers