From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752168AbeC0BGi (ORCPT ); Mon, 26 Mar 2018 21:06:38 -0400 Received: from mx3-rdu2.redhat.com ([66.187.233.73]:53072 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1751935AbeC0BGh (ORCPT ); Mon, 26 Mar 2018 21:06:37 -0400 Date: Mon, 26 Mar 2018 21:06:33 -0400 From: Mike Snitzer To: Yael Chemla Cc: Alasdair Kergon , dm-devel@redhat.com, linux-kernel@vger.kernel.org, ofir.drang@gmail.com, Yael Chemla Subject: Re: [PATCH 2/2] md: dm-verity: allow parallel processing of bio blocks Message-ID: <20180327010633.GB23487@redhat.com> References: <1522003290-27243-1-git-send-email-yael.chemla@foss.arm.com> <1522003290-27243-2-git-send-email-yael.chemla@foss.arm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1522003290-27243-2-git-send-email-yael.chemla@foss.arm.com> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Mar 25 2018 at 2:41pm -0400, Yael Chemla wrote: > Allow parallel processing of bio blocks by moving to async. completion > handling. This allows for better resource utilization of both HW and > software based hash tfm and therefore better performance in many cases, > depending on the specific tfm in use. > > Tested on ARM32 (zynq board) and ARM64 (Juno board). > Time of cat command was measured on a filesystem with various file sizes. > 12% performance improvement when HW based hash was used (ccree driver). > SW based hash showed less than 1% improvement. > CPU utilization when HW based hash was used presented 10% less context > switch, 4% less cycles and 7% less instructions. No difference in > CPU utilization noticed with SW based hash. > > Signed-off-by: Yael Chemla This one had various issues. I've fixed most of what I saw and staged in linux-next (purely for build test coverage purposes). I may drop this patch if others disagree with it (or my sg deallocation in the error path question isn't answered). I've staged the changes here (and in linux-next via 'for-next'): https://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm.git/log/?h=dm-4.17 I switched all the new GFP_KERNEL uses to GFP_NOIO. The fact that you're doing allocations at all (per IO) is bad enough. Using GFP_KERNEL is a serious liability (risk of deadlock if dm-verity were to be used for something like.. swap.. weird setup but possible). But the gfp flags aside, the need for additional memory and the expectation of scalable async parallel IO is potentially at odds with changes like this (that I just staged, and had to rebase your 2 patches ontop of): https://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm.git/commit/?h=dm-4.17&id=a89f6a2cfec86fba7a115642ff082cb4e9450ea6 So I'm particulalry interested to hear from google folks to understand if they are OK with your proposed verity async crypto API use. Mike