From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-7.1 required=3.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,NICE_REPLY_A,SPF_HELO_NONE, SPF_PASS,USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id A4B5DC433DB for ; Thu, 24 Dec 2020 16:46:15 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 5CDD222518 for ; Thu, 24 Dec 2020 16:46:15 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727144AbgLXQpy (ORCPT ); Thu, 24 Dec 2020 11:45:54 -0500 Received: from mail-1.ca.inter.net ([208.85.220.69]:43627 "EHLO mail-1.ca.inter.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726839AbgLXQpy (ORCPT ); Thu, 24 Dec 2020 11:45:54 -0500 Received: from localhost (offload-3.ca.inter.net [208.85.220.70]) by mail-1.ca.inter.net (Postfix) with ESMTP id 31C992EA00D; Thu, 24 Dec 2020 11:45:12 -0500 (EST) Received: from mail-1.ca.inter.net ([208.85.220.69]) by localhost (offload-3.ca.inter.net [208.85.220.70]) (amavisd-new, port 10024) with ESMTP id NF1X0jaVqyMg; Thu, 24 Dec 2020 11:33:17 -0500 (EST) Received: from [192.168.48.23] (host-104-157-204-209.dyn.295.ca [104.157.204.209]) (using TLSv1 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: dgilbert@interlog.com) by mail-1.ca.inter.net (Postfix) with ESMTPSA id A198C2EA01C; Thu, 24 Dec 2020 11:45:10 -0500 (EST) Reply-To: dgilbert@interlog.com Subject: Re: [PATCH v1 0/6] no-copy bvec To: Christoph Hellwig , Pavel Begunkov Cc: James Bottomley , Ming Lei , linux-block@vger.kernel.org, Jens Axboe , Matthew Wilcox , Johannes Weiner , Alexander Viro , "Darrick J . Wong" , "Martin K . Petersen" , Jonathan Corbet , linux-xfs@vger.kernel.org, linux-fsdevel@vger.kernel.org, io-uring@vger.kernel.org, linux-kernel@vger.kernel.org, target-devel@vger.kernel.org, linux-scsi@vger.kernel.org, linux-doc@vger.kernel.org References: <20201215014114.GA1777020@T590> <103235c1-e7d0-0b55-65d0-013d1a09304e@gmail.com> <20201215120357.GA1798021@T590> <20201222141112.GE13079@infradead.org> <933030f0-e428-18fd-4668-68db4f14b976@gmail.com> <20201223155145.GA5902@infradead.org> <8abc56c2-4db8-5ee3-ab2d-8960d0eeeb0d@interlog.com> <20201224064119.GA3048@infradead.org> From: Douglas Gilbert Message-ID: Date: Thu, 24 Dec 2020 11:45:10 -0500 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0 MIME-Version: 1.0 In-Reply-To: <20201224064119.GA3048@infradead.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-CA Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: io-uring@vger.kernel.org On 2020-12-24 1:41 a.m., Christoph Hellwig wrote: > On Wed, Dec 23, 2020 at 08:32:45PM +0000, Pavel Begunkov wrote: >> On 23/12/2020 20:23, Douglas Gilbert wrote: >>> On 2020-12-23 11:04 a.m., James Bottomley wrote: >>>> On Wed, 2020-12-23 at 15:51 +0000, Christoph Hellwig wrote: >>>>> On Wed, Dec 23, 2020 at 12:52:59PM +0000, Pavel Begunkov wrote: >>>>>> Can scatterlist have 0-len entries? Those are directly translated >>>>>> into bvecs, e.g. in nvme/target/io-cmd-file.c and >>>>>> target/target_core_file.c. I've audited most of others by this >>>>>> moment, they're fine. >>>>> >>>>> For block layer SGLs we should never see them, and for nvme neither. >>>>> I think the same is true for the SCSI target code, but please double >>>>> check. >>>> >>>> Right, no-one ever wants to see a 0-len scatter list entry.?? The reason >>>> is that every driver uses the sgl to program the device DMA engine in >>>> the way NVME does.?? a 0 length sgl would be a dangerous corner case: >>>> some DMA engines would ignore it and others would go haywire, so if we >>>> ever let a 0 length list down into the driver, they'd have to >>>> understand the corner case behaviour of their DMA engine and filter it >>>> accordingly, which is why we disallow them in the upper levels, since >>>> they're effective nops anyway. >>> >>> When using scatter gather lists at the far end (i.e. on the storage device) >>> the T10 examples (WRITE SCATTERED and POPULATE TOKEN in SBC-4) explicitly >>> allow the "number of logical blocks" in their sgl_s to be zero and state >>> that it is _not_ to be considered an error. >> >> It's fine for my case unless it leaks them out of device driver to the >> net/block layer/etc. Is it? > > None of the SCSI Command mentions above are supported by Linux, > nevermind mapped to struct scatterlist. > The POPULATE TOKEN / WRITE USING TOKEN pair can be viewed as a subset of EXTENDED COPY (SPC-4) which also supports "range descriptors". It is not clear if target_core_xcopy.c supports these range descriptors but if it did, it would be trying to map them to struct scatterlist objects. That said, it would be easy to skip the "number of logical blocks" == 0 case when translating range descriptors to sgl_s. In my ddpt utility (a dd clone) I have generalized skip= and seek= to optionally take sgl_s. If the last element in one of those sgl_s is LBAn,0 then it is interpreted as "until the end of that device" which is further restricted if the other sgl has a "hard" length or count= is given. The point being a length of 0 can have meaning, a benefit lost with NVMe's 0-based counts. Doug Gilbert