From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-12.3 required=3.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, INCLUDES_PATCH,MAILING_LIST_MULTI,NICE_REPLY_A,SPF_HELO_NONE,SPF_PASS, USER_AGENT_SANE_1 autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id D3701C433B4 for ; Sat, 10 Apr 2021 00:22:09 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 999006113A for ; Sat, 10 Apr 2021 00:22:09 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235231AbhDJAWW (ORCPT ); Fri, 9 Apr 2021 20:22:22 -0400 Received: from mail-mw2nam12on2042.outbound.protection.outlook.com ([40.107.244.42]:7328 "EHLO NAM12-MW2-obe.outbound.protection.outlook.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S235215AbhDJAWV (ORCPT ); Fri, 9 Apr 2021 20:22:21 -0400 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=GlxEfLzrhaPdoQosaNtbcTNs59up3rdx52tgXQIloxfIkIZwgPPEIYOTyfvOSikhxfZTHztmJkK7/L+NlOqaqYebkZ8zQrqIaSEwCCo1c0GaH3yhmOiCrSyw4DKcC+2rQyZA26+kiBkVeatQQAxkCf7o8gOHTsn9xoYwUZkaVh+TXkU1nlI5i+/c0iilE4qdKCAINjuTTwLD4B3UeO9PYZab8cmug/9cXDVLlCDZ4UA6uFicOxu0qbkoyoDczqWtE6SP/vKx2XPzIWktHhKKgyHTf4PHLb081e62Wdvqoab+llc1aVthtPKt+fCRmkMGwOH7X6h5Jo1I9X201JUhUQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=qTNVcLQfO3+hLDF1G2Bydtq6ipe4vouwA42qltinQO8=; b=KNYzT2WQEwP3Dwkxiu4PG2gkGHqBTTdO0jtgbfsw1A/qn6UoJI9akXAx500VEr/VxOBM3MbbNYpDhpKnS28d6zrE37OyKHcxjZoLPGieyKXzvHb15FwYNY1fnmmgGz+/5c4id6utuL6023KgszgWqIyU3Wh9mLk8gkssUYqJHvZudtnK7iBPZpnmjP/snSXsJ8jH8Sa6gWlJ7cugal4Ms+M0ODGOhaHQCt854ifGkhlEUIyxGA2eiCi0IVGRguXyeO1lJszAYnTe7uq8ZETSFuMlp9MfNZ88RGXm8VXs5+t9ut0Hi2U+WP/omtuWaDrZgF1GznsNaqqCzC/x085jUA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 216.228.112.34) smtp.rcpttodomain=redhat.com smtp.mailfrom=nvidia.com; dmarc=pass (p=none sp=none pct=100) action=none header.from=nvidia.com; dkim=none (message not signed); arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=Nvidia.com; s=selector2; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=qTNVcLQfO3+hLDF1G2Bydtq6ipe4vouwA42qltinQO8=; b=NRj2vuU1DS2kU8yTG675GzuEqtx6QWirEZIWvHSJIM43yTPsWAYqq80ea/Rj77Im3FSxGMredLtwvf+xvej9XUI0tV4/w9aYbZ9EnQmMIF0+dx0wxovcRTdb937DMAB5/p9PV1p2qrI1Ll4bhJb/K2EoinPQJfI7MSZ5hCitFVstzMO+9IusJHLEg/tRqQEhGZLd+dYpLjvkXtPeYd/T2bHcqpw+r1109ZvzmoNI7ZRFg6VXJljm1w5LRhnv5KSgNZ4JctohgBvtRJmSTC5YtTFxhvBfwYShNT2OUtus4s9XAxR+A7TMh0D1aomxNfrmj075Qz5aIBLqWRWiYlRo7w== Received: from MW4PR03CA0100.namprd03.prod.outlook.com (2603:10b6:303:b7::15) by MN2PR12MB3760.namprd12.prod.outlook.com (2603:10b6:208:158::33) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4020.16; Sat, 10 Apr 2021 00:22:05 +0000 Received: from CO1NAM11FT032.eop-nam11.prod.protection.outlook.com (2603:10b6:303:b7::4) by MW4PR03CA0100.outlook.office365.com (2603:10b6:303:b7::15) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4020.16 via Frontend Transport; Sat, 10 Apr 2021 00:22:05 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 216.228.112.34) smtp.mailfrom=nvidia.com; redhat.com; dkim=none (message not signed) header.d=none;redhat.com; dmarc=pass action=none header.from=nvidia.com; Received-SPF: Pass (protection.outlook.com: domain of nvidia.com designates 216.228.112.34 as permitted sender) receiver=protection.outlook.com; client-ip=216.228.112.34; helo=mail.nvidia.com; Received: from mail.nvidia.com (216.228.112.34) by CO1NAM11FT032.mail.protection.outlook.com (10.13.174.218) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384) id 15.20.4020.17 via Frontend Transport; Sat, 10 Apr 2021 00:22:05 +0000 Received: from [172.27.0.23] (172.20.145.6) by HQMAIL107.nvidia.com (172.20.187.13) with Microsoft SMTP Server (TLS) id 15.0.1497.2; Sat, 10 Apr 2021 00:22:00 +0000 Subject: Re: [RFC PATCH v5 0/4] add simple copy support To: SelvaKumar S , CC: , , , , , , , , , , , , , , , References: <20210219124517.79359-1-selvakuma.s1@samsung.com> From: Max Gurtovoy Message-ID: Date: Sat, 10 Apr 2021 03:21:57 +0300 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.9.0 MIME-Version: 1.0 In-Reply-To: <20210219124517.79359-1-selvakuma.s1@samsung.com> Content-Type: text/plain; charset="utf-8"; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-US X-Originating-IP: [172.20.145.6] X-ClientProxiedBy: HQMAIL105.nvidia.com (172.20.187.12) To HQMAIL107.nvidia.com (172.20.187.13) X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-Office365-Filtering-Correlation-Id: 9c6b6fc5-c84c-4c8b-760e-08d8fbb6abc0 X-MS-TrafficTypeDiagnostic: MN2PR12MB3760: X-Microsoft-Antispam-PRVS: X-MS-Oob-TLC-OOBClassifiers: OLM:5516; X-MS-Exchange-SenderADCheck: 1 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: IXIQ1Y9S2hrdwTPcTmnwVJrmz4tqrXFi2Kw9wCdqwQR+01ULjcjNJQ9wc5N3ISPaNPEYKFBVOptWkvQW3jnR42W0pj4+y7KmJQpv7AMNA51brNY7InQKkTugB0lW/Fr/rOJBbwO1RU5tyYTtb5vX/Ywuo8w1/qQZVFK+fb/FtxIqp7KIAoMaLuoHTPpET3O0ghIycGMME0kx5U+JrU8s1NsMVB4a5QsTgMmaxBVcQ+HozMFnGB2Gk4QJZFjh8oS0Yp9HO7MExX/U4xifhgC3p1xeHY6rQTgQkuVnV8sVsOqnns0dRH8hryCMc+dTZ19fu+Z6oSrvwm6i2kuomVgY2p+VdLEula9LCa7ggaFS0RsdjccxG51PsUST24XHgQDnZhB1kmAxjXJNuklX5DHWTzhfQhUlGHgFDswt6bEde72Mp6fKeBQj5L5QzNtdsXMeMFHllf/oZtX0PIKViVwMJAGkQpaqh0DAD2WRsie1yQnmlUGDrjvUCx5k2VmlT9WWQBm02YtZO8GQeK0kJ0x123mJay7SmEUDmj7zw6xrmwJKjfZIEeb1w6mImTWCAG/7bRGHqKKOQrjD5IPzGu/r8M1McglVB0bulE4PiFww9/0Jw366rhCEgTxQ1Bokroc3W6bhDlNH4fd8SmzHoscTDto9eyNi5Pf3eLBO6+zkqvqKM5gmwHXtD1go4eP13uuVkSykqw6t8yQ05hD4NBCWkZkXa4DT/8vzPiWKrYTUOeGtCfy+s1bCPtLTD7Bxvv/QA/skONlPsI+kc1xLeC6mzFIFOVT+YrQ4JG0an2Lf+DcXkUzrMOIPGe7QRN8FNoLbrk44gOa5yE63BVaJj4EOtA== X-Forefront-Antispam-Report: CIP:216.228.112.34;CTRY:US;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:mail.nvidia.com;PTR:schybrid03.nvidia.com;CAT:NONE;SFS:(4636009)(346002)(136003)(376002)(396003)(39860400002)(46966006)(36840700001)(36756003)(2616005)(82740400003)(16526019)(6666004)(478600001)(82310400003)(966005)(31686004)(186003)(2906002)(26005)(36860700001)(70206006)(110136005)(356005)(70586007)(31696002)(54906003)(53546011)(83380400001)(7416002)(7636003)(5660300002)(16576012)(8676002)(47076005)(336012)(8936002)(4326008)(86362001)(316002)(36906005)(426003)(43740500002)(43620500001)(15398625002);DIR:OUT;SFP:1101; X-OriginatorOrg: Nvidia.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 10 Apr 2021 00:22:05.6022 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 9c6b6fc5-c84c-4c8b-760e-08d8fbb6abc0 X-MS-Exchange-CrossTenant-Id: 43083d15-7273-40c1-b7db-39efd9ccc17a X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=43083d15-7273-40c1-b7db-39efd9ccc17a;Ip=[216.228.112.34];Helo=[mail.nvidia.com] X-MS-Exchange-CrossTenant-AuthSource: CO1NAM11FT032.eop-nam11.prod.protection.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: MN2PR12MB3760 Precedence: bulk List-ID: X-Mailing-List: linux-block@vger.kernel.org On 2/19/2021 2:45 PM, SelvaKumar S wrote: > This patchset tries to add support for TP4065a ("Simple Copy Command"), > v2020.05.04 ("Ratified") > > The Specification can be found in following link. > https://nvmexpress.org/wp-content/uploads/NVM-Express-1.4-Ratified-TPs-1.zip > > Simple copy command is a copy offloading operation and is used to copy > multiple contiguous ranges (source_ranges) of LBA's to a single destination > LBA within the device reducing traffic between host and device. > > This implementation doesn't add native copy offload support for stacked > devices rather copy offload is done through emulation. Possible use > cases are F2FS gc and BTRFS relocation/balance. > > *blkdev_issue_copy* takes source bdev, no of sources, array of source > ranges (in sectors), destination bdev and destination offset(in sectors). > If both source and destination block devices are same and copy_offload = 1, > then copy is done through native copy offloading. Copy emulation is used > in other cases. > > As SCSI XCOPY can take two different block devices and no of source range is > equal to 1, this interface can be extended in future to support SCSI XCOPY. Any idea why this TP wasn't designed for copy offload between 2 different namespaces in the same controller ? And a simple copy will be the case where the src_nsid == dst_nsid ? Also why there are multiple source ranges and only one dst range ? We could add a bit to indicate if this range is src or dst.. > > For devices supporting native simple copy, attach the control information > as payload to the bio and submit to the device. For devices without native > copy support, copy emulation is done by reading each source range into memory > and writing it to the destination. Caller can choose not to try > emulation if copy offload is not supported by setting > BLKDEV_COPY_NOEMULATION flag. > > Following limits are added to queue limits and are exposed in sysfs > to userspace > - *copy_offload* controls copy_offload. set 0 to disable copy > offload, 1 to enable native copy offloading support. > - *max_copy_sectors* limits the sum of all source_range length > - *max_copy_nr_ranges* limits the number of source ranges > - *max_copy_range_sectors* limit the maximum number of sectors > that can constitute a single source range. > > max_copy_sectors = 0 indicates the device doesn't support copy > offloading. > > *copy offload* sysfs entry is configurable and can be used toggle > between emulation and native support depending upon the usecase. > > Changes from v4 > > 1. Extend dm-kcopyd to leverage copy-offload, while copying within the > same device. The other approach was to have copy-emulation by moving > dm-kcopyd to block layer. But it also required moving core dm-io infra, > causing a massive churn across multiple dm-targets. > > 2. Remove export in bio_map_kern() > 3. Change copy_offload sysfs to accept 0 or else > 4. Rename copy support flag to QUEUE_FLAG_SIMPLE_COPY > 5. Rename payload entries, add source bdev field to be used while > partition remapping, remove copy_size > 6. Change the blkdev_issue_copy() interface to accept destination and > source values in sector rather in bytes > 7. Add payload to bio using bio_map_kern() for copy_offload case > 8. Add check to return error if one of the source range length is 0 > 9. Add BLKDEV_COPY_NOEMULATION flag to allow user to not try copy > emulation incase of copy offload is not supported. Caller can his use > his existing copying logic to complete the io. > 10. Bug fix copy checks and reduce size of rcu_lock() > > Planned for next: > - adding blktests > - handling larger (than device limits) copy > - decide on ioctl interface (man-page etc.) > > Changes from v3 > > 1. gfp_flag fixes. > 2. Export bio_map_kern() and use it to allocate and add pages to bio. > 3. Move copy offload, reading to buf, writing from buf to separate functions. > 4. Send read bio of copy offload by chaining them and submit asynchronously. > 5. Add gendisk->part0 and part->bd_start_sect changes to blk_check_copy(). > 6. Move single source range limit check to blk_check_copy() > 7. Rename __blkdev_issue_copy() to blkdev_issue_copy and remove old helper. > 8. Change blkdev_issue_copy() interface generic to accepts destination bdev > to support XCOPY as well. > 9. Add invalidate_kernel_vmap_range() after reading data for vmalloc'ed memory. > 10. Fix buf allocoation logic to allocate buffer for the total size of copy. > 11. Reword patch commit description. > > Changes from v2 > > 1. Add emulation support for devices not supporting copy. > 2. Add *copy_offload* sysfs entry to enable and disable copy_offload > in devices supporting simple copy. > 3. Remove simple copy support for stacked devices. > > Changes from v1: > > 1. Fix memory leak in __blkdev_issue_copy > 2. Unmark blk_check_copy inline > 3. Fix line break in blk_check_copy_eod > 4. Remove p checks and made code more readable > 5. Don't use bio_set_op_attrs and remove op and set > bi_opf directly > 6. Use struct_size to calculate total_size > 7. Fix partition remap of copy destination > 8. Remove mcl,mssrl,msrc from nvme_ns > 9. Initialize copy queue limits to 0 in nvme_config_copy > 10. Remove return in QUEUE_FLAG_COPY check > 11. Remove unused OCFS > > SelvaKumar S (4): > block: make bio_map_kern() non static > block: add simple copy support > nvme: add simple copy support > dm kcopyd: add simple copy offload support > > block/blk-core.c | 102 +++++++++++++++-- > block/blk-lib.c | 223 ++++++++++++++++++++++++++++++++++++++ > block/blk-map.c | 2 +- > block/blk-merge.c | 2 + > block/blk-settings.c | 10 ++ > block/blk-sysfs.c | 47 ++++++++ > block/blk-zoned.c | 1 + > block/bounce.c | 1 + > block/ioctl.c | 33 ++++++ > drivers/md/dm-kcopyd.c | 49 ++++++++- > drivers/nvme/host/core.c | 87 +++++++++++++++ > include/linux/bio.h | 1 + > include/linux/blk_types.h | 14 +++ > include/linux/blkdev.h | 17 +++ > include/linux/nvme.h | 43 +++++++- > include/uapi/linux/fs.h | 13 +++ > 16 files changed, 627 insertions(+), 18 deletions(-) >