From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <SRS0=f8eR=XW=vger.kernel.org=linux-block-owner@kernel.org>
X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on
	aws-us-west-2-korg-lkml-1.web.codeaurora.org
X-Spam-Level: 
X-Spam-Status: No, score=-0.9 required=3.0 tests=DKIM_SIGNED,DKIM_VALID,
	DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_HELO_NONE,
	SPF_PASS autolearn=no autolearn_force=no version=3.4.0
Received: from mail.kernel.org (mail.kernel.org [198.145.29.99])
	by smtp.lore.kernel.org (Postfix) with ESMTP id BED58C352AB
	for <linux-block@archiver.kernel.org>; Fri, 27 Sep 2019 09:32:58 +0000 (UTC)
Received: from vger.kernel.org (vger.kernel.org [209.132.180.67])
	by mail.kernel.org (Postfix) with ESMTP id 95D0E217F4
	for <linux-block@archiver.kernel.org>; Fri, 27 Sep 2019 09:32:58 +0000 (UTC)
Authentication-Results: mail.kernel.org;
	dkim=pass (2048-bit key) header.d=cloud.ionos.com header.i=@cloud.ionos.com header.b="CRctnwvQ"
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
        id S1725946AbfI0Jc5 (ORCPT <rfc822;linux-block@archiver.kernel.org>);
        Fri, 27 Sep 2019 05:32:57 -0400
Received: from mail-io1-f65.google.com ([209.85.166.65]:40687 "EHLO
        mail-io1-f65.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
        with ESMTP id S1725882AbfI0Jc5 (ORCPT
        <rfc822;linux-block@vger.kernel.org>);
        Fri, 27 Sep 2019 05:32:57 -0400
Received: by mail-io1-f65.google.com with SMTP id h144so14565341iof.7
        for <linux-block@vger.kernel.org>; Fri, 27 Sep 2019 02:32:55 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=cloud.ionos.com; s=google;
        h=mime-version:references:in-reply-to:from:date:message-id:subject:to
         :cc;
        bh=0F9BV+w6YlVo6rnMh8c7eadXk4YfSZWOZo9NVbXZgWc=;
        b=CRctnwvQx2HF1XCyaoCBnpneTOIp5SJktILtKhNC2o38Tv4V2in8p7iNqhQgMZJJ6v
         pxQG0YjdMUT73ZV8itpDnBtmVWiq4oDxLArCNDiVbpsOUjHM+xZNoi2jE4clAxiLplnX
         x0+rCGzle0J12J5soDbsmTGrM+yGSJf/7pd0titKHC1rx6sEl8xKgCpS9mCkDH4YrOXg
         g6rBv8XgPHXO5/WK8b0G1t37e/alN2PJm0rlBxD9RMwPnkGPHz6fMIk5MCLOQ79fKGNm
         AblHdNpL5vUqIY83KK8s3gfYg7+CDsMdEh3r1uC0CiIuWFOs1QZOviRnObcxeQEvK5Vj
         B5aQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20161025;
        h=x-gm-message-state:mime-version:references:in-reply-to:from:date
         :message-id:subject:to:cc;
        bh=0F9BV+w6YlVo6rnMh8c7eadXk4YfSZWOZo9NVbXZgWc=;
        b=DXS13SvkzjBrg4SHedOs1LpO/3GWdrhJv5ZfbbCnIXa3wmq+MJ1M7F1zO1Fxj4oWt7
         GSf/t99rCWBSehXtrq4kPmZsCpPR56rg79ltq7mo/oOMFfwzEJFdTVpJIrnHRovoA+p1
         jsjObI+Pyq9aMKX/2u21OF9sfdN2pdRLfABB17w9P3EQK3DpQApdTVHUsi40To4FvWzH
         djmKtaf7uRL2BKxtVFVePruT5YeaGw8irb+sDMhIqX9fVOzfPSmvgZ7QIm16UeQRBmxX
         A8xINuttM2nN7aZ2HF+mFmTFUYD8+jILVedkgTO8+/0mMlWfaMC8BKUSOKKokE7v9lNz
         Vkkg==
X-Gm-Message-State: APjAAAXYS0m/caK1lrxuT27Acu8RlDwjP0OeNJiukFcI2PaNQam7pulE
        cbL8TdqvDV4nvinXEfQqKKY2U7Wim6MhjWK+4Jjd
X-Google-Smtp-Source: APXvYqxEynMT037nTVIy/Fgxg5eZt/CD1wNfe2fobSORKFoA55i9jfqmRHpTKL6RPe1JEEhWirf70SOaHQG49SdSwiE=
X-Received: by 2002:a92:1b02:: with SMTP id b2mr3800165ilb.111.1569576775287;
 Fri, 27 Sep 2019 02:32:55 -0700 (PDT)
MIME-Version: 1.0
References: <20190620150337.7847-1-jinpuwang@gmail.com> <20190620150337.7847-18-jinpuwang@gmail.com>
 <bd8963e2-d186-dbd0-fe39-7f4a518f4177@acm.org> <CAHg0HuwzHnzPQAqjtYFTZb7BhzFagJ0NJ=pW=VkTqn5HML-0Vw@mail.gmail.com>
 <5c5ff7df-2cce-ec26-7893-55911e4d8595@acm.org> <CAHg0HuwFTVsCNHbiXW20P6hQ3c-P_p5tB6dYKtOW=_euWEvLnA@mail.gmail.com>
 <CAHg0HuzQOH4ZCe+v-GHu8jOYm-wUbh1fFRK75Muq+DPpQGAH8A@mail.gmail.com>
 <6f677d56-82b3-a321-f338-cbf8ff4e83eb@acm.org> <CAHg0HuxvKZVjROMM7YmYJ0kOU5Y4UeE+a3V==LNkWpLFy8wqtw@mail.gmail.com>
 <CACZ9PQU6bFtnDUYtzbsmNzsNW0j1EkxgUKzUw5N5gr1ArEXZvw@mail.gmail.com>
 <e2056b1d-b428-18c7-8e22-2f37b91917c8@acm.org> <CACZ9PQU8=4DaSAUQ7czKdcWio2H5HB1ro-pXaY2VP9PhgTxk7g@mail.gmail.com>
In-Reply-To: <CACZ9PQU8=4DaSAUQ7czKdcWio2H5HB1ro-pXaY2VP9PhgTxk7g@mail.gmail.com>
From:   Danil Kipnis <danil.kipnis@cloud.ionos.com>
Date:   Fri, 27 Sep 2019 11:32:44 +0200
Message-ID: <CAHg0HuwgPXtaY3XGv0=TjPbmRRdbmOsa7fRYa+n5fGf9K0_xRg@mail.gmail.com>
Subject: Re: [PATCH v4 17/25] ibnbd: client: main functionality
To:     Roman Penyaev <r.peniaev@gmail.com>
Cc:     Bart Van Assche <bvanassche@acm.org>,
        Jack Wang <jinpuwang@gmail.com>, linux-block@vger.kernel.org,
        linux-rdma@vger.kernel.org, Jens Axboe <axboe@kernel.dk>,
        Christoph Hellwig <hch@infradead.org>,
        Sagi Grimberg <sagi@grimberg.me>,
        Jason Gunthorpe <jgg@mellanox.com>,
        Doug Ledford <dledford@redhat.com>, rpenyaev@suse.de,
        Jack Wang <jinpu.wang@cloud.ionos.com>
Content-Type: text/plain; charset="UTF-8"
Sender: linux-block-owner@vger.kernel.org
Precedence: bulk
List-ID: <linux-block.vger.kernel.org>
X-Mailing-List: linux-block@vger.kernel.org

On Fri, Sep 27, 2019 at 10:52 AM Roman Penyaev <r.peniaev@gmail.com> wrote:
>
> No, it seems this thingy is a bit different.  According to my
> understanding patches 3 and 4 from this patchset do the
> following: 1# split equally the whole queue depth on number
> of hardware queues and 2# return tag number which is unique
> host-wide (more or less similar to unique_tag, right?).
>
> 2# is not needed for ibtrs, and 1# can be easy done by dividing
> queue_depth on number of hw queues on tag set allocation, e.g.
> something like the following:
>
>     ...
>     tags->nr_hw_queues = num_online_cpus();
>     tags->queue_depth  = sess->queue_deph / tags->nr_hw_queues;
>
>     blk_mq_alloc_tag_set(tags);
>
>
> And this trick won't work out for the performance.  ibtrs client
> has a single resource: set of buffer chunks received from a
> server side.  And these buffers should be dynamically distributed
> between IO producers according to the load.  Having a hard split
> of the whole queue depth between hw queues we can forget about a
> dynamic load distribution, here is an example:
>
>    - say server shares 1024 buffer chunks for a session (do not
>      remember what is the actual number).
>
>    - 1024 buffers are equally divided between hw queues, let's
>      say 64 (number of cpus), so each queue is 16 requests depth.
>
>    - only several CPUs produce IO, and instead of occupying the
>      whole "bandwidth" of a session, i.e. 1024 buffer chunks,
>      we limit ourselves to a small queue depth of an each hw
>      queue.
>
> And performance drops significantly when number of IO producers
> is smaller than number of hw queues (CPUs), and it can be easily
> tested and proved.
>
> So for this particular ibtrs case tags should be globally shared,
> and seems (unfortunately) there is no any other similar requirements
> for other block devices.
I don't see any difference between what you describe here and 100 dm
volumes sitting on top of a single NVME device.