All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ming Lei <ming.lei@canonical.com>
To: Dongsu Park <dongsu.park@profitbricks.com>
Cc: Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	Jens Axboe <axboe@kernel.dk>, Kent Overstreet <kmo@daterainc.com>,
	Ming Lin <mlin@minggr.net>, Christoph Hellwig <hch@infradead.org>,
	Al Viro <viro@zeniv.linux.org.uk>, Neil Brown <neilb@suse.de>,
	Alasdair Kergon <agk@redhat.com>,
	Mike Snitzer <snitzer@redhat.com>,
	dm-devel@redhat.com, drbd-dev@lists.linbit.com,
	Jiri Kosina <jkosina@suse.cz>, Geoff Levand <geoff@infradead.org>,
	Jim Paris <jim@jtan.com>,
	Joshua Morris <josh.h.morris@us.ibm.com>,
	Philip Kelleher <pjk1939@linux.vnet.ibm.com>,
	Minchan Kim <minchan@kernel.org>, Nitin Gupta <ngupta@vflare.org>,
	Oleg Drokin <oleg.drokin@intel.com>,
	Andreas Dilger <andreas.dilger@intel.com>
Subject: Re: [RFC PATCH 01/17] block: make generic_make_request handle arbitrarily sized bios
Date: Thu, 25 Dec 2014 14:09:25 +0800	[thread overview]
Message-ID: <CACVXFVOqaBWNa+zR-YsRQfaeCs7oB03mo2Rze4x+_gbOyYj_mA@mail.gmail.com> (raw)
In-Reply-To: <d26a9a0fc9a7d1529c115785ee5935d5750782bd.1419241597.git.dongsu.park@profitbricks.com>

On Mon, Dec 22, 2014 at 7:48 PM, Dongsu Park
<dongsu.park@profitbricks.com> wrote:
> From: Kent Overstreet <kmo@daterainc.com>
>
> The way the block layer is currently written, it goes to great lengths
> to avoid having to split bios; upper layer code (such as bio_add_page())
> checks what the underlying device can handle and tries to always create
> bios that don't need to be split.
>
> But this approach becomes unwieldy and eventually breaks down with
> stacked devices and devices with dynamic limits, and it adds a lot of
> complexity. If the block layer could split bios as needed, we could
> eliminate a lot of complexity elsewhere - particularly in stacked
> drivers. Code that creates bios can then create whatever size bios are
> convenient, and more importantly stacked drivers don't have to deal with
> both their own bio size limitations and the limitations of the
> (potentially multiple) devices underneath them.  In the future this will
> let us delete merge_bvec_fn and a bunch of other code.

Looks it is a very good idea to split bio in block.

>
> We do this by adding calls to blk_queue_split() to the various
> make_request functions that need it - a few can already handle arbitrary

I am wondering why the bio isn't splitted just before q->make_request_fn
is called in generic_make_request()?  By this way, drivers won't need
to call blk_queue_split() at all.  Is it because performance reason? or
others?

> size bios. Note that we add the call _after_ any call to
> blk_queue_bounce(); this means that blk_queue_split() and
> blk_recalc_rq_segments() don't need to be concerned with bouncing
> affecting segment merging.
>
> Some make_request_fn() callbacks were simple enough to audit and verify
> they don't need blk_queue_split() calls. The skipped ones are:
>
>  * nfhd_make_request (arch/m68k/emu/nfblock.c)
>  * axon_ram_make_request (arch/powerpc/sysdev/axonram.c)
>  * simdisk_make_request (arch/xtensa/platforms/iss/simdisk.c)
>  * brd_make_request (ramdisk - drivers/block/brd.c)
>  * mtip_submit_request (drivers/block/mtip32xx/mtip32xx.c)
>  * loop_make_request
>  * null_queue_bio
>  * bcache's make_request fns

I guess the above drivers haven't max_sectors/max_segment
limit.

Thanks,
Ming Lei

      parent reply	other threads:[~2014-12-25  6:09 UTC|newest]

Thread overview: 43+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-12-22 11:48 [RFC PATCH 00/17] simplify block layer based on immutable biovecs Dongsu Park
2014-12-22 11:48 ` [RFC PATCH 01/17] block: make generic_make_request handle arbitrarily sized bios Dongsu Park
2014-12-22 11:48   ` [RFC PATCH 02/17] block: simplify bio_add_page() Dongsu Park
2014-12-22 11:48     ` [RFC PATCH 03/17] block: simplify issueing discard, write_same, zeroout Dongsu Park
2014-12-22 11:48       ` [RFC PATCH 04/17] bcache: clean up hacks around bio_split_pool Dongsu Park
2014-12-22 11:48         ` [RFC PATCH 05/17] btrfs: remove bio splitting and merge_bvec_fn() calls Dongsu Park
2014-12-22 11:48           ` [RFC PATCH 06/17] btrfs: make use of immutable biovecs Dongsu Park
2014-12-22 11:48             ` [RFC PATCH 07/17] block: replace sg_iovec with iov_iter Dongsu Park
2014-12-22 11:48               ` [RFC PATCH 08/17] block: refactor __bio_copy_iov() Dongsu Park
2014-12-22 11:48                 ` [RFC PATCH 09/17] block: refactor iov_count_pages() from bio_{copy,map}_user_iov() Dongsu Park
2014-12-22 11:48                   ` [RFC PATCH 10/17] block: refactor bio_get_user_pages() from __bio_map_user_iov() Dongsu Park
2014-12-22 11:48                     ` [RFC PATCH 11/17] block: allow __blk_queue_bounce() to handle bios larger than BIO_MAX_PAGES Dongsu Park
2014-12-22 11:48                       ` [RFC PATCH 12/17] md/raid10: make sync_request_write() call bio_copy_data() Dongsu Park
2014-12-22 11:48                         ` [RFC PATCH 13/17] md/raid5: get rid of bio_fits_rdev() Dongsu Park
2014-12-22 11:48                           ` [RFC PATCH 14/17] block: kill merge_bvec_fn() completely Dongsu Park
2014-12-22 11:48                             ` [RFC PATCH 15/17] fs: use helper bio_add_page() instead of open coding on bi_io_vec Dongsu Park
2014-12-22 11:48                               ` [RFC PATCH 16/17] fs: convert buffer head etc. to use immutable biovecs API Dongsu Park
2014-12-22 11:48                                 ` [RFC PATCH 17/17] Documentation: update notes in biovecs about arbitrarily sized bios Dongsu Park
2014-12-23 10:52                                   ` Christoph Hellwig
2014-12-23 12:34                                     ` Dongsu Park
2014-12-23 10:51                                 ` [RFC PATCH 16/17] fs: convert buffer head etc. to use immutable biovecs API Christoph Hellwig
2014-12-23 12:33                                   ` Dongsu Park
2014-12-22 15:22                               ` [RFC PATCH 15/17] fs: use helper bio_add_page() instead of open coding on bi_io_vec Dave Kleikamp
2014-12-22 15:22                                 ` Dave Kleikamp
2014-12-23 10:48                         ` [RFC PATCH 12/17] md/raid10: make sync_request_write() call bio_copy_data() Christoph Hellwig
2014-12-23 12:31                           ` Dongsu Park
2014-12-23 10:45                 ` [RFC PATCH 08/17] block: refactor __bio_copy_iov() Christoph Hellwig
2014-12-23 12:24                   ` Dongsu Park
2014-12-23 10:44               ` [RFC PATCH 07/17] block: replace sg_iovec with iov_iter Christoph Hellwig
2014-12-23 12:18                 ` Dongsu Park
2014-12-27 15:03                   ` Christoph Hellwig
2014-12-23 10:35             ` [RFC PATCH 06/17] btrfs: make use of immutable biovecs Christoph Hellwig
2014-12-23 12:09               ` Dongsu Park
     [not found]           ` <e4b7b017eaa81784889ebb2a4e6a7d4366adf13a.1419241597.git.dongsu.park@profit bricks.com>
2014-12-23 14:44             ` [RFC PATCH 05/17] btrfs: remove bio splitting and merge_bvec_fn() calls Chris Mason
2014-12-23 10:23       ` [RFC PATCH 03/17] block: simplify issueing discard, write_same, zeroout Christoph Hellwig
2014-12-23 10:22     ` [RFC PATCH 02/17] block: simplify bio_add_page() Christoph Hellwig
2014-12-23 11:46       ` Dongsu Park
2014-12-23 10:16   ` [RFC PATCH 01/17] block: make generic_make_request handle arbitrarily sized bios Christoph Hellwig
2014-12-23 11:41     ` Dongsu Park
2014-12-27 15:02       ` Christoph Hellwig
2014-12-23 19:14   ` Geoff Levand
2014-12-24 10:37     ` Dongsu Park
2014-12-25  6:09   ` Ming Lei [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=CACVXFVOqaBWNa+zR-YsRQfaeCs7oB03mo2Rze4x+_gbOyYj_mA@mail.gmail.com \
    --to=ming.lei@canonical.com \
    --cc=agk@redhat.com \
    --cc=andreas.dilger@intel.com \
    --cc=axboe@kernel.dk \
    --cc=dm-devel@redhat.com \
    --cc=dongsu.park@profitbricks.com \
    --cc=drbd-dev@lists.linbit.com \
    --cc=geoff@infradead.org \
    --cc=hch@infradead.org \
    --cc=jim@jtan.com \
    --cc=jkosina@suse.cz \
    --cc=josh.h.morris@us.ibm.com \
    --cc=kmo@daterainc.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=minchan@kernel.org \
    --cc=mlin@minggr.net \
    --cc=neilb@suse.de \
    --cc=ngupta@vflare.org \
    --cc=oleg.drokin@intel.com \
    --cc=pjk1939@linux.vnet.ibm.com \
    --cc=snitzer@redhat.com \
    --cc=viro@zeniv.linux.org.uk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.