All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ming Lin <mlin@kernel.org>
To: Mike Snitzer <snitzer@redhat.com>
Cc: lkml <linux-kernel@vger.kernel.org>,
	Christoph Hellwig <hch@lst.de>, Jens Axboe <axboe@kernel.dk>,
	Kent Overstreet <kent.overstreet@gmail.com>,
	Dongsu Park <dpark@posteo.net>,
	Christoph Hellwig <hch@infradead.org>,
	Al Viro <viro@zeniv.linux.org.uk>,
	Ming Lei <ming.lei@canonical.com>, Neil Brown <neilb@suse.de>,
	Alasdair Kergon <agk@redhat.com>,
	dm-devel@redhat.com, Lars Ellenberg <drbd-dev@lists.linbit.com>,
	drbd-user@lists.linbit.com, Jiri Kosina <jkosina@suse.cz>,
	Geoff Levand <geoff@infradead.org>, Jim Paris <jim@jtan.com>,
	Joshua Morris <josh.h.morris@us.ibm.com>,
	Philip Kelleher <pjk1939@linux.vnet.ibm.com>,
	Minchan Kim <minchan@kernel.org>, Nitin Gupta <ngupta@vflare.org>,
	Oleg Drokin <oleg.drokin@intel.com>,
	Andreas Dilger <andreas.dilger@intel.com>,
	Ming Lin <ming.l@ssi.samsung.com>
Subject: Re: [PATCH v5 01/11] block: make generic_make_request handle arbitrarily sized bios
Date: Fri, 31 Jul 2015 23:58:10 -0700	[thread overview]
Message-ID: <1438412290.26596.14.camel@hasee> (raw)
In-Reply-To: <20150731213831.GA16464@redhat.com>

On Fri, 2015-07-31 at 17:38 -0400, Mike Snitzer wrote:
> On Fri, Jul 31 2015 at  5:19pm -0400,
> Ming Lin <mlin@kernel.org> wrote:
> 
> > On Fri, Jul 31, 2015 at 12:23 PM, Mike Snitzer <snitzer@redhat.com> wrote:
> > > On Mon, Jul 06 2015 at  3:44P -0400,
> > > Ming Lin <mlin@kernel.org> wrote:
> > >
> > >> From: Kent Overstreet <kent.overstreet@gmail.com>
> > >>
> > >> The way the block layer is currently written, it goes to great lengths
> > >> to avoid having to split bios; upper layer code (such as bio_add_page())
> > >> checks what the underlying device can handle and tries to always create
> > >> bios that don't need to be split.
> > >>
> > >> But this approach becomes unwieldy and eventually breaks down with
> > >> stacked devices and devices with dynamic limits, and it adds a lot of
> > >> complexity. If the block layer could split bios as needed, we could
> > >> eliminate a lot of complexity elsewhere - particularly in stacked
> > >> drivers. Code that creates bios can then create whatever size bios are
> > >> convenient, and more importantly stacked drivers don't have to deal with
> > >> both their own bio size limitations and the limitations of the
> > >> (potentially multiple) devices underneath them.  In the future this will
> > >> let us delete merge_bvec_fn and a bunch of other code.
> > >>
> > >> We do this by adding calls to blk_queue_split() to the various
> > >> make_request functions that need it - a few can already handle arbitrary
> > >> size bios. Note that we add the call _after_ any call to
> > >> blk_queue_bounce(); this means that blk_queue_split() and
> > >> blk_recalc_rq_segments() don't need to be concerned with bouncing
> > >> affecting segment merging.
> > >>
> > >> Some make_request_fn() callbacks were simple enough to audit and verify
> > >> they don't need blk_queue_split() calls. The skipped ones are:
> > >>
> > >>  * nfhd_make_request (arch/m68k/emu/nfblock.c)
> > >>  * axon_ram_make_request (arch/powerpc/sysdev/axonram.c)
> > >>  * simdisk_make_request (arch/xtensa/platforms/iss/simdisk.c)
> > >>  * brd_make_request (ramdisk - drivers/block/brd.c)
> > >>  * mtip_submit_request (drivers/block/mtip32xx/mtip32xx.c)
> > >>  * loop_make_request
> > >>  * null_queue_bio
> > >>  * bcache's make_request fns
> > >>
> > >> Some others are almost certainly safe to remove now, but will be left
> > >> for future patches.
> > >>
> > >> Cc: Jens Axboe <axboe@kernel.dk>
> > >> Cc: Christoph Hellwig <hch@infradead.org>
> > >> Cc: Al Viro <viro@zeniv.linux.org.uk>
> > >> Cc: Ming Lei <ming.lei@canonical.com>
> > >> Cc: Neil Brown <neilb@suse.de>
> > >> Cc: Alasdair Kergon <agk@redhat.com>
> > >> Cc: Mike Snitzer <snitzer@redhat.com>
> > >> Cc: dm-devel@redhat.com
> > >> Cc: Lars Ellenberg <drbd-dev@lists.linbit.com>
> > >> Cc: drbd-user@lists.linbit.com
> > >> Cc: Jiri Kosina <jkosina@suse.cz>
> > >> Cc: Geoff Levand <geoff@infradead.org>
> > >> Cc: Jim Paris <jim@jtan.com>
> > >> Cc: Joshua Morris <josh.h.morris@us.ibm.com>
> > >> Cc: Philip Kelleher <pjk1939@linux.vnet.ibm.com>
> > >> Cc: Minchan Kim <minchan@kernel.org>
> > >> Cc: Nitin Gupta <ngupta@vflare.org>
> > >> Cc: Oleg Drokin <oleg.drokin@intel.com>
> > >> Cc: Andreas Dilger <andreas.dilger@intel.com>
> > >> Acked-by: NeilBrown <neilb@suse.de> (for the 'md/md.c' bits)
> > >> Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
> > >> [dpark: skip more mq-based drivers, resolve merge conflicts, etc.]
> > >> Signed-off-by: Dongsu Park <dpark@posteo.net>
> > >> Signed-off-by: Ming Lin <ming.l@ssi.samsung.com>
> > > ...
> > >> diff --git a/block/blk-merge.c b/block/blk-merge.c
> > >> index 30a0d9f..3707f30 100644
> > >> --- a/block/blk-merge.c
> > >> +++ b/block/blk-merge.c
> > >> @@ -9,12 +9,158 @@
> > >>
> > >>  #include "blk.h"
> > >>
> > >> +static struct bio *blk_bio_discard_split(struct request_queue *q,
> > >> +                                      struct bio *bio,
> > >> +                                      struct bio_set *bs)
> > >> +{
> > >> +     unsigned int max_discard_sectors, granularity;
> > >> +     int alignment;
> > >> +     sector_t tmp;
> > >> +     unsigned split_sectors;
> > >> +
> > >> +     /* Zero-sector (unknown) and one-sector granularities are the same.  */
> > >> +     granularity = max(q->limits.discard_granularity >> 9, 1U);
> > >> +
> > >> +     max_discard_sectors = min(q->limits.max_discard_sectors, UINT_MAX >> 9);
> > >> +     max_discard_sectors -= max_discard_sectors % granularity;
> > >> +
> > >> +     if (unlikely(!max_discard_sectors)) {
> > >> +             /* XXX: warn */
> > >> +             return NULL;
> > >> +     }
> > >> +
> > >> +     if (bio_sectors(bio) <= max_discard_sectors)
> > >> +             return NULL;
> > >> +
> > >> +     split_sectors = max_discard_sectors;
> > >> +
> > >> +     /*
> > >> +      * If the next starting sector would be misaligned, stop the discard at
> > >> +      * the previous aligned sector.
> > >> +      */
> > >> +     alignment = (q->limits.discard_alignment >> 9) % granularity;
> > >> +
> > >> +     tmp = bio->bi_iter.bi_sector + split_sectors - alignment;
> > >> +     tmp = sector_div(tmp, granularity);
> > >> +
> > >> +     if (split_sectors > tmp)
> > >> +             split_sectors -= tmp;
> > >> +
> > >> +     return bio_split(bio, split_sectors, GFP_NOIO, bs);
> > >> +}
> > >
> > > This code to stop the discard at the previous aligned sector could be
> > > the reason why I have 2 device-mapper-test-suite tests in the
> > > 'thin-provisioning' testsuite failing due to this patchset:
> > 
> > I'm setting up the testsuite to debug.
> 
> OK, once setup, to run the 2 tests in question directly you'd do
> something like:
> 
> dmtest run --suite thin-provisioning -n discard_a_fragmented_device
> 
> dmtest run --suite thin-provisioning -n discard_fully_provisioned_device_benchmark
> 
> Again, these tests pass without this patchset.

It's caused by patch 4.
When discard size >=4G, the bio->bi_iter.bi_size overflows.
Below is the new patch.

Christoph,
Could you also help to review it?

Now we still do "misaligned" check in blkdev_issue_discard().
So the same code in blk_bio_discard_split() was removed.
Please see
https://git.kernel.org/cgit/linux/kernel/git/mlin/linux.git/commit/?h=block-generic-req&id=dcc5d9c41

I have updated both patch 1 & 4 on my tree.

commit 9607f737de9c4ca1a81655c320a61c287bf77bf5
Author: Ming Lin <ming.l@ssi.samsung.com>
Date:   Fri May 22 00:46:56 2015 -0700

    block: remove split code in blkdev_issue_discard
    
    The split code in blkdev_issue_discard() can go away now
    that any driver that cares does the split, all we have
    to do is make sure bio size doesn't overflow.
    
    Signed-off-by: Ming Lin <ming.l@ssi.samsung.com>
---
 block/blk-lib.c | 16 +++-------------
 1 file changed, 3 insertions(+), 13 deletions(-)

diff --git a/block/blk-lib.c b/block/blk-lib.c
index 7688ee3..b9e2fca 100644
--- a/block/blk-lib.c
+++ b/block/blk-lib.c
@@ -43,7 +43,7 @@ int blkdev_issue_discard(struct block_device *bdev, sector_t sector,
 	DECLARE_COMPLETION_ONSTACK(wait);
 	struct request_queue *q = bdev_get_queue(bdev);
 	int type = REQ_WRITE | REQ_DISCARD;
-	unsigned int max_discard_sectors, granularity;
+	unsigned int granularity;
 	int alignment;
 	struct bio_batch bb;
 	struct bio *bio;
@@ -60,17 +60,6 @@ int blkdev_issue_discard(struct block_device *bdev, sector_t sector,
 	granularity = max(q->limits.discard_granularity >> 9, 1U);
 	alignment = (bdev_discard_alignment(bdev) >> 9) % granularity;
 
-	/*
-	 * Ensure that max_discard_sectors is of the proper
-	 * granularity, so that requests stay aligned after a split.
-	 */
-	max_discard_sectors = min(q->limits.max_discard_sectors, UINT_MAX >> 9);
-	max_discard_sectors -= max_discard_sectors % granularity;
-	if (unlikely(!max_discard_sectors)) {
-		/* Avoid infinite loop below. Being cautious never hurts. */
-		return -EOPNOTSUPP;
-	}
-
 	if (flags & BLKDEV_DISCARD_SECURE) {
 		if (!blk_queue_secdiscard(q))
 			return -EOPNOTSUPP;
@@ -92,7 +81,8 @@ int blkdev_issue_discard(struct block_device *bdev, sector_t sector,
 			break;
 		}
 
-		req_sects = min_t(sector_t, nr_sects, max_discard_sectors);
+		/* Make sure bi_size doesn't overflow */
+		req_sects = min_t(sector_t, nr_sects, UINT_MAX >> 9);
 
 		/*
 		 * If splitting a request, and the next starting sector would be



WARNING: multiple messages have this Message-ID (diff)
From: Ming Lin <mlin-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
To: Mike Snitzer <snitzer-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
Cc: Brown <neilb-l3A5Bk7waGM@public.gmane.org>,
	Ming Lei <ming.lei-Z7WLFzj8eWMS+FvcfC7Uqw@public.gmane.org>,
	lkml <linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
	dm-devel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org,
	Christoph Hellwig <hch-jcswGhMUV9g@public.gmane.org>,
	Alasdair Kergon <agk-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>,
	Lars Ellenberg <drbd-dev-cunTk1MwBs8qoQakbn7OcQ@public.gmane.org>,
	Philip Kelleher
	<pjk1939-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org>,
	Joshua Morris
	<josh.h.morris-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>,
	Christoph Hellwig <hch-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org>,
	Kent Overstreet
	<kent.overstreet-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>,
	Nitin Gupta <ngupta-KNmc09w0p+Ednm+yROfE0A@public.gmane.org>,
	Ming Lin <ming.l-Vzezgt5dB6uUEJcrhfAQsw@public.gmane.org>,
	Drokin <oleg.drokin-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>,
	Oleg-63ez5xqkn6DQT0dZR+AlfA@public.gmane.org,
	Viro <viro-RmSDqhL/yNMiFSDQTTA3OLVCufUGDwFn@public.gmane.org>,
	Andreas Dilger
	<andreas.dilger-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>,
	Jens Axboe <axboe-tSWWG44O7X1aa/9Udqfwiw@public.gmane.org>,
	Neil-63ez5xqkn6DQT0dZR+AlfA@public.gmane.org,
	Geoff Levand <geoff-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org>,
	Jiri Kosina <jkosina-AlSwsSmVLrQ@public.gmane.org>,
	Al-63ez5xqkn6DQT0dZR+AlfA@public.gmane.org,
	Jim Paris <jim-XrPbb/hENzg@public.gmane.org>,
	Minchan Kim <minchan-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>,
	Dongsu Park <dpark-VwIFZPTo/vqsTnJN9+BGXg@public.gmane.org>,
	drbd-user-cunTk1MwBs8qoQakbn7OcQ@public.gmane.org
Subject: Re: [PATCH v5 01/11] block: make generic_make_request handle arbitrarily sized bios
Date: Fri, 31 Jul 2015 23:58:10 -0700	[thread overview]
Message-ID: <1438412290.26596.14.camel@hasee> (raw)
In-Reply-To: <20150731213831.GA16464-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>

On Fri, 2015-07-31 at 17:38 -0400, Mike Snitzer wrote:
> On Fri, Jul 31 2015 at  5:19pm -0400,
> Ming Lin <mlin-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org> wrote:
> 
> > On Fri, Jul 31, 2015 at 12:23 PM, Mike Snitzer <snitzer-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> wrote:
> > > On Mon, Jul 06 2015 at  3:44P -0400,
> > > Ming Lin <mlin-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org> wrote:
> > >
> > >> From: Kent Overstreet <kent.overstreet-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> > >>
> > >> The way the block layer is currently written, it goes to great lengths
> > >> to avoid having to split bios; upper layer code (such as bio_add_page())
> > >> checks what the underlying device can handle and tries to always create
> > >> bios that don't need to be split.
> > >>
> > >> But this approach becomes unwieldy and eventually breaks down with
> > >> stacked devices and devices with dynamic limits, and it adds a lot of
> > >> complexity. If the block layer could split bios as needed, we could
> > >> eliminate a lot of complexity elsewhere - particularly in stacked
> > >> drivers. Code that creates bios can then create whatever size bios are
> > >> convenient, and more importantly stacked drivers don't have to deal with
> > >> both their own bio size limitations and the limitations of the
> > >> (potentially multiple) devices underneath them.  In the future this will
> > >> let us delete merge_bvec_fn and a bunch of other code.
> > >>
> > >> We do this by adding calls to blk_queue_split() to the various
> > >> make_request functions that need it - a few can already handle arbitrary
> > >> size bios. Note that we add the call _after_ any call to
> > >> blk_queue_bounce(); this means that blk_queue_split() and
> > >> blk_recalc_rq_segments() don't need to be concerned with bouncing
> > >> affecting segment merging.
> > >>
> > >> Some make_request_fn() callbacks were simple enough to audit and verify
> > >> they don't need blk_queue_split() calls. The skipped ones are:
> > >>
> > >>  * nfhd_make_request (arch/m68k/emu/nfblock.c)
> > >>  * axon_ram_make_request (arch/powerpc/sysdev/axonram.c)
> > >>  * simdisk_make_request (arch/xtensa/platforms/iss/simdisk.c)
> > >>  * brd_make_request (ramdisk - drivers/block/brd.c)
> > >>  * mtip_submit_request (drivers/block/mtip32xx/mtip32xx.c)
> > >>  * loop_make_request
> > >>  * null_queue_bio
> > >>  * bcache's make_request fns
> > >>
> > >> Some others are almost certainly safe to remove now, but will be left
> > >> for future patches.
> > >>
> > >> Cc: Jens Axboe <axboe-tSWWG44O7X1aa/9Udqfwiw@public.gmane.org>
> > >> Cc: Christoph Hellwig <hch-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org>
> > >> Cc: Al Viro <viro-RmSDqhL/yNMiFSDQTTA3OLVCufUGDwFn@public.gmane.org>
> > >> Cc: Ming Lei <ming.lei-Z7WLFzj8eWMS+FvcfC7Uqw@public.gmane.org>
> > >> Cc: Neil Brown <neilb-l3A5Bk7waGM@public.gmane.org>
> > >> Cc: Alasdair Kergon <agk-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
> > >> Cc: Mike Snitzer <snitzer-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
> > >> Cc: dm-devel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org
> > >> Cc: Lars Ellenberg <drbd-dev-cunTk1MwBs8qoQakbn7OcQ@public.gmane.org>
> > >> Cc: drbd-user-cunTk1MwBs8qoQakbn7OcQ@public.gmane.org
> > >> Cc: Jiri Kosina <jkosina-AlSwsSmVLrQ@public.gmane.org>
> > >> Cc: Geoff Levand <geoff-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org>
> > >> Cc: Jim Paris <jim-XrPbb/hENzg@public.gmane.org>
> > >> Cc: Joshua Morris <josh.h.morris-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
> > >> Cc: Philip Kelleher <pjk1939-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org>
> > >> Cc: Minchan Kim <minchan-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
> > >> Cc: Nitin Gupta <ngupta-KNmc09w0p+Ednm+yROfE0A@public.gmane.org>
> > >> Cc: Oleg Drokin <oleg.drokin-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>
> > >> Cc: Andreas Dilger <andreas.dilger-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>
> > >> Acked-by: NeilBrown <neilb-l3A5Bk7waGM@public.gmane.org> (for the 'md/md.c' bits)
> > >> Signed-off-by: Kent Overstreet <kent.overstreet-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> > >> [dpark: skip more mq-based drivers, resolve merge conflicts, etc.]
> > >> Signed-off-by: Dongsu Park <dpark-VwIFZPTo/vqsTnJN9+BGXg@public.gmane.org>
> > >> Signed-off-by: Ming Lin <ming.l-Vzezgt5dB6uUEJcrhfAQsw@public.gmane.org>
> > > ...
> > >> diff --git a/block/blk-merge.c b/block/blk-merge.c
> > >> index 30a0d9f..3707f30 100644
> > >> --- a/block/blk-merge.c
> > >> +++ b/block/blk-merge.c
> > >> @@ -9,12 +9,158 @@
> > >>
> > >>  #include "blk.h"
> > >>
> > >> +static struct bio *blk_bio_discard_split(struct request_queue *q,
> > >> +                                      struct bio *bio,
> > >> +                                      struct bio_set *bs)
> > >> +{
> > >> +     unsigned int max_discard_sectors, granularity;
> > >> +     int alignment;
> > >> +     sector_t tmp;
> > >> +     unsigned split_sectors;
> > >> +
> > >> +     /* Zero-sector (unknown) and one-sector granularities are the same.  */
> > >> +     granularity = max(q->limits.discard_granularity >> 9, 1U);
> > >> +
> > >> +     max_discard_sectors = min(q->limits.max_discard_sectors, UINT_MAX >> 9);
> > >> +     max_discard_sectors -= max_discard_sectors % granularity;
> > >> +
> > >> +     if (unlikely(!max_discard_sectors)) {
> > >> +             /* XXX: warn */
> > >> +             return NULL;
> > >> +     }
> > >> +
> > >> +     if (bio_sectors(bio) <= max_discard_sectors)
> > >> +             return NULL;
> > >> +
> > >> +     split_sectors = max_discard_sectors;
> > >> +
> > >> +     /*
> > >> +      * If the next starting sector would be misaligned, stop the discard at
> > >> +      * the previous aligned sector.
> > >> +      */
> > >> +     alignment = (q->limits.discard_alignment >> 9) % granularity;
> > >> +
> > >> +     tmp = bio->bi_iter.bi_sector + split_sectors - alignment;
> > >> +     tmp = sector_div(tmp, granularity);
> > >> +
> > >> +     if (split_sectors > tmp)
> > >> +             split_sectors -= tmp;
> > >> +
> > >> +     return bio_split(bio, split_sectors, GFP_NOIO, bs);
> > >> +}
> > >
> > > This code to stop the discard at the previous aligned sector could be
> > > the reason why I have 2 device-mapper-test-suite tests in the
> > > 'thin-provisioning' testsuite failing due to this patchset:
> > 
> > I'm setting up the testsuite to debug.
> 
> OK, once setup, to run the 2 tests in question directly you'd do
> something like:
> 
> dmtest run --suite thin-provisioning -n discard_a_fragmented_device
> 
> dmtest run --suite thin-provisioning -n discard_fully_provisioned_device_benchmark
> 
> Again, these tests pass without this patchset.

It's caused by patch 4.
When discard size >=4G, the bio->bi_iter.bi_size overflows.
Below is the new patch.

Christoph,
Could you also help to review it?

Now we still do "misaligned" check in blkdev_issue_discard().
So the same code in blk_bio_discard_split() was removed.
Please see
https://git.kernel.org/cgit/linux/kernel/git/mlin/linux.git/commit/?h=block-generic-req&id=dcc5d9c41

I have updated both patch 1 & 4 on my tree.

commit 9607f737de9c4ca1a81655c320a61c287bf77bf5
Author: Ming Lin <ming.l-Vzezgt5dB6uUEJcrhfAQsw@public.gmane.org>
Date:   Fri May 22 00:46:56 2015 -0700

    block: remove split code in blkdev_issue_discard
    
    The split code in blkdev_issue_discard() can go away now
    that any driver that cares does the split, all we have
    to do is make sure bio size doesn't overflow.
    
    Signed-off-by: Ming Lin <ming.l-Vzezgt5dB6uUEJcrhfAQsw@public.gmane.org>
---
 block/blk-lib.c | 16 +++-------------
 1 file changed, 3 insertions(+), 13 deletions(-)

diff --git a/block/blk-lib.c b/block/blk-lib.c
index 7688ee3..b9e2fca 100644
--- a/block/blk-lib.c
+++ b/block/blk-lib.c
@@ -43,7 +43,7 @@ int blkdev_issue_discard(struct block_device *bdev, sector_t sector,
 	DECLARE_COMPLETION_ONSTACK(wait);
 	struct request_queue *q = bdev_get_queue(bdev);
 	int type = REQ_WRITE | REQ_DISCARD;
-	unsigned int max_discard_sectors, granularity;
+	unsigned int granularity;
 	int alignment;
 	struct bio_batch bb;
 	struct bio *bio;
@@ -60,17 +60,6 @@ int blkdev_issue_discard(struct block_device *bdev, sector_t sector,
 	granularity = max(q->limits.discard_granularity >> 9, 1U);
 	alignment = (bdev_discard_alignment(bdev) >> 9) % granularity;
 
-	/*
-	 * Ensure that max_discard_sectors is of the proper
-	 * granularity, so that requests stay aligned after a split.
-	 */
-	max_discard_sectors = min(q->limits.max_discard_sectors, UINT_MAX >> 9);
-	max_discard_sectors -= max_discard_sectors % granularity;
-	if (unlikely(!max_discard_sectors)) {
-		/* Avoid infinite loop below. Being cautious never hurts. */
-		return -EOPNOTSUPP;
-	}
-
 	if (flags & BLKDEV_DISCARD_SECURE) {
 		if (!blk_queue_secdiscard(q))
 			return -EOPNOTSUPP;
@@ -92,7 +81,8 @@ int blkdev_issue_discard(struct block_device *bdev, sector_t sector,
 			break;
 		}
 
-		req_sects = min_t(sector_t, nr_sects, max_discard_sectors);
+		/* Make sure bi_size doesn't overflow */
+		req_sects = min_t(sector_t, nr_sects, UINT_MAX >> 9);
 
 		/*
 		 * If splitting a request, and the next starting sector would be

  parent reply	other threads:[~2015-08-01  6:58 UTC|newest]

Thread overview: 103+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-07-06  7:44 [PATCH v5 01/11] block: make generic_make_request handle arbitrarily sized bios Ming Lin
2015-07-06  7:44 ` [PATCH v5 02/11] block: simplify bio_add_page() Ming Lin
2015-07-06  7:44 ` [PATCH v5 03/11] bcache: remove driver private bio splitting code Ming Lin
2015-07-06  7:44 ` [PATCH v5 04/11] btrfs: remove bio splitting and merge_bvec_fn() calls Ming Lin
2015-07-06  7:44 ` [PATCH v5 05/11] block: remove split code in blkdev_issue_discard Ming Lin
2015-07-06  7:44 ` [PATCH v5 06/11] md/raid5: split bio for chunk_aligned_read Ming Lin
2015-07-06  7:44 ` [PATCH v5 07/11] md/raid5: get rid of bio_fits_rdev() Ming Lin
2015-07-06  7:44 ` [PATCH v5 08/11] block: kill merge_bvec_fn() completely Ming Lin
2015-07-06  7:44 ` [PATCH v5 09/11] fs: use helper bio_add_page() instead of open coding on bi_io_vec Ming Lin
2015-07-06  7:44 ` [PATCH v5 10/11] block: remove bio_get_nr_vecs() Ming Lin
2015-07-06 10:58   ` Steven Whitehouse
2015-07-06 17:21     ` Ming Lin
2015-07-07  9:04       ` Steven Whitehouse
2015-07-06  7:44 ` [PATCH v5 11/11] Documentation: update notes in biovecs about arbitrarily sized bios Ming Lin
2015-07-31 19:23 ` [PATCH v5 01/11] block: make generic_make_request handle " Mike Snitzer
2015-07-31 21:19   ` Ming Lin
2015-07-31 21:19     ` Ming Lin
2015-07-31 21:38     ` Mike Snitzer
2015-07-31 21:38       ` Mike Snitzer
2015-07-31 22:02       ` Ming Lin
2015-07-31 22:18         ` Ming Lin
2015-08-01  6:58       ` Ming Lin [this message]
2015-08-01  6:58         ` Ming Lin
2015-08-01 16:33         ` Mike Snitzer
2015-08-03  5:58           ` Ming Lin
2015-08-03  5:58             ` Ming Lin
2015-08-04 11:36             ` Christoph Hellwig
2015-08-05  6:03               ` Ming Lin
2015-08-05  6:03                 ` Ming Lin
2015-08-07  7:30                 ` Christoph Hellwig
2015-08-07  7:30                   ` Christoph Hellwig
2015-08-07 23:40                   ` Ming Lin
2015-08-07 23:40                     ` Ming Lin
2015-08-08  0:30                     ` Kent Overstreet
2015-08-08  5:17                       ` Ming Lin
2015-08-08  5:17                         ` Ming Lin
2015-08-08  5:22                         ` Kent Overstreet
2015-08-08 12:35                           ` Christoph Hellwig
2015-08-08 12:35                             ` Christoph Hellwig
2015-08-08  8:52                     ` [dm-devel] " Hannes Reinecke
2015-08-08  9:02                       ` Kent Overstreet
2015-08-13  6:04                         ` Hannes Reinecke
2015-08-07  0:00               ` Kent Overstreet
2015-08-07  7:30                 ` Christoph Hellwig
2015-08-07  7:30                   ` Christoph Hellwig
2015-08-08 16:19           ` [dm-devel] " Martin K. Petersen
2015-08-08 16:19             ` Martin K. Petersen
2015-08-09  5:59             ` Ming Lin
2015-08-09  5:59               ` Ming Lin
2015-08-09  6:41               ` Christoph Hellwig
2015-08-09  6:41                 ` Christoph Hellwig
2015-08-09  6:55                 ` Ming Lin
2015-08-09  6:55                   ` Ming Lin
2015-08-09  7:01                   ` Christoph Hellwig
2015-08-09  7:01                     ` Christoph Hellwig
2015-08-09  7:18                     ` Ming Lin
2015-08-09  7:18                       ` Ming Lin
2015-08-10 15:02                       ` Mike Snitzer
2015-08-10 15:02                         ` Mike Snitzer
2015-08-10 16:14                         ` Ming Lin
2015-08-10 16:14                           ` Ming Lin
2015-08-10 16:18                           ` Ming Lin
2015-08-10 16:18                             ` Ming Lin
2015-08-10 16:40                           ` Martin K. Petersen
2015-08-10 16:40                             ` Martin K. Petersen
2015-08-10 18:13                           ` Mike Snitzer
2015-08-10 22:30                             ` Ming Lin
2015-08-10 22:30                               ` Ming Lin
2015-08-10 16:22                         ` Martin K. Petersen
2015-08-10 16:22                           ` Martin K. Petersen
2015-08-10 18:18                           ` Ming Lin
2015-08-11  2:00                             ` Martin K. Petersen
2015-08-11  2:00                               ` Martin K. Petersen
2015-08-11  2:41                               ` Mike Snitzer
2015-08-11  2:41                                 ` Mike Snitzer
2015-08-11  3:38                                 ` Kent Overstreet
2015-08-11 14:08                                   ` Mike Snitzer
2015-08-11 14:08                                     ` Mike Snitzer
2015-08-11 17:49                                   ` Martin K. Petersen
2015-08-11 17:49                                     ` Martin K. Petersen
2015-08-11 18:05                                     ` Martin K. Petersen
2015-08-11 18:05                                       ` Martin K. Petersen
2015-08-11 20:56                                       ` Ming Lin
2015-08-11 20:56                                         ` Ming Lin
2015-08-12  0:24                                         ` Martin K. Petersen
2015-08-12  0:24                                           ` Martin K. Petersen
2015-08-12  4:41                                           ` Ming Lin
2015-08-12  4:41                                             ` Ming Lin
2015-08-11 17:36                                 ` Martin K. Petersen
2015-08-11 17:36                                   ` Martin K. Petersen
2015-08-11 17:47                                   ` Mike Snitzer
2015-08-11 17:47                                     ` Mike Snitzer
2015-08-11 18:01                                     ` [dm-devel] " Martin K. Petersen
2015-08-11 18:01                                       ` Martin K. Petersen
2015-08-18  5:09                         ` Ming Lin
2015-08-18  7:04                           ` Ming Lin
2015-08-18 14:45                             ` Mike Snitzer
2015-08-18 17:32                               ` Ming Lin
2015-08-18 19:59                                 ` Mike Snitzer
2015-08-18 21:16                                   ` Ming Lin
2015-08-18 21:22                                     ` Mike Snitzer
2015-08-18 22:17                                       ` Ming Lin
  -- strict thread matches above, loose matches on Subject: below --
2015-07-06  7:11 [PATCH v5 00/11] simplify block layer based on immutable biovecs mlin
2015-07-06  7:11 ` [PATCH v5 01/11] block: make generic_make_request handle arbitrarily sized bios mlin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1438412290.26596.14.camel@hasee \
    --to=mlin@kernel.org \
    --cc=agk@redhat.com \
    --cc=andreas.dilger@intel.com \
    --cc=axboe@kernel.dk \
    --cc=dm-devel@redhat.com \
    --cc=dpark@posteo.net \
    --cc=drbd-dev@lists.linbit.com \
    --cc=drbd-user@lists.linbit.com \
    --cc=geoff@infradead.org \
    --cc=hch@infradead.org \
    --cc=hch@lst.de \
    --cc=jim@jtan.com \
    --cc=jkosina@suse.cz \
    --cc=josh.h.morris@us.ibm.com \
    --cc=kent.overstreet@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=minchan@kernel.org \
    --cc=ming.l@ssi.samsung.com \
    --cc=ming.lei@canonical.com \
    --cc=neilb@suse.de \
    --cc=ngupta@vflare.org \
    --cc=oleg.drokin@intel.com \
    --cc=pjk1939@linux.vnet.ibm.com \
    --cc=snitzer@redhat.com \
    --cc=viro@zeniv.linux.org.uk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.