linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* device mapper not reporting no-barrier-support?
@ 2008-02-25 13:26 Anders Henke
  2008-02-25 23:20 ` Andrew Morton
  0 siblings, 1 reply; 9+ messages in thread
From: Anders Henke @ 2008-02-25 13:26 UTC (permalink / raw)
  To: linux-kernel

Hi,

I'm currently stuck between Kernel LVM and DRBD, as I'm using Kernel
2.6.24.2 with DRBD 8.2.5 on top of an LVM2 device (LV).

-LVM2/device mapper doesn't support write barriers
-DRBD uses blkdev_issue_flush() to flush its metadata to disk.
 On a no-barrier-device, DRBD should receive EOPNOTSUPP, but
 it really does receive an EIO. Promptly, DRBD gives the
 error message "drbd0: local disk flush failed with status -5".

The physical disk (in LVM speak) is a RAID1 on a 3ware 9650SE-2LP
controller; the driver 3w-9xxx supports barriers and after moving my D
RBD device from the LV to a single partition on the same RAID1, the 
error messages from DRBD vanished.

I've posted a lengty summary of my findings to

http://lists.linbit.com/pipermail/drbd-user/2008-February/008665.html

... where Lars Ellenberg from DRBD basically responded in

http://lists.linbit.com/pipermail/drbd-user/2008-February/008666.html

... that DRBD does catch the EOPNOTSUPP for blkdev_issue_flush and
BIO_RW_BARRIER, but the lvm implementation of blkdev_issue_flush in
2.6.24.2 aparently does return EIO for blkdev_issue_flush.

So simply the question: how should a top-layer driver check wether a lower
device does support barriers? md-raid does check this way differently than
e.g. XFS does, while DRBD also adds a third way to check this.
Or is this "merely" a bug in drivers/md/dm.c?


Anders
-- 
1&1 Internet AG              System Architect
Brauerstrasse 48             v://49.721.91374.50
D-76135 Karlsruhe            f://49.721.91374.225

Amtsgericht Montabaur HRB 6484
Vorstand: Henning Ahlert, Ralph Dommermuth, Matthias Ehrlich, Andreas Gauger,
Thomas Gottschlich, Matthias Greve, Robert Hoffmann, Markus Huhn, Achim Weiss
Aufsichtsratsvorsitzender: Michael Scheeren

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: device mapper not reporting no-barrier-support?
  2008-02-25 13:26 device mapper not reporting no-barrier-support? Anders Henke
@ 2008-02-25 23:20 ` Andrew Morton
  2008-02-26  1:36   ` [dm-devel] " Alasdair G Kergon
  0 siblings, 1 reply; 9+ messages in thread
From: Andrew Morton @ 2008-02-25 23:20 UTC (permalink / raw)
  To: Anders Henke; +Cc: linux-kernel, dm-devel

On Mon, 25 Feb 2008 14:26:15 +0100 Anders Henke <anders.henke@1und1.de> wrote:

> Hi,
> 
> I'm currently stuck between Kernel LVM and DRBD, as I'm using Kernel
> 2.6.24.2 with DRBD 8.2.5 on top of an LVM2 device (LV).
> 
> -LVM2/device mapper doesn't support write barriers
> -DRBD uses blkdev_issue_flush() to flush its metadata to disk.
>  On a no-barrier-device, DRBD should receive EOPNOTSUPP, but
>  it really does receive an EIO. Promptly, DRBD gives the
>  error message "drbd0: local disk flush failed with status -5".
> 
> The physical disk (in LVM speak) is a RAID1 on a 3ware 9650SE-2LP
> controller; the driver 3w-9xxx supports barriers and after moving my D
> RBD device from the LV to a single partition on the same RAID1, the 
> error messages from DRBD vanished.
> 
> I've posted a lengty summary of my findings to
> 
> http://lists.linbit.com/pipermail/drbd-user/2008-February/008665.html
> 
> ... where Lars Ellenberg from DRBD basically responded in
> 
> http://lists.linbit.com/pipermail/drbd-user/2008-February/008666.html
> 
> ... that DRBD does catch the EOPNOTSUPP for blkdev_issue_flush and
> BIO_RW_BARRIER, but the lvm implementation of blkdev_issue_flush in
> 2.6.24.2 aparently does return EIO for blkdev_issue_flush.
> 
> So simply the question: how should a top-layer driver check wether a lower
> device does support barriers? md-raid does check this way differently than
> e.g. XFS does, while DRBD also adds a third way to check this.
> Or is this "merely" a bug in drivers/md/dm.c?
> 

(cc dm-devel)

I'd say it's a DM bug.  Probably a hard-to-fix one though.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [dm-devel] Re: device mapper not reporting no-barrier-support?
  2008-02-25 23:20 ` Andrew Morton
@ 2008-02-26  1:36   ` Alasdair G Kergon
  2008-02-26 16:17     ` Jens Axboe
  0 siblings, 1 reply; 9+ messages in thread
From: Alasdair G Kergon @ 2008-02-26  1:36 UTC (permalink / raw)
  To: Andrew Morton, Anders Henke, Jens Axboe
  Cc: device-mapper development, linux-kernel

On Mon, Feb 25, 2008 at 03:20:50PM -0800, Andrew Morton wrote:
> On Mon, 25 Feb 2008 14:26:15 +0100 Anders Henke <anders.henke@1und1.de> wrote:
> > I'm currently stuck between Kernel LVM and DRBD, as I'm using Kernel
> > 2.6.24.2 with DRBD 8.2.5 on top of an LVM2 device (LV).
> > -LVM2/device mapper doesn't support write barriers

That's right.

> > -DRBD uses blkdev_issue_flush() to flush its metadata to disk.

Which won't work if device-mapper is underneath.

> >  On a no-barrier-device, DRBD should receive EOPNOTSUPP, but
> >  it really does receive an EIO. Promptly, DRBD gives the
> >  error message "drbd0: local disk flush failed with status -5".
> > I've posted a lengty summary of my findings to
> > http://lists.linbit.com/pipermail/drbd-user/2008-February/008665.html
> > ... that DRBD does catch the EOPNOTSUPP for blkdev_issue_flush and
> > BIO_RW_BARRIER, but the lvm implementation of blkdev_issue_flush in
> > 2.6.24.2 aparently does return EIO for blkdev_issue_flush.
> I'd say it's a DM bug.

The dm code is unchanged, but look at the limited endio handling in
ll_rw_blk.c:

static void bio_end_empty_barrier(struct bio *bio, int err)
{
        if (err)
                clear_bit(BIO_UPTODATE, &bio->bi_flags);

        complete(bio->bi_private);
}

int blkdev_issue_flush(struct block_device *bdev, sector_t *error_sector)
{
...
        wait_for_completion(&wait);
        if (error_sector)
                *error_sector = bio->bi_sector;
        ret = 0;
        if (!bio_flagged(bio, BIO_UPTODATE))
                ret = -EIO;

Alasdair
-- 
agk@redhat.com

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [dm-devel] Re: device mapper not reporting no-barrier-support?
  2008-02-26  1:36   ` [dm-devel] " Alasdair G Kergon
@ 2008-02-26 16:17     ` Jens Axboe
  2008-02-26 19:33       ` Anders Henke
  0 siblings, 1 reply; 9+ messages in thread
From: Jens Axboe @ 2008-02-26 16:17 UTC (permalink / raw)
  To: Andrew Morton, Anders Henke, device-mapper development, linux-kernel

On Tue, Feb 26 2008, Alasdair G Kergon wrote:
> On Mon, Feb 25, 2008 at 03:20:50PM -0800, Andrew Morton wrote:
> > On Mon, 25 Feb 2008 14:26:15 +0100 Anders Henke <anders.henke@1und1.de> wrote:
> > > I'm currently stuck between Kernel LVM and DRBD, as I'm using Kernel
> > > 2.6.24.2 with DRBD 8.2.5 on top of an LVM2 device (LV).
> > > -LVM2/device mapper doesn't support write barriers
> 
> That's right.
> 
> > > -DRBD uses blkdev_issue_flush() to flush its metadata to disk.
> 
> Which won't work if device-mapper is underneath.
> 
> > >  On a no-barrier-device, DRBD should receive EOPNOTSUPP, but
> > >  it really does receive an EIO. Promptly, DRBD gives the
> > >  error message "drbd0: local disk flush failed with status -5".
> > > I've posted a lengty summary of my findings to
> > > http://lists.linbit.com/pipermail/drbd-user/2008-February/008665.html
> > > ... that DRBD does catch the EOPNOTSUPP for blkdev_issue_flush and
> > > BIO_RW_BARRIER, but the lvm implementation of blkdev_issue_flush in
> > > 2.6.24.2 aparently does return EIO for blkdev_issue_flush.
> > I'd say it's a DM bug.
> 
> The dm code is unchanged, but look at the limited endio handling in
> ll_rw_blk.c:
> 
> static void bio_end_empty_barrier(struct bio *bio, int err)
> {
>         if (err)
>                 clear_bit(BIO_UPTODATE, &bio->bi_flags);
> 
>         complete(bio->bi_private);
> }
> 
> int blkdev_issue_flush(struct block_device *bdev, sector_t *error_sector)
> {
> ...
>         wait_for_completion(&wait);
>         if (error_sector)
>                 *error_sector = bio->bi_sector;
>         ret = 0;
>         if (!bio_flagged(bio, BIO_UPTODATE))
>                 ret = -EIO;

You are right, the return value got broken there. Does this make it
return -EOPNOTSUPP properly for you?

diff --git a/block/blk-barrier.c b/block/blk-barrier.c
index 6901eed..55c5f1f 100644
--- a/block/blk-barrier.c
+++ b/block/blk-barrier.c
@@ -259,8 +259,11 @@ int blk_do_ordered(struct request_queue *q, struct request **rqp)
 
 static void bio_end_empty_barrier(struct bio *bio, int err)
 {
-	if (err)
+	if (err) {
+		if (err == -EOPNOTSUPP)
+			set_bit(BIO_EOPNOTSUPP, &bio->bi_flags);
 		clear_bit(BIO_UPTODATE, &bio->bi_flags);
+	}
 
 	complete(bio->bi_private);
 }
@@ -309,7 +312,9 @@ int blkdev_issue_flush(struct block_device *bdev, sector_t *error_sector)
 		*error_sector = bio->bi_sector;
 
 	ret = 0;
-	if (!bio_flagged(bio, BIO_UPTODATE))
+	if (bio_flagged(bio, BIO_EOPNOTSUPP))
+		ret = -EOPNOTSUPP;
+	else if (!bio_flagged(bio, BIO_UPTODATE))
 		ret = -EIO;
 
 	bio_put(bio);

-- 
Jens Axboe


^ permalink raw reply related	[flat|nested] 9+ messages in thread

* Re: [dm-devel] Re: device mapper not reporting no-barrier-support?
  2008-02-26 16:17     ` Jens Axboe
@ 2008-02-26 19:33       ` Anders Henke
  2008-02-26 19:41         ` Jens Axboe
  0 siblings, 1 reply; 9+ messages in thread
From: Anders Henke @ 2008-02-26 19:33 UTC (permalink / raw)
  To: Jens Axboe; +Cc: Andrew Morton, device-mapper development, linux-kernel

On Tue, Feb 26 2008 Jens Axboe wrote:
> On Tue, Feb 26 2008, Alasdair G Kergon wrote:
> > On Mon, Feb 25, 2008 at 03:20:50PM -0800, Andrew Morton wrote:
> > > On Mon, 25 Feb 2008 14:26:15 +0100 Anders Henke <anders.henke@1und1.de> wrote:
> > > > I'm currently stuck between Kernel LVM and DRBD, as I'm using Kernel
> > > > 2.6.24.2 with DRBD 8.2.5 on top of an LVM2 device (LV).
> > > > -LVM2/device mapper doesn't support write barriers
> > 
> > That's right.
> > 
> > > > -DRBD uses blkdev_issue_flush() to flush its metadata to disk.
> > 
> > Which won't work if device-mapper is underneath.
> > 
> > > >  On a no-barrier-device, DRBD should receive EOPNOTSUPP, but
> > > >  it really does receive an EIO. Promptly, DRBD gives the
> > > >  error message "drbd0: local disk flush failed with status -5".
> > > > I've posted a lengty summary of my findings to
> > > > http://lists.linbit.com/pipermail/drbd-user/2008-February/008665.html
> > > > ... that DRBD does catch the EOPNOTSUPP for blkdev_issue_flush and
> > > > BIO_RW_BARRIER, but the lvm implementation of blkdev_issue_flush in
> > > > 2.6.24.2 aparently does return EIO for blkdev_issue_flush.
> > > I'd say it's a DM bug.
> > 
> > The dm code is unchanged, but look at the limited endio handling in
> > ll_rw_blk.c:
> > 
> > static void bio_end_empty_barrier(struct bio *bio, int err)
> > {
> >         if (err)
> >                 clear_bit(BIO_UPTODATE, &bio->bi_flags);
> > 
> >         complete(bio->bi_private);
> > }
> > 
> > int blkdev_issue_flush(struct block_device *bdev, sector_t *error_sector)
> > {
> > ...
> >         wait_for_completion(&wait);
> >         if (error_sector)
> >                 *error_sector = bio->bi_sector;
> >         ret = 0;
> >         if (!bio_flagged(bio, BIO_UPTODATE))
> >                 ret = -EIO;
> 
> You are right, the return value got broken there. Does this make it
> return -EOPNOTSUPP properly for you?


No, it doesn't.



I've applied your patch manually, as 2.6.24.2. doesn't have a "blk-barrier.c":

---cut
--- linux-2.6.24.2/block/ll_rw_blk.c.prepatch   2008-02-11
06:51:11.000000000 +0100
+++ linux-2.6.24.2/block/ll_rw_blk.c    2008-02-26 20:02:28.514641620
+0100
@@ -2667,8 +2667,11 @@
 
 static void bio_end_empty_barrier(struct bio *bio, int err)
 {
-       if (err)
+       if (err) {
+               if (err == -EOPNOTSUPP)
+                       set_bit(BIO_EOPNOTSUPP, &bio->bi_flags);
                clear_bit(BIO_UPTODATE, &bio->bi_flags);
+       }
 
        complete(bio->bi_private);
 }
---cut

... and the resulting kernel shows exactly the same behaviour than before:

[  752.301388] drbd0: Writing meta data super block now.
[  752.349713] drbd0: local disk flush failed with status -5
[  752.416256] drbd0: local disk flush failed with status -5
[  753.419254] drbd0: local disk flush failed with status -5
[  753.925726] drbd0: local disk flush failed with status -5
[  754.551176] drbd0: local disk flush failed with status -5
[  754.806052] drbd0: local disk flush failed with status -5
[  755.327988] drbd0: local disk flush failed with status -5
[  755.781863] drbd0: local disk flush failed with status -5
[  756.266694] drbd0: local disk flush failed with status -5





Anders

> diff --git a/block/blk-barrier.c b/block/blk-barrier.c
> index 6901eed..55c5f1f 100644
> --- a/block/blk-barrier.c
> +++ b/block/blk-barrier.c
> @@ -259,8 +259,11 @@ int blk_do_ordered(struct request_queue *q, struct request **rqp)
>  
>  static void bio_end_empty_barrier(struct bio *bio, int err)
>  {
> -	if (err)
> +	if (err) {
> +		if (err == -EOPNOTSUPP)
> +			set_bit(BIO_EOPNOTSUPP, &bio->bi_flags);
>  		clear_bit(BIO_UPTODATE, &bio->bi_flags);
> +	}
>  
>  	complete(bio->bi_private);
>  }
> @@ -309,7 +312,9 @@ int blkdev_issue_flush(struct block_device *bdev, sector_t *error_sector)
>  		*error_sector = bio->bi_sector;
>  
>  	ret = 0;
> -	if (!bio_flagged(bio, BIO_UPTODATE))
> +	if (bio_flagged(bio, BIO_EOPNOTSUPP))
> +		ret = -EOPNOTSUPP;
> +	else if (!bio_flagged(bio, BIO_UPTODATE))
>  		ret = -EIO;
>  
>  	bio_put(bio);
> 
> -- 
> Jens Axboe
> 
-- 
1&1 Internet AG              "Use the --force, Luke"
Brauerstrasse 48             v://49.721.91374.50
D-76135 Karlsruhe            f://49.721.91374.225

Amtsgericht Montabaur HRB 6484
Vorstand: Henning Ahlert, Ralph Dommermuth, Matthias Ehrlich, Andreas Gauger,
Thomas Gottschlich, Matthias Greve, Robert Hoffmann, Markus Huhn, Achim Weiss
Aufsichtsratsvorsitzender: Michael Scheeren

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [dm-devel] Re: device mapper not reporting no-barrier-support?
  2008-02-26 19:33       ` Anders Henke
@ 2008-02-26 19:41         ` Jens Axboe
  2008-02-26 20:20           ` Anders Henke
  0 siblings, 1 reply; 9+ messages in thread
From: Jens Axboe @ 2008-02-26 19:41 UTC (permalink / raw)
  To: Anders Henke, Andrew Morton, device-mapper development, linux-kernel

On Tue, Feb 26 2008, Anders Henke wrote:
> On Tue, Feb 26 2008 Jens Axboe wrote:
> > On Tue, Feb 26 2008, Alasdair G Kergon wrote:
> > > On Mon, Feb 25, 2008 at 03:20:50PM -0800, Andrew Morton wrote:
> > > > On Mon, 25 Feb 2008 14:26:15 +0100 Anders Henke <anders.henke@1und1.de> wrote:
> > > > > I'm currently stuck between Kernel LVM and DRBD, as I'm using Kernel
> > > > > 2.6.24.2 with DRBD 8.2.5 on top of an LVM2 device (LV).
> > > > > -LVM2/device mapper doesn't support write barriers
> > > 
> > > That's right.
> > > 
> > > > > -DRBD uses blkdev_issue_flush() to flush its metadata to disk.
> > > 
> > > Which won't work if device-mapper is underneath.
> > > 
> > > > >  On a no-barrier-device, DRBD should receive EOPNOTSUPP, but
> > > > >  it really does receive an EIO. Promptly, DRBD gives the
> > > > >  error message "drbd0: local disk flush failed with status -5".
> > > > > I've posted a lengty summary of my findings to
> > > > > http://lists.linbit.com/pipermail/drbd-user/2008-February/008665.html
> > > > > ... that DRBD does catch the EOPNOTSUPP for blkdev_issue_flush and
> > > > > BIO_RW_BARRIER, but the lvm implementation of blkdev_issue_flush in
> > > > > 2.6.24.2 aparently does return EIO for blkdev_issue_flush.
> > > > I'd say it's a DM bug.
> > > 
> > > The dm code is unchanged, but look at the limited endio handling in
> > > ll_rw_blk.c:
> > > 
> > > static void bio_end_empty_barrier(struct bio *bio, int err)
> > > {
> > >         if (err)
> > >                 clear_bit(BIO_UPTODATE, &bio->bi_flags);
> > > 
> > >         complete(bio->bi_private);
> > > }
> > > 
> > > int blkdev_issue_flush(struct block_device *bdev, sector_t *error_sector)
> > > {
> > > ...
> > >         wait_for_completion(&wait);
> > >         if (error_sector)
> > >                 *error_sector = bio->bi_sector;
> > >         ret = 0;
> > >         if (!bio_flagged(bio, BIO_UPTODATE))
> > >                 ret = -EIO;
> > 
> > You are right, the return value got broken there. Does this make it
> > return -EOPNOTSUPP properly for you?
> 
> 
> No, it doesn't.
> 
> 
> 
> I've applied your patch manually, as 2.6.24.2. doesn't have a "blk-barrier.c":
> 
> ---cut
> --- linux-2.6.24.2/block/ll_rw_blk.c.prepatch   2008-02-11
> 06:51:11.000000000 +0100
> +++ linux-2.6.24.2/block/ll_rw_blk.c    2008-02-26 20:02:28.514641620
> +0100
> @@ -2667,8 +2667,11 @@
>  
>  static void bio_end_empty_barrier(struct bio *bio, int err)
>  {
> -       if (err)
> +       if (err) {
> +               if (err == -EOPNOTSUPP)
> +                       set_bit(BIO_EOPNOTSUPP, &bio->bi_flags);
>                 clear_bit(BIO_UPTODATE, &bio->bi_flags);
> +       }
>  
>         complete(bio->bi_private);
>  }
> ---cut
> 
> ... and the resulting kernel shows exactly the same behaviour than before:

Not surprising, as you missed half of the patch:

> > @@ -309,7 +312,9 @@ int blkdev_issue_flush(struct block_device *bdev, sector_t *error_sector)
> >  		*error_sector = bio->bi_sector;
> >  
> >  	ret = 0;
> > -	if (!bio_flagged(bio, BIO_UPTODATE))
> > +	if (bio_flagged(bio, BIO_EOPNOTSUPP))
> > +		ret = -EOPNOTSUPP;
> > +	else if (!bio_flagged(bio, BIO_UPTODATE))
> >  		ret = -EIO;
> >  
> >  	bio_put(bio);

-- 
Jens Axboe


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [dm-devel] Re: device mapper not reporting no-barrier-support?
  2008-02-26 19:41         ` Jens Axboe
@ 2008-02-26 20:20           ` Anders Henke
  2008-02-26 22:25             ` Jens Axboe
  0 siblings, 1 reply; 9+ messages in thread
From: Anders Henke @ 2008-02-26 20:20 UTC (permalink / raw)
  To: Jens Axboe; +Cc: Andrew Morton, device-mapper development, linux-kernel

On Tue, Feb 26 2008 schrieb Jens Axboe:
> On Tue, Feb 26 2008, Anders Henke wrote:
> > On Tue, Feb 26 2008 Jens Axboe wrote:
> > > On Tue, Feb 26 2008, Alasdair G Kergon wrote:
> > > > On Mon, Feb 25, 2008 at 03:20:50PM -0800, Andrew Morton wrote:
> > > > > On Mon, 25 Feb 2008 14:26:15 +0100 Anders Henke <anders.henke@1und1.de> wrote:
> > > > > > I'm currently stuck between Kernel LVM and DRBD, as I'm using Kernel
> > > > > > 2.6.24.2 with DRBD 8.2.5 on top of an LVM2 device (LV).
> > > > > > -LVM2/device mapper doesn't support write barriers
> > > > 
> > > > That's right.
> > > > 
> > > > > > -DRBD uses blkdev_issue_flush() to flush its metadata to disk.
> > > > 
> > > > Which won't work if device-mapper is underneath.
> > > > 
> > > > > >  On a no-barrier-device, DRBD should receive EOPNOTSUPP, but
> > > > > >  it really does receive an EIO. Promptly, DRBD gives the
> > > > > >  error message "drbd0: local disk flush failed with status -5".
> > > > > > I've posted a lengty summary of my findings to
> > > > > > http://lists.linbit.com/pipermail/drbd-user/2008-February/008665.html
> > > > > > ... that DRBD does catch the EOPNOTSUPP for blkdev_issue_flush and
> > > > > > BIO_RW_BARRIER, but the lvm implementation of blkdev_issue_flush in
> > > > > > 2.6.24.2 aparently does return EIO for blkdev_issue_flush.
> > > > > I'd say it's a DM bug.
> > > > 
> > > > The dm code is unchanged, but look at the limited endio handling in
> > > > ll_rw_blk.c:
> > > > 
> > > > static void bio_end_empty_barrier(struct bio *bio, int err)
> > > > {
> > > >         if (err)
> > > >                 clear_bit(BIO_UPTODATE, &bio->bi_flags);
> > > > 
> > > >         complete(bio->bi_private);
> > > > }
> > > > 
> > > > int blkdev_issue_flush(struct block_device *bdev, sector_t *error_sector)
> > > > {
> > > > ...
> > > >         wait_for_completion(&wait);
> > > >         if (error_sector)
> > > >                 *error_sector = bio->bi_sector;
> > > >         ret = 0;
> > > >         if (!bio_flagged(bio, BIO_UPTODATE))
> > > >                 ret = -EIO;
> > > 
> > > You are right, the return value got broken there. Does this make it
> > > return -EOPNOTSUPP properly for you?
> > 
> > 
> > No, it doesn't.
> > 
> > 
> > 
> > I've applied your patch manually, as 2.6.24.2. doesn't have a "blk-barrier.c":
> > 
> > ---cut
> > --- linux-2.6.24.2/block/ll_rw_blk.c.prepatch   2008-02-11
> > 06:51:11.000000000 +0100
> > +++ linux-2.6.24.2/block/ll_rw_blk.c    2008-02-26 20:02:28.514641620
> > +0100
> > @@ -2667,8 +2667,11 @@
> >  
> >  static void bio_end_empty_barrier(struct bio *bio, int err)
> >  {
> > -       if (err)
> > +       if (err) {
> > +               if (err == -EOPNOTSUPP)
> > +                       set_bit(BIO_EOPNOTSUPP, &bio->bi_flags);
> >                 clear_bit(BIO_UPTODATE, &bio->bi_flags);
> > +       }
> >  
> >         complete(bio->bi_private);
> >  }
> > ---cut
> > 
> > ... and the resulting kernel shows exactly the same behaviour than before:
> 
> Not surprising, as you missed half of the patch:

Ouch. Thank you for pointing this out.

I've been spending too much time of the day with things who have a negative 
impact on my concentration and I shouldn't manually patch kernels at
this time of the day.

Yes, it's useless to set a bit, but never check it (like in my version of
your patch).

After adding the second part of your patch, the resulting kernel works as
intended:

[  234.946192] drbd0: conn( WFSyncUUID -> SyncTarget ) 
[  234.956176] drbd0: Began resync as SyncTarget (will sync 19542404 KB
[4885601
 bits set]).
[  234.972567] drbd0: Writing meta data super block now.
[  235.018203] drbd0: local disk flush failed with status -95

DRBD sees the EOPNOTSUPP, logs this message only once and doesn't try
any further barrier requests (as intended).

Just for the records, the 2.6.24.2-ready version of your patch:

---cut
--- linux-2.6.24.2/block/ll_rw_blk.c.prepatch   2008-02-11 06:51:11.000000000 +0
100
+++ linux-2.6.24.2/block/ll_rw_blk.c    2008-02-26 20:58:05.552467940 +0100
@@ -2667,8 +2667,11 @@
 
 static void bio_end_empty_barrier(struct bio *bio, int err)
 {
-       if (err)
+       if (err) {
+               if (err == -EOPNOTSUPP)
+                       set_bit(BIO_EOPNOTSUPP, &bio->bi_flags);
                clear_bit(BIO_UPTODATE, &bio->bi_flags);
+       }
 
        complete(bio->bi_private);
 }
@@ -2717,7 +2720,9 @@
                *error_sector = bio->bi_sector;
 
        ret = 0;
-       if (!bio_flagged(bio, BIO_UPTODATE))
+       if (bio_flagged(bio, BIO_EOPNOTSUPP))
+               ret = -EOPNOTSUPP;
+       else if (!bio_flagged(bio, BIO_UPTODATE))
                ret = -EIO;
 
        bio_put(bio);
---cut



Anders
-- 
1&1 Internet AG              better sleep(28800)
Brauerstrasse 48             v://49.721.91374.50
D-76135 Karlsruhe            f://49.721.91374.225

Amtsgericht Montabaur HRB 6484
Vorstand: Henning Ahlert, Ralph Dommermuth, Matthias Ehrlich, Andreas Gauger,
Thomas Gottschlich, Matthias Greve, Robert Hoffmann, Markus Huhn, Achim Weiss
Aufsichtsratsvorsitzender: Michael Scheeren

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [dm-devel] Re: device mapper not reporting no-barrier-support?
  2008-02-26 20:20           ` Anders Henke
@ 2008-02-26 22:25             ` Jens Axboe
  2008-02-28 12:05               ` Anders Henke
  0 siblings, 1 reply; 9+ messages in thread
From: Jens Axboe @ 2008-02-26 22:25 UTC (permalink / raw)
  To: Anders Henke, Andrew Morton, device-mapper development, linux-kernel

On Tue, Feb 26 2008, Anders Henke wrote:
> On Tue, Feb 26 2008 schrieb Jens Axboe:
> > On Tue, Feb 26 2008, Anders Henke wrote:
> > > On Tue, Feb 26 2008 Jens Axboe wrote:
> > > > On Tue, Feb 26 2008, Alasdair G Kergon wrote:
> > > > > On Mon, Feb 25, 2008 at 03:20:50PM -0800, Andrew Morton wrote:
> > > > > > On Mon, 25 Feb 2008 14:26:15 +0100 Anders Henke <anders.henke@1und1.de> wrote:
> > > > > > > I'm currently stuck between Kernel LVM and DRBD, as I'm using Kernel
> > > > > > > 2.6.24.2 with DRBD 8.2.5 on top of an LVM2 device (LV).
> > > > > > > -LVM2/device mapper doesn't support write barriers
> > > > > 
> > > > > That's right.
> > > > > 
> > > > > > > -DRBD uses blkdev_issue_flush() to flush its metadata to disk.
> > > > > 
> > > > > Which won't work if device-mapper is underneath.
> > > > > 
> > > > > > >  On a no-barrier-device, DRBD should receive EOPNOTSUPP, but
> > > > > > >  it really does receive an EIO. Promptly, DRBD gives the
> > > > > > >  error message "drbd0: local disk flush failed with status -5".
> > > > > > > I've posted a lengty summary of my findings to
> > > > > > > http://lists.linbit.com/pipermail/drbd-user/2008-February/008665.html
> > > > > > > ... that DRBD does catch the EOPNOTSUPP for blkdev_issue_flush and
> > > > > > > BIO_RW_BARRIER, but the lvm implementation of blkdev_issue_flush in
> > > > > > > 2.6.24.2 aparently does return EIO for blkdev_issue_flush.
> > > > > > I'd say it's a DM bug.
> > > > > 
> > > > > The dm code is unchanged, but look at the limited endio handling in
> > > > > ll_rw_blk.c:
> > > > > 
> > > > > static void bio_end_empty_barrier(struct bio *bio, int err)
> > > > > {
> > > > >         if (err)
> > > > >                 clear_bit(BIO_UPTODATE, &bio->bi_flags);
> > > > > 
> > > > >         complete(bio->bi_private);
> > > > > }
> > > > > 
> > > > > int blkdev_issue_flush(struct block_device *bdev, sector_t *error_sector)
> > > > > {
> > > > > ...
> > > > >         wait_for_completion(&wait);
> > > > >         if (error_sector)
> > > > >                 *error_sector = bio->bi_sector;
> > > > >         ret = 0;
> > > > >         if (!bio_flagged(bio, BIO_UPTODATE))
> > > > >                 ret = -EIO;
> > > > 
> > > > You are right, the return value got broken there. Does this make it
> > > > return -EOPNOTSUPP properly for you?
> > > 
> > > 
> > > No, it doesn't.
> > > 
> > > 
> > > 
> > > I've applied your patch manually, as 2.6.24.2. doesn't have a "blk-barrier.c":
> > > 
> > > ---cut
> > > --- linux-2.6.24.2/block/ll_rw_blk.c.prepatch   2008-02-11
> > > 06:51:11.000000000 +0100
> > > +++ linux-2.6.24.2/block/ll_rw_blk.c    2008-02-26 20:02:28.514641620
> > > +0100
> > > @@ -2667,8 +2667,11 @@
> > >  
> > >  static void bio_end_empty_barrier(struct bio *bio, int err)
> > >  {
> > > -       if (err)
> > > +       if (err) {
> > > +               if (err == -EOPNOTSUPP)
> > > +                       set_bit(BIO_EOPNOTSUPP, &bio->bi_flags);
> > >                 clear_bit(BIO_UPTODATE, &bio->bi_flags);
> > > +       }
> > >  
> > >         complete(bio->bi_private);
> > >  }
> > > ---cut
> > > 
> > > ... and the resulting kernel shows exactly the same behaviour than before:
> > 
> > Not surprising, as you missed half of the patch:
> 
> Ouch. Thank you for pointing this out.
> 
> I've been spending too much time of the day with things who have a negative 
> impact on my concentration and I shouldn't manually patch kernels at
> this time of the day.
> 
> Yes, it's useless to set a bit, but never check it (like in my version of
> your patch).
> 
> After adding the second part of your patch, the resulting kernel works as
> intended:
> 
> [  234.946192] drbd0: conn( WFSyncUUID -> SyncTarget ) 
> [  234.956176] drbd0: Began resync as SyncTarget (will sync 19542404 KB
> [4885601
>  bits set]).
> [  234.972567] drbd0: Writing meta data super block now.
> [  235.018203] drbd0: local disk flush failed with status -95
> 
> DRBD sees the EOPNOTSUPP, logs this message only once and doesn't try
> any further barrier requests (as intended).

OK good, that's what I expected :-)

I'll queue the patch for 2.6.25, the 2.6.24 should go to stable. Send me
a properly formatted patch and I'll make sure it goes that way.

Thanks for testing!

-- 
Jens Axboe


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [dm-devel] Re: device mapper not reporting no-barrier-support?
  2008-02-26 22:25             ` Jens Axboe
@ 2008-02-28 12:05               ` Anders Henke
  0 siblings, 0 replies; 9+ messages in thread
From: Anders Henke @ 2008-02-28 12:05 UTC (permalink / raw)
  To: Jens Axboe, Andrew Morton, device-mapper development, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1077 bytes --]

On Feb 26 2008, Jens Axboe wrote:
> > [  234.946192] drbd0: conn( WFSyncUUID -> SyncTarget ) 
> > [  234.956176] drbd0: Began resync as SyncTarget (will sync 19542404 KB
> > [4885601
> >  bits set]).
> > [  234.972567] drbd0: Writing meta data super block now.
> > [  235.018203] drbd0: local disk flush failed with status -95
> > 
> > DRBD sees the EOPNOTSUPP, logs this message only once and doesn't try
> > any further barrier requests (as intended).
> 
> OK good, that's what I expected :-)
> 
> I'll queue the patch for 2.6.25, the 2.6.24 should go to stable. Send me
> a properly formatted patch and I'll make sure it goes that way.
> 
> Thanks for testing!

'diff -up''d patch is attached.

Anders
-- 
1&1 Internet AG            System Design
Brauerstrasse 48           v://49.721.91374.50
D-76135 Karlsruhe          f://49.721.91374.225

Amtsgericht Montabaur HRB 6484
Vorstand: Henning Ahlert, Ralph Dommermuth, Matthias Ehrlich, Andreas Gauger,
Thomas Gottschlich, Matthias Greve, Robert Hoffmann, Markus Huhn, Achim Weiss
Aufsichtsratsvorsitzender: Michael Scheeren

[-- Attachment #2: ll_rw_blk-eopnotsup-2.6.24.2.patch --]
[-- Type: text/x-diff, Size: 741 bytes --]

--- linux-2.6.24.2-vanilla/block/ll_rw_blk.c	2008-02-11 06:51:11.000000000 +0100
+++ linux-2.6.24.2/block/ll_rw_blk.c	2008-02-28 10:38:22.392987649 +0100
@@ -2667,8 +2667,11 @@ EXPORT_SYMBOL(blk_execute_rq);
 
 static void bio_end_empty_barrier(struct bio *bio, int err)
 {
-	if (err)
+	if (err) {
+		if (err == -EOPNOTSUPP)
+			set_bit(BIO_EOPNOTSUPP, &bio->bi_flags);
 		clear_bit(BIO_UPTODATE, &bio->bi_flags);
+	}
 
 	complete(bio->bi_private);
 }
@@ -2717,7 +2720,9 @@ int blkdev_issue_flush(struct block_devi
 		*error_sector = bio->bi_sector;
 
 	ret = 0;
-	if (!bio_flagged(bio, BIO_UPTODATE))
+	if (bio_flagged(bio, BIO_EOPNOTSUPP))
+		ret = -EOPNOTSUPP;
+	else if (!bio_flagged(bio, BIO_UPTODATE))
 		ret = -EIO;
 
 	bio_put(bio);

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2008-02-28 12:05 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2008-02-25 13:26 device mapper not reporting no-barrier-support? Anders Henke
2008-02-25 23:20 ` Andrew Morton
2008-02-26  1:36   ` [dm-devel] " Alasdair G Kergon
2008-02-26 16:17     ` Jens Axboe
2008-02-26 19:33       ` Anders Henke
2008-02-26 19:41         ` Jens Axboe
2008-02-26 20:20           ` Anders Henke
2008-02-26 22:25             ` Jens Axboe
2008-02-28 12:05               ` Anders Henke

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).