All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH V1] hostdisk: Fix linux disk cache workaround on multipath disks
@ 2018-09-13  7:20 Michael Chang
  2018-09-19 13:28 ` Daniel Kiper
  0 siblings, 1 reply; 3+ messages in thread
From: Michael Chang @ 2018-09-13  7:20 UTC (permalink / raw)
  To: The development of GNU GRUB; +Cc: Martin Wilck

In grub-core/osdep/linux/hostdisk.c::grub_util_fd_open_device() there's comment
about linux disk cache issue as below:

  /* Linux has a bug that the disk cache for a whole disk is not consistent
     with the one for a partition of the disk.  */
  {
    ....
  }

As the input argument of grub_util_fd_open_device() is using address in unit of
sector size offset from the "disk", and in a bid to avoid Linux disk cache
inconsistency problem described by comment above, grub translates the address
again into the address offset from partition that has encompassed it, then use
that partition device in place of disk device.

The problem we encountered was that installing grub into multipath disk's
partition didn't work reliably. It boiled down to the disk cache problem
described above as strace result shown it was still using the whole disk
device, not the partition device we would expect.

This patch fixes the problem by adding the missing "/dev/dm-" name scheme
handling in grub_hostdisk_linux_find_partition(). After applying the patch
problem gets solved and we would like to have this fixing patch upstreamed as
it looks good material to be.

v1: Rework commit message.

Signed-off-by: Michael Chang <mchang@suse.com>
---
 grub-core/osdep/linux/hostdisk.c | 6 ++++++
 1 file changed, 6 insertions(+)

diff --git a/grub-core/osdep/linux/hostdisk.c b/grub-core/osdep/linux/hostdisk.c
index 06179fca7..ed530bdc4 100644
--- a/grub-core/osdep/linux/hostdisk.c
+++ b/grub-core/osdep/linux/hostdisk.c
@@ -263,6 +263,12 @@ grub_hostdisk_linux_find_partition (char *dev, grub_disk_addr_t sector)
       p = real_dev + len;
       format = "-part%d";
     }
+  else if (strncmp (real_dev, "/dev/dm-",
+		    sizeof ("/dev/dm-") - 1) == 0)
+    {
+      p = real_dev + len - 1;
+      format = "%d";
+    }
   else if (real_dev[len - 1] >= '0' && real_dev[len - 1] <= '9')
     {
       p = real_dev + len;
-- 
2.13.6



^ permalink raw reply related	[flat|nested] 3+ messages in thread

* Re: [PATCH V1] hostdisk: Fix linux disk cache workaround on multipath disks
  2018-09-13  7:20 [PATCH V1] hostdisk: Fix linux disk cache workaround on multipath disks Michael Chang
@ 2018-09-19 13:28 ` Daniel Kiper
  2018-09-20 10:14   ` Michael Chang
  0 siblings, 1 reply; 3+ messages in thread
From: Daniel Kiper @ 2018-09-19 13:28 UTC (permalink / raw)
  To: Michael Chang; +Cc: The development of GNU GRUB, Martin Wilck

On Thu, Sep 13, 2018 at 03:20:34PM +0800, Michael Chang wrote:
> In grub-core/osdep/linux/hostdisk.c::grub_util_fd_open_device() there's comment
> about linux disk cache issue as below:
>
>   /* Linux has a bug that the disk cache for a whole disk is not consistent
>      with the one for a partition of the disk.  */
>   {
>     ....
>   }
>
> As the input argument of grub_util_fd_open_device() is using address in unit of
> sector size offset from the "disk", and in a bid to avoid Linux disk cache
> inconsistency problem described by comment above, grub translates the address
> again into the address offset from partition that has encompassed it, then use
> that partition device in place of disk device.
>
> The problem we encountered was that installing grub into multipath disk's
> partition didn't work reliably. It boiled down to the disk cache problem
> described above as strace result shown it was still using the whole disk
> device, not the partition device we would expect.
>
> This patch fixes the problem by adding the missing "/dev/dm-" name scheme
> handling in grub_hostdisk_linux_find_partition(). After applying the patch
> problem gets solved and we would like to have this fixing patch upstreamed as
> it looks good material to be.
>
> v1: Rework commit message.

Thanks! Right now it looks much better for me.

> Signed-off-by: Michael Chang <mchang@suse.com>
> ---
>  grub-core/osdep/linux/hostdisk.c | 6 ++++++
>  1 file changed, 6 insertions(+)
>
> diff --git a/grub-core/osdep/linux/hostdisk.c b/grub-core/osdep/linux/hostdisk.c
> index 06179fca7..ed530bdc4 100644
> --- a/grub-core/osdep/linux/hostdisk.c
> +++ b/grub-core/osdep/linux/hostdisk.c
> @@ -263,6 +263,12 @@ grub_hostdisk_linux_find_partition (char *dev, grub_disk_addr_t sector)
>        p = real_dev + len;
>        format = "-part%d";
>      }
> +  else if (strncmp (real_dev, "/dev/dm-",
> +		    sizeof ("/dev/dm-") - 1) == 0)
> +    {
> +      p = real_dev + len - 1;
> +      format = "%d";
> +    }

What will happen if the device path is /dev/dm-10?

>    else if (real_dev[len - 1] >= '0' && real_dev[len - 1] <= '9')
>      {
>        p = real_dev + len;

...and I am afraid that above line is buggy too...
What about the other cases?

Daniel


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH V1] hostdisk: Fix linux disk cache workaround on multipath disks
  2018-09-19 13:28 ` Daniel Kiper
@ 2018-09-20 10:14   ` Michael Chang
  0 siblings, 0 replies; 3+ messages in thread
From: Michael Chang @ 2018-09-20 10:14 UTC (permalink / raw)
  To: Daniel Kiper; +Cc: The development of GNU GRUB, Martin Wilck

On Wed, Sep 19, 2018 at 03:28:34PM +0200, Daniel Kiper wrote:
> On Thu, Sep 13, 2018 at 03:20:34PM +0800, Michael Chang wrote:
> > In grub-core/osdep/linux/hostdisk.c::grub_util_fd_open_device() there's comment
> > about linux disk cache issue as below:
> >
> >   /* Linux has a bug that the disk cache for a whole disk is not consistent
> >      with the one for a partition of the disk.  */
> >   {
> >     ....
> >   }
> >
> > As the input argument of grub_util_fd_open_device() is using address in unit of
> > sector size offset from the "disk", and in a bid to avoid Linux disk cache
> > inconsistency problem described by comment above, grub translates the address
> > again into the address offset from partition that has encompassed it, then use
> > that partition device in place of disk device.
> >
> > The problem we encountered was that installing grub into multipath disk's
> > partition didn't work reliably. It boiled down to the disk cache problem
> > described above as strace result shown it was still using the whole disk
> > device, not the partition device we would expect.
> >
> > This patch fixes the problem by adding the missing "/dev/dm-" name scheme
> > handling in grub_hostdisk_linux_find_partition(). After applying the patch
> > problem gets solved and we would like to have this fixing patch upstreamed as
> > it looks good material to be.
> >
> > v1: Rework commit message.
> 
> Thanks! Right now it looks much better for me.
> 
> > Signed-off-by: Michael Chang <mchang@suse.com>
> > ---
> >  grub-core/osdep/linux/hostdisk.c | 6 ++++++
> >  1 file changed, 6 insertions(+)
> >
> > diff --git a/grub-core/osdep/linux/hostdisk.c b/grub-core/osdep/linux/hostdisk.c
> > index 06179fca7..ed530bdc4 100644
> > --- a/grub-core/osdep/linux/hostdisk.c
> > +++ b/grub-core/osdep/linux/hostdisk.c
> > @@ -263,6 +263,12 @@ grub_hostdisk_linux_find_partition (char *dev, grub_disk_addr_t sector)
> >        p = real_dev + len;
> >        format = "-part%d";
> >      }
> > +  else if (strncmp (real_dev, "/dev/dm-",
> > +		    sizeof ("/dev/dm-") - 1) == 0)
> > +    {
> > +      p = real_dev + len - 1;
> > +      format = "%d";
> > +    }
> 
> What will happen if the device path is /dev/dm-10?

It depends on the number of consective failed attempt of open the device in
sequence order, starting from /dev/dm-1, before reaching /dev/dm-10. The number
is set to 10 and will be reset to 0 for any successful open .. It may or may
not give up /dev/dm-10 depending on how many consective failure encountered.

> 
> >    else if (real_dev[len - 1] >= '0' && real_dev[len - 1] <= '9')
> >      {
> >        p = real_dev + len;
> 
> ...and I am afraid that above line is buggy too...

It seems to guess the partition device name from the disk device name matching
the pattern /dev/.*[0-9] to be /dev/.*[0-9]p[0-9]+, and is wrong with dm
partition name as something like /dev/dm-0p1.

> What about the other cases?

I think the guess work is not good, but that is not the problem the patch
wanted to fix. It is aimed to fix a missing piece in current guess work.

Of course it didn't prevent us from coming up with another patch to improve the
guess work, or even get rid of it by preserving the input partition name
somewhere ...

Thanks,
Michael

> 
> Daniel


^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2018-09-20 10:14 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-09-13  7:20 [PATCH V1] hostdisk: Fix linux disk cache workaround on multipath disks Michael Chang
2018-09-19 13:28 ` Daniel Kiper
2018-09-20 10:14   ` Michael Chang

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.