All of lore.kernel.org
 help / color / mirror / Atom feed
From: Mike Snitzer <snitzer@redhat.com>
To: "hch@infradead.org" <hch@infradead.org>
Cc: Keith Busch <keith.busch@intel.com>,
	Bart Van Assche <Bart.VanAssche@sandisk.com>,
	"dm-devel@redhat.com" <dm-devel@redhat.com>,
	"linux-nvme@lists.infradead.org" <linux-nvme@lists.infradead.org>,
	"linux-scsi@vger.kernel.org" <linux-scsi@vger.kernel.org>
Subject: Re: hch's native NVMe multipathing [was: Re: [PATCH 1/2] Don't blacklist nvme]
Date: Fri, 17 Feb 2017 09:43:47 -0500	[thread overview]
Message-ID: <20170217144347.GC19043@redhat.com> (raw)
In-Reply-To: <20170217090420.GA29629@infradead.org>

On Fri, Feb 17 2017 at  4:04am -0500,
hch@infradead.org <hch@infradead.org> wrote:

> On Thu, Feb 16, 2017 at 01:21:29PM -0500, Mike Snitzer wrote:
> > multipath-tools has tables that specify all the defaults for a given
> > target backend.  NVMe will just be yet another.
> 
> No, if we get things right it won't.  ALUA already got rid of most
> of the parameter people would have to set under normal conditions,
> and I plan to make sure the NVMe equivalent will do it for all
> parameters.  I am active in the NVMe working group and will do my
> best to get there.  There's a few others folks here that are more or
> less active there as well (Keith, Martin, Jens for example), so I
> think we have a chance.
> 
> That beeing said Keith is right that we'll always have odd setups
> where we need to overrid things, and we will have to provide tunables
> for that.  It's no different from any other kernel subsystem in that.

Before ALUA fixed all that vendor specific fragmentation there was the
even worse fragmentation where different vendors pushed multipathing
into their FC drivers.  James correctly pushed them toward a generic
solution (and DM multipath was born).  If every transport implements its
own multipathing then it'll be a more generic, yet very similar,
fragmentation problem.

But if your native NVMe multipathing really is factored such that the
actual IO fast path is implemented in block core, and transport specific
hooks are called out to as needed, then you've simply reimplement DM
multipath in block core.

Pretty weird place to invest _so_ much energy before you've fully
explored how unworkable DM multipath support for NVMe is.  But I
digress.

WARNING: multiple messages have this Message-ID (diff)
From: snitzer@redhat.com (Mike Snitzer)
Subject: hch's native NVMe multipathing [was: Re: [PATCH 1/2] Don't blacklist nvme]
Date: Fri, 17 Feb 2017 09:43:47 -0500	[thread overview]
Message-ID: <20170217144347.GC19043@redhat.com> (raw)
In-Reply-To: <20170217090420.GA29629@infradead.org>

On Fri, Feb 17 2017 at  4:04am -0500,
hch@infradead.org <hch@infradead.org> wrote:

> On Thu, Feb 16, 2017@01:21:29PM -0500, Mike Snitzer wrote:
> > multipath-tools has tables that specify all the defaults for a given
> > target backend.  NVMe will just be yet another.
> 
> No, if we get things right it won't.  ALUA already got rid of most
> of the parameter people would have to set under normal conditions,
> and I plan to make sure the NVMe equivalent will do it for all
> parameters.  I am active in the NVMe working group and will do my
> best to get there.  There's a few others folks here that are more or
> less active there as well (Keith, Martin, Jens for example), so I
> think we have a chance.
> 
> That beeing said Keith is right that we'll always have odd setups
> where we need to overrid things, and we will have to provide tunables
> for that.  It's no different from any other kernel subsystem in that.

Before ALUA fixed all that vendor specific fragmentation there was the
even worse fragmentation where different vendors pushed multipathing
into their FC drivers.  James correctly pushed them toward a generic
solution (and DM multipath was born).  If every transport implements its
own multipathing then it'll be a more generic, yet very similar,
fragmentation problem.

But if your native NVMe multipathing really is factored such that the
actual IO fast path is implemented in block core, and transport specific
hooks are called out to as needed, then you've simply reimplement DM
multipath in block core.

Pretty weird place to invest _so_ much energy before you've fully
explored how unworkable DM multipath support for NVMe is.  But I
digress.

  reply	other threads:[~2017-02-17 14:43 UTC|newest]

Thread overview: 55+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-02-14 21:19 [PATCH 1/2] Don't blacklist nvme Keith Busch
2017-02-14 21:19 ` [PATCH 2/2] Fill NVMe specific path info Keith Busch
2017-02-20 17:57   ` Benjamin Marzinski
2017-02-21 21:06     ` Keith Busch
2017-02-14 21:35 ` [PATCH 1/2] Don't blacklist nvme Bart Van Assche
2017-02-14 23:00   ` Keith Busch
2017-02-15 14:57     ` Christoph Hellwig
2017-02-15 17:24       ` Keith Busch
2017-02-16  1:58         ` Mike Snitzer
2017-02-16  2:01     ` Mike Snitzer
2017-02-16  2:35       ` Mike Snitzer
2017-02-15 14:56 ` Christoph Hellwig
2017-02-16  2:53   ` hch's native NVMe multipathing [was: Re: [PATCH 1/2] Don't blacklist nvme] Mike Snitzer
2017-02-16  2:53     ` Mike Snitzer
2017-02-16  5:00     ` [dm-devel] " Bart Van Assche
2017-02-16  5:00       ` Bart Van Assche
2017-02-16 12:37       ` Mike Snitzer
2017-02-16 12:37         ` Mike Snitzer
2017-02-16 19:46         ` Bart Van Assche
2017-02-16 19:46           ` Bart Van Assche
2017-02-16 20:23           ` Mike Snitzer
2017-02-16 20:23             ` Mike Snitzer
2017-02-16 20:58             ` Bart Van Assche
2017-02-16 20:58               ` Bart Van Assche
2017-02-16 14:26     ` [dm-devel] " Christoph Hellwig
2017-02-16 14:26       ` Christoph Hellwig
2017-02-16 15:13       ` Mike Snitzer
2017-02-16 15:13         ` Mike Snitzer
2017-02-16 17:38         ` Keith Busch
2017-02-16 17:38           ` Keith Busch
2017-02-16 17:37           ` Bart Van Assche
2017-02-16 17:37             ` Bart Van Assche
2017-02-16 18:07             ` Keith Busch
2017-02-16 18:07               ` Keith Busch
2017-02-16 18:21               ` Mike Snitzer
2017-02-16 18:21                 ` Mike Snitzer
2017-02-16 20:40                 ` Keith Busch
2017-02-16 20:40                   ` Keith Busch
2017-02-17  9:04                 ` [dm-devel] " hch
2017-02-17  9:04                   ` hch
2017-02-17 14:43                   ` Mike Snitzer [this message]
2017-02-17 14:43                     ` Mike Snitzer
2017-02-16 18:05         ` Sagi Grimberg
2017-02-16 18:05           ` Sagi Grimberg
2017-02-17  9:05           ` [dm-devel] " Christoph Hellwig
2017-02-17  9:05             ` Christoph Hellwig
2017-02-17 14:37             ` Mike Snitzer
2017-02-17 14:37               ` Mike Snitzer
2017-02-17  9:33         ` [dm-devel] " Christoph Hellwig
2017-02-17  9:33           ` Christoph Hellwig
2017-02-17 14:32           ` Mike Snitzer
2017-02-17 14:32             ` Mike Snitzer
2017-02-20 18:17   ` [PATCH 1/2] Don't blacklist nvme Benjamin Marzinski
2017-02-20 14:14 ` Mike Snitzer
2017-02-27  5:37 ` Christophe Varoqui

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20170217144347.GC19043@redhat.com \
    --to=snitzer@redhat.com \
    --cc=Bart.VanAssche@sandisk.com \
    --cc=dm-devel@redhat.com \
    --cc=hch@infradead.org \
    --cc=keith.busch@intel.com \
    --cc=linux-nvme@lists.infradead.org \
    --cc=linux-scsi@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.