All of lore.kernel.org
 help / color / mirror / Atom feed
From: Wido den Hollander <wido@42on.com>
To: John Spray <jspray@redhat.com>
Cc: Sage Weil <sage@newdream.net>, Loic Dachary <loic@dachary.org>,
	Ceph Development <ceph-devel@vger.kernel.org>
Subject: Re: crush devices class types
Date: Fri, 3 Feb 2017 13:23:11 +0100 (CET)	[thread overview]
Message-ID: <1112245949.9180.1486124591339@ox.pcextreme.nl> (raw)
In-Reply-To: <CALe9h7essupO4hRdYFnYjO5XhnQk_HGX504K_oZeN-8oVEL65A@mail.gmail.com>


> Op 3 februari 2017 om 11:57 schreef John Spray <jspray@redhat.com>:
> 
> 
> On Fri, Feb 3, 2017 at 10:52 AM, Wido den Hollander <wido@42on.com> wrote:
> >
> >> Op 2 februari 2017 om 21:57 schreef Sage Weil <sage@newdream.net>:
> >>
> >>
> >> Hi everyone,
> >>
> >> I made more updates to http://pad.ceph.com/p/crush-types after the CDM
> >> discussion yesterday:
> >>
> >> - consolidated notes into a single proposal
> >> - use otherwise illegal character (e.g., ~) as separater for generated
> >> buckets.  This avoids ambiguity with user-defined buckets.
> >> - class-id $class $id properties for each bucket.  This allows us to
> >> preserve the derivative bucket ids across a decompile->compile cycle so
> >> that data does not move (the bucket id is one of many inputs into crush's
> >> hash during placement).
> >> - simpler rule syntax:
> >>
> >>     rule ssd {
> >>             ruleset 1
> >>             step take default class ssd
> >>             step chooseleaf firstn 0 type host
> >>             step emit
> >>     }
> >>
> >> My rationale here is that we don't want to make this a separate 'step'
> >> call since steps map to underlying crush rule step ops, and this is a
> >> directive only to the compiler.  Making it an optional step argument seems
> >> like the cleanest way to do that.
> >>
> >> Any other comments before we kick this off?
> >>
> >
> > No, looks good to me! Like combining the class into the 'step'.
> >
> > Would be very nice to have this in L!
> >
> > What would be interesting as well is if OSD daemons could somehow access this while parsing their configuration.
> >
> > Eg
> >
> > [class.ssd]
> >   osd_op_threads = 16
> >
> > [class.hdd]
> >    osd_max_backfills = 1
> >
> > That way you can keep configuration generic and makes config management a lot easier.
> 
> I think there's a general desirable concept of applying configs
> according to a CRUSH selector, that would tie in nicely with hosting
> configs on the mons: rather than the OSD doing the filtering, it would
> just be sent the proper configuration for its location.
> 

I would love to see the configs hosted on the MONs instead of a local config file :)

Just wanted to suggest this as in the current situation it is still difficult to have different configs for SSD and HDD based OSDs.

Wido

> John
> 
> >
> > Wido
> >
> >> Thanks!
> >> sage
> >>
> >>
> >> On Mon, 23 Jan 2017, Loic Dachary wrote:
> >>
> >> > Hi Wido,
> >> >
> >> > Updated http://pad.ceph.com/p/crush-types with your proposal for the rule syntax
> >> >
> >> > Cheers
> >> >
> >> > On 01/23/2017 03:29 PM, Sage Weil wrote:
> >> > > On Mon, 23 Jan 2017, Wido den Hollander wrote:
> >> > >>> Op 22 januari 2017 om 17:44 schreef Loic Dachary <loic@dachary.org>:
> >> > >>>
> >> > >>>
> >> > >>> Hi Sage,
> >> > >>>
> >> > >>> You proposed an improvement to the crush map to address different device types (SSD, HDD, etc.)[1]. When learning how to create a crush map, I was indeed confused by the tricks required to create SSD only pools. After years of practice it feels more natural :-)
> >> > >>>
> >> > >>> The source of my confusion was mostly because I had to use a hierarchical description to describe something that is not organized hierarchically. "The rack contains hosts that contain devices" is intuitive. "The rack contains hosts that contain ssd that contain devices" is counter intuitive. Changing:
> >> > >>>
> >> > >>>     # devices
> >> > >>>     device 0 osd.0
> >> > >>>     device 1 osd.1
> >> > >>>     device 2 osd.2
> >> > >>>     device 3 osd.3
> >> > >>>
> >> > >>> into:
> >> > >>>
> >> > >>>     # devices
> >> > >>>     device 0 osd.0 ssd
> >> > >>>     device 1 osd.1 ssd
> >> > >>>     device 2 osd.2 hdd
> >> > >>>     device 3 osd.3 hdd
> >> > >>>
> >> > >>> where ssd/hdd is the device class would be much better. However, using the device class like so:
> >> > >>>
> >> > >>>     rule ssd {
> >> > >>>             ruleset 1
> >> > >>>             type replicated
> >> > >>>             min_size 1
> >> > >>>             max_size 10
> >> > >>>             step take default:ssd
> >> > >>>             step chooseleaf firstn 0 type host
> >> > >>>             step emit
> >> > >>>     }
> >> > >>>
> >> > >>> looks arcane. Since the goal is to simplify the description for the first time user, maybe we could have something like:
> >> > >>>
> >> > >>>     rule ssd {
> >> > >>>             ruleset 1
> >> > >>>             type replicated
> >> > >>>             min_size 1
> >> > >>>             max_size 10
> >> > >>>             device class = ssd
> >> > >>
> >> > >> Would that be sane?
> >> > >>
> >> > >> Why not:
> >> > >>
> >> > >> step set-class ssd
> >> > >> step take default
> >> > >> step chooseleaf firstn 0 type host
> >> > >> step emit
> >> > >>
> >> > >> Since it's a 'step' you take, am I right?
> >> > >
> >> > > Good idea... a step is a cleaner way to extend the syntax!
> >> > >
> >> > > sage
> >> > > --
> >> > > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> >> > > the body of a message to majordomo@vger.kernel.org
> >> > > More majordomo info at  http://vger.kernel.org/majordomo-info.html
> >> > >
> >> >
> >> > --
> >> > Loïc Dachary, Artisan Logiciel Libre
> >> > --
> >> > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> >> > the body of a message to majordomo@vger.kernel.org
> >> > More majordomo info at  http://vger.kernel.org/majordomo-info.html
> >> >
> >> >
> > --
> > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> > the body of a message to majordomo@vger.kernel.org
> > More majordomo info at  http://vger.kernel.org/majordomo-info.html
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

  reply	other threads:[~2017-02-03 12:23 UTC|newest]

Thread overview: 25+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-01-22 16:44 crush devices class types Loic Dachary
2017-01-23 13:38 ` Wido den Hollander
2017-01-23 14:29   ` Sage Weil
2017-01-23 14:41     ` Loic Dachary
2017-02-02 20:57       ` Sage Weil
2017-02-03 10:52         ` Wido den Hollander
2017-02-03 10:57           ` John Spray
2017-02-03 12:23             ` Wido den Hollander [this message]
2017-06-28 17:54           ` Kyle Bader
2017-02-03 11:24         ` John Spray
2017-02-03 12:22         ` Loic Dachary
2017-02-03 12:46           ` John Spray
2017-02-03 12:52             ` Brett Niver
2017-02-03 13:21             ` Loic Dachary
2017-02-15 11:57               ` John Spray
2017-02-15 12:14                 ` Loic Dachary
2017-03-08  9:42                   ` Dan van der Ster
2017-03-08 10:05                     ` Loic Dachary
2017-03-08 14:39                     ` Sage Weil
2017-03-08 15:55                       ` Dan van der Ster
2017-03-08 17:00                         ` Sage Weil
     [not found]                           ` <201706281000476718115@gmail.com>
2017-06-28  4:26                             ` Sage Weil
2017-06-28  5:28                       ` Kyle Bader
2017-06-28 13:46                         ` Sage Weil
2017-01-23 14:12 ` Sage Weil

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1112245949.9180.1486124591339@ox.pcextreme.nl \
    --to=wido@42on.com \
    --cc=ceph-devel@vger.kernel.org \
    --cc=jspray@redhat.com \
    --cc=loic@dachary.org \
    --cc=sage@newdream.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.