From: waxhead <waxhead@dirtcellar.net>
To: dsterba@suse.cz, Roman Mamedov <rm@romanrm.net>,
Qu Wenruo <quwenruo.btrfs@gmx.com>,
David Sterba <dsterba@suse.com>,
linux-btrfs@vger.kernel.org
Subject: Re: [PATCH] btrfs: allow degenerate raid0/raid10
Date: Sat, 24 Jul 2021 13:04:19 +0200 [thread overview]
Message-ID: <18a1bdd5-321e-68d3-517f-84c8d9bacb9c@dirtcellar.net> (raw)
In-Reply-To: <20210723192145.GF19710@suse.cz>
David Sterba wrote:
> Maybe it's still too new so nobody is used to it and we've always had
> problems with the raid naming scheme anyway.
Perhaps slightly off topic , but I see constantly that people do not
understand how BTRFS "RAID" implementation works. They tend to confuse
it with regular RAID and get angry because they run into "issues" simply
because they do not understand the differences.
I have been an enthusiastic BTRFS user for years, and I actually caught
myself incorrectly explaining how regular RAID works to a guy a while
ago. This happened simply because my mind was so used to how BTRFS uses
this terminology that I did not think about it.
As BTRFS is getting used more and more it may be increasingly difficult
(if not impossible) to get rid of the "RAID" terminology, but in my
opinion also increasingly more important as well.
Some years ago (2018) there was some talk about a new naming scheme
https://marc.info/?l=linux-btrfs&m=136286324417767
While technically spot on I found Hugo's naming scheme difficult. It was
based on this idea:
numCOPIESnumSTRIPESnumPARITY
1CmS1P = Raid5 or 1 copy, max stripes, 1 parity.
I also do not agree with the use of 'copy'. The Oxford dictionary
defines 'copy' as "a thing that is made to be the same as something
else, especially a document or a work of art"
And while some might argue that copying something on disk from memory
makes it a copy, it ceases to be a copy once the memory contents is
erased. I therefore think that replicas is a far better terminology.
I earlier suggested Rnum.Snum.Pnum as a naming scheme which I think is
far more readable so if I may dare to be as bold....
SINGLE = R0.S0.P0 (no replicas, no stripes (any device), no parity)
DUP = R1.S1.P0 (1 replica, 1 stripe (one device), no parity)
RAID0 = R0.Sm.P0 (no replicas, max stripes, no parity)
RAID1 = R1.S0.P0 (1 replica, no stripes (any device), no parity)
RAID1c2 = R2.S0.P0 (2 replicas, no stripes (any device), no parity)
RAID1c3 = R3.S0.P0 (3 replicas, no stripes (any device), no parity)
RAID10 = R1.Sm.P0 (1 replica, max stripes, no parity)
RAID5 = R0.Sm.P1 (no replicas, max stripes, 1 parity)
RAID5 = R0.Sm.P2 (no replicas, max stripes, 2 parity)
This (or Hugo's) type of naming scheme would also easily allow add more
exotic configuration such as S5 e.g. stripe over 5 devices in a 10
device storage system which could increase throughput for certain
workloads (because it leaves half the storage devices "free" for other jobs)
A variant of RAID5 behaving like RAID10 would simply be R1.Sm.P1. Easy
peasy... And just for the record , the old RAID terminology should of
course work for compatibility reasons, but ideally should not be
advertised at all.
Sorry for completely derailing the topic, but I felt it was important to
bring up (and I admit to be overenthusiastic about it). :)
next prev parent reply other threads:[~2021-07-24 11:04 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-07-22 19:29 [PATCH] btrfs: allow degenerate raid0/raid10 David Sterba
2021-07-22 22:51 ` Qu Wenruo
2021-07-23 14:08 ` David Sterba
2021-07-23 17:27 ` Roman Mamedov
2021-07-23 19:21 ` David Sterba
2021-07-24 11:04 ` waxhead [this message]
2021-07-24 11:24 ` Hugo Mills
2021-07-24 11:49 ` waxhead
2021-07-24 11:55 ` Hugo Mills
2021-07-24 12:07 ` waxhead
2021-07-24 12:30 ` Hugo Mills
2021-07-24 13:54 ` Forza
2021-07-24 21:15 ` Zygo Blaxell
2021-07-24 22:25 ` waxhead
2021-07-23 22:35 ` Qu Wenruo
2021-07-24 21:31 ` Zygo Blaxell
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=18a1bdd5-321e-68d3-517f-84c8d9bacb9c@dirtcellar.net \
--to=waxhead@dirtcellar.net \
--cc=dsterba@suse.com \
--cc=dsterba@suse.cz \
--cc=linux-btrfs@vger.kernel.org \
--cc=quwenruo.btrfs@gmx.com \
--cc=rm@romanrm.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.