All of lore.kernel.org
 help / color / mirror / Atom feed
* checksums
@ 2013-05-14  9:14 folkert
  2013-05-14 13:18 ` checksums Theodore Ts'o
  0 siblings, 1 reply; 8+ messages in thread
From: folkert @ 2013-05-14  9:14 UTC (permalink / raw)
  To: linux-ext4

Hi,

Is it possible to "scrub" (check/verify) (the new-) checksums in ext4?

Also: is it planned to add an option to add checksums to the data as
well?


Folkert van Heusden

-- 
www.vanheusden.com/multitail - multitail is tail on steroids. multiple
               windows, filtering, coloring, anything you can think of
----------------------------------------------------------------------
Phone: +31-6-41278122, PGP-key: 1F28D8AE, www.vanheusden.com

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: checksums
  2013-05-14  9:14 checksums folkert
@ 2013-05-14 13:18 ` Theodore Ts'o
  2013-05-14 14:40   ` checksums folkert
  0 siblings, 1 reply; 8+ messages in thread
From: Theodore Ts'o @ 2013-05-14 13:18 UTC (permalink / raw)
  To: folkert; +Cc: linux-ext4

On Tue, May 14, 2013 at 11:14:08AM +0200, folkert wrote:
> 
> Is it possible to "scrub" (check/verify) (the new-) checksums in ext4?

When e2fsck does a full check, it verifies the contents of all of the
metadata blocks, and this includes verifying all of the checksums.

> Also: is it planned to add an option to add checksums to the data as
> well?

That's not currently on the development roadmap; I could imagine
someone deciding to design an extension to ext4 that would do this
probably by storing the checksums in the indirect blocks, but no one
is currently working on it.

Regards,

					- Ted

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: checksums
  2013-05-14 13:18 ` checksums Theodore Ts'o
@ 2013-05-14 14:40   ` folkert
  2013-05-14 18:09     ` checksums Darrick J. Wong
  0 siblings, 1 reply; 8+ messages in thread
From: folkert @ 2013-05-14 14:40 UTC (permalink / raw)
  To: Theodore Ts'o; +Cc: linux-ext4

> > Is it possible to "scrub" (check/verify) (the new-) checksums in ext4?
> 
> When e2fsck does a full check, it verifies the contents of all of the
> metadata blocks, and this includes verifying all of the checksums.

Ok. But that would only when the filesystem is not mounted.
Maybe some on-line functionality for doing so would be nice. I'm not
totally aware of the filesystem structures in memory/on disk, but
reading meta-data from disk which has changes pending in memory/in the
journal would give at worst a verify of old(er) data. I don't think this
(checking occasional old data) is a bad thing - scrubbing a
raid-device/disk doesn't give you the situation for the whole disk(s) in
1 (!) point at time either. If that would be required, then the user
could still unmount the filesystem and do a check.

> > Also: is it planned to add an option to add checksums to the data as
> > well?
> That's not currently on the development roadmap; I could imagine
> someone deciding to design an extension to ext4 that would do this
> probably by storing the checksums in the indirect blocks, but no one
> is currently working on it.

Would crowdfunding such a feature be an option?
E.g. via https://www.catincan.com/ or http://www.kickstarter.com/

On other thing I was thinking of, was a switch that would enable a
read-after-write which would verify that the data is indeed on the
platters/NAND-chips.
(I don't care (too) much about speed - I just want to be really sure my
data doesn't get corrupt/lost without investing huge piles of money on
a san/nas with remote mirroring etc.).


Regards,

Folkert van Heusden

-- 
Winnen in de staatsloterij? Check htp://vanheusden.com/sl.php voor
alle staatsloterij-uitslag statistieken denkbaar en zelfs ook een paar
win-tips!
----------------------------------------------------------------------
Phone: +31-6-41278122, PGP-key: 1F28D8AE, www.vanheusden.com

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: checksums
  2013-05-14 14:40   ` checksums folkert
@ 2013-05-14 18:09     ` Darrick J. Wong
  2013-05-14 18:57       ` checksums folkert
  0 siblings, 1 reply; 8+ messages in thread
From: Darrick J. Wong @ 2013-05-14 18:09 UTC (permalink / raw)
  To: folkert; +Cc: Theodore Ts'o, linux-ext4

On Tue, May 14, 2013 at 04:40:34PM +0200, folkert wrote:
> > > Is it possible to "scrub" (check/verify) (the new-) checksums in ext4?
> > 
> > When e2fsck does a full check, it verifies the contents of all of the
> > metadata blocks, and this includes verifying all of the checksums.
> 
> Ok. But that would only when the filesystem is not mounted.
> Maybe some on-line functionality for doing so would be nice. I'm not
> totally aware of the filesystem structures in memory/on disk, but
> reading meta-data from disk which has changes pending in memory/in the
> journal would give at worst a verify of old(er) data. I don't think this
> (checking occasional old data) is a bad thing - scrubbing a
> raid-device/disk doesn't give you the situation for the whole disk(s) in
> 1 (!) point at time either. If that would be required, then the user
> could still unmount the filesystem and do a check.

Well... if you ran filefrag -v on every file on the disk and read all the
xattrs, you'd scrub nearly all the metadata.  The only things you'd miss are
unallocated parts of the disk, most of which e2fsck also skips.

Ofc e2fsck will also catch incorrect values in the metadata, whereas I think
the kernel only notices /insane/ values.

> > > Also: is it planned to add an option to add checksums to the data as
> > > well?
> > That's not currently on the development roadmap; I could imagine
> > someone deciding to design an extension to ext4 that would do this
> > probably by storing the checksums in the indirect blocks, but no one
> > is currently working on it.

sha256sum < file > file.sha256 ? :D

(If only there was disk space and brain-time to do something where you could
*reconstruct* data.)

--D

> Would crowdfunding such a feature be an option?
> E.g. via https://www.catincan.com/ or http://www.kickstarter.com/
> 
> On other thing I was thinking of, was a switch that would enable a
> read-after-write which would verify that the data is indeed on the
> platters/NAND-chips.
> (I don't care (too) much about speed - I just want to be really sure my
> data doesn't get corrupt/lost without investing huge piles of money on
> a san/nas with remote mirroring etc.).
> 
> 
> Regards,
> 
> Folkert van Heusden
> 
> -- 
> Winnen in de staatsloterij? Check htp://vanheusden.com/sl.php voor
> alle staatsloterij-uitslag statistieken denkbaar en zelfs ook een paar
> win-tips!
> ----------------------------------------------------------------------
> Phone: +31-6-41278122, PGP-key: 1F28D8AE, www.vanheusden.com
> --
> To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: checksums
  2013-05-14 18:09     ` checksums Darrick J. Wong
@ 2013-05-14 18:57       ` folkert
  2013-05-14 19:21         ` checksums Darrick J. Wong
  0 siblings, 1 reply; 8+ messages in thread
From: folkert @ 2013-05-14 18:57 UTC (permalink / raw)
  To: Darrick J. Wong; +Cc: Theodore Ts'o, linux-ext4

> > Ok. But that would only when the filesystem is not mounted.
> > Maybe some on-line functionality for doing so would be nice. I'm not
> > totally aware of the filesystem structures in memory/on disk, but
> > reading meta-data from disk which has changes pending in memory/in the
> > journal would give at worst a verify of old(er) data. I don't think this
> > (checking occasional old data) is a bad thing - scrubbing a
> > raid-device/disk doesn't give you the situation for the whole disk(s) in
> > 1 (!) point at time either. If that would be required, then the user
> > could still unmount the filesystem and do a check.
> 
> Well... if you ran filefrag -v on every file on the disk and read all the
> xattrs, you'd scrub nearly all the metadata.  The only things you'd miss are
> unallocated parts of the disk, most of which e2fsck also skips.

Yes but that is, imho, a bit dirty method.
Because I assume the result will be a message in dmesg and the
filesystem being remounted r/o?
I think it would be better if a nice message on the user's terminal and
an exit code.

> > > That's not currently on the development roadmap; I could imagine
> > > someone deciding to design an extension to ext4 that would do this
> > > probably by storing the checksums in the indirect blocks, but no one
> > > is currently working on it.
> 
> sha256sum < file > file.sha256 ? :D

Then you would need to read the whole file. I think it would be better
to have this on e.g. block-level. 4KB so CRC32 suffices?

> (If only there was disk space and brain-time to do something where you could
> *reconstruct* data.)

ah yes.
These days everything is done by the gpu, maybe it can help with that :)


Folkert van Heusden

-- 
www.vanheusden.com/multitail - multitail is tail on steroids. multiple
               windows, filtering, coloring, anything you can think of
----------------------------------------------------------------------
Phone: +31-6-41278122, PGP-key: 1F28D8AE, www.vanheusden.com

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: checksums
  2013-05-14 18:57       ` checksums folkert
@ 2013-05-14 19:21         ` Darrick J. Wong
  0 siblings, 0 replies; 8+ messages in thread
From: Darrick J. Wong @ 2013-05-14 19:21 UTC (permalink / raw)
  To: folkert; +Cc: Theodore Ts'o, linux-ext4

On Tue, May 14, 2013 at 08:57:54PM +0200, folkert wrote:
> > > Ok. But that would only when the filesystem is not mounted.
> > > Maybe some on-line functionality for doing so would be nice. I'm not
> > > totally aware of the filesystem structures in memory/on disk, but
> > > reading meta-data from disk which has changes pending in memory/in the
> > > journal would give at worst a verify of old(er) data. I don't think this
> > > (checking occasional old data) is a bad thing - scrubbing a
> > > raid-device/disk doesn't give you the situation for the whole disk(s) in
> > > 1 (!) point at time either. If that would be required, then the user
> > > could still unmount the filesystem and do a check.
> > 
> > Well... if you ran filefrag -v on every file on the disk and read all the
> > xattrs, you'd scrub nearly all the metadata.  The only things you'd miss are
> > unallocated parts of the disk, most of which e2fsck also skips.
> 
> Yes but that is, imho, a bit dirty method.
> Because I assume the result will be a message in dmesg and the
> filesystem being remounted r/o?
> I think it would be better if a nice message on the user's terminal and
> an exit code.

You should see "I/O Error" (or whatever -EIO becomes in the message catalog) on
the terminal running filefrag if you hit a checksum error, in addition to a
complaint in dmesg and a ro fs.

> > > > That's not currently on the development roadmap; I could imagine
> > > > someone deciding to design an extension to ext4 that would do this
> > > > probably by storing the checksums in the indirect blocks, but no one
> > > > is currently working on it.
> > 
> > sha256sum < file > file.sha256 ? :D
> 
> Then you would need to read the whole file. I think it would be better
> to have this on e.g. block-level. 4KB so CRC32 suffices?

block or bigalloc-cluster level, I suppose.

--D
> 
> > (If only there was disk space and brain-time to do something where you could
> > *reconstruct* data.)
> 
> ah yes.
> These days everything is done by the gpu, maybe it can help with that :)
> 
> 
> Folkert van Heusden
> 
> -- 
> www.vanheusden.com/multitail - multitail is tail on steroids. multiple
>                windows, filtering, coloring, anything you can think of
> ----------------------------------------------------------------------
> Phone: +31-6-41278122, PGP-key: 1F28D8AE, www.vanheusden.com

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: checksums...
  2009-01-21 18:55 checksums Koen Kooi
@ 2009-01-21 19:15 ` Philip Balister
  0 siblings, 0 replies; 8+ messages in thread
From: Philip Balister @ 2009-01-21 19:15 UTC (permalink / raw)
  To: openembedded-devel

[-- Attachment #1: Type: text/plain, Size: 834 bytes --]

Koen Kooi wrote:
> Hi,
> 
> Could people adding new recipe and people git-am'ing patches adding new 
> recipes please add the corresponding checksums to checksums.ini?
> 
> It's not hard:
> 
> 1) bitbake <recipe>
> 2) watch it fail
> 3) cd $TMPDIR ; cat /OE/org.openembedded.dev/conf/checksums.ini >> 
> checksums.ini ; python 
> /OE/org.openembedded.dev/contrib/source-checker/oe-checksums-sorter.py 
> checksums.ini > /OE/org.openembedded.dev/conf/checksums.ini
> 4) git commit --amend /OE/org.openembedded.dev/conf/checksums.ini

I screwed this up once (and committed a recipe without the checksum) 
when I left ALLOW_INSECURE_DOWNLOADS set in my local.conf. Now I have it 
commented out and am really careful only to use it only when hacking.

I also need to look up what "git commit --amend" does.

Philip

[-- Attachment #2: S/MIME Cryptographic Signature --]
[-- Type: application/x-pkcs7-signature, Size: 3303 bytes --]

^ permalink raw reply	[flat|nested] 8+ messages in thread

* checksums...
@ 2009-01-21 18:55 Koen Kooi
  2009-01-21 19:15 ` checksums Philip Balister
  0 siblings, 1 reply; 8+ messages in thread
From: Koen Kooi @ 2009-01-21 18:55 UTC (permalink / raw)
  To: openembedded-devel

Hi,

Could people adding new recipe and people git-am'ing patches adding new 
recipes please add the corresponding checksums to checksums.ini?

It's not hard:

1) bitbake <recipe>
2) watch it fail
3) cd $TMPDIR ; cat /OE/org.openembedded.dev/conf/checksums.ini >> 
checksums.ini ; python 
/OE/org.openembedded.dev/contrib/source-checker/oe-checksums-sorter.py 
checksums.ini > /OE/org.openembedded.dev/conf/checksums.ini
4) git commit --amend /OE/org.openembedded.dev/conf/checksums.ini

regards,

Koen




^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2013-05-14 19:21 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-05-14  9:14 checksums folkert
2013-05-14 13:18 ` checksums Theodore Ts'o
2013-05-14 14:40   ` checksums folkert
2013-05-14 18:09     ` checksums Darrick J. Wong
2013-05-14 18:57       ` checksums folkert
2013-05-14 19:21         ` checksums Darrick J. Wong
  -- strict thread matches above, loose matches on Subject: below --
2009-01-21 18:55 checksums Koen Kooi
2009-01-21 19:15 ` checksums Philip Balister

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.