All of lore.kernel.org
 help / color / mirror / Atom feed
* Fwd: Bulk block to filename lookups on an incomplete dd copy
       [not found] <CAL=yMgWXEqTj4ZAb2UdL8PCkeKjy=0mb95eVF+qukkOU=GF2Vw@mail.gmail.com>
@ 2015-08-21  2:02 ` Ki'Sak
  0 siblings, 0 replies; only message in thread
From: Ki'Sak @ 2015-08-21  2:02 UTC (permalink / raw)
  To: linux-ext4

Good day,

About a month ago I did an emergency dd copy of my 2TB home folder
drive, which I was able to salvage about 2/3's of the contents before
the source drive completely failed. In the post-mortum I have a
separate recovery drive with untouched data from the incident and a
replacement which is installed and in use. To mitigate spreading
filesystem damage from the incomplete copy, I have rsync'd the
contents from the recovery drive to the replacement drive.

In the last week I've been looking into what is needed to generate a
manifest of damaged files from a block range. For the most part, I
have intact superblocks on the recovery drive and have been looking
for a reliable method of finding what files were allocated to the
uncopied area. Unfortunately, I am faced with an unreasonable initial
need of scanning about 157 million 4k blocks. I'm looking at up to 40
minutes to icheck 1000 blocks if all the blocks are empty. I have
found that testb and the block bitmap are cheap, but is unreliable in
my scenario, where some random samplings I did revealed the block not
marked in use in the bitmap, but had inodes pointing to it.

>From what I can tell, I'm hitting a slow path (high cpu) when there is
an error with reading extents on empty blocks and was wondering if you
had any insight into reducing the processing cost of icheck with empty
block, which is averaging about 2.5 seconds or if I've missed
something fundimental in my approach.

I've thrown together a script at
https://github.com/Tele42/ext4scan/blob/master/ext4scan.sh for
posterity.

Thank you for your time,
Tele42

PS. Sorry for double sending this to Mr. Ts'o.

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2015-08-21  2:02 UTC | newest]

Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <CAL=yMgWXEqTj4ZAb2UdL8PCkeKjy=0mb95eVF+qukkOU=GF2Vw@mail.gmail.com>
2015-08-21  2:02 ` Fwd: Bulk block to filename lookups on an incomplete dd copy Ki'Sak

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.