Unmountable degraded BTRFS RAID6 filesystem

* Unmountable degraded BTRFS RAID6 filesystem
@ 2019-09-03 22:20 Edmund Urbani
  2019-09-03 23:30 ` Chris Murphy
  0 siblings, 1 reply; 12+ messages in thread
From: Edmund Urbani @ 2019-09-03 22:20 UTC (permalink / raw)
  To: linux-btrfs

Hi all,

two days ago my btrfs filesystem became quite slow and the logs showed a
lot of I/O errors on one of the HDDs. I ordered a replacement drive and
tried to remove the failing drive from the filesystem (btrfs device
remove). That removal command did not finish but just sat there without
any output.

Today the new drive arrived. Device removal still had not finished, but
the filesystem had entered read-only mode last night. I shut down the
system to replace the defective drive. However, after the reboot I am no
longer able to mount the filesystem at all or recover any data from it.:(

*****
uname -a

Linux phoenix 4.14.78-gentoo #1 SMP Mon Dec 3 09:25:24 CET 2018 x86_64
AMD Opteron(tm) Processor 6174 AuthenticAMD GNU/Linux

*****
btrfs --version

btrfs-progs v4.19

*****
btrfs fi show

warning, device 8 is missing
warning, device 8 is missing
checksum verify failed on 71133554540544 found B52922D9 wanted C8FB97CF
checksum verify failed on 71133554540544 found 9820D207 wanted 189B50C0
checksum verify failed on 71133554540544 found 9820D207 wanted 189B50C0
bad tree block 71133554540544, bytenr mismatch, want=71133554540544,
have=7227596181724576485
ERROR: cannot read chunk root
Label: none uuid: 108df6ea-2846-4a88-8a50-61aedeef92b4
Total devices 10 FS bytes used 14.71TiB
devid 1 size 2.73TiB used 2.04TiB path /dev/sdg1
devid 2 size 2.73TiB used 2.04TiB path /dev/sdh1
devid 3 size 2.73TiB used 2.04TiB path /dev/sdj1
devid 4 size 2.73TiB used 2.04TiB path /dev/sdi1
devid 5 size 2.73TiB used 2.04TiB path /dev/sde1
devid 6 size 2.73TiB used 2.04TiB path /dev/sdf1
devid 7 size 2.73TiB used 2.04TiB path /dev/sda1
devid 9 size 2.73TiB used 2.04TiB path /dev/sdc1
devid 10 size 2.73TiB used 2.04TiB path /dev/sdd1
*** Some devices missing

*****
dmesg (after attempting mount with -o degraded)

...
[ 8904.358084] BTRFS info (device sda1): turning on discard
[ 8904.358088] BTRFS info (device sda1): allowing degraded mounts
[ 8904.358089] BTRFS info (device sda1): disk space caching is enabled
[ 8904.358091] BTRFS info (device sda1): has skinny extents
[ 8904.361743] BTRFS warning (device sda1): devid 8 uuid
0e8b4aff-6d64-4d31-a135-705421928f94 is missing
[ 8905.705036] BTRFS info (device sda1): bdev (null) errs: wr 0, rd
14809, flush 0, corrupt 4, gen 0
[ 8905.705041] BTRFS info (device sda1): bdev /dev/sda1 errs: wr 0, rd
4, flush 0, corrupt 0, gen 0
[ 8905.705052] BTRFS info (device sda1): bdev /dev/sdf1 errs: wr 0, rd
10543, flush 0, corrupt 0, gen 0
[ 8905.705062] BTRFS info (device sda1): bdev /dev/sdc1 errs: wr 0, rd
8, flush 0, corrupt 0, gen 0
[ 8909.565118] BTRFS error (device sda1): bad tree block start
12170572967447269873 34958581399552
[ 8909.565978] BTRFS error (device sda1): bad tree block start
12170572967447269873 34958581399552
[ 8909.567462] BTRFS error (device sda1): bad tree block start
12170572967447269873 34958581399552
[ 8909.568439] BTRFS error (device sda1): bad tree block start
12170572967447269873 34958581399552
[ 8909.569861] BTRFS error (device sda1): bad tree block start
12170572967447269873 34958581399552
[ 8909.570695] BTRFS error (device sda1): bad tree block start
12170572967447269873 34958581399552
[ 8909.572146] BTRFS error (device sda1): bad tree block start
12170572967447269873 34958581399552
[ 8909.572969] BTRFS error (device sda1): bad tree block start
12170572967447269873 34958581399552
[ 8909.574175] BTRFS error (device sda1): bad tree block start
12170572967447269873 34958581399552
[ 8909.574189] BTRFS error (device sda1): failed to read block groups: -5
[ 8909.635991] BTRFS error (device sda1): open_ctree failed

*****
btrfs check /dev/sda1

Opening filesystem to check...
warning, device 8 is missing
warning, device 8 is missing
checksum verify failed on 71133554540544 found B52922D9 wanted C8FB97CF
checksum verify failed on 71133554540544 found 9820D207 wanted 189B50C0
checksum verify failed on 71133554540544 found 9820D207 wanted 189B50C0
bad tree block 71133554540544, bytenr mismatch, want=71133554540544,
have=7227596181724576485
ERROR: cannot read chunk root
ERROR: cannot open file system

*****

I have tried all the mount / restore options listed here:
https://forums.unraid.net/topic/46802-faq-for-unraid-v6/page/2/?tab=comments#comment-543490

... and all I keep getting is "bad tree block" errors. Superblocks seem
fine (btrfs rescue super-reecover found no problem). I am considering
trying "btrfs rescue chunk-recover" at this point.

Could this help in my situation? What do you think?

Kind regards
 Edmund

-- 
*Liland IT GmbH*

Ferlach ● Wien ● München
Tel: +43 463 220111
Tel: +49 89 
458 15 940
office@Liland.com
https://Liland.com <https://Liland.com> 

Copyright © 2019 Liland IT GmbH 

Diese Mail enthaelt vertrauliche und/oder 
rechtlich geschuetzte Informationen. 
Wenn Sie nicht der richtige Adressat 
sind oder diese Email irrtuemlich erhalten haben, informieren Sie bitte 
sofort den Absender und vernichten Sie diese Mail. Das unerlaubte Kopieren 
sowie die unbefugte Weitergabe dieser Mail ist nicht gestattet. 

This 
email may contain confidential and/or privileged information. 
If you are 
not the intended recipient (or have received this email in error) please 
notify the sender immediately and destroy this email. Any unauthorised 
copying, disclosure or distribution of the material in this email is 
strictly forbidden.

^ permalink raw reply	[flat|nested] 12+ messages in thread