* BTRFS remove device crashing whole system
@ 2020-03-08 21:05 Jason Clara
0 siblings, 0 replies; only message in thread
From: Jason Clara @ 2020-03-08 21:05 UTC (permalink / raw)
To: linux-btrfs
Hi, hoping someone can help with this issue I am having. I have a pool setup as D:RAID6/M:RAID1 which I converted to that from D/M:RAID1 about a month ago.
Since I did that I have been wanting to remove one of my drives from the pool. However, whenever I do it crashes the whole system after running for about 24 hours. This is my third or fourth attempt with it happening and I have updated my kernel and btrfs-progs hoping it would solve the issue
Since the last time it happened I have run a full balance and a scrub and both complete without any issues. I do this between every attempt. But when I try to do the remove it crashes the whole system and I have to reset. Right now a scrub is running since this is RAID6 and suggested to be done after any unclean shutdown. Should be done in 24 hours.
At the moment of the crash the system was being used as a plex server and had three streams going, and some other services. So reads with some writes going on at the same time. Remove had been going for maybe 24 hours before it locked up.
Here is hopefully all the info you need, but please let me know if anything else is required
I am trying to remove one of the 3TB drives (sdb)
My device stats shows 1 corruption_errs for one drive that has been there for a while so no errors have come up with these delete attempts
Pool was created many years ago, and I have been keeping up to date on kernel versions and btrfs-progs manually.
System info
Ubuntu 18.04
Kernel: Linux FileServer 5.5.7-050507-generic #202002281805
btrfs-progs v5.4.1
device usage
/dev/sdd1, ID: 1
Device size: 2.73TiB
Device slack: 0.00B
Data,RAID6: 1.16TiB
Data,RAID6: 810.74GiB
Data,RAID6: 791.52GiB
Unallocated: 1.00MiB
/dev/sdb1, ID: 2
Device size: 2.73TiB
Device slack: 0.00B
Data,RAID6: 1.16TiB
Data,RAID6: 605.74GiB
Data,RAID6: 6.00GiB
Unallocated: 990.52GiB
/dev/sdc1, ID: 3
Device size: 2.73TiB
Device slack: 0.00B
Data,RAID6: 1.16TiB
Data,RAID6: 810.74GiB
Data,RAID6: 791.52GiB
Unallocated: 1.00MiB
/dev/sdi1, ID: 5
Device size: 2.73TiB
Device slack: 1.36TiB
Data,RAID6: 1.16TiB
Data,RAID6: 205.00GiB
Unallocated: 1.00MiB
/dev/sdh1, ID: 6
Device size: 4.55TiB
Device slack: 0.00B
Data,RAID6: 1.16TiB
Data,RAID6: 810.74GiB
Data,RAID6: 581.00GiB
Data,RAID6: 791.52GiB
Data,RAID6: 6.00GiB
Metadata,RAID1: 3.00GiB
System,RAID1: 32.00MiB
Unallocated: 1.24TiB
/dev/sda1, ID: 7
Device size: 7.28TiB
Device slack: 0.00B
Data,RAID6: 1.16TiB
Data,RAID6: 810.74GiB
Data,RAID6: 581.00GiB
Data,RAID6: 791.52GiB
Data,RAID6: 6.00GiB
Unallocated: 3.97TiB
/dev/sdf1, ID: 8
Device size: 7.28TiB
Device slack: 0.00B
Data,RAID6: 1.16TiB
Data,RAID6: 810.74GiB
Data,RAID6: 581.00GiB
Data,RAID6: 791.52GiB
Data,RAID6: 6.00GiB
Metadata,RAID1: 9.00GiB
System,RAID1: 32.00MiB
Unallocated: 3.97TiB
/dev/sdj1, ID: 9
Device size: 7.28TiB
Device slack: 0.00B
Data,RAID6: 1.16TiB
Data,RAID6: 810.74GiB
Data,RAID6: 581.00GiB
Data,RAID6: 791.52GiB
Data,RAID6: 6.00GiB
Metadata,RAID1: 10.00GiB
Unallocated: 3.96TiB
fi show
Label: 'Pool1' uuid: 99935e27-4922-4efa-bf76-5787536dd71f
Total devices 8 FS bytes used 14.91TiB
devid 1 size 2.73TiB used 2.73TiB path /dev/sdd1
devid 2 size 2.73TiB used 1.76TiB path /dev/sdb1
devid 3 size 2.73TiB used 2.73TiB path /dev/sdc1
devid 5 size 1.36TiB used 1.36TiB path /dev/sdi1
devid 6 size 4.55TiB used 3.31TiB path /dev/sdh1
devid 7 size 7.28TiB used 3.30TiB path /dev/sda1
devid 8 size 7.28TiB used 3.31TiB path /dev/sdf1
devid 9 size 7.28TiB used 3.31TiB path /dev/sdj1
fi df
Data, RAID6: total=15.20TiB, used=14.90TiB
System, RAID1: total=32.00MiB, used=1.09MiB
Metadata, RAID1: total=11.00GiB, used=9.58GiB
GlobalReserve, single: total=512.00MiB, used=0.00B
fi usage
WARNING: RAID56 detected, not implemented
Overall:
Device size: 35.93TiB
Device allocated: 22.06GiB
Device unallocated: 35.91TiB
Device missing: 0.00B
Used: 19.17GiB
Free (estimated): 0.00B (min: 8.00EiB)
Data ratio: 0.00
Metadata ratio: 2.00
Global reserve: 512.00MiB (used: 0.00B)
Data,RAID6: Size:15.20TiB, Used:14.90TiB (98.06%)
/dev/sdd1 2.73TiB
/dev/sdb1 1.76TiB
/dev/sdc1 2.73TiB
/dev/sdi1 1.36TiB
/dev/sdh1 3.30TiB
/dev/sda1 3.30TiB
/dev/sdf1 3.30TiB
/dev/sdj1 3.30TiB
Metadata,RAID1: Size:11.00GiB, Used:9.58GiB (87.11%)
/dev/sdh1 3.00GiB
/dev/sdf1 9.00GiB
/dev/sdj1 10.00GiB
System,RAID1: Size:32.00MiB, Used:1.09MiB (3.42%)
/dev/sdh1 32.00MiB
/dev/sdf1 32.00MiB
Unallocated:
/dev/sdd1 1.00MiB
/dev/sdb1 987.52GiB
/dev/sdc1 1.00MiB
/dev/sdi1 1.00MiB
/dev/sdh1 1.24TiB
/dev/sda1 3.97TiB
/dev/sdf1 3.96TiB
/dev/sdj1 3.96TiB
DMESG Log: https://pastebin.com/w09F4nMr
PS. I think my first message was blocked due to being over 100k, so I am resending with pastebin for DMESG. Sorry if there is a duplicate.
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2020-03-08 21:05 UTC | newest]
Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-03-08 21:05 BTRFS remove device crashing whole system Jason Clara
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).