linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* BTRFS remove device crashing whole system
@ 2020-03-08 21:05 Jason Clara
  0 siblings, 0 replies; only message in thread
From: Jason Clara @ 2020-03-08 21:05 UTC (permalink / raw)
  To: linux-btrfs

Hi, hoping someone can help with this issue I am having.  I have a pool setup as D:RAID6/M:RAID1 which I converted to that from D/M:RAID1 about a month ago.  

Since I did that I have been wanting to remove one of my drives from the pool. However, whenever I do it crashes the whole system after running for about 24 hours.  This is my third or fourth attempt with it happening and I have updated my kernel and btrfs-progs hoping it would solve the issue

Since the last time it happened I have run a full balance and a scrub and both complete without any issues.  I do this between every attempt. But when I try to do the remove it crashes the whole system and I have to reset.  Right now a scrub is running since this is RAID6 and suggested to be done after any unclean shutdown.  Should be done in 24 hours.

At the moment of the crash the system was being used as a plex server and had three streams going, and some other services.  So reads with some writes going on at the same time.  Remove had been going for maybe 24 hours before it locked up.

Here is hopefully all the info you need, but please let me know if anything else is required

I am trying to remove one of the 3TB drives (sdb)

My device stats shows 1 corruption_errs for one drive that has been there for a while so no errors have come up with these delete attempts

Pool was created many years ago, and I have been keeping up to date on kernel versions and btrfs-progs manually.

System info
Ubuntu 18.04 
Kernel: Linux FileServer 5.5.7-050507-generic #202002281805 
btrfs-progs v5.4.1

device usage
/dev/sdd1, ID: 1
 Device size:             2.73TiB
 Device slack:              0.00B
 Data,RAID6:              1.16TiB
 Data,RAID6:            810.74GiB
 Data,RAID6:            791.52GiB
 Unallocated:             1.00MiB

/dev/sdb1, ID: 2
 Device size:             2.73TiB
 Device slack:              0.00B
 Data,RAID6:              1.16TiB
 Data,RAID6:            605.74GiB
 Data,RAID6:              6.00GiB
 Unallocated:           990.52GiB

/dev/sdc1, ID: 3
 Device size:             2.73TiB
 Device slack:              0.00B
 Data,RAID6:              1.16TiB
 Data,RAID6:            810.74GiB
 Data,RAID6:            791.52GiB
 Unallocated:             1.00MiB

/dev/sdi1, ID: 5
 Device size:             2.73TiB
 Device slack:            1.36TiB
 Data,RAID6:              1.16TiB
 Data,RAID6:            205.00GiB
 Unallocated:             1.00MiB

/dev/sdh1, ID: 6
 Device size:             4.55TiB
 Device slack:              0.00B
 Data,RAID6:              1.16TiB
 Data,RAID6:            810.74GiB
 Data,RAID6:            581.00GiB
 Data,RAID6:            791.52GiB
 Data,RAID6:              6.00GiB
 Metadata,RAID1:          3.00GiB
 System,RAID1:           32.00MiB
 Unallocated:             1.24TiB

/dev/sda1, ID: 7
 Device size:             7.28TiB
 Device slack:              0.00B
 Data,RAID6:              1.16TiB
 Data,RAID6:            810.74GiB
 Data,RAID6:            581.00GiB
 Data,RAID6:            791.52GiB
 Data,RAID6:              6.00GiB
 Unallocated:             3.97TiB

/dev/sdf1, ID: 8
 Device size:             7.28TiB
 Device slack:              0.00B
 Data,RAID6:              1.16TiB
 Data,RAID6:            810.74GiB
 Data,RAID6:            581.00GiB
 Data,RAID6:            791.52GiB
 Data,RAID6:              6.00GiB
 Metadata,RAID1:          9.00GiB
 System,RAID1:           32.00MiB
 Unallocated:             3.97TiB

/dev/sdj1, ID: 9
 Device size:             7.28TiB
 Device slack:              0.00B
 Data,RAID6:              1.16TiB
 Data,RAID6:            810.74GiB
 Data,RAID6:            581.00GiB
 Data,RAID6:            791.52GiB
 Data,RAID6:              6.00GiB
 Metadata,RAID1:         10.00GiB
 Unallocated:             3.96TiB

fi show
Label: 'Pool1'  uuid: 99935e27-4922-4efa-bf76-5787536dd71f
	Total devices 8 FS bytes used 14.91TiB
	devid    1 size 2.73TiB used 2.73TiB path /dev/sdd1
	devid    2 size 2.73TiB used 1.76TiB path /dev/sdb1
	devid    3 size 2.73TiB used 2.73TiB path /dev/sdc1
	devid    5 size 1.36TiB used 1.36TiB path /dev/sdi1
	devid    6 size 4.55TiB used 3.31TiB path /dev/sdh1
	devid    7 size 7.28TiB used 3.30TiB path /dev/sda1
	devid    8 size 7.28TiB used 3.31TiB path /dev/sdf1
	devid    9 size 7.28TiB used 3.31TiB path /dev/sdj1

fi df
Data, RAID6: total=15.20TiB, used=14.90TiB
System, RAID1: total=32.00MiB, used=1.09MiB
Metadata, RAID1: total=11.00GiB, used=9.58GiB
GlobalReserve, single: total=512.00MiB, used=0.00B

fi usage
WARNING: RAID56 detected, not implemented
Overall:
   Device size:		  35.93TiB
   Device allocated:		  22.06GiB
   Device unallocated:		  35.91TiB
   Device missing:		     0.00B
   Used:			  19.17GiB
   Free (estimated):		     0.00B	(min: 8.00EiB)
   Data ratio:			      0.00
   Metadata ratio:		      2.00
   Global reserve:		 512.00MiB	(used: 0.00B)

Data,RAID6: Size:15.20TiB, Used:14.90TiB (98.06%)
  /dev/sdd1	   2.73TiB
  /dev/sdb1	   1.76TiB
  /dev/sdc1	   2.73TiB
  /dev/sdi1	   1.36TiB
  /dev/sdh1	   3.30TiB
  /dev/sda1	   3.30TiB
  /dev/sdf1	   3.30TiB
  /dev/sdj1	   3.30TiB

Metadata,RAID1: Size:11.00GiB, Used:9.58GiB (87.11%)
  /dev/sdh1	   3.00GiB
  /dev/sdf1	   9.00GiB
  /dev/sdj1	  10.00GiB

System,RAID1: Size:32.00MiB, Used:1.09MiB (3.42%)
  /dev/sdh1	  32.00MiB
  /dev/sdf1	  32.00MiB

Unallocated:
  /dev/sdd1	   1.00MiB
  /dev/sdb1	 987.52GiB
  /dev/sdc1	   1.00MiB
  /dev/sdi1	   1.00MiB
  /dev/sdh1	   1.24TiB
  /dev/sda1	   3.97TiB
  /dev/sdf1	   3.96TiB
  /dev/sdj1	   3.96TiB


DMESG Log: https://pastebin.com/w09F4nMr


PS.  I think my first message was blocked due to being over 100k, so I am resending with pastebin for DMESG.  Sorry if there is a duplicate.

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2020-03-08 21:05 UTC | newest]

Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-03-08 21:05 BTRFS remove device crashing whole system Jason Clara

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).