kernel BUG at fs/btrfs/relocation.c:437!

* kernel BUG at fs/btrfs/relocation.c:437!
@ 2020-09-01  8:18 Johannes Rohr
  2020-09-03  8:15 ` Johannes Rohr
  2020-09-03 15:56 ` Josef Bacik
  0 siblings, 2 replies; 3+ messages in thread
From: Johannes Rohr @ 2020-09-01  8:18 UTC (permalink / raw)
  To: linux-btrfs

Dear devs,

I tried to replace an SSD with bad S.M.A.R.T. status and since I don't
have physical access to the server, I first wanted to remove it from the
RAID 1 (which has 4 SSDs) and then erase it.

I ran "btrfs device delete /dev/sda2 /". After a while, the command
terminated with a segfault and the system hung. I waited for 30 minutes.
Fortunately, it could be resurrected with a hard reset.

dmesg, as this happened, reports that a block on a different SSD, on
/dev/sdc can't be found.

See full backtrace here:
https://gist.github.com/vasyugan/340d9cd2292e3122c1d7773df718a234

Now I am afraid that if sda is just removed physically, then marked as
degraded and swapped for a new SSD using the btrfs replace command, this
might also go bad  because of the block that can't be found.

Does any of you have advice on what to do? From the backtrace I don't
even understand if the issue is a physical problem with sdc (whose
S.M.A.R.T. values are just fine) or whether this is another btrfs bug
and if you, if there is any way to work around it.

We are running Ubuntu 20.04, the kernel is 5.4.0-45-generic, Ubuntu's
version number is: 5.4.0-45.49. It was released yesterday and was
supposed to have a relocation relate bug fixed, see
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1889669

I suppose, this is a separate issue. Should I report a bug? If so, where?

Thanks a lot in advance for your support!!!

Johannes

^ permalink raw reply	[flat|nested] 3+ messages in thread