All of lore.kernel.org
 help / color / mirror / Atom feed
* Need Help with crashed RAID5 (that was rebuilding and then had SATA error on another drive)
@ 2016-08-22 21:51 Ben Kamen
  2016-08-22 23:06 ` Adam Goryachev
  0 siblings, 1 reply; 25+ messages in thread
From: Ben Kamen @ 2016-08-22 21:51 UTC (permalink / raw)
  To: linux-raid

Hey all. I'm looking at the RAID Wiki and need some help.

First Info:

I have a RAID5 with 4 members /dev/sd[cdef]1 where last night, sdc1
reported a smart error recommended drive replacement (after watching
sector errors pile up for about a week.)

no problem. shut down the drive, pulled it, replace it with a cold
spare. Started the rebuild (around midnight CDT).

At 5:43am, I got this message:

This is an automatically generated mail message from mdadm
running on quantum

A Fail event had been detected on md device /dev/md127.

It could be related to component device /dev/sde1.

Faithfully yours, etc.

P.S. The /proc/mdstat file currently contains the following:

Personalities : [raid1] [raid6] [raid5] [raid4]
md0 : active raid1 sda2[0] sdb2[2]
      511988 blocks super 1.0 [2/2] [UU]

md127 : active raid5 sdc1[4] sdf1[6] sde1[1](F) sdd1[5]
      2930276352 blocks super 1.2 level 5, 512k chunk, algorithm 2 [4/2] [U_U_]
      [===========>.........]  recovery = 55.9% (546131076/976758784)
finish=381.6min speed=18805K/sec
      bitmap: 4/8 pages [16KB], 65536KB chunk

md1 : active raid1 sda3[0] sdb3[2]
      239489916 blocks super 1.1 [2/2] [UU]
      bitmap: 2/2 pages [8KB], 65536KB chunk

md10 : active raid1 sda1[0] sdb1[2]
      4193272 blocks super 1.1 [2/2] [UU]

unused devices: <none>

/dev/md127  is the one with issues.

It looks like the SATA controller had issues. I couldn't see sde - so
I rebooted. (scold me later.)

All the drives are available. SMARTCTL tells me /dev/sde is happy as
can be (has a few bad sectors and is slated for replacement next, but
smart says drive is healthy).

I looked at the raid Wiki - and saved the mdadm --examine info. Of the
active members, the event count is off by 25 for happy vs unhappy
members.

But forcing the assembly claims

mdadm --assemble --force /dev/md127 /dev/sd[cdef]1
mdadm: /dev/sdc1 is busy - skipping
mdadm: /dev/sdd1 is busy - skipping
mdadm: /dev/sdf1 is busy - skipping
mdadm: Found some drive for an array that is already active: /dev/md/:BigRAID
mdadm: giving up.

So before I mess up ANYTHING else...

What should I be doing?

(should I be stopping the RAID as right now it's seems like it's running)

Thanks,

   -Ben

^ permalink raw reply	[flat|nested] 25+ messages in thread

end of thread, other threads:[~2016-09-19 16:17 UTC | newest]

Thread overview: 25+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-08-22 21:51 Need Help with crashed RAID5 (that was rebuilding and then had SATA error on another drive) Ben Kamen
2016-08-22 23:06 ` Adam Goryachev
2016-08-23 11:36   ` Wols Lists
2016-08-23 15:44     ` Ben
     [not found]   ` <CADDTLRBf9NPO6OuF4a3b+xffZgeZRqHRG+pJdPmbc9-Jat0HVQ@mail.gmail.com>
     [not found]     ` <d6d3fe0d-3f9f-985f-9bfb-051428cf221b@websitemanagers.com.au>
     [not found]       ` <57BBDA5B.3020706@gmail.com>
     [not found]         ` <57BBDC15.5030301@gmail.com>
     [not found]           ` <b8c6a380-7e6a-fda9-5834-b85271b26892@websitemanagers.com.au>
     [not found]             ` <57BC61F7.8070102@gmail.com>
     [not found]               ` <aca4e83f-9a3f-c200-7c16-3b5d9df52c1e@websitemanagers.com.au>
     [not found]                 ` <57BE450B.4030700@gmail.com>
     [not found]                   ` <56e86db5-456d-e9c1-339d-ba8903fe5dde@websitemanagers.com.au>
     [not found]                     ` <57BE52BC.6040908@gmail.com>
     [not found]                       ` <933228e0-bce4-ffad-f48d-034bf89bc07f@websitemanagers.com.au>
2016-08-26  1:20                         ` Ben
2016-08-26  2:22                           ` Phil Turmel
2016-08-26  2:54                             ` Benjammin2068
2016-08-26 12:38                               ` Phil Turmel
2016-08-26 18:07                           ` Wols Lists
2016-08-28 18:29                             ` Benjammin2068
2016-08-28 19:20                               ` Anthony Youngman
2016-08-29  1:23                                 ` Benjammin2068
2016-08-28 23:54                               ` Adam Goryachev
2016-08-29  1:25                                 ` Benjammin2068
2016-08-29 11:19                                   ` Wols Lists
2016-09-18 17:13                                     ` Best tool to partition Drives with new sector geometry - (WAS: Need Help with crashed RAID5 (that was rebuilding and then had SATA error on another drive)) Benjammin2068
2016-09-18 17:50                                       ` Chris Murphy
2016-09-18 18:41                                         ` Benjammin2068
2016-09-18 19:17                                           ` Wols Lists
2016-09-18 19:58                                             ` Benjammin2068
2016-09-18 21:21                                               ` Wols Lists
2016-09-18 21:29                                                 ` Benjammin2068
2016-09-19  6:25                                                   ` Wols Lists
2016-09-19 16:17                                                     ` Benjammin2068
2016-09-18 18:08                                       ` Benjammin2068

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.