All of lore.kernel.org
 help / color / mirror / Atom feed
* Software RAID6 broke after power outage
       [not found] <CA+CBf3QZP4Yss0U=6Aa_5a+3D2Yy-WT545VazHiFWCZsreNOEg@mail.gmail.com>
@ 2020-07-22  7:41 ` Cory Derenburger
  2020-07-22  9:14   ` Wols Lists
  2020-07-22 19:03   ` Ian Pilcher
  0 siblings, 2 replies; 6+ messages in thread
From: Cory Derenburger @ 2020-07-22  7:41 UTC (permalink / raw)
  To: linux-raid

My server lost power this morning. The server is running Linux Mint
(14?) on a battery backup and I believe it shutdown before losing
power. Upon restarting the server the computer hung for a while, and
after resetting and booting up in recovery mode my RAID is now
nonfunctional.

The server was set up years ago with a RAID 6 array built with mdadm.
To be honest I don't really know what is wrong with the array, it
seems to be an issue with disk sdc. I wanted to reach out for help to
confirm the issue and get some guidance before proceeding (or making
things worse).

Any assistance that can help me determine what steps to take to get
this server back up and running would be greatly appreciated. It's
been 4+ since I have touched RAID, and only attempted a recovery once.
If anyone can help I would be super appreciative.

Below I'm including outputs from various commands for the 3rd disk
which seems to be the culprit

dmesg - boot section section where first errors begin occurring
[    2.637856] md: bind<sdd1>
[    2.646987] random: nonblocking pool is initialized
[    2.647432] md: bind<sde1>
[    2.651429] md: bind<sdb1>
[    2.863538] ata3.00: exception Emask 0x0 SAct 0x10 SErr 0x0 action 0x0
[    2.863594] ata3.00: irq_stat 0x40000008
[    2.863643] ata3.00: failed command: READ FPDMA QUEUED
[    2.863695] ata3.00: cmd 60/08:20:08:08:00/00:00:00:00:00/40 tag 4
ncq 4096 in
[    2.863695]          res 41/40:00:09:08:00/00:00:00:00:00/40 Emask
0x409 (media error) <F>
[    2.863775] ata3.00: status: { DRDY ERR }
[    2.863822] ata3.00: error: { UNC }
[    2.873407] ata3.00: configured for UDMA/133
[    2.873476] sd 2:0:0:0: [sdc] Unhandled sense code
[    2.873525] sd 2:0:0:0: [sdc]
[    2.873571] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[    2.873619] sd 2:0:0:0: [sdc]
[    2.873665] Sense Key : Medium Error [current] [descriptor]
[    2.873819] Descriptor sense data with sense descriptors (in hex):
[    2.873901]         72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
[    2.874544]         00 00 08 09
[    2.874764] sd 2:0:0:0: [sdc]
[    2.874811] Add. Sense: Unrecovered read error - auto reallocate failed
[    2.874895] sd 2:0:0:0: [sdc] CDB:
[    2.874941] Read(10): 28 00 00 00 08 08 00 00 08 00
[    2.875428] end_request: I/O error, dev sdc, sector 2057
[    2.875478] Buffer I/O error on device sdc1, logical block 1

cat /proc/mdstat
Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
[raid4] [raid10]
md0 : inactive sdb1[0](S) sde1[3](S) sdd1[2](S)
      5860147464 blocks super 1.2

{not sure why these drives are now showing as spares}

Below running mdstat for sdc.  Checking sdb, sdd, sde appear fine.

mdadm --examine /dev/sdc
/dev/sdc:   MBR Magic : aa55
Partition[0] :   3907027120 sectors at         2048 (type fd)

mdadm --examine /dev/sdc1
mdadm: No md superblock detected on /dev/sdc1.

fdisk -l
Disk /dev/sdb: 2000.4 GB, 2000398934016 bytes
81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0x38389fdc

   Device Boot      Start         End      Blocks   Id  System
/dev/sdb1            2048  3907029167  1953513560   fd  Linux raid autodetect

Disk /dev/sdc: 2000.4 GB, 2000398934016 bytes
81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0xd108824d

   Device Boot      Start         End      Blocks   Id  System
/dev/sdc1            2048  3907029167  1953513560   fd  Linux raid autodetect

Disk /dev/sdd: 2000.4 GB, 2000398934016 bytes
81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0x6207659a

   Device Boot      Start         End      Blocks   Id  System
/dev/sdd1            2048  3907029167  1953513560   fd  Linux raid autodetect

Disk /dev/sde: 2000.4 GB, 2000398934016 bytes
81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0xd9a4afcf

   Device Boot      Start         End      Blocks   Id  System
/dev/sde1            2048  3907029167  1953513560   fd  Linux raid autodetect


Is there other information needed to determine the issue?  Where do I
go from here?



Thank you for your help,
Cory

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Software RAID6 broke after power outage
  2020-07-22  7:41 ` Software RAID6 broke after power outage Cory Derenburger
@ 2020-07-22  9:14   ` Wols Lists
  2020-07-22 16:29     ` Cory Derenburger
  2020-07-22 19:03   ` Ian Pilcher
  1 sibling, 1 reply; 6+ messages in thread
From: Wols Lists @ 2020-07-22  9:14 UTC (permalink / raw)
  To: Cory Derenburger, linux-raid

On 22/07/20 08:41, Cory Derenburger wrote:
> My server lost power this morning. The server is running Linux Mint
> (14?) on a battery backup and I believe it shutdown before losing
> power. Upon restarting the server the computer hung for a while, and
> after resetting and booting up in recovery mode my RAID is now
> nonfunctional.
> 
> The server was set up years ago with a RAID 6 array built with mdadm.
> To be honest I don't really know what is wrong with the array, it
> seems to be an issue with disk sdc. I wanted to reach out for help to
> confirm the issue and get some guidance before proceeding (or making
> things worse).
> 
> Any assistance that can help me determine what steps to take to get
> this server back up and running would be greatly appreciated. It's
> been 4+ since I have touched RAID, and only attempted a recovery once.
> If anyone can help I would be super appreciative.

https://raid.wiki.kernel.org/index.php/Linux_Raid#When_Things_Go_Wrogn
https://raid.wiki.kernel.org/index.php/Asking_for_help

I see you've included some stuff which is helpful, but can you do
everything that last page asks for. In particular, lsdrv.
> 
> Below I'm including outputs from various commands for the 3rd disk
> which seems to be the culprit
> 
> dmesg - boot section section where first errors begin occurring
> [    2.637856] md: bind<sdd1>
> [    2.646987] random: nonblocking pool is initialized
> [    2.647432] md: bind<sde1>
> [    2.651429] md: bind<sdb1>
> [    2.863538] ata3.00: exception Emask 0x0 SAct 0x10 SErr 0x0 action 0x0
> [    2.863594] ata3.00: irq_stat 0x40000008
> [    2.863643] ata3.00: failed command: READ FPDMA QUEUED
> [    2.863695] ata3.00: cmd 60/08:20:08:08:00/00:00:00:00:00/40 tag 4
> ncq 4096 in
> [    2.863695]          res 41/40:00:09:08:00/00:00:00:00:00/40 Emask
> 0x409 (media error) <F>
> [    2.863775] ata3.00: status: { DRDY ERR }
> [    2.863822] ata3.00: error: { UNC }
> [    2.873407] ata3.00: configured for UDMA/133
> [    2.873476] sd 2:0:0:0: [sdc] Unhandled sense code
> [    2.873525] sd 2:0:0:0: [sdc]
> [    2.873571] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
> [    2.873619] sd 2:0:0:0: [sdc]
> [    2.873665] Sense Key : Medium Error [current] [descriptor]
> [    2.873819] Descriptor sense data with sense descriptors (in hex):
> [    2.873901]         72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
> [    2.874544]         00 00 08 09
> [    2.874764] sd 2:0:0:0: [sdc]
> [    2.874811] Add. Sense: Unrecovered read error - auto reallocate failed
> [    2.874895] sd 2:0:0:0: [sdc] CDB:
> [    2.874941] Read(10): 28 00 00 00 08 08 00 00 08 00
> [    2.875428] end_request: I/O error, dev sdc, sector 2057
> [    2.875478] Buffer I/O error on device sdc1, logical block 1
> 
> cat /proc/mdstat
> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
> [raid4] [raid10]
> md0 : inactive sdb1[0](S) sde1[3](S) sdd1[2](S)
>       5860147464 blocks super 1.2
> 
> {not sure why these drives are now showing as spares}

This is very common when an array fails to assemble properly.
Unfortunately, when there's one error, it often triggers a cascade of
fake errors, and this is probably the case here.
> 
> Below running mdstat for sdc.  Checking sdb, sdd, sde appear fine.
> 
> mdadm --examine /dev/sdc
> /dev/sdc:   MBR Magic : aa55
> Partition[0] :   3907027120 sectors at         2048 (type fd)
> 
> mdadm --examine /dev/sdc1
> mdadm: No md superblock detected on /dev/sdc1.
> 
> fdisk -l
> Disk /dev/sdb: 2000.4 GB, 2000398934016 bytes
> 81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
> Units = sectors of 1 * 512 = 512 bytes
> Sector size (logical/physical): 512 bytes / 512 bytes
> I/O size (minimum/optimal): 512 bytes / 512 bytes
> Disk identifier: 0x38389fdc
> 
>    Device Boot      Start         End      Blocks   Id  System
> /dev/sdb1            2048  3907029167  1953513560   fd  Linux raid autodetect
> 
> Disk /dev/sdc: 2000.4 GB, 2000398934016 bytes
> 81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
> Units = sectors of 1 * 512 = 512 bytes
> Sector size (logical/physical): 512 bytes / 512 bytes
> I/O size (minimum/optimal): 512 bytes / 512 bytes
> Disk identifier: 0xd108824d
> 
>    Device Boot      Start         End      Blocks   Id  System
> /dev/sdc1            2048  3907029167  1953513560   fd  Linux raid autodetect
> 
> Disk /dev/sdd: 2000.4 GB, 2000398934016 bytes
> 81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
> Units = sectors of 1 * 512 = 512 bytes
> Sector size (logical/physical): 512 bytes / 512 bytes
> I/O size (minimum/optimal): 512 bytes / 512 bytes
> Disk identifier: 0x6207659a
> 
>    Device Boot      Start         End      Blocks   Id  System
> /dev/sdd1            2048  3907029167  1953513560   fd  Linux raid autodetect
> 
> Disk /dev/sde: 2000.4 GB, 2000398934016 bytes
> 81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
> Units = sectors of 1 * 512 = 512 bytes
> Sector size (logical/physical): 512 bytes / 512 bytes
> I/O size (minimum/optimal): 512 bytes / 512 bytes
> Disk identifier: 0xd9a4afcf
> 
>    Device Boot      Start         End      Blocks   Id  System
> /dev/sde1            2048  3907029167  1953513560   fd  Linux raid autodetect
> 
> 
> Is there other information needed to determine the issue?  Where do I
> go from here?
> 
How old is linux mint? Have you kept it up-to-date? Unfortunately, it
seems a lot of older systems suffer issues when the kernel is heavily
patched and mdadm is not updated, and this regularly surfaces on this
list where Ubuntu is concerned ...

mdadm --version
uname -a

Make sure you have a "latest and greatest" rescue disk to hand, and
we'll see what the others say.

Cheers,
Wol

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Software RAID6 broke after power outage
  2020-07-22  9:14   ` Wols Lists
@ 2020-07-22 16:29     ` Cory Derenburger
  2020-07-22 19:47       ` antlists
  0 siblings, 1 reply; 6+ messages in thread
From: Cory Derenburger @ 2020-07-22 16:29 UTC (permalink / raw)
  To: Wols Lists; +Cc: linux-raid

Thanks Wols,

The version on Linux Mint I've been running is quite old.  Once the
server was last configured it did not have updates.  It was put on a
shelf and (mostly) left alone to serve files reliably for years.

$ mdadm --version
mdadm - v3.2.5 - 18th May 2012

uname -a
Linux LIZZY 3.16.0-38-generic #52~14.04.1-Ubuntu SMP Fri May 8
09:43:57 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux

Here is the lsdrv information
./lsdrv
**Warning** The following utility(ies) failed to execute:
  sginfo
Some information may be missing.

Controller platform [None]
└platform floppy.0
 └fd0 0.00k [2:0] Empty/Unknown
PCI [ahci] 00:11.0 SATA controller: Advanced Micro Devices, Inc.
[AMD/ATI] SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode]
├scsi 0:0:0:0 ATA      MKNSSDEC60GB     {ME150901AS2073580}
│└sda 55.90g [8:0] Partitioned (dos)
│ ├sda1 7.92g [8:1] ext4 {ef60a590-af5c-41f6-9166-3988d6646092}
│ │└Mounted as /dev/disk/by-uuid/ef60a590-af5c-41f6-9166-3988d6646092 @ /
│ ├sda2 1.00k [8:2] Partitioned (dos)
│ ├sda5 36.76g [8:5] ext4 {22fbf184-d791-45c9-8de9-62ee4f0a1776}
│ │└Mounted as /dev/sda5 @ /home
│ └sda6 1.91g [8:6] swap {4326b017-dea7-489d-850a-29c814ea6a99}
├scsi 1:0:0:0 ATA      Hitachi HUA72302 {YFGK3VXD}
│└sdb 1.82t [8:16] Partitioned (dos)
│ └sdb1 1.82t [8:17] MD  (none/) (w/ sdd1,sde1) spare 'LIZZY:0'
{605a2a08-65dc-e76a-967b-4f9e8fc79011}
│  └md0 0.00k [9:0] MD v1.2  () inactive, None (None) None {None}
│                   Empty/Unknown
├scsi 2:0:0:0 ATA      WDC WD20EARS-00M {WD-WCAZA1597296}
│└sdc 1.82t [8:32] Partitioned (dos)
│ └sdc1 1.82t [8:33] Empty/Unknown
├scsi 3:0:0:0 ATA      Hitachi HUA72302 {YFHK9JAA}
│└sdd 1.82t [8:48] Partitioned (dos)
│ └sdd1 1.82t [8:49] MD  (none/) (w/ sdb1,sde1) spare 'LIZZY:0'
{605a2a08-65dc-e76a-967b-4f9e8fc79011}
│  └md0 0.00k [9:0] MD v1.2  () inactive, None (None) None {None}
│                   Empty/Unknown
└scsi 5:0:0:0 ATA      Hitachi HUA72302 {YFG7LWBA}
 └sde 1.82t [8:64] Partitioned (dos)
  └sde1 1.82t [8:65] MD  (none/) (w/ sdb1,sdd1) spare 'LIZZY:0'
{605a2a08-65dc-e76a-967b-4f9e8fc79011}
   └md0 0.00k [9:0] MD v1.2  () inactive, None (None) None {None}
                    Empty/Unknown
PCI [pata_atiixp] 00:14.1 IDE interface: Advanced Micro Devices, Inc.
[AMD/ATI] SB7x0/SB8x0/SB9x0 IDE Controller
└scsi 6:0:0:0 PIONEER  DVD-RW  DVR-116D {PIONEER_DVD-RW_DVR-116D}
 └sr0 1.00g [11:0] Empty/Unknown
PCI [ahci] 02:00.0 SATA controller: Marvell Technology Group Ltd.
Device 9215 (rev 11)
└scsi 8:x:x:x [Empty]
Other Block Devices
├loop0 0.00k [7:0] Empty/Unknown
├loop1 0.00k [7:1] Empty/Unknown
├loop2 0.00k [7:2] Empty/Unknown
├loop3 0.00k [7:3] Empty/Unknown
├loop4 0.00k [7:4] Empty/Unknown
├loop5 0.00k [7:5] Empty/Unknown
├loop6 0.00k [7:6] Empty/Unknown
├loop7 0.00k [7:7] Empty/Unknown
├ram0 64.00m [1:0] Empty/Unknown
├ram1 64.00m [1:1] Empty/Unknown
├ram2 64.00m [1:2] Empty/Unknown
├ram3 64.00m [1:3] Empty/Unknown
├ram4 64.00m [1:4] Empty/Unknown
├ram5 64.00m [1:5] Empty/Unknown
├ram6 64.00m [1:6] Empty/Unknown
├ram7 64.00m [1:7] Empty/Unknown
├ram8 64.00m [1:8] Empty/Unknown
├ram9 64.00m [1:9] Empty/Unknown
├ram10 64.00m [1:10] Empty/Unknown
├ram11 64.00m [1:11] Empty/Unknown
├ram12 64.00m [1:12] Empty/Unknown
├ram13 64.00m [1:13] Empty/Unknown
├ram14 64.00m [1:14] Empty/Unknown
└ram15 64.00m [1:15] Empty/Unknown


smartctrl for the drives
# smartctl --xall /dev/sdb
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     Hitachi HUA723020ALA641
Serial Number:    YFGK3VXD
LU WWN Device Id: 5 000cca 223c7c8d4
Firmware Version: MK7OA840
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    7200 rpm
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is:    Tue Jul 21 12:43:42 2020 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Disabled
Rd look-ahead is: Enabled
Write cache is:   Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84) Offline data collection activity
                                        was suspended by an
interrupting command from host.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (20116) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 336) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     PO-R--   100   100   016    -    0
  2 Throughput_Performance  P-S---   133   133   054    -    90
  3 Spin_Up_Time            POS---   100   100   024    -    492
  4 Start_Stop_Count        -O--C-   100   100   000    -    7
  5 Reallocated_Sector_Ct   PO--CK   100   100   005    -    0
  7 Seek_Error_Rate         PO-R--   100   100   067    -    0
  8 Seek_Time_Performance   P-S---   123   123   020    -    31
  9 Power_On_Hours          -O--C-   096   096   000    -    32011
 10 Spin_Retry_Count        PO--C-   100   100   060    -    0
 12 Power_Cycle_Count       -O--CK   100   100   000    -    7
192 Power-Off_Retract_Count -O--CK   100   100   000    -    641
193 Load_Cycle_Count        -O--C-   100   100   000    -    641
194 Temperature_Celsius     -O----   176   176   000    -    34 (Min/Max 23/39)
196 Reallocated_Event_Count -O--CK   100   100   000    -    0
197 Current_Pending_Sector  -O---K   100   100   000    -    0
198 Offline_Uncorrectable   ---R--   100   100   000    -    0
199 UDMA_CRC_Error_Count    -O-R--   200   200   000    -    0
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01           SL  R/O      1  Summary SMART error log
0x03       GPL     R/O      1  Ext. Comprehensive SMART error log
0x04       GPL     R/O      7  Device Statistics log
0x06           SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x08       GPL     R/O      2  Power Conditions log
0x09           SL  R/W      1  Selective self-test log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters
0x20       GPL     R/O      1  Streaming performance log [OBS-8]
0x21       GPL     R/O      1  Write stream error log
0x22       GPL     R/O      1  Read stream error log
0x24       GPL     R/O     63  Current Device Internal Status Data log
0x80       GPL     R/W     63  Host vendor specific log
0x81-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (1 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     31921         -
# 2  Extended offline    Completed without error       00%     31753         -
# 3  Extended offline    Completed without error       00%     31585         -
# 4  Extended offline    Completed without error       00%     31417         -
# 5  Extended offline    Completed without error       00%     31249         -
# 6  Extended offline    Completed without error       00%     31081         -
# 7  Extended offline    Completed without error       00%     30913         -
# 8  Extended offline    Completed without error       00%     30745         -
# 9  Extended offline    Completed without error       00%     30577         -
#10  Extended offline    Completed without error       00%     30409         -
#11  Extended offline    Completed without error       00%     30241         -
#12  Extended offline    Completed without error       00%     30073         -
#13  Extended offline    Completed without error       00%     29905         -
#14  Extended offline    Completed without error       00%     29737         -
#15  Extended offline    Completed without error       00%     29569         -
#16  Extended offline    Completed without error       00%     29401         -
#17  Extended offline    Completed without error       00%     29233         -
#18  Extended offline    Completed without error       00%     29065         -
#19  Extended offline    Completed without error       00%     28897         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       256 (0x0100)
SCT Support Level:                   1
Device State:                        SMART Off-line Data Collection
executing in background (4)
Current Temperature:                    34 Celsius
Power Cycle Min/Max Temperature:     27/34 Celsius
Lifetime    Min/Max Temperature:     23/39 Celsius
Under/Over Temperature Limit Count:   0/0
SCT Temperature History Version:     2
Temperature Sampling Period:         1 minute
Temperature Logging Interval:        1 minute
Min/Max recommended Temperature:      0/60 Celsius
Min/Max Temperature Limit:           -40/70 Celsius
Temperature History Size (Index):    128 (37)

Index    Estimated Time   Temperature Celsius
  38    2020-07-21 10:36    33  **************
 ...    ..(  3 skipped).    ..  **************
  42    2020-07-21 10:40    33  **************
  43    2020-07-21 10:41    34  ***************
  44    2020-07-21 10:42    33  **************
 ...    ..( 11 skipped).    ..  **************
  56    2020-07-21 10:54    33  **************
  57    2020-07-21 10:55    34  ***************
  58    2020-07-21 10:56    33  **************
 ...    ..(  5 skipped).    ..  **************
  64    2020-07-21 11:02    33  **************
  65    2020-07-21 11:03    34  ***************
  66    2020-07-21 11:04    33  **************
  67    2020-07-21 11:05    33  **************
  68    2020-07-21 11:06    34  ***************
  69    2020-07-21 11:07    33  **************
 ...    ..(  8 skipped).    ..  **************
  78    2020-07-21 11:16    33  **************
  79    2020-07-21 11:17    34  ***************
  80    2020-07-21 11:18    33  **************
  81    2020-07-21 11:19    33  **************
  82    2020-07-21 11:20    34  ***************
  83    2020-07-21 11:21    33  **************
 ...    ..( 11 skipped).    ..  **************
  95    2020-07-21 11:33    33  **************
  96    2020-07-21 11:34    34  ***************
  97    2020-07-21 11:35    33  **************
 ...    ..( 11 skipped).    ..  **************
 109    2020-07-21 11:47    33  **************
 110    2020-07-21 11:48    34  ***************
 111    2020-07-21 11:49    33  **************
 ...    ..( 10 skipped).    ..  **************
 122    2020-07-21 12:00    33  **************
 123    2020-07-21 12:01    34  ***************
 124    2020-07-21 12:02    33  **************
 125    2020-07-21 12:03    33  **************
 126    2020-07-21 12:04    34  ***************
 127    2020-07-21 12:05    33  **************
 ...    ..(  9 skipped).    ..  **************
   9    2020-07-21 12:15    33  **************
  10    2020-07-21 12:16    34  ***************
  11    2020-07-21 12:17    33  **************
 ...    ..(  2 skipped).    ..  **************
  14    2020-07-21 12:20    33  **************
  15    2020-07-21 12:21    34  ***************
  16    2020-07-21 12:22    33  **************
  17    2020-07-21 12:23    33  **************
  18    2020-07-21 12:24    34  ***************
  19    2020-07-21 12:25    33  **************
  20    2020-07-21 12:26    33  **************
  21    2020-07-21 12:27    34  ***************
  22    2020-07-21 12:28    33  **************
 ...    ..(  3 skipped).    ..  **************
  26    2020-07-21 12:32    33  **************
  27    2020-07-21 12:33    34  ***************
 ...    ..(  9 skipped).    ..  ***************
  37    2020-07-21 12:43    34  ***************

SCT Error Recovery Control:
           Read: Disabled
          Write: Disabled

Device Statistics (GP Log 0x04)
Page Offset Size         Value  Description
  1  =====  =                =  == General Statistics (rev 1) ==
  1  0x008  4                7  Lifetime Power-On Resets
  1  0x010  4            32011  Power-on Hours
  1  0x018  6       7094397844  Logical Sectors Written
  1  0x020  6         32420890  Number of Write Commands
  1  0x028  6     183722166461  Logical Sectors Read
  1  0x030  6        194678316  Number of Read Commands
  3  =====  =                =  == Rotating Media Statistics (rev 1) ==
  3  0x008  4            32006  Spindle Motor Power-on Hours
  3  0x010  4            32006  Head Flying Hours
  3  0x018  4              641  Head Load Events
  3  0x020  4                0  Number of Reallocated Logical Sectors
  3  0x028  4               12  Read Recovery Attempts
  3  0x030  4                0  Number of Mechanical Start Failures
  4  =====  =                =  == General Errors Statistics (rev 1) ==
  4  0x008  4                0  Number of Reported Uncorrectable Errors
  4  0x010  4                0  Resets Between Cmd Acceptance and Completion
  5  =====  =                =  == Temperature Statistics (rev 1) ==
  5  0x008  1               34  Current Temperature
  5  0x010  1               33~ Average Short Term Temperature
  5  0x018  1               31~ Average Long Term Temperature
  5  0x020  1               39  Highest Temperature
  5  0x028  1               23  Lowest Temperature
  5  0x030  1               37~ Highest Average Short Term Temperature
  5  0x038  1               25~ Lowest Average Short Term Temperature
  5  0x040  1               35~ Highest Average Long Term Temperature
  5  0x048  1               25~ Lowest Average Long Term Temperature
  5  0x050  4                0  Time in Over-Temperature
  5  0x058  1               60  Specified Maximum Operating Temperature
  5  0x060  4                0  Time in Under-Temperature
  5  0x068  1                0  Specified Minimum Operating Temperature
  6  =====  =                =  == Transport Statistics (rev 1) ==
  6  0x008  4              169  Number of Hardware Resets
  6  0x010  4              129  Number of ASR Events
  6  0x018  4                0  Number of Interface CRC Errors
                              |_ ~ normalized value

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x0001  2            0  Command failed due to ICRC error
0x0002  2            0  R_ERR response for data FIS
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0005  2            0  R_ERR response for non-data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
0x0009  2           25  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2           22  Device-to-host register FISes sent due to a COMRESET
0x000b  2            0  CRC errors within host-to-device FIS
0x000d  2            0  Non-CRC errors within host-to-device FIS

# smartctl --xall /dev/sdc
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Caviar Green (AF)
Device Model:     WDC WD20EARS-00MVWB0
Serial Number:    WD-WCAZA1597296
LU WWN Device Id: 5 0014ee 25a653961
Firmware Version: 51.0AB51
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Size:      512 bytes logical/physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS (minor revision not indicated)
SATA Version is:  SATA 2.6, 3.0 Gb/s
Local Time is:    Tue Jul 21 12:45:57 2020 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Disabled
APM feature is:   Unavailable
Rd look-ahead is: Enabled
Write cache is:   Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (38460) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 371) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x3035) SCT Status supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     POSR-K   199   199   051    -    2027
  3 Spin_Up_Time            POS--K   167   167   021    -    6641
  4 Start_Stop_Count        -O--CK   100   100   000    -    16
  5 Reallocated_Sector_Ct   PO--CK   200   200   140    -    0
  7 Seek_Error_Rate         -OSR-K   200   200   000    -    0
  9 Power_On_Hours          -O--CK   057   057   000    -    31954
 10 Spin_Retry_Count        -O--CK   100   253   000    -    0
 11 Calibration_Retry_Count -O--CK   100   253   000    -    0
 12 Power_Cycle_Count       -O--CK   100   100   000    -    14
192 Power-Off_Retract_Count -O--CK   200   200   000    -    11
193 Load_Cycle_Count        -O--CK   001   001   000    -    1121775
194 Temperature_Celsius     -O---K   121   115   000    -    29
196 Reallocated_Event_Count -O--CK   200   200   000    -    0
197 Current_Pending_Sector  -O--CK   200   200   000    -    0
198 Offline_Uncorrectable   ----CK   200   200   000    -    0
199 UDMA_CRC_Error_Count    -O--CK   200   200   000    -    0
200 Multi_Zone_Error_Rate   ---R--   199   198   000    -    371
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01           SL  R/O      1  Summary SMART error log
0x02           SL  R/O      5  Comprehensive SMART error log
0x03       GPL     R/O      6  Ext. Comprehensive SMART error log
0x06           SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x09           SL  R/W      1  Selective self-test log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters
0x80-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xa0-0xa7  GPL,SL  VS      16  Device vendor specific log
0xa8-0xb7  GPL,SL  VS       1  Device vendor specific log
0xc0       GPL,SL  VS       1  Device vendor specific log
0xc1       GPL     VS      93  Device vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
Device Error Count: 89 (device log contains only the most recent 24 errors)
        CR     = Command Register
        FEATR  = Features Register
        COUNT  = Count (was: Sector Count) Register
        LBA_48 = Upper bytes of LBA High/Mid/Low Registers ]  ATA-8
        LH     = LBA High (was: Cylinder High) Register    ]   LBA
        LM     = LBA Mid (was: Cylinder Low) Register      ] Register
        LL     = LBA Low (was: Sector Number) Register     ]
        DV     = Device (was: Device/Head) Register
        DC     = Device Control Register
        ER     = Error register
        ST     = Status register
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 89 [16] occurred at disk power-on lifetime: 31954 hours (1331
days + 10 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 c8 00 00 00 00 08 08 40 08     04:18:54.610  READ FPDMA QUEUED
  60 00 08 00 c0 00 00 00 00 08 00 40 08     04:18:54.610  READ FPDMA QUEUED
  60 00 08 00 b8 00 00 e8 e0 88 a0 40 08     04:18:54.609  READ FPDMA QUEUED
  60 00 08 00 b0 00 00 e8 e0 88 00 40 08     04:18:54.204  READ FPDMA QUEUED
  b0 00 da 00 00 00 00 00 c2 4f 00 00 08     04:16:10.310  SMART RETURN STATUS

Error 88 [15] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 30 00 00 00 00 08 08 40 08     03:55:50.295  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     03:55:50.295  SET
FEATURES [Enable SATA feature]
  27 00 00 00 00 00 00 00 00 00 00 e0 08     03:55:50.293  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 00 00 00 00 00 a0 08     03:55:50.290  IDENTIFY DEVICE
  ef 00 03 00 46 00 00 00 00 00 00 a0 08     03:55:50.290  SET
FEATURES [Set transfer mode]

Error 87 [14] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 28 00 00 00 00 08 08 40 08     03:55:50.136  READ FPDMA QUEUED
  60 00 08 00 20 00 00 00 00 08 20 40 08     03:55:50.136  READ FPDMA QUEUED
  60 00 08 00 18 00 00 00 00 0a 00 40 08     03:55:50.135  READ FPDMA QUEUED
  60 00 08 00 10 00 00 00 00 0b f8 40 08     03:55:50.135  READ FPDMA QUEUED
  60 00 08 00 08 00 00 00 00 0b f0 40 08     03:55:50.135  READ FPDMA QUEUED

Error 86 [13] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 e0 00 00 00 00 08 08 40 08     03:55:49.960  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     03:55:49.960  SET
FEATURES [Enable SATA feature]
  27 00 00 00 00 00 00 00 00 00 00 e0 08     03:55:49.958  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 00 00 00 00 00 a0 08     03:55:49.955  IDENTIFY DEVICE
  ef 00 03 00 46 00 00 00 00 00 00 a0 08     03:55:49.955  SET
FEATURES [Set transfer mode]

Error 85 [12] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 d8 00 00 00 00 08 08 40 08     03:55:49.798  READ FPDMA QUEUED
  60 00 08 00 d0 00 00 00 00 08 78 40 08     03:55:49.798  READ FPDMA QUEUED
  60 00 08 00 c8 00 00 00 00 08 38 40 08     03:55:49.798  READ FPDMA QUEUED
  60 00 08 00 c0 00 00 00 00 08 18 40 08     03:55:49.780  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     03:55:49.780  SET
FEATURES [Enable SATA feature]

Error 84 [11] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 b8 00 00 00 00 08 08 40 08     03:55:49.624  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     03:55:49.624  SET
FEATURES [Enable SATA feature]
  27 00 00 00 00 00 00 00 00 00 00 e0 08     03:55:49.622  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 00 00 00 00 00 a0 08     03:55:49.619  IDENTIFY DEVICE
  ef 00 03 00 46 00 00 00 00 00 00 a0 08     03:55:49.619  SET
FEATURES [Set transfer mode]

Error 83 [10] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 b0 00 00 00 00 08 08 40 08     03:55:49.468  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     03:55:49.468  SET
FEATURES [Enable SATA feature]
  27 00 00 00 00 00 00 00 00 00 00 e0 08     03:55:49.466  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 00 00 00 00 00 a0 08     03:55:49.463  IDENTIFY DEVICE
  ef 00 03 00 46 00 00 00 00 00 00 a0 08     03:55:49.463  SET
FEATURES [Set transfer mode]

Error 82 [9] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 a8 00 00 00 00 08 08 40 08     03:55:49.312  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     03:55:49.312  SET
FEATURES [Enable SATA feature]
  27 00 00 00 00 00 00 00 00 00 00 e0 08     03:55:49.310  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 00 00 00 00 00 a0 08     03:55:49.307  IDENTIFY DEVICE
  ef 00 03 00 46 00 00 00 00 00 00 a0 08     03:55:49.307  SET
FEATURES [Set transfer mode]

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     31866         -
# 2  Extended offline    Completed without error       00%     31698         -
# 3  Extended offline    Completed without error       00%     31530         -
# 4  Extended offline    Completed without error       00%     31363         -
# 5  Extended offline    Completed without error       00%     31195         -
# 6  Extended offline    Completed without error       00%     31027         -
# 7  Extended offline    Completed without error       00%     30859         -
# 8  Extended offline    Completed without error       00%     30691         -
# 9  Extended offline    Completed without error       00%     30523         -
#10  Extended offline    Completed without error       00%     30356         -
#11  Extended offline    Completed without error       00%     30188         -
#12  Extended offline    Completed without error       00%     30020         -
#13  Extended offline    Completed without error       00%     29852         -
#14  Extended offline    Completed without error       00%     29685         -
#15  Extended offline    Completed without error       00%     29517         -
#16  Extended offline    Completed without error       00%     29349         -
#17  Extended offline    Completed without error       00%     29182         -
#18  Extended offline    Completed without error       00%     29014         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       258 (0x0102)
SCT Support Level:                   1
Device State:                        Active (0)
Current Temperature:                    29 Celsius
Power Cycle Min/Max Temperature:     26/29 Celsius
Lifetime    Min/Max Temperature:     26/35 Celsius
Under/Over Temperature Limit Count:   0/0
SCT Temperature History Version:     2
Temperature Sampling Period:         1 minute
Temperature Logging Interval:        1 minute
Min/Max recommended Temperature:      0/60 Celsius
Min/Max Temperature Limit:           -41/85 Celsius
Temperature History Size (Index):    478 (465)

Index    Estimated Time   Temperature Celsius
 466    2020-07-21 04:48    29  **********
 ...    ..( 14 skipped).    ..  **********
   3    2020-07-21 05:03    29  **********
   4    2020-07-21 05:04    31  ************
 ...    ..( 21 skipped).    ..  ************
  26    2020-07-21 05:26    31  ************
  27    2020-07-21 05:27    32  *************
 ...    ..(  3 skipped).    ..  *************
  31    2020-07-21 05:31    32  *************
  32    2020-07-21 05:32    31  ************
  33    2020-07-21 05:33    32  *************
  34    2020-07-21 05:34    31  ************
  35    2020-07-21 05:35    31  ************
  36    2020-07-21 05:36    32  *************
  37    2020-07-21 05:37    32  *************
  38    2020-07-21 05:38    31  ************
 ...    ..( 72 skipped).    ..  ************
 111    2020-07-21 06:51    31  ************
 112    2020-07-21 06:52    30  ***********
 113    2020-07-21 06:53    31  ************
 ...    ..( 18 skipped).    ..  ************
 132    2020-07-21 07:12    31  ************
 133    2020-07-21 07:13    30  ***********
 134    2020-07-21 07:14    31  ************
 ...    ..( 56 skipped).    ..  ************
 191    2020-07-21 08:11    31  ************
 192    2020-07-21 08:12    32  *************
 193    2020-07-21 08:13    31  ************
 194    2020-07-21 08:14    32  *************
 195    2020-07-21 08:15    31  ************
 196    2020-07-21 08:16    32  *************
 197    2020-07-21 08:17    32  *************
 198    2020-07-21 08:18    32  *************
 199    2020-07-21 08:19    31  ************
 200    2020-07-21 08:20    31  ************
 201    2020-07-21 08:21    32  *************
 202    2020-07-21 08:22    31  ************
 ...    ..(  2 skipped).    ..  ************
 205    2020-07-21 08:25    31  ************
 206    2020-07-21 08:26    32  *************
 207    2020-07-21 08:27    32  *************
 208    2020-07-21 08:28    31  ************
 209    2020-07-21 08:29    31  ************
 210    2020-07-21 08:30    31  ************
 211    2020-07-21 08:31    30  ***********
 212    2020-07-21 08:32    31  ************
 ...    ..(  6 skipped).    ..  ************
 219    2020-07-21 08:39    31  ************
 220    2020-07-21 08:40     ?  -
 221    2020-07-21 08:41    26  *******
 ...    ..( 13 skipped).    ..  *******
 235    2020-07-21 08:55    26  *******
 236    2020-07-21 08:56    27  ********
 ...    ..(  9 skipped).    ..  ********
 246    2020-07-21 09:06    27  ********
 247    2020-07-21 09:07    28  *********
 ...    ..( 29 skipped).    ..  *********
 277    2020-07-21 09:37    28  *********
 278    2020-07-21 09:38    29  **********
 279    2020-07-21 09:39    28  *********
 ...    ..( 37 skipped).    ..  *********
 317    2020-07-21 10:17    28  *********
 318    2020-07-21 10:18    29  **********
 319    2020-07-21 10:19    28  *********
 ...    ..(  7 skipped).    ..  *********
 327    2020-07-21 10:27    28  *********
 328    2020-07-21 10:28    29  **********
 329    2020-07-21 10:29    29  **********
 330    2020-07-21 10:30    29  **********
 331    2020-07-21 10:31    28  *********
 332    2020-07-21 10:32    28  *********
 333    2020-07-21 10:33    29  **********
 334    2020-07-21 10:34    29  **********
 335    2020-07-21 10:35    28  *********
 ...    ..(  2 skipped).    ..  *********
 338    2020-07-21 10:38    28  *********
 339    2020-07-21 10:39    29  **********
 340    2020-07-21 10:40    28  *********
 ...    ..( 58 skipped).    ..  *********
 399    2020-07-21 11:39    28  *********
 400    2020-07-21 11:40    29  **********
 401    2020-07-21 11:41    29  **********
 402    2020-07-21 11:42    28  *********
 ...    ..( 25 skipped).    ..  *********
 428    2020-07-21 12:08    28  *********
 429    2020-07-21 12:09    29  **********
 430    2020-07-21 12:10    28  *********
 431    2020-07-21 12:11    28  *********
 432    2020-07-21 12:12    29  **********
 433    2020-07-21 12:13    29  **********
 434    2020-07-21 12:14    29  **********
 435    2020-07-21 12:15    28  *********
 ...    ..(  3 skipped).    ..  *********
 439    2020-07-21 12:19    28  *********
 440    2020-07-21 12:20    29  **********
 441    2020-07-21 12:21    28  *********
 442    2020-07-21 12:22    29  **********
 ...    ..( 22 skipped).    ..  **********
 465    2020-07-21 12:45    29  **********

SCT Error Recovery Control command not supported

Device Statistics (GP Log 0x04) not supported

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x0001  2            0  Command failed due to ICRC error
0x0002  2            0  R_ERR response for data FIS
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0005  2            0  R_ERR response for non-data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
0x000a  2           23  Device-to-host register FISes sent due to a COMRESET
0x000b  2            0  CRC errors within host-to-device FIS
0x8000  4        15858  Vendor specific

# smartctl --xall /dev/sdd
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     Hitachi HUA723020ALA641
Serial Number:    YFHK9JAA
LU WWN Device Id: 5 000cca 223d5f593
Firmware Version: MK7OA840
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    7200 rpm
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is:    Tue Jul 21 12:47:13 2020 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Disabled
Rd look-ahead is: Enabled
Write cache is:   Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84) Offline data collection activity
                                        was suspended by an
interrupting command from host.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (19618) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 327) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     PO-R--   100   100   016    -    0
  2 Throughput_Performance  P-S---   134   134   054    -    88
  3 Spin_Up_Time            POS---   100   100   024    -    498
  4 Start_Stop_Count        -O--C-   100   100   000    -    7
  5 Reallocated_Sector_Ct   PO--CK   100   100   005    -    0
  7 Seek_Error_Rate         PO-R--   100   100   067    -    0
  8 Seek_Time_Performance   P-S---   125   125   020    -    30
  9 Power_On_Hours          -O--C-   096   096   000    -    32010
 10 Spin_Retry_Count        PO--C-   100   100   060    -    0
 12 Power_Cycle_Count       -O--CK   100   100   000    -    7
192 Power-Off_Retract_Count -O--CK   100   100   000    -    648
193 Load_Cycle_Count        -O--C-   100   100   000    -    648
194 Temperature_Celsius     -O----   181   181   000    -    33 (Min/Max 23/37)
196 Reallocated_Event_Count -O--CK   100   100   000    -    0
197 Current_Pending_Sector  -O---K   100   100   000    -    0
198 Offline_Uncorrectable   ---R--   100   100   000    -    0
199 UDMA_CRC_Error_Count    -O-R--   200   200   000    -    0
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01           SL  R/O      1  Summary SMART error log
0x03       GPL     R/O      1  Ext. Comprehensive SMART error log
0x04       GPL     R/O      7  Device Statistics log
0x06           SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x08       GPL     R/O      2  Power Conditions log
0x09           SL  R/W      1  Selective self-test log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters
0x20       GPL     R/O      1  Streaming performance log [OBS-8]
0x21       GPL     R/O      1  Write stream error log
0x22       GPL     R/O      1  Read stream error log
0x24       GPL     R/O     63  Current Device Internal Status Data log
0x80       GPL     R/W     63  Host vendor specific log
0x81-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (1 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     31920         -
# 2  Extended offline    Completed without error       00%     31752         -
# 3  Extended offline    Completed without error       00%     31584         -
# 4  Extended offline    Completed without error       00%     31416         -
# 5  Extended offline    Completed without error       00%     31248         -
# 6  Extended offline    Completed without error       00%     31080         -
# 7  Extended offline    Completed without error       00%     30912         -
# 8  Extended offline    Completed without error       00%     30744         -
# 9  Extended offline    Completed without error       00%     30576         -
#10  Extended offline    Completed without error       00%     30408         -
#11  Extended offline    Completed without error       00%     30240         -
#12  Extended offline    Completed without error       00%     30072         -
#13  Extended offline    Completed without error       00%     29904         -
#14  Extended offline    Completed without error       00%     29736         -
#15  Extended offline    Completed without error       00%     29568         -
#16  Extended offline    Completed without error       00%     29400         -
#17  Extended offline    Completed without error       00%     29232         -
#18  Extended offline    Completed without error       00%     29064         -
#19  Extended offline    Completed without error       00%     28896         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       256 (0x0100)
SCT Support Level:                   1
Device State:                        SMART Off-line Data Collection
executing in background (4)
Current Temperature:                    33 Celsius
Power Cycle Min/Max Temperature:     27/34 Celsius
Lifetime    Min/Max Temperature:     23/37 Celsius
Under/Over Temperature Limit Count:   0/0
SCT Temperature History Version:     2
Temperature Sampling Period:         1 minute
Temperature Logging Interval:        1 minute
Min/Max recommended Temperature:      0/60 Celsius
Min/Max Temperature Limit:           -40/70 Celsius
Temperature History Size (Index):    128 (6)

Index    Estimated Time   Temperature Celsius
   7    2020-07-21 10:40    33  **************
   8    2020-07-21 10:41    33  **************
   9    2020-07-21 10:42    32  *************
  10    2020-07-21 10:43    33  **************
  11    2020-07-21 10:44    33  **************
  12    2020-07-21 10:45    32  *************
  13    2020-07-21 10:46    33  **************
  14    2020-07-21 10:47    32  *************
  15    2020-07-21 10:48    32  *************
  16    2020-07-21 10:49    33  **************
  17    2020-07-21 10:50    32  *************
  18    2020-07-21 10:51    33  **************
 ...    ..( 11 skipped).    ..  **************
  30    2020-07-21 11:03    33  **************
  31    2020-07-21 11:04    32  *************
  32    2020-07-21 11:05    33  **************
 ...    ..( 15 skipped).    ..  **************
  48    2020-07-21 11:21    33  **************
  49    2020-07-21 11:22    32  *************
  50    2020-07-21 11:23    33  **************
  51    2020-07-21 11:24    33  **************
  52    2020-07-21 11:25    33  **************
  53    2020-07-21 11:26    32  *************
  54    2020-07-21 11:27    32  *************
  55    2020-07-21 11:28    33  **************
 ...    ..(  2 skipped).    ..  **************
  58    2020-07-21 11:31    33  **************
  59    2020-07-21 11:32    32  *************
  60    2020-07-21 11:33    32  *************
  61    2020-07-21 11:34    33  **************
  62    2020-07-21 11:35    32  *************
  63    2020-07-21 11:36    32  *************
  64    2020-07-21 11:37    33  **************
  65    2020-07-21 11:38    32  *************
  66    2020-07-21 11:39    33  **************
  67    2020-07-21 11:40    32  *************
  68    2020-07-21 11:41    32  *************
  69    2020-07-21 11:42    33  **************
  70    2020-07-21 11:43    32  *************
  71    2020-07-21 11:44    32  *************
  72    2020-07-21 11:45    33  **************
  73    2020-07-21 11:46    33  **************
  74    2020-07-21 11:47    32  *************
  75    2020-07-21 11:48    33  **************
  76    2020-07-21 11:49    32  *************
  77    2020-07-21 11:50    33  **************
 ...    ..( 45 skipped).    ..  **************
 123    2020-07-21 12:36    33  **************
 124    2020-07-21 12:37    34  ***************
 125    2020-07-21 12:38    34  ***************
 126    2020-07-21 12:39    33  **************
 ...    ..(  7 skipped).    ..  **************
   6    2020-07-21 12:47    33  **************

SCT Error Recovery Control:
           Read: Disabled
          Write: Disabled

Device Statistics (GP Log 0x04)
Page Offset Size         Value  Description
  1  =====  =                =  == General Statistics (rev 1) ==
  1  0x008  4                7  Lifetime Power-On Resets
  1  0x010  4            32010  Power-on Hours
  1  0x018  6       7053496316  Logical Sectors Written
  1  0x020  6         30154975  Number of Write Commands
  1  0x028  6     183776882028  Logical Sectors Read
  1  0x030  6        197249758  Number of Read Commands
  3  =====  =                =  == Rotating Media Statistics (rev 1) ==
  3  0x008  4            32005  Spindle Motor Power-on Hours
  3  0x010  4            32005  Head Flying Hours
  3  0x018  4              648  Head Load Events
  3  0x020  4                0  Number of Reallocated Logical Sectors
  3  0x028  4                0  Read Recovery Attempts
  3  0x030  4                0  Number of Mechanical Start Failures
  4  =====  =                =  == General Errors Statistics (rev 1) ==
  4  0x008  4                0  Number of Reported Uncorrectable Errors
  4  0x010  4                0  Resets Between Cmd Acceptance and Completion
  5  =====  =                =  == Temperature Statistics (rev 1) ==
  5  0x008  1               33  Current Temperature
  5  0x010  1               33~ Average Short Term Temperature
  5  0x018  1               30~ Average Long Term Temperature
  5  0x020  1               37  Highest Temperature
  5  0x028  1               23  Lowest Temperature
  5  0x030  1               34~ Highest Average Short Term Temperature
  5  0x038  1               25~ Lowest Average Short Term Temperature
  5  0x040  1               33~ Highest Average Long Term Temperature
  5  0x048  1               25~ Lowest Average Long Term Temperature
  5  0x050  4                0  Time in Over-Temperature
  5  0x058  1               60  Specified Maximum Operating Temperature
  5  0x060  4                0  Time in Under-Temperature
  5  0x068  1                0  Specified Minimum Operating Temperature
  6  =====  =                =  == Transport Statistics (rev 1) ==
  6  0x008  4              175  Number of Hardware Resets
  6  0x010  4              130  Number of ASR Events
  6  0x018  4                0  Number of Interface CRC Errors
                              |_ ~ normalized value

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x0001  2            0  Command failed due to ICRC error
0x0002  2            0  R_ERR response for data FIS
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0005  2            0  R_ERR response for non-data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
0x0009  2           25  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2           22  Device-to-host register FISes sent due to a COMRESET
0x000b  2            0  CRC errors within host-to-device FIS
0x000d  2            0  Non-CRC errors within host-to-device FIS



# smartctl --xall /dev/sde
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     Hitachi HUA723020ALA641
Serial Number:    YFG7LWBA
LU WWN Device Id: 5 000cca 223c3757b
Firmware Version: MK7OA840
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    7200 rpm
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is:    Tue Jul 21 12:47:56 2020 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Disabled
Rd look-ahead is: Enabled
Write cache is:   Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84) Offline data collection activity
                                        was suspended by an
interrupting command from host.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (20614) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 344) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     PO-R--   100   100   016    -    0
  2 Throughput_Performance  P-S---   134   134   054    -    87
  3 Spin_Up_Time            POS---   100   100   024    -    493
  4 Start_Stop_Count        -O--C-   100   100   000    -    7
  5 Reallocated_Sector_Ct   PO--CK   100   100   005    -    0
  7 Seek_Error_Rate         PO-R--   100   100   067    -    0
  8 Seek_Time_Performance   P-S---   133   133   020    -    27
  9 Power_On_Hours          -O--C-   096   096   000    -    32010
 10 Spin_Retry_Count        PO--C-   100   100   060    -    0
 12 Power_Cycle_Count       -O--CK   100   100   000    -    7
192 Power-Off_Retract_Count -O--CK   100   100   000    -    647
193 Load_Cycle_Count        -O--C-   100   100   000    -    647
194 Temperature_Celsius     -O----   181   181   000    -    33 (Min/Max 23/37)
196 Reallocated_Event_Count -O--CK   100   100   000    -    0
197 Current_Pending_Sector  -O---K   100   100   000    -    0
198 Offline_Uncorrectable   ---R--   100   100   000    -    0
199 UDMA_CRC_Error_Count    -O-R--   200   200   000    -    0
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01           SL  R/O      1  Summary SMART error log
0x03       GPL     R/O      1  Ext. Comprehensive SMART error log
0x04       GPL     R/O      7  Device Statistics log
0x06           SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x08       GPL     R/O      2  Power Conditions log
0x09           SL  R/W      1  Selective self-test log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters
0x20       GPL     R/O      1  Streaming performance log [OBS-8]
0x21       GPL     R/O      1  Write stream error log
0x22       GPL     R/O      1  Read stream error log
0x24       GPL     R/O     63  Current Device Internal Status Data log
0x80       GPL     R/W     63  Host vendor specific log
0x81-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (1 sectors)
Device Error Count: 12 (device log contains only the most recent 4 errors)
        CR     = Command Register
        FEATR  = Features Register
        COUNT  = Count (was: Sector Count) Register
        LBA_48 = Upper bytes of LBA High/Mid/Low Registers ]  ATA-8
        LH     = LBA High (was: Cylinder High) Register    ]   LBA
        LM     = LBA Mid (was: Cylinder Low) Register      ] Register
        LL     = LBA Low (was: Sector Number) Register     ]
        DV     = Device (was: Device/Head) Register
        DC     = Device Control Register
        ER     = Error register
        ST     = Status register
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 12 [3] occurred at disk power-on lifetime: 14988 hours (624 days
+ 12 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 03 ba 00 00 15 53 13 7d 05 00  Error: UNC 954 sectors at
LBA = 0x1553137d = 357766013

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 00 00 04 00 00 00 15 53 13 37 e0 08  2d+22:13:36.175  READ DMA EXT
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:36.171  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 03 00 00 00 00 00 00 00 00 00 a0 08  2d+22:13:36.168  IDENTIFY DEVICE
  ef 00 03 00 46 e0 88 af 00 00 00 a0 08  2d+22:13:36.165  SET
FEATURES [Set transfer mode]
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:36.162  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]

Error 11 [2] occurred at disk power-on lifetime: 14988 hours (624 days
+ 12 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 03 ba 00 00 15 53 13 7d 05 00  Error: UNC 954 sectors at
LBA = 0x1553137d = 357766013

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 00 00 04 00 00 00 15 53 13 37 e0 08  2d+22:13:32.148  READ DMA EXT
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:32.143  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 03 00 00 00 00 00 00 00 00 00 a0 08  2d+22:13:32.140  IDENTIFY DEVICE
  ef 00 03 00 46 e0 88 af 00 00 00 a0 08  2d+22:13:32.137  SET
FEATURES [Set transfer mode]
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:32.133  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]

Error 10 [1] occurred at disk power-on lifetime: 14988 hours (624 days
+ 12 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 03 ba 00 00 15 53 13 7d 05 00  Error: UNC 954 sectors at
LBA = 0x1553137d = 357766013

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 00 00 04 00 00 00 15 53 13 37 e0 08  2d+22:13:28.564  READ DMA EXT
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:28.560  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 03 00 00 00 00 00 00 00 00 00 a0 08  2d+22:13:28.556  IDENTIFY DEVICE
  ef 00 03 00 46 e0 88 af 00 00 00 a0 08  2d+22:13:28.553  SET
FEATURES [Set transfer mode]
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:28.550  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]

Error 9 [0] occurred at disk power-on lifetime: 14988 hours (624 days
+ 12 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 03 ba 00 00 15 53 13 7d 05 00  Error: UNC 954 sectors at
LBA = 0x1553137d = 357766013

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 00 00 04 00 00 00 15 53 13 37 e0 08  2d+22:13:24.974  READ DMA EXT
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:24.971  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 03 00 00 00 00 00 00 00 00 00 a0 08  2d+22:13:24.967  IDENTIFY DEVICE
  ef 00 03 00 46 e0 88 af 00 00 00 a0 08  2d+22:13:24.967  SET
FEATURES [Set transfer mode]
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:24.963  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     31921         -
# 2  Extended offline    Completed without error       00%     31753         -
# 3  Extended offline    Completed without error       00%     31585         -
# 4  Extended offline    Completed without error       00%     31417         -
# 5  Extended offline    Completed without error       00%     31249         -
# 6  Extended offline    Completed without error       00%     31081         -
# 7  Extended offline    Completed without error       00%     30913         -
# 8  Extended offline    Completed without error       00%     30745         -
# 9  Extended offline    Completed without error       00%     30576         -
#10  Extended offline    Completed without error       00%     30409         -
#11  Extended offline    Completed without error       00%     30241         -
#12  Extended offline    Completed without error       00%     30073         -
#13  Extended offline    Completed without error       00%     29905         -
#14  Extended offline    Completed without error       00%     29737         -
#15  Extended offline    Completed without error       00%     29569         -
#16  Extended offline    Completed without error       00%     29401         -
#17  Extended offline    Completed without error       00%     29233         -
#18  Extended offline    Completed without error       00%     29065         -
#19  Extended offline    Completed without error       00%     28897         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       256 (0x0100)
SCT Support Level:                   1
Device State:                        SMART Off-line Data Collection
executing in background (4)
Current Temperature:                    33 Celsius
Power Cycle Min/Max Temperature:     27/34 Celsius
Lifetime    Min/Max Temperature:     23/37 Celsius
Under/Over Temperature Limit Count:   0/0
SCT Temperature History Version:     2
Temperature Sampling Period:         1 minute
Temperature Logging Interval:        1 minute
Min/Max recommended Temperature:      0/60 Celsius
Min/Max Temperature Limit:           -40/70 Celsius
Temperature History Size (Index):    128 (111)

Index    Estimated Time   Temperature Celsius
 112    2020-07-21 10:40    33  **************
 ...    ..(111 skipped).    ..  **************
  96    2020-07-21 12:32    33  **************
  97    2020-07-21 12:33    34  ***************
 ...    ..(  5 skipped).    ..  ***************
 103    2020-07-21 12:39    34  ***************
 104    2020-07-21 12:40    33  **************
 ...    ..(  6 skipped).    ..  **************
 111    2020-07-21 12:47    33  **************

SCT Error Recovery Control:
           Read: Disabled
          Write: Disabled

Device Statistics (GP Log 0x04)
Page Offset Size         Value  Description
  1  =====  =                =  == General Statistics (rev 1) ==
  1  0x008  4                7  Lifetime Power-On Resets
  1  0x010  4            32010  Power-on Hours
  1  0x018  6       7079444176  Logical Sectors Written
  1  0x020  6         32145267  Number of Write Commands
  1  0x028  6     183726144100  Logical Sectors Read
  1  0x030  6        193643146  Number of Read Commands
  3  =====  =                =  == Rotating Media Statistics (rev 1) ==
  3  0x008  4            32005  Spindle Motor Power-on Hours
  3  0x010  4            32005  Head Flying Hours
  3  0x018  4              647  Head Load Events
  3  0x020  4                0  Number of Reallocated Logical Sectors
  3  0x028  4              176  Read Recovery Attempts
  3  0x030  4                0  Number of Mechanical Start Failures
  4  =====  =                =  == General Errors Statistics (rev 1) ==
  4  0x008  4                0  Number of Reported Uncorrectable Errors
  4  0x010  4                0  Resets Between Cmd Acceptance and Completion
  5  =====  =                =  == Temperature Statistics (rev 1) ==
  5  0x008  1               33  Current Temperature
  5  0x010  1               33~ Average Short Term Temperature
  5  0x018  1               31~ Average Long Term Temperature
  5  0x020  1               37  Highest Temperature
  5  0x028  1               23  Lowest Temperature
  5  0x030  1               35~ Highest Average Short Term Temperature
  5  0x038  1               25~ Lowest Average Short Term Temperature
  5  0x040  1               33~ Highest Average Long Term Temperature
  5  0x048  1               25~ Lowest Average Long Term Temperature
  5  0x050  4                0  Time in Over-Temperature
  5  0x058  1               60  Specified Maximum Operating Temperature
  5  0x060  4                0  Time in Under-Temperature
  5  0x068  1                0  Specified Minimum Operating Temperature
  6  =====  =                =  == Transport Statistics (rev 1) ==
  6  0x008  4              184  Number of Hardware Resets
  6  0x010  4              129  Number of ASR Events
  6  0x018  4                0  Number of Interface CRC Errors
                              |_ ~ normalized value

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x0001  2            0  Command failed due to ICRC error
0x0002  2            0  R_ERR response for data FIS
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0005  2            0  R_ERR response for non-data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
0x0009  2           25  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2           22  Device-to-host register FISes sent due to a COMRESET
0x000b  2            0  CRC errors within host-to-device FIS
0x000d  2            0  Non-CRC errors within host-to-device FIS






On Wed, Jul 22, 2020 at 2:14 AM Wols Lists <antlists@youngman.org.uk> wrote:
>
> On 22/07/20 08:41, Cory Derenburger wrote:
> > My server lost power this morning. The server is running Linux Mint
> > (14?) on a battery backup and I believe it shutdown before losing
> > power. Upon restarting the server the computer hung for a while, and
> > after resetting and booting up in recovery mode my RAID is now
> > nonfunctional.
> >
> > The server was set up years ago with a RAID 6 array built with mdadm.
> > To be honest I don't really know what is wrong with the array, it
> > seems to be an issue with disk sdc. I wanted to reach out for help to
> > confirm the issue and get some guidance before proceeding (or making
> > things worse).
> >
> > Any assistance that can help me determine what steps to take to get
> > this server back up and running would be greatly appreciated. It's
> > been 4+ since I have touched RAID, and only attempted a recovery once.
> > If anyone can help I would be super appreciative.
>
> https://raid.wiki.kernel.org/index.php/Linux_Raid#When_Things_Go_Wrogn
> https://raid.wiki.kernel.org/index.php/Asking_for_help
>
> I see you've included some stuff which is helpful, but can you do
> everything that last page asks for. In particular, lsdrv.
> >
> > Below I'm including outputs from various commands for the 3rd disk
> > which seems to be the culprit
> >
> > dmesg - boot section section where first errors begin occurring
> > [    2.637856] md: bind<sdd1>
> > [    2.646987] random: nonblocking pool is initialized
> > [    2.647432] md: bind<sde1>
> > [    2.651429] md: bind<sdb1>
> > [    2.863538] ata3.00: exception Emask 0x0 SAct 0x10 SErr 0x0 action 0x0
> > [    2.863594] ata3.00: irq_stat 0x40000008
> > [    2.863643] ata3.00: failed command: READ FPDMA QUEUED
> > [    2.863695] ata3.00: cmd 60/08:20:08:08:00/00:00:00:00:00/40 tag 4
> > ncq 4096 in
> > [    2.863695]          res 41/40:00:09:08:00/00:00:00:00:00/40 Emask
> > 0x409 (media error) <F>
> > [    2.863775] ata3.00: status: { DRDY ERR }
> > [    2.863822] ata3.00: error: { UNC }
> > [    2.873407] ata3.00: configured for UDMA/133
> > [    2.873476] sd 2:0:0:0: [sdc] Unhandled sense code
> > [    2.873525] sd 2:0:0:0: [sdc]
> > [    2.873571] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
> > [    2.873619] sd 2:0:0:0: [sdc]
> > [    2.873665] Sense Key : Medium Error [current] [descriptor]
> > [    2.873819] Descriptor sense data with sense descriptors (in hex):
> > [    2.873901]         72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
> > [    2.874544]         00 00 08 09
> > [    2.874764] sd 2:0:0:0: [sdc]
> > [    2.874811] Add. Sense: Unrecovered read error - auto reallocate failed
> > [    2.874895] sd 2:0:0:0: [sdc] CDB:
> > [    2.874941] Read(10): 28 00 00 00 08 08 00 00 08 00
> > [    2.875428] end_request: I/O error, dev sdc, sector 2057
> > [    2.875478] Buffer I/O error on device sdc1, logical block 1
> >
> > cat /proc/mdstat
> > Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
> > [raid4] [raid10]
> > md0 : inactive sdb1[0](S) sde1[3](S) sdd1[2](S)
> >       5860147464 blocks super 1.2
> >
> > {not sure why these drives are now showing as spares}
>
> This is very common when an array fails to assemble properly.
> Unfortunately, when there's one error, it often triggers a cascade of
> fake errors, and this is probably the case here.
> >
> > Below running mdstat for sdc.  Checking sdb, sdd, sde appear fine.
> >
> > mdadm --examine /dev/sdc
> > /dev/sdc:   MBR Magic : aa55
> > Partition[0] :   3907027120 sectors at         2048 (type fd)
> >
> > mdadm --examine /dev/sdc1
> > mdadm: No md superblock detected on /dev/sdc1.
> >
> > fdisk -l
> > Disk /dev/sdb: 2000.4 GB, 2000398934016 bytes
> > 81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
> > Units = sectors of 1 * 512 = 512 bytes
> > Sector size (logical/physical): 512 bytes / 512 bytes
> > I/O size (minimum/optimal): 512 bytes / 512 bytes
> > Disk identifier: 0x38389fdc
> >
> >    Device Boot      Start         End      Blocks   Id  System
> > /dev/sdb1            2048  3907029167  1953513560   fd  Linux raid autodetect
> >
> > Disk /dev/sdc: 2000.4 GB, 2000398934016 bytes
> > 81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
> > Units = sectors of 1 * 512 = 512 bytes
> > Sector size (logical/physical): 512 bytes / 512 bytes
> > I/O size (minimum/optimal): 512 bytes / 512 bytes
> > Disk identifier: 0xd108824d
> >
> >    Device Boot      Start         End      Blocks   Id  System
> > /dev/sdc1            2048  3907029167  1953513560   fd  Linux raid autodetect
> >
> > Disk /dev/sdd: 2000.4 GB, 2000398934016 bytes
> > 81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
> > Units = sectors of 1 * 512 = 512 bytes
> > Sector size (logical/physical): 512 bytes / 512 bytes
> > I/O size (minimum/optimal): 512 bytes / 512 bytes
> > Disk identifier: 0x6207659a
> >
> >    Device Boot      Start         End      Blocks   Id  System
> > /dev/sdd1            2048  3907029167  1953513560   fd  Linux raid autodetect
> >
> > Disk /dev/sde: 2000.4 GB, 2000398934016 bytes
> > 81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
> > Units = sectors of 1 * 512 = 512 bytes
> > Sector size (logical/physical): 512 bytes / 512 bytes
> > I/O size (minimum/optimal): 512 bytes / 512 bytes
> > Disk identifier: 0xd9a4afcf
> >
> >    Device Boot      Start         End      Blocks   Id  System
> > /dev/sde1            2048  3907029167  1953513560   fd  Linux raid autodetect
> >
> >
> > Is there other information needed to determine the issue?  Where do I
> > go from here?
> >
> How old is linux mint? Have you kept it up-to-date? Unfortunately, it
> seems a lot of older systems suffer issues when the kernel is heavily
> patched and mdadm is not updated, and this regularly surfaces on this
> list where Ubuntu is concerned ...
>
> mdadm --version
> uname -a
>
> Make sure you have a "latest and greatest" rescue disk to hand, and
> we'll see what the others say.
>
> Cheers,
> Wol
>

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Software RAID6 broke after power outage
  2020-07-22  7:41 ` Software RAID6 broke after power outage Cory Derenburger
  2020-07-22  9:14   ` Wols Lists
@ 2020-07-22 19:03   ` Ian Pilcher
  1 sibling, 0 replies; 6+ messages in thread
From: Ian Pilcher @ 2020-07-22 19:03 UTC (permalink / raw)
  To: linux-raid

On 7/22/20 2:41 AM, Cory Derenburger wrote:
> [    2.863538] ata3.00: exception Emask 0x0 SAct 0x10 SErr 0x0 action 0x0
> [    2.863594] ata3.00: irq_stat 0x40000008
> [    2.863643] ata3.00: failed command: READ FPDMA QUEUED
> [    2.863695] ata3.00: cmd 60/08:20:08:08:00/00:00:00:00:00/40 tag 4
> ncq 4096 in
> [    2.863695]          res 41/40:00:09:08:00/00:00:00:00:00/40 Emask
> 0x409 (media error) <F>
> [    2.863775] ata3.00: status: { DRDY ERR }
> [    2.863822] ata3.00: error: { UNC }
> [    2.873407] ata3.00: configured for UDMA/133
> [    2.873476] sd 2:0:0:0: [sdc] Unhandled sense code
> [    2.873525] sd 2:0:0:0: [sdc]
> [    2.873571] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
> [    2.873619] sd 2:0:0:0: [sdc]
> [    2.873665] Sense Key : Medium Error [current] [descriptor]
> [    2.873819] Descriptor sense data with sense descriptors (in hex):
> [    2.873901]         72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
> [    2.874544]         00 00 08 09
> [    2.874764] sd 2:0:0:0: [sdc]
> [    2.874811] Add. Sense: Unrecovered read error - auto reallocate failed
> [    2.874895] sd 2:0:0:0: [sdc] CDB:
> [    2.874941] Read(10): 28 00 00 00 08 08 00 00 08 00
> [    2.875428] end_request: I/O error, dev sdc, sector 2057

That's an error on sdc all right.

> [    2.875478] Buffer I/O error on device sdc1, logical block 1

Note that this refers to sdc1, the first partition on sdc.

> cat /proc/mdstat
> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
> [raid4] [raid10]
> md0 : inactive sdb1[0](S) sde1[3](S) sdd1[2](S)
>        5860147464 blocks super 1.2

Note that this is again referring to *partitions* (sdb1, sde1, sdd1),
not the whole disks.  So your faulty RAID member is *sdc1* (not sdc).

> Below running mdstat for sdc.  Checking sdb, sdd, sde appear fine.
> 
> mdadm --examine /dev/sdc
> /dev/sdc:   MBR Magic : aa55
> Partition[0] :   3907027120 sectors at         2048 (type fd)

That's expected.  sdc isn't a RAID device.

> mdadm --examine /dev/sdc1
> mdadm: No md superblock detected on /dev/sdc1.

That's not good (but we already knew that the drive is unhappy).

> Is there other information needed to determine the issue?  Where do I
> go from here?

What does mdadm --detail /dev/md0 say?

-- 
========================================================================
                  In Soviet Russia, Google searches you!
========================================================================

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Software RAID6 broke after power outage
  2020-07-22 16:29     ` Cory Derenburger
@ 2020-07-22 19:47       ` antlists
  2020-07-30 18:28         ` Cory Derenburger
  0 siblings, 1 reply; 6+ messages in thread
From: antlists @ 2020-07-22 19:47 UTC (permalink / raw)
  To: Cory Derenburger; +Cc: linux-raid

On 22/07/2020 17:29, Cory Derenburger wrote:
> Thanks Wols,
> 
> The version on Linux Mint I've been running is quite old.  Once the
> server was last configured it did not have updates.  It was put on a
> shelf and (mostly) left alone to serve files reliably for years.

That's good.
> 
> $ mdadm --version
> mdadm - v3.2.5 - 18th May 2012
> 
And that's not so good. If your root is not on the raid and the system 
actually runs, download and run the latest mdadm. That on an old kernel 
shouldn't be a problem. A franken-kernel that's been patched to buggery 
probably is.

> uname -a
> Linux LIZZY 3.16.0-38-generic #52~14.04.1-Ubuntu SMP Fri May 8
> 09:43:57 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
> 
Do you want the good news or the bad news? The good news is we can 
probably recover your data. The bad news is you're probably looking at 
replacing all your drives :-(

https://raid.wiki.kernel.org/index.php/Timeout_Mismatch

A cursory glance says you have several drives that fall foul of this. 
Again, if your system is bootable, you NEED to configure Brad's script 
to run. I'll go into it a bit deeper as I dig through your reply.
> 
> smartctrl for the drives
> # smartctl --xall /dev/sdb
> smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
> Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
> 
> === START OF INFORMATION SECTION ===
> Device Model:     Hitachi HUA723020ALA641
> Serial Number:    YFGK3VXD
> LU WWN Device Id: 5 000cca 223c7c8d4
> Firmware Version: MK7OA840
> User Capacity:    2,000,398,934,016 bytes [2.00 TB]
> Sector Size:      512 bytes logical/physical
> Rotation Rate:    7200 rpm
> Device is:        Not in smartctl database [for details use: -P showall]
> ATA Version is:   ATA8-ACS T13/1699-D revision 4
> SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 3.0 Gb/s)
> Local Time is:    Tue Jul 21 12:43:42 2020 PDT
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
> AAM feature is:   Unavailable
> APM feature is:   Disabled
> Rd look-ahead is: Enabled
> Write cache is:   Enabled
> ATA Security is:  Disabled, NOT FROZEN [SEC1]
> Wt Cache Reorder: Enabled
> 
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
> 
> General SMART Values:
> Offline data collection status:  (0x84) Offline data collection activity
>                                          was suspended by an
> interrupting command from host.
>                                          Auto Offline Data Collection: Enabled.
> Self-test execution status:      (   0) The previous self-test routine completed
>                                          without error or no self-test has ever
>                                          been run.
> Total time to complete Offline
> data collection:                (20116) seconds.
> Offline data collection
> capabilities:                    (0x5b) SMART execute Offline immediate.
>                                          Auto Offline data collection
> on/off support.
>                                          Suspend Offline collection upon new
>                                          command.
>                                          Offline surface scan supported.
>                                          Self-test supported.
>                                          No Conveyance Self-test supported.
>                                          Selective Self-test supported.
> SMART capabilities:            (0x0003) Saves SMART data before entering
>                                          power-saving mode.
>                                          Supports SMART auto save timer.
> Error logging capability:        (0x01) Error logging supported.
>                                          General Purpose Logging supported.
> Short self-test routine
> recommended polling time:        (   1) minutes.
> Extended self-test routine
> recommended polling time:        ( 336) minutes.
> SCT capabilities:              (0x003d) SCT Status supported.
>                                          SCT Error Recovery Control supported.
>                                          SCT Feature Control supported.
>                                          SCT Data Table supported.

GOOD. This drive is suitable for RAID.
> 
> 
> SCT Error Recovery Control:
>             Read: Disabled
>            Write: Disabled

And BAD. Brad's script should switch this on. Check that it does!
> 

> 
> # smartctl --xall /dev/sdc
> smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
> Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
> 
> === START OF INFORMATION SECTION ===
> Model Family:     Western Digital Caviar Green (AF)

I don't think green drives are suitable ... but it is a Caviar, which 
have a good rep ...

> Device Model:     WDC WD20EARS-00MVWB0
> Serial Number:    WD-WCAZA1597296
> LU WWN Device Id: 5 0014ee 25a653961
> Firmware Version: 51.0AB51
> User Capacity:    2,000,398,934,016 bytes [2.00 TB]
> Sector Size:      512 bytes logical/physical
> Device is:        In smartctl database [for details use: -P show]
> ATA Version is:   ATA8-ACS (minor revision not indicated)
> SATA Version is:  SATA 2.6, 3.0 Gb/s
> Local Time is:    Tue Jul 21 12:45:57 2020 PDT
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
> AAM feature is:   Disabled
> APM feature is:   Unavailable
> Rd look-ahead is: Enabled
> Write cache is:   Enabled
> ATA Security is:  Disabled, NOT FROZEN [SEC1]
> Wt Cache Reorder: Enabled
> 
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
> 
> General SMART Values:
> Offline data collection status:  (0x82) Offline data collection activity
>                                          was completed without error.
>                                          Auto Offline Data Collection: Enabled.
> Self-test execution status:      (   0) The previous self-test routine completed
>                                          without error or no self-test has ever
>                                          been run.
> Total time to complete Offline
> data collection:                (38460) seconds.
> Offline data collection
> capabilities:                    (0x7b) SMART execute Offline immediate.
>                                          Auto Offline data collection
> on/off support.
>                                          Suspend Offline collection upon new
>                                          command.
>                                          Offline surface scan supported.
>                                          Self-test supported.
>                                          Conveyance Self-test supported.
>                                          Selective Self-test supported.
> SMART capabilities:            (0x0003) Saves SMART data before entering
>                                          power-saving mode.
>                                          Supports SMART auto save timer.
> Error logging capability:        (0x01) Error logging supported.
>                                          General Purpose Logging supported.
> Short self-test routine
> recommended polling time:        (   2) minutes.
> Extended self-test routine
> recommended polling time:        ( 371) minutes.
> Conveyance self-test routine
> recommended polling time:        (   5) minutes.
> SCT capabilities:              (0x3035) SCT Status supported.
>                                          SCT Feature Control supported.
>                                          SCT Data Table supported.

No mention of Error Recovery ... BAAAADDDDD!!!
> 
> 
> SCT Error Recovery Control command not supported

BAAAAADDDDDD!!!!

Greens aren't suitable - and this is sdc, the dodgy drive, so I suspect 
using it in a raid array has knackered it.
> 
> # smartctl --xall /dev/sdd
> smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
> Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
> 
> === START OF INFORMATION SECTION ===
> Device Model:     Hitachi HUA723020ALA641
> Serial Number:    YFHK9JAA
> LU WWN Device Id: 5 000cca 223d5f593
> Firmware Version: MK7OA840
> User Capacity:    2,000,398,934,016 bytes [2.00 TB]
> Sector Size:      512 bytes logical/physical
> Rotation Rate:    7200 rpm
> Device is:        Not in smartctl database [for details use: -P showall]
> ATA Version is:   ATA8-ACS T13/1699-D revision 4
> SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 3.0 Gb/s)
> Local Time is:    Tue Jul 21 12:47:13 2020 PDT
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
> AAM feature is:   Unavailable
> APM feature is:   Disabled
> Rd look-ahead is: Enabled
> Write cache is:   Enabled
> ATA Security is:  Disabled, NOT FROZEN [SEC1]
> Wt Cache Reorder: Enabled
> 
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
> 
> General SMART Values:
> Offline data collection status:  (0x84) Offline data collection activity
>                                          was suspended by an
> interrupting command from host.
>                                          Auto Offline Data Collection: Enabled.
> Self-test execution status:      (   0) The previous self-test routine completed
>                                          without error or no self-test has ever
>                                          been run.
> Total time to complete Offline
> data collection:                (19618) seconds.
> Offline data collection
> capabilities:                    (0x5b) SMART execute Offline immediate.
>                                          Auto Offline data collection
> on/off support.
>                                          Suspend Offline collection upon new
>                                          command.
>                                          Offline surface scan supported.
>                                          Self-test supported.
>                                          No Conveyance Self-test supported.
>                                          Selective Self-test supported.
> SMART capabilities:            (0x0003) Saves SMART data before entering
>                                          power-saving mode.
>                                          Supports SMART auto save timer.
> Error logging capability:        (0x01) Error logging supported.
>                                          General Purpose Logging supported.
> Short self-test routine
> recommended polling time:        (   1) minutes.
> Extended self-test routine
> recommended polling time:        ( 327) minutes.
> SCT capabilities:              (0x003d) SCT Status supported.
>                                          SCT Error Recovery Control supported.

Good...

>                                          SCT Feature Control supported.
>                                          SCT Data Table supported.
> 
> 
> SCT Error Recovery Control:
>             Read: Disabled
>            Write: Disabled

And bad, but at least it's got it ... as above make sure it's enabled.
> 

> 
> # smartctl --xall /dev/sde
> smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
> Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
> 
> === START OF INFORMATION SECTION ===
> Device Model:     Hitachi HUA723020ALA641

Okay, I assume this is the same as the previous drive ...

> Serial Number:    YFG7LWBA
> LU WWN Device Id: 5 000cca 223c3757b
> Firmware Version: MK7OA840
> User Capacity:    2,000,398,934,016 bytes [2.00 TB]
> Sector Size:      512 bytes logical/physical
> Rotation Rate:    7200 rpm
> Device is:        Not in smartctl database [for details use: -P showall]
> ATA Version is:   ATA8-ACS T13/1699-D revision 4
> SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 3.0 Gb/s)
> Local Time is:    Tue Jul 21 12:47:56 2020 PDT
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
> AAM feature is:   Unavailable
> APM feature is:   Disabled
> Rd look-ahead is: Enabled
> Write cache is:   Enabled
> ATA Security is:  Disabled, NOT FROZEN [SEC1]
> Wt Cache Reorder: Enabled
> 

Okay, we'll assume sdc is dead. The first thing is to try to assemble 
the remaining disks without it. Boot from a rescue disk so you've got 
the latest and greatest available. And don't forget, if you get "device 
busy", you've probably got the remains of a previous assemble messing 
things up, so you need to do a --stop. Just DON'T do a --force, not yet.

Next thing we need is the event count - I think that's mdadm --examine 
over each partition that makes the array.

And make sure you've got a replacement for that Green. You NEED to get 
rid of it.

Let's see how it goes ... if the array assembles fine with the rescue 
disk, just add your new disk and replace sdc, but make sure Brad's 
script has enabled ERC!

Cheers,
Wol

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Software RAID6 broke after power outage
  2020-07-22 19:47       ` antlists
@ 2020-07-30 18:28         ` Cory Derenburger
  0 siblings, 0 replies; 6+ messages in thread
From: Cory Derenburger @ 2020-07-30 18:28 UTC (permalink / raw)
  To: antlists; +Cc: linux-raid

Thank you all so much for your guidance and help!  I was able to
reassemble my RAID with the use of a bootable rescue disk.  The --stop
command is what I was missing.

I ended up purchasing a Synology NAS and some Iron Wolf NAS drives to
replace this old file server.  While I could have just replaced the
drives, the file server is quite old and there are other components
that could eventually fail.  The computer was previously a desktop
twice over and is now 15 years old and deserves a retirement.

Thanks again for your help Wol!
Cory

On Wed, Jul 22, 2020 at 12:47 PM antlists <antlists@youngman.org.uk> wrote:
>
> On 22/07/2020 17:29, Cory Derenburger wrote:
> > Thanks Wols,
> >
> > The version on Linux Mint I've been running is quite old.  Once the
> > server was last configured it did not have updates.  It was put on a
> > shelf and (mostly) left alone to serve files reliably for years.
>
> That's good.
> >
> > $ mdadm --version
> > mdadm - v3.2.5 - 18th May 2012
> >
> And that's not so good. If your root is not on the raid and the system
> actually runs, download and run the latest mdadm. That on an old kernel
> shouldn't be a problem. A franken-kernel that's been patched to buggery
> probably is.
>
> > uname -a
> > Linux LIZZY 3.16.0-38-generic #52~14.04.1-Ubuntu SMP Fri May 8
> > 09:43:57 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
> >
> Do you want the good news or the bad news? The good news is we can
> probably recover your data. The bad news is you're probably looking at
> replacing all your drives :-(
>
> https://raid.wiki.kernel.org/index.php/Timeout_Mismatch
>
> A cursory glance says you have several drives that fall foul of this.
> Again, if your system is bootable, you NEED to configure Brad's script
> to run. I'll go into it a bit deeper as I dig through your reply.
> >
> > smartctrl for the drives
> > # smartctl --xall /dev/sdb
> > smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
> > Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
> >
> > === START OF INFORMATION SECTION ===
> > Device Model:     Hitachi HUA723020ALA641
> > Serial Number:    YFGK3VXD
> > LU WWN Device Id: 5 000cca 223c7c8d4
> > Firmware Version: MK7OA840
> > User Capacity:    2,000,398,934,016 bytes [2.00 TB]
> > Sector Size:      512 bytes logical/physical
> > Rotation Rate:    7200 rpm
> > Device is:        Not in smartctl database [for details use: -P showall]
> > ATA Version is:   ATA8-ACS T13/1699-D revision 4
> > SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 3.0 Gb/s)
> > Local Time is:    Tue Jul 21 12:43:42 2020 PDT
> > SMART support is: Available - device has SMART capability.
> > SMART support is: Enabled
> > AAM feature is:   Unavailable
> > APM feature is:   Disabled
> > Rd look-ahead is: Enabled
> > Write cache is:   Enabled
> > ATA Security is:  Disabled, NOT FROZEN [SEC1]
> > Wt Cache Reorder: Enabled
> >
> > === START OF READ SMART DATA SECTION ===
> > SMART overall-health self-assessment test result: PASSED
> >
> > General SMART Values:
> > Offline data collection status:  (0x84) Offline data collection activity
> >                                          was suspended by an
> > interrupting command from host.
> >                                          Auto Offline Data Collection: Enabled.
> > Self-test execution status:      (   0) The previous self-test routine completed
> >                                          without error or no self-test has ever
> >                                          been run.
> > Total time to complete Offline
> > data collection:                (20116) seconds.
> > Offline data collection
> > capabilities:                    (0x5b) SMART execute Offline immediate.
> >                                          Auto Offline data collection
> > on/off support.
> >                                          Suspend Offline collection upon new
> >                                          command.
> >                                          Offline surface scan supported.
> >                                          Self-test supported.
> >                                          No Conveyance Self-test supported.
> >                                          Selective Self-test supported.
> > SMART capabilities:            (0x0003) Saves SMART data before entering
> >                                          power-saving mode.
> >                                          Supports SMART auto save timer.
> > Error logging capability:        (0x01) Error logging supported.
> >                                          General Purpose Logging supported.
> > Short self-test routine
> > recommended polling time:        (   1) minutes.
> > Extended self-test routine
> > recommended polling time:        ( 336) minutes.
> > SCT capabilities:              (0x003d) SCT Status supported.
> >                                          SCT Error Recovery Control supported.
> >                                          SCT Feature Control supported.
> >                                          SCT Data Table supported.
>
> GOOD. This drive is suitable for RAID.
> >
> >
> > SCT Error Recovery Control:
> >             Read: Disabled
> >            Write: Disabled
>
> And BAD. Brad's script should switch this on. Check that it does!
> >
>
> >
> > # smartctl --xall /dev/sdc
> > smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
> > Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
> >
> > === START OF INFORMATION SECTION ===
> > Model Family:     Western Digital Caviar Green (AF)
>
> I don't think green drives are suitable ... but it is a Caviar, which
> have a good rep ...
>
> > Device Model:     WDC WD20EARS-00MVWB0
> > Serial Number:    WD-WCAZA1597296
> > LU WWN Device Id: 5 0014ee 25a653961
> > Firmware Version: 51.0AB51
> > User Capacity:    2,000,398,934,016 bytes [2.00 TB]
> > Sector Size:      512 bytes logical/physical
> > Device is:        In smartctl database [for details use: -P show]
> > ATA Version is:   ATA8-ACS (minor revision not indicated)
> > SATA Version is:  SATA 2.6, 3.0 Gb/s
> > Local Time is:    Tue Jul 21 12:45:57 2020 PDT
> > SMART support is: Available - device has SMART capability.
> > SMART support is: Enabled
> > AAM feature is:   Disabled
> > APM feature is:   Unavailable
> > Rd look-ahead is: Enabled
> > Write cache is:   Enabled
> > ATA Security is:  Disabled, NOT FROZEN [SEC1]
> > Wt Cache Reorder: Enabled
> >
> > === START OF READ SMART DATA SECTION ===
> > SMART overall-health self-assessment test result: PASSED
> >
> > General SMART Values:
> > Offline data collection status:  (0x82) Offline data collection activity
> >                                          was completed without error.
> >                                          Auto Offline Data Collection: Enabled.
> > Self-test execution status:      (   0) The previous self-test routine completed
> >                                          without error or no self-test has ever
> >                                          been run.
> > Total time to complete Offline
> > data collection:                (38460) seconds.
> > Offline data collection
> > capabilities:                    (0x7b) SMART execute Offline immediate.
> >                                          Auto Offline data collection
> > on/off support.
> >                                          Suspend Offline collection upon new
> >                                          command.
> >                                          Offline surface scan supported.
> >                                          Self-test supported.
> >                                          Conveyance Self-test supported.
> >                                          Selective Self-test supported.
> > SMART capabilities:            (0x0003) Saves SMART data before entering
> >                                          power-saving mode.
> >                                          Supports SMART auto save timer.
> > Error logging capability:        (0x01) Error logging supported.
> >                                          General Purpose Logging supported.
> > Short self-test routine
> > recommended polling time:        (   2) minutes.
> > Extended self-test routine
> > recommended polling time:        ( 371) minutes.
> > Conveyance self-test routine
> > recommended polling time:        (   5) minutes.
> > SCT capabilities:              (0x3035) SCT Status supported.
> >                                          SCT Feature Control supported.
> >                                          SCT Data Table supported.
>
> No mention of Error Recovery ... BAAAADDDDD!!!
> >
> >
> > SCT Error Recovery Control command not supported
>
> BAAAAADDDDDD!!!!
>
> Greens aren't suitable - and this is sdc, the dodgy drive, so I suspect
> using it in a raid array has knackered it.
> >
> > # smartctl --xall /dev/sdd
> > smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
> > Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
> >
> > === START OF INFORMATION SECTION ===
> > Device Model:     Hitachi HUA723020ALA641
> > Serial Number:    YFHK9JAA
> > LU WWN Device Id: 5 000cca 223d5f593
> > Firmware Version: MK7OA840
> > User Capacity:    2,000,398,934,016 bytes [2.00 TB]
> > Sector Size:      512 bytes logical/physical
> > Rotation Rate:    7200 rpm
> > Device is:        Not in smartctl database [for details use: -P showall]
> > ATA Version is:   ATA8-ACS T13/1699-D revision 4
> > SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 3.0 Gb/s)
> > Local Time is:    Tue Jul 21 12:47:13 2020 PDT
> > SMART support is: Available - device has SMART capability.
> > SMART support is: Enabled
> > AAM feature is:   Unavailable
> > APM feature is:   Disabled
> > Rd look-ahead is: Enabled
> > Write cache is:   Enabled
> > ATA Security is:  Disabled, NOT FROZEN [SEC1]
> > Wt Cache Reorder: Enabled
> >
> > === START OF READ SMART DATA SECTION ===
> > SMART overall-health self-assessment test result: PASSED
> >
> > General SMART Values:
> > Offline data collection status:  (0x84) Offline data collection activity
> >                                          was suspended by an
> > interrupting command from host.
> >                                          Auto Offline Data Collection: Enabled.
> > Self-test execution status:      (   0) The previous self-test routine completed
> >                                          without error or no self-test has ever
> >                                          been run.
> > Total time to complete Offline
> > data collection:                (19618) seconds.
> > Offline data collection
> > capabilities:                    (0x5b) SMART execute Offline immediate.
> >                                          Auto Offline data collection
> > on/off support.
> >                                          Suspend Offline collection upon new
> >                                          command.
> >                                          Offline surface scan supported.
> >                                          Self-test supported.
> >                                          No Conveyance Self-test supported.
> >                                          Selective Self-test supported.
> > SMART capabilities:            (0x0003) Saves SMART data before entering
> >                                          power-saving mode.
> >                                          Supports SMART auto save timer.
> > Error logging capability:        (0x01) Error logging supported.
> >                                          General Purpose Logging supported.
> > Short self-test routine
> > recommended polling time:        (   1) minutes.
> > Extended self-test routine
> > recommended polling time:        ( 327) minutes.
> > SCT capabilities:              (0x003d) SCT Status supported.
> >                                          SCT Error Recovery Control supported.
>
> Good...
>
> >                                          SCT Feature Control supported.
> >                                          SCT Data Table supported.
> >
> >
> > SCT Error Recovery Control:
> >             Read: Disabled
> >            Write: Disabled
>
> And bad, but at least it's got it ... as above make sure it's enabled.
> >
>
> >
> > # smartctl --xall /dev/sde
> > smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
> > Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
> >
> > === START OF INFORMATION SECTION ===
> > Device Model:     Hitachi HUA723020ALA641
>
> Okay, I assume this is the same as the previous drive ...
>
> > Serial Number:    YFG7LWBA
> > LU WWN Device Id: 5 000cca 223c3757b
> > Firmware Version: MK7OA840
> > User Capacity:    2,000,398,934,016 bytes [2.00 TB]
> > Sector Size:      512 bytes logical/physical
> > Rotation Rate:    7200 rpm
> > Device is:        Not in smartctl database [for details use: -P showall]
> > ATA Version is:   ATA8-ACS T13/1699-D revision 4
> > SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 3.0 Gb/s)
> > Local Time is:    Tue Jul 21 12:47:56 2020 PDT
> > SMART support is: Available - device has SMART capability.
> > SMART support is: Enabled
> > AAM feature is:   Unavailable
> > APM feature is:   Disabled
> > Rd look-ahead is: Enabled
> > Write cache is:   Enabled
> > ATA Security is:  Disabled, NOT FROZEN [SEC1]
> > Wt Cache Reorder: Enabled
> >
>
> Okay, we'll assume sdc is dead. The first thing is to try to assemble
> the remaining disks without it. Boot from a rescue disk so you've got
> the latest and greatest available. And don't forget, if you get "device
> busy", you've probably got the remains of a previous assemble messing
> things up, so you need to do a --stop. Just DON'T do a --force, not yet.
>
> Next thing we need is the event count - I think that's mdadm --examine
> over each partition that makes the array.
>
> And make sure you've got a replacement for that Green. You NEED to get
> rid of it.
>
> Let's see how it goes ... if the array assembles fine with the rescue
> disk, just add your new disk and replace sdc, but make sure Brad's
> script has enabled ERC!
>
> Cheers,
> Wol

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2020-07-30 18:28 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <CA+CBf3QZP4Yss0U=6Aa_5a+3D2Yy-WT545VazHiFWCZsreNOEg@mail.gmail.com>
2020-07-22  7:41 ` Software RAID6 broke after power outage Cory Derenburger
2020-07-22  9:14   ` Wols Lists
2020-07-22 16:29     ` Cory Derenburger
2020-07-22 19:47       ` antlists
2020-07-30 18:28         ` Cory Derenburger
2020-07-22 19:03   ` Ian Pilcher

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.