All of lore.kernel.org
 help / color / mirror / Atom feed
* Seagate hard disk firmware issue
@ 2011-01-23 11:05 BU66ER BAD6ER
  2011-01-23 13:42 ` Alan Cox
  2011-01-25  1:21 ` Robert Hancock
  0 siblings, 2 replies; 10+ messages in thread
From: BU66ER BAD6ER @ 2011-01-23 11:05 UTC (permalink / raw)
  To: linux-ide

Hi,

Four weeks ago I bought a new 2TB Seagate Barracuda internal SATA
drive. That drive has two 667GB ext4 partions (667GB unused) and it is
used for storage. My main system (Debian Sid 64-bit and KDE) resides
on a 40GB SSD, also using ext4.

Two weeks ago I noticed a severe performance drop, where any file
manager couldn't view directories on the 2TB disk without a one or two
minute penalty. After that I have had three hard freezes of that disk
and the entire system. Before the freeze there is very much hd
activity and finally I need to turn the power off. I have now also
made a backup of /dev/sdb1 should it be fatally serious.

I was recommended by someone at the #debian irc to make changes to the
spindown_time but that only helped for a few days. Yesterday, the 3rd
freeze came and the system wouldn't even recognize the disk after
reboot; just 'clicking' waiting for a response. I showed the kern.log
to someone at the same channel who concluded that this should be a
firmware issue.

Here is the latest kern.log which may identify the issue: I hope it
contains the relevant details. But first the output of smartctl -a
/dev/sdb.

I have now set the hdparm spindown_time to 0, disabling disk sleep
which seems to have been the culprit as judged on messages in the
Dolphin file manager etc.

Thanks for any help!


> # smartctl -a /dev/sdb
> smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
> Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
>
> === START OF INFORMATION SECTION ===
> Model Family:     Seagate Barracuda LP
> Device Model:     ST32000542AS
> Serial Number:    5XW20H7P
> Firmware Version: CC34
> User Capacity:    2,000,398,934,016 bytes
> Device is:        In smartctl database [for details use: -P show]
> ATA Version is:   8
> ATA Standard is:  ATA-8-ACS revision 4
> Local Time is:    Sun Jan 23 11:50:56 2011 CET
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
>
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
>
> General SMART Values:
> Offline data collection status:  (0x00) Offline data collection activity
>                                         was never started.
>                                         Auto Offline Data Collection: Disabled.
> Self-test execution status:      (   0) The previous self-test routine completed
>                                         without error or no self-test has ever
>                                         been run.
> Total time to complete Offline
> data collection:                 ( 633) seconds.
> Offline data collection
> capabilities:                    (0x73) SMART execute Offline immediate.
>                                         Auto Offline data collection on/off support.
>                                         Suspend Offline collection upon new
>                                         command.
>                                         No Offline surface scan supported.
>                                         Self-test supported.
>                                         Conveyance Self-test supported.
>                                         Selective Self-test supported.
> SMART capabilities:            (0x0003) Saves SMART data before entering
>                                         power-saving mode.
>                                         Supports SMART auto save timer.
> Error logging capability:        (0x01) Error logging supported.
>                                         General Purpose Logging supported.
> Short self-test routine
> recommended polling time:        (   1) minutes.
> Extended self-test routine
> recommended polling time:        ( 255) minutes.
> Conveyance self-test routine
> recommended polling time:        (   2) minutes.
> SCT capabilities:              (0x103f) SCT Status supported.
>                                         SCT Error Recovery Control supported.
>                                         SCT Feature Control supported.
>                                         SCT Data Table supported.
>
> SMART Attributes Data Structure revision number: 10
> Vendor Specific SMART Attributes with Thresholds:
> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
>   1 Raw_Read_Error_Rate     0x000f   100   089   006    Pre-fail  Always       -       184733939
>   3 Spin_Up_Time            0x0003   100   100   000    Pre-fail  Always       -       0
>   4 Start_Stop_Count        0x0032   099   099   020    Old_age   Always       -       1160
>   5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail  Always       -       0
>   7 Seek_Error_Rate         0x000f   100   253   030    Pre-fail  Always       -       296094
>   9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       307
>  10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
>  12 Power_Cycle_Count       0x0032   099   099   020    Old_age   Always       -       1186
> 183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always       -       0
> 184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
> 187 Reported_Uncorrect      0x0032   001   001   000    Old_age   Always       -       225
> 188 Command_Timeout         0x0032   100   099   000    Old_age   Always       -       4295032833
> 189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
> 190 Airflow_Temperature_Cel 0x0022   063   063   045    Old_age   Always       -       37 (Lifetime Min/Max 19/37)
> 194 Temperature_Celsius     0x0022   037   040   000    Old_age   Always       -       37 (0 16 0 0)
> 195 Hardware_ECC_Recovered  0x001a   052   033   000    Old_age   Always       -       184733939
> 197 Current_Pending_Sector  0x0012   100   099   000    Old_age   Always       -       40
> 198 Offline_Uncorrectable   0x0010   100   099   000    Old_age   Offline      -       40
> 199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
> 240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       210019605808507
> 241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       546635184
> 242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       2195715347
>
> SMART Error Log Version: 1
> ATA Error Count: 303 (device log contains only the most recent five errors)
>         CR = Command Register [HEX]
>         FR = Features Register [HEX]
>         SC = Sector Count Register [HEX]
>         SN = Sector Number Register [HEX]
>         CL = Cylinder Low Register [HEX]
>         CH = Cylinder High Register [HEX]
>         DH = Device/Head Register [HEX]
>         DC = Device Command Register [HEX]
>         ER = Error register [HEX]
>         ST = Status register [HEX]
> Powered_Up_Time is measured from power on, and printed as
> DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
> SS=sec, and sss=millisec. It "wraps" after 49.710 days.
>
> Error 303 occurred at disk power-on lifetime: 305 hours (12 days + 17 hours)
>   When the command that caused the error occurred, the device was active or idle.
>
>   After command completion occurred, registers were:
>   ER ST SC SN CL CH DH
>   -- -- -- -- -- -- --
>   40 51 00 58 e9 a8 00  Error: UNC at LBA = 0x00a8e958 = 11069784
>
>   Commands leading to the command that caused the error were:
>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>   -- -- -- -- -- -- -- --  ----------------  --------------------
>   c8 00 08 57 e9 a8 e0 00      02:02:27.917  READ DMA
>   27 00 00 00 00 00 e0 00      02:02:27.916  READ NATIVE MAX ADDRESS EXT
>   ec 00 00 00 00 00 a0 00      02:02:27.908  IDENTIFY DEVICE
>   ef 03 46 00 00 00 a0 00      02:02:27.877  SET FEATURES [Set transfer mode]
>   27 00 00 00 00 00 e0 00      02:02:27.788  READ NATIVE MAX ADDRESS EXT
>
> Error 302 occurred at disk power-on lifetime: 305 hours (12 days + 17 hours)
>   When the command that caused the error occurred, the device was active or idle.
>
>   After command completion occurred, registers were:
>   ER ST SC SN CL CH DH
>   -- -- -- -- -- -- --
>   40 51 00 58 e9 a8 00  Error: UNC at LBA = 0x00a8e958 = 11069784
>
>   Commands leading to the command that caused the error were:
>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>   -- -- -- -- -- -- -- --  ----------------  --------------------
>   c8 00 08 57 e9 a8 e0 00      02:02:24.081  READ DMA
>   27 00 00 00 00 00 e0 00      02:02:24.080  READ NATIVE MAX ADDRESS EXT
>   ec 00 00 00 00 00 a0 00      02:02:24.071  IDENTIFY DEVICE
>   ef 03 46 00 00 00 a0 00      02:02:24.066  SET FEATURES [Set transfer mode]
>   27 00 00 00 00 00 e0 00      02:02:24.044  READ NATIVE MAX ADDRESS EXT
>
> Error 301 occurred at disk power-on lifetime: 305 hours (12 days + 17 hours)
>   When the command that caused the error occurred, the device was active or idle.
>
>   After command completion occurred, registers were:
>   ER ST SC SN CL CH DH
>   -- -- -- -- -- -- --
>   40 51 00 58 e9 a8 00  Error: UNC at LBA = 0x00a8e958 = 11069784
>
>   Commands leading to the command that caused the error were:
>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>   -- -- -- -- -- -- -- --  ----------------  --------------------
>   c8 00 08 57 e9 a8 e0 00      02:02:20.320  READ DMA
>   27 00 00 00 00 00 e0 00      02:02:20.319  READ NATIVE MAX ADDRESS EXT
>   ec 00 00 00 00 00 a0 00      02:02:20.310  IDENTIFY DEVICE
>   ef 03 46 00 00 00 a0 00      02:02:20.302  SET FEATURES [Set transfer mode]
>   27 00 00 00 00 00 e0 00      02:02:20.279  READ NATIVE MAX ADDRESS EXT
>
> Error 300 occurred at disk power-on lifetime: 305 hours (12 days + 17 hours)
>   When the command that caused the error occurred, the device was active or idle.
>
>   After command completion occurred, registers were:
>   ER ST SC SN CL CH DH
>   -- -- -- -- -- -- --
>   40 51 00 58 e9 a8 00  Error: UNC at LBA = 0x00a8e958 = 11069784
>
>   Commands leading to the command that caused the error were:
>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>   -- -- -- -- -- -- -- --  ----------------  --------------------
>   c8 00 08 57 e9 a8 e0 00      02:02:16.559  READ DMA
>   27 00 00 00 00 00 e0 00      02:02:16.558  READ NATIVE MAX ADDRESS EXT
>   ec 00 00 00 00 00 a0 00      02:02:16.546  IDENTIFY DEVICE
>   ef 03 46 00 00 00 a0 00      02:02:16.538  SET FEATURES [Set transfer mode]
>   27 00 00 00 00 00 e0 00      02:02:16.506  READ NATIVE MAX ADDRESS EXT
>
> Error 299 occurred at disk power-on lifetime: 305 hours (12 days + 17 hours)
>   When the command that caused the error occurred, the device was active or idle.
>
>   After command completion occurred, registers were:
>   ER ST SC SN CL CH DH
>   -- -- -- -- -- -- --
>   40 51 00 58 e9 a8 00  Error: UNC at LBA = 0x00a8e958 = 11069784
>
>   Commands leading to the command that caused the error were:
>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>   -- -- -- -- -- -- -- --  ----------------  --------------------
>   c8 00 08 57 e9 a8 e0 00      02:02:12.770  READ DMA
>   27 00 00 00 00 00 e0 00      02:02:12.769  READ NATIVE MAX ADDRESS EXT
>   ec 00 00 00 00 00 a0 00      02:02:12.761  IDENTIFY DEVICE
>   ef 03 46 00 00 00 a0 00      02:02:12.754  SET FEATURES [Set transfer mode]
>   27 00 00 00 00 00 e0 00      02:02:12.725  READ NATIVE MAX ADDRESS EXT
>
> SMART Self-test log structure revision number 1
> No self-tests have been logged.  [To run self-tests, use: smartctl -t]
>
>
> SMART Selective self-test log data structure revision number 1
>  SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
>     1        0        0  Not_testing
>     2        0        0  Not_testing
>     3        0        0  Not_testing
>     4        0        0  Not_testing
>     5        0        0  Not_testing
> Selective self-test flags (0x0):
>   After scanning selected spans, do NOT read-scan remainder of disk.
> If Selective self-test is pending on power-up, resume after 0 minute delay.

and here is the output of kern.log

Jan 22 07:26:59 my kernel: [    0.439387] SCSI subsystem initialized
Jan 22 07:26:59 my kernel: [    0.443964] ehci_hcd 0000:00:1d.7: USB
2.0 started, EHCI 1.00
Jan 22 07:26:59 my kernel: [    0.443981] usb usb1: New USB device
found, idVendor=1d6b, idProduct=0002
Jan 22 07:26:59 my kernel: [    0.443983] usb usb1: New USB device
strings: Mfr=3, Product=2, SerialNumber=1
Jan 22 07:26:59 my kernel: [    0.443984] usb usb1: Product: EHCI Host
Controller
Jan 22 07:26:59 my kernel: [    0.443986] usb usb1: Manufacturer:
Linux 2.6.32-5-amd64 ehci_hcd
Jan 22 07:26:59 my kernel: [    0.443987] usb usb1: SerialNumber: 0000:00:1d.7
Jan 22 07:26:59 my kernel: [    0.444064] usb usb1: configuration #1
chosen from 1 choice
Jan 22 07:26:59 my kernel: [    0.444105] hub 1-0:1.0: USB hub found
Jan 22 07:26:59 my kernel: [    0.444110] hub 1-0:1.0: 8 ports detected
Jan 22 07:26:59 my kernel: [    0.444170] uhci_hcd 0000:00:1d.0: PCI
INT A -> GSI 23 (level, low) -> IRQ 23
Jan 22 07:26:59 my kernel: [    0.444176] uhci_hcd 0000:00:1d.0:
setting latency timer to 64
Jan 22 07:26:59 my kernel: [    0.444178] uhci_hcd 0000:00:1d.0: UHCI
Host Controller
Jan 22 07:26:59 my kernel: [    0.444184] uhci_hcd 0000:00:1d.0: new
USB bus registered, assigned bus number 2
Jan 22 07:26:59 my kernel: [    0.444203] uhci_hcd 0000:00:1d.0: irq
23, io base 0x0000b880
Jan 22 07:26:59 my kernel: [    0.444225] usb usb2: New USB device
found, idVendor=1d6b, idProduct=0001
Jan 22 07:26:59 my kernel: [    0.444227] usb usb2: New USB device
strings: Mfr=3, Product=2, SerialNumber=1
Jan 22 07:26:59 my kernel: [    0.444229] usb usb2: Product: UHCI Host
Controller
Jan 22 07:26:59 my kernel: [    0.444230] usb usb2: Manufacturer:
Linux 2.6.32-5-amd64 uhci_hcd
Jan 22 07:26:59 my kernel: [    0.444231] usb usb2: SerialNumber: 0000:00:1d.0
Jan 22 07:26:59 my kernel: [    0.444268] usb usb2: configuration #1
chosen from 1 choice
Jan 22 07:26:59 my kernel: [    0.444285] hub 2-0:1.0: USB hub found
Jan 22 07:26:59 my kernel: [    0.444289] hub 2-0:1.0: 2 ports detected
Jan 22 07:26:59 my kernel: [    0.444323]   alloc irq_desc for 19 on node -1
Jan 22 07:26:59 my kernel: [    0.444325]   alloc kstat_irqs on node -1
Jan 22 07:26:59 my kernel: [    0.444329] uhci_hcd 0000:00:1d.1: PCI
INT B -> GSI 19 (level, low) -> IRQ 19
Jan 22 07:26:59 my kernel: [    0.444333] uhci_hcd 0000:00:1d.1:
setting latency timer to 64
Jan 22 07:26:59 my kernel: [    0.444335] uhci_hcd 0000:00:1d.1: UHCI
Host Controller
Jan 22 07:26:59 my kernel: [    0.444340] uhci_hcd 0000:00:1d.1: new
USB bus registered, assigned bus number 3
Jan 22 07:26:59 my kernel: [    0.444364] uhci_hcd 0000:00:1d.1: irq
19, io base 0x0000bc00
Jan 22 07:26:59 my kernel: [    0.444387] usb usb3: New USB device
found, idVendor=1d6b, idProduct=0001
Jan 22 07:26:59 my kernel: [    0.444388] usb usb3: New USB device
strings: Mfr=3, Product=2, SerialNumber=1
Jan 22 07:26:59 my kernel: [    0.444390] usb usb3: Product: UHCI Host
Controller
Jan 22 07:26:59 my kernel: [    0.444391] usb usb3: Manufacturer:
Linux 2.6.32-5-amd64 uhci_hcd
Jan 22 07:26:59 my kernel: [    0.444392] usb usb3: SerialNumber: 0000:00:1d.1
Jan 22 07:26:59 my kernel: [    0.444429] usb usb3: configuration #1
chosen from 1 choice
Jan 22 07:26:59 my kernel: [    0.444447] hub 3-0:1.0: USB hub found
Jan 22 07:26:59 my kernel: [    0.444451] hub 3-0:1.0: 2 ports detected
Jan 22 07:26:59 my kernel: [    0.444485]   alloc irq_desc for 18 on node -1
Jan 22 07:26:59 my kernel: [    0.444486]   alloc kstat_irqs on node -1
Jan 22 07:26:59 my kernel: [    0.444489] uhci_hcd 0000:00:1d.2: PCI
INT C -> GSI 18 (level, low) -> IRQ 18
Jan 22 07:26:59 my kernel: [    0.444493] uhci_hcd 0000:00:1d.2:
setting latency timer to 64
Jan 22 07:26:59 my kernel: [    0.444495] uhci_hcd 0000:00:1d.2: UHCI
Host Controller
Jan 22 07:26:59 my kernel: [    0.444500] uhci_hcd 0000:00:1d.2: new
USB bus registered, assigned bus number 4
Jan 22 07:26:59 my kernel: [    0.444523] uhci_hcd 0000:00:1d.2: irq
18, io base 0x0000c000
Jan 22 07:26:59 my kernel: [    0.444546] usb usb4: New USB device
found, idVendor=1d6b, idProduct=0001
Jan 22 07:26:59 my kernel: [    0.444547] usb usb4: New USB device
strings: Mfr=3, Product=2, SerialNumber=1
Jan 22 07:26:59 my kernel: [    0.444549] usb usb4: Product: UHCI Host
Controller
Jan 22 07:26:59 my kernel: [    0.444550] usb usb4: Manufacturer:
Linux 2.6.32-5-amd64 uhci_hcd
Jan 22 07:26:59 my kernel: [    0.444551] usb usb4: SerialNumber: 0000:00:1d.2
Jan 22 07:26:59 my kernel: [    0.444586] usb usb4: configuration #1
chosen from 1 choice
Jan 22 07:26:59 my kernel: [    0.444606] hub 4-0:1.0: USB hub found
Jan 22 07:26:59 my kernel: [    0.444610] hub 4-0:1.0: 2 ports detected
Jan 22 07:26:59 my kernel: [    0.444641] uhci_hcd 0000:00:1d.3: PCI
INT D -> GSI 16 (level, low) -> IRQ 16
Jan 22 07:26:59 my kernel: [    0.444645] uhci_hcd 0000:00:1d.3:
setting latency timer to 64
Jan 22 07:26:59 my kernel: [    0.444648] uhci_hcd 0000:00:1d.3: UHCI
Host Controller
Jan 22 07:26:59 my kernel: [    0.444652] uhci_hcd 0000:00:1d.3: new
USB bus registered, assigned bus number 5
Jan 22 07:26:59 my kernel: [    0.444676] uhci_hcd 0000:00:1d.3: irq
16, io base 0x0000c080
Jan 22 07:26:59 my kernel: [    0.444699] usb usb5: New USB device
found, idVendor=1d6b, idProduct=0001
Jan 22 07:26:59 my kernel: [    0.444701] usb usb5: New USB device
strings: Mfr=3, Product=2, SerialNumber=1
Jan 22 07:26:59 my kernel: [    0.444702] usb usb5: Product: UHCI Host
Controller
Jan 22 07:26:59 my kernel: [    0.444704] usb usb5: Manufacturer:
Linux 2.6.32-5-amd64 uhci_hcd
Jan 22 07:26:59 my kernel: [    0.444705] usb usb5: SerialNumber: 0000:00:1d.3
Jan 22 07:26:59 my kernel: [    0.444747] usb usb5: configuration #1
chosen from 1 choice
Jan 22 07:26:59 my kernel: [    0.444765] hub 5-0:1.0: USB hub found
Jan 22 07:26:59 my kernel: [    0.444769] hub 5-0:1.0: 2 ports detected
Jan 22 07:26:59 my kernel: [    0.449570] Floppy drive(s): fd0 is 1.44M
Jan 22 07:26:59 my kernel: [    0.449771] libata version 3.00 loaded.
Jan 22 07:26:59 my kernel: [    0.451665] via-rhine.c:v1.10-LK1.4.3
2007-03-06 Written by Donald Becker
Jan 22 07:26:59 my kernel: [    0.451684]   alloc irq_desc for 22 on node -1
Jan 22 07:26:59 my kernel: [    0.451685]   alloc kstat_irqs on node -1
Jan 22 07:26:59 my kernel: [    0.451689] via-rhine 0000:01:02.0: PCI
INT A -> GSI 22 (level, low) -> IRQ 22
Jan 22 07:26:59 my kernel: [    0.456630] ata_piix 0000:00:1f.1: version 2.13
Jan 22 07:26:59 my kernel: [    0.456912] ata_piix 0000:00:1f.1: PCI
INT A -> GSI 18 (level, low) -> IRQ 18
Jan 22 07:26:59 my kernel: [    0.456937] ata_piix 0000:00:1f.1:
setting latency timer to 64
Jan 22 07:26:59 my kernel: [    0.457559] scsi0 : ata_piix
Jan 22 07:26:59 my kernel: [    0.457622] eth0: VIA Rhine III at
0x1d800, 00:11:95:86:8d:f8, IRQ 22.
Jan 22 07:26:59 my kernel: [    0.458339] eth0: MII PHY found at
address 1, status 0x786d advertising 05e1 Link 45e1.
Jan 22 07:26:59 my kernel: [    0.458343] scsi1 : ata_piix
Jan 22 07:26:59 my kernel: [    0.459329] ata1: PATA max UDMA/100 cmd
0x1f0 ctl 0x3f6 bmdma 0xffa0 irq 14
Jan 22 07:26:59 my kernel: [    0.459331] ata2: PATA max UDMA/100 cmd
0x170 ctl 0x376 bmdma 0xffa8 irq 15
Jan 22 07:26:59 my kernel: [    0.459348] ata_piix 0000:00:1f.2: PCI
INT B -> GSI 19 (level, low) -> IRQ 19
Jan 22 07:26:59 my kernel: [    0.459351] ata_piix 0000:00:1f.2: MAP [
P0 P2 P1 P3 ]
Jan 22 07:26:59 my kernel: [    0.459376] ata_piix 0000:00:1f.2:
setting latency timer to 64
Jan 22 07:26:59 my kernel: [    0.459416]   alloc irq_desc for 21 on node -1
Jan 22 07:26:59 my kernel: [    0.459418]   alloc kstat_irqs on node -1
Jan 22 07:26:59 my kernel: [    0.459422] firewire_ohci 0000:01:01.0:
PCI INT A -> GSI 21 (level, low) -> IRQ 21
Jan 22 07:26:59 my kernel: [    0.459499] scsi2 : ata_piix
Jan 22 07:26:59 my kernel: [    0.459561] scsi3 : ata_piix
Jan 22 07:26:59 my kernel: [    0.461184] ata3: SATA max UDMA/133 cmd
0xcc00 ctl 0xc880 bmdma 0xc400 irq 19
Jan 22 07:26:59 my kernel: [    0.461185] ata4: SATA max UDMA/133 cmd
0xc800 ctl 0xc480 bmdma 0xc408 irq 19
Jan 22 07:26:59 my kernel: [    0.471598] FDC 0 is a post-1991 82077
Jan 22 07:26:59 my kernel: [    0.535975] firewire_ohci: Added fw-ohci
device 0000:01:01.0, OHCI version 1.0
Jan 22 07:26:59 my kernel: [    0.628338] ata4.00: ATA-8:
ST32000542AS, CC34, max UDMA/133
Jan 22 07:26:59 my kernel: [    0.628341] ata4.00: 3907029168 sectors,
multi 16: LBA48 NCQ (depth 0/32)
Jan 22 07:26:59 my kernel: [    0.644250] ata4.00: configured for UDMA/133
Jan 22 07:26:59 my kernel: [    0.844252] ata3.00: ATA-7: INTEL
SSDSA2M040G2GC, 2CV102HD, max UDMA/133
Jan 22 07:26:59 my kernel: [    0.844255] ata3.00: 78165360 sectors,
multi 16: LBA48 NCQ (depth 0/32)
Jan 22 07:26:59 my kernel: [    0.844296] ata3.01: ATAPI: TSSTcorp
CDDVDW SH-S223C, SB04, max UDMA/100
Jan 22 07:26:59 my kernel: [    0.852166] ata3.00: configured for UDMA/133
Jan 22 07:26:59 my kernel: [    0.908202] ata3.01: configured for UDMA/100
Jan 22 07:26:59 my kernel: [    0.931374] scsi 2:0:0:0: Direct-Access
   ATA      INTEL SSDSA2M040 2CV1 PQ: 0 ANSI: 5
Jan 22 07:26:59 my kernel: [    0.932025] scsi 2:0:1:0: CD-ROM
   TSSTcorp CDDVDW SH-S223C  SB04 PQ: 0 ANSI: 5
Jan 22 07:26:59 my kernel: [    0.932170] scsi 3:0:0:0: Direct-Access
   ATA      ST32000542AS     CC34 PQ: 0 ANSI: 5
Jan 22 07:26:59 my kernel: [    0.936896] sd 2:0:0:0: [sda] 78165360
512-byte logical blocks: (40.0 GB/37.2 GiB)
Jan 22 07:26:59 my kernel: [    0.936928] sd 3:0:0:0: [sdb] 3907029168
512-byte logical blocks: (2.00 TB/1.81 TiB)
Jan 22 07:26:59 my kernel: [    0.936936] sd 2:0:0:0: [sda] Write Protect is off
Jan 22 07:26:59 my kernel: [    0.936938] sd 2:0:0:0: [sda] Mode
Sense: 00 3a 00 00
Jan 22 07:26:59 my kernel: [    0.936955] sd 2:0:0:0: [sda] Write
cache: enabled, read cache: enabled, doesn't support DPO or FUA
Jan 22 07:26:59 my kernel: [    0.936965] sd 3:0:0:0: [sdb] Write Protect is off
Jan 22 07:26:59 my kernel: [    0.936967] sd 3:0:0:0: [sdb] Mode
Sense: 00 3a 00 00
Jan 22 07:26:59 my kernel: [    0.936982] sd 3:0:0:0: [sdb] Write
cache: enabled, read cache: enabled, doesn't support DPO or FUA
Jan 22 07:26:59 my kernel: [    0.937065]  sda:
Jan 22 07:26:59 my kernel: [    0.937146]  sdb: sda1 sda2 sda3 sda4
Jan 22 07:26:59 my kernel: [    0.937822] sd 2:0:0:0: [sda] Attached SCSI disk
Jan 22 07:26:59 my kernel: [    0.964915]  sdb1 sdb2 sdb3
Jan 22 07:26:59 my kernel: [    0.965125] sd 3:0:0:0: [sdb] Attached SCSI disk
Jan 22 07:26:59 my kernel: [    1.034615] sr0: scsi3-mmc drive:
52x/52x writer dvd-ram cd/rw xa/form2 cdda tray
Jan 22 07:26:59 my kernel: [    1.034618] Uniform CD-ROM driver Revision: 3.20
Jan 22 07:26:59 my kernel: [    1.034693] sr 2:0:1:0: Attached scsi CD-ROM sr0
Jan 22 07:26:59 my kernel: [    1.037201] sd 2:0:0:0: Attached scsi
generic sg0 type 0
Jan 22 07:26:59 my kernel: [    1.037275] sr 2:0:1:0: Attached scsi
generic sg1 type 5
Jan 22 07:26:59 my kernel: [    1.037314] sd 3:0:0:0: Attached scsi
generic sg2 type 0
Jan 22 07:26:59 my kernel: [    1.039983] firewire_core: created
device fw0: GUID 0011060001112679, S400
Jan 22 07:26:59 my kernel: [    1.347010] PM: Starting manual resume from disk
Jan 22 07:26:59 my kernel: [    1.347013] PM: Resume from partition 8:2
Jan 22 07:26:59 my kernel: [    1.347014] PM: Checking hibernation image.
Jan 22 07:26:59 my kernel: [    1.347244] PM: Error -22 checking image file
Jan 22 07:26:59 my kernel: [    1.347246] PM: Resume from disk failed.
Jan 22 07:26:59 my kernel: [    1.361769] EXT4-fs (sda3): mounted
filesystem with ordered data mode
Jan 22 07:26:59 my kernel: [    1.505412] udev[360]: starting version 164
Jan 22 07:26:59 my kernel: [    1.555655] ACPI: SSDT 000000007ff9e0b0
001D2 (v01    AMI   CPU1PM 00000001 INTL 20060113)
Jan 22 07:26:59 my kernel: [    1.555852] processor LNXCPU:00:
registered as cooling_device0
Jan 22 07:26:59 my kernel: [    1.556183] ACPI: SSDT 000000007ff9e290
00143 (v01    AMI   CPU2PM 00000001 INTL 20060113)
Jan 22 07:26:59 my kernel: [    1.556330] processor LNXCPU:01:
registered as cooling_device1
Jan 22 07:26:59 my kernel: [    1.559003] input: Power Button as
/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0C0C:00/input/input2
Jan 22 07:26:59 my kernel: [    1.559011] ACPI: Power Button [PWRB]
Jan 22 07:26:59 my kernel: [    1.559057] input: Power Button as
/devices/LNXSYSTM:00/LNXPWRBN:00/input/input3
Jan 22 07:26:59 my kernel: [    1.559059] ACPI: Power Button [PWRF]
Jan 22 07:26:59 my kernel: [    1.568373] input: PC Speaker as
/devices/platform/pcspkr/input/input4
Jan 22 07:26:59 my kernel: [    1.598983] intel_rng: FWH not detected
Jan 22 07:26:59 my kernel: [    2.026042] nvidia: module license
'NVIDIA' taints kernel.
Jan 22 07:26:59 my kernel: [    2.026046] Disabling lock debugging due
to kernel taint
Jan 22 07:26:59 my kernel: [    2.239933] input: ImPS/2 Generic Wheel
Mouse as /devices/platform/i8042/serio1/input/input5
Jan 22 07:26:59 my kernel: [    2.247908] i801_smbus 0000:00:1f.3: PCI
INT B -> GSI 19 (level, low) -> IRQ 19
Jan 22 07:26:59 my kernel: [    2.250796] parport_pc 00:07: reported
by Plug and Play ACPI
Jan 22 07:26:59 my kernel: [    2.250890] parport0: PC-style at 0x378
(0x778), irq 7 [PCSPP,TRISTATE,EPP]
Jan 22 07:26:59 my kernel: [    2.276519] parport0: Printer,
Hewlett-Packard HP LaserJet 1100
Jan 22 07:26:59 my kernel: [    2.495774] nvidia 0000:04:00.0: PCI INT
A -> GSI 16 (level, low) -> IRQ 16
Jan 22 07:26:59 my kernel: [    2.495780] nvidia 0000:04:00.0: setting
latency timer to 64
Jan 22 07:26:59 my kernel: [    2.495784] vgaarb: device changed
decodes: PCI:0000:04:00.0,olddecodes=io+mem,decodes=none:owns=io+mem
Jan 22 07:26:59 my kernel: [    2.495862] NVRM: loading NVIDIA UNIX
x86_64 Kernel Module  195.36.31  Thu Jun  3 08:19:50 PDT 2010
Jan 22 07:26:59 my kernel: [    2.549543] HDA Intel 0000:00:1b.0: PCI
INT A -> GSI 16 (level, low) -> IRQ 16
Jan 22 07:26:59 my kernel: [    2.549567] HDA Intel 0000:00:1b.0:
setting latency timer to 64
Jan 22 07:26:59 my kernel: [    2.619123] input: HDA Digital PCBeep as
/devices/pci0000:00/0000:00:1b.0/input/input6
Jan 22 07:26:59 my kernel: [    2.709931] Adding 249848k swap on
/dev/sda2.  Priority:-1 extents:1 across:249848k SS
Jan 22 07:26:59 my kernel: [    2.784474] loop: module loaded
Jan 22 07:26:59 my kernel: [    3.118288] EXT4-fs (sda4): mounted
filesystem with ordered data mode
Jan 22 07:26:59 my kernel: [    3.174861] EXT4-fs (sdb1): mounted
filesystem with ordered data mode
Jan 22 07:26:59 my kernel: [    3.212696] EXT4-fs (sdb2): mounted
filesystem with ordered data mode
Jan 22 07:26:59 my kernel: [    3.331711] ip_tables: (C) 2000-2006
Netfilter Core Team
Jan 22 07:26:59 my kernel: [    3.341759] nf_conntrack version 0.5.0
(16384 buckets, 65536 max)
Jan 22 07:26:59 my kernel: [    3.341943] CONFIG_NF_CT_ACCT is
deprecated and will be removed soon. Please use
Jan 22 07:26:59 my kernel: [    3.341945] nf_conntrack.acct=1 kernel
parameter, acct=1 nf_conntrack module option or
Jan 22 07:26:59 my kernel: [    3.341946] sysctl
net.netfilter.nf_conntrack_acct=1 to enable it.
Jan 22 07:26:59 my kernel: [    3.365328] ip6_tables: (C) 2000-2006
Netfilter Core Team
Jan 22 07:26:59 my kernel: [    8.449954] fuse init (API version 7.13)
Jan 22 07:27:00 my kernel: [    9.362940]   alloc irq_desc for 27 on node -1
Jan 22 07:27:00 my kernel: [    9.362942]   alloc kstat_irqs on node -1
Jan 22 07:27:00 my kernel: [    9.362957] atl1 0000:02:00.0: irq 27
for MSI/MSI-X
Jan 22 07:27:00 my kernel: [    9.363242] ADDRCONF(NETDEV_UP): eth1:
link is not ready
Jan 22 07:27:00 my kernel: [    9.365260] eth0: link up, 100Mbps,
full-duplex, lpa 0x45E1
Jan 22 07:27:01 my kernel: [    9.861987] lp0: using parport0
(interrupt-driven).
Jan 22 07:27:01 my kernel: [    9.864872] ppdev: user-space parallel port driver
Jan 22 07:27:11 my kernel: [   19.896007] eth0: no IPv6 routers present
Jan 22 07:28:33 my kernel: [  102.645811] CPU0 attaching NULL sched-domain.
Jan 22 07:28:33 my kernel: [  102.645816] CPU1 attaching NULL sched-domain.
Jan 22 07:28:33 my kernel: [  102.668591] CPU0 attaching sched-domain:
Jan 22 07:28:33 my kernel: [  102.668594]  domain 0: span 0-1 level MC
Jan 22 07:28:33 my kernel: [  102.668596]   groups: 0 1
Jan 22 07:28:33 my kernel: [  102.668601] CPU1 attaching sched-domain:
Jan 22 07:28:33 my kernel: [  102.668603]  domain 0: span 0-1 level MC
Jan 22 07:28:33 my kernel: [  102.668606]   groups: 1 0
Jan 22 15:07:58 my kernel: [27667.804537] ata4: lost interrupt (Status 0x50)
Jan 22 15:07:58 my kernel: [27667.804557] ata4.00: exception Emask 0x0
SAct 0x0 SErr 0x0 action 0x6 frozen
Jan 22 15:07:58 my kernel: [27667.804562] ata4.00: failed command: READ DMA EXT
Jan 22 15:07:58 my kernel: [27667.804568] ata4.00: cmd
25/00:38:cf:12:8a/00:00:4d:00:00/e0 tag 0 dma 28672 in
Jan 22 15:07:58 my kernel: [27667.804569]          res
40/00:f0:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
Jan 22 15:07:58 my kernel: [27667.804572] ata4.00: status: { DRDY }
Jan 22 15:07:58 my kernel: [27667.804582] ata4: soft resetting link
Jan 22 15:07:59 my kernel: [27667.987633] ata4.00: configured for UDMA/133
Jan 22 15:07:59 my kernel: [27667.987648] ata4.00: device reported
invalid CHS sector 0
Jan 22 15:07:59 my kernel: [27667.987680] ata4: EH complete
Jan 22 15:08:29 my kernel: [27698.804569] ata4: lost interrupt (Status 0x50)
Jan 22 15:08:29 my kernel: [27698.804592] ata4.00: exception Emask 0x0
SAct 0x0 SErr 0x0 action 0x6 frozen
Jan 22 15:08:29 my kernel: [27698.804597] ata4.00: failed command: READ DMA
Jan 22 15:08:29 my kernel: [27698.804602] ata4.00: cmd
c8/00:80:7f:74:08/00:00:00:00:00/e0 tag 0 dma 65536 in
Jan 22 15:08:29 my kernel: [27698.804603]          res
40/00:f0:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
Jan 22 15:08:29 my kernel: [27698.804605] ata4.00: status: { DRDY }
Jan 22 15:08:29 my kernel: [27698.804616] ata4: soft resetting link
Jan 22 15:08:30 my kernel: [27698.988495] ata4.00: configured for UDMA/133
Jan 22 15:08:30 my kernel: [27698.988505] ata4.00: device reported
invalid CHS sector 0
Jan 22 15:08:30 my kernel: [27698.988530] ata4: EH complete
Jan 22 15:09:00 my kernel: [27729.804523] ata4: lost interrupt (Status 0x50)
Jan 22 15:09:00 my kernel: [27729.804542] ata4.00: exception Emask 0x0
SAct 0x0 SErr 0x0 action 0x6 frozen
Jan 22 15:09:00 my kernel: [27729.804547] ata4.00: failed command: READ DMA
Jan 22 15:09:00 my kernel: [27729.804553] ata4.00: cmd
c8/00:40:3f:21:04/00:00:00:00:00/e0 tag 0 dma 32768 in
Jan 22 15:09:00 my kernel: [27729.804554]          res
40/00:f0:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
Jan 22 15:09:00 my kernel: [27729.804557] ata4.00: status: { DRDY }
Jan 22 15:09:00 my kernel: [27729.804567] ata4: soft resetting link
Jan 22 15:09:01 my kernel: [27729.985869] ata4.00: configured for UDMA/133
Jan 22 15:09:01 my kernel: [27729.985876] ata4.00: device reported
invalid CHS sector 0
Jan 22 15:09:01 my kernel: [27729.985889] ata4: EH complete
Jan 22 15:09:31 my kernel: [27760.816048] ata4: lost interrupt (Status 0x50)
Jan 22 15:09:31 my kernel: [27760.816069] ata4.00: limiting speed to
UDMA/100:PIO4
Jan 22 15:09:31 my kernel: [27760.816073] ata4.00: exception Emask 0x0
SAct 0x0 SErr 0x0 action 0x6 frozen
Jan 22 15:09:31 my kernel: [27760.816077] ata4.00: failed command: READ DMA
Jan 22 15:09:31 my kernel: [27760.816083] ata4.00: cmd
c8/00:48:7f:21:04/00:00:00:00:00/e0 tag 0 dma 36864 in
Jan 22 15:09:31 my kernel: [27760.816085]          res
40/00:f0:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
Jan 22 15:09:31 my kernel: [27760.816087] ata4.00: status: { DRDY }
Jan 22 15:09:31 my kernel: [27760.816098] ata4: soft resetting link
Jan 22 15:09:32 my kernel: [27760.996711] ata4.00: configured for UDMA/100
Jan 22 15:09:32 my kernel: [27760.996719] ata4.00: device reported
invalid CHS sector 0
Jan 22 15:09:32 my kernel: [27760.996740] ata4: EH complete
Jan 22 15:10:02 my kernel: [27791.804530] ata4: lost interrupt (Status 0x50)
Jan 22 15:10:02 my kernel: [27791.804553] ata4.00: exception Emask 0x0
SAct 0x0 SErr 0x0 action 0x6 frozen
Jan 22 15:10:02 my kernel: [27791.804560] ata4.00: failed command: READ DMA
Jan 22 15:10:02 my kernel: [27791.804566] ata4.00: cmd
c8/00:50:c7:7c:08/00:00:00:00:00/e0 tag 0 dma 40960 in
Jan 22 15:10:02 my kernel: [27791.804567]          res
40/00:f0:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
Jan 22 15:10:02 my kernel: [27791.804570] ata4.00: status: { DRDY }
Jan 22 15:10:02 my kernel: [27791.804582] ata4: soft resetting link
Jan 22 15:10:03 my kernel: [27791.988297] ata4.00: configured for UDMA/100
Jan 22 15:10:03 my kernel: [27791.988308] ata4.00: device reported
invalid CHS sector 0
Jan 22 15:10:03 my kernel: [27791.988332] ata4: EH complete
Jan 22 15:10:33 my kernel: [27822.804521] ata4: lost interrupt (Status 0x50)
Jan 22 15:10:33 my kernel: [27822.804539] ata4.00: exception Emask 0x0
SAct 0x0 SErr 0x0 action 0x6 frozen
Jan 22 15:10:33 my kernel: [27822.804544] ata4.00: failed command: READ DMA EXT
Jan 22 15:10:33 my kernel: [27822.804550] ata4.00: cmd
25/00:38:7f:20:8a/00:00:4d:00:00/e0 tag 0 dma 28672 in
Jan 22 15:10:33 my kernel: [27822.804551]          res
40/00:f0:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
Jan 22 15:10:33 my kernel: [27822.804554] ata4.00: status: { DRDY }
Jan 22 15:10:33 my kernel: [27822.804565] ata4: soft resetting link
Jan 22 15:10:34 my kernel: [27822.989794] ata4.00: configured for UDMA/100
Jan 22 15:10:34 my kernel: [27822.989800] ata4.00: device reported
invalid CHS sector 0
Jan 22 15:10:34 my kernel: [27822.989814] ata4: EH complete
Jan 22 15:11:04 my kernel: [27853.820104] ata4: lost interrupt (Status 0x50)
Jan 22 15:11:04 my kernel: [27853.820132] ata4.00: exception Emask 0x0
SAct 0x0 SErr 0x0 action 0x6 frozen
Jan 22 15:11:04 my kernel: [27853.820137] ata4.00: failed command: READ DMA
Jan 22 15:11:04 my kernel: [27853.820142] ata4.00: cmd
c8/00:40:3f:28:04/00:00:00:00:00/e0 tag 0 dma 32768 in
Jan 22 15:11:04 my kernel: [27853.820143]          res
40/00:f0:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
Jan 22 15:11:04 my kernel: [27853.820145] ata4.00: status: { DRDY }
Jan 22 15:11:04 my kernel: [27853.820156] ata4: soft resetting link
Jan 22 15:11:05 my kernel: [27854.004474] ata4.00: configured for UDMA/100
Jan 22 15:11:05 my kernel: [27854.004482] ata4.00: device reported
invalid CHS sector 0
Jan 22 15:11:05 my kernel: [27854.004503] ata4: EH complete
Jan 22 15:11:35 my kernel: [27884.804540] ata4: lost interrupt (Status 0x50)
Jan 22 15:11:35 my kernel: [27884.804566] ata4.00: limiting speed to
UDMA/33:PIO4
Jan 22 15:11:35 my kernel: [27884.804570] ata4.00: exception Emask 0x0
SAct 0x0 SErr 0x0 action 0x6 frozen
Jan 22 15:11:35 my kernel: [27884.804576] ata4.00: failed command: READ DMA
Jan 22 15:11:35 my kernel: [27884.804582] ata4.00: cmd
c8/00:80:7f:28:04/00:00:00:00:00/e0 tag 0 dma 65536 in
Jan 22 15:11:35 my kernel: [27884.804583]          res
40/00:f0:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
Jan 22 15:11:35 my kernel: [27884.804586] ata4.00: status: { DRDY }
Jan 22 15:11:35 my kernel: [27884.804598] ata4: soft resetting link
Jan 22 15:11:36 my kernel: [27884.984946] ata4.00: configured for UDMA/33
Jan 22 15:11:36 my kernel: [27884.984953] ata4.00: device reported
invalid CHS sector 0
Jan 22 15:11:36 my kernel: [27884.984971] ata4: EH complete
Jan 22 15:12:06 my kernel: [27915.804529] ata4: lost interrupt (Status 0x50)
Jan 22 15:12:06 my kernel: [27915.804548] ata4.00: exception Emask 0x0
SAct 0x0 SErr 0x0 action 0x6 frozen
Jan 22 15:12:06 my kernel: [27915.804554] ata4.00: failed command: READ DMA
Jan 22 15:12:06 my kernel: [27915.804559] ata4.00: cmd
c8/00:80:7f:28:04/00:00:00:00:00/e0 tag 0 dma 65536 in
Jan 22 15:12:06 my kernel: [27915.804561]          res
40/00:f0:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
Jan 22 15:12:06 my kernel: [27915.804564] ata4.00: status: { DRDY }
Jan 22 15:12:06 my kernel: [27915.804575] ata4: soft resetting link
Jan 22 15:12:07 my kernel: [27915.984902] ata4.00: configured for UDMA/33
Jan 22 15:12:07 my kernel: [27915.984909] ata4.00: device reported
invalid CHS sector 0
Jan 22 15:12:07 my kernel: [27915.984927] ata4: EH complete
Jan 22 15:12:52 my kernel: [27961.780543] cupsd[1898]: segfault at
3cf3b ip 000000000003cf3b sp 00007fff28af19d8 error 14 in
libnss_files-2.11.2.so[7ffb50975000+b000]
Jan 22 15:13:45 my kernel: [28014.828243] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=92.33.33.179
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=58 ID=0 DF PROTO=TCP
SPT=80 DPT=39734 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 15:13:46 my kernel: [28014.868831] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=195.54.111.91
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=59 ID=0 DF PROTO=TCP
SPT=80 DPT=36214 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 15:13:46 my kernel: [28014.881609] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=74.125.79.99
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=55 ID=50371 PROTO=TCP
SPT=80 DPT=37642 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 15:13:46 my kernel: [28014.892944] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=74.125.79.99
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=55 ID=37335 PROTO=TCP
SPT=80 DPT=37625 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 15:13:46 my kernel: [28014.897322] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=74.125.79.99
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=55 ID=34042 PROTO=TCP
SPT=80 DPT=37626 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 15:13:46 my kernel: [28014.925305] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=74.125.79.118
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=55 ID=23674 PROTO=TCP
SPT=80 DPT=36969 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 15:13:46 my kernel: [28014.933365] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=74.125.79.99
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=55 ID=39469 PROTO=TCP
SPT=80 DPT=37640 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 15:13:46 my kernel: [28014.934029] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=74.125.79.99
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=55 ID=34532 PROTO=TCP
SPT=80 DPT=37641 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 15:13:46 my kernel: [28014.941446] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=74.125.79.99
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=55 ID=54104 PROTO=TCP
SPT=80 DPT=37639 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 15:13:46 my kernel: [28015.284273] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=92.33.33.179
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=58 ID=0 DF PROTO=TCP
SPT=80 DPT=39734 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 15:14:14 my kernel: [28043.604663] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=195.54.111.91
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=59 ID=0 DF PROTO=TCP
SPT=80 DPT=36214 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 15:14:43 my kernel: [28072.785223] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=195.54.111.91
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=59 ID=0 DF PROTO=TCP
SPT=80 DPT=36214 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 15:14:51 my kernel: [28079.949488] [UFW BLOCK] IN=eth0 OUT=
MAC=00:11:92:86:8d:f8:00:1c:f0:4c:c2:b9:08:00 SRC=74.125.79.118
DST=192.168.0.198 LEN=40 TOS=0x00 PREC=0x00 TTL=55 ID=23681 PROTO=TCP
SPT=80 DPT=36969 WINDOW=0 RES=0x00 RST URGP=0
Jan 22 16:56:43 my kernel: [34192.816023] ata4: lost interrupt (Status 0x50)
Jan 22 16:56:43 my kernel: [34192.816040] ata4.00: exception Emask 0x0
SAct 0x0 SErr 0x0 action 0x6 frozen
Jan 22 16:56:43 my kernel: [34192.816044] ata4.00: failed command: WRITE DMA EXT
Jan 22 16:56:43 my kernel: [34192.816048] ata4.00: cmd
35/00:08:bf:00:80/00:00:4d:00:00/e0 tag 0 dma 4096 out
Jan 22 16:56:43 my kernel: [34192.816049]          res
40/00:00:00:4f:c2/00:00:00:00:00/40 Emask 0x4 (timeout)
Jan 22 16:56:43 my kernel: [34192.816051] ata4.00: status: { DRDY }
Jan 22 16:56:43 my kernel: [34192.816060] ata4: soft resetting link
Jan 22 16:56:44 my kernel: [34192.996387] ata4.00: configured for UDMA/33
Jan 22 16:56:44 my kernel: [34192.996392] ata4.00: device reported
invalid CHS sector 0
Jan 22 16:56:44 my kernel: [34192.996402] ata4: EH complete

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Seagate hard disk firmware issue
  2011-01-23 11:05 Seagate hard disk firmware issue BU66ER BAD6ER
@ 2011-01-23 13:42 ` Alan Cox
  2011-01-25  1:21 ` Robert Hancock
  1 sibling, 0 replies; 10+ messages in thread
From: Alan Cox @ 2011-01-23 13:42 UTC (permalink / raw)
  To: BU66ER BAD6ER; +Cc: linux-ide

> >   After command completion occurred, registers were:
> >   ER ST SC SN CL CH DH
> >   -- -- -- -- -- -- --
> >   40 51 00 58 e9 a8 00  Error: UNC at LBA = 0x00a8e958 = 11069784

Uncorrectable media error (bad sectors etc)



I would suggest you instead talk to the drive manufacturer or whoever is
responsible for the warranty.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Seagate hard disk firmware issue
  2011-01-23 11:05 Seagate hard disk firmware issue BU66ER BAD6ER
  2011-01-23 13:42 ` Alan Cox
@ 2011-01-25  1:21 ` Robert Hancock
  2011-01-25  6:06   ` BU66ER BAD6ER
  1 sibling, 1 reply; 10+ messages in thread
From: Robert Hancock @ 2011-01-25  1:21 UTC (permalink / raw)
  To: BU66ER BAD6ER; +Cc: linux-ide

On 01/23/2011 05:05 AM, BU66ER BAD6ER wrote:
> Hi,
>
> Four weeks ago I bought a new 2TB Seagate Barracuda internal SATA
> drive. That drive has two 667GB ext4 partions (667GB unused) and it is
> used for storage. My main system (Debian Sid 64-bit and KDE) resides
> on a 40GB SSD, also using ext4.
>
> Two weeks ago I noticed a severe performance drop, where any file
> manager couldn't view directories on the 2TB disk without a one or two
> minute penalty. After that I have had three hard freezes of that disk
> and the entire system. Before the freeze there is very much hd
> activity and finally I need to turn the power off. I have now also
> made a backup of /dev/sdb1 should it be fatally serious.
>
> I was recommended by someone at the #debian irc to make changes to the
> spindown_time but that only helped for a few days. Yesterday, the 3rd
> freeze came and the system wouldn't even recognize the disk after
> reboot; just 'clicking' waiting for a response. I showed the kern.log
> to someone at the same channel who concluded that this should be a
> firmware issue.
>
> Here is the latest kern.log which may identify the issue: I hope it
> contains the relevant details. But first the output of smartctl -a
> /dev/sdb.
>
> I have now set the hdparm spindown_time to 0, disabling disk sleep
> which seems to have been the culprit as judged on messages in the
> Dolphin file manager etc.
>
> Thanks for any help!
>
>
>> # smartctl -a /dev/sdb
>> smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
>> Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
>>
>> === START OF INFORMATION SECTION ===
>> Model Family:     Seagate Barracuda LP
>> Device Model:     ST32000542AS
>> Serial Number:    5XW20H7P
>> Firmware Version: CC34
>> User Capacity:    2,000,398,934,016 bytes
>> Device is:        In smartctl database [for details use: -P show]
>> ATA Version is:   8
>> ATA Standard is:  ATA-8-ACS revision 4
>> Local Time is:    Sun Jan 23 11:50:56 2011 CET
>> SMART support is: Available - device has SMART capability.
>> SMART support is: Enabled
>>
>> === START OF READ SMART DATA SECTION ===
>> SMART overall-health self-assessment test result: PASSED
>>
>> General SMART Values:
>> Offline data collection status:  (0x00) Offline data collection activity
>>                                          was never started.
>>                                          Auto Offline Data Collection: Disabled.
>> Self-test execution status:      (   0) The previous self-test routine completed
>>                                          without error or no self-test has ever
>>                                          been run.
>> Total time to complete Offline
>> data collection:                 ( 633) seconds.
>> Offline data collection
>> capabilities:                    (0x73) SMART execute Offline immediate.
>>                                          Auto Offline data collection on/off support.
>>                                          Suspend Offline collection upon new
>>                                          command.
>>                                          No Offline surface scan supported.
>>                                          Self-test supported.
>>                                          Conveyance Self-test supported.
>>                                          Selective Self-test supported.
>> SMART capabilities:            (0x0003) Saves SMART data before entering
>>                                          power-saving mode.
>>                                          Supports SMART auto save timer.
>> Error logging capability:        (0x01) Error logging supported.
>>                                          General Purpose Logging supported.
>> Short self-test routine
>> recommended polling time:        (   1) minutes.
>> Extended self-test routine
>> recommended polling time:        ( 255) minutes.
>> Conveyance self-test routine
>> recommended polling time:        (   2) minutes.
>> SCT capabilities:              (0x103f) SCT Status supported.
>>                                          SCT Error Recovery Control supported.
>>                                          SCT Feature Control supported.
>>                                          SCT Data Table supported.
>>
>> SMART Attributes Data Structure revision number: 10
>> Vendor Specific SMART Attributes with Thresholds:
>> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
>>    1 Raw_Read_Error_Rate     0x000f   100   089   006    Pre-fail  Always       -       184733939
>>    3 Spin_Up_Time            0x0003   100   100   000    Pre-fail  Always       -       0
>>    4 Start_Stop_Count        0x0032   099   099   020    Old_age   Always       -       1160
>>    5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail  Always       -       0
>>    7 Seek_Error_Rate         0x000f   100   253   030    Pre-fail  Always       -       296094
>>    9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       307
>>   10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
>>   12 Power_Cycle_Count       0x0032   099   099   020    Old_age   Always       -       1186
>> 183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always       -       0
>> 184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
>> 187 Reported_Uncorrect      0x0032   001   001   000    Old_age   Always       -       225
>> 188 Command_Timeout         0x0032   100   099   000    Old_age   Always       -       4295032833
>> 189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
>> 190 Airflow_Temperature_Cel 0x0022   063   063   045    Old_age   Always       -       37 (Lifetime Min/Max 19/37)
>> 194 Temperature_Celsius     0x0022   037   040   000    Old_age   Always       -       37 (0 16 0 0)
>> 195 Hardware_ECC_Recovered  0x001a   052   033   000    Old_age   Always       -       184733939
>> 197 Current_Pending_Sector  0x0012   100   099   000    Old_age   Always       -       40
>> 198 Offline_Uncorrectable   0x0010   100   099   000    Old_age   Offline      -       40
>> 199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
>> 240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       210019605808507
>> 241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       546635184
>> 242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       2195715347

The SMART data shows there haven't been many start/stops other than from 
power cycles, so I don't think spindown is related here. The error log 
entries and the Offline_Uncorrectable and Reported_Uncorrect attributes 
would indicate that your drive is having read errors. Think you likely 
need a new drive.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Seagate hard disk firmware issue
  2011-01-25  1:21 ` Robert Hancock
@ 2011-01-25  6:06   ` BU66ER BAD6ER
  2011-04-30 11:32     ` BU66ER BAD6ER
  0 siblings, 1 reply; 10+ messages in thread
From: BU66ER BAD6ER @ 2011-01-25  6:06 UTC (permalink / raw)
  To: linux-ide

Dear both,

Thanks for the input! I will return this hard disk as soon as possible.

Here below is the latest output.

Best regards!

# smartctl -a /dev/sdb
smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda LP
Device Model:     ST32000542AS
Serial Number:    5XW20H7P
Firmware Version: CC34
User Capacity:    2,000,398,934,016 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 4
Local Time is:    Tue Jan 25 06:57:27 2011 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                 ( 633) seconds.
Offline data collection
capabilities:                    (0x73) SMART execute Offline immediate.
                                        Auto Offline data collection
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        No Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 255) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x103f) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   096   085   006    Pre-fail
Always       -       166323635
  3 Spin_Up_Time            0x0003   100   100   000    Pre-fail
Always       -       0
  4 Start_Stop_Count        0x0032   099   099   020    Old_age
Always       -       1164
  5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail
Always       -       0
  7 Seek_Error_Rate         0x000f   100   253   030    Pre-fail
Always       -       499927
  9 Power_On_Hours          0x0032   100   100   000    Old_age
Always       -       334
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail
Always       -       0
 12 Power_Cycle_Count       0x0032   099   099   020    Old_age
Always       -       1190
183 Runtime_Bad_Block       0x0032   100   100   000    Old_age
Always       -       0
184 End-to-End_Error        0x0032   100   100   099    Old_age
Always       -       0
187 Reported_Uncorrect      0x0032   001   001   000    Old_age
Always       -       495
188 Command_Timeout         0x0032   100   099   000    Old_age
Always       -       4295032833
189 High_Fly_Writes         0x003a   100   100   000    Old_age
Always       -       0
190 Airflow_Temperature_Cel 0x0022   081   063   045    Old_age
Always       -       19 (Lifetime Min/Max 19/19)
194 Temperature_Celsius     0x0022   019   040   000    Old_age
Always       -       19 (0 16 0 0)
195 Hardware_ECC_Recovered  0x001a   048   033   000    Old_age
Always       -       166323635
197 Current_Pending_Sector  0x0012   100   099   000    Old_age
Always       -       24
198 Offline_Uncorrectable   0x0010   100   099   000    Old_age
Offline      -       24
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age
Always       -       0
240 Head_Flying_Hours       0x0000   100   253   000    Old_age
Offline      -       65979287602589
241 Total_LBAs_Written      0x0000   100   253   000    Old_age
Offline      -       1931349171
242 Total_LBAs_Read         0x0000   100   253   000    Old_age
Offline      -       2326768434

SMART Error Log Version: 1
ATA Error Count: 573 (device log contains only the most recent five errors)
        CR = Command Register [HEX]
        FR = Features Register [HEX]
        SC = Sector Count Register [HEX]
        SN = Sector Number Register [HEX]
        CL = Cylinder Low Register [HEX]
        CH = Cylinder High Register [HEX]
        DH = Device/Head Register [HEX]
        DC = Device Command Register [HEX]
        ER = Error register [HEX]
        ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 573 occurred at disk power-on lifetime: 317 hours (13 days + 5 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 83 74 1b 07  Error: UNC at LBA = 0x071b7483 = 119239811

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 7f 74 1b e7 00      13:20:31.001  READ DMA
  27 00 00 00 00 00 e0 00      13:20:31.000  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      13:20:30.992  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      13:20:30.986  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      13:20:30.964  READ NATIVE MAX ADDRESS EXT

Error 572 occurred at disk power-on lifetime: 317 hours (13 days + 5 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 83 74 1b 07  Error: UNC at LBA = 0x071b7483 = 119239811

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 7f 74 1b e7 00      13:20:27.236  READ DMA
  27 00 00 00 00 00 e0 00      13:20:27.235  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      13:20:27.227  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      13:20:27.223  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      13:20:27.199  READ NATIVE MAX ADDRESS EXT

Error 571 occurred at disk power-on lifetime: 317 hours (13 days + 5 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 83 74 1b 07  Error: UNC at LBA = 0x071b7483 = 119239811

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 7f 74 1b e7 00      13:20:23.451  READ DMA
  27 00 00 00 00 00 e0 00      13:20:23.451  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      13:20:23.442  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      13:20:23.438  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      13:20:23.407  READ NATIVE MAX ADDRESS EXT

Error 570 occurred at disk power-on lifetime: 317 hours (13 days + 5 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 83 74 1b 07  Error: UNC at LBA = 0x071b7483 = 119239811

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 7f 74 1b e7 00      13:20:19.659  READ DMA
  27 00 00 00 00 00 e0 00      13:20:19.658  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      13:20:19.650  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      13:20:19.619  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      13:20:19.534  READ NATIVE MAX ADDRESS EXT

Error 569 occurred at disk power-on lifetime: 317 hours (13 days + 5 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 83 74 1b 07  Error: UNC at LBA = 0x071b7483 = 119239811

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 7f 74 1b e7 00      13:20:15.894  READ DMA
  27 00 00 00 00 00 e0 00      13:20:15.893  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      13:20:15.885  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      13:20:15.880  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      13:20:15.857  READ NATIVE MAX ADDRESS EXT

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

On Tue, Jan 25, 2011 at 2:21 AM, Robert Hancock <hancockrwd@gmail.com> wrote:
> On 01/23/2011 05:05 AM, BU66ER BAD6ER wrote:
>>
>> Hi,
>>
>> Four weeks ago I bought a new 2TB Seagate Barracuda internal SATA
>> drive. That drive has two 667GB ext4 partions (667GB unused) and it is
>> used for storage. My main system (Debian Sid 64-bit and KDE) resides
>> on a 40GB SSD, also using ext4.
>>
>> Two weeks ago I noticed a severe performance drop, where any file
>> manager couldn't view directories on the 2TB disk without a one or two
>> minute penalty. After that I have had three hard freezes of that disk
>> and the entire system. Before the freeze there is very much hd
>> activity and finally I need to turn the power off. I have now also
>> made a backup of /dev/sdb1 should it be fatally serious.
>>
>> I was recommended by someone at the #debian irc to make changes to the
>> spindown_time but that only helped for a few days. Yesterday, the 3rd
>> freeze came and the system wouldn't even recognize the disk after
>> reboot; just 'clicking' waiting for a response. I showed the kern.log
>> to someone at the same channel who concluded that this should be a
>> firmware issue.
>>
>> Here is the latest kern.log which may identify the issue: I hope it
>> contains the relevant details. But first the output of smartctl -a
>> /dev/sdb.
>>
>> I have now set the hdparm spindown_time to 0, disabling disk sleep
>> which seems to have been the culprit as judged on messages in the
>> Dolphin file manager etc.
>>
>> Thanks for any help!
>>
>>
>>> # smartctl -a /dev/sdb
>>> smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
>>> Copyright (C) 2002-10 by Bruce Allen,
>>> http://smartmontools.sourceforge.net
>>>
>>> === START OF INFORMATION SECTION ===
>>> Model Family:     Seagate Barracuda LP
>>> Device Model:     ST32000542AS
>>> Serial Number:    5XW20H7P
>>> Firmware Version: CC34
>>> User Capacity:    2,000,398,934,016 bytes
>>> Device is:        In smartctl database [for details use: -P show]
>>> ATA Version is:   8
>>> ATA Standard is:  ATA-8-ACS revision 4
>>> Local Time is:    Sun Jan 23 11:50:56 2011 CET
>>> SMART support is: Available - device has SMART capability.
>>> SMART support is: Enabled
>>>
>>> === START OF READ SMART DATA SECTION ===
>>> SMART overall-health self-assessment test result: PASSED
>>>
>>> General SMART Values:
>>> Offline data collection status:  (0x00) Offline data collection activity
>>>                                         was never started.
>>>                                         Auto Offline Data Collection:
>>> Disabled.
>>> Self-test execution status:      (   0) The previous self-test routine
>>> completed
>>>                                         without error or no self-test has
>>> ever
>>>                                         been run.
>>> Total time to complete Offline
>>> data collection:                 ( 633) seconds.
>>> Offline data collection
>>> capabilities:                    (0x73) SMART execute Offline immediate.
>>>                                         Auto Offline data collection
>>> on/off support.
>>>                                         Suspend Offline collection upon
>>> new
>>>                                         command.
>>>                                         No Offline surface scan
>>> supported.
>>>                                         Self-test supported.
>>>                                         Conveyance Self-test supported.
>>>                                         Selective Self-test supported.
>>> SMART capabilities:            (0x0003) Saves SMART data before entering
>>>                                         power-saving mode.
>>>                                         Supports SMART auto save timer.
>>> Error logging capability:        (0x01) Error logging supported.
>>>                                         General Purpose Logging
>>> supported.
>>> Short self-test routine
>>> recommended polling time:        (   1) minutes.
>>> Extended self-test routine
>>> recommended polling time:        ( 255) minutes.
>>> Conveyance self-test routine
>>> recommended polling time:        (   2) minutes.
>>> SCT capabilities:              (0x103f) SCT Status supported.
>>>                                         SCT Error Recovery Control
>>> supported.
>>>                                         SCT Feature Control supported.
>>>                                         SCT Data Table supported.
>>>
>>> SMART Attributes Data Structure revision number: 10
>>> Vendor Specific SMART Attributes with Thresholds:
>>> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED
>>>  WHEN_FAILED RAW_VALUE
>>>   1 Raw_Read_Error_Rate     0x000f   100   089   006    Pre-fail  Always
>>>       -       184733939
>>>   3 Spin_Up_Time            0x0003   100   100   000    Pre-fail  Always
>>>       -       0
>>>   4 Start_Stop_Count        0x0032   099   099   020    Old_age   Always
>>>       -       1160
>>>   5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail  Always
>>>       -       0
>>>   7 Seek_Error_Rate         0x000f   100   253   030    Pre-fail  Always
>>>       -       296094
>>>   9 Power_On_Hours          0x0032   100   100   000    Old_age   Always
>>>       -       307
>>>  10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always
>>>       -       0
>>>  12 Power_Cycle_Count       0x0032   099   099   020    Old_age   Always
>>>       -       1186
>>> 183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always
>>>       -       0
>>> 184 End-to-End_Error        0x0032   100   100   099    Old_age   Always
>>>       -       0
>>> 187 Reported_Uncorrect      0x0032   001   001   000    Old_age   Always
>>>       -       225
>>> 188 Command_Timeout         0x0032   100   099   000    Old_age   Always
>>>       -       4295032833
>>> 189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always
>>>       -       0
>>> 190 Airflow_Temperature_Cel 0x0022   063   063   045    Old_age   Always
>>>       -       37 (Lifetime Min/Max 19/37)
>>> 194 Temperature_Celsius     0x0022   037   040   000    Old_age   Always
>>>       -       37 (0 16 0 0)
>>> 195 Hardware_ECC_Recovered  0x001a   052   033   000    Old_age   Always
>>>       -       184733939
>>> 197 Current_Pending_Sector  0x0012   100   099   000    Old_age   Always
>>>       -       40
>>> 198 Offline_Uncorrectable   0x0010   100   099   000    Old_age   Offline
>>>      -       40
>>> 199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always
>>>       -       0
>>> 240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline
>>>      -       210019605808507
>>> 241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline
>>>      -       546635184
>>> 242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline
>>>      -       2195715347
>
> The SMART data shows there haven't been many start/stops other than from
> power cycles, so I don't think spindown is related here. The error log
> entries and the Offline_Uncorrectable and Reported_Uncorrect attributes
> would indicate that your drive is having read errors. Think you likely need
> a new drive.
>

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Seagate hard disk firmware issue
  2011-01-25  6:06   ` BU66ER BAD6ER
@ 2011-04-30 11:32     ` BU66ER BAD6ER
  2011-04-30 12:08       ` gene heskett
  0 siblings, 1 reply; 10+ messages in thread
From: BU66ER BAD6ER @ 2011-04-30 11:32 UTC (permalink / raw)
  To: linux-ide

Hi,

Some time ago, I returned my hard disk and got a new one. Lately, I'm
having performance issues again and I suspect there is a hardware
error again like last time. If you could confirm this I would be most
grateful.

Thanks in advance!

# smartctl -a /dev/sdb
smartctl 5.41 2011-03-16 r3296
[x86_64-unknown-linux-gnu-2.6.38-2-amd64] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Device Model:     ST2000DL003-9VT166
Serial Number:    5YD1YD5P
Firmware Version: CC32
User Capacity:    2,000,398,934,016 bytes
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 4
Local Time is:    Sat Apr 30 12:59:32 2011 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (  623) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 255) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x30b7) SCT Status supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   096   082   006    Pre-fail
Always       -       2447656
  3 Spin_Up_Time            0x0003   097   092   000    Pre-fail
Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age
Always       -       265
  5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail
Always       -       0
  7 Seek_Error_Rate         0x000f   060   060   030    Pre-fail
Always       -       1108085
  9 Power_On_Hours          0x0032   100   100   000    Old_age
Always       -       717
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail
Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age
Always       -       74
183 Runtime_Bad_Block       0x0032   100   100   000    Old_age
Always       -       0
184 End-to-End_Error        0x0032   100   100   099    Old_age
Always       -       0
187 Reported_Uncorrect      0x0032   001   001   000    Old_age
Always       -       792
188 Command_Timeout         0x0032   100   100   000    Old_age
Always       -       0
189 High_Fly_Writes         0x003a   100   100   000    Old_age
Always       -       0
190 Airflow_Temperature_Cel 0x0022   065   062   045    Old_age
Always       -       35 (Min/Max 35/35)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age
Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age
Always       -       202
193 Load_Cycle_Count        0x0032   100   100   000    Old_age
Always       -       265
194 Temperature_Celsius     0x0022   035   040   000    Old_age
Always       -       35 (0 19 0 0)
195 Hardware_ECC_Recovered  0x001a   015   009   000    Old_age
Always       -       2447656
197 Current_Pending_Sector  0x0012   096   096   000    Old_age
Always       -       368
198 Offline_Uncorrectable   0x0010   096   096   000    Old_age
Offline      -       368
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age
Always       -       0
240 Head_Flying_Hours       0x0000   100   253   000    Old_age
Offline      -       9354438771404
241 Total_LBAs_Written      0x0000   100   253   000    Old_age
Offline      -       793142133
242 Total_LBAs_Read         0x0000   100   253   000    Old_age
Offline      -       307847059

SMART Error Log Version: 1
ATA Error Count: 828 (device log contains only the most recent five errors)
        CR = Command Register [HEX]
        FR = Features Register [HEX]
        SC = Sector Count Register [HEX]
        SN = Sector Number Register [HEX]
        CL = Cylinder Low Register [HEX]
        CH = Cylinder High Register [HEX]
        DH = Device/Head Register [HEX]
        DC = Device Command Register [HEX]
        ER = Error register [HEX]
        ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 828 occurred at disk power-on lifetime: 705 hours (29 days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 00 08 ff ff ff ef 00      03:44:18.432  READ DMA EXT
  27 00 00 00 00 00 e0 00      03:44:18.431  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      03:44:18.423  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      03:44:18.391  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      03:44:18.391  READ NATIVE MAX ADDRESS EXT

Error 827 occurred at disk power-on lifetime: 705 hours (29 days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 00 08 ff ff ff ef 00      03:44:15.060  READ DMA EXT
  27 00 00 00 00 00 e0 00      03:44:15.059  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      03:44:15.051  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      03:44:15.019  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      03:44:15.019  READ NATIVE MAX ADDRESS EXT

Error 826 occurred at disk power-on lifetime: 705 hours (29 days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 00 08 ff ff ff ef 00      03:44:11.687  READ DMA EXT
  27 00 00 00 00 00 e0 00      03:44:11.686  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      03:44:11.662  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      03:44:11.566  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      03:44:11.566  READ NATIVE MAX ADDRESS EXT

Error 825 occurred at disk power-on lifetime: 705 hours (29 days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 00 08 ff ff ff ef 00      03:44:08.323  READ DMA EXT
  27 00 00 00 00 00 e0 00      03:44:08.322  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      03:44:08.314  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      03:44:08.282  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      03:44:08.282  READ NATIVE MAX ADDRESS EXT

Error 824 occurred at disk power-on lifetime: 705 hours (29 days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 00 08 ff ff ff ef 00      03:44:04.950  READ DMA EXT
  27 00 00 00 00 00 e0 00      03:44:04.949  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      03:44:04.941  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      03:44:04.910  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      03:44:04.909  READ NATIVE MAX ADDRESS EXT

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.



On Tue, Jan 25, 2011 at 7:06 AM, BU66ER BAD6ER <debu66er@gmail.com> wrote:
> Dear both,
>
> Thanks for the input! I will return this hard disk as soon as possible.
>
> Here below is the latest output.
>
> Best regards!
>
> # smartctl -a /dev/sdb
> smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
> Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
>
> === START OF INFORMATION SECTION ===
> Model Family:     Seagate Barracuda LP
> Device Model:     ST32000542AS
> Serial Number:    5XW20H7P
> Firmware Version: CC34
> User Capacity:    2,000,398,934,016 bytes
> Device is:        In smartctl database [for details use: -P show]
> ATA Version is:   8
> ATA Standard is:  ATA-8-ACS revision 4
> Local Time is:    Tue Jan 25 06:57:27 2011 CET
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
>
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
>
> General SMART Values:
> Offline data collection status:  (0x00) Offline data collection activity
>                                        was never started.
>                                        Auto Offline Data Collection: Disabled.
> Self-test execution status:      (   0) The previous self-test routine completed
>                                        without error or no self-test has ever
>                                        been run.
> Total time to complete Offline
> data collection:                 ( 633) seconds.
> Offline data collection
> capabilities:                    (0x73) SMART execute Offline immediate.
>                                        Auto Offline data collection
> on/off support.
>                                        Suspend Offline collection upon new
>                                        command.
>                                        No Offline surface scan supported.
>                                        Self-test supported.
>                                        Conveyance Self-test supported.
>                                        Selective Self-test supported.
> SMART capabilities:            (0x0003) Saves SMART data before entering
>                                        power-saving mode.
>                                        Supports SMART auto save timer.
> Error logging capability:        (0x01) Error logging supported.
>                                        General Purpose Logging supported.
> Short self-test routine
> recommended polling time:        (   1) minutes.
> Extended self-test routine
> recommended polling time:        ( 255) minutes.
> Conveyance self-test routine
> recommended polling time:        (   2) minutes.
> SCT capabilities:              (0x103f) SCT Status supported.
>                                        SCT Error Recovery Control supported.
>                                        SCT Feature Control supported.
>                                        SCT Data Table supported.
>
> SMART Attributes Data Structure revision number: 10
> Vendor Specific SMART Attributes with Thresholds:
> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
> UPDATED  WHEN_FAILED RAW_VALUE
>  1 Raw_Read_Error_Rate     0x000f   096   085   006    Pre-fail
> Always       -       166323635
>  3 Spin_Up_Time            0x0003   100   100   000    Pre-fail
> Always       -       0
>  4 Start_Stop_Count        0x0032   099   099   020    Old_age
> Always       -       1164
>  5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail
> Always       -       0
>  7 Seek_Error_Rate         0x000f   100   253   030    Pre-fail
> Always       -       499927
>  9 Power_On_Hours          0x0032   100   100   000    Old_age
> Always       -       334
>  10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail
> Always       -       0
>  12 Power_Cycle_Count       0x0032   099   099   020    Old_age
> Always       -       1190
> 183 Runtime_Bad_Block       0x0032   100   100   000    Old_age
> Always       -       0
> 184 End-to-End_Error        0x0032   100   100   099    Old_age
> Always       -       0
> 187 Reported_Uncorrect      0x0032   001   001   000    Old_age
> Always       -       495
> 188 Command_Timeout         0x0032   100   099   000    Old_age
> Always       -       4295032833
> 189 High_Fly_Writes         0x003a   100   100   000    Old_age
> Always       -       0
> 190 Airflow_Temperature_Cel 0x0022   081   063   045    Old_age
> Always       -       19 (Lifetime Min/Max 19/19)
> 194 Temperature_Celsius     0x0022   019   040   000    Old_age
> Always       -       19 (0 16 0 0)
> 195 Hardware_ECC_Recovered  0x001a   048   033   000    Old_age
> Always       -       166323635
> 197 Current_Pending_Sector  0x0012   100   099   000    Old_age
> Always       -       24
> 198 Offline_Uncorrectable   0x0010   100   099   000    Old_age
> Offline      -       24
> 199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age
> Always       -       0
> 240 Head_Flying_Hours       0x0000   100   253   000    Old_age
> Offline      -       65979287602589
> 241 Total_LBAs_Written      0x0000   100   253   000    Old_age
> Offline      -       1931349171
> 242 Total_LBAs_Read         0x0000   100   253   000    Old_age
> Offline      -       2326768434
>
> SMART Error Log Version: 1
> ATA Error Count: 573 (device log contains only the most recent five errors)
>        CR = Command Register [HEX]
>        FR = Features Register [HEX]
>        SC = Sector Count Register [HEX]
>        SN = Sector Number Register [HEX]
>        CL = Cylinder Low Register [HEX]
>        CH = Cylinder High Register [HEX]
>        DH = Device/Head Register [HEX]
>        DC = Device Command Register [HEX]
>        ER = Error register [HEX]
>        ST = Status register [HEX]
> Powered_Up_Time is measured from power on, and printed as
> DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
> SS=sec, and sss=millisec. It "wraps" after 49.710 days.
>
> Error 573 occurred at disk power-on lifetime: 317 hours (13 days + 5 hours)
>  When the command that caused the error occurred, the device was
> active or idle.
>
>  After command completion occurred, registers were:
>  ER ST SC SN CL CH DH
>  -- -- -- -- -- -- --
>  40 51 00 83 74 1b 07  Error: UNC at LBA = 0x071b7483 = 119239811
>
>  Commands leading to the command that caused the error were:
>  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>  -- -- -- -- -- -- -- --  ----------------  --------------------
>  c8 00 08 7f 74 1b e7 00      13:20:31.001  READ DMA
>  27 00 00 00 00 00 e0 00      13:20:31.000  READ NATIVE MAX ADDRESS EXT
>  ec 00 00 00 00 00 a0 00      13:20:30.992  IDENTIFY DEVICE
>  ef 03 46 00 00 00 a0 00      13:20:30.986  SET FEATURES [Set transfer mode]
>  27 00 00 00 00 00 e0 00      13:20:30.964  READ NATIVE MAX ADDRESS EXT
>
> Error 572 occurred at disk power-on lifetime: 317 hours (13 days + 5 hours)
>  When the command that caused the error occurred, the device was
> active or idle.
>
>  After command completion occurred, registers were:
>  ER ST SC SN CL CH DH
>  -- -- -- -- -- -- --
>  40 51 00 83 74 1b 07  Error: UNC at LBA = 0x071b7483 = 119239811
>
>  Commands leading to the command that caused the error were:
>  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>  -- -- -- -- -- -- -- --  ----------------  --------------------
>  c8 00 08 7f 74 1b e7 00      13:20:27.236  READ DMA
>  27 00 00 00 00 00 e0 00      13:20:27.235  READ NATIVE MAX ADDRESS EXT
>  ec 00 00 00 00 00 a0 00      13:20:27.227  IDENTIFY DEVICE
>  ef 03 46 00 00 00 a0 00      13:20:27.223  SET FEATURES [Set transfer mode]
>  27 00 00 00 00 00 e0 00      13:20:27.199  READ NATIVE MAX ADDRESS EXT
>
> Error 571 occurred at disk power-on lifetime: 317 hours (13 days + 5 hours)
>  When the command that caused the error occurred, the device was
> active or idle.
>
>  After command completion occurred, registers were:
>  ER ST SC SN CL CH DH
>  -- -- -- -- -- -- --
>  40 51 00 83 74 1b 07  Error: UNC at LBA = 0x071b7483 = 119239811
>
>  Commands leading to the command that caused the error were:
>  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>  -- -- -- -- -- -- -- --  ----------------  --------------------
>  c8 00 08 7f 74 1b e7 00      13:20:23.451  READ DMA
>  27 00 00 00 00 00 e0 00      13:20:23.451  READ NATIVE MAX ADDRESS EXT
>  ec 00 00 00 00 00 a0 00      13:20:23.442  IDENTIFY DEVICE
>  ef 03 46 00 00 00 a0 00      13:20:23.438  SET FEATURES [Set transfer mode]
>  27 00 00 00 00 00 e0 00      13:20:23.407  READ NATIVE MAX ADDRESS EXT
>
> Error 570 occurred at disk power-on lifetime: 317 hours (13 days + 5 hours)
>  When the command that caused the error occurred, the device was
> active or idle.
>
>  After command completion occurred, registers were:
>  ER ST SC SN CL CH DH
>  -- -- -- -- -- -- --
>  40 51 00 83 74 1b 07  Error: UNC at LBA = 0x071b7483 = 119239811
>
>  Commands leading to the command that caused the error were:
>  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>  -- -- -- -- -- -- -- --  ----------------  --------------------
>  c8 00 08 7f 74 1b e7 00      13:20:19.659  READ DMA
>  27 00 00 00 00 00 e0 00      13:20:19.658  READ NATIVE MAX ADDRESS EXT
>  ec 00 00 00 00 00 a0 00      13:20:19.650  IDENTIFY DEVICE
>  ef 03 46 00 00 00 a0 00      13:20:19.619  SET FEATURES [Set transfer mode]
>  27 00 00 00 00 00 e0 00      13:20:19.534  READ NATIVE MAX ADDRESS EXT
>
> Error 569 occurred at disk power-on lifetime: 317 hours (13 days + 5 hours)
>  When the command that caused the error occurred, the device was
> active or idle.
>
>  After command completion occurred, registers were:
>  ER ST SC SN CL CH DH
>  -- -- -- -- -- -- --
>  40 51 00 83 74 1b 07  Error: UNC at LBA = 0x071b7483 = 119239811
>
>  Commands leading to the command that caused the error were:
>  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>  -- -- -- -- -- -- -- --  ----------------  --------------------
>  c8 00 08 7f 74 1b e7 00      13:20:15.894  READ DMA
>  27 00 00 00 00 00 e0 00      13:20:15.893  READ NATIVE MAX ADDRESS EXT
>  ec 00 00 00 00 00 a0 00      13:20:15.885  IDENTIFY DEVICE
>  ef 03 46 00 00 00 a0 00      13:20:15.880  SET FEATURES [Set transfer mode]
>  27 00 00 00 00 00 e0 00      13:20:15.857  READ NATIVE MAX ADDRESS EXT
>
> SMART Self-test log structure revision number 1
> No self-tests have been logged.  [To run self-tests, use: smartctl -t]
>
>
> SMART Selective self-test log data structure revision number 1
>  SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
>    1        0        0  Not_testing
>    2        0        0  Not_testing
>    3        0        0  Not_testing
>    4        0        0  Not_testing
>    5        0        0  Not_testing
> Selective self-test flags (0x0):
>  After scanning selected spans, do NOT read-scan remainder of disk.
> If Selective self-test is pending on power-up, resume after 0 minute delay.
>
> On Tue, Jan 25, 2011 at 2:21 AM, Robert Hancock <hancockrwd@gmail.com> wrote:
>> On 01/23/2011 05:05 AM, BU66ER BAD6ER wrote:
>>>
>>> Hi,
>>>
>>> Four weeks ago I bought a new 2TB Seagate Barracuda internal SATA
>>> drive. That drive has two 667GB ext4 partions (667GB unused) and it is
>>> used for storage. My main system (Debian Sid 64-bit and KDE) resides
>>> on a 40GB SSD, also using ext4.
>>>
>>> Two weeks ago I noticed a severe performance drop, where any file
>>> manager couldn't view directories on the 2TB disk without a one or two
>>> minute penalty. After that I have had three hard freezes of that disk
>>> and the entire system. Before the freeze there is very much hd
>>> activity and finally I need to turn the power off. I have now also
>>> made a backup of /dev/sdb1 should it be fatally serious.
>>>
>>> I was recommended by someone at the #debian irc to make changes to the
>>> spindown_time but that only helped for a few days. Yesterday, the 3rd
>>> freeze came and the system wouldn't even recognize the disk after
>>> reboot; just 'clicking' waiting for a response. I showed the kern.log
>>> to someone at the same channel who concluded that this should be a
>>> firmware issue.
>>>
>>> Here is the latest kern.log which may identify the issue: I hope it
>>> contains the relevant details. But first the output of smartctl -a
>>> /dev/sdb.
>>>
>>> I have now set the hdparm spindown_time to 0, disabling disk sleep
>>> which seems to have been the culprit as judged on messages in the
>>> Dolphin file manager etc.
>>>
>>> Thanks for any help!
>>>
>>>
>>>> # smartctl -a /dev/sdb
>>>> smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
>>>> Copyright (C) 2002-10 by Bruce Allen,
>>>> http://smartmontools.sourceforge.net
>>>>
>>>> === START OF INFORMATION SECTION ===
>>>> Model Family:     Seagate Barracuda LP
>>>> Device Model:     ST32000542AS
>>>> Serial Number:    5XW20H7P
>>>> Firmware Version: CC34
>>>> User Capacity:    2,000,398,934,016 bytes
>>>> Device is:        In smartctl database [for details use: -P show]
>>>> ATA Version is:   8
>>>> ATA Standard is:  ATA-8-ACS revision 4
>>>> Local Time is:    Sun Jan 23 11:50:56 2011 CET
>>>> SMART support is: Available - device has SMART capability.
>>>> SMART support is: Enabled
>>>>
>>>> === START OF READ SMART DATA SECTION ===
>>>> SMART overall-health self-assessment test result: PASSED
>>>>
>>>> General SMART Values:
>>>> Offline data collection status:  (0x00) Offline data collection activity
>>>>                                         was never started.
>>>>                                         Auto Offline Data Collection:
>>>> Disabled.
>>>> Self-test execution status:      (   0) The previous self-test routine
>>>> completed
>>>>                                         without error or no self-test has
>>>> ever
>>>>                                         been run.
>>>> Total time to complete Offline
>>>> data collection:                 ( 633) seconds.
>>>> Offline data collection
>>>> capabilities:                    (0x73) SMART execute Offline immediate.
>>>>                                         Auto Offline data collection
>>>> on/off support.
>>>>                                         Suspend Offline collection upon
>>>> new
>>>>                                         command.
>>>>                                         No Offline surface scan
>>>> supported.
>>>>                                         Self-test supported.
>>>>                                         Conveyance Self-test supported.
>>>>                                         Selective Self-test supported.
>>>> SMART capabilities:            (0x0003) Saves SMART data before entering
>>>>                                         power-saving mode.
>>>>                                         Supports SMART auto save timer.
>>>> Error logging capability:        (0x01) Error logging supported.
>>>>                                         General Purpose Logging
>>>> supported.
>>>> Short self-test routine
>>>> recommended polling time:        (   1) minutes.
>>>> Extended self-test routine
>>>> recommended polling time:        ( 255) minutes.
>>>> Conveyance self-test routine
>>>> recommended polling time:        (   2) minutes.
>>>> SCT capabilities:              (0x103f) SCT Status supported.
>>>>                                         SCT Error Recovery Control
>>>> supported.
>>>>                                         SCT Feature Control supported.
>>>>                                         SCT Data Table supported.
>>>>
>>>> SMART Attributes Data Structure revision number: 10
>>>> Vendor Specific SMART Attributes with Thresholds:
>>>> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED
>>>>  WHEN_FAILED RAW_VALUE
>>>>   1 Raw_Read_Error_Rate     0x000f   100   089   006    Pre-fail  Always
>>>>       -       184733939
>>>>   3 Spin_Up_Time            0x0003   100   100   000    Pre-fail  Always
>>>>       -       0
>>>>   4 Start_Stop_Count        0x0032   099   099   020    Old_age   Always
>>>>       -       1160
>>>>   5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail  Always
>>>>       -       0
>>>>   7 Seek_Error_Rate         0x000f   100   253   030    Pre-fail  Always
>>>>       -       296094
>>>>   9 Power_On_Hours          0x0032   100   100   000    Old_age   Always
>>>>       -       307
>>>>  10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always
>>>>       -       0
>>>>  12 Power_Cycle_Count       0x0032   099   099   020    Old_age   Always
>>>>       -       1186
>>>> 183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always
>>>>       -       0
>>>> 184 End-to-End_Error        0x0032   100   100   099    Old_age   Always
>>>>       -       0
>>>> 187 Reported_Uncorrect      0x0032   001   001   000    Old_age   Always
>>>>       -       225
>>>> 188 Command_Timeout         0x0032   100   099   000    Old_age   Always
>>>>       -       4295032833
>>>> 189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always
>>>>       -       0
>>>> 190 Airflow_Temperature_Cel 0x0022   063   063   045    Old_age   Always
>>>>       -       37 (Lifetime Min/Max 19/37)
>>>> 194 Temperature_Celsius     0x0022   037   040   000    Old_age   Always
>>>>       -       37 (0 16 0 0)
>>>> 195 Hardware_ECC_Recovered  0x001a   052   033   000    Old_age   Always
>>>>       -       184733939
>>>> 197 Current_Pending_Sector  0x0012   100   099   000    Old_age   Always
>>>>       -       40
>>>> 198 Offline_Uncorrectable   0x0010   100   099   000    Old_age   Offline
>>>>      -       40
>>>> 199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always
>>>>       -       0
>>>> 240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline
>>>>      -       210019605808507
>>>> 241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline
>>>>      -       546635184
>>>> 242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline
>>>>      -       2195715347
>>
>> The SMART data shows there haven't been many start/stops other than from
>> power cycles, so I don't think spindown is related here. The error log
>> entries and the Offline_Uncorrectable and Reported_Uncorrect attributes
>> would indicate that your drive is having read errors. Think you likely need
>> a new drive.
>>
>

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Seagate hard disk firmware issue
  2011-04-30 11:32     ` BU66ER BAD6ER
@ 2011-04-30 12:08       ` gene heskett
  2011-04-30 22:39         ` BU66ER BAD6ER
  0 siblings, 1 reply; 10+ messages in thread
From: gene heskett @ 2011-04-30 12:08 UTC (permalink / raw)
  To: linux-ide; +Cc: BU66ER BAD6ER

On Saturday, April 30, 2011 07:56:21 AM BU66ER BAD6ER did opine:
This <linux-ide@vger.kernel.org> is a mailing list.  WTH when I click on 
reply-to-list, do I always have to copy/paste the lists address in the To: 
line?  fscking PIMA!

More below, where it should be.

> Hi,
> 
> Some time ago, I returned my hard disk and got a new one. Lately, I'm
> having performance issues again and I suspect there is a hardware
> error again like last time. If you could confirm this I would be most
> grateful.
> 
> Thanks in advance!
> 
> # smartctl -a /dev/sdb
> smartctl 5.41 2011-03-16 r3296
> [x86_64-unknown-linux-gnu-2.6.38-2-amd64] (local build)
> Copyright (C) 2002-11 by Bruce Allen,
> http://smartmontools.sourceforge.net
> 
> === START OF INFORMATION SECTION ===
> Device Model:     ST2000DL003-9VT166
> Serial Number:    5YD1YD5P
> Firmware Version: CC32
> User Capacity:    2,000,398,934,016 bytes
> Device is:        Not in smartctl database [for details use: -P showall]
> ATA Version is:   8
> ATA Standard is:  ATA-8-ACS revision 4
> Local Time is:    Sat Apr 30 12:59:32 2011 CEST
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
> 
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
> 
> General SMART Values:
> Offline data collection status:  (0x82) Offline data collection activity
>                                         was completed without error.
>                                         Auto Offline Data Collection:
> Enabled. Self-test execution status:      (   0) The previous self-test
> routine completed without error or no self-test has ever been run.
> Total time to complete Offline
> data collection:                (  623) seconds.
> Offline data collection
> capabilities:                    (0x7b) SMART execute Offline immediate.
>                                         Auto Offline data collection
> on/off support.
>                                         Suspend Offline collection upon
> new command.
>                                         Offline surface scan supported.
>                                         Self-test supported.
>                                         Conveyance Self-test supported.
>                                         Selective Self-test supported.
> SMART capabilities:            (0x0003) Saves SMART data before entering
>                                         power-saving mode.
>                                         Supports SMART auto save timer.
> Error logging capability:        (0x01) Error logging supported.
>                                         General Purpose Logging
> supported. Short self-test routine
> recommended polling time:        (   1) minutes.
> Extended self-test routine
> recommended polling time:        ( 255) minutes.
> Conveyance self-test routine
> recommended polling time:        (   2) minutes.
> SCT capabilities:              (0x30b7) SCT Status supported.
>                                         SCT Feature Control supported.
>                                         SCT Data Table supported.
> 
> SMART Attributes Data Structure revision number: 10
> Vendor Specific SMART Attributes with Thresholds:
> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
> UPDATED  WHEN_FAILED RAW_VALUE
>   1 Raw_Read_Error_Rate     0x000f   096   082   006    Pre-fail
> Always       -       2447656
>   3 Spin_Up_Time            0x0003   097   092   000    Pre-fail
> Always       -       0
>   4 Start_Stop_Count        0x0032   100   100   020    Old_age
> Always       -       265
>   5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail
> Always       -       0
>   7 Seek_Error_Rate         0x000f   060   060   030    Pre-fail
> Always       -       1108085
>   9 Power_On_Hours          0x0032   100   100   000    Old_age
> Always       -       717
>  10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail
> Always       -       0
>  12 Power_Cycle_Count       0x0032   100   100   020    Old_age
> Always       -       74
> 183 Runtime_Bad_Block       0x0032   100   100   000    Old_age
> Always       -       0
> 184 End-to-End_Error        0x0032   100   100   099    Old_age
> Always       -       0
> 187 Reported_Uncorrect      0x0032   001   001   000    Old_age
> Always       -       792
> 188 Command_Timeout         0x0032   100   100   000    Old_age
> Always       -       0
> 189 High_Fly_Writes         0x003a   100   100   000    Old_age
> Always       -       0
> 190 Airflow_Temperature_Cel 0x0022   065   062   045    Old_age
> Always       -       35 (Min/Max 35/35)
> 191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age
> Always       -       0
> 192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age
> Always       -       202
> 193 Load_Cycle_Count        0x0032   100   100   000    Old_age
> Always       -       265
> 194 Temperature_Celsius     0x0022   035   040   000    Old_age
> Always       -       35 (0 19 0 0)
> 195 Hardware_ECC_Recovered  0x001a   015   009   000    Old_age
> Always       -       2447656
> 197 Current_Pending_Sector  0x0012   096   096   000    Old_age
> Always       -       368
> 198 Offline_Uncorrectable   0x0010   096   096   000    Old_age
> Offline      -       368
> 199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age
> Always       -       0
> 240 Head_Flying_Hours       0x0000   100   253   000    Old_age
> Offline      -       9354438771404
> 241 Total_LBAs_Written      0x0000   100   253   000    Old_age
> Offline      -       793142133
> 242 Total_LBAs_Read         0x0000   100   253   000    Old_age
> Offline      -       307847059
> 
> SMART Error Log Version: 1
> ATA Error Count: 828 (device log contains only the most recent five
> errors) CR = Command Register [HEX]
>         FR = Features Register [HEX]
>         SC = Sector Count Register [HEX]
>         SN = Sector Number Register [HEX]
>         CL = Cylinder Low Register [HEX]
>         CH = Cylinder High Register [HEX]
>         DH = Device/Head Register [HEX]
>         DC = Device Command Register [HEX]
>         ER = Error register [HEX]
>         ST = Status register [HEX]
> Powered_Up_Time is measured from power on, and printed as
> DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
> SS=sec, and sss=millisec. It "wraps" after 49.710 days.
> 
> Error 828 occurred at disk power-on lifetime: 705 hours (29 days + 9
> hours) When the command that caused the error occurred, the device was
> active or idle.
> 
>   After command completion occurred, registers were:
>   ER ST SC SN CL CH DH
>   -- -- -- -- -- -- --
>   40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455
> 
>   Commands leading to the command that caused the error were:
>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>   -- -- -- -- -- -- -- --  ----------------  --------------------
>   25 00 08 ff ff ff ef 00      03:44:18.432  READ DMA EXT
>   27 00 00 00 00 00 e0 00      03:44:18.431  READ NATIVE MAX ADDRESS EXT
>   ec 00 00 00 00 00 a0 00      03:44:18.423  IDENTIFY DEVICE
>   ef 03 46 00 00 00 a0 00      03:44:18.391  SET FEATURES [Set transfer
> mode] 27 00 00 00 00 00 e0 00      03:44:18.391  READ NATIVE MAX
> ADDRESS EXT
> 
> Error 827 occurred at disk power-on lifetime: 705 hours (29 days + 9
> hours) When the command that caused the error occurred, the device was
> active or idle.
> 
>   After command completion occurred, registers were:
>   ER ST SC SN CL CH DH
>   -- -- -- -- -- -- --
>   40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455
> 
>   Commands leading to the command that caused the error were:
>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>   -- -- -- -- -- -- -- --  ----------------  --------------------
>   25 00 08 ff ff ff ef 00      03:44:15.060  READ DMA EXT
>   27 00 00 00 00 00 e0 00      03:44:15.059  READ NATIVE MAX ADDRESS EXT
>   ec 00 00 00 00 00 a0 00      03:44:15.051  IDENTIFY DEVICE
>   ef 03 46 00 00 00 a0 00      03:44:15.019  SET FEATURES [Set transfer
> mode] 27 00 00 00 00 00 e0 00      03:44:15.019  READ NATIVE MAX
> ADDRESS EXT
> 
> Error 826 occurred at disk power-on lifetime: 705 hours (29 days + 9
> hours) When the command that caused the error occurred, the device was
> active or idle.
> 
>   After command completion occurred, registers were:
>   ER ST SC SN CL CH DH
>   -- -- -- -- -- -- --
>   40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455
> 
>   Commands leading to the command that caused the error were:
>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>   -- -- -- -- -- -- -- --  ----------------  --------------------
>   25 00 08 ff ff ff ef 00      03:44:11.687  READ DMA EXT
>   27 00 00 00 00 00 e0 00      03:44:11.686  READ NATIVE MAX ADDRESS EXT
>   ec 00 00 00 00 00 a0 00      03:44:11.662  IDENTIFY DEVICE
>   ef 03 46 00 00 00 a0 00      03:44:11.566  SET FEATURES [Set transfer
> mode] 27 00 00 00 00 00 e0 00      03:44:11.566  READ NATIVE MAX
> ADDRESS EXT
> 
> Error 825 occurred at disk power-on lifetime: 705 hours (29 days + 9
> hours) When the command that caused the error occurred, the device was
> active or idle.
> 
>   After command completion occurred, registers were:
>   ER ST SC SN CL CH DH
>   -- -- -- -- -- -- --
>   40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455
> 
>   Commands leading to the command that caused the error were:
>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>   -- -- -- -- -- -- -- --  ----------------  --------------------
>   25 00 08 ff ff ff ef 00      03:44:08.323  READ DMA EXT
>   27 00 00 00 00 00 e0 00      03:44:08.322  READ NATIVE MAX ADDRESS EXT
>   ec 00 00 00 00 00 a0 00      03:44:08.314  IDENTIFY DEVICE
>   ef 03 46 00 00 00 a0 00      03:44:08.282  SET FEATURES [Set transfer
> mode] 27 00 00 00 00 00 e0 00      03:44:08.282  READ NATIVE MAX
> ADDRESS EXT
> 
> Error 824 occurred at disk power-on lifetime: 705 hours (29 days + 9
> hours) When the command that caused the error occurred, the device was
> active or idle.
> 
>   After command completion occurred, registers were:
>   ER ST SC SN CL CH DH
>   -- -- -- -- -- -- --
>   40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455
> 
>   Commands leading to the command that caused the error were:
>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>   -- -- -- -- -- -- -- --  ----------------  --------------------
>   25 00 08 ff ff ff ef 00      03:44:04.950  READ DMA EXT
>   27 00 00 00 00 00 e0 00      03:44:04.949  READ NATIVE MAX ADDRESS EXT
>   ec 00 00 00 00 00 a0 00      03:44:04.941  IDENTIFY DEVICE
>   ef 03 46 00 00 00 a0 00      03:44:04.910  SET FEATURES [Set transfer
> mode] 27 00 00 00 00 00 e0 00      03:44:04.909  READ NATIVE MAX
> ADDRESS EXT
> 
> SMART Self-test log structure revision number 1
> No self-tests have been logged.  [To run self-tests, use: smartctl -t]
> 
> 
> SMART Selective self-test log data structure revision number 1
>  SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
>     1        0        0  Not_testing
>     2        0        0  Not_testing
>     3        0        0  Not_testing
>     4        0        0  Not_testing
>     5        0        0  Not_testing
> Selective self-test flags (0x0):
>   After scanning selected spans, do NOT read-scan remainder of disk.
> If Selective self-test is pending on power-up, resume after 0 minute
> delay.

I might run the selftest (long) to get a better idea, but if there is not a 
firmware update for this drive on the Seagate site, I believe I'd be asking 
for an RA forthwith.

Be aware that I just updated 2 identical 1 terrabyte Seagate drives about 3 
weeks ago, and the firmware update, while it did not scramble tha partition 
table data, did scramble the partition labels AND the blkid's of the boot 
drive only.  The second drive no longer is hanging the system with bus 
resets, but it still has a write speed of about 3.5 megs/second.  I had to 
re-install.  Fortunately I had data backups from the night before courtesy 
amanda.

-- 
Cheers, Gene
"There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
<http://tinyurl.com/ddg5bz>
<http://www.cantrip.org/gatto.html>
Row, row, row your bits, gently down the stream...

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Seagate hard disk firmware issue
  2011-04-30 12:08       ` gene heskett
@ 2011-04-30 22:39         ` BU66ER BAD6ER
  2011-05-01 17:18           ` gene heskett
  0 siblings, 1 reply; 10+ messages in thread
From: BU66ER BAD6ER @ 2011-04-30 22:39 UTC (permalink / raw)
  To: linux-ide

On Sat, Apr 30, 2011 at 2:08 PM, gene heskett <gheskett@wdtv.com> wrote:
> On Saturday, April 30, 2011 07:56:21 AM BU66ER BAD6ER did opine:
> This <linux-ide@vger.kernel.org> is a mailing list.  WTH when I click on
> reply-to-list, do I always have to copy/paste the lists address in the To:
> line?  fscking PIMA!
>
> More below, where it should be.
>
>> Hi,
>>
>> Some time ago, I returned my hard disk and got a new one. Lately, I'm
>> having performance issues again and I suspect there is a hardware
>> error again like last time. If you could confirm this I would be most
>> grateful.
>>
>> Thanks in advance!
>>
>> # smartctl -a /dev/sdb
>> smartctl 5.41 2011-03-16 r3296
>> [x86_64-unknown-linux-gnu-2.6.38-2-amd64] (local build)
>> Copyright (C) 2002-11 by Bruce Allen,
>> http://smartmontools.sourceforge.net
>>
>> === START OF INFORMATION SECTION ===
>> Device Model:     ST2000DL003-9VT166
>> Serial Number:    5YD1YD5P
>> Firmware Version: CC32
>> User Capacity:    2,000,398,934,016 bytes
>> Device is:        Not in smartctl database [for details use: -P showall]
>> ATA Version is:   8
>> ATA Standard is:  ATA-8-ACS revision 4
>> Local Time is:    Sat Apr 30 12:59:32 2011 CEST
>> SMART support is: Available - device has SMART capability.
>> SMART support is: Enabled
>>
>> === START OF READ SMART DATA SECTION ===
>> SMART overall-health self-assessment test result: PASSED
>>
>> General SMART Values:
>> Offline data collection status:  (0x82) Offline data collection activity
>>                                         was completed without error.
>>                                         Auto Offline Data Collection:
>> Enabled. Self-test execution status:      (   0) The previous self-test
>> routine completed without error or no self-test has ever been run.
>> Total time to complete Offline
>> data collection:                (  623) seconds.
>> Offline data collection
>> capabilities:                    (0x7b) SMART execute Offline immediate.
>>                                         Auto Offline data collection
>> on/off support.
>>                                         Suspend Offline collection upon
>> new command.
>>                                         Offline surface scan supported.
>>                                         Self-test supported.
>>                                         Conveyance Self-test supported.
>>                                         Selective Self-test supported.
>> SMART capabilities:            (0x0003) Saves SMART data before entering
>>                                         power-saving mode.
>>                                         Supports SMART auto save timer.
>> Error logging capability:        (0x01) Error logging supported.
>>                                         General Purpose Logging
>> supported. Short self-test routine
>> recommended polling time:        (   1) minutes.
>> Extended self-test routine
>> recommended polling time:        ( 255) minutes.
>> Conveyance self-test routine
>> recommended polling time:        (   2) minutes.
>> SCT capabilities:              (0x30b7) SCT Status supported.
>>                                         SCT Feature Control supported.
>>                                         SCT Data Table supported.
>>
>> SMART Attributes Data Structure revision number: 10
>> Vendor Specific SMART Attributes with Thresholds:
>> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
>> UPDATED  WHEN_FAILED RAW_VALUE
>>   1 Raw_Read_Error_Rate     0x000f   096   082   006    Pre-fail
>> Always       -       2447656
>>   3 Spin_Up_Time            0x0003   097   092   000    Pre-fail
>> Always       -       0
>>   4 Start_Stop_Count        0x0032   100   100   020    Old_age
>> Always       -       265
>>   5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail
>> Always       -       0
>>   7 Seek_Error_Rate         0x000f   060   060   030    Pre-fail
>> Always       -       1108085
>>   9 Power_On_Hours          0x0032   100   100   000    Old_age
>> Always       -       717
>>  10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail
>> Always       -       0
>>  12 Power_Cycle_Count       0x0032   100   100   020    Old_age
>> Always       -       74
>> 183 Runtime_Bad_Block       0x0032   100   100   000    Old_age
>> Always       -       0
>> 184 End-to-End_Error        0x0032   100   100   099    Old_age
>> Always       -       0
>> 187 Reported_Uncorrect      0x0032   001   001   000    Old_age
>> Always       -       792
>> 188 Command_Timeout         0x0032   100   100   000    Old_age
>> Always       -       0
>> 189 High_Fly_Writes         0x003a   100   100   000    Old_age
>> Always       -       0
>> 190 Airflow_Temperature_Cel 0x0022   065   062   045    Old_age
>> Always       -       35 (Min/Max 35/35)
>> 191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age
>> Always       -       0
>> 192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age
>> Always       -       202
>> 193 Load_Cycle_Count        0x0032   100   100   000    Old_age
>> Always       -       265
>> 194 Temperature_Celsius     0x0022   035   040   000    Old_age
>> Always       -       35 (0 19 0 0)
>> 195 Hardware_ECC_Recovered  0x001a   015   009   000    Old_age
>> Always       -       2447656
>> 197 Current_Pending_Sector  0x0012   096   096   000    Old_age
>> Always       -       368
>> 198 Offline_Uncorrectable   0x0010   096   096   000    Old_age
>> Offline      -       368
>> 199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age
>> Always       -       0
>> 240 Head_Flying_Hours       0x0000   100   253   000    Old_age
>> Offline      -       9354438771404
>> 241 Total_LBAs_Written      0x0000   100   253   000    Old_age
>> Offline      -       793142133
>> 242 Total_LBAs_Read         0x0000   100   253   000    Old_age
>> Offline      -       307847059
>>
>> SMART Error Log Version: 1
>> ATA Error Count: 828 (device log contains only the most recent five
>> errors) CR = Command Register [HEX]
>>         FR = Features Register [HEX]
>>         SC = Sector Count Register [HEX]
>>         SN = Sector Number Register [HEX]
>>         CL = Cylinder Low Register [HEX]
>>         CH = Cylinder High Register [HEX]
>>         DH = Device/Head Register [HEX]
>>         DC = Device Command Register [HEX]
>>         ER = Error register [HEX]
>>         ST = Status register [HEX]
>> Powered_Up_Time is measured from power on, and printed as
>> DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
>> SS=sec, and sss=millisec. It "wraps" after 49.710 days.
>>
>> Error 828 occurred at disk power-on lifetime: 705 hours (29 days + 9
>> hours) When the command that caused the error occurred, the device was
>> active or idle.
>>
>>   After command completion occurred, registers were:
>>   ER ST SC SN CL CH DH
>>   -- -- -- -- -- -- --
>>   40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455
>>
>>   Commands leading to the command that caused the error were:
>>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>>   -- -- -- -- -- -- -- --  ----------------  --------------------
>>   25 00 08 ff ff ff ef 00      03:44:18.432  READ DMA EXT
>>   27 00 00 00 00 00 e0 00      03:44:18.431  READ NATIVE MAX ADDRESS EXT
>>   ec 00 00 00 00 00 a0 00      03:44:18.423  IDENTIFY DEVICE
>>   ef 03 46 00 00 00 a0 00      03:44:18.391  SET FEATURES [Set transfer
>> mode] 27 00 00 00 00 00 e0 00      03:44:18.391  READ NATIVE MAX
>> ADDRESS EXT
>>
>> Error 827 occurred at disk power-on lifetime: 705 hours (29 days + 9
>> hours) When the command that caused the error occurred, the device was
>> active or idle.
>>
>>   After command completion occurred, registers were:
>>   ER ST SC SN CL CH DH
>>   -- -- -- -- -- -- --
>>   40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455
>>
>>   Commands leading to the command that caused the error were:
>>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>>   -- -- -- -- -- -- -- --  ----------------  --------------------
>>   25 00 08 ff ff ff ef 00      03:44:15.060  READ DMA EXT
>>   27 00 00 00 00 00 e0 00      03:44:15.059  READ NATIVE MAX ADDRESS EXT
>>   ec 00 00 00 00 00 a0 00      03:44:15.051  IDENTIFY DEVICE
>>   ef 03 46 00 00 00 a0 00      03:44:15.019  SET FEATURES [Set transfer
>> mode] 27 00 00 00 00 00 e0 00      03:44:15.019  READ NATIVE MAX
>> ADDRESS EXT
>>
>> Error 826 occurred at disk power-on lifetime: 705 hours (29 days + 9
>> hours) When the command that caused the error occurred, the device was
>> active or idle.
>>
>>   After command completion occurred, registers were:
>>   ER ST SC SN CL CH DH
>>   -- -- -- -- -- -- --
>>   40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455
>>
>>   Commands leading to the command that caused the error were:
>>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>>   -- -- -- -- -- -- -- --  ----------------  --------------------
>>   25 00 08 ff ff ff ef 00      03:44:11.687  READ DMA EXT
>>   27 00 00 00 00 00 e0 00      03:44:11.686  READ NATIVE MAX ADDRESS EXT
>>   ec 00 00 00 00 00 a0 00      03:44:11.662  IDENTIFY DEVICE
>>   ef 03 46 00 00 00 a0 00      03:44:11.566  SET FEATURES [Set transfer
>> mode] 27 00 00 00 00 00 e0 00      03:44:11.566  READ NATIVE MAX
>> ADDRESS EXT
>>
>> Error 825 occurred at disk power-on lifetime: 705 hours (29 days + 9
>> hours) When the command that caused the error occurred, the device was
>> active or idle.
>>
>>   After command completion occurred, registers were:
>>   ER ST SC SN CL CH DH
>>   -- -- -- -- -- -- --
>>   40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455
>>
>>   Commands leading to the command that caused the error were:
>>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>>   -- -- -- -- -- -- -- --  ----------------  --------------------
>>   25 00 08 ff ff ff ef 00      03:44:08.323  READ DMA EXT
>>   27 00 00 00 00 00 e0 00      03:44:08.322  READ NATIVE MAX ADDRESS EXT
>>   ec 00 00 00 00 00 a0 00      03:44:08.314  IDENTIFY DEVICE
>>   ef 03 46 00 00 00 a0 00      03:44:08.282  SET FEATURES [Set transfer
>> mode] 27 00 00 00 00 00 e0 00      03:44:08.282  READ NATIVE MAX
>> ADDRESS EXT
>>
>> Error 824 occurred at disk power-on lifetime: 705 hours (29 days + 9
>> hours) When the command that caused the error occurred, the device was
>> active or idle.
>>
>>   After command completion occurred, registers were:
>>   ER ST SC SN CL CH DH
>>   -- -- -- -- -- -- --
>>   40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455
>>
>>   Commands leading to the command that caused the error were:
>>   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
>>   -- -- -- -- -- -- -- --  ----------------  --------------------
>>   25 00 08 ff ff ff ef 00      03:44:04.950  READ DMA EXT
>>   27 00 00 00 00 00 e0 00      03:44:04.949  READ NATIVE MAX ADDRESS EXT
>>   ec 00 00 00 00 00 a0 00      03:44:04.941  IDENTIFY DEVICE
>>   ef 03 46 00 00 00 a0 00      03:44:04.910  SET FEATURES [Set transfer
>> mode] 27 00 00 00 00 00 e0 00      03:44:04.909  READ NATIVE MAX
>> ADDRESS EXT
>>
>> SMART Self-test log structure revision number 1
>> No self-tests have been logged.  [To run self-tests, use: smartctl -t]
>>
>>
>> SMART Selective self-test log data structure revision number 1
>>  SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
>>     1        0        0  Not_testing
>>     2        0        0  Not_testing
>>     3        0        0  Not_testing
>>     4        0        0  Not_testing
>>     5        0        0  Not_testing
>> Selective self-test flags (0x0):
>>   After scanning selected spans, do NOT read-scan remainder of disk.
>> If Selective self-test is pending on power-up, resume after 0 minute
>> delay.
>
> I might run the selftest (long) to get a better idea, but if there is not a
> firmware update for this drive on the Seagate site, I believe I'd be asking
> for an RA forthwith.
>
> Be aware that I just updated 2 identical 1 terrabyte Seagate drives about 3
> weeks ago, and the firmware update, while it did not scramble tha partition
> table data, did scramble the partition labels AND the blkid's of the boot
> drive only.  The second drive no longer is hanging the system with bus
> resets, but it still has a write speed of about 3.5 megs/second.  I had to
> re-install.  Fortunately I had data backups from the night before courtesy
> amanda.
>
> --
> Cheers, Gene
> "There are four boxes to be used in defense of liberty:
>  soap, ballot, jury, and ammo. Please use in that order."
> -Ed Howdershelt (Author)
> <http://tinyurl.com/ddg5bz>
> <http://www.cantrip.org/gatto.html>
> Row, row, row your bits, gently down the stream...
>

Hi, thanks for the reply.

Here is my reply, below :)

This is the output of smartctl -t long /dev/sdb and smartctl -l
selftest /dev/sdb. I hope that was the correct procedure.

# smartctl -t long /dev/sdb
smartctl 5.41 2011-03-16 r3296
[x86_64-unknown-linux-gnu-2.6.38-2-amd64] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART Extended self-test routine immediately
in off-line mode".
Drive command "Execute SMART Extended self-test routine immediately in
off-line mode" successful.
Testing has begun.
Please wait 255 minutes for test to complete.
Test will complete after Sat Apr 30 23:06:43 2011

Use smartctl -X to abort test.

# smartctl -l selftest /dev/sdb
smartctl 5.41 2011-03-16 r3296
[x86_64-unknown-linux-gnu-2.6.38-2-amd64] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: read failure       90%       723
      469696848



Thanks for any interpretation of this.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Seagate hard disk firmware issue
  2011-04-30 22:39         ` BU66ER BAD6ER
@ 2011-05-01 17:18           ` gene heskett
  2011-06-06 10:09             ` BU66ER BAD6ER
  0 siblings, 1 reply; 10+ messages in thread
From: gene heskett @ 2011-05-01 17:18 UTC (permalink / raw)
  To: BU66ER BAD6ER; +Cc: linux-ide

On Sunday, May 01, 2011 01:16:34 PM BU66ER BAD6ER did opine:

> On Sat, Apr 30, 2011 at 2:08 PM, gene heskett <gheskett@wdtv.com> wrote:
> > On Saturday, April 30, 2011 07:56:21 AM BU66ER BAD6ER did opine:
> > This <linux-ide@vger.kernel.org> is a mailing list. �WTH when I click
> > on reply-to-list, do I always have to copy/paste the lists address in
> > the To: line? �fscking PIMA!
> > 
> > More below, where it should be.
> > 
> >> Hi,
> >> 
> >> Some time ago, I returned my hard disk and got a new one. Lately, I'm
> >> having performance issues again and I suspect there is a hardware
> >> error again like last time. If you could confirm this I would be most
> >> grateful.
> >> 
> >> Thanks in advance!
> >> 
> >> # smartctl -a /dev/sdb
> >> smartctl 5.41 2011-03-16 r3296
> >> [x86_64-unknown-linux-gnu-2.6.38-2-amd64] (local build)
> >> Copyright (C) 2002-11 by Bruce Allen,
> >> http://smartmontools.sourceforge.net
> >> 
> >> === START OF INFORMATION SECTION ===
> >> Device Model: � � ST2000DL003-9VT166
> >> Serial Number: � �5YD1YD5P
> >> Firmware Version: CC32
> >> User Capacity: � �2,000,398,934,016 bytes
> >> Device is: � � � �Not in smartctl database [for details use: -P
> >> showall] ATA Version is: � 8
> >> ATA Standard is: �ATA-8-ACS revision 4
> >> Local Time is: � �Sat Apr 30 12:59:32 2011 CEST
> >> SMART support is: Available - device has SMART capability.
> >> SMART support is: Enabled
> >> 
> >> === START OF READ SMART DATA SECTION ===
> >> SMART overall-health self-assessment test result: PASSED
> >> 
> >> General SMART Values:
> >> Offline data collection status: �(0x82) Offline data collection
> >> activity � � � � � � � � � � � � � � � � � � � � was completed
> >> without error. � � � � � � � � � � � � � � � � � � � � Auto Offline
> >> Data Collection: Enabled. Self-test execution status: � � �( � 0)
> >> The previous self-test routine completed without error or no
> >> self-test has ever been run. Total time to complete Offline
> >> data collection: � � � � � � � �( �623) seconds.
> >> Offline data collection
> >> capabilities: � � � � � � � � � �(0x7b) SMART execute Offline
> >> immediate. � � � � � � � � � � � � � � � � � � � � Auto Offline data
> >> collection on/off support.
> >> � � � � � � � � � � � � � � � � � � � � Suspend Offline collection
> >> upon new command.
> >> � � � � � � � � � � � � � � � � � � � � Offline surface scan
> >> supported. � � � � � � � � � � � � � � � � � � � � Self-test
> >> supported. � � � � � � � � � � � � � � � � � � � � Conveyance
> >> Self-test supported. � � � � � � � � � � � � � � � � � � � �
> >> Selective Self-test supported. SMART capabilities: � � � � �
> >> �(0x0003) Saves SMART data before entering � � � � � � � � � � � � �
> >> � � � � � � � power-saving mode.
> >> � � � � � � � � � � � � � � � � � � � � Supports SMART auto save
> >> timer. Error logging capability: � � � �(0x01) Error logging
> >> supported. � � � � � � � � � � � � � � � � � � � � General Purpose
> >> Logging supported. Short self-test routine
> >> recommended polling time: � � � �( � 1) minutes.
> >> Extended self-test routine
> >> recommended polling time: � � � �( 255) minutes.
> >> Conveyance self-test routine
> >> recommended polling time: � � � �( � 2) minutes.
> >> SCT capabilities: � � � � � � �(0x30b7) SCT Status supported.
> >> � � � � � � � � � � � � � � � � � � � � SCT Feature Control
> >> supported. � � � � � � � � � � � � � � � � � � � � SCT Data Table
> >> supported.
> >> 
> >> SMART Attributes Data Structure revision number: 10
> >> Vendor Specific SMART Attributes with Thresholds:
> >> ID# ATTRIBUTE_NAME � � � � �FLAG � � VALUE WORST THRESH TYPE
> >> UPDATED �WHEN_FAILED RAW_VALUE
> >> � 1 Raw_Read_Error_Rate � � 0x000f � 096 � 082 � 006 � �Pre-fail
> >> Always � � � - � � � 2447656
> >> � 3 Spin_Up_Time � � � � � �0x0003 � 097 � 092 � 000 � �Pre-fail
> >> Always � � � - � � � 0
> >> � 4 Start_Stop_Count � � � �0x0032 � 100 � 100 � 020 � �Old_age
> >> Always � � � - � � � 265
> >> � 5 Reallocated_Sector_Ct � 0x0033 � 100 � 100 � 036 � �Pre-fail
> >> Always � � � - � � � 0
> >> � 7 Seek_Error_Rate � � � � 0x000f � 060 � 060 � 030 � �Pre-fail
> >> Always � � � - � � � 1108085
> >> � 9 Power_On_Hours � � � � �0x0032 � 100 � 100 � 000 � �Old_age
> >> Always � � � - � � � 717
> >> �10 Spin_Retry_Count � � � �0x0013 � 100 � 100 � 097 � �Pre-fail
> >> Always � � � - � � � 0
> >> �12 Power_Cycle_Count � � � 0x0032 � 100 � 100 � 020 � �Old_age
> >> Always � � � - � � � 74
> >> 183 Runtime_Bad_Block � � � 0x0032 � 100 � 100 � 000 � �Old_age
> >> Always � � � - � � � 0
> >> 184 End-to-End_Error � � � �0x0032 � 100 � 100 � 099 � �Old_age
> >> Always � � � - � � � 0
> >> 187 Reported_Uncorrect � � �0x0032 � 001 � 001 � 000 � �Old_age
> >> Always � � � - � � � 792
> >> 188 Command_Timeout � � � � 0x0032 � 100 � 100 � 000 � �Old_age
> >> Always � � � - � � � 0
> >> 189 High_Fly_Writes � � � � 0x003a � 100 � 100 � 000 � �Old_age
> >> Always � � � - � � � 0
> >> 190 Airflow_Temperature_Cel 0x0022 � 065 � 062 � 045 � �Old_age
> >> Always � � � - � � � 35 (Min/Max 35/35)
> >> 191 G-Sense_Error_Rate � � �0x0032 � 100 � 100 � 000 � �Old_age
> >> Always � � � - � � � 0
> >> 192 Power-Off_Retract_Count 0x0032 � 100 � 100 � 000 � �Old_age
> >> Always � � � - � � � 202
> >> 193 Load_Cycle_Count � � � �0x0032 � 100 � 100 � 000 � �Old_age
> >> Always � � � - � � � 265
> >> 194 Temperature_Celsius � � 0x0022 � 035 � 040 � 000 � �Old_age
> >> Always � � � - � � � 35 (0 19 0 0)
> >> 195 Hardware_ECC_Recovered �0x001a � 015 � 009 � 000 � �Old_age
> >> Always � � � - � � � 2447656
> >> 197 Current_Pending_Sector �0x0012 � 096 � 096 � 000 � �Old_age
> >> Always � � � - � � � 368
> >> 198 Offline_Uncorrectable � 0x0010 � 096 � 096 � 000 � �Old_age
> >> Offline � � �- � � � 368
> >> 199 UDMA_CRC_Error_Count � �0x003e � 200 � 200 � 000 � �Old_age
> >> Always � � � - � � � 0
> >> 240 Head_Flying_Hours � � � 0x0000 � 100 � 253 � 000 � �Old_age
> >> Offline � � �- � � � 9354438771404
> >> 241 Total_LBAs_Written � � �0x0000 � 100 � 253 � 000 � �Old_age
> >> Offline � � �- � � � 793142133
> >> 242 Total_LBAs_Read � � � � 0x0000 � 100 � 253 � 000 � �Old_age
> >> Offline � � �- � � � 307847059
> >> 
> >> SMART Error Log Version: 1
> >> ATA Error Count: 828 (device log contains only the most recent five
> >> errors) CR = Command Register [HEX]
> >> � � � � FR = Features Register [HEX]
> >> � � � � SC = Sector Count Register [HEX]
> >> � � � � SN = Sector Number Register [HEX]
> >> � � � � CL = Cylinder Low Register [HEX]
> >> � � � � CH = Cylinder High Register [HEX]
> >> � � � � DH = Device/Head Register [HEX]
> >> � � � � DC = Device Command Register [HEX]
> >> � � � � ER = Error register [HEX]
> >> � � � � ST = Status register [HEX]
> >> Powered_Up_Time is measured from power on, and printed as
> >> DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
> >> SS=sec, and sss=millisec. It "wraps" after 49.710 days.
> >> 
> >> Error 828 occurred at disk power-on lifetime: 705 hours (29 days + 9
> >> hours) When the command that caused the error occurred, the device
> >> was active or idle.
> >> 
> >> � After command completion occurred, registers were:
> >> � ER ST SC SN CL CH DH
> >> � -- -- -- -- -- -- --
> >> � 40 51 00 ff ff ff 0f �Error: UNC at LBA = 0x0fffffff = 268435455
> >> 
> >> � Commands leading to the command that caused the error were:
> >> � CR FR SC SN CL CH DH DC � Powered_Up_Time �Command/Feature_Name
> >> � -- -- -- -- -- -- -- -- �---------------- �--------------------
> >> � 25 00 08 ff ff ff ef 00 � � �03:44:18.432 �READ DMA EXT
> >> � 27 00 00 00 00 00 e0 00 � � �03:44:18.431 �READ NATIVE MAX ADDRESS
> >> EXT � ec 00 00 00 00 00 a0 00 � � �03:44:18.423 �IDENTIFY DEVICE �
> >> ef 03 46 00 00 00 a0 00 � � �03:44:18.391 �SET FEATURES [Set
> >> transfer mode] 27 00 00 00 00 00 e0 00 � � �03:44:18.391 �READ
> >> NATIVE MAX ADDRESS EXT
> >> 
> >> Error 827 occurred at disk power-on lifetime: 705 hours (29 days + 9
> >> hours) When the command that caused the error occurred, the device
> >> was active or idle.
> >> 
> >> � After command completion occurred, registers were:
> >> � ER ST SC SN CL CH DH
> >> � -- -- -- -- -- -- --
> >> � 40 51 00 ff ff ff 0f �Error: UNC at LBA = 0x0fffffff = 268435455
> >> 
> >> � Commands leading to the command that caused the error were:
> >> � CR FR SC SN CL CH DH DC � Powered_Up_Time �Command/Feature_Name
> >> � -- -- -- -- -- -- -- -- �---------------- �--------------------
> >> � 25 00 08 ff ff ff ef 00 � � �03:44:15.060 �READ DMA EXT
> >> � 27 00 00 00 00 00 e0 00 � � �03:44:15.059 �READ NATIVE MAX ADDRESS
> >> EXT � ec 00 00 00 00 00 a0 00 � � �03:44:15.051 �IDENTIFY DEVICE �
> >> ef 03 46 00 00 00 a0 00 � � �03:44:15.019 �SET FEATURES [Set
> >> transfer mode] 27 00 00 00 00 00 e0 00 � � �03:44:15.019 �READ
> >> NATIVE MAX ADDRESS EXT
> >> 
> >> Error 826 occurred at disk power-on lifetime: 705 hours (29 days + 9
> >> hours) When the command that caused the error occurred, the device
> >> was active or idle.
> >> 
> >> � After command completion occurred, registers were:
> >> � ER ST SC SN CL CH DH
> >> � -- -- -- -- -- -- --
> >> � 40 51 00 ff ff ff 0f �Error: UNC at LBA = 0x0fffffff = 268435455
> >> 
> >> � Commands leading to the command that caused the error were:
> >> � CR FR SC SN CL CH DH DC � Powered_Up_Time �Command/Feature_Name
> >> � -- -- -- -- -- -- -- -- �---------------- �--------------------
> >> � 25 00 08 ff ff ff ef 00 � � �03:44:11.687 �READ DMA EXT
> >> � 27 00 00 00 00 00 e0 00 � � �03:44:11.686 �READ NATIVE MAX ADDRESS
> >> EXT � ec 00 00 00 00 00 a0 00 � � �03:44:11.662 �IDENTIFY DEVICE �
> >> ef 03 46 00 00 00 a0 00 � � �03:44:11.566 �SET FEATURES [Set
> >> transfer mode] 27 00 00 00 00 00 e0 00 � � �03:44:11.566 �READ
> >> NATIVE MAX ADDRESS EXT
> >> 
> >> Error 825 occurred at disk power-on lifetime: 705 hours (29 days + 9
> >> hours) When the command that caused the error occurred, the device
> >> was active or idle.
> >> 
> >> � After command completion occurred, registers were:
> >> � ER ST SC SN CL CH DH
> >> � -- -- -- -- -- -- --
> >> � 40 51 00 ff ff ff 0f �Error: UNC at LBA = 0x0fffffff = 268435455
> >> 
> >> � Commands leading to the command that caused the error were:
> >> � CR FR SC SN CL CH DH DC � Powered_Up_Time �Command/Feature_Name
> >> � -- -- -- -- -- -- -- -- �---------------- �--------------------
> >> � 25 00 08 ff ff ff ef 00 � � �03:44:08.323 �READ DMA EXT
> >> � 27 00 00 00 00 00 e0 00 � � �03:44:08.322 �READ NATIVE MAX ADDRESS
> >> EXT � ec 00 00 00 00 00 a0 00 � � �03:44:08.314 �IDENTIFY DEVICE �
> >> ef 03 46 00 00 00 a0 00 � � �03:44:08.282 �SET FEATURES [Set
> >> transfer mode] 27 00 00 00 00 00 e0 00 � � �03:44:08.282 �READ
> >> NATIVE MAX ADDRESS EXT
> >> 
> >> Error 824 occurred at disk power-on lifetime: 705 hours (29 days + 9
> >> hours) When the command that caused the error occurred, the device
> >> was active or idle.
> >> 
> >> � After command completion occurred, registers were:
> >> � ER ST SC SN CL CH DH
> >> � -- -- -- -- -- -- --
> >> � 40 51 00 ff ff ff 0f �Error: UNC at LBA = 0x0fffffff = 268435455
> >> 
> >> � Commands leading to the command that caused the error were:
> >> � CR FR SC SN CL CH DH DC � Powered_Up_Time �Command/Feature_Name
> >> � -- -- -- -- -- -- -- -- �---------------- �--------------------
> >> � 25 00 08 ff ff ff ef 00 � � �03:44:04.950 �READ DMA EXT
> >> � 27 00 00 00 00 00 e0 00 � � �03:44:04.949 �READ NATIVE MAX ADDRESS
> >> EXT � ec 00 00 00 00 00 a0 00 � � �03:44:04.941 �IDENTIFY DEVICE �
> >> ef 03 46 00 00 00 a0 00 � � �03:44:04.910 �SET FEATURES [Set
> >> transfer mode] 27 00 00 00 00 00 e0 00 � � �03:44:04.909 �READ
> >> NATIVE MAX ADDRESS EXT
> >> 
> >> SMART Self-test log structure revision number 1
> >> No self-tests have been logged. �[To run self-tests, use: smartctl
> >> -t]
> >> 
> >> 
> >> SMART Selective self-test log data structure revision number 1
> >> �SPAN �MIN_LBA �MAX_LBA �CURRENT_TEST_STATUS
> >> � � 1 � � � �0 � � � �0 �Not_testing
> >> � � 2 � � � �0 � � � �0 �Not_testing
> >> � � 3 � � � �0 � � � �0 �Not_testing
> >> � � 4 � � � �0 � � � �0 �Not_testing
> >> � � 5 � � � �0 � � � �0 �Not_testing
> >> Selective self-test flags (0x0):
> >> � After scanning selected spans, do NOT read-scan remainder of disk.
> >> If Selective self-test is pending on power-up, resume after 0 minute
> >> delay.
> > 
> > I might run the selftest (long) to get a better idea, but if there is
> > not a firmware update for this drive on the Seagate site, I believe
> > I'd be asking for an RA forthwith.
> > 
> > Be aware that I just updated 2 identical 1 terrabyte Seagate drives
> > about 3 weeks ago, and the firmware update, while it did not scramble
> > tha partition table data, did scramble the partition labels AND the
> > blkid's of the boot drive only. �The second drive no longer is
> > hanging the system with bus resets, but it still has a write speed of
> > about 3.5 megs/second. �I had to re-install. �Fortunately I had data
> > backups from the night before courtesy amanda.
> > 
> > --
> > Cheers, Gene
> > "There are four boxes to be used in defense of liberty:
> > �soap, ballot, jury, and ammo. Please use in that order."
> > -Ed Howdershelt (Author)
> > <http://tinyurl.com/ddg5bz>
> > <http://www.cantrip.org/gatto.html>
> > Row, row, row your bits, gently down the stream...
> 
> Hi, thanks for the reply.
> 
> Here is my reply, below :)
> 
> This is the output of smartctl -t long /dev/sdb and smartctl -l
> selftest /dev/sdb. I hope that was the correct procedure.
> 
> # smartctl -t long /dev/sdb
> smartctl 5.41 2011-03-16 r3296
> [x86_64-unknown-linux-gnu-2.6.38-2-amd64] (local build)
> Copyright (C) 2002-11 by Bruce Allen,
> http://smartmontools.sourceforge.net
> 
> === START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
> Sending command: "Execute SMART Extended self-test routine immediately
> in off-line mode".
> Drive command "Execute SMART Extended self-test routine immediately in
> off-line mode" successful.
> Testing has begun.
> Please wait 255 minutes for test to complete.
> Test will complete after Sat Apr 30 23:06:43 2011
> 
> Use smartctl -X to abort test.
> 
> # smartctl -l selftest /dev/sdb
> smartctl 5.41 2011-03-16 r3296
> [x86_64-unknown-linux-gnu-2.6.38-2-amd64] (local build)
> Copyright (C) 2002-11 by Bruce Allen,
> http://smartmontools.sourceforge.net
> 
> === START OF READ SMART DATA SECTION ===
> SMART Self-test log structure revision number 1
> Num  Test_Description    Status                  Remaining
> LifeTime(hours)  LBA_of_first_error
> # 1  Extended offline    Completed: read failure       90%       723
>       469696848
> 
> 
> 
> Thanks for any interpretation of this.

Get the RA, a 2Tb drive should still be well in warranty.

-- 
Cheers, Gene
"There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
<http://tinyurl.com/ddg5bz>
<http://www.cantrip.org/gatto.html>
<Wordplay> You measure your vibrators in "characters per second"?  I have
	   bad news for you, c90, you've been masturbating with a
	   dot-matrix printer.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Seagate hard disk firmware issue
  2011-05-01 17:18           ` gene heskett
@ 2011-06-06 10:09             ` BU66ER BAD6ER
  2011-06-06 10:36               ` gene heskett
  0 siblings, 1 reply; 10+ messages in thread
From: BU66ER BAD6ER @ 2011-06-06 10:09 UTC (permalink / raw)
  To: gene heskett; +Cc: linux-ide

I replaced that one, too, and got a Western Digital "Green" 2TB.

No funny noises, and it works excellent, so-far!

Thanks for your input!

On Sun, May 1, 2011 at 7:18 PM, gene heskett <gheskett@wdtv.com> wrote:
> On Sunday, May 01, 2011 01:16:34 PM BU66ER BAD6ER did opine:
>
>> On Sat, Apr 30, 2011 at 2:08 PM, gene heskett <gheskett@wdtv.com> wrote:
>> > On Saturday, April 30, 2011 07:56:21 AM BU66ER BAD6ER did opine:
>> > This <linux-ide@vger.kernel.org> is a mailing list. �WTH when I click
>> > on reply-to-list, do I always have to copy/paste the lists address in
>> > the To: line? �fscking PIMA!
>> >
>> > More below, where it should be.
>> >
>> >> Hi,
>> >>
>> >> Some time ago, I returned my hard disk and got a new one. Lately, I'm
>> >> having performance issues again and I suspect there is a hardware
>> >> error again like last time. If you could confirm this I would be most
>> >> grateful.
>> >>
>> >> Thanks in advance!
>> >>
>> >> # smartctl -a /dev/sdb
>> >> smartctl 5.41 2011-03-16 r3296
>> >> [x86_64-unknown-linux-gnu-2.6.38-2-amd64] (local build)
>> >> Copyright (C) 2002-11 by Bruce Allen,
>> >> http://smartmontools.sourceforge.net
>> >>
>> >> === START OF INFORMATION SECTION ===
>> >> Device Model: � � ST2000DL003-9VT166
>> >> Serial Number: � �5YD1YD5P
>> >> Firmware Version: CC32
>> >> User Capacity: � �2,000,398,934,016 bytes
>> >> Device is: � � � �Not in smartctl database [for details use: -P
>> >> showall] ATA Version is: � 8
>> >> ATA Standard is: �ATA-8-ACS revision 4
>> >> Local Time is: � �Sat Apr 30 12:59:32 2011 CEST
>> >> SMART support is: Available - device has SMART capability.
>> >> SMART support is: Enabled
>> >>
>> >> === START OF READ SMART DATA SECTION ===
>> >> SMART overall-health self-assessment test result: PASSED
>> >>
>> >> General SMART Values:
>> >> Offline data collection status: �(0x82) Offline data collection
>> >> activity � � � � � � � � � � � � � � � � � � � � was completed
>> >> without error. � � � � � � � � � � � � � � � � � � � � Auto Offline
>> >> Data Collection: Enabled. Self-test execution status: � � �( � 0)
>> >> The previous self-test routine completed without error or no
>> >> self-test has ever been run. Total time to complete Offline
>> >> data collection: � � � � � � � �( �623) seconds.
>> >> Offline data collection
>> >> capabilities: � � � � � � � � � �(0x7b) SMART execute Offline
>> >> immediate. � � � � � � � � � � � � � � � � � � � � Auto Offline data
>> >> collection on/off support.
>> >> � � � � � � � � � � � � � � � � � � � � Suspend Offline collection
>> >> upon new command.
>> >> � � � � � � � � � � � � � � � � � � � � Offline surface scan
>> >> supported. � � � � � � � � � � � � � � � � � � � � Self-test
>> >> supported. � � � � � � � � � � � � � � � � � � � � Conveyance
>> >> Self-test supported. � � � � � � � � � � � � � � � � � � � �
>> >> Selective Self-test supported. SMART capabilities: � � � � �
>> >> �(0x0003) Saves SMART data before entering � � � � � � � � � � � � �
>> >> � � � � � � � power-saving mode.
>> >> � � � � � � � � � � � � � � � � � � � � Supports SMART auto save
>> >> timer. Error logging capability: � � � �(0x01) Error logging
>> >> supported. � � � � � � � � � � � � � � � � � � � � General Purpose
>> >> Logging supported. Short self-test routine
>> >> recommended polling time: � � � �( � 1) minutes.
>> >> Extended self-test routine
>> >> recommended polling time: � � � �( 255) minutes.
>> >> Conveyance self-test routine
>> >> recommended polling time: � � � �( � 2) minutes.
>> >> SCT capabilities: � � � � � � �(0x30b7) SCT Status supported.
>> >> � � � � � � � � � � � � � � � � � � � � SCT Feature Control
>> >> supported. � � � � � � � � � � � � � � � � � � � � SCT Data Table
>> >> supported.
>> >>
>> >> SMART Attributes Data Structure revision number: 10
>> >> Vendor Specific SMART Attributes with Thresholds:
>> >> ID# ATTRIBUTE_NAME � � � � �FLAG � � VALUE WORST THRESH TYPE
>> >> UPDATED �WHEN_FAILED RAW_VALUE
>> >> � 1 Raw_Read_Error_Rate � � 0x000f � 096 � 082 � 006 � �Pre-fail
>> >> Always � � � - � � � 2447656
>> >> � 3 Spin_Up_Time � � � � � �0x0003 � 097 � 092 � 000 � �Pre-fail
>> >> Always � � � - � � � 0
>> >> � 4 Start_Stop_Count � � � �0x0032 � 100 � 100 � 020 � �Old_age
>> >> Always � � � - � � � 265
>> >> � 5 Reallocated_Sector_Ct � 0x0033 � 100 � 100 � 036 � �Pre-fail
>> >> Always � � � - � � � 0
>> >> � 7 Seek_Error_Rate � � � � 0x000f � 060 � 060 � 030 � �Pre-fail
>> >> Always � � � - � � � 1108085
>> >> � 9 Power_On_Hours � � � � �0x0032 � 100 � 100 � 000 � �Old_age
>> >> Always � � � - � � � 717
>> >> �10 Spin_Retry_Count � � � �0x0013 � 100 � 100 � 097 � �Pre-fail
>> >> Always � � � - � � � 0
>> >> �12 Power_Cycle_Count � � � 0x0032 � 100 � 100 � 020 � �Old_age
>> >> Always � � � - � � � 74
>> >> 183 Runtime_Bad_Block � � � 0x0032 � 100 � 100 � 000 � �Old_age
>> >> Always � � � - � � � 0
>> >> 184 End-to-End_Error � � � �0x0032 � 100 � 100 � 099 � �Old_age
>> >> Always � � � - � � � 0
>> >> 187 Reported_Uncorrect � � �0x0032 � 001 � 001 � 000 � �Old_age
>> >> Always � � � - � � � 792
>> >> 188 Command_Timeout � � � � 0x0032 � 100 � 100 � 000 � �Old_age
>> >> Always � � � - � � � 0
>> >> 189 High_Fly_Writes � � � � 0x003a � 100 � 100 � 000 � �Old_age
>> >> Always � � � - � � � 0
>> >> 190 Airflow_Temperature_Cel 0x0022 � 065 � 062 � 045 � �Old_age
>> >> Always � � � - � � � 35 (Min/Max 35/35)
>> >> 191 G-Sense_Error_Rate � � �0x0032 � 100 � 100 � 000 � �Old_age
>> >> Always � � � - � � � 0
>> >> 192 Power-Off_Retract_Count 0x0032 � 100 � 100 � 000 � �Old_age
>> >> Always � � � - � � � 202
>> >> 193 Load_Cycle_Count � � � �0x0032 � 100 � 100 � 000 � �Old_age
>> >> Always � � � - � � � 265
>> >> 194 Temperature_Celsius � � 0x0022 � 035 � 040 � 000 � �Old_age
>> >> Always � � � - � � � 35 (0 19 0 0)
>> >> 195 Hardware_ECC_Recovered �0x001a � 015 � 009 � 000 � �Old_age
>> >> Always � � � - � � � 2447656
>> >> 197 Current_Pending_Sector �0x0012 � 096 � 096 � 000 � �Old_age
>> >> Always � � � - � � � 368
>> >> 198 Offline_Uncorrectable � 0x0010 � 096 � 096 � 000 � �Old_age
>> >> Offline � � �- � � � 368
>> >> 199 UDMA_CRC_Error_Count � �0x003e � 200 � 200 � 000 � �Old_age
>> >> Always � � � - � � � 0
>> >> 240 Head_Flying_Hours � � � 0x0000 � 100 � 253 � 000 � �Old_age
>> >> Offline � � �- � � � 9354438771404
>> >> 241 Total_LBAs_Written � � �0x0000 � 100 � 253 � 000 � �Old_age
>> >> Offline � � �- � � � 793142133
>> >> 242 Total_LBAs_Read � � � � 0x0000 � 100 � 253 � 000 � �Old_age
>> >> Offline � � �- � � � 307847059
>> >>
>> >> SMART Error Log Version: 1
>> >> ATA Error Count: 828 (device log contains only the most recent five
>> >> errors) CR = Command Register [HEX]
>> >> � � � � FR = Features Register [HEX]
>> >> � � � � SC = Sector Count Register [HEX]
>> >> � � � � SN = Sector Number Register [HEX]
>> >> � � � � CL = Cylinder Low Register [HEX]
>> >> � � � � CH = Cylinder High Register [HEX]
>> >> � � � � DH = Device/Head Register [HEX]
>> >> � � � � DC = Device Command Register [HEX]
>> >> � � � � ER = Error register [HEX]
>> >> � � � � ST = Status register [HEX]
>> >> Powered_Up_Time is measured from power on, and printed as
>> >> DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
>> >> SS=sec, and sss=millisec. It "wraps" after 49.710 days.
>> >>
>> >> Error 828 occurred at disk power-on lifetime: 705 hours (29 days + 9
>> >> hours) When the command that caused the error occurred, the device
>> >> was active or idle.
>> >>
>> >> � After command completion occurred, registers were:
>> >> � ER ST SC SN CL CH DH
>> >> � -- -- -- -- -- -- --
>> >> � 40 51 00 ff ff ff 0f �Error: UNC at LBA = 0x0fffffff = 268435455
>> >>
>> >> � Commands leading to the command that caused the error were:
>> >> � CR FR SC SN CL CH DH DC � Powered_Up_Time �Command/Feature_Name
>> >> � -- -- -- -- -- -- -- -- �---------------- �--------------------
>> >> � 25 00 08 ff ff ff ef 00 � � �03:44:18.432 �READ DMA EXT
>> >> � 27 00 00 00 00 00 e0 00 � � �03:44:18.431 �READ NATIVE MAX ADDRESS
>> >> EXT � ec 00 00 00 00 00 a0 00 � � �03:44:18.423 �IDENTIFY DEVICE �
>> >> ef 03 46 00 00 00 a0 00 � � �03:44:18.391 �SET FEATURES [Set
>> >> transfer mode] 27 00 00 00 00 00 e0 00 � � �03:44:18.391 �READ
>> >> NATIVE MAX ADDRESS EXT
>> >>
>> >> Error 827 occurred at disk power-on lifetime: 705 hours (29 days + 9
>> >> hours) When the command that caused the error occurred, the device
>> >> was active or idle.
>> >>
>> >> � After command completion occurred, registers were:
>> >> � ER ST SC SN CL CH DH
>> >> � -- -- -- -- -- -- --
>> >> � 40 51 00 ff ff ff 0f �Error: UNC at LBA = 0x0fffffff = 268435455
>> >>
>> >> � Commands leading to the command that caused the error were:
>> >> � CR FR SC SN CL CH DH DC � Powered_Up_Time �Command/Feature_Name
>> >> � -- -- -- -- -- -- -- -- �---------------- �--------------------
>> >> � 25 00 08 ff ff ff ef 00 � � �03:44:15.060 �READ DMA EXT
>> >> � 27 00 00 00 00 00 e0 00 � � �03:44:15.059 �READ NATIVE MAX ADDRESS
>> >> EXT � ec 00 00 00 00 00 a0 00 � � �03:44:15.051 �IDENTIFY DEVICE �
>> >> ef 03 46 00 00 00 a0 00 � � �03:44:15.019 �SET FEATURES [Set
>> >> transfer mode] 27 00 00 00 00 00 e0 00 � � �03:44:15.019 �READ
>> >> NATIVE MAX ADDRESS EXT
>> >>
>> >> Error 826 occurred at disk power-on lifetime: 705 hours (29 days + 9
>> >> hours) When the command that caused the error occurred, the device
>> >> was active or idle.
>> >>
>> >> � After command completion occurred, registers were:
>> >> � ER ST SC SN CL CH DH
>> >> � -- -- -- -- -- -- --
>> >> � 40 51 00 ff ff ff 0f �Error: UNC at LBA = 0x0fffffff = 268435455
>> >>
>> >> � Commands leading to the command that caused the error were:
>> >> � CR FR SC SN CL CH DH DC � Powered_Up_Time �Command/Feature_Name
>> >> � -- -- -- -- -- -- -- -- �---------------- �--------------------
>> >> � 25 00 08 ff ff ff ef 00 � � �03:44:11.687 �READ DMA EXT
>> >> � 27 00 00 00 00 00 e0 00 � � �03:44:11.686 �READ NATIVE MAX ADDRESS
>> >> EXT � ec 00 00 00 00 00 a0 00 � � �03:44:11.662 �IDENTIFY DEVICE �
>> >> ef 03 46 00 00 00 a0 00 � � �03:44:11.566 �SET FEATURES [Set
>> >> transfer mode] 27 00 00 00 00 00 e0 00 � � �03:44:11.566 �READ
>> >> NATIVE MAX ADDRESS EXT
>> >>
>> >> Error 825 occurred at disk power-on lifetime: 705 hours (29 days + 9
>> >> hours) When the command that caused the error occurred, the device
>> >> was active or idle.
>> >>
>> >> � After command completion occurred, registers were:
>> >> � ER ST SC SN CL CH DH
>> >> � -- -- -- -- -- -- --
>> >> � 40 51 00 ff ff ff 0f �Error: UNC at LBA = 0x0fffffff = 268435455
>> >>
>> >> � Commands leading to the command that caused the error were:
>> >> � CR FR SC SN CL CH DH DC � Powered_Up_Time �Command/Feature_Name
>> >> � -- -- -- -- -- -- -- -- �---------------- �--------------------
>> >> � 25 00 08 ff ff ff ef 00 � � �03:44:08.323 �READ DMA EXT
>> >> � 27 00 00 00 00 00 e0 00 � � �03:44:08.322 �READ NATIVE MAX ADDRESS
>> >> EXT � ec 00 00 00 00 00 a0 00 � � �03:44:08.314 �IDENTIFY DEVICE �
>> >> ef 03 46 00 00 00 a0 00 � � �03:44:08.282 �SET FEATURES [Set
>> >> transfer mode] 27 00 00 00 00 00 e0 00 � � �03:44:08.282 �READ
>> >> NATIVE MAX ADDRESS EXT
>> >>
>> >> Error 824 occurred at disk power-on lifetime: 705 hours (29 days + 9
>> >> hours) When the command that caused the error occurred, the device
>> >> was active or idle.
>> >>
>> >> � After command completion occurred, registers were:
>> >> � ER ST SC SN CL CH DH
>> >> � -- -- -- -- -- -- --
>> >> � 40 51 00 ff ff ff 0f �Error: UNC at LBA = 0x0fffffff = 268435455
>> >>
>> >> � Commands leading to the command that caused the error were:
>> >> � CR FR SC SN CL CH DH DC � Powered_Up_Time �Command/Feature_Name
>> >> � -- -- -- -- -- -- -- -- �---------------- �--------------------
>> >> � 25 00 08 ff ff ff ef 00 � � �03:44:04.950 �READ DMA EXT
>> >> � 27 00 00 00 00 00 e0 00 � � �03:44:04.949 �READ NATIVE MAX ADDRESS
>> >> EXT � ec 00 00 00 00 00 a0 00 � � �03:44:04.941 �IDENTIFY DEVICE �
>> >> ef 03 46 00 00 00 a0 00 � � �03:44:04.910 �SET FEATURES [Set
>> >> transfer mode] 27 00 00 00 00 00 e0 00 � � �03:44:04.909 �READ
>> >> NATIVE MAX ADDRESS EXT
>> >>
>> >> SMART Self-test log structure revision number 1
>> >> No self-tests have been logged. �[To run self-tests, use: smartctl
>> >> -t]
>> >>
>> >>
>> >> SMART Selective self-test log data structure revision number 1
>> >> �SPAN �MIN_LBA �MAX_LBA �CURRENT_TEST_STATUS
>> >> � � 1 � � � �0 � � � �0 �Not_testing
>> >> � � 2 � � � �0 � � � �0 �Not_testing
>> >> � � 3 � � � �0 � � � �0 �Not_testing
>> >> � � 4 � � � �0 � � � �0 �Not_testing
>> >> � � 5 � � � �0 � � � �0 �Not_testing
>> >> Selective self-test flags (0x0):
>> >> � After scanning selected spans, do NOT read-scan remainder of disk.
>> >> If Selective self-test is pending on power-up, resume after 0 minute
>> >> delay.
>> >
>> > I might run the selftest (long) to get a better idea, but if there is
>> > not a firmware update for this drive on the Seagate site, I believe
>> > I'd be asking for an RA forthwith.
>> >
>> > Be aware that I just updated 2 identical 1 terrabyte Seagate drives
>> > about 3 weeks ago, and the firmware update, while it did not scramble
>> > tha partition table data, did scramble the partition labels AND the
>> > blkid's of the boot drive only. �The second drive no longer is
>> > hanging the system with bus resets, but it still has a write speed of
>> > about 3.5 megs/second. �I had to re-install. �Fortunately I had data
>> > backups from the night before courtesy amanda.
>> >
>> > --
>> > Cheers, Gene
>> > "There are four boxes to be used in defense of liberty:
>> > �soap, ballot, jury, and ammo. Please use in that order."
>> > -Ed Howdershelt (Author)
>> > <http://tinyurl.com/ddg5bz>
>> > <http://www.cantrip.org/gatto.html>
>> > Row, row, row your bits, gently down the stream...
>>
>> Hi, thanks for the reply.
>>
>> Here is my reply, below :)
>>
>> This is the output of smartctl -t long /dev/sdb and smartctl -l
>> selftest /dev/sdb. I hope that was the correct procedure.
>>
>> # smartctl -t long /dev/sdb
>> smartctl 5.41 2011-03-16 r3296
>> [x86_64-unknown-linux-gnu-2.6.38-2-amd64] (local build)
>> Copyright (C) 2002-11 by Bruce Allen,
>> http://smartmontools.sourceforge.net
>>
>> === START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
>> Sending command: "Execute SMART Extended self-test routine immediately
>> in off-line mode".
>> Drive command "Execute SMART Extended self-test routine immediately in
>> off-line mode" successful.
>> Testing has begun.
>> Please wait 255 minutes for test to complete.
>> Test will complete after Sat Apr 30 23:06:43 2011
>>
>> Use smartctl -X to abort test.
>>
>> # smartctl -l selftest /dev/sdb
>> smartctl 5.41 2011-03-16 r3296
>> [x86_64-unknown-linux-gnu-2.6.38-2-amd64] (local build)
>> Copyright (C) 2002-11 by Bruce Allen,
>> http://smartmontools.sourceforge.net
>>
>> === START OF READ SMART DATA SECTION ===
>> SMART Self-test log structure revision number 1
>> Num  Test_Description    Status                  Remaining
>> LifeTime(hours)  LBA_of_first_error
>> # 1  Extended offline    Completed: read failure       90%       723
>>       469696848
>>
>>
>>
>> Thanks for any interpretation of this.
>
> Get the RA, a 2Tb drive should still be well in warranty.
>
> --
> Cheers, Gene
> "There are four boxes to be used in defense of liberty:
>  soap, ballot, jury, and ammo. Please use in that order."
> -Ed Howdershelt (Author)
> <http://tinyurl.com/ddg5bz>
> <http://www.cantrip.org/gatto.html>
> <Wordplay> You measure your vibrators in "characters per second"?  I have
>           bad news for you, c90, you've been masturbating with a
>           dot-matrix printer.
>

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Seagate hard disk firmware issue
  2011-06-06 10:09             ` BU66ER BAD6ER
@ 2011-06-06 10:36               ` gene heskett
  0 siblings, 0 replies; 10+ messages in thread
From: gene heskett @ 2011-06-06 10:36 UTC (permalink / raw)
  To: BU66ER BAD6ER; +Cc: linux-ide

On Monday, June 06, 2011 06:34:05 AM BU66ER BAD6ER did opine:

> I replaced that one, too, and got a Western Digital "Green" 2TB.
> 
> No funny noises, and it works excellent, so-far!
> 
> Thanks for your input!
> 
I appreciate being told that my advice was good, but please clip your 
replies.

[...]

Cheers, gene
-- 
"There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
Maybe Computer Science should be in the College of Theology.
		-- R. S. Barton

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2011-06-06 10:43 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-01-23 11:05 Seagate hard disk firmware issue BU66ER BAD6ER
2011-01-23 13:42 ` Alan Cox
2011-01-25  1:21 ` Robert Hancock
2011-01-25  6:06   ` BU66ER BAD6ER
2011-04-30 11:32     ` BU66ER BAD6ER
2011-04-30 12:08       ` gene heskett
2011-04-30 22:39         ` BU66ER BAD6ER
2011-05-01 17:18           ` gene heskett
2011-06-06 10:09             ` BU66ER BAD6ER
2011-06-06 10:36               ` gene heskett

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.