linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* SATA disk dead? ATA: abnormal status 0x59 on port 0xE407
@ 2005-01-15 20:25 Erik Steffl
  2005-01-16  0:58 ` Alan Cox
  0 siblings, 1 reply; 9+ messages in thread
From: Erik Steffl @ 2005-01-15 20:25 UTC (permalink / raw)
  To: linux-kernel

   I got these errors when accessing SATA disk (via scsi):

Jan 15 11:56:50 jojda kernel: ata2: command 0x25 timeout, stat 0x59 
host_stat 0x21
Jan 15 11:56:50 jojda kernel: ata2: status=0x59 { DriveReady 
SeekComplete DataRequest Error }
Jan 15 11:56:50 jojda kernel: ata2: error=0x40 { UncorrectableError }
Jan 15 11:56:50 jojda kernel: scsi1: ERROR on channel 0, id 0, lun 0, 
CDB: Read (10) 00 00 00 01 26 00 00 29 00
Jan 15 11:56:50 jojda kernel: Current sda: sense key Medium Error
Jan 15 11:56:50 jojda kernel: Additional sense: Unrecovered read error - 
auto reallocate failed
Jan 15 11:56:50 jojda kernel: end_request: I/O error, dev sda, sector 294
Jan 15 11:56:50 jojda kernel: Buffer I/O error on device sda1, logical 
block 57
Jan 15 11:56:50 jojda kernel: ATA: abnormal status 0x59 on port 0xE407
Jan 15 11:56:50 jojda last message repeated 2 times

   when the disk was mounted I got it only when accessing certain 
directories but now any disk access generates these errors and processes 
that touch the disk are in disk wait state (I tried fsck, mount, 
dd_rescue), looks like some of them get out if it after very long time 
(1h+).

   I have another SATA drive (pretty much same, both are Maxtor 
DiamondMax 9, 250GB) and that one works when I connect it to same SATA 
and power cables so I think there is a problem with disk (not my setup 
or cables etc.).

   Since I didn't see any read error before I think it might be the 
electronics being dead, not the disk itself - considering that I have 
another disk of same model is it possible to swap the disks (right now I 
can't try it because I don't have funny screwdriver to fit the screws on 
the disk).

   my system: kernel 2.6.9, debian unstable, SATA disks seen as scsi 
disks (CONFIG_SCSI_SATA=y).

   Is there anything I can do to rescue (some of) the data on the disk?

   TIA,

	erik

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: SATA disk dead? ATA: abnormal status 0x59 on port 0xE407
  2005-01-15 20:25 SATA disk dead? ATA: abnormal status 0x59 on port 0xE407 Erik Steffl
@ 2005-01-16  0:58 ` Alan Cox
  2005-01-16  2:33   ` Erik Steffl
  0 siblings, 1 reply; 9+ messages in thread
From: Alan Cox @ 2005-01-16  0:58 UTC (permalink / raw)
  To: Erik Steffl; +Cc: Linux Kernel Mailing List

On Sad, 2005-01-15 at 20:25, Erik Steffl wrote:
>    I got these errors when accessing SATA disk (via scsi):
> 
> Jan 15 11:56:50 jojda kernel: ata2: command 0x25 timeout, stat 0x59 
> host_stat 0x21
> Jan 15 11:56:50 jojda kernel: ata2: status=0x59 { DriveReady 
> SeekComplete DataRequest Error }
> Jan 15 11:56:50 jojda kernel: ata2: error=0x40 { UncorrectableError }

Bad sector - the disk has lost the data on some blocks. Thats a physical
disk failure.


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: SATA disk dead? ATA: abnormal status 0x59 on port 0xE407
  2005-01-16  0:58 ` Alan Cox
@ 2005-01-16  2:33   ` Erik Steffl
  2005-01-17  4:30     ` Bill Davidsen
  0 siblings, 1 reply; 9+ messages in thread
From: Erik Steffl @ 2005-01-16  2:33 UTC (permalink / raw)
  To: Linux Kernel Mailing List

Alan Cox wrote:
> On Sad, 2005-01-15 at 20:25, Erik Steffl wrote:
> 
>>   I got these errors when accessing SATA disk (via scsi):
>>
>>Jan 15 11:56:50 jojda kernel: ata2: command 0x25 timeout, stat 0x59 
>>host_stat 0x21
>>Jan 15 11:56:50 jojda kernel: ata2: status=0x59 { DriveReady 
>>SeekComplete DataRequest Error }
>>Jan 15 11:56:50 jojda kernel: ata2: error=0x40 { UncorrectableError }
> 
> 
> Bad sector - the disk has lost the data on some blocks. Thats a physical
> disk failure.

   what's somewhat weird is that the disk _seemed_ OK (i.e. no errors 
that I would notice, nothing in the syslog) and then suddenly the disk 
does not respond at all, I tried dd_rescue and it ran for hours (more 
than a day) and it rescued absolutely nothing. Is it possible that the 
disk surface is OK but the electronics went bad? Is there anything that 
can be done if that's the case? (I have another disk, same model).

	erik

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: SATA disk dead? ATA: abnormal status 0x59 on port 0xE407
  2005-01-16  2:33   ` Erik Steffl
@ 2005-01-17  4:30     ` Bill Davidsen
  2005-01-17  6:44       ` Erik Steffl
  0 siblings, 1 reply; 9+ messages in thread
From: Bill Davidsen @ 2005-01-17  4:30 UTC (permalink / raw)
  To: Erik Steffl; +Cc: Linux Kernel Mailing List

Erik Steffl wrote:
> Alan Cox wrote:
> 
>> On Sad, 2005-01-15 at 20:25, Erik Steffl wrote:
>>
>>>   I got these errors when accessing SATA disk (via scsi):
>>>
>>> Jan 15 11:56:50 jojda kernel: ata2: command 0x25 timeout, stat 0x59 
>>> host_stat 0x21
>>> Jan 15 11:56:50 jojda kernel: ata2: status=0x59 { DriveReady 
>>> SeekComplete DataRequest Error }
>>> Jan 15 11:56:50 jojda kernel: ata2: error=0x40 { UncorrectableError }
>>
>>
>>
>> Bad sector - the disk has lost the data on some blocks. Thats a physical
>> disk failure.
> 
> 
>   what's somewhat weird is that the disk _seemed_ OK (i.e. no errors 
> that I would notice, nothing in the syslog) and then suddenly the disk 
> does not respond at all, I tried dd_rescue and it ran for hours (more 
> than a day) and it rescued absolutely nothing. Is it possible that the 
> disk surface is OK but the electronics went bad? Is there anything that 
> can be done if that's the case? (I have another disk, same model).

You probably void your waranty on both drives if you swap the control 
board, it may require special tools you don't have, and I have done it 
in the past. Can you get to the point where it fails and cool it with a 
shot of freon (or whatever is politically correct these days)? May be 
thermal, in which case you run it until you back it up, then waranty it.

-- 
bill davidsen <davidsen@tmr.com>
   CTO TMR Associates, Inc
   Doing interesting things with small computers since 1979

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: SATA disk dead? ATA: abnormal status 0x59 on port 0xE407
  2005-01-17  4:30     ` Bill Davidsen
@ 2005-01-17  6:44       ` Erik Steffl
  2005-01-17  9:14         ` Mark Watts
  0 siblings, 1 reply; 9+ messages in thread
From: Erik Steffl @ 2005-01-17  6:44 UTC (permalink / raw)
  To: Linux Kernel Mailing List

Bill Davidsen wrote:
> Erik Steffl wrote:
> 
>> Alan Cox wrote:
>>
>>> On Sad, 2005-01-15 at 20:25, Erik Steffl wrote:
>>>
>>>>   I got these errors when accessing SATA disk (via scsi):
>>>>
>>>> Jan 15 11:56:50 jojda kernel: ata2: command 0x25 timeout, stat 0x59 
>>>> host_stat 0x21
>>>> Jan 15 11:56:50 jojda kernel: ata2: status=0x59 { DriveReady 
>>>> SeekComplete DataRequest Error }
>>>> Jan 15 11:56:50 jojda kernel: ata2: error=0x40 { UncorrectableError }
>>>
>>>
>>>
>>>
>>> Bad sector - the disk has lost the data on some blocks. Thats a physical
>>> disk failure.
>>
>>
>>
>>   what's somewhat weird is that the disk _seemed_ OK (i.e. no errors 
>> that I would notice, nothing in the syslog) and then suddenly the disk 
>> does not respond at all, I tried dd_rescue and it ran for hours (more 
>> than a day) and it rescued absolutely nothing. Is it possible that the 
>> disk surface is OK but the electronics went bad? Is there anything 
>> that can be done if that's the case? (I have another disk, same model).
> 
> 
> You probably void your waranty on both drives if you swap the control 
> board, it may require special tools you don't have, and I have done it 
> in the past. Can you get to the point where it fails and cool it with a 
> shot of freon (or whatever is politically correct these days)? May be 
> thermal, in which case you run it until you back it up, then waranty it.

   it does not respond at all (right after I boot up the computer), 
doesn't seem to be heat related. It is completely unreadable, I ran 
rr_rescue on it for a long time, it didn't read absolutely anything. It 
requires a star-shaped screwdriver, are those available somewhere?

	erik

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: SATA disk dead? ATA: abnormal status 0x59 on port 0xE407
  2005-01-17  6:44       ` Erik Steffl
@ 2005-01-17  9:14         ` Mark Watts
  2005-01-18  1:01           ` Eric Mudama
  0 siblings, 1 reply; 9+ messages in thread
From: Mark Watts @ 2005-01-17  9:14 UTC (permalink / raw)
  To: linux-kernel; +Cc: Erik Steffl

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


> Bill Davidsen wrote:
> > Erik Steffl wrote:
> >> Alan Cox wrote:
> >>> On Sad, 2005-01-15 at 20:25, Erik Steffl wrote:
> >>>>   I got these errors when accessing SATA disk (via scsi):
> >>>>
> >>>> Jan 15 11:56:50 jojda kernel: ata2: command 0x25 timeout, stat 0x59
> >>>> host_stat 0x21
> >>>> Jan 15 11:56:50 jojda kernel: ata2: status=0x59 { DriveReady
> >>>> SeekComplete DataRequest Error }
> >>>> Jan 15 11:56:50 jojda kernel: ata2: error=0x40 { UncorrectableError }
> >>>
> >>> Bad sector - the disk has lost the data on some blocks. Thats a
> >>> physical disk failure.
> >>
> >>   what's somewhat weird is that the disk _seemed_ OK (i.e. no errors
> >> that I would notice, nothing in the syslog) and then suddenly the disk
> >> does not respond at all, I tried dd_rescue and it ran for hours (more
> >> than a day) and it rescued absolutely nothing. Is it possible that the
> >> disk surface is OK but the electronics went bad? Is there anything
> >> that can be done if that's the case? (I have another disk, same model).
> >
> > You probably void your waranty on both drives if you swap the control
> > board, it may require special tools you don't have, and I have done it
> > in the past. Can you get to the point where it fails and cool it with a
> > shot of freon (or whatever is politically correct these days)? May be
> > thermal, in which case you run it until you back it up, then waranty it.
>
>    it does not respond at all (right after I boot up the computer),
> doesn't seem to be heat related. It is completely unreadable, I ran
> rr_rescue on it for a long time, it didn't read absolutely anything. It
> requires a star-shaped screwdriver, are those available somewhere?

Those are Torx drivers. You may need the 'security' version if the screws have 
a pin in the middle (utterly pointless since both types of driver are 
publicly available).

Mark.

- -- 
Mark Watts
Senior Systems Engineer
QinetiQ Trusted Information Management
Trusted Solutions and Services group
GPG Public Key ID: 455420ED

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)

iD8DBQFB64IGBn4EFUVUIO0RAugkAJ4kmCDOsILhZLISR75ml2gch528AQCbB56r
UJWFiujxQxI95TZEhIOKoWc=
=7AkY
-----END PGP SIGNATURE-----

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: SATA disk dead? ATA: abnormal status 0x59 on port 0xE407
  2005-01-17  9:14         ` Mark Watts
@ 2005-01-18  1:01           ` Eric Mudama
  2005-01-18  6:26             ` Erik Steffl
  0 siblings, 1 reply; 9+ messages in thread
From: Eric Mudama @ 2005-01-18  1:01 UTC (permalink / raw)
  To: Mark Watts; +Cc: linux-kernel, Erik Steffl

we don't use security torx screws, we use normal ones on our boards.

I wouldn't recommend swapping boards, since the code stored on the
physical media, the opti tables, and the asic on the board were all
processed together at one point and are specific to each other.  The
new board may not work properly with the heads in the other drive, and
could even cause damage, if both drives were several sigma to opposite
sides of each other in the spectrum of passing drives, or had a
different head vendor, etc.

If the data already appears lost and you've run out of other options,
it may prove useful to attempt writing to the entire device without
attempting reads.  If the drive then reads normally after that, the
damage was probably incurred in some transient fashion (excessive
vibration or heat, etc) and the replacement data may eliminate the
failures.

Either way, however, I would probably recommend just RMA'ing the
drives.  We should be able to get you a replacement in a few days from
the time you fill out the form.

--eric


On Mon, 17 Jan 2005 09:14:46 +0000, Mark Watts <m.watts@eris.qinetiq.com> wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> 
> > Bill Davidsen wrote:
> > > Erik Steffl wrote:
> > >> Alan Cox wrote:
> > >>> On Sad, 2005-01-15 at 20:25, Erik Steffl wrote:
> > >>>>   I got these errors when accessing SATA disk (via scsi):
> > >>>>
> > >>>> Jan 15 11:56:50 jojda kernel: ata2: command 0x25 timeout, stat 0x59
> > >>>> host_stat 0x21
> > >>>> Jan 15 11:56:50 jojda kernel: ata2: status=0x59 { DriveReady
> > >>>> SeekComplete DataRequest Error }
> > >>>> Jan 15 11:56:50 jojda kernel: ata2: error=0x40 { UncorrectableError }
> > >>>
> > >>> Bad sector - the disk has lost the data on some blocks. Thats a
> > >>> physical disk failure.
> > >>
> > >>   what's somewhat weird is that the disk _seemed_ OK (i.e. no errors
> > >> that I would notice, nothing in the syslog) and then suddenly the disk
> > >> does not respond at all, I tried dd_rescue and it ran for hours (more
> > >> than a day) and it rescued absolutely nothing. Is it possible that the
> > >> disk surface is OK but the electronics went bad? Is there anything
> > >> that can be done if that's the case? (I have another disk, same model).
> > >
> > > You probably void your waranty on both drives if you swap the control
> > > board, it may require special tools you don't have, and I have done it
> > > in the past. Can you get to the point where it fails and cool it with a
> > > shot of freon (or whatever is politically correct these days)? May be
> > > thermal, in which case you run it until you back it up, then waranty it.
> >
> >    it does not respond at all (right after I boot up the computer),
> > doesn't seem to be heat related. It is completely unreadable, I ran
> > rr_rescue on it for a long time, it didn't read absolutely anything. It
> > requires a star-shaped screwdriver, are those available somewhere?
> 
> Those are Torx drivers. You may need the 'security' version if the screws have
> a pin in the middle (utterly pointless since both types of driver are
> publicly available).
> 
> Mark.
> 
> - --
> Mark Watts
> Senior Systems Engineer
> QinetiQ Trusted Information Management
> Trusted Solutions and Services group
> GPG Public Key ID: 455420ED
> 
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.2.4 (GNU/Linux)
> 
> iD8DBQFB64IGBn4EFUVUIO0RAugkAJ4kmCDOsILhZLISR75ml2gch528AQCbB56r
> UJWFiujxQxI95TZEhIOKoWc=
> =7AkY
> -----END PGP SIGNATURE-----
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at  http://www.tux.org/lkml/
>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: SATA disk dead? ATA: abnormal status 0x59 on port 0xE407
  2005-01-18  1:01           ` Eric Mudama
@ 2005-01-18  6:26             ` Erik Steffl
  2005-01-19  0:17               ` James Colannino
  0 siblings, 1 reply; 9+ messages in thread
From: Erik Steffl @ 2005-01-18  6:26 UTC (permalink / raw)
  To: Eric Mudama; +Cc: linux-kernel

Eric Mudama wrote:
> we don't use security torx screws, we use normal ones on our boards.
> 
> I wouldn't recommend swapping boards, since the code stored on the
> physical media, the opti tables, and the asic on the board were all
> processed together at one point and are specific to each other.  The
> new board may not work properly with the heads in the other drive, and
> could even cause damage, if both drives were several sigma to opposite
> sides of each other in the spectrum of passing drives, or had a
> different head vendor, etc.
> 
> If the data already appears lost and you've run out of other options,
> it may prove useful to attempt writing to the entire device without
> attempting reads.  If the drive then reads normally after that, the
> damage was probably incurred in some transient fashion (excessive
> vibration or heat, etc) and the replacement data may eliminate the
> failures.
> 
> Either way, however, I would probably recommend just RMA'ing the
> drives.  We should be able to get you a replacement in a few days from
> the time you fill out the form.

   it's DiamondMax 9 (manufactured june 13 2003), those had only one 
year warranty so unfortunately I can't return it (just checked it on 
maxtor.com).

   trying to write to it (cat /dev/hdb6 > /dev/sda) but getting exactly 
same messages (ATA: abnormal status 0x59 on port 0xE407). Looks like the 
drive does not respond to anything at all (I tried to turn off computer 
completely, even disconnecting it (while powered off)).

here's the full set of messages (the same set repeats every 30s or so):

Jan 17 22:22:48 jojda kernel: ata2: command 0x35 timeout, stat 0x59 
host_stat 0x21
Jan 17 22:22:48 jojda kernel: ata2: status=0x59 { DriveReady 
SeekComplete DataRequest Error }
Jan 17 22:22:48 jojda kernel: ata2: error=0x40 { UncorrectableError }
Jan 17 22:22:48 jojda kernel: scsi1: ERROR on channel 0, id 0, lun 0, 
CDB: Write (10) 00 00 00 00 15 00 03 eb 00
Jan 17 22:22:48 jojda kernel: Current sda: sense key Medium Error
Jan 17 22:22:48 jojda kernel: Additional sense: Unrecovered read error - 
auto reallocate failed
Jan 17 22:22:48 jojda kernel: end_request: I/O error, dev sda, sector 21
Jan 17 22:22:48 jojda kernel: ATA: abnormal status 0x59 on port 0xE407
Jan 17 22:22:48 jojda last message repeated 2 times

	erik

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: SATA disk dead? ATA: abnormal status 0x59 on port 0xE407
  2005-01-18  6:26             ` Erik Steffl
@ 2005-01-19  0:17               ` James Colannino
  0 siblings, 0 replies; 9+ messages in thread
From: James Colannino @ 2005-01-19  0:17 UTC (permalink / raw)
  To: linux-kernel

Erik Steffl wrote:

> Eric Mudama wrote:
>
>> we don't use security torx screws, we use normal ones on our boards.
>>
>> I wouldn't recommend swapping boards, since the code stored on the
>> physical media, the opti tables, and the asic on the board were all
>> processed together at one point and are specific to each other.  The
>> new board may not work properly with the heads in the other drive, and
>> could even cause damage, if both drives were several sigma to opposite
>> sides of each other in the spectrum of passing drives, or had a
>> different head vendor, etc.
>>
>> If the data already appears lost and you've run out of other options,
>> it may prove useful to attempt writing to the entire device without
>> attempting reads.  If the drive then reads normally after that, the
>> damage was probably incurred in some transient fashion (excessive
>> vibration or heat, etc) and the replacement data may eliminate the
>> failures.
>>
>> Either way, however, I would probably recommend just RMA'ing the
>> drives.  We should be able to get you a replacement in a few days from
>> the time you fill out the form.
>
>
>   it's DiamondMax 9 (manufactured june 13 2003), those had only one 
> year warranty so unfortunately I can't return it (just checked it on 
> maxtor.com).
>

Sometimes, if you get a nice person from Maxtor on the phone, you can 
get it RMA'd anyway.  You just have to talk to the right person.  If you 
don't get someone willing to help out, try calling back until you get 
someone else.  I was able to return a drive that was 3 months out of 
warranty.  Yours is a bit more out of date, but you might as well give 
it a shot ;)

James


^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2005-01-19  0:19 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2005-01-15 20:25 SATA disk dead? ATA: abnormal status 0x59 on port 0xE407 Erik Steffl
2005-01-16  0:58 ` Alan Cox
2005-01-16  2:33   ` Erik Steffl
2005-01-17  4:30     ` Bill Davidsen
2005-01-17  6:44       ` Erik Steffl
2005-01-17  9:14         ` Mark Watts
2005-01-18  1:01           ` Eric Mudama
2005-01-18  6:26             ` Erik Steffl
2005-01-19  0:17               ` James Colannino

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).