All of lore.kernel.org
 help / color / mirror / Atom feed
* RecovData Handshk error
@ 2017-07-30  1:54 Alex
  2017-08-05 19:43 ` Alex
  0 siblings, 1 reply; 4+ messages in thread
From: Alex @ 2017-07-30  1:54 UTC (permalink / raw)
  To: Linux RAID

Hi,

I have a fedora25 system with a MegaRAID SAS 9260-8i. I believe
there's 8 500GB SSDs connected. It appears one (or more?) may be
having a problem. I'm now noticing this message periodically in the
kernel logs:

[Wed Jun 28 01:40:06 2017] ata1.00: exception Emask 0x0 SAct 0x0 SErr
0x400001 action 0x6 frozen
[Wed Jun 28 01:40:06 2017] ata1: SError: { RecovData Handshk }
[Wed Jun 28 01:40:06 2017] ata1.00: cmd
a0/00:00:00:08:00/00:00:00:00:00/a0 tag 29 pio 16392 in
                                    Get event status notification 4a
01 00 00 10 00 00 00 08 00res 40/00:00:00:00:00/00:00:00:00:00/00
Emask 0x4 (timeout)
[Wed Jun 28 01:40:06 2017] ata1.00: status: { DRDY }
[Wed Jun 28 01:40:06 2017] ata1: hard resetting link
[Wed Jun 28 01:40:06 2017] ata1: SATA link up 1.5 Gbps (SStatus 113
SControl 300)
[Wed Jun 28 01:40:06 2017] ata1.00: configured for UDMA/133
[Wed Jun 28 01:40:06 2017] ata1: EH complete

Is it something to be concerned with? What level of panic? :-)

It appears to recover normally? Is it a device or disk issue? How can
I identify which disk is involved?

This is the information on the control from lspci:

05:00.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID SAS
2108 [Liberator] (rev 05)
        Subsystem: LSI Logic / Symbios Logic MegaRAID SAS 9260-8i
        Flags: bus master, fast devsel, latency 0, IRQ 29, NUMA node 0
        I/O ports at 7000 [size=256]
        Memory at df960000 (64-bit, non-prefetchable) [size=16K]
        Memory at df900000 (64-bit, non-prefetchable) [size=256K]
        Expansion ROM at df940000 [disabled] [size=128K]
        Capabilities: [50] Power Management version 3
        Capabilities: [68] Express Endpoint, MSI 00
        Capabilities: [d0] Vital Product Data
        Capabilities: [a8] MSI: Enable- Count=1/1 Maskable- 64bit+
        Capabilities: [c0] MSI-X: Enable+ Count=15 Masked-
        Capabilities: [100] Advanced Error Reporting
        Capabilities: [138] Power Budgeting <?>
        Kernel driver in use: megaraid_sas
        Kernel modules: megaraid_sas

Thanks for any ideas.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: RecovData Handshk error
  2017-07-30  1:54 RecovData Handshk error Alex
@ 2017-08-05 19:43 ` Alex
  2017-08-07 13:19   ` Drew
  2017-08-08 20:08   ` David C. Rankin
  0 siblings, 2 replies; 4+ messages in thread
From: Alex @ 2017-08-05 19:43 UTC (permalink / raw)
  To: Linux RAID

Hi all, I sent the message below last week and haven't received any
response. Is there a place that might be more appropriate for help
with my MegaRAID and possible disk failure?

On Sat, Jul 29, 2017 at 9:54 PM, Alex <mysqlstudent@gmail.com> wrote:
> Hi,
>
> I have a fedora25 system with a MegaRAID SAS 9260-8i. I believe
> there's 8 500GB SSDs connected. It appears one (or more?) may be
> having a problem. I'm now noticing this message periodically in the
> kernel logs:
>
> [Wed Jun 28 01:40:06 2017] ata1.00: exception Emask 0x0 SAct 0x0 SErr
> 0x400001 action 0x6 frozen
> [Wed Jun 28 01:40:06 2017] ata1: SError: { RecovData Handshk }
> [Wed Jun 28 01:40:06 2017] ata1.00: cmd
> a0/00:00:00:08:00/00:00:00:00:00/a0 tag 29 pio 16392 in
>                                     Get event status notification 4a
> 01 00 00 10 00 00 00 08 00res 40/00:00:00:00:00/00:00:00:00:00/00
> Emask 0x4 (timeout)
> [Wed Jun 28 01:40:06 2017] ata1.00: status: { DRDY }
> [Wed Jun 28 01:40:06 2017] ata1: hard resetting link
> [Wed Jun 28 01:40:06 2017] ata1: SATA link up 1.5 Gbps (SStatus 113
> SControl 300)
> [Wed Jun 28 01:40:06 2017] ata1.00: configured for UDMA/133
> [Wed Jun 28 01:40:06 2017] ata1: EH complete
>
> Is it something to be concerned with? What level of panic? :-)
>
> It appears to recover normally? Is it a device or disk issue? How can
> I identify which disk is involved?
>
> This is the information on the control from lspci:
>
> 05:00.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID SAS
> 2108 [Liberator] (rev 05)
>         Subsystem: LSI Logic / Symbios Logic MegaRAID SAS 9260-8i
>         Flags: bus master, fast devsel, latency 0, IRQ 29, NUMA node 0
>         I/O ports at 7000 [size=256]
>         Memory at df960000 (64-bit, non-prefetchable) [size=16K]
>         Memory at df900000 (64-bit, non-prefetchable) [size=256K]
>         Expansion ROM at df940000 [disabled] [size=128K]
>         Capabilities: [50] Power Management version 3
>         Capabilities: [68] Express Endpoint, MSI 00
>         Capabilities: [d0] Vital Product Data
>         Capabilities: [a8] MSI: Enable- Count=1/1 Maskable- 64bit+
>         Capabilities: [c0] MSI-X: Enable+ Count=15 Masked-
>         Capabilities: [100] Advanced Error Reporting
>         Capabilities: [138] Power Budgeting <?>
>         Kernel driver in use: megaraid_sas
>         Kernel modules: megaraid_sas
>
> Thanks for any ideas.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: RecovData Handshk error
  2017-08-05 19:43 ` Alex
@ 2017-08-07 13:19   ` Drew
  2017-08-08 20:08   ` David C. Rankin
  1 sibling, 0 replies; 4+ messages in thread
From: Drew @ 2017-08-07 13:19 UTC (permalink / raw)
  To: Alex; +Cc: Linux RAID

Hi Alex,

This particular mailing list is dedicated to the linux software raid
subsystem md, not raid in general. Your best bet is to ask in one of
the SAS/SATA mailing lists, or a vendor specific forum for further
info.


-- 
Drew

"Nothing in life is to be feared. It is only to be understood."
--Marie Curie

"This started out as a hobby and spun horribly out of control."
-Unknown

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: RecovData Handshk error
  2017-08-05 19:43 ` Alex
  2017-08-07 13:19   ` Drew
@ 2017-08-08 20:08   ` David C. Rankin
  1 sibling, 0 replies; 4+ messages in thread
From: David C. Rankin @ 2017-08-08 20:08 UTC (permalink / raw)
  To: Alex; +Cc: mdraid

On 08/05/2017 02:43 PM, Alex wrote:
> Hi all, I sent the message below last week and haven't received any
> response. Is there a place that might be more appropriate for help
> with my MegaRAID and possible disk failure?

Alex,

  Sorry, the reason you got no response is because the LSI (or whatever
vendor) MegaRAID software is a for a proprietary HARDWARE raid implementation.
This list is for opensource Linux SOFTWARE raid. The MegaRAID software used to
manage the proprietary hardware RAID card and drives (while opensourced a
while back) has nothing at all to do with Linux software RAID discussed here.

  Once you get your problem sorted out, (or if you can't), you might consider
just using your raid card as a simple disk controller and creating a Linux
software RAID array on that -- then your questions would be relevant here.

(and unless you are saturating your hardware raid setup relying on a
battery-powered write-back cache, then good old free software raid here will
likely match or exceed the I/O performance of your current hardware card :)

-- 
David C. Rankin, J.D.,P.E.

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2017-08-08 20:08 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-07-30  1:54 RecovData Handshk error Alex
2017-08-05 19:43 ` Alex
2017-08-07 13:19   ` Drew
2017-08-08 20:08   ` David C. Rankin

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.