linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: David Greaves <david@dgreaves.com>
To: Mark Hahn <hahn@physics.mcmaster.ca>
Cc: Mitchell Laks <mlaks@verizon.net>,
	linux-raid@vger.kernel.org, linux-kernel@vger.kernel.org,
	IDE Linux <linux-ide@vger.kernel.org>
Subject: Possible libata/sata/Asus problem (was Re: Need to upgrade to latest stable mdadm version?)
Date: Mon, 23 Jan 2006 12:31:24 +0000	[thread overview]
Message-ID: <43D4CC9C.9060500@dgreaves.com> (raw)
In-Reply-To: <Pine.LNX.4.44.0601221719020.24057-100000@coffee.psychology.mcmaster.ca>

Mark Hahn wrote:

>>>and with the ETCH testing 2.6.12:  the sata_via module fails with
>>>      
>>>
>I'm sure you know that no kernel developer really cares about distro-hacked 
>kernels.  why not test a real (kernel.org) kernel?
>  
>
Only because if the problem exists on the stock kernel and not on the
distro kernel then there could be assistance in determining which patch
solves (or hides!) the problem. This may or may not actually be helpful.


>>>ata1: status=0x51 { DriveReady SeekComplete Error }
>>>ata1: error=0x84 { DriveStatusError BadCRC }
>>>      
>>>
>badcrc's are a sign that the link is failing - bad cable, bad power,
>overclocking,
>
OK

>possibly an error in the driver's timing config.
>  
>
A-ha!

>it cannot possibly be an mdadm problem, and cannot be related to 
>other software (kernel memory management, say.)
>  
>
Agreed.

> I don't see how driver interaction could cause the BadCRC, unless one
> driver
>
>is screwing with the timing registers of the other's hardware.
>  
>
And maybe, on a lightly loaded system, RAID causes concurrent access
(and potentially triggers problems) more often than a non-RAID solution?

> mdadm cannot possibly have anything to do with causing BadCRC's.
> upgrade if
>
>you feel like it, but not because of this problem.
>  
>
Completely agree.

>>I am running the *stock* 2.6.15 and get the same problems (ata timeouts etc)
>>    
>>
>is there a reason you call this a timeout, rather than a BadCRC?
>  
>
I had:
  ata2: command 0x25 timeout, stat 0x51 host_stat 0x0

It stuck in my head. It's not that representative. My bad.

most errors (for me) were:

Jan 19 15:23:05 haze kernel: ata1: PIO error
Jan 19 15:23:05 haze kernel: ata1: status=0x50 { DriveReady SeekComplete }
Jan 19 15:23:05 haze kernel: ata1: PIO error
Jan 19 15:23:05 haze kernel: ata1: status=0x50 { DriveReady SeekComplete }
Jan 19 15:23:05 haze kernel: ata1: PIO error


But if you look at:
http://marc.theaimsgroup.com/?l=linux-kernel&m=113769509617034&w=2

you'll see that I had what looked like a 'spurious' bad-block error -
and I have an Asus motherboard, am using the via_sata driver, have
another sata driver loaded and am using md - all under 2.6.15 .... hence
the tentative association of the problems :)

Oh, and libata's error handling is embryonic - maybe it should be
retrying. I dunno.

I've seen other potentially related problems in the
sata/motherboard/raid area.
Personally I suspect buggy Asus motherboards.
I wonder if the bug is triggered by multiple drivers or some concurrency
- hence raid's involvment...
(Since I suspect md is actually tickling it, not causing it moving to
lkml and linux-ide too)

Of course I plan to do some tests - but mentioning it may give others
ideas too... And maybe I/we'll get suggestions as to what to try next...

David

-- 


           reply	other threads:[~2006-01-23 12:31 UTC|newest]

Thread overview: expand[flat|nested]  mbox.gz  Atom feed
 [parent not found: <Pine.LNX.4.44.0601221719020.24057-100000@coffee.psychology.mcmaster.ca>]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=43D4CC9C.9060500@dgreaves.com \
    --to=david@dgreaves.com \
    --cc=hahn@physics.mcmaster.ca \
    --cc=linux-ide@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-raid@vger.kernel.org \
    --cc=mlaks@verizon.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).