* wierd raid 1 problem
@ 2003-09-09 17:59 Ying-Hung Chen
2003-09-09 20:53 ` Alan Cox
0 siblings, 1 reply; 3+ messages in thread
From: Ying-Hung Chen @ 2003-09-09 17:59 UTC (permalink / raw)
To: linux-kernel
Hi all,
I have recently encounter a weird raid 1 problem on my system. Here is
my setup:
i have two ide harddisk on a promise ide control card (PDC20262)
and here is my /etc/raidtab file
raiddev /dev/md0
raid-level 1
chunk-size 64k
persistent-superblock 1
nr-raid-disks 2
device /dev/hdd1
raid-disk 0
device /dev/hdb1
raid-disk 1
the file system is XFS. everything works most of time except from time
to time, files seem to get corrupted. I test the integrity by running
rpm --checks *.rpm continuously to verify the signature of the file.
the corrupted files seem to 'recover' itself if i leave the machine
alone for a while or umount and mount back the filesystem.
does anyone have this type of temperory file corruption problem? I
tested it against 2.4.2x kernel including the last vanilla 2.4.22 + xfs
patches, they all seem to have the same problem
Thanks,
-Ying
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: wierd raid 1 problem
2003-09-09 17:59 wierd raid 1 problem Ying-Hung Chen
@ 2003-09-09 20:53 ` Alan Cox
2003-09-10 18:59 ` Tomi Orava
0 siblings, 1 reply; 3+ messages in thread
From: Alan Cox @ 2003-09-09 20:53 UTC (permalink / raw)
To: Ying-Hung Chen; +Cc: Linux Kernel Mailing List
On Maw, 2003-09-09 at 18:59, Ying-Hung Chen wrote:
> the corrupted files seem to 'recover' itself if i leave the machine
> alone for a while or umount and mount back the filesystem.
>
> does anyone have this type of temperory file corruption problem? I
> tested it against 2.4.2x kernel including the last vanilla 2.4.22 + xfs
> patches, they all seem to have the same problem
Classic symptoms of bad memory or a kernel bug corrupting data. See if
the box passes memtest86 as a starter
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: wierd raid 1 problem
2003-09-09 20:53 ` Alan Cox
@ 2003-09-10 18:59 ` Tomi Orava
0 siblings, 0 replies; 3+ messages in thread
From: Tomi Orava @ 2003-09-10 18:59 UTC (permalink / raw)
To: linux-kernel; +Cc: Ying-Hung Chen
> On Maw, 2003-09-09 at 18:59, Ying-Hung Chen wrote:
>> the corrupted files seem to 'recover' itself if i leave the machine
>> alone for a while or umount and mount back the filesystem.
>>
>> does anyone have this type of temperory file corruption problem? I
>> tested it against 2.4.2x kernel including the last vanilla 2.4.22 + xfs
>> patches, they all seem to have the same problem
>
> Classic symptoms of bad memory or a kernel bug corrupting data. See if
> the box passes memtest86 as a starter
I actually saw the same thing happen to me a week ago with
linux-2.4.23-pre1+xfs (xfs bk).
The really wierd thing was that I copied several big files from one
disk-array to another and ran md5sum before & after the copy. The files
were instact before copying, but failed right after the copy. When I
started investigating what the hell is wrong, the next md5sum on the
destination filesystem was succesfull ... and that was it (no more errors).
The source filesystem was two disk RAID1 array with XFS (Sil680).
The destination filesystem was RAID1+0 four disk array with XFS as well
(HPT374 with hightech binary driver 1.10, as 2.4.22 still doesn't work at
all with Epox 8K9A3+ motherboard's integrated HPT374 with dma).
I ran the latest memtest86 for 24h (29 passes) without any errors.
Regards,
Tomi Orava
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2003-09-10 19:01 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-09-09 17:59 wierd raid 1 problem Ying-Hung Chen
2003-09-09 20:53 ` Alan Cox
2003-09-10 18:59 ` Tomi Orava
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).