All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH 0/2] Modify read error handle for RAID-4,5,6.
@ 2012-05-26  2:52 majianpeng
  2012-06-27  4:32 ` NeilBrown
  0 siblings, 1 reply; 4+ messages in thread
From: majianpeng @ 2012-05-26  2:52 UTC (permalink / raw)
  To: Neil Brown; +Cc: linux-raid

When RAID-4,5,6 degraded and met read-error, it will eject the rdev.And then
the RAID will fail and lost data.Because the function of set-badsector,when 
this occur,it will set-badsector,not ejecting the rdev.
When RAID-4,5,6 met read-error, it will re-write if RAID was not degrade.But if 
re-write error,it will eject the rdev and RAID will degrade and it will take too
long time for recoverying.So I add judgement for controling how may re-write-error
can eject the rdev.

I do those for flexible controling the read-error for different situation.
	

majianpeng (2):
  md/raid456: When readed error and raid was degraded,it try to
    set badsector, not ejecting the rdev.
  md/raid456:Add interface for contorling eject rdev when re-write
    failed.

 drivers/md/md.c    |   35 +++++++++++++++++++++++++++++++++++
 drivers/md/md.h    |    2 ++
 drivers/md/raid5.c |   28 +++++++++++++++++++---------
 3 files changed, 56 insertions(+), 9 deletions(-)

-- 
1.7.5.4
 				
--------------
majianpeng
2012-05-26


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH 0/2] Modify read error handle for RAID-4,5,6.
  2012-05-26  2:52 [PATCH 0/2] Modify read error handle for RAID-4,5,6 majianpeng
@ 2012-06-27  4:32 ` NeilBrown
  2012-06-27  6:03   ` majianpeng
  0 siblings, 1 reply; 4+ messages in thread
From: NeilBrown @ 2012-06-27  4:32 UTC (permalink / raw)
  To: majianpeng; +Cc: linux-raid

[-- Attachment #1: Type: text/plain, Size: 1370 bytes --]

On Sat, 26 May 2012 10:52:50 +0800 "majianpeng" <majianpeng@gmail.com> wrote:

> When RAID-4,5,6 degraded and met read-error, it will eject the rdev.And then
> the RAID will fail and lost data.Because the function of set-badsector,when 
> this occur,it will set-badsector,not ejecting the rdev.
> When RAID-4,5,6 met read-error, it will re-write if RAID was not degrade.But if 
> re-write error,it will eject the rdev and RAID will degrade and it will take too
> long time for recoverying.So I add judgement for controling how may re-write-error
> can eject the rdev.
> 
> I do those for flexible controling the read-error for different situation.
> 	

Thanks.

> 
> majianpeng (2):
>   md/raid456: When readed error and raid was degraded,it try to
>     set badsector, not ejecting the rdev.

I've applied this one.  I also added 'set_bad = 1' in the case where
the re-write failed.

>   md/raid456:Add interface for contorling eject rdev when re-write
>     failed.

I haven't applied this.  I'm not entirely sure what the point of counting
the errors was, but I don't think it is necessary.

Thanks,
NeilBrown


> 
>  drivers/md/md.c    |   35 +++++++++++++++++++++++++++++++++++
>  drivers/md/md.h    |    2 ++
>  drivers/md/raid5.c |   28 +++++++++++++++++++---------
>  3 files changed, 56 insertions(+), 9 deletions(-)
> 


[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Re: [PATCH 0/2] Modify read error handle for RAID-4,5,6.
  2012-06-27  4:32 ` NeilBrown
@ 2012-06-27  6:03   ` majianpeng
  2012-06-28  0:04     ` NeilBrown
  0 siblings, 1 reply; 4+ messages in thread
From: majianpeng @ 2012-06-27  6:03 UTC (permalink / raw)
  To: Neil Brown; +Cc: linux-raid

On 2012-06-27 12:32 NeilBrown <neilb@suse.de> Wrote:
>On Sat, 26 May 2012 10:52:50 +0800 "majianpeng" <majianpeng@gmail.com> wrote:
>
>> When RAID-4,5,6 degraded and met read-error, it will eject the rdev.And then
>> the RAID will fail and lost data.Because the function of set-badsector,when 
>> this occur,it will set-badsector,not ejecting the rdev.
>> When RAID-4,5,6 met read-error, it will re-write if RAID was not degrade.But if 
>> re-write error,it will eject the rdev and RAID will degrade and it will take too
>> long time for recoverying.So I add judgement for controling how may re-write-error
>> can eject the rdev.
>> 
>> I do those for flexible controling the read-error for different situation.
>> 	
>
>Thanks.
>
>> 
>> majianpeng (2):
>>   md/raid456: When readed error and raid was degraded,it try to
>>     set badsector, not ejecting the rdev.
>
>I've applied this one.  I also added 'set_bad = 1' in the case where
>the re-write failed.
>
>>   md/raid456:Add interface for contorling eject rdev when re-write
>>     failed.
>
>I haven't applied this.  I'm not entirely sure what the point of counting
>the errors was, but I don't think it is necessary.
Using raid456,the first object is to protect data.But in some situation, the user
can endure lost some data instead of raid degraed or failed.
After introduce the badblocks, I think md-driver should do flexible controling for
error.The controling can control by different user for different requirment.
>
>Thanks,
>NeilBrown
>
>
>> 
>>  drivers/md/md.c    |   35 +++++++++++++++++++++++++++++++++++
>>  drivers/md/md.h    |    2 ++
>>  drivers/md/raid5.c |   28 +++++++++++++++++++---------
>>  3 files changed, 56 insertions(+), 9 deletions(-)
>> 
>
>

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH 0/2] Modify read error handle for RAID-4,5,6.
  2012-06-27  6:03   ` majianpeng
@ 2012-06-28  0:04     ` NeilBrown
  0 siblings, 0 replies; 4+ messages in thread
From: NeilBrown @ 2012-06-28  0:04 UTC (permalink / raw)
  To: majianpeng; +Cc: linux-raid

[-- Attachment #1: Type: text/plain, Size: 2240 bytes --]

On Wed, 27 Jun 2012 14:03:55 +0800 majianpeng <majianpeng@gmail.com> wrote:

> On 2012-06-27 12:32 NeilBrown <neilb@suse.de> Wrote:
> >On Sat, 26 May 2012 10:52:50 +0800 "majianpeng" <majianpeng@gmail.com> wrote:
> >
> >> When RAID-4,5,6 degraded and met read-error, it will eject the rdev.And then
> >> the RAID will fail and lost data.Because the function of set-badsector,when 
> >> this occur,it will set-badsector,not ejecting the rdev.
> >> When RAID-4,5,6 met read-error, it will re-write if RAID was not degrade.But if 
> >> re-write error,it will eject the rdev and RAID will degrade and it will take too
> >> long time for recoverying.So I add judgement for controling how may re-write-error
> >> can eject the rdev.
> >> 
> >> I do those for flexible controling the read-error for different situation.
> >> 	
> >
> >Thanks.
> >
> >> 
> >> majianpeng (2):
> >>   md/raid456: When readed error and raid was degraded,it try to
> >>     set badsector, not ejecting the rdev.
> >
> >I've applied this one.  I also added 'set_bad = 1' in the case where
> >the re-write failed.
> >
> >>   md/raid456:Add interface for contorling eject rdev when re-write
> >>     failed.
> >
> >I haven't applied this.  I'm not entirely sure what the point of counting
> >the errors was, but I don't think it is necessary.
> Using raid456,the first object is to protect data.But in some situation, the user
> can endure lost some data instead of raid degraed or failed.
> After introduce the badblocks, I think md-driver should do flexible controling for
> error.The controling can control by different user for different requirment.

I cannot see the point of that control though.

Sure you *always* want to record a bad block if possible, if the alternative
is ejecting the whole device?
I don't see where the choice would be between "lost data" or "degraded array".

Maybe if the failing device caused large delays then you want to eject it
soon rather than struggling on with it.  However my belief is that if you
don't want long delays, then you should tell the device to fail rather than
impose long delays.  It is not something that md should care about.

So: still a little confused.

NeilBrown

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2012-06-28  0:04 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2012-05-26  2:52 [PATCH 0/2] Modify read error handle for RAID-4,5,6 majianpeng
2012-06-27  4:32 ` NeilBrown
2012-06-27  6:03   ` majianpeng
2012-06-28  0:04     ` NeilBrown

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.