From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ric Wheeler Subject: Re: end to end error recovery musings Date: Mon, 26 Feb 2007 10:18:22 -0500 Message-ID: <45E2FA3E.3040201@emc.com> References: <45DEF6EF.3020509@emc.com> <45DF80C9.5080606@zytor.com> <20070224003723.GS10715@schatzie.adilger.int> <20070224023229.GB4380@thunk.org> <17890.28977.989203.938339@notabene.brown> <20070226132511.GB8154@thunk.org> <20070226151507.13a1701e@lxorguk.ukuu.org.uk> Reply-To: ric@emc.com Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <20070226151507.13a1701e@lxorguk.ukuu.org.uk> Sender: linux-raid-owner@vger.kernel.org To: Alan Cc: Theodore Tso , Neil Brown , "H. Peter Anvin" , Linux-ide , linux-scsi , linux-raid@vger.kernel.org, Tejun Heo , James Bottomley , Mark Lord , Jens Axboe , "Clark, Nathan" , "Singh, Arvinder" , "De Smet, Jochen" , "Farmer, Matt" , linux-fsdevel@vger.kernel.org, "Mizar, Sunita" List-Id: linux-raid.ids Alan wrote: >> the new location. I believe this should be always true, so presumably >> with all modern disk drives a write error should mean something very >> serious has happend. > > Not quite that simple. I think that write errors are normally quite serious, but there are exceptions which might be able to be worked around with retries. To Ted's point, in general, a write to a bad spot on the media will cause a remapping which should be transparent (if a bit slow) to us. > > If you write a block aligned size the same size as the physical media > block size maybe this is true. If you write a sector on a device with > physical sector size larger than logical block size (as allowed by say > ATA7) then it's less clear what happens. I don't know if the drive > firmware implements multiple "tails" in this case. > > On a read error it is worth trying the other parts of the I/O. > I think that this is mostly true, but we also need to balance this against the need for higher levels to get a timely response. In a really large IO, a naive retry of a very large write could lead to a non-responsive system for a very large time... ric