From mboxrd@z Thu Jan 1 00:00:00 1970 From: Goswin von Brederlow Subject: Re: mismatch_cnt again Date: Sat, 14 Nov 2009 20:04:37 +0100 Message-ID: <87lji92aii.fsf@frosties.localdomain> References: <4AF4C247.6050303@eyal.emu.id.au> <4AF4D323.6020108@panix.com> <4AF5268D.60900@eyal.emu.id.au> <4877c76c0911070008m789507f8h799d419287740ca5@mail.gmail.com> <87tyx6tpcb.fsf@frosties.localdomain> <4AF58B20.3000409@redhat.com> <87iqdlaujb.fsf@frosties.localdomain> <4AF74B61.6000102@rabbit.us> <20091109185632.GA2723@lazy.lzy> <73ebdcee169f46611d411755f9aaca5b.squirrel@neil.brown.name> <20091109215443.GA4143@lazy.lzy> <4AF92DBD.5010102@rabbit.us> <4AFC8EC5.6060400@tmr.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Return-path: In-Reply-To: (Martin K. Petersen's message of "Fri, 13 Nov 2009 12:12:45 -0500") Sender: linux-raid-owner@vger.kernel.org To: "Martin K. Petersen" Cc: Bill Davidsen , Peter Rabbitson , NeilBrown , Piergiorgio Sartor , Goswin von Brederlow , Doug Ledford , Michael Evans , Eyal Lebedinsky , linux-raid list List-Id: linux-raid.ids "Martin K. Petersen" writes: >>>>>> "Bill" == Bill Davidsen writes: > >>> FWIW, XFS and btrfs both use the page writeback bit correctly and >>> never change a page while it is undergoing I/O. >>> >>> > Bill> That's necessary but not sufficient. To be done correctly it must > Bill> be protected by md as well. This is because arrays are used > Bill> without a filesystem by some applications, such as swap and > Bill> database, to name the most common cases. > > I agree that making MD RAID1 do a copy would be a quick fix. But I > don't see any reason to encourage what is essentially sloppy behavior at > the top of the stack. And then what if you stack MD/DM devices? Do > each layer do a copy? I think that gets murky pretty quickly. Maybe as a quick debug the raid layer should make the page read-only and then watch what fails to write to it. > I'd much rather fix the cases where the top layers are broken. And as I > said there are several people working on this spurred by my work on the > data integrity extensions. > > FWIW, databases on raw disk have gone out of fashion. But it is true > that applications that do direct I/O need to avoid updating buffers in > flight. Maybe a flag somewhere saying if the data is safe from writes or not. Default would be unsafe and md copies. A filesystem that works "right" sets the safe flag as would md after copying. That way anything lower in the stack (like another md) has the flag set. MfG Goswin