From mboxrd@z Thu Jan 1 00:00:00 1970 From: Kiyoshi Ueda Subject: Re: [PATCHSET block#for-2.6.36-post] block: replace barrier with sequenced flush Date: Fri, 20 Aug 2010 17:26:02 +0900 Message-ID: <4C6E3C1A.50205@ct.jp.nec.com> References: <1281616891-5691-1-git-send-email-tj@kernel.org> <20100813114858.GA31937@lst.de> <4C654D4B.1030507@kernel.org> <20100813143820.GA7931@lst.de> <4C655BE5.4070809@kernel.org> <20100814103654.GA13292@lst.de> <4C6A5D8A.4010205@kernel.org> <20100817131915.GB2963@lst.de> <4C6ABBCB.9030306@kernel.org> <20100817165929.GB13800@lst.de> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <20100817165929.GB13800@lst.de> Sender: linux-ide-owner@vger.kernel.org To: Christoph Hellwig , Tejun Heo Cc: jaxboe@fusionio.com, linux-fsdevel@vger.kernel.org, linux-scsi@vger.kernel.org, linux-ide@vger.kernel.org, linux-kernel@vger.kernel.org, linux-raid@vger.kernel.org, James.Bottomley@suse.de, tytso@mit.edu, chris.mason@oracle.com, swhiteho@redhat.com, konishi.ryusuke@lab.ntt.co.jp, dm-devel@redhat.com, vst@vlnb.net, jack@suse.cz, rwheeler@redhat.com, hare@suse.de List-Id: linux-raid.ids Hi Tejun, Christoph, On Tue, Aug 17, 2010 at 06:41:47PM +0200, Tejun Heo wrote: >>> I wasn't sure about that part. You removed store_flush_error(), but >>> DM_ENDIO_REQUEUE should still have higher priority than other >>> failures, no? >> >> Which priority? > > IIUC, when any of flushes get DM_ENDIO_REQUEUE (which tells the dm > core layer to retry the whole bio later), it trumps all other failures > and the bio is retried later. That was why DM_ENDIO_REQUEUE was > prioritized over other error codes, which actually is sort of > incorrect in that once a FLUSH fails, it _MUST_ be reported to upper > layers as FLUSH failure implies data already lost. So, > DM_ENDIO_REQUEUE actually should have lower priority than other > failures. But, then again, the error codes still need to be > prioritized. I think that's correct and changing the priority of DM_ENDIO_REQUEUE for REQ_FLUSH down to the lowest should be fine. (I didn't know that FLUSH failure implies data loss possibility.) But the patch is not enough, you have to change target drivers, too. E.g. As for multipath, you need to change drivers/md/dm-mpath.c:do_end_io() to return error for REQ_FLUSH like the REQ_DISCARD support included in 2.6.36-rc1. By the way, if these patch-set with the change above are included, even one path failure for REQ_FLUSH on multipath configuration will be reported to upper layer as error, although it's retried using other paths currently. Then, if an upper layer won't take correct recovery action for the error, it would be seen as a regression for users. (e.g. Frequent EXT3-error resulting in read-only mount on multipath configuration.) Although I think the explicit error is fine rather than implicit data corruption, please check upper layers carefully so that users won't see such errors as much as possible. Thanks, Kiyoshi Ueda