From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from aserp1040.oracle.com ([141.146.126.69]:33867 "EHLO aserp1040.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753130AbdC1SlJ (ORCPT ); Tue, 28 Mar 2017 14:41:09 -0400 Date: Tue, 28 Mar 2017 11:40:10 -0700 From: Liu Bo To: dsterba@suse.cz, linux-btrfs@vger.kernel.org, Qu Wenruo Subject: Re: [PATCH v2] Btrfs: fix unexpected file hole after disk errors Message-ID: <20170328184010.GB4385@lim.localdomain> Reply-To: bo.li.liu@oracle.com References: <1488330280-22678-1-git-send-email-bo.li.liu@oracle.com> <1488831810-26684-1-git-send-email-bo.li.liu@oracle.com> <20170328125006.GY4781@twin.jikos.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii In-Reply-To: <20170328125006.GY4781@twin.jikos.cz> Sender: linux-btrfs-owner@vger.kernel.org List-ID: On Tue, Mar 28, 2017 at 02:50:06PM +0200, David Sterba wrote: > On Mon, Mar 06, 2017 at 12:23:30PM -0800, Liu Bo wrote: > > Btrfs creates hole extents to cover any unwritten section right before > > doing buffer writes after commit 3ac0d7b96a26 ("btrfs: Change the expanding > > write sequence to fix snapshot related bug."). > > > > However, that takes the start position of the buffered write to compare > > against the current EOF, hole extents would be created only if (EOF < > > start). > > > > If the EOF is at the middle of the buffered write, no hole extents will be > > created and a file hole without a hole extent is left in this file. > > > > This bug was revealed by generic/019 in fstests. 'fsstress' in this test > > may create the above situation and the test then fails all requests > > including writes, so the buffer write which is supposed to cover the > > hole (without the hole extent) couldn't make it on disk. Running fsck > > against such btrfs ends up with detecting file extent holes. > > > > Things could be more serious, some stale data would be exposed to > > userspace if files with this kind of hole are truncated to a position of > > the hole, because the on-disk inode size is beyond the last extent in the > > file. > > > > This fixes the bug by comparing the end position against the EOF. > > Is the test reliable? As I read it, it should be possible to craft the > file extents and trigger the bug. And verify the fix. It's not, running generic/019 by 10 times could produces a btrfsck error once on my box. Actually I think we don't need this patch any more since we're going to remove hole extents. I made a mistake when writing the above 'more serious' part, in fact truncating to a hole doesn't end up stale data as we don't have any file extent that points to the hole and reading the hole part should get all zero, thus there's no serious problem. It was another bug about setting disk isize to zero accidentally and data was lost, which has been fixed. Thanks, -liubo