From: Eric Blake <eblake@redhat.com>
To: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>,
Kevin Wolf <kwolf@redhat.com>, Max Reitz <mreitz@redhat.com>
Cc: Andrey Shinkevich <andrey.shinkevich@virtuozzo.com>,
qemu-devel <qemu-devel@nongnu.org>,
qemu block <qemu-block@nongnu.org>
Subject: Re: backing chain & block status & filters
Date: Tue, 28 Apr 2020 11:18:48 -0500 [thread overview]
Message-ID: <25f6278c-466d-c663-73e0-ef2d256b326d@redhat.com> (raw)
In-Reply-To: <2e3eab55-4a1d-f1a9-ab28-3f9399c57bfe@virtuozzo.com>
On 4/28/20 10:13 AM, Vladimir Sementsov-Ogievskiy wrote:
>>> Hm. I could imagine that there are formats that have non-zero holes
>>> (e.g. 0xff or just garbage). It would be a bit wrong for them to return
>>> ZERO or DATA then.
>>>
>>> But OTOH we don’t care about such cases, do we? We need to know whether
>>> ranges are zero, data, or unallocated. If they aren’t zero, we only
>>> care about whether reading from it will return data from this layer
>>> or not.
>>>
>>> So I suppose that anything that doesn’t support backing files (or
>>> filtered children) should always return ZERO and/or DATA.
>>
>> I'm not sure I agree with the notion that everything should be
>> BDRV_BLOCK_ALLOCATED at the lowest layer. It's not what it means today
>> at least. If we want to change this, we will have to check all callers
>> of bdrv_is_allocated() and friends who might use this to find holes in
>> the file.
>
> Yes. Because they are doing incorrect (or at least undocumented and
> unreliable) thing.
Here's some previous mails discussing the same question about what
block_status should actually mean. At the time, I was so scared of the
prospect of something breaking if I changed things that I ended up
keeping status quo, so here we are revisiting the topic several years
later, still asking the same questions.
https://lists.gnu.org/archive/html/qemu-devel/2017-12/msg00069.html
https://lists.gnu.org/archive/html/qemu-devel/2018-02/msg03757.html
>
>>
>> Basically, the way bdrv_is_allocated() works today is that we assume an
>> implicit zeroed backing layer even for block drivers that don't support
>> backing files.
>
> But read doesn't work so: it will read data from the bottom layer, not from
> this implicit zeroed backing layer. And it is inconsistent. On read data
> comes exactly from this layer, not from its implicit backing. So it should
> return BDRV_BLOCK_ALLOCATED, accordingly to its definition..
>
> Or, we should at least document current behavior:
>
> BDRV_BLOCK_ALLOCATED: the content of the block is determined by this
> layer rather than any backing, set by block. Attention: it may not be
> set
> for drivers without backing support, still data is of course read from
> this layer. Note, that for such drivers BDRV_BLOCK_ALLOCATED may mean
> allocation on fs level, which occupies real space on disk.. So, for
> such drivers
>
> ZERO | ALLOCATED means that, read as zero, data may be allocated on
> fs, or
> (most probably) not,
> don't look at ALLOCATED flag, as it is added by generic layer for
> another logic,
> not related to fs-allocation.
>
> 0 means that, most probably, data doesn't occupy space on fs,
> zero-status is
> unknown (most probably non-zero)
>
That may be right in describing the current situation, but again, needs
a GOOD audit of what we are actually using it for, and whether it is
what we really WANT to be using it for. If we're going to
audit/refactor the code, we might as well get semantics that are
actually useful, rather than painfully contorted to documentation that
happens to match our current contorted code.
--
Eric Blake, Principal Software Engineer
Red Hat, Inc. +1-919-301-3226
Virtualization: qemu.org | libvirt.org
next prev parent reply other threads:[~2020-04-28 16:22 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-04-28 8:55 backing chain & block status & filters Vladimir Sementsov-Ogievskiy
2020-04-28 11:08 ` Max Reitz
2020-04-28 11:28 ` Kevin Wolf
2020-04-28 15:13 ` Vladimir Sementsov-Ogievskiy
2020-04-28 16:18 ` Eric Blake [this message]
2020-04-28 16:46 ` Vladimir Sementsov-Ogievskiy
2020-04-28 18:37 ` Kevin Wolf
2020-04-28 19:44 ` Vladimir Sementsov-Ogievskiy
2020-04-29 9:15 ` Vladimir Sementsov-Ogievskiy
2020-04-29 10:50 ` Vladimir Sementsov-Ogievskiy
2020-04-28 14:51 ` Vladimir Sementsov-Ogievskiy
2020-04-30 19:12 ` Vladimir Sementsov-Ogievskiy
2020-05-01 3:04 ` Andrey Shinkevich
2020-05-06 5:56 ` Vladimir Sementsov-Ogievskiy
2020-05-07 12:58 ` Max Reitz
2020-05-07 19:34 ` Vladimir Sementsov-Ogievskiy
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=25f6278c-466d-c663-73e0-ef2d256b326d@redhat.com \
--to=eblake@redhat.com \
--cc=andrey.shinkevich@virtuozzo.com \
--cc=kwolf@redhat.com \
--cc=mreitz@redhat.com \
--cc=qemu-block@nongnu.org \
--cc=qemu-devel@nongnu.org \
--cc=vsementsov@virtuozzo.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).