All of lore.kernel.org
 help / color / mirror / Atom feed
From: Eric Blake <eblake@redhat.com>
To: Max Reitz <mreitz@redhat.com>, Qemu-block <qemu-block@nongnu.org>,
	Kevin Wolf <kwolf@redhat.com>
Cc: "qemu-devel@nongnu.org" <qemu-devel@nongnu.org>
Subject: Re: qcow2: Zero-initialization of external data files
Date: Mon, 6 Apr 2020 17:22:46 -0500	[thread overview]
Message-ID: <50080252-ff22-78ed-0002-1742c694471b@redhat.com> (raw)
In-Reply-To: <db0b4b5c-d564-f003-bbbc-9afdcac07cc4@redhat.com>

On 2/17/20 10:56 AM, Max Reitz wrote:
> Hi,
> 
> AFAIU, external data files with data_file_raw=on are supposed to return
> the same data as the qcow2 file when read.  But we still use the qcow2
> metadata structures (which are by default initialized to “everything
> unallocated”), even though we never ensure that the external data file
> is zero, too, so this can happen:
> 
> $ dd if=/dev/urandom of=foo.raw 64M
> [...]
> 
> $ sudo losetup -f --show foo.raw
> /dev/loop0
> 
> $ sudo ./qemu-img create -f qcow2 -o \
>      data_file=/dev/loop0,data_file_raw=on foo.qcow2 64M
> [...]
> 
> $ sudo ./qemu-io -c 'read -P 0 0 64M' foo.qcow2
> read 67108864/67108864 bytes at offset 0
> 64 MiB, 1 ops; 00.00 sec (25.036 GiB/sec and 400.5751 ops/sec)

This looks like a bug (and we should fix it for 5.0 if possible) - read 
of a data_file_raw=on should not treat unallocated clusters as reading 
0, but rather as reading whatever the raw data contains.

> 
> $ sudo ./qemu-io -c 'read -P 0 0 64M' -f raw foo.raw
> Pattern verification failed at offset 0, 67108864 bytes
> read 67108864/67108864 bytes at offset 0
> 64 MiB, 1 ops; 00.01 sec (5.547 GiB/sec and 88.7484 ops/sec)
> 
> I suppose this behavior is fine for blockdev-create because I guess it’s
> the user’s responsibility to ensure that the external data file is zero.
>   But maybe it isn’t, so that’s my first question: Is it really the
> user’s responsibility or should we always ensure it’s zero?

I'd argue that requiring the user to pre-zero the raw data file is 
undesirable; and that we should instead fix our code to not report the 
image as reading all zeroes when creating with data_file_raw=on.

> 
> My second question is: If we decide that this is fine for
> blockdev-create, should at least qcow2_co_create_opts() ensure the data
> file it just created is zero?

Having an option to make qemu force-zero the raw image during 
qcow2_co_create_opts seems reasonable, but for performance reasons, I 
don't think the flag should be on by default.

-- 
Eric Blake, Principal Software Engineer
Red Hat, Inc.           +1-919-301-3226
Virtualization:  qemu.org | libvirt.org



  reply	other threads:[~2020-04-06 22:23 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-02-17 16:56 qcow2: Zero-initialization of external data files Max Reitz
2020-04-06 22:22 ` Eric Blake [this message]
2020-04-09 13:05   ` Max Reitz
2020-04-09 13:42     ` Eric Blake
2020-04-09 13:47       ` Eric Blake
2020-04-09 14:10         ` Max Reitz
2020-04-09 14:32           ` Eric Blake
2020-04-09 15:01             ` Max Reitz
2020-04-09 15:46               ` Eric Blake
2020-04-09 15:56                 ` Eric Blake
2020-04-14 12:34                   ` Kevin Wolf
2020-04-14 12:28             ` Kevin Wolf

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=50080252-ff22-78ed-0002-1742c694471b@redhat.com \
    --to=eblake@redhat.com \
    --cc=kwolf@redhat.com \
    --cc=mreitz@redhat.com \
    --cc=qemu-block@nongnu.org \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.