[Qemu-devel] Cluster_size parameter issue on qcow2 image format

All of lore.kernel.org
 help / color / mirror / Atom feed

* [Qemu-devel] Cluster_size parameter issue on qcow2 image format
@ 2012-02-23  5:31 PANKAJ RAWAT
  2012-02-23  8:32 ` Stefan Hajnoczi
  0 siblings, 1 reply; 5+ messages in thread
From: PANKAJ RAWAT @ 2012-02-23  5:31 UTC (permalink / raw)
  To: qemu-devel

[-- Attachment #1: Type: text/plain, Size: 343 bytes --]

Hi all

I have been working o the qcow2 image format ,

I theory regarding  cluster size it is written that as the size of cluster
increase performance should increase.



But something surprising happen The performance is degrading as the size of
cluster increased from 64K to 1M  ( during expansion of qcow2 image)



can any one tell why ?

[-- Attachment #2: Type: text/html, Size: 793 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [Qemu-devel] Cluster_size parameter issue on qcow2 image format
  2012-02-23  5:31 [Qemu-devel] Cluster_size parameter issue on qcow2 image format PANKAJ RAWAT
@ 2012-02-23  8:32 ` Stefan Hajnoczi
  2012-02-23 10:02   ` PANKAJ RAWAT
  0 siblings, 1 reply; 5+ messages in thread
From: Stefan Hajnoczi @ 2012-02-23  8:32 UTC (permalink / raw)
  To: PANKAJ RAWAT; +Cc: qemu-devel

On Thu, Feb 23, 2012 at 11:01:46AM +0530, PANKAJ RAWAT wrote:
> I theory regarding  cluster size it is written that as the size of cluster
> increase performance should increase.
> 
> But something surprising happen The performance is degrading as the size of
> cluster increased from 64K to 1M  ( during expansion of qcow2 image)

It's not true that performance should increase by raising the cluster
size, otherwise the default would be infinity.  It's an algorithms/data
structure trade-off.

Most importantly is the relative latency between a small guest I/O
request (e.g. 4 KB) and the cluster size (e.g. 64 KB).  If the cluster
size latency is orders of magnitude larger than a small guest I/O
request, then be prepared to see extreme effects described below:

 * Bigger clusters decrease the frequency of metadata operations and
   increase metadata cache hit rates.  Bigger clusters means less
   metadata so qcow2 performs fewer metadata operations overall.

   Performance boost.

 * Bigger clusters increase the cost of allocating a new cluster.  For
   example, a 8 KB write to a new cluster will incur a 1 MB write to the
   image file (the untouched regions are filled with zeros).  This can
   be optimized in some cases but not everywhere (e.g. reallocating a
   data cluster versus extending the image file size and relying on the
   file system to provide zeroed space).  This is especially expensive
   when a backing file is in use because up to 1 MB of the backing file
   needs to be read to populate the newly allocated cluster!

   Performance loss.

 * Bigger clusters can reduce fragmentation of data on the physical
   disk.  The file system sees fewer, bigger allocating writes and is
   therefore able to allocate more contiguous data - less fragmentation.

   Performance boost.

 * Bigger clusters reduce the compactness of sparse files. you use more
   image file space on the host file system when the cluster size is
   large.

   Space efficiency loss.

Here's a scenario where a 1 MB cluster size is great compared to a large
cluster size:

You have a fully allocated qcow2 image, you will never need to do any
allocating writes.

Here's a scenario where a 1 MB cluster size is terrible compared to a
small cluster size:

You have an empty qcow2 file and perform 4 KB writes to the first sector
of each 1 MB chunk, and there is a backing file.

So it depends on the application.

Stefan

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [Qemu-devel] Cluster_size parameter issue on qcow2 image format
  2012-02-23  8:32 ` Stefan Hajnoczi
@ 2012-02-23 10:02   ` PANKAJ RAWAT
  2012-02-23 10:16     ` Stefan Hajnoczi
  0 siblings, 1 reply; 5+ messages in thread
From: PANKAJ RAWAT @ 2012-02-23 10:02 UTC (permalink / raw)
  To: Stefan Hajnoczi; +Cc: qemu-devel

[-- Attachment #1: Type: text/plain, Size: 3149 bytes --]

Thanks for the reply .
i am not using a backing file.
My concern is growing file system.
The performance of 64K is better than 1M , 2M or 32K

Is the degrade in performance is only due to allocation of large cluster
during expansion of qcow2 image ?

But the trend is same in case of
Sequential write
Random write  of 1 GB data

In random i can understand the sparseness of data
But in sequential write I don't understand as the write is performed on
sequential bases

is there is any reason behind it or i am not getting it right ?

On Thu, Feb 23, 2012 at 2:02 PM, Stefan Hajnoczi <stefanha@gmail.com> wrote:

> On Thu, Feb 23, 2012 at 11:01:46AM +0530, PANKAJ RAWAT wrote:
> > I theory regarding  cluster size it is written that as the size of
> cluster
> > increase performance should increase.
> >
> > But something surprising happen The performance is degrading as the size
> of
> > cluster increased from 64K to 1M  ( during expansion of qcow2 image)
>
> It's not true that performance should increase by raising the cluster
> size, otherwise the default would be infinity.  It's an algorithms/data
> structure trade-off.
>
> Most importantly is the relative latency between a small guest I/O
> request (e.g. 4 KB) and the cluster size (e.g. 64 KB).  If the cluster
> size latency is orders of magnitude larger than a small guest I/O
> request, then be prepared to see extreme effects described below:
>
>  * Bigger clusters decrease the frequency of metadata operations and
>   increase metadata cache hit rates.  Bigger clusters means less
>   metadata so qcow2 performs fewer metadata operations overall.
>
>   Performance boost.
>
>  * Bigger clusters increase the cost of allocating a new cluster.  For
>   example, a 8 KB write to a new cluster will incur a 1 MB write to the
>   image file (the untouched regions are filled with zeros).  This can
>   be optimized in some cases but not everywhere (e.g. reallocating a
>   data cluster versus extending the image file size and relying on the
>   file system to provide zeroed space).  This is especially expensive
>   when a backing file is in use because up to 1 MB of the backing file
>   needs to be read to populate the newly allocated cluster!
>
>   Performance loss.
>
>  * Bigger clusters can reduce fragmentation of data on the physical
>   disk.  The file system sees fewer, bigger allocating writes and is
>   therefore able to allocate more contiguous data - less fragmentation.
>
>   Performance boost.
>
>  * Bigger clusters reduce the compactness of sparse files. you use more
>   image file space on the host file system when the cluster size is
>   large.
>
>   Space efficiency loss.
>
> Here's a scenario where a 1 MB cluster size is great compared to a large
> cluster size:
>
> You have a fully allocated qcow2 image, you will never need to do any
> allocating writes.
>
> Here's a scenario where a 1 MB cluster size is terrible compared to a
> small cluster size:
>
> You have an empty qcow2 file and perform 4 KB writes to the first sector
> of each 1 MB chunk, and there is a backing file.
>
> So it depends on the application.
>
> Stefan
>



-- 
*Pankaj Rawat*

[-- Attachment #2: Type: text/html, Size: 3821 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [Qemu-devel] Cluster_size parameter issue on qcow2 image format
  2012-02-23 10:02   ` PANKAJ RAWAT
@ 2012-02-23 10:16     ` Stefan Hajnoczi
  2012-02-23 11:08       ` PANKAJ RAWAT
  0 siblings, 1 reply; 5+ messages in thread
From: Stefan Hajnoczi @ 2012-02-23 10:16 UTC (permalink / raw)
  To: PANKAJ RAWAT; +Cc: qemu-devel

On Thu, Feb 23, 2012 at 10:02 AM, PANKAJ RAWAT <pankajr141@gmail.com> wrote:
> Is the degrade in performance is only due to allocation of large cluster
> during expansion of qcow2 image ?
>
> But the trend is same in case of
> Sequential write
> Random write  of 1 GB data
>
> In random i can understand the sparseness of data
> But in sequential write I don't understand as the write is performed on
> sequential bases
>
> is there is any reason behind it or i am not getting it right ?

Sequential writes still require qcow2 to allocate clusters.

The first write request that touches a new cluster causes qcow2 to
allocate the full 1 MB.  Then the next few sequential write requests
overwrite in-place (these requests do not suffer allocation overhead).

Now if you imagine doing 4 KB requests in the guest with 1 MB cluster
size, you should find that the host is doing n * 4 KB / 1 MB - n * 4
KB extra I/O to the image file because it is zeroing each allocated
cluster!

Linux I/O requests tend to be 128 or 256 KB maximum with virtio-blk.
So even if your request size in guest userspace is 1 MB you're
probably doing multiple virtio-blk requests underneath.  Therefore you
are hitting the sequential allocating write pattern I described above.

The exact overhead depends on your application's I/O request pattern
but it's unsuprising that you experience a performance impact.

Stefan

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [Qemu-devel] Cluster_size parameter issue on qcow2 image format
  2012-02-23 10:16     ` Stefan Hajnoczi
@ 2012-02-23 11:08       ` PANKAJ RAWAT
  0 siblings, 0 replies; 5+ messages in thread
From: PANKAJ RAWAT @ 2012-02-23 11:08 UTC (permalink / raw)
  To: Stefan Hajnoczi; +Cc: qemu-devel

[-- Attachment #1: Type: text/plain, Size: 1578 bytes --]

Thanks for the reply

On Thu, Feb 23, 2012 at 3:46 PM, Stefan Hajnoczi <stefanha@gmail.com> wrote:

> On Thu, Feb 23, 2012 at 10:02 AM, PANKAJ RAWAT <pankajr141@gmail.com>
> wrote:
> > Is the degrade in performance is only due to allocation of large cluster
> > during expansion of qcow2 image ?
> >
> > But the trend is same in case of
> > Sequential write
> > Random write  of 1 GB data
> >
> > In random i can understand the sparseness of data
> > But in sequential write I don't understand as the write is performed on
> > sequential bases
> >
> > is there is any reason behind it or i am not getting it right ?
>
> Sequential writes still require qcow2 to allocate clusters.
>
> The first write request that touches a new cluster causes qcow2 to
> allocate the full 1 MB.  Then the next few sequential write requests
> overwrite in-place (these requests do not suffer allocation overhead).
>
> Now if you imagine doing 4 KB requests in the guest with 1 MB cluster
> size, you should find that the host is doing n * 4 KB / 1 MB - n * 4
> KB extra I/O to the image file because it is zeroing each allocated
> cluster!
>
> Linux I/O requests tend to be 128 or 256 KB maximum with virtio-blk.
> So even if your request size in guest userspace is 1 MB you're
> probably doing multiple virtio-blk requests underneath.  Therefore you
> are hitting the sequential allocating write pattern I described above.
>
> The exact overhead depends on your application's I/O request pattern
> but it's unsuprising that you experience a performance impact.
>
> Stefan
>



-- 
*Pankaj Rawat*

[-- Attachment #2: Type: text/html, Size: 2158 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2012-02-23 11:09 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2012-02-23  5:31 [Qemu-devel] Cluster_size parameter issue on qcow2 image format PANKAJ RAWAT
2012-02-23  8:32 ` Stefan Hajnoczi
2012-02-23 10:02   ` PANKAJ RAWAT
2012-02-23 10:16     ` Stefan Hajnoczi
2012-02-23 11:08       ` PANKAJ RAWAT

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.