linux-xfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: "Darrick J. Wong" <darrick.wong@oracle.com>
To: Gionatan Danti <g.danti@assyoma.it>
Cc: linux-xfs@vger.kernel.org
Subject: Re: XFS reflink vs ThinLVM
Date: Fri, 17 Jan 2020 15:42:19 -0800	[thread overview]
Message-ID: <20200117234219.GM8257@magnolia> (raw)
In-Reply-To: <e3dd598260d9f92c3b2c91cb81540e37@assyoma.it>

On Fri, Jan 17, 2020 at 10:58:15PM +0100, Gionatan Danti wrote:
> Il 15-01-2020 18:45 Gionatan Danti ha scritto:
> > Let me briefly describe the expected workload: thinly provisioned
> > virtual image storage. The problem with "plain" sparse file (ie:
> > without extsize hint) is that, after some time, the underlying vdisk
> > file will be very fragmented: consecutive physical blocks will be
> > assigned to very different logical blocks, leading to sub-par
> > performance when reading back the whole file (eg: for backup purpose).
> > 
> > I can easily simulate a worst-case scenario with fio, issuing random
> > write to a pre-created sparse file. While the random writes complete
> > very fast (because they are more-or-less sequentially written inside
> > the sparse file), reading back that file will have very low
> > performance: 10 MB/s vs 600+ MB/s for a preallocated file.
> 
> I would like to share some other observation/results, which I hope can be
> useful for other peoples.
> 
> Further testing shows that "cp --reflink" an highly fragmented files is a
> relatively long operation, easily in the range of 30s or more, during which
> the guest virtual machine is basically denied any access to the underlying
> virtual disk file.

How many fragments, and how big of a sparse file?

--D

> While the number of fragments required to reach reflink time of 30+ seconds
> is very high, this would be a quite common case when using thinly
> provisioned virtual disk files. With sparse file, any write done at guest OS
> level has a very good chance to create its own fragment (ie: allocating a
> discontiguous chunk as seen by logical/physical block mapping), leading to
> very fragmented files.
> 
> So, back to main topic: reflink is an invaluable tool, to be used *with*
> (rather than instead of) thin lvm:
> - thinlvm is the right tool for taking rolling volume snapshot;
> - reflink is extremely useful for "on-demand" snapshot of key files.
> 
> Thank you all for the very detailed and useful information you provided.
> Regards.
> 
> -- 
> Danti Gionatan
> Supporto Tecnico
> Assyoma S.r.l. - www.assyoma.it
> email: g.danti@assyoma.it - info@assyoma.it
> GPG public key ID: FF5F32A8

  reply	other threads:[~2020-01-17 23:44 UTC|newest]

Thread overview: 20+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-01-13 10:22 XFS reflink vs ThinLVM Gionatan Danti
2020-01-13 11:10 ` Carlos Maiolino
2020-01-13 11:25   ` Gionatan Danti
2020-01-13 11:43     ` Carlos Maiolino
2020-01-13 12:21       ` Gionatan Danti
2020-01-13 15:34         ` Gionatan Danti
2020-01-13 16:53           ` Darrick J. Wong
2020-01-13 17:00             ` Gionatan Danti
2020-01-13 18:09               ` Darrick J. Wong
2020-01-14  8:45                 ` Gionatan Danti
2020-01-15 11:37                   ` Gionatan Danti
2020-01-15 16:39                     ` Darrick J. Wong
2020-01-15 17:45                       ` Gionatan Danti
2020-01-17 21:58                         ` Gionatan Danti
2020-01-17 23:42                           ` Darrick J. Wong [this message]
2020-01-18 11:08                             ` Gionatan Danti
2020-01-18 23:06                               ` Darrick J. Wong
2020-01-19  8:45                                 ` Gionatan Danti
2020-01-13 16:14 ` Chris Murphy
2020-01-13 16:25   ` Gionatan Danti

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20200117234219.GM8257@magnolia \
    --to=darrick.wong@oracle.com \
    --cc=g.danti@assyoma.it \
    --cc=linux-xfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).