All of lore.kernel.org
 help / color / mirror / Atom feed
From: Austin S Hemmelgarn <ahferroin7@gmail.com>
To: Christoph Hellwig <hch@infradead.org>
Cc: Andy Lutomirski <luto@amacapital.net>,
	Anna Schumaker <Anna.Schumaker@netapp.com>,
	"Darrick J. Wong" <darrick.wong@oracle.com>,
	linux-nfs@vger.kernel.org,
	Linux btrfs Developers List <linux-btrfs@vger.kernel.org>,
	Linux FS Devel <linux-fsdevel@vger.kernel.org>,
	Linux API <linux-api@vger.kernel.org>, Zach Brown <zab@zabbo.net>,
	Al Viro <viro@zeniv.linux.org.uk>, Chris Mason <clm@fb.com>,
	Michael Kerrisk-manpages <mtk.manpages@gmail.com>,
	andros@netapp.com
Subject: Re: [PATCH v5 8/9] vfs: Add vfs_copy_file_range() support for pagecache copies
Date: Fri, 16 Oct 2015 10:11:23 -0400	[thread overview]
Message-ID: <5621058B.1010704@gmail.com> (raw)
In-Reply-To: <20151016131250.GA15345@infradead.org>

[-- Attachment #1: Type: text/plain, Size: 3446 bytes --]

On 2015-10-16 09:12, Christoph Hellwig wrote:
> On Fri, Oct 16, 2015 at 08:50:41AM -0400, Austin S Hemmelgarn wrote:
>> Certain parts of userspace do try to reflink things instead of copying (for
>> example, coreutils recently started doing so in mv and has had the option to
>> do so with cp for a while now), but a properly designed general purpose
>> filesystem does not and should not do this without the user telling it to do
>> so.
>
> But they do.  Get out of your narrow local Linux file system view.
> Every all flash array or hyperconverge hypervisor will dedeup the hell
> out of your data, heck some SSDs even do it on the device.  Your NFS or
> CIFS server already does or soon will do dedup and reflinks behind the
> scenes, that's the whole point of adding these features to the protocol.
Unless things have significantly changed on Windows and OS X, NTFS and 
HFS+ do not do automatic data deduplication (I'm not sure whether either 
even supports reflinks, although NTFS is at least partly COW), and I 
know for certain that FAT, UDF, Minix, BeFS, and Venti do not do so. 
NFS and CIFS/SMB both have support in the protocol, but unless either 
the client asks for it specifically, or the server is manually 
configured to do it automatically (although current versions of Windows 
server might do it by default, but if they do it is not documented 
anywhere I've seen), they don't do it.  9P has no provisions for 
reflinks/deduplication.  AFS/Coda/Ceph/Lustre/GFS2 might do 
deduplication, but I'm pretty certain that they do not do so by default, 
and even then they really don't fit the 'general purpose' bit in my 
statement above.  So, overall, my statement still holds for any widely 
used filesystem technology that is actually 'general purpose'.

Furthermore, if you actually read my statement, you will notice that I 
only said that _filesystems_ should not do it without being told to do 
so, and (intentionally) said absolutely nothing about any kind of 
storage devices or virtualization.  Ideally, SSD's really shouldn't do 
it either unless they have a 100% guarantee that the entire block going 
bad will not render the data unrecoverable (most do in fact use ECC 
internally, but they typically only handle two or three bad bits out of 
a full byte).  And as far as hypervisors go, a good storage hypervisor 
should be providing some guarantee of reliability, which means either it 
is already storing multiple copies of _everything_ or using some form of 
erasure coding so that it can recover from issues with the underlying 
storage devices without causing issues for higher levels, thus meaning 
that deduplication in that context is safe for all intents and purposes.
> And except for the odd fear or COW or dedup, and the ENOSPC issue for
> which we have a flag with a very well defined meaning I've still not
> heard any good arguments against it.
Most people who I know who demonstrate this fear are just fine with COW, 
it's the deduplication that they're terrified of, and TBH that's largely 
because they've only ever seen it used in unsafe ways.  My main argument 
(which I admittedly have not really stated properly at all during this 
discussion) is that almost everyone is likely to jump on this, which 
_will_ change long established semantics in many things that switch to 
this, and there will almost certainly be serious backlash from that.


[-- Attachment #2: S/MIME Cryptographic Signature --]
[-- Type: application/pkcs7-signature, Size: 3019 bytes --]

  reply	other threads:[~2015-10-16 14:12 UTC|newest]

Thread overview: 129+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-09-30 17:26 [PATCH v5 0/9] VFS: In-kernel copy system call Anna Schumaker
2015-09-30 17:26 ` Anna Schumaker
2015-09-30 17:26 ` Anna Schumaker
2015-09-30 17:26 ` [PATCH v5 1/9] vfs: add copy_file_range syscall and vfs helper Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker
2015-09-30 17:26 ` [PATCH v5 2/9] x86: add sys_copy_file_range to syscall tables Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker
2015-09-30 17:26 ` [PATCH v5 3/9] btrfs: add .copy_file_range file operation Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker
2015-09-30 17:26 ` [PATCH v5 4/9] vfs: Copy should check len after file open mode Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker
2015-10-11 14:22   ` Christoph Hellwig
2015-10-11 14:22     ` Christoph Hellwig
2015-09-30 17:26 ` [PATCH v5 5/9] vfs: Copy shouldn't forbid ranges inside the same file Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker
2015-10-11 14:22   ` Christoph Hellwig
2015-10-14 17:37     ` Anna Schumaker
2015-10-14 17:37       ` Anna Schumaker
2015-10-14 17:37       ` Anna Schumaker
2015-10-14 18:25       ` Christoph Hellwig
2015-10-14 18:27         ` Anna Schumaker
2015-10-14 18:27           ` Anna Schumaker
2015-10-14 18:27           ` Anna Schumaker
2015-09-30 17:26 ` [PATCH v5 6/9] vfs: Copy should use file_out rather than file_in Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker
2015-10-11 14:24   ` Christoph Hellwig
2015-09-30 17:26 ` [PATCH v5 7/9] vfs: Remove copy_file_range mountpoint checks Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker
2015-10-11 14:23   ` Christoph Hellwig
2015-10-14 17:41     ` Anna Schumaker
2015-10-14 17:41       ` Anna Schumaker
2015-10-14 18:25       ` Christoph Hellwig
2015-10-14 18:25         ` Christoph Hellwig
2015-09-30 17:26 ` [PATCH v5 8/9] vfs: Add vfs_copy_file_range() support for pagecache copies Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker
2015-10-08  1:40   ` Neil Brown
2015-10-09 11:15     ` Pádraig Brady
2015-10-09 11:15       ` Pádraig Brady
2015-10-13 20:25       ` Anna Schumaker
2015-10-13 20:25         ` Anna Schumaker
2015-10-14  7:41         ` Christoph Hellwig
2015-10-14  7:41           ` Christoph Hellwig
2015-10-13 19:45     ` Anna Schumaker
2015-10-13 19:45       ` Anna Schumaker
2015-10-13 19:45       ` Anna Schumaker
2015-10-11 14:22   ` Christoph Hellwig
2015-10-11 14:22     ` Christoph Hellwig
2015-10-12 23:17     ` Darrick J. Wong
2015-10-12 23:17       ` Darrick J. Wong
2015-10-13  3:36       ` Trond Myklebust
2015-10-13  7:19         ` Darrick J. Wong
2015-10-13  7:19           ` Darrick J. Wong
2015-10-13  7:30         ` Christoph Hellwig
2015-10-13  7:30           ` Christoph Hellwig
2015-10-13  7:27       ` Christoph Hellwig
2015-10-13  7:27         ` Christoph Hellwig
2015-11-10  6:24         ` Darrick J. Wong
2015-10-14 17:59       ` Anna Schumaker
2015-10-14 17:59         ` Anna Schumaker
2015-10-14 17:59         ` Anna Schumaker
2015-10-14 18:08         ` Andy Lutomirski
2015-10-14 18:27           ` Christoph Hellwig
2015-10-14 18:38             ` Andy Lutomirski
2015-10-14 18:38               ` Andy Lutomirski
2015-10-14 18:49               ` Christoph Hellwig
2015-10-14 18:49                 ` Christoph Hellwig
2015-10-14 18:53                 ` Andy Lutomirski
2015-10-14 19:14                   ` Austin S Hemmelgarn
2015-10-14 19:14                     ` Austin S Hemmelgarn
2015-10-14 19:39                     ` Pádraig Brady
2015-10-14 19:39                       ` Pádraig Brady
2015-10-15  5:56                   ` Christoph Hellwig
2015-10-15  5:56                     ` Christoph Hellwig
2015-10-14 19:08             ` Austin S Hemmelgarn
2015-10-14 19:08               ` Austin S Hemmelgarn
2015-10-15  6:36               ` Christoph Hellwig
2015-10-15  6:36                 ` Christoph Hellwig
2015-10-15 12:24                 ` Austin S Hemmelgarn
2015-10-15 12:24                   ` Austin S Hemmelgarn
2015-10-16  5:38                   ` Christoph Hellwig
2015-10-16  5:38                     ` Christoph Hellwig
2015-10-16 11:46                     ` Austin S Hemmelgarn
2015-10-16 12:02                       ` Pádraig Brady
2015-10-16 12:02                         ` Pádraig Brady
2015-10-16 12:24                         ` Christoph Hellwig
2015-10-16 12:24                           ` Christoph Hellwig
2015-10-16 12:46                           ` Austin S Hemmelgarn
2015-10-16 12:46                             ` Austin S Hemmelgarn
2015-10-16 12:21                       ` Christoph Hellwig
2015-10-16 12:21                         ` Christoph Hellwig
2015-10-16 12:50                         ` Austin S Hemmelgarn
2015-10-16 13:12                           ` Christoph Hellwig
2015-10-16 14:11                             ` Austin S Hemmelgarn [this message]
2015-10-14 18:11         ` Darrick J. Wong
2015-10-14 18:11           ` Darrick J. Wong
2015-10-14 18:26           ` Andy Lutomirski
2015-09-30 17:26 ` [PATCH v5 9/9] btrfs: btrfs_copy_file_range() only supports reflinks Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker
2015-10-11 14:29   ` Christoph Hellwig
2015-10-11 14:29     ` Christoph Hellwig
2015-10-12 10:23     ` Pádraig Brady
2015-10-12 10:23       ` Pádraig Brady
2015-10-12 14:34       ` Christoph Hellwig
2015-10-12 23:41         ` Darrick J. Wong
2015-10-12 23:41           ` Darrick J. Wong
2015-10-13  7:29           ` Christoph Hellwig
2015-10-13  7:29             ` Christoph Hellwig
2015-10-14 18:46             ` Darrick J. Wong
2015-10-14 18:46               ` Darrick J. Wong
2015-10-15  6:00               ` Christoph Hellwig
2015-10-16 11:49                 ` Chris Mason
2015-10-16 11:49                   ` Chris Mason
2015-10-16 11:49                   ` Chris Mason
2015-10-16 12:25                   ` Christoph Hellwig
2015-10-16 12:25                     ` Christoph Hellwig
2015-10-16 13:19                     ` Chris Mason
2015-10-16 13:19                       ` Chris Mason
2015-10-16 13:19                       ` Chris Mason
2015-10-16 21:44                       ` Dave Chinner
2015-10-17 13:44                         ` Chris Mason
2015-10-17 13:44                           ` Chris Mason
2015-10-17 13:44                           ` Chris Mason
2015-10-15  8:35               ` Dave Chinner
2015-10-15  8:35                 ` Dave Chinner
2015-09-30 17:26 ` [PATCH v5 10/9] copy_file_range.2: New page documenting copy_file_range() Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker
2015-09-30 17:26   ` Anna Schumaker

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5621058B.1010704@gmail.com \
    --to=ahferroin7@gmail.com \
    --cc=Anna.Schumaker@netapp.com \
    --cc=andros@netapp.com \
    --cc=clm@fb.com \
    --cc=darrick.wong@oracle.com \
    --cc=hch@infradead.org \
    --cc=linux-api@vger.kernel.org \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-nfs@vger.kernel.org \
    --cc=luto@amacapital.net \
    --cc=mtk.manpages@gmail.com \
    --cc=viro@zeniv.linux.org.uk \
    --cc=zab@zabbo.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.