linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Andy Isaacson <adi@hexapodia.org>
To: "Jörn Engel" <joern@wohnheim.fh-wedel.de>
Cc: linux-kernel@vger.kernel.org
Subject: Re: Is there a "make hole" (truncate in middle) syscall?
Date: Fri, 12 Dec 2003 09:40:31 -0600	[thread overview]
Message-ID: <20031212094031.B1303@hexapodia.org> (raw)
In-Reply-To: <20031212121824.GA6112@wohnheim.fh-wedel.de>; from joern@wohnheim.fh-wedel.de on Fri, Dec 12, 2003 at 01:18:24PM +0100

On Fri, Dec 12, 2003 at 01:18:24PM +0100, Jörn Engel wrote:
> On Thu, 11 December 2003 13:58:54 -0600, Andy Isaacson wrote:
> > On Thu, Dec 11, 2003 at 08:48:15PM +0100, Jörn Engel wrote:
> > > If you really do it, please don't add a syscall for it.  Simply check
> > > each written page if it is completely filled with zero.  (This will be
> > > a very quick check for most pages, as they will contain something
> > > nonzero in the first couple of words)
> > 
> > Um, no.
> > 
> > That is a very bad idea.  Your suggestion would make it impossible to
> > actually write a block of all-zeros to the disk.  That makes it
> > impossible to pre-allocate disk space.
> 
> How about implementing it inside the individual filesystems?  Then
> each filesystem can figure out a logic that suits it's special needs.
> 
> What I would sometimes like to have goes even beyond this.  Create a
> simple hash for each size-x chunk of a file, and check against those
> hashes whenever writing.  If hashes are identical, compare the chunks
> and if those are identical as well, just create another link to that
> chunk.  Kinda like rsync on a filesystem level, only without the
> rolling checksum thing.

A related idea was reportedly used in the Venti filesystem, which was
discussed on linux-kernel back in October; look for the thread
"Transparent compression in the FS".

The downsides are pretty substantial (but the upsides are too).  You
don't know how many blocks are available on the filesystem for you to
write, because when you write you might not allocate blocks.  And you
lose disk-streaming-perfomance, because you're going to be seeking all
over the disk picking up the blocks for your files.

> > Another syscall is precisely the correct thing to do.  (I don't think
> > make_hole() is a special case of any extant syscall.)
> 
> Depends.  My proposal has a bunch of problems.  We won't have it
> implemented by next year.  I buy all that.  Maybe we can do it with
> 10% kernel code and 90% userspace, maybe not.  Most likely the first
> couple of implementations create more problems than they solve, yes.
> 
> But we should get there some day.  Having 15 nearly identical copies
> of the kernel on my notebook is a pain and hard links simply have the
> wrong semantics.

I don't know about you, but I don't have 15 nearly identical copies of
the kernel; I have 30 copies that have almost no text in common, and
certainly have no blocks in common -- they result from independent
compilations, and the resulting bzImage files are not duplicates.

-andy

  reply	other threads:[~2003-12-12 15:40 UTC|newest]

Thread overview: 59+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2003-12-04 20:32 Is there a "make hole" (truncate in middle) syscall? Rob Landley
2003-12-04 20:55 ` Måns Rullgård
2003-12-04 21:10 ` Szakacsits Szabolcs
2003-12-05  0:02   ` Rob Landley
2003-12-04 22:33     ` Szakacsits Szabolcs
2003-12-05 11:22     ` Helge Hafting
2003-12-05 12:11   ` Måns Rullgård
2003-12-05 22:41     ` Mike Fedyk
2003-12-05 23:25       ` Måns Rullgård
2003-12-05 23:33       ` Szakacsits Szabolcs
2003-12-05 23:25     ` Szakacsits Szabolcs
2003-12-04 21:48 ` Mike Fedyk
2003-12-04 23:59   ` Rob Landley
2003-12-05 22:42     ` Olaf Titz
2003-12-04 22:53 ` Peter Chubb
2003-12-05  1:04   ` Philippe Troin
2003-12-05  2:39     ` Peter Chubb
2003-12-08  4:03     ` bill davidsen
2003-12-04 23:23 ` Andy Isaacson
2003-12-04 23:42   ` Szakacsits Szabolcs
2003-12-05  2:03     ` Mike Fedyk
2003-12-05  7:09       ` Ville Herva
2003-12-05 11:22   ` Anton Altaparmakov
2003-12-05 11:44     ` viro
2003-12-05 14:27       ` Anton Altaparmakov
2003-12-05 21:00   ` sparse file performance (was Re: Is there a "make hole" (truncate in middle) syscall?) Andy Isaacson
2003-12-05 21:12     ` Linus Torvalds
2003-12-08 20:43       ` Andy Isaacson
2003-12-11  5:13 ` Is there a "make hole" (truncate in middle) syscall? Hua Zhong
2003-12-11  6:19   ` Rob Landley
2003-12-11 18:58   ` Andy Isaacson
2003-12-11 19:15     ` Hua Zhong
2003-12-11 19:43       ` Andreas Dilger
2003-12-12 21:37         ` Daniel Phillips
2003-12-11 19:48       ` Jörn Engel
2003-12-11 19:55         ` Hua Zhong
2003-12-11 19:58         ` Andy Isaacson
2003-12-12 12:18           ` Jörn Engel
2003-12-12 15:40             ` Andy Isaacson [this message]
2003-12-12 16:03               ` Jörn Engel
2003-12-11 20:32         ` Rob Landley
2003-12-12 12:55           ` Jörn Engel
2003-12-12 13:28             ` Vladimir Saveliev
2003-12-12 13:43               ` Jörn Engel
2003-12-12 13:52                 ` Vladimir Saveliev
2003-12-12 14:04                   ` Jörn Engel
2003-12-12 13:53               ` Rob Landley
2003-12-12 14:01                 ` Vladimir Saveliev
2003-12-12 21:35                   ` Rob Landley
2003-12-15 10:00                     ` Vladimir Saveliev
2003-12-15 11:52                       ` Rob Landley
2003-12-15 13:26                         ` Jörn Engel
2003-12-12 13:39             ` Rob Landley
2003-12-12 13:56               ` Jörn Engel
2003-12-12 14:24                 ` Jörn Engel
2003-12-12 21:37                   ` Rob Landley
2003-12-15 12:47                     ` Jörn Engel
2003-12-16  5:43                       ` Rob Landley
2003-12-16 11:05                         ` Jörn Engel

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20031212094031.B1303@hexapodia.org \
    --to=adi@hexapodia.org \
    --cc=joern@wohnheim.fh-wedel.de \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).