linux-api.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Jeff Layton <jlayton@kernel.org>
To: John Stoffel <john@stoffel.org>
Cc: tytso@mit.edu, adilger.kernel@dilger.ca, djwong@kernel.org,
	david@fromorbit.com, trondmy@hammerspace.com, neilb@suse.de,
	viro@zeniv.linux.org.uk, zohar@linux.ibm.com, xiubli@redhat.com,
	chuck.lever@oracle.com, lczerner@redhat.com, jack@suse.cz,
	brauner@kernel.org, linux-man@vger.kernel.org,
	linux-api@vger.kernel.org, linux-btrfs@vger.kernel.org,
	linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org,
	linux-ceph@vger.kernel.org, linux-ext4@vger.kernel.org,
	linux-nfs@vger.kernel.org, linux-xfs@vger.kernel.org
Subject: Re: [man-pages PATCH] statx, inode: document the new STATX_INO_VERSION field
Date: Mon, 29 Aug 2022 08:37:33 -0400	[thread overview]
Message-ID: <bffe16482bcb3e6a69378e821e76182be21c7d1b.camel@kernel.org> (raw)
In-Reply-To: <25355.34889.890961.350510@quad.stoffel.home>

On Sun, 2022-08-28 at 11:22 -0400, John Stoffel wrote:
> > > > > > "Jeff" == Jeff Layton <jlayton@kernel.org> writes:
> 
> Jeff> We're planning to expose the inode change attribute via statx.
> Document
> Jeff> what this value means and what an observer can infer from a
> change in
> Jeff> its value.
> 
> It might be nice to put in some more example verbiage of how this
> would be used by userland.  For example, if you do a statx() call and
> notice that the ino_version has changed... what would you do next to
> find out what changed?  
> 
> Would you have to keep around an old copy of the statx() results and
> then compare them to find the changes?  When talking to userland
> people, don't assume they know anything about the kernel internals
> here.  
> 

How you'd use this really depends on the application, but yes, you'd
need to at least know what an old stx_ino_version was in order to detect
that there has been a change to it.

Today, i_version is mostly of use for knowing when you should invalidate
cached file info. Think of it as something like ctime, but with infinite
granularity. If it changes then something substantive _might_ have
changed in the inode. It's possible it's nothing your application cares
about, so you'd likely have to deal with "false" changes to this anyway.

In the case of NFS, it will invalidate its data and metadata caches when
this value changes. This is why this why false iversion bumps are so
expensive for NFS, particularly if you're dealing with large files.
Everything has to be re-fetched from the server when it changes.

In the case of IMA, it will re-measure the data in the inode to make
sure that it hasn't changed. That involves reading the whole file in and
running a checksum over it.

You could envision a backup tool using this to do incremental backups,
for instance. Keep a db of stx_ino:stx_ino_version for the files and you
could scan a filesystem and see which files need their backups updated.

Or, use it as a way to do stable file copies: Get the stx_ino_version
for a file, copy it somewhere and then get the stx_ino_version again.
Did it change? Redo the copy. That might be a nice option to add to
rsync, actually.

> 
> Jeff> Signed-off-by: Jeff Layton <jlayton@kernel.org>
> Jeff> ---
> Jeff>  man2/statx.2 | 13 +++++++++++++
> Jeff>  man7/inode.7 | 10 ++++++++++
> Jeff>  2 files changed, 23 insertions(+)
> 
> Jeff> diff --git a/man2/statx.2 b/man2/statx.2
> Jeff> index 0d1b4591f74c..644fb251f114 100644
> Jeff> --- a/man2/statx.2
> Jeff> +++ b/man2/statx.2
> Jeff> @@ -62,6 +62,7 @@ struct statx {
> Jeff>      __u32 stx_dev_major;   /* Major ID */
> Jeff>      __u32 stx_dev_minor;   /* Minor ID */
> Jeff>      __u64 stx_mnt_id;      /* Mount ID */
> Jeff> +    __u64 stx_ino_version; /* Inode change attribute */
> Jeff>  };
> Jeff>  .EE
> Jeff>  .in
> Jeff> @@ -247,6 +248,7 @@ STATX_BTIME	Want stx_btime
> Jeff>  STATX_ALL	The same as STATX_BASIC_STATS | STATX_BTIME.
> Jeff>  	It is deprecated and should not be used.
> Jeff>  STATX_MNT_ID	Want stx_mnt_id (since Linux 5.8)
> Jeff> +STATX_INO_VERSION	Want stx_ino_version (since Linux
> 6.1)
> Jeff>  .TE
> Jeff>  .in
> Jeff>  .PP
> Jeff> @@ -411,6 +413,17 @@ and corresponds to the number in the first
> field in one of the records in
> Jeff>  For further information on the above fields, see
> Jeff>  .BR inode (7).
> Jeff>  .\"
> Jeff> +.TP
> Jeff> +.I stx_ino_version
> Jeff> +The inode version, also known as the inode change attribute.
> This
> Jeff> +value is intended to change any time there is an inode status
> change. Any
> Jeff> +operation that would cause the stx_ctime to change should also
> cause
> Jeff> +stx_ino_version to change, even when there is no apparent
> change to the
> Jeff> +stx_ctime due to timestamp granularity.
> Jeff> +.IP
> Jeff> +Note that an observer cannot infer anything about the nature or
> Jeff> +magnitude of the change from the value of this field. A change
> in this value
> Jeff> +only indicates that there may have been an explicit change in
> the inode.
> Jeff>  .SS File attributes
> Jeff>  The
> Jeff>  .I stx_attributes
> Jeff> diff --git a/man7/inode.7 b/man7/inode.7
> Jeff> index 9b255a890720..d296bb6df70c 100644
> Jeff> --- a/man7/inode.7
> Jeff> +++ b/man7/inode.7
> Jeff> @@ -184,6 +184,16 @@ Last status change timestamp (ctime)
> Jeff>  This is the file's last status change timestamp.
> Jeff>  It is changed by writing or by setting inode information
> Jeff>  (i.e., owner, group, link count, mode, etc.).
> Jeff> +.TP
> Jeff> +Inode version (i_version)
> Jeff> +(not returned in the \fIstat\fP structure);
> \fIstatx.stx_ino_version\fP
> Jeff> +.IP
> Jeff> +This is the inode change attribute. Any operation that would
> result in a ctime
> Jeff> +change should also result in a change to this value. The value
> must change even
> Jeff> +in the case where the ctime change is not evident due to
> timestamp granularity.
> Jeff> +An observer cannot infer anything from the actual value about
> the nature or
> Jeff> +magnitude of the change. If it is different from the last time
> it was checked,
> Jeff> +then something may have made an explicit change to the inode.
> Jeff>  .PP
> Jeff>  The timestamp fields report time measured with a zero point at
> the
> Jeff>  .IR Epoch ,
> Jeff> -- 
> Jeff> 2.37.2
> 

-- 
Jeff Layton <jlayton@kernel.org>

      reply	other threads:[~2022-08-29 12:51 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-08-26 21:47 [man-pages PATCH] statx, inode: document the new STATX_INO_VERSION field Jeff Layton
2022-08-28 15:22 ` John Stoffel
2022-08-29 12:37   ` Jeff Layton [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=bffe16482bcb3e6a69378e821e76182be21c7d1b.camel@kernel.org \
    --to=jlayton@kernel.org \
    --cc=adilger.kernel@dilger.ca \
    --cc=brauner@kernel.org \
    --cc=chuck.lever@oracle.com \
    --cc=david@fromorbit.com \
    --cc=djwong@kernel.org \
    --cc=jack@suse.cz \
    --cc=john@stoffel.org \
    --cc=lczerner@redhat.com \
    --cc=linux-api@vger.kernel.org \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=linux-ceph@vger.kernel.org \
    --cc=linux-ext4@vger.kernel.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-man@vger.kernel.org \
    --cc=linux-nfs@vger.kernel.org \
    --cc=linux-xfs@vger.kernel.org \
    --cc=neilb@suse.de \
    --cc=trondmy@hammerspace.com \
    --cc=tytso@mit.edu \
    --cc=viro@zeniv.linux.org.uk \
    --cc=xiubli@redhat.com \
    --cc=zohar@linux.ibm.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).