From: Jeff Layton <jlayton@redhat.com> To: linux-fsdevel@vger.kernel.org Cc: linux-kernel@vger.kernel.org, linux-nfs@vger.kernel.org, linux-ext4@vger.kernel.org, linux-btrfs@vger.kernel.org, linux-xfs@vger.kernel.org Subject: [RFC PATCH v1 29/30] fs: track whether the i_version has been queried with an i_state flag Date: Wed, 21 Dec 2016 12:03:46 -0500 [thread overview] Message-ID: <1482339827-7882-30-git-send-email-jlayton@redhat.com> (raw) In-Reply-To: <1482339827-7882-1-git-send-email-jlayton@redhat.com> NFSv4 has some pretty relaxed rules for the i_version counter that we can exploit for our own (performance) gain. The rules basically boil down to: 1) it must steadily increase so that a client can discard change attributes that are older than ones it has already seen. 2) the value must be different from the last time we checked it if there was a data or metadata change. This last bit is important, as we don't necessarily need to bump the counter when no one is querying for it. On a write-intensive workload this can add up to the metadata being written a lot less. Add a new I_VERS_BUMP i_state flag that we can use to track when the i_version has been queried. When it's queried we take the i_lock, get the value and set the flag and then drop the lock and return it. When we would go to bump it, we check the flag and only bump the the counter if it's set and we weren't requested to forcibly bump it. Signed-off-by: Jeff Layton <jlayton@redhat.com> --- include/linux/fs.h | 66 +++++++++++++++++++++++++++++++++++++----------------- 1 file changed, 46 insertions(+), 20 deletions(-) diff --git a/include/linux/fs.h b/include/linux/fs.h index 75323e7b6954..917557faa8e8 100644 --- a/include/linux/fs.h +++ b/include/linux/fs.h @@ -1909,6 +1909,9 @@ static inline bool HAS_UNMAPPED_ID(struct inode *inode) * wb stat updates to grab mapping->tree_lock. See * inode_switch_wb_work_fn() for details. * + * I_VERS_BUMP inode->i_version counter must be bumped on the next + * change. See the inode_*_iversion functions. + * * Q: What is the difference between I_WILL_FREE and I_FREEING? */ #define I_DIRTY_SYNC (1 << 0) @@ -1929,6 +1932,7 @@ static inline bool HAS_UNMAPPED_ID(struct inode *inode) #define __I_DIRTY_TIME_EXPIRED 12 #define I_DIRTY_TIME_EXPIRED (1 << __I_DIRTY_TIME_EXPIRED) #define I_WB_SWITCH (1 << 13) +#define I_VERS_BUMP (1 << 14) #define I_DIRTY (I_DIRTY_SYNC | I_DIRTY_DATASYNC | I_DIRTY_PAGES) #define I_DIRTY_ALL (I_DIRTY | I_DIRTY_TIME) @@ -1976,20 +1980,6 @@ inode_set_iversion(struct inode *inode, const u64 new) } /** - * inode_inc_iversion_locked - increment i_version while protected - * @inode: inode to be updated - * - * Increment the i_version field in the inode. This version is usable - * when there is some other sort of lock in play that would prevent - * concurrent accessors. - */ -static inline void -inode_inc_iversion_locked(struct inode *inode) -{ - inode->i_version++; -} - -/** * inode_set_iversion_read - set i_version to a particular value and flag * set flag to indicate that it has been viewed * @inode: inode to set @@ -2002,7 +1992,10 @@ inode_inc_iversion_locked(struct inode *inode) static inline void inode_set_iversion_read(struct inode *inode, const u64 new) { + spin_lock(&inode->i_lock); inode_set_iversion(inode, new); + inode->i_state |= I_VERS_BUMP; + spin_unlock(&inode->i_lock); } /** @@ -2011,14 +2004,36 @@ inode_set_iversion_read(struct inode *inode, const u64 new) * * Every time the inode is modified, the i_version field will be incremented. * The filesystem has to be mounted with MS_I_VERSION flag. + * + * Returns true if counter was bumped, and false if it wasn't necessary. */ static inline bool inode_inc_iversion(struct inode *inode, bool force) { + bool ret = false; + spin_lock(&inode->i_lock); - inode_inc_iversion_locked(inode); + if (force || (inode->i_state & I_VERS_BUMP)) { + inode->i_version++; + inode->i_state &= ~I_VERS_BUMP; + ret = true; + } spin_unlock(&inode->i_lock); - return true; + return ret; +} + +/** + * inode_inc_iversion_locked - increment i_version while protected + * @inode: inode to be updated + * + * Increment the i_version field in the inode. This version is usable + * when there is some other sort of lock in play that would prevent + * concurrent increments (typically inode->i_rwsem for write). + */ +static inline void +inode_inc_iversion_locked(struct inode *inode) +{ + inode_inc_iversion(inode, true); } /** @@ -2043,9 +2058,15 @@ inode_get_iversion_raw(const struct inode *inode) * to store the returned i_version for later comparison. */ static inline u64 -inode_get_iversion(const struct inode *inode) +inode_get_iversion(struct inode *inode) { - return inode_get_iversion_raw(inode); + u64 ret; + + spin_lock(&inode->i_lock); + inode->i_state |= I_VERS_BUMP; + ret = inode->i_version; + spin_unlock(&inode->i_lock); + return ret; } /** @@ -2054,7 +2075,7 @@ inode_get_iversion(const struct inode *inode) * @old: old value to check against its i_version * * Compare an i_version counter with a previous one. Returns 0 if they are - * the same or non-zero if they are different. + * the same, greater than zero if the inode's is "later" than the old value. */ static inline s64 inode_cmp_iversion(const struct inode *inode, const u64 old) @@ -2072,7 +2093,12 @@ inode_cmp_iversion(const struct inode *inode, const u64 old) static inline bool inode_iversion_need_inc(struct inode *inode) { - return true; + bool ret; + + spin_lock(&inode->i_lock); + ret = inode->i_state & I_VERS_BUMP; + spin_unlock(&inode->i_lock); + return ret; } enum file_time_flags { -- 2.7.4
next prev parent reply other threads:[~2016-12-21 17:04 UTC|newest] Thread overview: 87+ messages / expand[flat|nested] mbox.gz Atom feed top 2016-12-21 17:03 [RFC PATCH v1 00/30] fs: inode->i_version rework and optimization Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 01/30] lustre: don't set f_version in ll_readdir Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 02/30] ecryptfs: remove unnecessary i_version bump Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 03/30] ceph: remove the bump of i_version Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 04/30] f2fs: don't bother setting i_version Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 05/30] hpfs: don't bother with the i_version counter Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 06/30] jfs: remove initialization of " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 07/30] nilfs2: remove inode->i_version initialization Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 08/30] orangefs: remove initialization of i_version Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 09/30] reiserfs: remove unneeded i_version bump Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 10/30] ntfs: remove i_version handling Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 11/30] fs: new API for handling i_version Jeff Layton 2017-03-03 22:36 ` J. Bruce Fields 2017-03-04 0:09 ` Jeff Layton 2017-03-03 23:55 ` NeilBrown 2017-03-04 1:58 ` Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 12/30] fat: convert to new i_version API Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 13/30] affs: " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 14/30] afs: " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 15/30] btrfs: " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 16/30] exofs: switch " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 17/30] ext2: convert " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 18/30] ext4: " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 19/30] nfs: " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 20/30] nfsd: " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 21/30] ocfs2: " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 22/30] ufs: use " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 23/30] xfs: convert to " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 24/30] IMA: switch IMA over " Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 25/30] fs: add a "force" parameter to inode_inc_iversion Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 26/30] fs: only set S_VERSION when updating times if it has been queried Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 27/30] xfs: avoid setting XFS_ILOG_CORE if i_version doesn't need incrementing Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 28/30] btrfs: only dirty the inode in btrfs_update_time if something was changed Jeff Layton 2016-12-21 17:03 ` Jeff Layton [this message] 2017-03-04 0:03 ` [RFC PATCH v1 29/30] fs: track whether the i_version has been queried with an i_state flag NeilBrown 2017-03-04 0:43 ` Jeff Layton 2016-12-21 17:03 ` [RFC PATCH v1 30/30] fs: convert i_version counter over to an atomic64_t Jeff Layton 2016-12-22 8:38 ` Amir Goldstein 2016-12-22 13:27 ` Jeff Layton 2017-03-04 0:00 ` NeilBrown 2016-12-22 8:45 ` [RFC PATCH v1 00/30] fs: inode->i_version rework and optimization Christoph Hellwig 2016-12-22 14:42 ` Jeff Layton 2017-03-20 21:43 ` J. Bruce Fields 2017-03-21 13:45 ` Christoph Hellwig 2017-03-21 16:30 ` J. Bruce Fields 2017-03-21 17:23 ` Jeff Layton 2017-03-21 17:37 ` J. Bruce Fields 2017-03-21 17:51 ` J. Bruce Fields 2017-03-21 18:30 ` J. Bruce Fields 2017-03-21 18:46 ` Jeff Layton 2017-03-21 19:13 ` J. Bruce Fields 2017-03-21 21:54 ` Jeff Layton 2017-03-29 11:15 ` Jan Kara 2017-03-29 17:54 ` Jeff Layton 2017-03-29 23:41 ` Dave Chinner 2017-03-30 11:24 ` Jeff Layton 2017-04-04 18:38 ` J. Bruce Fields 2017-03-30 6:47 ` Jan Kara 2017-03-30 11:11 ` Jeff Layton 2017-03-30 16:12 ` J. Bruce Fields 2017-03-30 18:35 ` Jeff Layton 2017-03-30 21:11 ` Boaz Harrosh 2017-04-04 18:31 ` J. Bruce Fields 2017-04-05 1:43 ` NeilBrown 2017-04-05 8:05 ` Jan Kara 2017-04-05 18:14 ` J. Bruce Fields 2017-05-11 18:59 ` J. Bruce Fields 2017-05-11 22:22 ` NeilBrown 2017-05-12 16:21 ` J. Bruce Fields 2017-10-30 13:21 ` Jeff Layton 2017-05-12 8:27 ` Jan Kara 2017-05-12 15:56 ` J. Bruce Fields 2017-05-12 11:01 ` Jeff Layton 2017-05-12 15:57 ` J. Bruce Fields 2017-04-06 1:12 ` NeilBrown 2017-04-06 7:22 ` Jan Kara 2017-04-05 17:26 ` J. Bruce Fields 2017-04-01 23:05 ` Dave Chinner 2017-04-03 14:00 ` Jan Kara 2017-04-04 12:34 ` Dave Chinner 2017-04-04 17:53 ` J. Bruce Fields 2017-04-05 1:26 ` NeilBrown 2017-03-21 21:45 ` Dave Chinner 2017-03-22 19:53 ` Jeff Layton 2017-03-03 23:00 ` J. Bruce Fields 2017-03-04 0:53 ` Jeff Layton 2017-03-08 17:29 ` J. Bruce Fields
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=1482339827-7882-30-git-send-email-jlayton@redhat.com \ --to=jlayton@redhat.com \ --cc=linux-btrfs@vger.kernel.org \ --cc=linux-ext4@vger.kernel.org \ --cc=linux-fsdevel@vger.kernel.org \ --cc=linux-kernel@vger.kernel.org \ --cc=linux-nfs@vger.kernel.org \ --cc=linux-xfs@vger.kernel.org \ --subject='Re: [RFC PATCH v1 29/30] fs: track whether the i_version has been queried with an i_state flag' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).