All of lore.kernel.org
 help / color / mirror / Atom feed
From: Dave Chinner <david@fromorbit.com>
To: "Darrick J. Wong" <djwong@kernel.org>
Cc: linux-xfs@vger.kernel.org
Subject: Re: [PATCH 21/22] xfs: clean up and simplify xfs_dialloc()
Date: Tue, 18 May 2021 11:46:39 +1000	[thread overview]
Message-ID: <20210518014639.GI2893@dread.disaster.area> (raw)
In-Reply-To: <20210512214930.GZ8582@magnolia>

On Wed, May 12, 2021 at 02:49:30PM -0700, Darrick J. Wong wrote:
> On Thu, May 06, 2021 at 05:20:53PM +1000, Dave Chinner wrote:
> > From: Dave Chinner <dchinner@redhat.com>
> > 
> > Because it's a mess.
> > 
> > Signed-off-by: Dave Chinner <dchinner@redhat.com>
.....
> > +	/*
> > +	 * Check that there is enough free space for the file plus a chunk of
> > +	 * inodes if we need to allocate some. If this is the first pass across
> > +	 * the AGs, take into account the potential space needed for alignment
> > +	 * of inode chunks when checking the longest contiguous free space in
> > +	 * the AG - this prevents us from getting ENOSPC because we have free
> > +	 * space larger than ialloc_blks but alignment constraints prevent us
> > +	 * from using it.
> > +	 *
> > +	 * If we can't find an AG with space for full alignment slack to be
> > +	 * taken into account, we must be near ENOSPC in all AGs.  Hence we
> > +	 * don't include alignment for the second pass and so if we fail
> > +	 * allocation due to alignment issues then it is most likely a real
> > +	 * ENOSPC condition.
> > +	 *
> > +	 * XXX(dgc): this calculation is now bogus thanks to the per-ag
> > +	 * reservations that xfs_alloc_fix_freelist() now does via
> > +	 * xfs_alloc_space_available(). When the AG fills up, pagf_freeblks will
> > +	 * be more than large enough for the check below to succeed, but
> > +	 * xfs_alloc_space_available() will fail because of the non-zero
> > +	 * metadata reservation and hence we won't actually be able to allocate
> > +	 * more inodes in this AG. We do soooo much unnecessary work near ENOSPC
> > +	 * because of this.
> 
> Yuck.  Can this be fixed by doing:
> 
> 	really_free = pag->pagf_freeblks -
> 				xfs_ag_resv_needed(pag, XFS_AG_RESV_NONE);
> 	return really_free >= needspace + ineed && longest >= ineed)
> 
> to account for those reservations, perhaps?

Something like that, though I'd much prefer to factor the freelist
fixup calculations and use them here so we have a single set of
"is there enough space in this AG for allocating X blocks"
functions.

It's somewhat outside the scope of this patchset, so I wrote the
comment rather than trying to address it here and complicate this
patchset....

> > @@ -1624,7 +1746,6 @@ xfs_dialloc(
> >  	 * Files of these types need at least one block if length > 0
> >  	 * (and they won't fit in the inode, but that's hard to figure out).
> >  	 */
> > -	needspace = S_ISDIR(mode) || S_ISREG(mode) || S_ISLNK(mode);
> >  	if (S_ISDIR(mode))
> >  		start_agno = xfs_ialloc_next_ag(mp);
> >  	else {
> > @@ -1635,7 +1756,7 @@ xfs_dialloc(
> >  
> >  	/*
> >  	 * If we have already hit the ceiling of inode blocks then clear
> > -	 * okalloc so we scan all available agi structures for a free
> > +	 * ok_alloc so we scan all available agi structures for a free
> >  	 * inode.
> >  	 *
> >  	 * Read rough value of mp->m_icount by percpu_counter_read_positive,
> > @@ -1644,7 +1765,7 @@ xfs_dialloc(
> 
> Er... didn't this logic get split into xfs_dialloc_select_ag in 5.11?

Yeah, but that was more about cleaning up the code in xfs_inode.c
by separating out the inode initialisation from the physical inode
allocation. That cleanup and separation is what allows this series
to simplify and clean up the inode allocation because it is no
longer commingled with the inode initialisation after allocation...

> Nice cleanup, at least...
>
>  :)

Yup, along with the 5.11 mods, we've chopped a lot of unnecessary
code out of the inode allocation path... :)

-Dave.
-- 
Dave Chinner
david@fromorbit.com

  reply	other threads:[~2021-05-18  1:46 UTC|newest]

Thread overview: 87+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-05-06  7:20 [RFC 00/22] xfs: initial agnumber -> perag conversions for shrink Dave Chinner
2021-05-06  7:20 ` [PATCH 01/22] xfs: move xfs_perag_get/put to xfs_ag.[ch] Dave Chinner
2021-05-10 12:52   ` Brian Foster
2021-05-11  7:18     ` Dave Chinner
2021-05-10 22:28   ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 02/22] xfs: prepare for moving perag definitions and support to libxfs Dave Chinner
2021-05-10 12:53   ` Brian Foster
2021-05-11  7:19     ` Dave Chinner
2021-05-06  7:20 ` [PATCH 03/22] xfs: move perag structure and setup to libxfs/xfs_ag.[ch] Dave Chinner
2021-05-10 22:26   ` Darrick J. Wong
2021-05-10 23:38     ` Dave Chinner
2021-05-06  7:20 ` [PATCH 04/22] xfs: make for_each_perag... a first class citizen Dave Chinner
2021-05-10 12:53   ` Brian Foster
2021-05-11  7:35     ` Dave Chinner
2021-05-11 12:29       ` Brian Foster
2021-05-11 21:33         ` Dave Chinner
2021-05-12 21:58           ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 05/22] xfs: convert raw ag walks to use for_each_perag Dave Chinner
2021-05-10 12:54   ` Brian Foster
2021-05-06  7:20 ` [PATCH 06/22] xfs: convert xfs_iwalk to use perag references Dave Chinner
2021-05-10 13:41   ` Brian Foster
2021-05-12 22:08   ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 07/22] xfs: convert secondary superblock walk to use perags Dave Chinner
2021-05-10 13:41   ` Brian Foster
2021-05-12 22:09   ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 08/22] xfs: pass perags through to the busy extent code Dave Chinner
2021-05-11 12:29   ` Brian Foster
2021-05-12 22:13   ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 09/22] xfs: push perags through the ag reservation callouts Dave Chinner
2021-05-11 12:29   ` Brian Foster
2021-05-13  0:29     ` Dave Chinner
2021-05-12 22:16   ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 10/22] xfs: pass perags around in fsmap data dev functions Dave Chinner
2021-05-11 12:30   ` Brian Foster
2021-05-12 22:23   ` Darrick J. Wong
2021-05-18  1:00     ` Dave Chinner
2021-05-06  7:20 ` [PATCH 11/22] xfs: add a perag to the btree cursor Dave Chinner
2021-05-11 12:30   ` Brian Foster
2021-05-11 20:51     ` Darrick J. Wong
2021-05-11 21:52       ` Dave Chinner
2021-05-12 12:49         ` Brian Foster
2021-05-12 22:41           ` Darrick J. Wong
2021-05-12 22:40   ` Darrick J. Wong
2021-05-13  0:12     ` Dave Chinner
2021-05-13  0:55       ` Darrick J. Wong
2021-05-13  1:07         ` Dave Chinner
2021-05-13  3:49           ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 12/22] xfs: convert rmap btree cursor to using a perag Dave Chinner
2021-05-11 12:30   ` Brian Foster
2021-05-12 22:45   ` Darrick J. Wong
2021-05-13  3:54     ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 13/22] xfs: convert refcount btree cursor to use perags Dave Chinner
2021-05-11 12:30   ` Brian Foster
2021-05-12 22:47   ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 14/22] xfs: convert allocbt cursors " Dave Chinner
2021-05-11 12:30   ` Brian Foster
2021-05-13  3:55   ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 15/22] xfs: use perag for ialloc btree cursors Dave Chinner
2021-05-11 12:30   ` Brian Foster
2021-05-13  3:55   ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 16/22] xfs: remove agno from btree cursor Dave Chinner
2021-05-11 12:34   ` Brian Foster
2021-05-11 22:02     ` Dave Chinner
2021-05-12 22:52   ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 17/22] xfs: simplify xfs_dialloc_select_ag() return values Dave Chinner
2021-05-12 12:49   ` Brian Foster
2021-05-12 22:55   ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 18/22] xfs: collapse AG selection for inode allocation Dave Chinner
2021-05-12 12:52   ` Brian Foster
2021-05-18  1:21     ` Dave Chinner
2021-05-12 23:11   ` Darrick J. Wong
2021-05-18  1:29     ` Dave Chinner
2021-05-06  7:20 ` [PATCH 19/22] xfs: get rid of xfs_dir_ialloc() Dave Chinner
2021-05-06 11:26   ` kernel test robot
2021-05-06 11:26     ` kernel test robot
2021-05-06 11:26   ` [RFC PATCH] xfs: xfs_dialloc_ag can be static kernel test robot
2021-05-06 11:26     ` kernel test robot
2021-05-12 12:52   ` [PATCH 19/22] xfs: get rid of xfs_dir_ialloc() Brian Foster
2021-05-12 23:19   ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 20/22] xfs: inode allocation can use a single perag instance Dave Chinner
2021-05-12 12:52   ` Brian Foster
2021-05-12 23:19   ` Darrick J. Wong
2021-05-06  7:20 ` [PATCH 21/22] xfs: clean up and simplify xfs_dialloc() Dave Chinner
2021-05-12 21:49   ` Darrick J. Wong
2021-05-18  1:46     ` Dave Chinner [this message]
2021-05-06  7:20 ` [PATCH 22/22] xfs: use perag through unlink processing Dave Chinner
2021-05-12 21:37   ` Darrick J. Wong

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20210518014639.GI2893@dread.disaster.area \
    --to=david@fromorbit.com \
    --cc=djwong@kernel.org \
    --cc=linux-xfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.