All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH] xfs: cancel intents immediately if process_intents fails
@ 2020-10-19 16:29 Darrick J. Wong
  2020-10-20 10:38 ` Brian Foster
  0 siblings, 1 reply; 2+ messages in thread
From: Darrick J. Wong @ 2020-10-19 16:29 UTC (permalink / raw)
  To: xfs

From: Darrick J. Wong <darrick.wong@oracle.com>

If processing recovered log intent items fails, we need to cancel all
the unprocessed recovered items immediately so that a subsequent AIL
push in the bail out path won't get wedged on the pinned intent items
that didn't get processed.

This can happen if the log contains (1) an intent that gets and releases
an inode, (2) an intent that cannot be recovered successfully, and (3)
some third intent item.  When recovery of (2) fails, we leave (3) pinned
in memory.  Inode reclamation is called in the error-out path of
xfs_mountfs before xfs_log_cancel_mount.  Reclamation calls
xfs_ail_push_all_sync, which gets stuck waiting for (3).

Therefore, call xlog_recover_cancel_intents if _process_intents fails.

Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
---
 fs/xfs/xfs_log_recover.c |    8 ++++++++
 1 file changed, 8 insertions(+)

diff --git a/fs/xfs/xfs_log_recover.c b/fs/xfs/xfs_log_recover.c
index a8289adc1b29..87886b7f77da 100644
--- a/fs/xfs/xfs_log_recover.c
+++ b/fs/xfs/xfs_log_recover.c
@@ -3446,6 +3446,14 @@ xlog_recover_finish(
 		int	error;
 		error = xlog_recover_process_intents(log);
 		if (error) {
+			/*
+			 * Cancel all the unprocessed intent items now so that
+			 * we don't leave them pinned in the AIL.  This can
+			 * cause the AIL to livelock on the pinned item if
+			 * anyone tries to push the AIL (inode reclaim does
+			 * this) before we get around to xfs_log_mount_cancel.
+			 */
+			xlog_recover_cancel_intents(log);
 			xfs_alert(log->l_mp, "Failed to recover intents");
 			return error;
 		}

^ permalink raw reply related	[flat|nested] 2+ messages in thread

* Re: [PATCH] xfs: cancel intents immediately if process_intents fails
  2020-10-19 16:29 [PATCH] xfs: cancel intents immediately if process_intents fails Darrick J. Wong
@ 2020-10-20 10:38 ` Brian Foster
  0 siblings, 0 replies; 2+ messages in thread
From: Brian Foster @ 2020-10-20 10:38 UTC (permalink / raw)
  To: Darrick J. Wong; +Cc: xfs

On Mon, Oct 19, 2020 at 09:29:17AM -0700, Darrick J. Wong wrote:
> From: Darrick J. Wong <darrick.wong@oracle.com>
> 
> If processing recovered log intent items fails, we need to cancel all
> the unprocessed recovered items immediately so that a subsequent AIL
> push in the bail out path won't get wedged on the pinned intent items
> that didn't get processed.
> 
> This can happen if the log contains (1) an intent that gets and releases
> an inode, (2) an intent that cannot be recovered successfully, and (3)
> some third intent item.  When recovery of (2) fails, we leave (3) pinned
> in memory.  Inode reclamation is called in the error-out path of
> xfs_mountfs before xfs_log_cancel_mount.  Reclamation calls
> xfs_ail_push_all_sync, which gets stuck waiting for (3).
> 
> Therefore, call xlog_recover_cancel_intents if _process_intents fails.
> 
> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
> ---

Reviewed-by: Brian Foster <bfoster@redhat.com>

>  fs/xfs/xfs_log_recover.c |    8 ++++++++
>  1 file changed, 8 insertions(+)
> 
> diff --git a/fs/xfs/xfs_log_recover.c b/fs/xfs/xfs_log_recover.c
> index a8289adc1b29..87886b7f77da 100644
> --- a/fs/xfs/xfs_log_recover.c
> +++ b/fs/xfs/xfs_log_recover.c
> @@ -3446,6 +3446,14 @@ xlog_recover_finish(
>  		int	error;
>  		error = xlog_recover_process_intents(log);
>  		if (error) {
> +			/*
> +			 * Cancel all the unprocessed intent items now so that
> +			 * we don't leave them pinned in the AIL.  This can
> +			 * cause the AIL to livelock on the pinned item if
> +			 * anyone tries to push the AIL (inode reclaim does
> +			 * this) before we get around to xfs_log_mount_cancel.
> +			 */
> +			xlog_recover_cancel_intents(log);
>  			xfs_alert(log->l_mp, "Failed to recover intents");
>  			return error;
>  		}
> 


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2020-10-20 10:38 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-10-19 16:29 [PATCH] xfs: cancel intents immediately if process_intents fails Darrick J. Wong
2020-10-20 10:38 ` Brian Foster

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.