From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-8.2 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, INCLUDES_PATCH,MAILING_LIST_MULTI,SIGNED_OFF_BY,SPF_HELO_NONE,SPF_PASS, UNPARSEABLE_RELAY,URIBL_BLOCKED,USER_AGENT_SANE_1 autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 0F982C4CEC4 for ; Thu, 19 Sep 2019 02:37:37 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id D1C2E218AE for ; Thu, 19 Sep 2019 02:37:36 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2387726AbfISChg (ORCPT ); Wed, 18 Sep 2019 22:37:36 -0400 Received: from out30-130.freemail.mail.aliyun.com ([115.124.30.130]:44913 "EHLO out30-130.freemail.mail.aliyun.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727324AbfISChf (ORCPT ); Wed, 18 Sep 2019 22:37:35 -0400 X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R211e4;CH=green;DM=||false|;FP=0|-1|-1|-1|0|-1|-1|-1;HT=e01e04423;MF=eguan@linux.alibaba.com;NM=1;PH=DS;RN=9;SR=0;TI=SMTPD_---0TckPGgq_1568860649; Received: from localhost(mailfrom:eguan@linux.alibaba.com fp:SMTPD_---0TckPGgq_1568860649) by smtp.aliyun-inc.com(127.0.0.1); Thu, 19 Sep 2019 10:37:29 +0800 Date: Thu, 19 Sep 2019 10:37:29 +0800 From: Eryu Guan To: Brian Foster Cc: kaixuxia , fstests@vger.kernel.org, linux-xfs@vger.kernel.org, Eryu Guan , "Darrick J. Wong" , newtongao@tencent.com, jasperwang@tencent.com Subject: Re: [PATCH v3 2/2] xfs: test the deadlock between the AGI and AGF with RENAME_WHITEOUT Message-ID: <20190919023729.GB52397@e18g06458.et15sqa> References: <20190918135947.GD29377@bfoster> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190918135947.GD29377@bfoster> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-xfs-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-xfs@vger.kernel.org On Wed, Sep 18, 2019 at 09:59:47AM -0400, Brian Foster wrote: > On Wed, Sep 18, 2019 at 07:49:22PM +0800, kaixuxia wrote: > > There is ABBA deadlock bug between the AGI and AGF when performing > > rename() with RENAME_WHITEOUT flag, and add this testcase to make > > sure the rename() call works well. > > > > Signed-off-by: kaixuxia > > --- > > FYI, for some reason your patch series isn't threaded on the mailing > list. I thought git send-email did this by default. Assuming you're not > explicitly using --no-thread, you might have to use the --thread option > so this gets posted as a proper series. > > > tests/xfs/512 | 96 +++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > tests/xfs/512.out | 2 ++ > > tests/xfs/group | 1 + > > 3 files changed, 99 insertions(+) > > create mode 100755 tests/xfs/512 > > create mode 100644 tests/xfs/512.out > > > > diff --git a/tests/xfs/512 b/tests/xfs/512 > > new file mode 100755 > > index 0000000..a2089f0 > > --- /dev/null > > +++ b/tests/xfs/512 > > @@ -0,0 +1,96 @@ > > +#! /bin/bash > > +# SPDX-License-Identifier: GPL-2.0 > > +# Copyright (c) 2019 Tencent. All Rights Reserved. > > +# > > +# FS QA Test 512 > > +# > > +# Test the ABBA deadlock case between the AGI and AGF When performing > > +# rename operation with RENAME_WHITEOUT flag. > > +# > > +seq=`basename $0` > > +seqres=$RESULT_DIR/$seq > > +echo "QA output created by $seq" > > + > > +here=`pwd` > > +tmp=/tmp/$$ > > +status=1 # failure is the default! > > +trap "_cleanup; exit \$status" 0 1 2 3 15 > > + > > +_cleanup() > > +{ > > + cd / > > + rm -f $tmp.* > > +} > > + > > +# get standard environment, filters and checks > > +. ./common/rc > > +. ./common/filter > > +. ./common/renameat2 > > + > > +rm -f $seqres.full > > + > > +# real QA test starts here > > +_supported_fs xfs > > +_supported_os Linux > > +# single AG will cause default xfs_repair to fail. This test need a > > +# single AG fs, so ignore the check. > > +_require_scratch_nocheck > > +_requires_renameat2 whiteout > > + > > +filter_enospc() { > > + sed -e '/^.*No space left on device.*/d' > > +} > > + > > +create_file() > > +{ > > + local target_dir=$1 > > + local files_count=$2 > > + > > + for i in $(seq 1 $files_count); do > > + echo > $target_dir/f$i >/dev/null 2>&1 | filter_enospc > > + done > > +} > > + > > +rename_whiteout() > > +{ > > + local target_dir=$1 > > + local files_count=$2 > > + > > + # a long filename could increase the possibility that target_dp > > + # allocate new blocks(acquire the AGF lock) to store the filename > > + longnamepre=`$PERL_PROG -e 'print "a"x200;'` > > + > > + # now try to do rename with RENAME_WHITEOUT flag > > + for i in $(seq 1 $files_count); do > > + src/renameat2 -w $SCRATCH_MNT/f$i $target_dir/$longnamepre$i >/dev/null 2>&1 > > + done > > +} > > + > > +_scratch_mkfs_xfs -d agcount=1 >> $seqres.full 2>&1 || > > + _fail "mkfs failed" > > This appears to be the only XFS specific bit. Could it be > conditionalized using FSTYP such that this test could go under > tests/generic? Looks like so. I'm wondering if the deadlock is only possible with single ag xfs (case 1) or single ag is just more easier to hit the deadlock (case 2). If it's case 1, I think we could make the test generic by only adding extra xfs-specific mkfs option when FSTYP is xfs, e.g. mkfs_opts="" # here goes comments explaining why xfs needs single ag if [ "$FSTYP" == "xfs" ]; then mkfs_opts="-d agcount=1" fi If it's case 2, that depends on what's the posibility to reproduce without single ag. I can afford 5%-10% or even lower reproduce posibility (given that the deadlock is a corner case and isn't likely to happen with normally formatted xfs) to drop "-d agcount=1". So the test has nothing xfs-specific. If the reproduce posibility is near 0, then we could go to case 1. > > > +_scratch_mount > > + > > +# set the rename and create file counts > > +file_count=50000 > > + > > +# create the necessary directory for create and rename operations > > +createdir=$SCRATCH_MNT/createdir > > +mkdir $createdir > > +renamedir=$SCRATCH_MNT/renamedir > > +mkdir $renamedir > > + > > +# create many small files for the rename with RENAME_WHITEOUT > > +create_file $SCRATCH_MNT $file_count > > + > > +# try to create files at the same time to hit the deadlock > > +rename_whiteout $renamedir $file_count & > > +create_file $createdir $file_count & > > + > > When I ran this test I noticed that the rename_whiteout task completed > renaming the 50k files before the create_file task created even 30k of > the 50k files. There's no risk of deadlock once one of these tasks > completes, right? If so, that seems like something that could be fixed > up. > > Beyond that though, the test itself ran for almost 19 minutes on a vm > with the deadlock fix. That seems like overkill to me for a test that's > so narrowly focused on a particular bug that it's unlikely to fail in > the future. If we can't find a way to get this down to a reasonable time > while still reproducing the deadlock, I'm kind of wondering if there's a > better approach to get more rename coverage from existing tests. For > example, could we add this support to fsstress and see if any of the > existing stress tests might trigger the original problem? Even if we > needed to add a new rename/create focused fsstress test, that might at > least be general enough to provide broader coverage. yeah, adding renameat2 operations to fsstress looks like a good idea to extend the test coverage. But that belongs to another patch. > > Alternatively, what if this test ran a create/rename workload (on a > smaller fileset) for a fixed time of a minute or two and then exited? I > think it would be a reasonable compromise if the test still reproduced > on some smaller frequency, it's just not clear to me how effective such > a test would be without actually trying it. Maybe Eryu has additional > thoughts.. I agreed, unless the frequency is zero :) As mentioned above, I can afford smaller frequency if test runs faster. Also, we could scale the test time based on TIME_FACTOR and add 'stress' group too. (You could grep for TIME_FACTOR in tests/ dir for examples.) Thanks, Eryu > > Brian > > > +wait > > +echo Silence is golden > > + > > +# Failure comes in the form of a deadlock. > > + > > +# success, all done > > +status=0 > > +exit > > diff --git a/tests/xfs/512.out b/tests/xfs/512.out > > new file mode 100644 > > index 0000000..0aabdef > > --- /dev/null > > +++ b/tests/xfs/512.out > > @@ -0,0 +1,2 @@ > > +QA output created by 512 > > +Silence is golden > > diff --git a/tests/xfs/group b/tests/xfs/group > > index a7ad300..ed250d6 100644 > > --- a/tests/xfs/group > > +++ b/tests/xfs/group > > @@ -509,3 +509,4 @@ > > 509 auto ioctl > > 510 auto ioctl quick > > 511 auto quick quota > > +512 auto rename > > -- > > 1.8.3.1 > > > > -- > > kaixuxia