From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from pdx1-mailman02.dreamhost.com (pdx1-mailman02.dreamhost.com [64.90.62.194]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 7C795C433F5 for ; Thu, 16 Dec 2021 08:24:32 +0000 (UTC) Received: from pdx1-mailman02.dreamhost.com (localhost [IPv6:::1]) by pdx1-mailman02.dreamhost.com (Postfix) with ESMTP id CA17E21F414; Thu, 16 Dec 2021 00:24:31 -0800 (PST) Received: from mail-yb1-f175.google.com (mail-yb1-f175.google.com [209.85.219.175]) by pdx1-mailman02.dreamhost.com (Postfix) with ESMTP id D0FEB21E09F for ; Thu, 16 Dec 2021 00:24:29 -0800 (PST) Received: by mail-yb1-f175.google.com with SMTP id v203so62566573ybe.6 for ; Thu, 16 Dec 2021 00:24:29 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=mime-version:from:date:message-id:subject:to:cc; bh=lCjaGpVFwvfa7wJNeITqhLE4fNKzOqU/Rf3CEDlc+n4=; b=FlxXzbkq9bW1cmj3cNneTO0qhnPcB55VVN9MOT4Bkg4KbTys3YNi/R4f4dO/r12b+7 DpC7HFM2+4vieWmTUZiJ/u6PcVUZYifLe0pmImqMh4z79Xwp0ZF9bb9k+2xAJYtOhp1G pJTmZrqprYrAoZ8Lj+ls6+egZMa+TpOfZbl7B14hL6YInzDmJG+EfQynsEn1LypjhiBi Oi4MS6DPy0npcgjmrENKSiptoENsSiJKPuvYCAGkLjZRSY8j3NhcXNZv7begWAQG3zxE 280Lv0OGgEZzXHRfh+cnfrUK0tdgcEDkHxw3dcYIdIf+WXUo0uNha1+Cfg8/FA1+GZtL OtjA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:from:date:message-id:subject:to:cc; bh=lCjaGpVFwvfa7wJNeITqhLE4fNKzOqU/Rf3CEDlc+n4=; b=evGhB+iC0KHvm8qv3aw6+Gis6JU4m2WTxe/0UJk/q0eu75YFpLGa002bXtaHwReT6h zx0CbGLuozyFSTr/zEq935v1bbnyK2iu6m5w5AunRB9GktsEguZ6O/WC+1WmhQQYAGbZ VYz04jEbDLrCZFosUqqFX9Y09+WVxTZQBL7LdygKT0T8PdMSmmLOB/MH+Hc44iMN4RdK 3Aq85zNElB0PVbxS2xrB9tWJA/f1dz25q/28lYPa3wKU3A8hEnDsuAjepGCDO3St8Rea lohPdl5XOBqYykoYM90GusJ15C6RzpNRr2w9D0G1SJ9vvq06n/ao6exIRSOSwP/Oaa+r +FWg== X-Gm-Message-State: AOAM533z9ybPk5UZWkq/TU9GmSxs435vQEbyFJOA3RIYtDcDYyXttamr /po60vOcD2NZPIq/ManpMYI50fvm9cdAs4hhBTDQLg== X-Google-Smtp-Source: ABdhPJyauk6CyZralcxXwxsmCGrKPMFsTYzaXpTuDeYYKR4xC7cLFPIQBDo7yL3uMuoOrauTPUIpVceqTjavdM1FBp4= X-Received: by 2002:a25:d2ca:: with SMTP id j193mr11435603ybg.419.1639643068957; Thu, 16 Dec 2021 00:24:28 -0800 (PST) MIME-Version: 1.0 Date: Thu, 16 Dec 2021 16:23:52 +0800 Message-ID: To: Peter Jones Subject: [lustre-devel] Lustre Arm stuff status and work plan X-BeenThere: lustre-devel@lists.lustre.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "For discussing Lustre software development." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , From: Xinliang Liu via lustre-devel Reply-To: Xinliang Liu Cc: Jian Yu , lixi@ddn.com, cloud-dev-request@op-lists.linaro.org, lustre-devel@lists.lustre.org Content-Type: multipart/mixed; boundary="===============4761974771498739341==" Errors-To: lustre-devel-bounces@lists.lustre.org Sender: "lustre-devel" --===============4761974771498739341== Content-Type: multipart/alternative; boundary="000000000000af1da705d33f26cb" --000000000000af1da705d33f26cb Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable *Hi Peter and all,As Kevin(on cc) and I have been working on Lustre Arm stuff for some time. We want to give a status and progress report to the community and list our work plan for the next year. Please help to review our work plan and give some comments and suggestions. Thanks.*Status and Progress *=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3DRelease - No A= rm packages built on official community release yet.Build - Verified Lustre, openZFS build and multi-nodes setup on Arm64 CentOS 8, all are ok.- Lbuild script support for Arm is on review , LU-15293 .CI - No Arm server end CI support yet.- Arm client with x86_64 server test is already in the CI gate.- Only run a few ldiskfs test suites(sanity, sanity-sec, sanctity-lnet, etc.), not a full test.- A full test (with empty GRANT_CHECK_LIST) shows several Arm client related failed test cases, see test results page - sanity test 317: LU-11667 (Workaround fix landed)- sanityn test 16a: LU-11597 , test 71a: LU-11787 - conf-sanity test 98: LU-11785 , test 112: LU-13813 - sanity-flr test 50a: LU-14970 - sanity-pcc test 7a: LU-14346 Arm server end test on local setup - Run a full ldiskfs test with all test suites.- Due to the multi MDTs crash issue, some multi MDTs tests are not run.- Many new failed tests come, see the google sheet for details.- The openZFS full test is not run, but heard that it should be more stable than ldiskfs.Bugfix - Old Arm always_except bugs https://jira.whamcloud.com/issues/?filter=3D15555 , the Arm related ones are almost addressed.- LU-11596 , LU-11597 , LU-14067 , LU-11787 : addressed, patch sent and waiting for Arm client CI recovery to land.- LU-10073 , LU-11671 : can't be reproduced on Arm or happen on x86_64 also.- Other old Arm bugs LU-11785 , LU-13813 , LU-14970 , LU-14346 to be fixed.- New created server end bugs - LU-15122 : ASSERTION( iobuf->dr_rw =3D=3D 0 ) crash issue, fixed patch is landed. - LU-15364 : multi MDTs kernel oops issue, related to atomic unaligned memory access, work in progress.- LU-15223 : 64K page size read/write improvement, long-term work, in progress. - Full Arm related bug list with label arm: https://jira.whamcloud.com/issues/?filter=3D16710 - is not that ready for production.Reference to:James Simmons=E2=80=99 Lustre Arm update: https://connect.linaro.org/resources/san19/san19-224/ *Work Plan *=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D - Lustre Server End Critical Bug Fix target= 2022-06- Lustre Multiple MDTs kernel OOPS when stripe issue: LU-15364 - Lustre hangs at Sanity Test 807- Lustre Conf-sanity test 44 kernel crash- Lustre Conf sanity case 58 kernel crash- Lustre Conf sanity case 78 kernel crash- Lustre Conf sanity case 79 crash- Lustre sanity-pcc 7a case hang the cluster- Lustre Server End Non-critical Bug Fix target 2022-12- Lustre Sanity failure cases: 33 cases- Lustre server replay-single: 1 case- Lustre sanity-flr 200 cases fix: 1 case - Lustre sanity-hsm failure cases: 25 cases- Lustre lustre-rsync-test failure test: 3- Lustre recovery-small/sanity-scrub: 2- Lustre sanityn test cases fix: 12- Lustre sanity-lfsck failure cases fix: 3- Lustre sanity-sec failure cases fix =EF=BC=9A7 - Lustre sanity-lnet fail= ure cases test fix: 2- Continuous add more test suites for Arm client CI ??- Once a test suite is all passed for Arm then add it into CI.- Server CI support for Arm on Centos8 ??- Ideally, Arm server CI can come with Arm server end fixes patches and ensure future patches merged don=E2=80=99t mak= e any regressions on Arm.- As the test infra is not open source and maintained by whamcloud, it might need whamcloud to make it ??- Other works in future- Test other distros like ubuntu, SUSE etc.- Test x86 client with Arm64 Server - Basic Optimised: CRC/AES- All-flash optimizationBest Regards,Xinliang* --000000000000af1da705d33f26cb Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

Hi Peter and all,

As Kevin(on cc) and I have been working on= Lustre Arm stuff for some time.

We want to give a status and progress rep= ort to the community and list our work plan for the next year.

<= p dir=3D"ltr" style=3D"line-height:1.38;margin-top:0pt;margin-bottom:0pt"><= span style=3D"font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-= color:transparent;font-weight:400;font-style:normal;font-variant:normal;tex= t-decoration:none;vertical-align:baseline;white-space:pre-wrap">Please help= to review our work plan and give some comments and suggestions. Thanks.

Status and Progress

=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D

Release

  • No Arm packages built on official community <= span style=3D"font-size:11pt;font-family:Arial;color:rgb(17,85,204);backgro= und-color:transparent;font-weight:400;font-style:normal;font-variant:normal= ;text-decoration:underline;vertical-align:baseline;white-space:pre-wrap">re= lease yet.

  • Build

    • Verified Lustre, openZFS bu= ild and multi-nodes setup on Arm64 CentOS 8, all are ok.

    • Lbuild script support for Arm is on review, LU-15293.

    CI

    • No Arm server end CI support yet.

    • Arm client with x86_64 server test is alread= y in the CI gate.

      • Only run a few ldiskfs test su= ites(sanity, sanity-sec, sanctity-lnet, etc.), not a full test.

        <= /li>
      • A full test<= /span>= (with empty GRANT_CHECK_LIST) shows several Arm client related failed test= cases, see test results page

        • sanity test 317: LU-11667<= span style=3D"font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-= color:transparent;font-weight:400;font-style:normal;font-variant:normal;tex= t-decoration:none;vertical-align:baseline;white-space:pre-wrap"> (Workaroun= d fix landed)

        • sanity= n test 16a: LU-11597, test 71a: LU-11787

        • <= p dir=3D"ltr" style=3D"line-height:1.38;margin-top:0pt;margin-bottom:0pt" r= ole=3D"presentation">conf-sanity test 98: LU-11785, t= est 112: LU-13813

        • sanity-flr test 50a: LU-14970

        • sanity-pcc test 7a: LU-14346

          =

    Arm server end test on local setup

    • Run a fu= ll ldiskfs test with all test suites.

      • Due to t= he multi MDTs crash issue, some multi MDTs tests are not run.

      • Many new failed tests come, see t= he google sheet for details.

    • The openZFS full test is not run, but heard that it should= be more stable than ldiskfs.

    Bugfix

    Reference to:

    James Simmons= =E2=80=99 Lustre Arm update: https://connect.li= naro.org/resources/san19/san19-224/


    Work Plan

    =3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D

    • Lustre Server End Critical Bug Fix targe= t 2022-06

      • Lustre Multiple MDTs kernel OOPS when stripe iss= ue: LU-15364

      • Lustre hangs at Sanity Test 807

      • Lustre Conf-sanity test 44 kernel crash

      • =
      • Lustre Conf sanity case 58 kernel crash

      • Lustre Conf sanity case 78 kernel c= rash

      • Lustre Conf sanity case 7= 9 crash

      • Lustre sanity-pcc 7a = case hang the cluster

    • Lustr= e Server End Non-critical Bug Fix target 2022-12

      • Lustre = Sanity failure cases: 33 cases

      • Lustre server replay-single: 1 case

      • Lustre sanity-flr 200 cases fix: 1 case=C2=A0

      • Lustre sanity-hsm failure cases: 25 cases

      • Lustre lustre-rsync-test failure test:= 3

      • Lustre recovery-small/sanit= y-scrub: 2

      • Lustre sanityn test= cases fix: 12

      • Lustre sanity-l= fsck failure cases fix: 3

      • Lust= re sanity-sec failure cases fix =EF=BC=9A7=C2=A0

      • Lustre sanity-lnet failure cases test fix: 2

        <= /li>
    • Continuous add more test suites for Ar= m client CI ??

      • Once a=C2=A0 test suite is all passed for A= rm then add it into CI.

    • Ser= ver CI support for Arm on Centos8 ??

      • Ideally, Arm server C= I can come with Arm server end fixes patches and ensure future patches merg= ed don=E2=80=99t make any regressions on Arm.

      • As the test infra is not open source and maintained by wham= cloud, it might need whamcloud to make it ??

    • Other works in future

      • Test other distros= like ubuntu, SUSE etc.

      • Test x= 86 client with Arm64 Server=C2=A0

      • Basic Optimised: CRC/AES

      • All-flash optimization


    Best Regards,

    Xinliang

    --000000000000af1da705d33f26cb-- --===============4761974771498739341== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ lustre-devel mailing list lustre-devel@lists.lustre.org http://lists.lustre.org/listinfo.cgi/lustre-devel-lustre.org --===============4761974771498739341==--