From: Tianjia Zhang <tianjia.zhang@linux.alibaba.com> To: Herbert Xu <herbert@gondor.apana.org.au>, "David S. Miller" <davem@davemloft.net>, Jussi Kivilinna <jussi.kivilinna@iki.fi>, Ard Biesheuvel <ardb@kernel.org>, Catalin Marinas <catalin.marinas@arm.com>, Will Deacon <will@kernel.org>, Maxime Coquelin <mcoquelin.stm32@gmail.com>, Alexandre Torgue <alexandre.torgue@foss.st.com>, Eric Biggers <ebiggers@kernel.org>, linux-crypto@vger.kernel.org, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, linux-stm32@st-md-mailman.stormreply.com Subject: [PATCH 00/16] Optimizing SM3 and SM4 algorithms using NEON/CE/SVE instructions Date: Mon, 26 Sep 2022 17:36:04 +0800 [thread overview] Message-ID: <20220926093620.99898-1-tianjia.zhang@linux.alibaba.com> (raw) This series of patches uses different instruction sets to optimize the SM3 and SM4 algorithms, as well as the optimization of different modes of SM4. patch 1-2: NEON instruction set optimization for SM3 patch 3: Refactored and streamlined SM4 NEON instruction implementation patch 4-5: support test for new SM4 mode patch 6-8: Refactored and streamlined SM4 CE instruction implementation patch 9-12: CE accelerated implementation of SM4 CTS/XTS/ESSIV patch 13: CE accelerated implementation of SM4 CMAC/XCBC/CBCMAC patch 14-15: CE accelerated implementation of SM4 CCM/GCM patch 16: SM4 ARMv9 SVE cryptography acceleration implementation Tianjia Zhang (16): crypto: arm64/sm3 - raise the priority of the CE implementation crypto: arm64/sm3 - add NEON assembly implementation crypto: arm64/sm4 - refactor and simplify NEON implementation crypto: testmgr - add SM4 cts-cbc/essiv/xts/xcbc test vectors crypto: tcrypt - add SM4 cts-cbc/essiv/xts/xcbc test crypto: arm64/sm4 - refactor and simplify CE implementation crypto: arm64/sm4 - simplify sm4_ce_expand_key() of CE implementation crypto: arm64/sm4 - export reusable CE acceleration functions crypto: arm64/sm4 - add CE implementation for CTS-CBC mode crypto: arm64/sm4 - add CE implementation for XTS mode crypto: essiv - allow digestsize to be greater than keysize crypto: arm64/sm4 - add CE implementation for ESSIV mode crypto: arm64/sm4 - add CE implementation for cmac/xcbc/cbcmac crypto: arm64/sm4 - add CE implementation for CCM mode crypto: arm64/sm4 - add CE implementation for GCM mode crypto: arm64/sm4 - add ARMv9 SVE cryptography acceleration implementation arch/arm64/crypto/Kconfig | 66 +- arch/arm64/crypto/Makefile | 12 + arch/arm64/crypto/sm3-ce-glue.c | 2 +- arch/arm64/crypto/sm3-neon-core.S | 600 +++++++++++++ arch/arm64/crypto/sm3-neon-glue.c | 103 +++ arch/arm64/crypto/sm4-ce-asm.h | 209 +++++ arch/arm64/crypto/sm4-ce-ccm-core.S | 328 +++++++ arch/arm64/crypto/sm4-ce-ccm-glue.c | 303 +++++++ arch/arm64/crypto/sm4-ce-core.S | 1247 ++++++++++++++++++--------- arch/arm64/crypto/sm4-ce-gcm-core.S | 741 ++++++++++++++++ arch/arm64/crypto/sm4-ce-gcm-glue.c | 286 ++++++ arch/arm64/crypto/sm4-ce-glue.c | 703 ++++++++++++++- arch/arm64/crypto/sm4-ce.h | 16 + arch/arm64/crypto/sm4-neon-core.S | 630 +++++++++----- arch/arm64/crypto/sm4-neon-glue.c | 172 +--- arch/arm64/crypto/sm4-sve-ce-core.S | 1028 ++++++++++++++++++++++ arch/arm64/crypto/sm4-sve-ce-glue.c | 332 +++++++ crypto/essiv.c | 11 +- crypto/tcrypt.c | 28 + crypto/testmgr.c | 25 + crypto/testmgr.h | 1161 +++++++++++++++++++++++++ 21 files changed, 7234 insertions(+), 769 deletions(-) create mode 100644 arch/arm64/crypto/sm3-neon-core.S create mode 100644 arch/arm64/crypto/sm3-neon-glue.c create mode 100644 arch/arm64/crypto/sm4-ce-asm.h create mode 100644 arch/arm64/crypto/sm4-ce-ccm-core.S create mode 100644 arch/arm64/crypto/sm4-ce-ccm-glue.c create mode 100644 arch/arm64/crypto/sm4-ce-gcm-core.S create mode 100644 arch/arm64/crypto/sm4-ce-gcm-glue.c create mode 100644 arch/arm64/crypto/sm4-ce.h create mode 100644 arch/arm64/crypto/sm4-sve-ce-core.S create mode 100644 arch/arm64/crypto/sm4-sve-ce-glue.c -- 2.24.3 (Apple Git-128)
WARNING: multiple messages have this Message-ID (diff)
From: Tianjia Zhang <tianjia.zhang@linux.alibaba.com> To: Herbert Xu <herbert@gondor.apana.org.au>, "David S. Miller" <davem@davemloft.net>, Jussi Kivilinna <jussi.kivilinna@iki.fi>, Ard Biesheuvel <ardb@kernel.org>, Catalin Marinas <catalin.marinas@arm.com>, Will Deacon <will@kernel.org>, Maxime Coquelin <mcoquelin.stm32@gmail.com>, Alexandre Torgue <alexandre.torgue@foss.st.com>, Eric Biggers <ebiggers@kernel.org>, linux-crypto@vger.kernel.org, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, linux-stm32@st-md-mailman.stormreply.com Subject: [PATCH 00/16] Optimizing SM3 and SM4 algorithms using NEON/CE/SVE instructions Date: Mon, 26 Sep 2022 17:36:04 +0800 [thread overview] Message-ID: <20220926093620.99898-1-tianjia.zhang@linux.alibaba.com> (raw) This series of patches uses different instruction sets to optimize the SM3 and SM4 algorithms, as well as the optimization of different modes of SM4. patch 1-2: NEON instruction set optimization for SM3 patch 3: Refactored and streamlined SM4 NEON instruction implementation patch 4-5: support test for new SM4 mode patch 6-8: Refactored and streamlined SM4 CE instruction implementation patch 9-12: CE accelerated implementation of SM4 CTS/XTS/ESSIV patch 13: CE accelerated implementation of SM4 CMAC/XCBC/CBCMAC patch 14-15: CE accelerated implementation of SM4 CCM/GCM patch 16: SM4 ARMv9 SVE cryptography acceleration implementation Tianjia Zhang (16): crypto: arm64/sm3 - raise the priority of the CE implementation crypto: arm64/sm3 - add NEON assembly implementation crypto: arm64/sm4 - refactor and simplify NEON implementation crypto: testmgr - add SM4 cts-cbc/essiv/xts/xcbc test vectors crypto: tcrypt - add SM4 cts-cbc/essiv/xts/xcbc test crypto: arm64/sm4 - refactor and simplify CE implementation crypto: arm64/sm4 - simplify sm4_ce_expand_key() of CE implementation crypto: arm64/sm4 - export reusable CE acceleration functions crypto: arm64/sm4 - add CE implementation for CTS-CBC mode crypto: arm64/sm4 - add CE implementation for XTS mode crypto: essiv - allow digestsize to be greater than keysize crypto: arm64/sm4 - add CE implementation for ESSIV mode crypto: arm64/sm4 - add CE implementation for cmac/xcbc/cbcmac crypto: arm64/sm4 - add CE implementation for CCM mode crypto: arm64/sm4 - add CE implementation for GCM mode crypto: arm64/sm4 - add ARMv9 SVE cryptography acceleration implementation arch/arm64/crypto/Kconfig | 66 +- arch/arm64/crypto/Makefile | 12 + arch/arm64/crypto/sm3-ce-glue.c | 2 +- arch/arm64/crypto/sm3-neon-core.S | 600 +++++++++++++ arch/arm64/crypto/sm3-neon-glue.c | 103 +++ arch/arm64/crypto/sm4-ce-asm.h | 209 +++++ arch/arm64/crypto/sm4-ce-ccm-core.S | 328 +++++++ arch/arm64/crypto/sm4-ce-ccm-glue.c | 303 +++++++ arch/arm64/crypto/sm4-ce-core.S | 1247 ++++++++++++++++++--------- arch/arm64/crypto/sm4-ce-gcm-core.S | 741 ++++++++++++++++ arch/arm64/crypto/sm4-ce-gcm-glue.c | 286 ++++++ arch/arm64/crypto/sm4-ce-glue.c | 703 ++++++++++++++- arch/arm64/crypto/sm4-ce.h | 16 + arch/arm64/crypto/sm4-neon-core.S | 630 +++++++++----- arch/arm64/crypto/sm4-neon-glue.c | 172 +--- arch/arm64/crypto/sm4-sve-ce-core.S | 1028 ++++++++++++++++++++++ arch/arm64/crypto/sm4-sve-ce-glue.c | 332 +++++++ crypto/essiv.c | 11 +- crypto/tcrypt.c | 28 + crypto/testmgr.c | 25 + crypto/testmgr.h | 1161 +++++++++++++++++++++++++ 21 files changed, 7234 insertions(+), 769 deletions(-) create mode 100644 arch/arm64/crypto/sm3-neon-core.S create mode 100644 arch/arm64/crypto/sm3-neon-glue.c create mode 100644 arch/arm64/crypto/sm4-ce-asm.h create mode 100644 arch/arm64/crypto/sm4-ce-ccm-core.S create mode 100644 arch/arm64/crypto/sm4-ce-ccm-glue.c create mode 100644 arch/arm64/crypto/sm4-ce-gcm-core.S create mode 100644 arch/arm64/crypto/sm4-ce-gcm-glue.c create mode 100644 arch/arm64/crypto/sm4-ce.h create mode 100644 arch/arm64/crypto/sm4-sve-ce-core.S create mode 100644 arch/arm64/crypto/sm4-sve-ce-glue.c -- 2.24.3 (Apple Git-128) _______________________________________________ linux-arm-kernel mailing list linux-arm-kernel@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
next reply other threads:[~2022-09-26 9:36 UTC|newest] Thread overview: 42+ messages / expand[flat|nested] mbox.gz Atom feed top 2022-09-26 9:36 Tianjia Zhang [this message] 2022-09-26 9:36 ` [PATCH 00/16] Optimizing SM3 and SM4 algorithms using NEON/CE/SVE instructions Tianjia Zhang 2022-09-26 9:36 ` [PATCH 01/16] crypto: arm64/sm3 - raise the priority of the CE implementation Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 02/16] crypto: arm64/sm3 - add NEON assembly implementation Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 03/16] crypto: arm64/sm4 - refactor and simplify NEON implementation Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 04/16] crypto: testmgr - add SM4 cts-cbc/essiv/xts/xcbc test vectors Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 05/16] crypto: tcrypt - add SM4 cts-cbc/essiv/xts/xcbc test Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 06/16] crypto: arm64/sm4 - refactor and simplify CE implementation Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 07/16] crypto: arm64/sm4 - simplify sm4_ce_expand_key() of " Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 08/16] crypto: arm64/sm4 - export reusable CE acceleration functions Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 09/16] crypto: arm64/sm4 - add CE implementation for CTS-CBC mode Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 10/16] crypto: arm64/sm4 - add CE implementation for XTS mode Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 11/16] crypto: essiv - allow digestsize to be greater than keysize Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 12/16] crypto: arm64/sm4 - add CE implementation for ESSIV mode Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 13/16] crypto: arm64/sm4 - add CE implementation for cmac/xcbc/cbcmac Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 14/16] crypto: arm64/sm4 - add CE implementation for CCM mode Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 15/16] crypto: arm64/sm4 - add CE implementation for GCM mode Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 9:36 ` [PATCH 16/16] crypto: arm64/sm4 - add ARMv9 SVE cryptography acceleration implementation Tianjia Zhang 2022-09-26 9:36 ` Tianjia Zhang 2022-09-26 10:02 ` Ard Biesheuvel 2022-09-26 10:02 ` Ard Biesheuvel 2022-09-26 17:14 ` Mark Brown 2022-09-26 17:14 ` Mark Brown 2022-09-27 4:30 ` Tianjia Zhang 2022-09-27 4:30 ` Tianjia Zhang 2022-09-27 4:26 ` Tianjia Zhang 2022-09-27 4:26 ` Tianjia Zhang
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=20220926093620.99898-1-tianjia.zhang@linux.alibaba.com \ --to=tianjia.zhang@linux.alibaba.com \ --cc=alexandre.torgue@foss.st.com \ --cc=ardb@kernel.org \ --cc=catalin.marinas@arm.com \ --cc=davem@davemloft.net \ --cc=ebiggers@kernel.org \ --cc=herbert@gondor.apana.org.au \ --cc=jussi.kivilinna@iki.fi \ --cc=linux-arm-kernel@lists.infradead.org \ --cc=linux-crypto@vger.kernel.org \ --cc=linux-kernel@vger.kernel.org \ --cc=linux-stm32@st-md-mailman.stormreply.com \ --cc=mcoquelin.stm32@gmail.com \ --cc=will@kernel.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.