All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH] crypto: arm64/neonbs - fix out-of-bounds access on short input
@ 2024-02-17 16:11 Ard Biesheuvel
  2024-02-21 23:37 ` Ard Biesheuvel
  2024-02-22  6:34 ` Eric Biggers
  0 siblings, 2 replies; 5+ messages in thread
From: Ard Biesheuvel @ 2024-02-17 16:11 UTC (permalink / raw)
  To: linux-crypto; +Cc: herbert, Ard Biesheuvel, stable, syzbot+f1ceaa1a09ab891e1934

From: Ard Biesheuvel <ardb@kernel.org>

The bit-sliced implementation of AES-CTR operates on blocks of 128
bytes, and will fall back to the plain NEON version for tail blocks or
inputs that are shorter than 128 bytes to begin with.

It will call straight into the plain NEON asm helper, which performs all
memory accesses in granules of 16 bytes (the size of a NEON register).
For this reason, the associated plain NEON glue code will copy inputs
shorter than 16 bytes into a temporary buffer, given that this is a rare
occurrence and it is not worth the effort to work around this in the asm
code.

The fallback from the bit-sliced NEON version fails to take this into
account, potentially resulting in out-of-bounds accesses. So clone the
same workaround, and use a temp buffer for short in/outputs.

Cc: <stable@vger.kernel.org>
Reported-by: syzbot+f1ceaa1a09ab891e1934@syzkaller.appspotmail.com
Tested-by: syzbot+f1ceaa1a09ab891e1934@syzkaller.appspotmail.com
Signed-off-by: Ard Biesheuvel <ardb@kernel.org>
---
 arch/arm64/crypto/aes-neonbs-glue.c | 11 +++++++++++
 1 file changed, 11 insertions(+)

diff --git a/arch/arm64/crypto/aes-neonbs-glue.c b/arch/arm64/crypto/aes-neonbs-glue.c
index bac4cabef607..849dc41320db 100644
--- a/arch/arm64/crypto/aes-neonbs-glue.c
+++ b/arch/arm64/crypto/aes-neonbs-glue.c
@@ -227,8 +227,19 @@ static int ctr_encrypt(struct skcipher_request *req)
 			src += blocks * AES_BLOCK_SIZE;
 		}
 		if (nbytes && walk.nbytes == walk.total) {
+			u8 buf[AES_BLOCK_SIZE];
+			u8 *d = dst;
+
+			if (unlikely(nbytes < AES_BLOCK_SIZE))
+				src = dst = memcpy(buf + sizeof(buf) - nbytes,
+						   src, nbytes);
+
 			neon_aes_ctr_encrypt(dst, src, ctx->enc, ctx->key.rounds,
 					     nbytes, walk.iv);
+
+			if (unlikely(nbytes < AES_BLOCK_SIZE))
+				memcpy(d, buf + sizeof(buf) - nbytes, nbytes);
+
 			nbytes = 0;
 		}
 		kernel_neon_end();
-- 
2.44.0.rc0.258.g7320e95886-goog


^ permalink raw reply related	[flat|nested] 5+ messages in thread

* Re: [PATCH] crypto: arm64/neonbs - fix out-of-bounds access on short input
  2024-02-17 16:11 [PATCH] crypto: arm64/neonbs - fix out-of-bounds access on short input Ard Biesheuvel
@ 2024-02-21 23:37 ` Ard Biesheuvel
  2024-02-22  6:08   ` Herbert Xu
  2024-02-22  6:34 ` Eric Biggers
  1 sibling, 1 reply; 5+ messages in thread
From: Ard Biesheuvel @ 2024-02-21 23:37 UTC (permalink / raw)
  To: Ard Biesheuvel; +Cc: linux-crypto, herbert, stable, syzbot+f1ceaa1a09ab891e1934

On Sat, 17 Feb 2024 at 17:12, Ard Biesheuvel <ardb+git@google.com> wrote:
>
> From: Ard Biesheuvel <ardb@kernel.org>
>
> The bit-sliced implementation of AES-CTR operates on blocks of 128
> bytes, and will fall back to the plain NEON version for tail blocks or
> inputs that are shorter than 128 bytes to begin with.
>
> It will call straight into the plain NEON asm helper, which performs all
> memory accesses in granules of 16 bytes (the size of a NEON register).
> For this reason, the associated plain NEON glue code will copy inputs
> shorter than 16 bytes into a temporary buffer, given that this is a rare
> occurrence and it is not worth the effort to work around this in the asm
> code.
>
> The fallback from the bit-sliced NEON version fails to take this into
> account, potentially resulting in out-of-bounds accesses. So clone the
> same workaround, and use a temp buffer for short in/outputs.
>
> Cc: <stable@vger.kernel.org>
> Reported-by: syzbot+f1ceaa1a09ab891e1934@syzkaller.appspotmail.com
> Tested-by: syzbot+f1ceaa1a09ab891e1934@syzkaller.appspotmail.com
> Signed-off-by: Ard Biesheuvel <ardb@kernel.org>

Ping?

> ---
>  arch/arm64/crypto/aes-neonbs-glue.c | 11 +++++++++++
>  1 file changed, 11 insertions(+)
>
> diff --git a/arch/arm64/crypto/aes-neonbs-glue.c b/arch/arm64/crypto/aes-neonbs-glue.c
> index bac4cabef607..849dc41320db 100644
> --- a/arch/arm64/crypto/aes-neonbs-glue.c
> +++ b/arch/arm64/crypto/aes-neonbs-glue.c
> @@ -227,8 +227,19 @@ static int ctr_encrypt(struct skcipher_request *req)
>                         src += blocks * AES_BLOCK_SIZE;
>                 }
>                 if (nbytes && walk.nbytes == walk.total) {
> +                       u8 buf[AES_BLOCK_SIZE];
> +                       u8 *d = dst;
> +
> +                       if (unlikely(nbytes < AES_BLOCK_SIZE))
> +                               src = dst = memcpy(buf + sizeof(buf) - nbytes,
> +                                                  src, nbytes);
> +
>                         neon_aes_ctr_encrypt(dst, src, ctx->enc, ctx->key.rounds,
>                                              nbytes, walk.iv);
> +
> +                       if (unlikely(nbytes < AES_BLOCK_SIZE))
> +                               memcpy(d, buf + sizeof(buf) - nbytes, nbytes);
> +
>                         nbytes = 0;
>                 }
>                 kernel_neon_end();
> --
> 2.44.0.rc0.258.g7320e95886-goog
>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] crypto: arm64/neonbs - fix out-of-bounds access on short input
  2024-02-21 23:37 ` Ard Biesheuvel
@ 2024-02-22  6:08   ` Herbert Xu
  0 siblings, 0 replies; 5+ messages in thread
From: Herbert Xu @ 2024-02-22  6:08 UTC (permalink / raw)
  To: Ard Biesheuvel
  Cc: Ard Biesheuvel, linux-crypto, stable, syzbot+f1ceaa1a09ab891e1934

On Thu, Feb 22, 2024 at 12:37:45AM +0100, Ard Biesheuvel wrote:
> On Sat, 17 Feb 2024 at 17:12, Ard Biesheuvel <ardb+git@google.com> wrote:
> >
> > From: Ard Biesheuvel <ardb@kernel.org>
> >
> > The bit-sliced implementation of AES-CTR operates on blocks of 128
> > bytes, and will fall back to the plain NEON version for tail blocks or
> > inputs that are shorter than 128 bytes to begin with.
> >
> > It will call straight into the plain NEON asm helper, which performs all
> > memory accesses in granules of 16 bytes (the size of a NEON register).
> > For this reason, the associated plain NEON glue code will copy inputs
> > shorter than 16 bytes into a temporary buffer, given that this is a rare
> > occurrence and it is not worth the effort to work around this in the asm
> > code.
> >
> > The fallback from the bit-sliced NEON version fails to take this into
> > account, potentially resulting in out-of-bounds accesses. So clone the
> > same workaround, and use a temp buffer for short in/outputs.
> >
> > Cc: <stable@vger.kernel.org>
> > Reported-by: syzbot+f1ceaa1a09ab891e1934@syzkaller.appspotmail.com
> > Tested-by: syzbot+f1ceaa1a09ab891e1934@syzkaller.appspotmail.com
> > Signed-off-by: Ard Biesheuvel <ardb@kernel.org>
> 
> Ping?

It's in my queue.  Thanks.
-- 
Email: Herbert Xu <herbert@gondor.apana.org.au>
Home Page: http://gondor.apana.org.au/~herbert/
PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] crypto: arm64/neonbs - fix out-of-bounds access on short input
  2024-02-17 16:11 [PATCH] crypto: arm64/neonbs - fix out-of-bounds access on short input Ard Biesheuvel
  2024-02-21 23:37 ` Ard Biesheuvel
@ 2024-02-22  6:34 ` Eric Biggers
  2024-02-22  8:18   ` Ard Biesheuvel
  1 sibling, 1 reply; 5+ messages in thread
From: Eric Biggers @ 2024-02-22  6:34 UTC (permalink / raw)
  To: Ard Biesheuvel
  Cc: linux-crypto, herbert, Ard Biesheuvel, stable,
	syzbot+f1ceaa1a09ab891e1934

On Sat, Feb 17, 2024 at 05:11:52PM +0100, Ard Biesheuvel wrote:
> From: Ard Biesheuvel <ardb@kernel.org>
> 
> The bit-sliced implementation of AES-CTR operates on blocks of 128
> bytes, and will fall back to the plain NEON version for tail blocks or
> inputs that are shorter than 128 bytes to begin with.
> 
> It will call straight into the plain NEON asm helper, which performs all
> memory accesses in granules of 16 bytes (the size of a NEON register).
> For this reason, the associated plain NEON glue code will copy inputs
> shorter than 16 bytes into a temporary buffer, given that this is a rare
> occurrence and it is not worth the effort to work around this in the asm
> code.
> 
> The fallback from the bit-sliced NEON version fails to take this into
> account, potentially resulting in out-of-bounds accesses. So clone the
> same workaround, and use a temp buffer for short in/outputs.
> 
> Cc: <stable@vger.kernel.org>
> Reported-by: syzbot+f1ceaa1a09ab891e1934@syzkaller.appspotmail.com
> Tested-by: syzbot+f1ceaa1a09ab891e1934@syzkaller.appspotmail.com
> Signed-off-by: Ard Biesheuvel <ardb@kernel.org>

Looks like this could use:

Fixes: fc074e130051 ("crypto: arm64/aes-neonbs-ctr - fallback to plain NEON for final chunk")

> +			if (unlikely(nbytes < AES_BLOCK_SIZE))
> +				src = dst = memcpy(buf + sizeof(buf) - nbytes,
> +						   src, nbytes);
> +
>  			neon_aes_ctr_encrypt(dst, src, ctx->enc, ctx->key.rounds,
>  					     nbytes, walk.iv);
> +
> +			if (unlikely(nbytes < AES_BLOCK_SIZE))
> +				memcpy(d, buf + sizeof(buf) - nbytes, nbytes);

The second one could use 'dst' instead of 'buf + sizeof(buf) - nbytes', right?

Otherwise this looks good.

Reviewed-by: Eric Biggers <ebiggers@google.com>

- Eric

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] crypto: arm64/neonbs - fix out-of-bounds access on short input
  2024-02-22  6:34 ` Eric Biggers
@ 2024-02-22  8:18   ` Ard Biesheuvel
  0 siblings, 0 replies; 5+ messages in thread
From: Ard Biesheuvel @ 2024-02-22  8:18 UTC (permalink / raw)
  To: Eric Biggers
  Cc: Ard Biesheuvel, linux-crypto, herbert, stable,
	syzbot+f1ceaa1a09ab891e1934

On Thu, 22 Feb 2024 at 07:34, Eric Biggers <ebiggers@kernel.org> wrote:
>
> On Sat, Feb 17, 2024 at 05:11:52PM +0100, Ard Biesheuvel wrote:
> > From: Ard Biesheuvel <ardb@kernel.org>
> >
> > The bit-sliced implementation of AES-CTR operates on blocks of 128
> > bytes, and will fall back to the plain NEON version for tail blocks or
> > inputs that are shorter than 128 bytes to begin with.
> >
> > It will call straight into the plain NEON asm helper, which performs all
> > memory accesses in granules of 16 bytes (the size of a NEON register).
> > For this reason, the associated plain NEON glue code will copy inputs
> > shorter than 16 bytes into a temporary buffer, given that this is a rare
> > occurrence and it is not worth the effort to work around this in the asm
> > code.
> >
> > The fallback from the bit-sliced NEON version fails to take this into
> > account, potentially resulting in out-of-bounds accesses. So clone the
> > same workaround, and use a temp buffer for short in/outputs.
> >
> > Cc: <stable@vger.kernel.org>
> > Reported-by: syzbot+f1ceaa1a09ab891e1934@syzkaller.appspotmail.com
> > Tested-by: syzbot+f1ceaa1a09ab891e1934@syzkaller.appspotmail.com
> > Signed-off-by: Ard Biesheuvel <ardb@kernel.org>
>
> Looks like this could use:
>
> Fixes: fc074e130051 ("crypto: arm64/aes-neonbs-ctr - fallback to plain NEON for final chunk")
>

Indeed.

> > +                     if (unlikely(nbytes < AES_BLOCK_SIZE))
> > +                             src = dst = memcpy(buf + sizeof(buf) - nbytes,
> > +                                                src, nbytes);
> > +
> >                       neon_aes_ctr_encrypt(dst, src, ctx->enc, ctx->key.rounds,
> >                                            nbytes, walk.iv);
> > +
> > +                     if (unlikely(nbytes < AES_BLOCK_SIZE))
> > +                             memcpy(d, buf + sizeof(buf) - nbytes, nbytes);
>
> The second one could use 'dst' instead of 'buf + sizeof(buf) - nbytes', right?
>

Correct.

> Otherwise this looks good.
>
> Reviewed-by: Eric Biggers <ebiggers@google.com>
>

I'll respin with these changes. Thanks.

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2024-02-22  8:19 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2024-02-17 16:11 [PATCH] crypto: arm64/neonbs - fix out-of-bounds access on short input Ard Biesheuvel
2024-02-21 23:37 ` Ard Biesheuvel
2024-02-22  6:08   ` Herbert Xu
2024-02-22  6:34 ` Eric Biggers
2024-02-22  8:18   ` Ard Biesheuvel

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.