From: Ard Biesheuvel <ard.biesheuvel@linaro.org>
To: Arnd Bergmann <arnd@arndb.de>
Cc: Russell King <linux@armlinux.org.uk>,
Romain Izard <romain.izard.pro@gmail.com>,
Sven Schmidt <4sschmid@informatik.uni-hamburg.de>,
LKML <linux-kernel@vger.kernel.org>,
"linux-arm-kernel@lists.infradead.org"
<linux-arm-kernel@lists.infradead.org>,
Gregory CLEMENT <gregory.clement@free-electrons.com>,
Petr Cvek <petrcvekcz@gmail.com>,
Aaro Koskinen <aaro.koskinen@iki.fi>,
Andrea Adami <andrea.adami@gmail.com>,
Robert Jarzmik <robert.jarzmik@free.fr>
Subject: Re: [PATCH] ARM: add a private asm/unaligned.h
Date: Fri, 20 Oct 2017 21:22:11 +0100 [thread overview]
Message-ID: <CAKv+Gu-opWFqrPxEK2zSUp4swBLtPPECecMuSDbRvhQDGvWfWg@mail.gmail.com> (raw)
In-Reply-To: <20171020200231.1355569-1-arnd@arndb.de>
On 20 October 2017 at 21:01, Arnd Bergmann <arnd@arndb.de> wrote:
> The asm-generic/unaligned.h header provides two different implementations
> for accessing unaligned variables: the access_ok.h version used when
> CONFIG_HAVE_EFFICIENT_UNALIGNED_ACCESS is set pretends that all pointers
> are in fact aligned, while the le_struct.h version convinces gcc that the
> alignment of a pointer is '1', to make it issue the correct load/store
> instructions depending on the architecture flags.
>
> On ARMv5 and older, we always use the second version, to let the compiler
> use byte accesses. On ARMv6 and newer, we currently use the access_ok.h
> version, so the compiler can use any instruction including stm/ldm and
> ldrd/strd that will cause an alignment trap. This trap can significantly
> impact performance when we have to do a lot of fixups and, worse, has
> led to crashes in the LZ4 decompressor code that does not have a trap
> handler.
>
> This adds an ARM specific version of asm/unaligned.h that uses the
> le_struct.h/be_struct.h implementation unconditionally. This should lead
> to essentially the same code on ARMv6+ as before, with the exception of
> using regular load/store instructions instead of the trapping instructions
> multi-register variants.
>
> The crash in the LZ4 decompressor code was probably introduced by the
> patch replacing the LZ4 implementation, commit 4e1a33b105dd ("lib: update
> LZ4 compressor module"), so linux-4.11 and higher would be affected most.
> However, we probably want to have this backported to all older stable
> kernels as well, to help with the performance issues.
>
> There are two follow-ups that I think we should also work on, but not
> backport to stable kernels, first to change the asm-generic version of
> the header to remove the ARM special case, and second to review all
> other uses of CONFIG_HAVE_EFFICIENT_UNALIGNED_ACCESS to see if they
> might be affected by the same problem on ARM.
>
> Cc: stable@vger.kernel.org
> Signed-off-by: Arnd Bergmann <arnd@arndb.de>
Acked-by: Ard Biesheuvel <ard.biesheuvel@linaro.org>
> ---
> Untested so far, please verify that this fixes all the known problems
> with the alignment traps.
> ---
> arch/arm/include/asm/Kbuild | 1 -
> arch/arm/include/asm/unaligned.h | 27 +++++++++++++++++++++++++++
> 2 files changed, 27 insertions(+), 1 deletion(-)
> create mode 100644 arch/arm/include/asm/unaligned.h
>
> diff --git a/arch/arm/include/asm/Kbuild b/arch/arm/include/asm/Kbuild
> index 721ab5ecfb9b..0f2c8a2a8131 100644
> --- a/arch/arm/include/asm/Kbuild
> +++ b/arch/arm/include/asm/Kbuild
> @@ -20,7 +20,6 @@ generic-y += simd.h
> generic-y += sizes.h
> generic-y += timex.h
> generic-y += trace_clock.h
> -generic-y += unaligned.h
>
> generated-y += mach-types.h
> generated-y += unistd-nr.h
> diff --git a/arch/arm/include/asm/unaligned.h b/arch/arm/include/asm/unaligned.h
> new file mode 100644
> index 000000000000..ab905ffcf193
> --- /dev/null
> +++ b/arch/arm/include/asm/unaligned.h
> @@ -0,0 +1,27 @@
> +#ifndef __ASM_ARM_UNALIGNED_H
> +#define __ASM_ARM_UNALIGNED_H
> +
> +/*
> + * We generally want to set CONFIG_HAVE_EFFICIENT_UNALIGNED_ACCESS on ARMv6+,
> + * but we don't want to use linux/unaligned/access_ok.h since that can lead
> + * to traps on unaligned stm/ldm or strd/ldrd.
> + */
> +#include <asm/byteorder.h>
> +
> +#if defined(__LITTLE_ENDIAN)
> +# include <linux/unaligned/le_struct.h>
> +# include <linux/unaligned/be_byteshift.h>
> +# include <linux/unaligned/generic.h>
> +# define get_unaligned __get_unaligned_le
> +# define put_unaligned __put_unaligned_le
> +#elif defined(__BIG_ENDIAN)
> +# include <linux/unaligned/be_struct.h>
> +# include <linux/unaligned/le_byteshift.h>
> +# include <linux/unaligned/generic.h>
> +# define get_unaligned __get_unaligned_be
> +# define put_unaligned __put_unaligned_be
> +#else
> +# error need to define endianess
> +#endif
> +
> +#endif /* __ASM_ARM_UNALIGNED_H */
> --
> 2.9.0
>
next prev parent reply other threads:[~2017-10-20 20:22 UTC|newest]
Thread overview: 35+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-10-20 20:01 [PATCH] ARM: add a private asm/unaligned.h Arnd Bergmann
2017-10-20 20:22 ` Ard Biesheuvel [this message]
2017-10-23 15:04 ` Romain Izard
2017-10-27 15:19 ` Gregory CLEMENT
2017-10-27 15:27 ` Russell King - ARM Linux
2017-10-30 13:48 ` Gregory CLEMENT
2017-10-30 14:55 ` Ard Biesheuvel
2017-10-30 15:05 ` Gregory CLEMENT
2017-10-30 15:07 ` Ard Biesheuvel
2017-10-30 15:09 ` Gregory CLEMENT
2017-10-30 15:20 ` Ard Biesheuvel
2017-10-30 15:33 ` Gregory CLEMENT
2017-10-30 15:35 ` Ard Biesheuvel
2017-10-30 15:40 ` Gregory CLEMENT
2017-10-30 16:59 ` Russell King - ARM Linux
2017-10-30 15:55 ` Russell King - ARM Linux
2017-10-30 16:04 ` Gregory CLEMENT
2017-10-30 15:48 ` Russell King - ARM Linux
2017-10-30 16:01 ` Gregory CLEMENT
2017-10-30 16:12 ` Russell King - ARM Linux
2017-10-30 16:24 ` Gregory CLEMENT
2017-10-30 16:38 ` Russell King - ARM Linux
2017-10-31 12:47 ` Russell King - ARM Linux
2017-10-31 12:57 ` Ard Biesheuvel
2017-10-31 13:22 ` Gregory CLEMENT
2017-11-01 15:57 ` Ard Biesheuvel
2017-11-01 18:00 ` Russell King - ARM Linux
2017-11-01 18:02 ` Ard Biesheuvel
2017-11-01 18:11 ` Russell King - ARM Linux
2017-11-01 18:20 ` Ard Biesheuvel
2017-11-01 19:10 ` Russell King - ARM Linux
2017-10-30 15:09 ` Arnd Bergmann
2017-10-30 15:50 ` Russell King - ARM Linux
2017-10-30 17:01 ` Arnd Bergmann
2017-10-30 17:13 ` Russell King - ARM Linux
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=CAKv+Gu-opWFqrPxEK2zSUp4swBLtPPECecMuSDbRvhQDGvWfWg@mail.gmail.com \
--to=ard.biesheuvel@linaro.org \
--cc=4sschmid@informatik.uni-hamburg.de \
--cc=aaro.koskinen@iki.fi \
--cc=andrea.adami@gmail.com \
--cc=arnd@arndb.de \
--cc=gregory.clement@free-electrons.com \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux@armlinux.org.uk \
--cc=petrcvekcz@gmail.com \
--cc=robert.jarzmik@free.fr \
--cc=romain.izard.pro@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).