All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH v2] x86/insn, tools/x86: Fix some potential undefined behavior.
@ 2020-10-15 16:12 Ian Rogers
  2020-10-15 21:34 ` hpa
  2020-10-16  1:33 ` Masami Hiramatsu
  0 siblings, 2 replies; 7+ messages in thread
From: Ian Rogers @ 2020-10-15 16:12 UTC (permalink / raw)
  To: Thomas Gleixner, Ingo Molnar, Borislav Petkov, x86,
	H. Peter Anvin, Peter Zijlstra (Intel),
	Masami Hiramatsu, Josh Poimboeuf, linux-kernel, Adrian Hunter,
	Arnaldo Carvalho de Melo
  Cc: Numfor Mbiziwo-Tiapo, Ian Rogers

From: Numfor Mbiziwo-Tiapo <nums@google.com>

Don't perform unaligned loads in __get_next and __peek_nbyte_next as
these are forms of undefined behavior.

These problems were identified using the undefined behavior sanitizer
(ubsan) with the tools version of the code and perf test. Part of this
patch was previously posted here:
https://lore.kernel.org/lkml/20190724184512.162887-4-nums@google.com/

v2. removes the validate_next check and merges the 2 changes into one as
requested by Masami Hiramatsu <mhiramat@kernel.org>

Signed-off-by: Ian Rogers <irogers@google.com>
Signed-off-by: Numfor Mbiziwo-Tiapo <nums@google.com>
---
 arch/x86/lib/insn.c       | 4 ++--
 tools/arch/x86/lib/insn.c | 4 ++--
 2 files changed, 4 insertions(+), 4 deletions(-)

diff --git a/arch/x86/lib/insn.c b/arch/x86/lib/insn.c
index 404279563891..be88ab250146 100644
--- a/arch/x86/lib/insn.c
+++ b/arch/x86/lib/insn.c
@@ -20,10 +20,10 @@
 	((insn)->next_byte + sizeof(t) + n <= (insn)->end_kaddr)
 
 #define __get_next(t, insn)	\
-	({ t r = *(t*)insn->next_byte; insn->next_byte += sizeof(t); r; })
+	({ t r; memcpy(&r, insn->next_byte, sizeof(t)); insn->next_byte += sizeof(t); r; })
 
 #define __peek_nbyte_next(t, insn, n)	\
-	({ t r = *(t*)((insn)->next_byte + n); r; })
+	({ t r; memcpy(&r, (insn)->next_byte + n, sizeof(t)); r; })
 
 #define get_next(t, insn)	\
 	({ if (unlikely(!validate_next(t, insn, 0))) goto err_out; __get_next(t, insn); })
diff --git a/tools/arch/x86/lib/insn.c b/tools/arch/x86/lib/insn.c
index 0151dfc6da61..92358c71a59e 100644
--- a/tools/arch/x86/lib/insn.c
+++ b/tools/arch/x86/lib/insn.c
@@ -20,10 +20,10 @@
 	((insn)->next_byte + sizeof(t) + n <= (insn)->end_kaddr)
 
 #define __get_next(t, insn)	\
-	({ t r = *(t*)insn->next_byte; insn->next_byte += sizeof(t); r; })
+	({ t r; memcpy(&r, insn->next_byte, sizeof(t)); insn->next_byte += sizeof(t); r; })
 
 #define __peek_nbyte_next(t, insn, n)	\
-	({ t r = *(t*)((insn)->next_byte + n); r; })
+	({ t r; memcpy(&r, (insn)->next_byte + n, sizeof(t)); r; })
 
 #define get_next(t, insn)	\
 	({ if (unlikely(!validate_next(t, insn, 0))) goto err_out; __get_next(t, insn); })
-- 
2.28.0.1011.ga647a8990f-goog


^ permalink raw reply related	[flat|nested] 7+ messages in thread

* Re: [PATCH v2] x86/insn, tools/x86: Fix some potential undefined behavior.
  2020-10-15 16:12 [PATCH v2] x86/insn, tools/x86: Fix some potential undefined behavior Ian Rogers
@ 2020-10-15 21:34 ` hpa
  2020-10-15 21:47   ` Ian Rogers
                     ` (2 more replies)
  2020-10-16  1:33 ` Masami Hiramatsu
  1 sibling, 3 replies; 7+ messages in thread
From: hpa @ 2020-10-15 21:34 UTC (permalink / raw)
  To: Ian Rogers, Thomas Gleixner, Ingo Molnar, Borislav Petkov, x86,
	Peter Zijlstra (Intel),
	Masami Hiramatsu, Josh Poimboeuf, linux-kernel, Adrian Hunter,
	Arnaldo Carvalho de Melo
  Cc: Numfor Mbiziwo-Tiapo

On October 15, 2020 9:12:16 AM PDT, Ian Rogers <irogers@google.com> wrote:
>From: Numfor Mbiziwo-Tiapo <nums@google.com>
>
>Don't perform unaligned loads in __get_next and __peek_nbyte_next as
>these are forms of undefined behavior.
>
>These problems were identified using the undefined behavior sanitizer
>(ubsan) with the tools version of the code and perf test. Part of this
>patch was previously posted here:
>https://lore.kernel.org/lkml/20190724184512.162887-4-nums@google.com/
>
>v2. removes the validate_next check and merges the 2 changes into one
>as
>requested by Masami Hiramatsu <mhiramat@kernel.org>
>
>Signed-off-by: Ian Rogers <irogers@google.com>
>Signed-off-by: Numfor Mbiziwo-Tiapo <nums@google.com>
>---
> arch/x86/lib/insn.c       | 4 ++--
> tools/arch/x86/lib/insn.c | 4 ++--
> 2 files changed, 4 insertions(+), 4 deletions(-)
>
>diff --git a/arch/x86/lib/insn.c b/arch/x86/lib/insn.c
>index 404279563891..be88ab250146 100644
>--- a/arch/x86/lib/insn.c
>+++ b/arch/x86/lib/insn.c
>@@ -20,10 +20,10 @@
> 	((insn)->next_byte + sizeof(t) + n <= (insn)->end_kaddr)
> 
> #define __get_next(t, insn)	\
>-	({ t r = *(t*)insn->next_byte; insn->next_byte += sizeof(t); r; })
>+	({ t r; memcpy(&r, insn->next_byte, sizeof(t)); insn->next_byte +=
>sizeof(t); r; })
> 
> #define __peek_nbyte_next(t, insn, n)	\
>-	({ t r = *(t*)((insn)->next_byte + n); r; })
>+	({ t r; memcpy(&r, (insn)->next_byte + n, sizeof(t)); r; })
> 
> #define get_next(t, insn)	\
>	({ if (unlikely(!validate_next(t, insn, 0))) goto err_out;
>__get_next(t, insn); })
>diff --git a/tools/arch/x86/lib/insn.c b/tools/arch/x86/lib/insn.c
>index 0151dfc6da61..92358c71a59e 100644
>--- a/tools/arch/x86/lib/insn.c
>+++ b/tools/arch/x86/lib/insn.c
>@@ -20,10 +20,10 @@
> 	((insn)->next_byte + sizeof(t) + n <= (insn)->end_kaddr)
> 
> #define __get_next(t, insn)	\
>-	({ t r = *(t*)insn->next_byte; insn->next_byte += sizeof(t); r; })
>+	({ t r; memcpy(&r, insn->next_byte, sizeof(t)); insn->next_byte +=
>sizeof(t); r; })
> 
> #define __peek_nbyte_next(t, insn, n)	\
>-	({ t r = *(t*)((insn)->next_byte + n); r; })
>+	({ t r; memcpy(&r, (insn)->next_byte + n, sizeof(t)); r; })
> 
> #define get_next(t, insn)	\
>	({ if (unlikely(!validate_next(t, insn, 0))) goto err_out;
>__get_next(t, insn); })

Wait, what?

You are taking about x86-specific code, and on x86 unaligned memory accesses are supported, well-defined, and ubiquitous. 

This is B.S. at best, and unless the compiler turns the memcpy() right back into what you started with, deleterious for performance.

If you have a *very* good reason for this kind of churn, wrap it in the unaligned access macros, but using memcpy() is insane. All you are doing is making the code worse.

-- 
Sent from my Android device with K-9 Mail. Please excuse my brevity.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH v2] x86/insn, tools/x86: Fix some potential undefined behavior.
  2020-10-15 21:34 ` hpa
@ 2020-10-15 21:47   ` Ian Rogers
  2020-10-16  8:24     ` David Laight
  2020-10-16  8:47   ` David Laight
  2020-10-16  9:37   ` Peter Zijlstra
  2 siblings, 1 reply; 7+ messages in thread
From: Ian Rogers @ 2020-10-15 21:47 UTC (permalink / raw)
  To: hpa
  Cc: Thomas Gleixner, Ingo Molnar, Borislav Petkov, x86,
	Peter Zijlstra (Intel),
	Masami Hiramatsu, Josh Poimboeuf, LKML, Adrian Hunter,
	Arnaldo Carvalho de Melo, Numfor Mbiziwo-Tiapo

On Thu, Oct 15, 2020 at 2:35 PM <hpa@zytor.com> wrote:
>
> On October 15, 2020 9:12:16 AM PDT, Ian Rogers <irogers@google.com> wrote:
> >From: Numfor Mbiziwo-Tiapo <nums@google.com>
> >
> >Don't perform unaligned loads in __get_next and __peek_nbyte_next as
> >these are forms of undefined behavior.
> >
> >These problems were identified using the undefined behavior sanitizer
> >(ubsan) with the tools version of the code and perf test. Part of this
> >patch was previously posted here:
> >https://lore.kernel.org/lkml/20190724184512.162887-4-nums@google.com/
> >
> >v2. removes the validate_next check and merges the 2 changes into one
> >as
> >requested by Masami Hiramatsu <mhiramat@kernel.org>
> >
> >Signed-off-by: Ian Rogers <irogers@google.com>
> >Signed-off-by: Numfor Mbiziwo-Tiapo <nums@google.com>
> >---
> > arch/x86/lib/insn.c       | 4 ++--
> > tools/arch/x86/lib/insn.c | 4 ++--
> > 2 files changed, 4 insertions(+), 4 deletions(-)
> >
> >diff --git a/arch/x86/lib/insn.c b/arch/x86/lib/insn.c
> >index 404279563891..be88ab250146 100644
> >--- a/arch/x86/lib/insn.c
> >+++ b/arch/x86/lib/insn.c
> >@@ -20,10 +20,10 @@
> >       ((insn)->next_byte + sizeof(t) + n <= (insn)->end_kaddr)
> >
> > #define __get_next(t, insn)   \
> >-      ({ t r = *(t*)insn->next_byte; insn->next_byte += sizeof(t); r; })
> >+      ({ t r; memcpy(&r, insn->next_byte, sizeof(t)); insn->next_byte +=
> >sizeof(t); r; })
> >
> > #define __peek_nbyte_next(t, insn, n) \
> >-      ({ t r = *(t*)((insn)->next_byte + n); r; })
> >+      ({ t r; memcpy(&r, (insn)->next_byte + n, sizeof(t)); r; })
> >
> > #define get_next(t, insn)     \
> >       ({ if (unlikely(!validate_next(t, insn, 0))) goto err_out;
> >__get_next(t, insn); })
> >diff --git a/tools/arch/x86/lib/insn.c b/tools/arch/x86/lib/insn.c
> >index 0151dfc6da61..92358c71a59e 100644
> >--- a/tools/arch/x86/lib/insn.c
> >+++ b/tools/arch/x86/lib/insn.c
> >@@ -20,10 +20,10 @@
> >       ((insn)->next_byte + sizeof(t) + n <= (insn)->end_kaddr)
> >
> > #define __get_next(t, insn)   \
> >-      ({ t r = *(t*)insn->next_byte; insn->next_byte += sizeof(t); r; })
> >+      ({ t r; memcpy(&r, insn->next_byte, sizeof(t)); insn->next_byte +=
> >sizeof(t); r; })
> >
> > #define __peek_nbyte_next(t, insn, n) \
> >-      ({ t r = *(t*)((insn)->next_byte + n); r; })
> >+      ({ t r; memcpy(&r, (insn)->next_byte + n, sizeof(t)); r; })
> >
> > #define get_next(t, insn)     \
> >       ({ if (unlikely(!validate_next(t, insn, 0))) goto err_out;
> >__get_next(t, insn); })
>
> Wait, what?
>
> You are taking about x86-specific code, and on x86 unaligned memory accesses are supported, well-defined, and ubiquitous.

On why this is undefined behavior:
https://lore.kernel.org/lkml/CAP-5=fU2XBoOa2=00VCuWYqsLUzMSMzUXY63ZJt9rz-NJ+vYwA@mail.gmail.com/

> This is B.S. at best, and unless the compiler turns the memcpy() right back into what you started with, deleterious for performance.

On performance, the memcpys are fixed size and so lowered to loads on
x86 by any reasonable compiler. See the thread above.

> If you have a *very* good reason for this kind of churn, wrap it in the unaligned access macros, but using memcpy() is insane. All you are doing is making the code worse.

The decoder is a shared code and using unaligned macros makes life
hard for the other users of the code. Memcpy is the "standard"
workaround for this kind of undefined behavior.
https://lore.kernel.org/lkml/e4269cb2-d8e6-da26-6afd-a9df72d4be36@intel.com/

For motivation, beyond just having perf be sanitizer clean, see discussion here:
https://lore.kernel.org/lkml/CAP-5=fUoSGy3NAzTSbF3YLEPABSs7oPsxLkCx36XkEzzm341yw@mail.gmail.com/
https://lore.kernel.org/lkml/160208761921.7002.1321765913567405137.tip-bot2@tip-bot2/

Thanks,
Ian

> --
> Sent from my Android device with K-9 Mail. Please excuse my brevity.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH v2] x86/insn, tools/x86: Fix some potential undefined behavior.
  2020-10-15 16:12 [PATCH v2] x86/insn, tools/x86: Fix some potential undefined behavior Ian Rogers
  2020-10-15 21:34 ` hpa
@ 2020-10-16  1:33 ` Masami Hiramatsu
  1 sibling, 0 replies; 7+ messages in thread
From: Masami Hiramatsu @ 2020-10-16  1:33 UTC (permalink / raw)
  To: Ian Rogers
  Cc: Thomas Gleixner, Ingo Molnar, Borislav Petkov, x86,
	H. Peter Anvin, Peter Zijlstra (Intel),
	Josh Poimboeuf, linux-kernel, Adrian Hunter,
	Arnaldo Carvalho de Melo, Numfor Mbiziwo-Tiapo

On Thu, 15 Oct 2020 09:12:16 -0700
Ian Rogers <irogers@google.com> wrote:

> From: Numfor Mbiziwo-Tiapo <nums@google.com>
> 
> Don't perform unaligned loads in __get_next and __peek_nbyte_next as
> these are forms of undefined behavior.
> 
> These problems were identified using the undefined behavior sanitizer
> (ubsan) with the tools version of the code and perf test. Part of this
> patch was previously posted here:
> https://lore.kernel.org/lkml/20190724184512.162887-4-nums@google.com/
> 
> v2. removes the validate_next check and merges the 2 changes into one as
> requested by Masami Hiramatsu <mhiramat@kernel.org>

Looks good to me. This may be OK on x86, but now this code will be run
on other arches for cross compilation.

Acked-by: Masami Hiramatsu <mhiramat@kernel.org>

Thank you,

> 
> Signed-off-by: Ian Rogers <irogers@google.com>
> Signed-off-by: Numfor Mbiziwo-Tiapo <nums@google.com>
> ---
>  arch/x86/lib/insn.c       | 4 ++--
>  tools/arch/x86/lib/insn.c | 4 ++--
>  2 files changed, 4 insertions(+), 4 deletions(-)
> 
> diff --git a/arch/x86/lib/insn.c b/arch/x86/lib/insn.c
> index 404279563891..be88ab250146 100644
> --- a/arch/x86/lib/insn.c
> +++ b/arch/x86/lib/insn.c
> @@ -20,10 +20,10 @@
>  	((insn)->next_byte + sizeof(t) + n <= (insn)->end_kaddr)
>  
>  #define __get_next(t, insn)	\
> -	({ t r = *(t*)insn->next_byte; insn->next_byte += sizeof(t); r; })
> +	({ t r; memcpy(&r, insn->next_byte, sizeof(t)); insn->next_byte += sizeof(t); r; })
>  
>  #define __peek_nbyte_next(t, insn, n)	\
> -	({ t r = *(t*)((insn)->next_byte + n); r; })
> +	({ t r; memcpy(&r, (insn)->next_byte + n, sizeof(t)); r; })
>  
>  #define get_next(t, insn)	\
>  	({ if (unlikely(!validate_next(t, insn, 0))) goto err_out; __get_next(t, insn); })
> diff --git a/tools/arch/x86/lib/insn.c b/tools/arch/x86/lib/insn.c
> index 0151dfc6da61..92358c71a59e 100644
> --- a/tools/arch/x86/lib/insn.c
> +++ b/tools/arch/x86/lib/insn.c
> @@ -20,10 +20,10 @@
>  	((insn)->next_byte + sizeof(t) + n <= (insn)->end_kaddr)
>  
>  #define __get_next(t, insn)	\
> -	({ t r = *(t*)insn->next_byte; insn->next_byte += sizeof(t); r; })
> +	({ t r; memcpy(&r, insn->next_byte, sizeof(t)); insn->next_byte += sizeof(t); r; })
>  
>  #define __peek_nbyte_next(t, insn, n)	\
> -	({ t r = *(t*)((insn)->next_byte + n); r; })
> +	({ t r; memcpy(&r, (insn)->next_byte + n, sizeof(t)); r; })
>  
>  #define get_next(t, insn)	\
>  	({ if (unlikely(!validate_next(t, insn, 0))) goto err_out; __get_next(t, insn); })
> -- 
> 2.28.0.1011.ga647a8990f-goog
> 


-- 
Masami Hiramatsu <mhiramat@kernel.org>

^ permalink raw reply	[flat|nested] 7+ messages in thread

* RE: [PATCH v2] x86/insn, tools/x86: Fix some potential undefined behavior.
  2020-10-15 21:47   ` Ian Rogers
@ 2020-10-16  8:24     ` David Laight
  0 siblings, 0 replies; 7+ messages in thread
From: David Laight @ 2020-10-16  8:24 UTC (permalink / raw)
  To: 'Ian Rogers', hpa
  Cc: Thomas Gleixner, Ingo Molnar, Borislav Petkov, x86,
	Peter Zijlstra (Intel),
	Masami Hiramatsu, Josh Poimboeuf, LKML, Adrian Hunter,
	Arnaldo Carvalho de Melo, Numfor Mbiziwo-Tiapo

From: Ian Rogers
> Sent: 15 October 2020 22:47
...
> The decoder is a shared code and using unaligned macros makes life
> hard for the other users of the code. Memcpy is the "standard"
> workaround for this kind of undefined behavior.
> https://lore.kernel.org/lkml/e4269cb2-d8e6-da26-6afd-a9df72d4be36@intel.com/

You can't always use memcpy() to copy unaligned data.
If the compiler can find any reason why the misaligned address
should be aligned (eg because it's type prior to some casts
requires alignment) it will use word-sized accesses that will fault.

Now in this specific code the pointers are probably all 'char *'
so have alignment 1 - so memcpy() will DTRT.

But it may generate an expensive function call.

There are 'read unaligned 32bit' (etc) macros out there.

	David

-
Registered Address Lakeside, Bramley Road, Mount Farm, Milton Keynes, MK1 1PT, UK
Registration No: 1397386 (Wales)

^ permalink raw reply	[flat|nested] 7+ messages in thread

* RE: [PATCH v2] x86/insn, tools/x86: Fix some potential undefined behavior.
  2020-10-15 21:34 ` hpa
  2020-10-15 21:47   ` Ian Rogers
@ 2020-10-16  8:47   ` David Laight
  2020-10-16  9:37   ` Peter Zijlstra
  2 siblings, 0 replies; 7+ messages in thread
From: David Laight @ 2020-10-16  8:47 UTC (permalink / raw)
  To: 'hpa@zytor.com',
	Ian Rogers, Thomas Gleixner, Ingo Molnar, Borislav Petkov, x86,
	Peter Zijlstra (Intel),
	Masami Hiramatsu, Josh Poimboeuf, linux-kernel, Adrian Hunter,
	Arnaldo Carvalho de Melo
  Cc: Numfor Mbiziwo-Tiapo

From: hpa@zytor.com
> Sent: 15 October 2020 22:35
> >Don't perform unaligned loads in __get_next and __peek_nbyte_next as
> >these are forms of undefined behavior.
...
> > #define __get_next(t, insn)	\
> >-	({ t r = *(t*)insn->next_byte; insn->next_byte += sizeof(t); r; })
> >+	({ t r; memcpy(&r, insn->next_byte, sizeof(t)); insn->next_byte += sizeof(t); r; })

Interesting other idea.
Can you add an 'aligned(1)' into the '*(t *)' cast?
I think '*(t aligned(1) *)' is the right cast.
(gcc seems to accept it either size of the type.)

	David

-
Registered Address Lakeside, Bramley Road, Mount Farm, Milton Keynes, MK1 1PT, UK
Registration No: 1397386 (Wales)

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH v2] x86/insn, tools/x86: Fix some potential undefined behavior.
  2020-10-15 21:34 ` hpa
  2020-10-15 21:47   ` Ian Rogers
  2020-10-16  8:47   ` David Laight
@ 2020-10-16  9:37   ` Peter Zijlstra
  2 siblings, 0 replies; 7+ messages in thread
From: Peter Zijlstra @ 2020-10-16  9:37 UTC (permalink / raw)
  To: hpa
  Cc: Ian Rogers, Thomas Gleixner, Ingo Molnar, Borislav Petkov, x86,
	Masami Hiramatsu, Josh Poimboeuf, linux-kernel, Adrian Hunter,
	Arnaldo Carvalho de Melo, Numfor Mbiziwo-Tiapo

On Thu, Oct 15, 2020 at 02:34:57PM -0700, hpa@zytor.com wrote:
> Wait, what?
> 
> You are taking about x86-specific code, and on x86 unaligned memory
> accesses are supported, well-defined, and ubiquitous. 

Objtool uses this x86 instruction decoder, people are cross-buildling
objtool to cross-build x86 kernels.

So it's entirely possible to run the x86 insn decode on BE RISC, which
very much doesn't like unaligned loads.

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2020-10-16  9:37 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-10-15 16:12 [PATCH v2] x86/insn, tools/x86: Fix some potential undefined behavior Ian Rogers
2020-10-15 21:34 ` hpa
2020-10-15 21:47   ` Ian Rogers
2020-10-16  8:24     ` David Laight
2020-10-16  8:47   ` David Laight
2020-10-16  9:37   ` Peter Zijlstra
2020-10-16  1:33 ` Masami Hiramatsu

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.