linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH v2 0/4] kernel hacking: GCC optimization for better debug experience (-Og)
@ 2018-10-19 12:49 Du Changbin
  2018-10-19 12:49 ` [PATCH v2 1/4] x86/mm: surround level4_kernel_pgt with #ifdef CONFIG_X86_5LEVEL...#endif Du Changbin
                   ` (3 more replies)
  0 siblings, 4 replies; 14+ messages in thread
From: Du Changbin @ 2018-10-19 12:49 UTC (permalink / raw)
  To: yamada.masahiro, michal.lkml, tglx, mingo, linux, akpm
  Cc: rostedt, x86, linux-kbuild, linux-kernel, linux-arm-kernel,
	linux-sparse, robin.murphy, Du Changbin

Hi all,
I have posted this series several months ago but interrupted by personal
affairs. Now I get time to complete this task. Thanks for all of the
reviewers.

I know some kernel developers was searching for a method to dissable GCC
optimizations, probably they want to apply GCC '-O0' option. But since Linux
kernel relies on GCC optimization to remove some dead code, so '-O0' just
breaks the build. They do need this because they want to debug kernel with
qemu, simics, kgtp or kgdb.

Thanks for the GCC '-Og' optimization level introduced in GCC 4.8, which
offers a reasonable level of optimization while maintaining fast compilation
and a good debugging experience. It is similar to '-O1' while perferring to
keep debug ability over runtime speed. With '-Og', we can build a kernel with
better debug ability and little performance drop after some simple change.

In this series, firstly introduce a new config CONFIG_NO_AUTO_INLINE after two
fixes for this new option. With this option, only functions explicitly marked
with "inline" will  be inlined. This will allow the function tracer to trace
more functions because it only traces functions that the compiler has not
inlined.

Then introduce new config CC_OPTIMIZE_FOR_DEBUGGING which apply '-Og'
optimization level for whole kernel, with a simple fix in fix_to_virt().
Currently I have only tested this option on x86 and ARM platform. Other
platforms should also work but probably need some compiling fixes as what
having done in this series. I leave that to who want to try this debug
option.

Comparison of vmlinux size: a bit smaller.

    w/o CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
    $ size vmlinux
       text    data     bss     dec     hex filename
    22665554   9709674  2920908 35296136        21a9388 vmlinux

    w/ CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
    $ size vmlinux
       text    data     bss     dec     hex filename
    21499032   10102758 2920908 34522698        20ec64a vmlinux


Comparison of system performance: a bit drop (~6%).
    This benchmark of kernel compilation is suggested by Ingo Molnar.
    https://lkml.org/lkml/2018/5/2/74

    Preparation: Set cpufreq to 'performance'.
    for ((cpu=0; cpu<120; cpu++)); do
      G=/sys/devices/system/cpu/cpu$cpu/cpufreq/scaling_governor
      [ -f $G ] && echo performance > $G
    done

    w/o CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
    $ perf stat --repeat 5 --null --pre                 '\
        cp -a kernel ../kernel.copy.$(date +%s);         \
        rm -rf *;                                        \
        git checkout .;                                  \
        echo 1 > /proc/sys/vm/drop_caches;               \
        find ../kernel* -type f | xargs cat >/dev/null;  \
        make -j kernel >/dev/null;                       \
        make clean >/dev/null 2>&1;                      \
        sync                                            '\
                                                         \
        make -j8 >/dev/null

     Performance counter stats for 'make -j8' (5 runs):

        219.764246652 seconds time elapsed                   ( +-  0.78% )

    w/ CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
    $ perf stat --repeat 5 --null --pre                 '\
        cp -a kernel ../kernel.copy.$(date +%s);         \
        rm -rf *;                                        \
        git checkout .;                                  \
        echo 1 > /proc/sys/vm/drop_caches;               \
        find ../kernel* -type f | xargs cat >/dev/null;  \
        make -j kernel >/dev/null;                       \
        make clean >/dev/null 2>&1;                      \
        sync                                            '\
                                                         \
        make -j8 >/dev/null

    Performance counter stats for 'make -j8' (5 runs):

         233.574187771 seconds time elapsed                  ( +-  0.19% )


Du Changbin (4):
  x86/mm: surround level4_kernel_pgt with #ifdef
    CONFIG_X86_5LEVEL...#endif
  kernel hacking: new config NO_AUTO_INLINE to disable compiler
    auto-inline optimizations
  ARM: mm: fix build error in fix_to_virt with
    CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
  kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og
    optimization

 Makefile                          | 11 +++++++++++
 arch/arm/mm/mmu.c                 |  2 +-
 arch/x86/include/asm/pgtable_64.h |  2 ++
 arch/x86/kernel/head64.c          | 13 ++++++-------
 include/linux/compiler-gcc.h      |  2 +-
 include/linux/compiler.h          |  2 +-
 init/Kconfig                      | 19 +++++++++++++++++++
 lib/Kconfig.debug                 | 17 +++++++++++++++++
 8 files changed, 58 insertions(+), 10 deletions(-)

-- 
2.17.1


^ permalink raw reply	[flat|nested] 14+ messages in thread

* [PATCH v2 1/4] x86/mm: surround level4_kernel_pgt with #ifdef CONFIG_X86_5LEVEL...#endif
  2018-10-19 12:49 [PATCH v2 0/4] kernel hacking: GCC optimization for better debug experience (-Og) Du Changbin
@ 2018-10-19 12:49 ` Du Changbin
  2018-10-28  1:44   ` Masahiro Yamada
  2018-10-19 12:49 ` [PATCH v2 2/4] kernel hacking: new config NO_AUTO_INLINE to disable compiler auto-inline optimizations Du Changbin
                   ` (2 subsequent siblings)
  3 siblings, 1 reply; 14+ messages in thread
From: Du Changbin @ 2018-10-19 12:49 UTC (permalink / raw)
  To: yamada.masahiro, michal.lkml, tglx, mingo, linux, akpm
  Cc: rostedt, x86, linux-kbuild, linux-kernel, linux-arm-kernel,
	linux-sparse, robin.murphy, Du Changbin

The level4_kernel_pgt is only defined when X86_5LEVEL is enabled. So
surround level4_kernel_pgt with #ifdef CONFIG_X86_5LEVEL...#endif to
make code correct.

Signed-off-by: Du Changbin <changbin.du@gmail.com>
Acked-by: Steven Rostedt (VMware) <rostedt@goodmis.org>
---
 arch/x86/include/asm/pgtable_64.h |  2 ++
 arch/x86/kernel/head64.c          | 13 ++++++-------
 2 files changed, 8 insertions(+), 7 deletions(-)

diff --git a/arch/x86/include/asm/pgtable_64.h b/arch/x86/include/asm/pgtable_64.h
index 9c85b54bf03c..9333f7fa5bdb 100644
--- a/arch/x86/include/asm/pgtable_64.h
+++ b/arch/x86/include/asm/pgtable_64.h
@@ -16,7 +16,9 @@
 #include <linux/threads.h>
 #include <asm/fixmap.h>
 
+#ifdef CONFIG_X86_5LEVEL
 extern p4d_t level4_kernel_pgt[512];
+#endif
 extern p4d_t level4_ident_pgt[512];
 extern pud_t level3_kernel_pgt[512];
 extern pud_t level3_ident_pgt[512];
diff --git a/arch/x86/kernel/head64.c b/arch/x86/kernel/head64.c
index ddee1f0870c4..4a59ef93c258 100644
--- a/arch/x86/kernel/head64.c
+++ b/arch/x86/kernel/head64.c
@@ -151,16 +151,15 @@ unsigned long __head __startup_64(unsigned long physaddr,
 
 	pgd = fixup_pointer(&early_top_pgt, physaddr);
 	p = pgd + pgd_index(__START_KERNEL_map);
-	if (la57)
-		*p = (unsigned long)level4_kernel_pgt;
-	else
-		*p = (unsigned long)level3_kernel_pgt;
-	*p += _PAGE_TABLE_NOENC - __START_KERNEL_map + load_delta;
-
+#ifdef CONFIG_X86_5LEVEL
 	if (la57) {
+		*p = (unsigned long)level4_kernel_pgt;
 		p4d = fixup_pointer(&level4_kernel_pgt, physaddr);
 		p4d[511] += load_delta;
-	}
+	} else
+#endif
+		*p = (unsigned long)level3_kernel_pgt;
+	*p += _PAGE_TABLE_NOENC - __START_KERNEL_map + load_delta;
 
 	pud = fixup_pointer(&level3_kernel_pgt, physaddr);
 	pud[510] += load_delta;
-- 
2.17.1


^ permalink raw reply related	[flat|nested] 14+ messages in thread

* [PATCH v2 2/4] kernel hacking: new config NO_AUTO_INLINE to disable compiler auto-inline optimizations
  2018-10-19 12:49 [PATCH v2 0/4] kernel hacking: GCC optimization for better debug experience (-Og) Du Changbin
  2018-10-19 12:49 ` [PATCH v2 1/4] x86/mm: surround level4_kernel_pgt with #ifdef CONFIG_X86_5LEVEL...#endif Du Changbin
@ 2018-10-19 12:49 ` Du Changbin
  2018-10-28 13:11   ` Masahiro Yamada
  2018-10-19 12:49 ` [PATCH v2 3/4] ARM: mm: fix build error in fix_to_virt with CONFIG_CC_OPTIMIZE_FOR_DEBUGGING Du Changbin
  2018-10-19 12:49 ` [PATCH v2 4/4] kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og optimization Du Changbin
  3 siblings, 1 reply; 14+ messages in thread
From: Du Changbin @ 2018-10-19 12:49 UTC (permalink / raw)
  To: yamada.masahiro, michal.lkml, tglx, mingo, linux, akpm
  Cc: rostedt, x86, linux-kbuild, linux-kernel, linux-arm-kernel,
	linux-sparse, robin.murphy, Du Changbin

This patch add a new kernel hacking option NO_AUTO_INLINE. Selecting
this option will prevent the compiler from optimizing the kernel by
auto-inlining functions not marked with the inline keyword.

With this option, only functions explicitly marked with "inline" will
be inlined. This will allow the function tracer to trace more functions
because it only traces functions that the compiler has not inlined.

Signed-off-by: Du Changbin <changbin.du@gmail.com>
Acked-by: Steven Rostedt (VMware) <rostedt@goodmis.org>
---
 Makefile          |  6 ++++++
 lib/Kconfig.debug | 17 +++++++++++++++++
 2 files changed, 23 insertions(+)

diff --git a/Makefile b/Makefile
index e8b599b4dcde..757d6507cb5c 100644
--- a/Makefile
+++ b/Makefile
@@ -749,6 +749,12 @@ KBUILD_CFLAGS 	+= $(call cc-option, -femit-struct-debug-baseonly) \
 		   $(call cc-option,-fno-var-tracking)
 endif
 
+ifdef CONFIG_NO_AUTO_INLINE
+KBUILD_CFLAGS   += $(call cc-option, -fno-inline-functions) \
+		   $(call cc-option, -fno-inline-small-functions) \
+		   $(call cc-option, -fno-inline-functions-called-once)
+endif
+
 ifdef CONFIG_FUNCTION_TRACER
 ifdef CONFIG_FTRACE_MCOUNT_RECORD
   # gcc 5 supports generating the mcount tables directly
diff --git a/lib/Kconfig.debug b/lib/Kconfig.debug
index 4966c4fbe7f7..c7c28ee01dfc 100644
--- a/lib/Kconfig.debug
+++ b/lib/Kconfig.debug
@@ -211,6 +211,23 @@ config GDB_SCRIPTS
 	  instance. See Documentation/dev-tools/gdb-kernel-debugging.rst
 	  for further details.
 
+config NO_AUTO_INLINE
+	bool "Disable compiler auto-inline optimizations"
+	help
+	  This will prevent the compiler from optimizing the kernel by
+	  auto-inlining functions not marked with the inline keyword.
+	  With this option, only functions explicitly marked with
+	  "inline" will be inlined. This will allow the function tracer
+	  to trace more functions because it only traces functions that
+	  the compiler has not inlined.
+
+	  Enabling this function can help debugging a kernel if using
+	  the function tracer. But it can also change how the kernel
+	  works, because inlining functions may change the timing,
+	  which could make it difficult while debugging race conditions.
+
+	  If unsure, select N.
+
 config ENABLE_MUST_CHECK
 	bool "Enable __must_check logic"
 	default y
-- 
2.17.1


^ permalink raw reply related	[flat|nested] 14+ messages in thread

* [PATCH v2 3/4] ARM: mm: fix build error in fix_to_virt with CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
  2018-10-19 12:49 [PATCH v2 0/4] kernel hacking: GCC optimization for better debug experience (-Og) Du Changbin
  2018-10-19 12:49 ` [PATCH v2 1/4] x86/mm: surround level4_kernel_pgt with #ifdef CONFIG_X86_5LEVEL...#endif Du Changbin
  2018-10-19 12:49 ` [PATCH v2 2/4] kernel hacking: new config NO_AUTO_INLINE to disable compiler auto-inline optimizations Du Changbin
@ 2018-10-19 12:49 ` Du Changbin
  2018-10-19 12:49 ` [PATCH v2 4/4] kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og optimization Du Changbin
  3 siblings, 0 replies; 14+ messages in thread
From: Du Changbin @ 2018-10-19 12:49 UTC (permalink / raw)
  To: yamada.masahiro, michal.lkml, tglx, mingo, linux, akpm
  Cc: rostedt, x86, linux-kbuild, linux-kernel, linux-arm-kernel,
	linux-sparse, robin.murphy, Du Changbin

With '-Og' optimization level, GCC would not optimize a count for a loop
as a constant value. But BUILD_BUG_ON() only accept compile-time constant
values. Let's use __fix_to_virt() to avoid the error.

arch/arm/mm/mmu.o: In function `fix_to_virt':
/home/changbin/work/linux/./include/asm-generic/fixmap.h:31: undefined reference to `__compiletime_assert_31'
Makefile:1051: recipe for target 'vmlinux' failed
make: *** [vmlinux] Error 1

Signed-off-by: Du Changbin <changbin.du@gmail.com>
Acked-by: Steven Rostedt (VMware) <rostedt@goodmis.org>
---
 arch/arm/mm/mmu.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/arch/arm/mm/mmu.c b/arch/arm/mm/mmu.c
index e46a6a446cdd..c08d74e76714 100644
--- a/arch/arm/mm/mmu.c
+++ b/arch/arm/mm/mmu.c
@@ -1599,7 +1599,7 @@ static void __init early_fixmap_shutdown(void)
 		pte_t *pte;
 		struct map_desc map;
 
-		map.virtual = fix_to_virt(i);
+		map.virtual = __fix_to_virt(i);
 		pte = pte_offset_early_fixmap(pmd_off_k(map.virtual), map.virtual);
 
 		/* Only i/o device mappings are supported ATM */
-- 
2.17.1


^ permalink raw reply related	[flat|nested] 14+ messages in thread

* [PATCH v2 4/4] kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og optimization
  2018-10-19 12:49 [PATCH v2 0/4] kernel hacking: GCC optimization for better debug experience (-Og) Du Changbin
                   ` (2 preceding siblings ...)
  2018-10-19 12:49 ` [PATCH v2 3/4] ARM: mm: fix build error in fix_to_virt with CONFIG_CC_OPTIMIZE_FOR_DEBUGGING Du Changbin
@ 2018-10-19 12:49 ` Du Changbin
  2018-10-28  2:28   ` Masahiro Yamada
  3 siblings, 1 reply; 14+ messages in thread
From: Du Changbin @ 2018-10-19 12:49 UTC (permalink / raw)
  To: yamada.masahiro, michal.lkml, tglx, mingo, linux, akpm
  Cc: rostedt, x86, linux-kbuild, linux-kernel, linux-arm-kernel,
	linux-sparse, robin.murphy, Du Changbin

This will apply GCC '-Og' optimization level which is supported
since GCC 4.8. This optimization level offers a reasonable level
of optimization while maintaining fast compilation and a good
debugging experience. It is similar to '-O1' while perferring
to keep debug ability over runtime speed.

If enabling this option breaks your kernel, you should either
disable this or find a fix (mostly in the arch code). Currently
this option has only been tested on x86_64 and arm platform.

This option can satisfy people who was searching for a method
to disable compiler optimizations so to achieve better kernel
debugging experience with kgdb or qemu.

The main problem of '-Og' is we must not use __attribute__((error(msg))).
The compiler will report error though the call to error function
still can be optimize out. So we must fallback to array tricky.

Comparison of vmlinux size: a bit smaller.

    w/o CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
    $ size vmlinux
       text    data     bss     dec     hex filename
    22665554   9709674  2920908 35296136        21a9388 vmlinux

    w/ CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
    $ size vmlinux
       text    data     bss     dec     hex filename
    21499032   10102758 2920908 34522698        20ec64a vmlinux

Comparison of system performance: a bit drop (~6%).
    This benchmark of kernel compilation is suggested by Ingo Molnar.
    https://lkml.org/lkml/2018/5/2/74

    Preparation: Set cpufreq to 'performance'.
    for ((cpu=0; cpu<120; cpu++)); do
      G=/sys/devices/system/cpu/cpu$cpu/cpufreq/scaling_governor
      [ -f $G ] && echo performance > $G
    done

    w/o CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
    $ perf stat --repeat 5 --null --pre                 '\
        cp -a kernel ../kernel.copy.$(date +%s);         \
        rm -rf *;                                        \
        git checkout .;                                  \
        echo 1 > /proc/sys/vm/drop_caches;               \
        find ../kernel* -type f | xargs cat >/dev/null;  \
        make -j kernel >/dev/null;                       \
        make clean >/dev/null 2>&1;                      \
        sync                                            '\
                                                         \
        make -j8 >/dev/null

    Performance counter stats for 'make -j8' (5 runs):

        219.764246652 seconds time elapsed                   ( +-  0.78% )

    w/ CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
    $ perf stat --repeat 5 --null --pre                 '\
        cp -a kernel ../kernel.copy.$(date +%s);         \
        rm -rf *;                                        \
        git checkout .;                                  \
        echo 1 > /proc/sys/vm/drop_caches;               \
        find ../kernel* -type f | xargs cat >/dev/null;  \
        make -j kernel >/dev/null;                       \
        make clean >/dev/null 2>&1;                      \
        sync                                            '\
                                                         \
        make -j8 >/dev/null

    Performance counter stats for 'make -j8' (5 runs):

         233.574187771 seconds time elapsed                  ( +-  0.19% )

Signed-off-by: Du Changbin <changbin.du@gmail.com>
Acked-by: Steven Rostedt (VMware) <rostedt@goodmis.org>
---
 Makefile                     |  5 +++++
 include/linux/compiler-gcc.h |  2 +-
 include/linux/compiler.h     |  2 +-
 init/Kconfig                 | 19 +++++++++++++++++++
 4 files changed, 26 insertions(+), 2 deletions(-)

diff --git a/Makefile b/Makefile
index 757d6507cb5c..ea908cfe8594 100644
--- a/Makefile
+++ b/Makefile
@@ -657,6 +657,10 @@ KBUILD_CFLAGS	+= $(call cc-disable-warning, format-truncation)
 KBUILD_CFLAGS	+= $(call cc-disable-warning, format-overflow)
 KBUILD_CFLAGS	+= $(call cc-disable-warning, int-in-bool-context)
 
+ifdef CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
+KBUILD_CFLAGS	+= $(call cc-option, -Og)
+KBUILD_CFLAGS	+= $(call cc-disable-warning,maybe-uninitialized,)
+else
 ifdef CONFIG_CC_OPTIMIZE_FOR_SIZE
 KBUILD_CFLAGS	+= $(call cc-option,-Oz,-Os)
 KBUILD_CFLAGS	+= $(call cc-disable-warning,maybe-uninitialized,)
@@ -667,6 +671,7 @@ else
 KBUILD_CFLAGS   += -O2
 endif
 endif
+endif
 
 KBUILD_CFLAGS += $(call cc-ifversion, -lt, 0409, \
 			$(call cc-disable-warning,maybe-uninitialized,))
diff --git a/include/linux/compiler-gcc.h b/include/linux/compiler-gcc.h
index 4d36b27214fd..2a76f7c64b54 100644
--- a/include/linux/compiler-gcc.h
+++ b/include/linux/compiler-gcc.h
@@ -85,7 +85,7 @@
 
 #define __compiletime_object_size(obj) __builtin_object_size(obj, 0)
 
-#ifndef __CHECKER__
+#if !defined(__CHECKER__) && !defined(CONFIG_CC_OPTIMIZE_FOR_DEBUGGING)
 #define __compiletime_warning(message) __attribute__((warning(message)))
 #define __compiletime_error(message) __attribute__((error(message)))
 
diff --git a/include/linux/compiler.h b/include/linux/compiler.h
index 681d866efb1e..9385c62e9f00 100644
--- a/include/linux/compiler.h
+++ b/include/linux/compiler.h
@@ -320,7 +320,7 @@ static inline void *offset_to_ptr(const int *off)
  * sparse see a constant array size without breaking compiletime_assert on old
  * versions of GCC (e.g. 4.2.4), so hide the array from sparse altogether.
  */
-# ifndef __CHECKER__
+# if !defined(__CHECKER__) && !defined(CONFIG_CC_OPTIMIZE_FOR_DEBUGGING)
 #  define __compiletime_error_fallback(condition) \
 	do { ((void)sizeof(char[1 - 2 * condition])); } while (0)
 # endif
diff --git a/init/Kconfig b/init/Kconfig
index 1e234e2f1cba..b01c7c654570 100644
--- a/init/Kconfig
+++ b/init/Kconfig
@@ -1081,6 +1081,25 @@ config CC_OPTIMIZE_FOR_SIZE
 
 	  If unsure, say N.
 
+config CC_OPTIMIZE_FOR_DEBUGGING
+	bool "Optimize for better debugging experience (-Og)"
+	select NO_AUTO_INLINE
+	help
+	  This will apply GCC '-Og' optimization level which is supported
+	  since GCC 4.8. This optimization level offers a reasonable level
+	  of optimization while maintaining fast compilation and a good
+	  debugging experience. It is similar to '-O1' while preferring to
+	  keep debug ability over runtime speed. The overall performance
+	  will drop a bit (~6%).
+
+	  Use only if you want to debug the kernel, especially if you want
+	  to have better kernel debugging experience with gdb facilities
+	  like kgdb or qemu. If enabling this option breaks your kernel,
+	  you should either disable this or find a fix (mostly in the arch
+	  code).
+
+	  If unsure, select N.
+
 endchoice
 
 config HAVE_LD_DEAD_CODE_DATA_ELIMINATION
-- 
2.17.1


^ permalink raw reply related	[flat|nested] 14+ messages in thread

* Re: [PATCH v2 1/4] x86/mm: surround level4_kernel_pgt with #ifdef CONFIG_X86_5LEVEL...#endif
  2018-10-19 12:49 ` [PATCH v2 1/4] x86/mm: surround level4_kernel_pgt with #ifdef CONFIG_X86_5LEVEL...#endif Du Changbin
@ 2018-10-28  1:44   ` Masahiro Yamada
  2018-10-28 12:46     ` Changbin Du
  0 siblings, 1 reply; 14+ messages in thread
From: Masahiro Yamada @ 2018-10-28  1:44 UTC (permalink / raw)
  To: changbin.du
  Cc: Michal Marek, Thomas Gleixner, Ingo Molnar, Russell King,
	Andrew Morton, Steven Rostedt, X86 ML, Linux Kbuild mailing list,
	Linux Kernel Mailing List, linux-arm-kernel, linux-sparse,
	Robin Murphy

On Fri, Oct 19, 2018 at 9:50 PM Du Changbin <changbin.du@gmail.com> wrote:
>
> The level4_kernel_pgt is only defined when X86_5LEVEL is enabled. So
> surround level4_kernel_pgt with #ifdef CONFIG_X86_5LEVEL...#endif to
> make code correct.

For clarification, is it better to mention
that this is a preparation for CONFIG_CC_OPTIMIZE_FOR_DEBUGGING ?


> Signed-off-by: Du Changbin <changbin.du@gmail.com>
> Acked-by: Steven Rostedt (VMware) <rostedt@goodmis.org>
> ---
>  arch/x86/include/asm/pgtable_64.h |  2 ++
>  arch/x86/kernel/head64.c          | 13 ++++++-------
>  2 files changed, 8 insertions(+), 7 deletions(-)
>
> diff --git a/arch/x86/include/asm/pgtable_64.h b/arch/x86/include/asm/pgtable_64.h
> index 9c85b54bf03c..9333f7fa5bdb 100644
> --- a/arch/x86/include/asm/pgtable_64.h
> +++ b/arch/x86/include/asm/pgtable_64.h
> @@ -16,7 +16,9 @@
>  #include <linux/threads.h>
>  #include <asm/fixmap.h>
>
> +#ifdef CONFIG_X86_5LEVEL
>  extern p4d_t level4_kernel_pgt[512];
> +#endif


Is this #ifdef  necessary?

It is harmless to declaring unused stuff.



>  extern p4d_t level4_ident_pgt[512];
>  extern pud_t level3_kernel_pgt[512];
>  extern pud_t level3_ident_pgt[512];
> diff --git a/arch/x86/kernel/head64.c b/arch/x86/kernel/head64.c
> index ddee1f0870c4..4a59ef93c258 100644
> --- a/arch/x86/kernel/head64.c
> +++ b/arch/x86/kernel/head64.c
> @@ -151,16 +151,15 @@ unsigned long __head __startup_64(unsigned long physaddr,
>
>         pgd = fixup_pointer(&early_top_pgt, physaddr);
>         p = pgd + pgd_index(__START_KERNEL_map);
> -       if (la57)
> -               *p = (unsigned long)level4_kernel_pgt;
> -       else
> -               *p = (unsigned long)level3_kernel_pgt;
> -       *p += _PAGE_TABLE_NOENC - __START_KERNEL_map + load_delta;
> -
> +#ifdef CONFIG_X86_5LEVEL
>         if (la57) {
> +               *p = (unsigned long)level4_kernel_pgt;
>                 p4d = fixup_pointer(&level4_kernel_pgt, physaddr);
>                 p4d[511] += load_delta;
> -       }
> +       } else
> +#endif
> +               *p = (unsigned long)level3_kernel_pgt;
> +       *p += _PAGE_TABLE_NOENC - __START_KERNEL_map + load_delta;
>
>         pud = fixup_pointer(&level3_kernel_pgt, physaddr);
>         pud[510] += load_delta;
> --
> 2.17.1
>


Hmm, this code looks a bit ugly...

Does the following one liner work with CONFIG_CC_OPTIMIZE_FOR_DEBUGGING ?


diff --git a/arch/x86/kernel/head64.c b/arch/x86/kernel/head64.c
index 8047379..579847f 100644
--- a/arch/x86/kernel/head64.c
+++ b/arch/x86/kernel/head64.c
@@ -97,7 +97,7 @@ static bool __head check_la57_support(unsigned long physaddr)
        return true;
 }
 #else
-static bool __head check_la57_support(unsigned long physaddr)
+static __always_inline bool __head check_la57_support(unsigned long physaddr)
 {
        return false;
 }




--
Best Regards
Masahiro Yamada

^ permalink raw reply related	[flat|nested] 14+ messages in thread

* Re: [PATCH v2 4/4] kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og optimization
  2018-10-19 12:49 ` [PATCH v2 4/4] kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og optimization Du Changbin
@ 2018-10-28  2:28   ` Masahiro Yamada
  2018-10-28 12:51     ` Changbin Du
  0 siblings, 1 reply; 14+ messages in thread
From: Masahiro Yamada @ 2018-10-28  2:28 UTC (permalink / raw)
  To: changbin.du
  Cc: Michal Marek, Thomas Gleixner, Ingo Molnar, Russell King,
	Andrew Morton, Steven Rostedt, X86 ML, Linux Kbuild mailing list,
	Linux Kernel Mailing List, linux-arm-kernel, linux-sparse,
	Robin Murphy

On Fri, Oct 19, 2018 at 9:50 PM Du Changbin <changbin.du@gmail.com> wrote:
>
> This will apply GCC '-Og' optimization level which is supported
> since GCC 4.8. This optimization level offers a reasonable level
> of optimization while maintaining fast compilation and a good
> debugging experience. It is similar to '-O1' while perferring
> to keep debug ability over runtime speed.
>
> If enabling this option breaks your kernel, you should either
> disable this or find a fix (mostly in the arch code). Currently
> this option has only been tested on x86_64 and arm platform.
>
> This option can satisfy people who was searching for a method
> to disable compiler optimizations so to achieve better kernel
> debugging experience with kgdb or qemu.
>
> The main problem of '-Og' is we must not use __attribute__((error(msg))).
> The compiler will report error though the call to error function
> still can be optimize out. So we must fallback to array tricky.
>
> Comparison of vmlinux size: a bit smaller.
>
>     w/o CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
>     $ size vmlinux
>        text    data     bss     dec     hex filename
>     22665554   9709674  2920908 35296136        21a9388 vmlinux
>
>     w/ CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
>     $ size vmlinux
>        text    data     bss     dec     hex filename
>     21499032   10102758 2920908 34522698        20ec64a vmlinux
>
> Comparison of system performance: a bit drop (~6%).
>     This benchmark of kernel compilation is suggested by Ingo Molnar.
>     https://lkml.org/lkml/2018/5/2/74
>
>     Preparation: Set cpufreq to 'performance'.
>     for ((cpu=0; cpu<120; cpu++)); do
>       G=/sys/devices/system/cpu/cpu$cpu/cpufreq/scaling_governor
>       [ -f $G ] && echo performance > $G
>     done
>
>     w/o CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
>     $ perf stat --repeat 5 --null --pre                 '\
>         cp -a kernel ../kernel.copy.$(date +%s);         \
>         rm -rf *;                                        \
>         git checkout .;                                  \
>         echo 1 > /proc/sys/vm/drop_caches;               \
>         find ../kernel* -type f | xargs cat >/dev/null;  \
>         make -j kernel >/dev/null;                       \
>         make clean >/dev/null 2>&1;                      \
>         sync                                            '\
>                                                          \
>         make -j8 >/dev/null
>
>     Performance counter stats for 'make -j8' (5 runs):
>
>         219.764246652 seconds time elapsed                   ( +-  0.78% )
>
>     w/ CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
>     $ perf stat --repeat 5 --null --pre                 '\
>         cp -a kernel ../kernel.copy.$(date +%s);         \
>         rm -rf *;                                        \
>         git checkout .;                                  \
>         echo 1 > /proc/sys/vm/drop_caches;               \
>         find ../kernel* -type f | xargs cat >/dev/null;  \
>         make -j kernel >/dev/null;                       \
>         make clean >/dev/null 2>&1;                      \
>         sync                                            '\
>                                                          \
>         make -j8 >/dev/null
>
>     Performance counter stats for 'make -j8' (5 runs):
>
>          233.574187771 seconds time elapsed                  ( +-  0.19% )
>
> Signed-off-by: Du Changbin <changbin.du@gmail.com>
> Acked-by: Steven Rostedt (VMware) <rostedt@goodmis.org>
> ---
>  Makefile                     |  5 +++++
>  include/linux/compiler-gcc.h |  2 +-
>  include/linux/compiler.h     |  2 +-
>  init/Kconfig                 | 19 +++++++++++++++++++
>  4 files changed, 26 insertions(+), 2 deletions(-)
>
> diff --git a/Makefile b/Makefile
> index 757d6507cb5c..ea908cfe8594 100644
> --- a/Makefile
> +++ b/Makefile
> @@ -657,6 +657,10 @@ KBUILD_CFLAGS      += $(call cc-disable-warning, format-truncation)
>  KBUILD_CFLAGS  += $(call cc-disable-warning, format-overflow)
>  KBUILD_CFLAGS  += $(call cc-disable-warning, int-in-bool-context)
>
> +ifdef CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
> +KBUILD_CFLAGS  += $(call cc-option, -Og)


This line is a problem.

As you stated in your patch description, -Og is supported on GCC 4.8+
but the minimum GCC version supported for Linux kernel is GCC 4.6


If you use either GCC 4.6 or 4.7,
$(call cc-option, -Og) is evaluated to empty.

It means no optimization flag is given to the compiler, then
the build fails.

I'd like to suggest to change as follows:




diff --git a/Makefile b/Makefile
index 122ab74..9af3dc4 100644
--- a/Makefile
+++ b/Makefile
@@ -663,7 +663,7 @@ KBUILD_CFLAGS       += $(call cc-disable-warning,
format-overflow)
 KBUILD_CFLAGS  += $(call cc-disable-warning, int-in-bool-context)

 ifdef CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
-KBUILD_CFLAGS  += $(call cc-option, -Og)
+KBUILD_CFLAGS  += -Og
 KBUILD_CFLAGS  += $(call cc-disable-warning,maybe-uninitialized,)
 else
 ifdef CONFIG_CC_OPTIMIZE_FOR_SIZE
diff --git a/init/Kconfig b/init/Kconfig
index b01c7c6..a0424dc 100644
--- a/init/Kconfig
+++ b/init/Kconfig
@@ -1083,6 +1083,7 @@ config CC_OPTIMIZE_FOR_SIZE

 config CC_OPTIMIZE_FOR_DEBUGGING
        bool "Optimize for better debugging experience (-Og)"
+       depends on $(cc-option,-Og)
        select NO_AUTO_INLINE
        help
          This will apply GCC '-Og' optimization level which is supported





If -Og is unsupported by the compiler, this is hidden from the Kconfig menu.



Also, in order to fix a warning from 'make tiny.config'
could you add the following change?



diff --git a/kernel/configs/tiny.config b/kernel/configs/tiny.config
index 7fa0c4a..599ea86 100644
--- a/kernel/configs/tiny.config
+++ b/kernel/configs/tiny.config
@@ -1,5 +1,6 @@
 # CONFIG_CC_OPTIMIZE_FOR_PERFORMANCE is not set
 CONFIG_CC_OPTIMIZE_FOR_SIZE=y
+# CONFIG_CC_OPTIMIZE_FOR_DEBUGGING is not set
 # CONFIG_KERNEL_GZIP is not set
 # CONFIG_KERNEL_BZIP2 is not set
 # CONFIG_KERNEL_LZMA is not set







> +KBUILD_CFLAGS  += $(call cc-disable-warning,maybe-uninitialized,)
> +else
>  ifdef CONFIG_CC_OPTIMIZE_FOR_SIZE
>  KBUILD_CFLAGS  += $(call cc-option,-Oz,-Os)
>  KBUILD_CFLAGS  += $(call cc-disable-warning,maybe-uninitialized,)
> @@ -667,6 +671,7 @@ else
>  KBUILD_CFLAGS   += -O2
>  endif
>  endif
> +endif
>
>  KBUILD_CFLAGS += $(call cc-ifversion, -lt, 0409, \
>                         $(call cc-disable-warning,maybe-uninitialized,))
> diff --git a/include/linux/compiler-gcc.h b/include/linux/compiler-gcc.h
> index 4d36b27214fd..2a76f7c64b54 100644
> --- a/include/linux/compiler-gcc.h
> +++ b/include/linux/compiler-gcc.h
> @@ -85,7 +85,7 @@
>
>  #define __compiletime_object_size(obj) __builtin_object_size(obj, 0)
>
> -#ifndef __CHECKER__
> +#if !defined(__CHECKER__) && !defined(CONFIG_CC_OPTIMIZE_FOR_DEBUGGING)
>  #define __compiletime_warning(message) __attribute__((warning(message)))
>  #define __compiletime_error(message) __attribute__((error(message)))
>
> diff --git a/include/linux/compiler.h b/include/linux/compiler.h
> index 681d866efb1e..9385c62e9f00 100644
> --- a/include/linux/compiler.h
> +++ b/include/linux/compiler.h
> @@ -320,7 +320,7 @@ static inline void *offset_to_ptr(const int *off)
>   * sparse see a constant array size without breaking compiletime_assert on old
>   * versions of GCC (e.g. 4.2.4), so hide the array from sparse altogether.
>   */
> -# ifndef __CHECKER__
> +# if !defined(__CHECKER__) && !defined(CONFIG_CC_OPTIMIZE_FOR_DEBUGGING)
>  #  define __compiletime_error_fallback(condition) \
>         do { ((void)sizeof(char[1 - 2 * condition])); } while (0)
>  # endif
> diff --git a/init/Kconfig b/init/Kconfig
> index 1e234e2f1cba..b01c7c654570 100644
> --- a/init/Kconfig
> +++ b/init/Kconfig
> @@ -1081,6 +1081,25 @@ config CC_OPTIMIZE_FOR_SIZE
>
>           If unsure, say N.
>
> +config CC_OPTIMIZE_FOR_DEBUGGING
> +       bool "Optimize for better debugging experience (-Og)"
> +       select NO_AUTO_INLINE
> +       help
> +         This will apply GCC '-Og' optimization level which is supported
> +         since GCC 4.8. This optimization level offers a reasonable level
> +         of optimization while maintaining fast compilation and a good
> +         debugging experience. It is similar to '-O1' while preferring to
> +         keep debug ability over runtime speed. The overall performance
> +         will drop a bit (~6%).
> +
> +         Use only if you want to debug the kernel, especially if you want
> +         to have better kernel debugging experience with gdb facilities
> +         like kgdb or qemu. If enabling this option breaks your kernel,
> +         you should either disable this or find a fix (mostly in the arch
> +         code).
> +
> +         If unsure, select N.
> +
>  endchoice
>
>  config HAVE_LD_DEAD_CODE_DATA_ELIMINATION
> --
> 2.17.1
>


-- 
Best Regards
Masahiro Yamada

^ permalink raw reply related	[flat|nested] 14+ messages in thread

* Re: [PATCH v2 1/4] x86/mm: surround level4_kernel_pgt with #ifdef CONFIG_X86_5LEVEL...#endif
  2018-10-28  1:44   ` Masahiro Yamada
@ 2018-10-28 12:46     ` Changbin Du
  0 siblings, 0 replies; 14+ messages in thread
From: Changbin Du @ 2018-10-28 12:46 UTC (permalink / raw)
  To: Masahiro Yamada
  Cc: changbin.du, Michal Marek, Thomas Gleixner, Ingo Molnar,
	Russell King, Andrew Morton, Steven Rostedt, X86 ML,
	Linux Kbuild mailing list, Linux Kernel Mailing List,
	linux-arm-kernel, linux-sparse, Robin Murphy

On Sun, Oct 28, 2018 at 10:44:46AM +0900, Masahiro Yamada wrote:
> On Fri, Oct 19, 2018 at 9:50 PM Du Changbin <changbin.du@gmail.com> wrote:
> >
> > The level4_kernel_pgt is only defined when X86_5LEVEL is enabled. So
> > surround level4_kernel_pgt with #ifdef CONFIG_X86_5LEVEL...#endif to
> > make code correct.
> 
> For clarification, is it better to mention
> that this is a preparation for CONFIG_CC_OPTIMIZE_FOR_DEBUGGING ?
>
yes, I will update the commit msg.

> 
> > Signed-off-by: Du Changbin <changbin.du@gmail.com>
> > Acked-by: Steven Rostedt (VMware) <rostedt@goodmis.org>
> > ---
> >  arch/x86/include/asm/pgtable_64.h |  2 ++
> >  arch/x86/kernel/head64.c          | 13 ++++++-------
> >  2 files changed, 8 insertions(+), 7 deletions(-)
> >
> > diff --git a/arch/x86/include/asm/pgtable_64.h b/arch/x86/include/asm/pgtable_64.h
> > index 9c85b54bf03c..9333f7fa5bdb 100644
> > --- a/arch/x86/include/asm/pgtable_64.h
> > +++ b/arch/x86/include/asm/pgtable_64.h
> > @@ -16,7 +16,9 @@
> >  #include <linux/threads.h>
> >  #include <asm/fixmap.h>
> >
> > +#ifdef CONFIG_X86_5LEVEL
> >  extern p4d_t level4_kernel_pgt[512];
> > +#endif
> 
> 
> Is this #ifdef  necessary?
> 
> It is harmless to declaring unused stuff.
> 
> 
> 
> >  extern p4d_t level4_ident_pgt[512];
> >  extern pud_t level3_kernel_pgt[512];
> >  extern pud_t level3_ident_pgt[512];
> > diff --git a/arch/x86/kernel/head64.c b/arch/x86/kernel/head64.c
> > index ddee1f0870c4..4a59ef93c258 100644
> > --- a/arch/x86/kernel/head64.c
> > +++ b/arch/x86/kernel/head64.c
> > @@ -151,16 +151,15 @@ unsigned long __head __startup_64(unsigned long physaddr,
> >
> >         pgd = fixup_pointer(&early_top_pgt, physaddr);
> >         p = pgd + pgd_index(__START_KERNEL_map);
> > -       if (la57)
> > -               *p = (unsigned long)level4_kernel_pgt;
> > -       else
> > -               *p = (unsigned long)level3_kernel_pgt;
> > -       *p += _PAGE_TABLE_NOENC - __START_KERNEL_map + load_delta;
> > -
> > +#ifdef CONFIG_X86_5LEVEL
> >         if (la57) {
> > +               *p = (unsigned long)level4_kernel_pgt;
> >                 p4d = fixup_pointer(&level4_kernel_pgt, physaddr);
> >                 p4d[511] += load_delta;
> > -       }
> > +       } else
> > +#endif
> > +               *p = (unsigned long)level3_kernel_pgt;
> > +       *p += _PAGE_TABLE_NOENC - __START_KERNEL_map + load_delta;
> >
> >         pud = fixup_pointer(&level3_kernel_pgt, physaddr);
> >         pud[510] += load_delta;
> > --
> > 2.17.1
> >
> 
> 
> Hmm, this code looks a bit ugly...
> 
> Does the following one liner work with CONFIG_CC_OPTIMIZE_FOR_DEBUGGING ?
> 
> 
> diff --git a/arch/x86/kernel/head64.c b/arch/x86/kernel/head64.c
> index 8047379..579847f 100644
> --- a/arch/x86/kernel/head64.c
> +++ b/arch/x86/kernel/head64.c
> @@ -97,7 +97,7 @@ static bool __head check_la57_support(unsigned long physaddr)
>         return true;
>  }
>  #else
> -static bool __head check_la57_support(unsigned long physaddr)
> +static __always_inline bool __head check_la57_support(unsigned long physaddr)
>  {
>         return false;
>  }
> 
This is much better. I just declared it as 'inline'. Thanks for your suggestion.

> 
> 
> 
> --
> Best Regards
> Masahiro Yamada

-- 
Thanks,
Changbin Du

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH v2 4/4] kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og optimization
  2018-10-28  2:28   ` Masahiro Yamada
@ 2018-10-28 12:51     ` Changbin Du
  2018-10-28 13:09       ` Masahiro Yamada
  0 siblings, 1 reply; 14+ messages in thread
From: Changbin Du @ 2018-10-28 12:51 UTC (permalink / raw)
  To: Masahiro Yamada
  Cc: changbin.du, Michal Marek, Thomas Gleixner, Ingo Molnar,
	Russell King, Andrew Morton, Steven Rostedt, X86 ML,
	Linux Kbuild mailing list, Linux Kernel Mailing List,
	linux-arm-kernel, linux-sparse, Robin Murphy

On Sun, Oct 28, 2018 at 11:28:24AM +0900, Masahiro Yamada wrote:
> On Fri, Oct 19, 2018 at 9:50 PM Du Changbin <changbin.du@gmail.com> wrote:
> >
> > This will apply GCC '-Og' optimization level which is supported
> > since GCC 4.8. This optimization level offers a reasonable level
> > of optimization while maintaining fast compilation and a good
> > debugging experience. It is similar to '-O1' while perferring
> > to keep debug ability over runtime speed.
> >
> > If enabling this option breaks your kernel, you should either
> > disable this or find a fix (mostly in the arch code). Currently
> > this option has only been tested on x86_64 and arm platform.
> >
> > This option can satisfy people who was searching for a method
> > to disable compiler optimizations so to achieve better kernel
> > debugging experience with kgdb or qemu.
> >
> > The main problem of '-Og' is we must not use __attribute__((error(msg))).
> > The compiler will report error though the call to error function
> > still can be optimize out. So we must fallback to array tricky.
> >
> > Comparison of vmlinux size: a bit smaller.
> >
> >     w/o CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
> >     $ size vmlinux
> >        text    data     bss     dec     hex filename
> >     22665554   9709674  2920908 35296136        21a9388 vmlinux
> >
> >     w/ CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
> >     $ size vmlinux
> >        text    data     bss     dec     hex filename
> >     21499032   10102758 2920908 34522698        20ec64a vmlinux
> >
> > Comparison of system performance: a bit drop (~6%).
> >     This benchmark of kernel compilation is suggested by Ingo Molnar.
> >     https://lkml.org/lkml/2018/5/2/74
> >
> >     Preparation: Set cpufreq to 'performance'.
> >     for ((cpu=0; cpu<120; cpu++)); do
> >       G=/sys/devices/system/cpu/cpu$cpu/cpufreq/scaling_governor
> >       [ -f $G ] && echo performance > $G
> >     done
> >
> >     w/o CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
> >     $ perf stat --repeat 5 --null --pre                 '\
> >         cp -a kernel ../kernel.copy.$(date +%s);         \
> >         rm -rf *;                                        \
> >         git checkout .;                                  \
> >         echo 1 > /proc/sys/vm/drop_caches;               \
> >         find ../kernel* -type f | xargs cat >/dev/null;  \
> >         make -j kernel >/dev/null;                       \
> >         make clean >/dev/null 2>&1;                      \
> >         sync                                            '\
> >                                                          \
> >         make -j8 >/dev/null
> >
> >     Performance counter stats for 'make -j8' (5 runs):
> >
> >         219.764246652 seconds time elapsed                   ( +-  0.78% )
> >
> >     w/ CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
> >     $ perf stat --repeat 5 --null --pre                 '\
> >         cp -a kernel ../kernel.copy.$(date +%s);         \
> >         rm -rf *;                                        \
> >         git checkout .;                                  \
> >         echo 1 > /proc/sys/vm/drop_caches;               \
> >         find ../kernel* -type f | xargs cat >/dev/null;  \
> >         make -j kernel >/dev/null;                       \
> >         make clean >/dev/null 2>&1;                      \
> >         sync                                            '\
> >                                                          \
> >         make -j8 >/dev/null
> >
> >     Performance counter stats for 'make -j8' (5 runs):
> >
> >          233.574187771 seconds time elapsed                  ( +-  0.19% )
> >
> > Signed-off-by: Du Changbin <changbin.du@gmail.com>
> > Acked-by: Steven Rostedt (VMware) <rostedt@goodmis.org>
> > ---
> >  Makefile                     |  5 +++++
> >  include/linux/compiler-gcc.h |  2 +-
> >  include/linux/compiler.h     |  2 +-
> >  init/Kconfig                 | 19 +++++++++++++++++++
> >  4 files changed, 26 insertions(+), 2 deletions(-)
> >
> > diff --git a/Makefile b/Makefile
> > index 757d6507cb5c..ea908cfe8594 100644
> > --- a/Makefile
> > +++ b/Makefile
> > @@ -657,6 +657,10 @@ KBUILD_CFLAGS      += $(call cc-disable-warning, format-truncation)
> >  KBUILD_CFLAGS  += $(call cc-disable-warning, format-overflow)
> >  KBUILD_CFLAGS  += $(call cc-disable-warning, int-in-bool-context)
> >
> > +ifdef CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
> > +KBUILD_CFLAGS  += $(call cc-option, -Og)
> 
> 
> This line is a problem.
> 
> As you stated in your patch description, -Og is supported on GCC 4.8+
> but the minimum GCC version supported for Linux kernel is GCC 4.6
> 
> 
> If you use either GCC 4.6 or 4.7,
> $(call cc-option, -Og) is evaluated to empty.
> 
> It means no optimization flag is given to the compiler, then
> the build fails.
> 
> I'd like to suggest to change as follows:
> 
>
Thanks for pointing this out and kind suggestions. I have tested with your
modification. Will update patch serias soon.

> 
> 
> diff --git a/Makefile b/Makefile
> index 122ab74..9af3dc4 100644
> --- a/Makefile
> +++ b/Makefile
> @@ -663,7 +663,7 @@ KBUILD_CFLAGS       += $(call cc-disable-warning,
> format-overflow)
>  KBUILD_CFLAGS  += $(call cc-disable-warning, int-in-bool-context)
> 
>  ifdef CONFIG_CC_OPTIMIZE_FOR_DEBUGGING
> -KBUILD_CFLAGS  += $(call cc-option, -Og)
> +KBUILD_CFLAGS  += -Og
>  KBUILD_CFLAGS  += $(call cc-disable-warning,maybe-uninitialized,)
>  else
>  ifdef CONFIG_CC_OPTIMIZE_FOR_SIZE
> diff --git a/init/Kconfig b/init/Kconfig
> index b01c7c6..a0424dc 100644
> --- a/init/Kconfig
> +++ b/init/Kconfig
> @@ -1083,6 +1083,7 @@ config CC_OPTIMIZE_FOR_SIZE
> 
>  config CC_OPTIMIZE_FOR_DEBUGGING
>         bool "Optimize for better debugging experience (-Og)"
> +       depends on $(cc-option,-Og)
>         select NO_AUTO_INLINE
>         help
>           This will apply GCC '-Og' optimization level which is supported
> 
> 
> 
> 
> 
> If -Og is unsupported by the compiler, this is hidden from the Kconfig menu.
> 
> 
> 
> Also, in order to fix a warning from 'make tiny.config'
> could you add the following change?
> 
> 
> 
> diff --git a/kernel/configs/tiny.config b/kernel/configs/tiny.config
> index 7fa0c4a..599ea86 100644
> --- a/kernel/configs/tiny.config
> +++ b/kernel/configs/tiny.config
> @@ -1,5 +1,6 @@
>  # CONFIG_CC_OPTIMIZE_FOR_PERFORMANCE is not set
>  CONFIG_CC_OPTIMIZE_FOR_SIZE=y
> +# CONFIG_CC_OPTIMIZE_FOR_DEBUGGING is not set
>  # CONFIG_KERNEL_GZIP is not set
>  # CONFIG_KERNEL_BZIP2 is not set
>  # CONFIG_KERNEL_LZMA is not set
> 
> 
> 
> 
> 
> 
> 
> > +KBUILD_CFLAGS  += $(call cc-disable-warning,maybe-uninitialized,)
> > +else
> >  ifdef CONFIG_CC_OPTIMIZE_FOR_SIZE
> >  KBUILD_CFLAGS  += $(call cc-option,-Oz,-Os)
> >  KBUILD_CFLAGS  += $(call cc-disable-warning,maybe-uninitialized,)
> > @@ -667,6 +671,7 @@ else
> >  KBUILD_CFLAGS   += -O2
> >  endif
> >  endif
> > +endif
> >
> >  KBUILD_CFLAGS += $(call cc-ifversion, -lt, 0409, \
> >                         $(call cc-disable-warning,maybe-uninitialized,))
> > diff --git a/include/linux/compiler-gcc.h b/include/linux/compiler-gcc.h
> > index 4d36b27214fd..2a76f7c64b54 100644
> > --- a/include/linux/compiler-gcc.h
> > +++ b/include/linux/compiler-gcc.h
> > @@ -85,7 +85,7 @@
> >
> >  #define __compiletime_object_size(obj) __builtin_object_size(obj, 0)
> >
> > -#ifndef __CHECKER__
> > +#if !defined(__CHECKER__) && !defined(CONFIG_CC_OPTIMIZE_FOR_DEBUGGING)
> >  #define __compiletime_warning(message) __attribute__((warning(message)))
> >  #define __compiletime_error(message) __attribute__((error(message)))
> >
> > diff --git a/include/linux/compiler.h b/include/linux/compiler.h
> > index 681d866efb1e..9385c62e9f00 100644
> > --- a/include/linux/compiler.h
> > +++ b/include/linux/compiler.h
> > @@ -320,7 +320,7 @@ static inline void *offset_to_ptr(const int *off)
> >   * sparse see a constant array size without breaking compiletime_assert on old
> >   * versions of GCC (e.g. 4.2.4), so hide the array from sparse altogether.
> >   */
> > -# ifndef __CHECKER__
> > +# if !defined(__CHECKER__) && !defined(CONFIG_CC_OPTIMIZE_FOR_DEBUGGING)
> >  #  define __compiletime_error_fallback(condition) \
> >         do { ((void)sizeof(char[1 - 2 * condition])); } while (0)
> >  # endif
> > diff --git a/init/Kconfig b/init/Kconfig
> > index 1e234e2f1cba..b01c7c654570 100644
> > --- a/init/Kconfig
> > +++ b/init/Kconfig
> > @@ -1081,6 +1081,25 @@ config CC_OPTIMIZE_FOR_SIZE
> >
> >           If unsure, say N.
> >
> > +config CC_OPTIMIZE_FOR_DEBUGGING
> > +       bool "Optimize for better debugging experience (-Og)"
> > +       select NO_AUTO_INLINE
> > +       help
> > +         This will apply GCC '-Og' optimization level which is supported
> > +         since GCC 4.8. This optimization level offers a reasonable level
> > +         of optimization while maintaining fast compilation and a good
> > +         debugging experience. It is similar to '-O1' while preferring to
> > +         keep debug ability over runtime speed. The overall performance
> > +         will drop a bit (~6%).
> > +
> > +         Use only if you want to debug the kernel, especially if you want
> > +         to have better kernel debugging experience with gdb facilities
> > +         like kgdb or qemu. If enabling this option breaks your kernel,
> > +         you should either disable this or find a fix (mostly in the arch
> > +         code).
> > +
> > +         If unsure, select N.
> > +
> >  endchoice
> >
> >  config HAVE_LD_DEAD_CODE_DATA_ELIMINATION
> > --
> > 2.17.1
> >
> 
> 
> -- 
> Best Regards
> Masahiro Yamada

-- 
Thanks,
Changbin Du

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH v2 4/4] kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og optimization
  2018-10-28 12:51     ` Changbin Du
@ 2018-10-28 13:09       ` Masahiro Yamada
  2018-10-28 13:14         ` Changbin Du
  0 siblings, 1 reply; 14+ messages in thread
From: Masahiro Yamada @ 2018-10-28 13:09 UTC (permalink / raw)
  To: Changbin Du
  Cc: Michal Marek, Thomas Gleixner, Ingo Molnar, Russell King,
	Andrew Morton, Steven Rostedt, X86 ML, Linux Kbuild mailing list,
	Linux Kernel Mailing List, linux-arm-kernel, linux-sparse,
	Robin Murphy

Hi Changbin,

On Sun, Oct 28, 2018 at 9:52 PM Changbin Du <changbin.du@gmail.com> wrote:

> >
> Thanks for pointing this out and kind suggestions. I have tested with your
> modification. Will update patch serias soon.


I have one more request.

The patch subject
"kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og
optimization"
is too long (83 characters)


I'd like the subject to be within 70 columns or so.

Thanks.




-- 
Best Regards
Masahiro Yamada

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH v2 2/4] kernel hacking: new config NO_AUTO_INLINE to disable compiler auto-inline optimizations
  2018-10-19 12:49 ` [PATCH v2 2/4] kernel hacking: new config NO_AUTO_INLINE to disable compiler auto-inline optimizations Du Changbin
@ 2018-10-28 13:11   ` Masahiro Yamada
  0 siblings, 0 replies; 14+ messages in thread
From: Masahiro Yamada @ 2018-10-28 13:11 UTC (permalink / raw)
  To: Changbin Du
  Cc: Michal Marek, Thomas Gleixner, Ingo Molnar, Russell King,
	Andrew Morton, Steven Rostedt, X86 ML, Linux Kbuild mailing list,
	Linux Kernel Mailing List, linux-arm-kernel, linux-sparse,
	Robin Murphy

On Fri, Oct 19, 2018 at 9:50 PM Du Changbin <changbin.du@gmail.com> wrote:
>
> This patch add a new kernel hacking option NO_AUTO_INLINE. Selecting
> this option will prevent the compiler from optimizing the kernel by
> auto-inlining functions not marked with the inline keyword.
>
> With this option, only functions explicitly marked with "inline" will
> be inlined. This will allow the function tracer to trace more functions
> because it only traces functions that the compiler has not inlined.
>
> Signed-off-by: Du Changbin <changbin.du@gmail.com>
> Acked-by: Steven Rostedt (VMware) <rostedt@goodmis.org>


This subject
"kernel hacking: new config NO_AUTO_INLINE to disable compiler
auto-inline optimizations"
is also too long.

Could you please make it a bit shorter?


-- 
Best Regards
Masahiro Yamada

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH v2 4/4] kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og optimization
  2018-10-28 13:09       ` Masahiro Yamada
@ 2018-10-28 13:14         ` Changbin Du
  2018-10-29 13:01           ` Masahiro Yamada
  0 siblings, 1 reply; 14+ messages in thread
From: Changbin Du @ 2018-10-28 13:14 UTC (permalink / raw)
  To: Masahiro Yamada
  Cc: Changbin Du, Michal Marek, Thomas Gleixner, Ingo Molnar,
	Russell King, Andrew Morton, Steven Rostedt, X86 ML,
	Linux Kbuild mailing list, Linux Kernel Mailing List,
	linux-arm-kernel, linux-sparse, Robin Murphy

On Sun, Oct 28, 2018 at 10:09:21PM +0900, Masahiro Yamada wrote:
> Hi Changbin,
> 
> On Sun, Oct 28, 2018 at 9:52 PM Changbin Du <changbin.du@gmail.com> wrote:
> 
> > >
> > Thanks for pointing this out and kind suggestions. I have tested with your
> > modification. Will update patch serias soon.
> 
> 
> I have one more request.
> 
> The patch subject
> "kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og
> optimization"
> is too long (83 characters)
> 
> 
> I'd like the subject to be within 70 columns or so.
> 
oh, I just sent new version. Could me update the title if there are more
suggestions then?

> Thanks.
> 
> 
> 
> 
> -- 
> Best Regards
> Masahiro Yamada

-- 
Thanks,
Changbin Du

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH v2 4/4] kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og optimization
  2018-10-28 13:14         ` Changbin Du
@ 2018-10-29 13:01           ` Masahiro Yamada
  2018-10-29 14:56             ` Changbin Du
  0 siblings, 1 reply; 14+ messages in thread
From: Masahiro Yamada @ 2018-10-29 13:01 UTC (permalink / raw)
  To: Changbin Du
  Cc: Michal Marek, Thomas Gleixner, Ingo Molnar, Russell King,
	Andrew Morton, Steven Rostedt, X86 ML, Linux Kbuild mailing list,
	Linux Kernel Mailing List, linux-arm-kernel, linux-sparse,
	Robin Murphy

Hi.


On Sun, Oct 28, 2018 at 10:14 PM Changbin Du <changbin.du@gmail.com> wrote:
>
> On Sun, Oct 28, 2018 at 10:09:21PM +0900, Masahiro Yamada wrote:
> > Hi Changbin,
> >
> > On Sun, Oct 28, 2018 at 9:52 PM Changbin Du <changbin.du@gmail.com> wrote:
> >
> > > >
> > > Thanks for pointing this out and kind suggestions. I have tested with your
> > > modification. Will update patch serias soon.
> >
> >
> > I have one more request.
> >
> > The patch subject
> > "kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og
> > optimization"
> > is too long (83 characters)
> >
> >
> > I'd like the subject to be within 70 columns or so.
> >
> oh, I just sent new version. Could me update the title if there are more
> suggestions then?



How about this?

2/4: kernel hacking: add a config option to disable compiler auto-inlining
3/4: ARM: mm: fix build error in fix_to_virt with -Og optimization level
4/4: kernel hacking: support building kernel with -Og optimization level

(The CONFIG name is so long that the subject
would easily exceeds 80-columns with it.)


I applied this series to linux-kbuild,
and will send another pull request in this MW.


If you come up with better patch titles,
please let me know.
I will fix them up locally.



-- 
Best Regards
Masahiro Yamada

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH v2 4/4] kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og optimization
  2018-10-29 13:01           ` Masahiro Yamada
@ 2018-10-29 14:56             ` Changbin Du
  0 siblings, 0 replies; 14+ messages in thread
From: Changbin Du @ 2018-10-29 14:56 UTC (permalink / raw)
  To: Masahiro Yamada
  Cc: Changbin Du, Michal Marek, Thomas Gleixner, Ingo Molnar,
	Russell King, Andrew Morton, Steven Rostedt, X86 ML,
	Linux Kbuild mailing list, Linux Kernel Mailing List,
	linux-arm-kernel, linux-sparse, Robin Murphy

On Mon, Oct 29, 2018 at 10:01:44PM +0900, Masahiro Yamada wrote:
> Hi.
> 
> 
> On Sun, Oct 28, 2018 at 10:14 PM Changbin Du <changbin.du@gmail.com> wrote:
> >
> > On Sun, Oct 28, 2018 at 10:09:21PM +0900, Masahiro Yamada wrote:
> > > Hi Changbin,
> > >
> > > On Sun, Oct 28, 2018 at 9:52 PM Changbin Du <changbin.du@gmail.com> wrote:
> > >
> > > > >
> > > > Thanks for pointing this out and kind suggestions. I have tested with your
> > > > modification. Will update patch serias soon.
> > >
> > >
> > > I have one more request.
> > >
> > > The patch subject
> > > "kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og
> > > optimization"
> > > is too long (83 characters)
> > >
> > >
> > > I'd like the subject to be within 70 columns or so.
> > >
> > oh, I just sent new version. Could me update the title if there are more
> > suggestions then?
> 
> 
> 
> How about this?
> 
> 2/4: kernel hacking: add a config option to disable compiler auto-inlining
> 3/4: ARM: mm: fix build error in fix_to_virt with -Og optimization level
> 4/4: kernel hacking: support building kernel with -Og optimization level
> 
> (The CONFIG name is so long that the subject
> would easily exceeds 80-columns with it.)
> 
> 
> I applied this series to linux-kbuild,
> and will send another pull request in this MW.
> 
> 
> If you come up with better patch titles,
> please let me know.
> I will fix them up locally.
>
I am just fine with these new titles. Thanks.

> 
> 
> -- 
> Best Regards
> Masahiro Yamada

-- 
Thanks,
Changbin Du

^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2018-10-29 14:57 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-10-19 12:49 [PATCH v2 0/4] kernel hacking: GCC optimization for better debug experience (-Og) Du Changbin
2018-10-19 12:49 ` [PATCH v2 1/4] x86/mm: surround level4_kernel_pgt with #ifdef CONFIG_X86_5LEVEL...#endif Du Changbin
2018-10-28  1:44   ` Masahiro Yamada
2018-10-28 12:46     ` Changbin Du
2018-10-19 12:49 ` [PATCH v2 2/4] kernel hacking: new config NO_AUTO_INLINE to disable compiler auto-inline optimizations Du Changbin
2018-10-28 13:11   ` Masahiro Yamada
2018-10-19 12:49 ` [PATCH v2 3/4] ARM: mm: fix build error in fix_to_virt with CONFIG_CC_OPTIMIZE_FOR_DEBUGGING Du Changbin
2018-10-19 12:49 ` [PATCH v2 4/4] kernel hacking: new config CC_OPTIMIZE_FOR_DEBUGGING to apply GCC -Og optimization Du Changbin
2018-10-28  2:28   ` Masahiro Yamada
2018-10-28 12:51     ` Changbin Du
2018-10-28 13:09       ` Masahiro Yamada
2018-10-28 13:14         ` Changbin Du
2018-10-29 13:01           ` Masahiro Yamada
2018-10-29 14:56             ` Changbin Du

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).