All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH 1/2] lockdep: Introduce CROSSRELEASE_STACK_TRACE and make it not unwind as default
@ 2017-10-18  9:13 ` Byungchul Park
  0 siblings, 0 replies; 44+ messages in thread
From: Byungchul Park @ 2017-10-18  9:13 UTC (permalink / raw)
  To: peterz, mingo; +Cc: tglx, linux-kernel, linux-mm, kernel-team

Johan Hovold reported a performance regression by crossrelease like:

> Boot time (from "Linux version" to login prompt) had in fact doubled
> since 4.13 where it took 17 seconds (with my current config) compared to
> the 35 seconds I now see with 4.14-rc4.
>
> I quick bisect pointed to lockdep and specifically the following commit:
>
> 	28a903f63ec0 ("locking/lockdep: Handle non(or multi)-acquisition
> 	               of a crosslock")
>
> which I've verified is the commit which doubled the boot time (compared
> to 28a903f63ec0^) (added by lockdep crossrelease series [1]).

Currently crossrelease performs unwind on every acquisition. But, that
overloads systems too much. So this patch makes unwind optional and set
it to N as default. Instead, it records only acquire_ip normally. Of
course, unwind is sometimes required for full analysis. In that case, we
can set CROSSRELEASE_STACK_TRACE to Y and use it.

In my qemu ubuntu machin (x86_64, 4 cores, 512M), the regression was
fixed like, measuring timestamp of "Freeing unused kernel memory":

1. No lockdep enabled
   Average : 1.543353 secs

2. Lockdep enabled
   Average : 1.570806 secs

3. Lockdep enabled + crossrelease enabled
   Average : 1.870317 secs

4. Lockdep enabled + crossrelease enabled + this patch applied
   Average : 1.574143 secs

Signed-off-by: Byungchul Park <byungchul.park@lge.com>
---
 include/linux/lockdep.h  |  4 ++++
 kernel/locking/lockdep.c |  5 +++++
 lib/Kconfig.debug        | 15 +++++++++++++++
 3 files changed, 24 insertions(+)

diff --git a/include/linux/lockdep.h b/include/linux/lockdep.h
index bfa8e0b..70358b5 100644
--- a/include/linux/lockdep.h
+++ b/include/linux/lockdep.h
@@ -278,7 +278,11 @@ struct held_lock {
 };
 
 #ifdef CONFIG_LOCKDEP_CROSSRELEASE
+#ifdef CONFIG_CROSSRELEASE_STACK_TRACE
 #define MAX_XHLOCK_TRACE_ENTRIES 5
+#else
+#define MAX_XHLOCK_TRACE_ENTRIES 1
+#endif
 
 /*
  * This is for keeping locks waiting for commit so that true dependencies
diff --git a/kernel/locking/lockdep.c b/kernel/locking/lockdep.c
index e36e652..5c2ddf2 100644
--- a/kernel/locking/lockdep.c
+++ b/kernel/locking/lockdep.c
@@ -4863,8 +4863,13 @@ static void add_xhlock(struct held_lock *hlock)
 	xhlock->trace.nr_entries = 0;
 	xhlock->trace.max_entries = MAX_XHLOCK_TRACE_ENTRIES;
 	xhlock->trace.entries = xhlock->trace_entries;
+#ifdef CONFIG_CROSSRELEASE_STACK_TRACE
 	xhlock->trace.skip = 3;
 	save_stack_trace(&xhlock->trace);
+#else
+	xhlock->trace.nr_entries = 1;
+	xhlock->trace.entries[0] = hlock->acquire_ip;
+#endif
 }
 
 static inline int same_context_xhlock(struct hist_lock *xhlock)
diff --git a/lib/Kconfig.debug b/lib/Kconfig.debug
index 3db9167..5be7bdd 100644
--- a/lib/Kconfig.debug
+++ b/lib/Kconfig.debug
@@ -1225,6 +1225,21 @@ config LOCKDEP_COMPLETIONS
 	 A deadlock caused by wait_for_completion() and complete() can be
 	 detected by lockdep using crossrelease feature.
 
+config CROSSRELEASE_STACK_TRACE
+	bool "Record more than one entity of stack trace in crossrelease"
+	depends on LOCKDEP_CROSSRELEASE
+	default n
+	help
+	 Crossrelease feature needs to record stack traces for all
+	 acquisitions for later use. And only acquire_ip is normally
+	 recorded because the unwind operation is too expensive. However,
+	 sometimes more than acquire_ip are required for full analysis.
+	 In the case that we need to record more than one entity of
+	 stack trace using unwind, this feature would be useful, with
+	 taking more overhead.
+
+	 If unsure, say N.
+
 config DEBUG_LOCKDEP
 	bool "Lock dependency engine debugging"
 	depends on DEBUG_KERNEL && LOCKDEP
-- 
1.9.1

^ permalink raw reply related	[flat|nested] 44+ messages in thread

end of thread, other threads:[~2017-10-19  9:41 UTC | newest]

Thread overview: 44+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-10-18  9:13 [PATCH 1/2] lockdep: Introduce CROSSRELEASE_STACK_TRACE and make it not unwind as default Byungchul Park
2017-10-18  9:13 ` Byungchul Park
2017-10-18  9:13 ` [PATCH 2/2] lockdep: Remove BROKEN flag of LOCKDEP_CROSSRELEASE Byungchul Park
2017-10-18  9:13   ` Byungchul Park
2017-10-18 10:12   ` Ingo Molnar
2017-10-18 10:12     ` Ingo Molnar
2017-10-19  1:58     ` Byungchul Park
2017-10-19  1:58       ` Byungchul Park
2017-10-18 10:09 ` [PATCH 1/2] lockdep: Introduce CROSSRELEASE_STACK_TRACE and make it not unwind as default Ingo Molnar
2017-10-18 10:09   ` Ingo Molnar
2017-10-19  4:32   ` Byungchul Park
2017-10-19  4:32     ` Byungchul Park
2017-10-19  5:57     ` Ingo Molnar
2017-10-19  5:57       ` Ingo Molnar
2017-10-19  6:11       ` Byungchul Park
2017-10-19  6:11         ` Byungchul Park
2017-10-19  6:22         ` Ingo Molnar
2017-10-19  6:22           ` Ingo Molnar
2017-10-19  6:36           ` Byungchul Park
2017-10-19  6:36             ` Byungchul Park
2017-10-19  8:05             ` Ingo Molnar
2017-10-19  8:05               ` Ingo Molnar
2017-10-19  6:22         ` Byungchul Park
2017-10-19  6:22           ` Byungchul Park
2017-10-19  8:10           ` Ingo Molnar
2017-10-19  8:10             ` Ingo Molnar
2017-10-19  9:02             ` 박병철/선임연구원/SW Platform(연)AOT팀(byungchul.park@lge.com)
2017-10-19  9:02               ` 박병철/선임연구원/SW Platform(연)AOT팀(byungchul.park@lge.com)
2017-10-19  9:41               ` Ingo Molnar
2017-10-19  9:41                 ` Ingo Molnar
2017-10-18 13:23 ` Thomas Gleixner
2017-10-18 13:23   ` Thomas Gleixner
2017-10-18 13:30   ` Ingo Molnar
2017-10-18 13:30     ` Ingo Molnar
2017-10-18 13:36     ` Thomas Gleixner
2017-10-18 13:36       ` Thomas Gleixner
2017-10-18 14:15       ` Matthew Wilcox
2017-10-18 14:15         ` Matthew Wilcox
2017-10-18 14:35         ` Thomas Gleixner
2017-10-18 14:35           ` Thomas Gleixner
2017-10-18 17:05           ` Ingo Molnar
2017-10-18 17:05             ` Ingo Molnar
2017-10-19  2:00       ` Byungchul Park
2017-10-19  2:00         ` Byungchul Park

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.