linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [BUG] function_graph trace causes  hang when using sleepgraph (4.15.0-rc1 and newer)
@ 2018-01-08 23:55 Todd Brandt
  2018-01-09  1:02 ` Todd Brandt
  0 siblings, 1 reply; 8+ messages in thread
From: Todd Brandt @ 2018-01-08 23:55 UTC (permalink / raw)
  To: linux-kernel, rostedt, linux-pm; +Cc: len.brown, todd.e.brandt

I've discovered an issue in the kernel which is causing sleepgraph
(analyse_suspend) to hang the system when it tries to get a callgraph
trace over suspend/resume. It happens on kernels 4.15.0-rc1 and newer.

I've whittled it down to 3 items which have to be set for the hang to
occur:

1) enable function_graph trace
2) enable global clock
3) enable trace points device_pm_callback_start or
device_pm_callback_end

These things all occur when sleepgraph is run with -f. It needs to
collect a function graph trace with global clock in tandem with the the
device_pm_callback trace events. 

This script reproduces the error:

echo "global" > /sys/kernel/debug/tracing/trace_clock
echo "function_graph" > /sys/kernel/debug/tracing/current_tracer
echo 1 >
/sys/kernel/debug/tracing/events/power/device_pm_callback_start/enable
echo 1 >
/sys/kernel/debug/tracing/events/power/device_pm_callback_end/enable
echo 1 >  /sys/kernel/debug/tracing/tracing_on
echo "mem" > /sys/power/state
# hang occurs here: system has to be rebooted
echo 0 >  /sys/kernel/debug/tracing/tracing_on

I performed a bisect between 4.14.0 and 4.15.0-rc1 and it landed on an
ftrace patch that was merged in for 4.15.0-rc1. I'm still debugging it,
but I just wanted to get this out there ASAP so it can be hopefully be
fixed before 4.15.0. I'm not sure if the issue is in ftrace or in the
pm subsystem device_pm_callback code (though nothing has changed in the
latter).

-- [BISECT LOG START] --

git bisect start
# bad: [4fbd8d194f06c8a3fd2af1ce560ddb31f7ec8323] Linux 4.15-rc1
git bisect bad 4fbd8d194f06c8a3fd2af1ce560ddb31f7ec8323
# good: [bebc6082da0a9f5d47a1ea2edc099bf671058bd4] Linux 4.14
git bisect good bebc6082da0a9f5d47a1ea2edc099bf671058bd4
# good: [1be2172e96e33bfa22a5c7a651f768ef30ce3984] Merge tag 'modules-
for-v4.15' of git://git.kernel.org/pub/scm/linux/kernel/git/jeyu/linux
git bisect good 1be2172e96e33bfa22a5c7a651f768ef30ce3984
# good: [8c609698569578913ad40bb160b97c3f6cfa15ec] Merge tag 'armsoc-
soc' of git://git.kernel.org/pub/scm/linux/kernel/git/arm/arm-soc
git bisect good 8c609698569578913ad40bb160b97c3f6cfa15ec
# good: [f6705bf959efac87bca76d40050d342f1d212587] Merge tag 'drm-for-
v4.15-amd-dc' of git://people.freedesktop.org/~airlied/linux
git bisect good f6705bf959efac87bca76d40050d342f1d212587
# bad: [c633e898bde8990a34907c91b7d5245cab866c6e] Merge tag
'devicetree-fixes-for-4.15' of
git://git.kernel.org/pub/scm/linux/kernel/git/robh/linux
git bisect bad c633e898bde8990a34907c91b7d5245cab866c6e
# bad: [fc35c1966e1372a21a88f6655279361e2f92713f] Merge tag 'clk-for-
linus' of git://git.kernel.org/pub/scm/linux/kernel/git/clk/linux
git bisect bad fc35c1966e1372a21a88f6655279361e2f92713f
# bad: [09bd7c75e55cbaa6c731b0c3a5512ad89159f26f] Merge tag 'kbuild-
v4.15' of
git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild
git bisect bad 09bd7c75e55cbaa6c731b0c3a5512ad89159f26f
# good: [5eb9e8ac9a8f8ae98ae4386357683a0b5684bb48]
arch/ia64/include/asm/topology.h: remove unused parent_node() macro
git bisect good 5eb9e8ac9a8f8ae98ae4386357683a0b5684bb48
# bad: [2dcd9c71c1ffa9a036e09047f60e08383bb0abb6] Merge tag 'trace-
v4.15' of git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/linux-
trace
git bisect bad 2dcd9c71c1ffa9a036e09047f60e08383bb0abb6
# bad: [466c81c45b650deca213fda3d0ec4761667379a9] perf/ftrace: Fix
function trace events
git bisect bad 466c81c45b650deca213fda3d0ec4761667379a9
# bad: [3e234289f86b12985ef8909cd34525fcb66c4efb] ftrace: Allow module
init functions to be traced
git bisect bad 3e234289f86b12985ef8909cd34525fcb66c4efb
# bad: [83c07ecc4203728e85fc4a2ce6fdf25d16ea118e] tracing: Remove
lookups from tracing_map hitcount
git bisect bad 83c07ecc4203728e85fc4a2ce6fdf25d16ea118e
# bad: [12ecef0cb12102d8c034770173d2d1363cb97d52] tracing: Reverse the
order of trace_types_lock and event_mutex
git bisect bad 12ecef0cb12102d8c034770173d2d1363cb97d52
# good: [6e7a2398114a2597a0995f96f44d908741ae5035] tracing: Remove
redundant unread variable ret
git bisect good 6e7a2398114a2597a0995f96f44d908741ae5035
# first bad commit: [12ecef0cb12102d8c034770173d2d1363cb97d52] tracing:
Reverse the order of trace_types_lock and event_mutex

-- [BISECT LOG END] --

This is the commit it landed on.

12ecef0cb12102d8c034770173d2d1363cb97d52 is the first bad commit
commit 12ecef0cb12102d8c034770173d2d1363cb97d52
Author: Steven Rostedt (VMware) <rostedt@goodmis.org>
Date:   Thu Sep 21 16:22:49 2017 -0400

    tracing: Reverse the order of trace_types_lock and event_mutex
    
    In order to make future changes where we need to call
    tracing_set_clock() from within an event command, the order of
    trace_types_lock and event_mutex must be reversed, as the event
command
    will hold event_mutex and the trace_types_lock is taken from within
    tracing_set_clock().
    
    Link: http://lkml.kernel.org/r/20170921162249.0dde3dca@gandalf.loca
l.home
    
    Requested-by: Tom Zanussi <tom.zanussi@linux.intel.com>
    Signed-off-by: Steven Rostedt (VMware) <rostedt@goodmis.org>

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [BUG] function_graph trace causes  hang when using sleepgraph (4.15.0-rc1 and newer)
  2018-01-08 23:55 [BUG] function_graph trace causes hang when using sleepgraph (4.15.0-rc1 and newer) Todd Brandt
@ 2018-01-09  1:02 ` Todd Brandt
  2018-01-09  1:07   ` Steven Rostedt
  0 siblings, 1 reply; 8+ messages in thread
From: Todd Brandt @ 2018-01-09  1:02 UTC (permalink / raw)
  To: linux-kernel, rostedt, linux-pm; +Cc: len.brown, todd.e.brandt

Stephen, the problem is reversed by removing the following two commits,
the one the bisect showed and the very next. So the problem is here:

commit 1a149d7d3f45d311da1f63473736c05f30ae8a75
Author: Steven Rostedt (VMware) <rostedt@goodmis.org>
Date:   Fri Sep 22 16:59:02 2017 -0400

    ring-buffer: Rewrite trace_recursive_(un)lock() to be simpler

    The current method to prevent the ring buffer from entering into a
recursize
    loop is to use a bitmask and set the bit that maps to the current
context
    (normal, softirq, irq or NMI), and if that bit was already set, it
is
    considered a recursive loop.

    New code is being added that may require the ring buffer to be
entered a
    second time in the current context. The recursive locking prevents
that from
    happening. Instead of mapping a bitmask to the current context,
just allow 4
    levels of nesting in the ring buffer. This matches the 4 context
levels that
    it can already nest. It is highly unlikely to have more than two
levels,
    thus it should be fine when we add the second entry into the ring
buffer. If
    that proves to be a problem, we can always up the number to 8.

    An added benefit is that reading preempt_count() to get the current
level
    adds a very slight but noticeable overhead. This removes that need.

    Signed-off-by: Steven Rostedt (VMware) <rostedt@goodmis.org>

commit 12ecef0cb12102d8c034770173d2d1363cb97d52
Author: Steven Rostedt (VMware) <rostedt@goodmis.org>
Date:   Thu Sep 21 16:22:49 2017 -0400

    tracing: Reverse the order of trace_types_lock and event_mutex

    In order to make future changes where we need to call
    tracing_set_clock() from within an event command, the order of
    trace_types_lock and event_mutex must be reversed, as the event
command
    will hold event_mutex and the trace_types_lock is taken from within
    tracing_set_clock().

    Link: http://lkml.kernel.org/r/20170921162249.0dde3dca@gandalf.loca
l.home

    Requested-by: Tom Zanussi <tom.zanussi@linux.intel.com>
    Signed-off-by: Steven Rostedt (VMware) <rostedt@goodmis.org>


On Mon, 2018-01-08 at 15:55 -0800, Todd Brandt wrote:
> I've discovered an issue in the kernel which is causing sleepgraph
> (analyse_suspend) to hang the system when it tries to get a callgraph
> trace over suspend/resume. It happens on kernels 4.15.0-rc1 and
> newer.
> 
> I've whittled it down to 3 items which have to be set for the hang to
> occur:
> 
> 1) enable function_graph trace
> 2) enable global clock
> 3) enable trace points device_pm_callback_start or
> device_pm_callback_end
> 
> These things all occur when sleepgraph is run with -f. It needs to
> collect a function graph trace with global clock in tandem with the
> the
> device_pm_callback trace events. 
> 
> This script reproduces the error:
> 
> echo "global" > /sys/kernel/debug/tracing/trace_clock
> echo "function_graph" > /sys/kernel/debug/tracing/current_tracer
> echo 1 >
> /sys/kernel/debug/tracing/events/power/device_pm_callback_start/enabl
> e
> echo 1 >
> /sys/kernel/debug/tracing/events/power/device_pm_callback_end/enable
> echo 1 >  /sys/kernel/debug/tracing/tracing_on
> echo "mem" > /sys/power/state
> # hang occurs here: system has to be rebooted
> echo 0 >  /sys/kernel/debug/tracing/tracing_on
> 
> I performed a bisect between 4.14.0 and 4.15.0-rc1 and it landed on
> an
> ftrace patch that was merged in for 4.15.0-rc1. I'm still debugging
> it,
> but I just wanted to get this out there ASAP so it can be hopefully
> be
> fixed before 4.15.0. I'm not sure if the issue is in ftrace or in the
> pm subsystem device_pm_callback code (though nothing has changed in
> the
> latter).
> 
> -- [BISECT LOG START] --
> 
> git bisect start
> # bad: [4fbd8d194f06c8a3fd2af1ce560ddb31f7ec8323] Linux 4.15-rc1
> git bisect bad 4fbd8d194f06c8a3fd2af1ce560ddb31f7ec8323
> # good: [bebc6082da0a9f5d47a1ea2edc099bf671058bd4] Linux 4.14
> git bisect good bebc6082da0a9f5d47a1ea2edc099bf671058bd4
> # good: [1be2172e96e33bfa22a5c7a651f768ef30ce3984] Merge tag
> 'modules-
> for-v4.15' of
> git://git.kernel.org/pub/scm/linux/kernel/git/jeyu/linux
> git bisect good 1be2172e96e33bfa22a5c7a651f768ef30ce3984
> # good: [8c609698569578913ad40bb160b97c3f6cfa15ec] Merge tag 'armsoc-
> soc' of git://git.kernel.org/pub/scm/linux/kernel/git/arm/arm-soc
> git bisect good 8c609698569578913ad40bb160b97c3f6cfa15ec
> # good: [f6705bf959efac87bca76d40050d342f1d212587] Merge tag 'drm-
> for-
> v4.15-amd-dc' of git://people.freedesktop.org/~airlied/linux
> git bisect good f6705bf959efac87bca76d40050d342f1d212587
> # bad: [c633e898bde8990a34907c91b7d5245cab866c6e] Merge tag
> 'devicetree-fixes-for-4.15' of
> git://git.kernel.org/pub/scm/linux/kernel/git/robh/linux
> git bisect bad c633e898bde8990a34907c91b7d5245cab866c6e
> # bad: [fc35c1966e1372a21a88f6655279361e2f92713f] Merge tag 'clk-for-
> linus' of git://git.kernel.org/pub/scm/linux/kernel/git/clk/linux
> git bisect bad fc35c1966e1372a21a88f6655279361e2f92713f
> # bad: [09bd7c75e55cbaa6c731b0c3a5512ad89159f26f] Merge tag 'kbuild-
> v4.15' of
> git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild
> git bisect bad 09bd7c75e55cbaa6c731b0c3a5512ad89159f26f
> # good: [5eb9e8ac9a8f8ae98ae4386357683a0b5684bb48]
> arch/ia64/include/asm/topology.h: remove unused parent_node() macro
> git bisect good 5eb9e8ac9a8f8ae98ae4386357683a0b5684bb48
> # bad: [2dcd9c71c1ffa9a036e09047f60e08383bb0abb6] Merge tag 'trace-
> v4.15' of
> git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/linux-
> trace
> git bisect bad 2dcd9c71c1ffa9a036e09047f60e08383bb0abb6
> # bad: [466c81c45b650deca213fda3d0ec4761667379a9] perf/ftrace: Fix
> function trace events
> git bisect bad 466c81c45b650deca213fda3d0ec4761667379a9
> # bad: [3e234289f86b12985ef8909cd34525fcb66c4efb] ftrace: Allow
> module
> init functions to be traced
> git bisect bad 3e234289f86b12985ef8909cd34525fcb66c4efb
> # bad: [83c07ecc4203728e85fc4a2ce6fdf25d16ea118e] tracing: Remove
> lookups from tracing_map hitcount
> git bisect bad 83c07ecc4203728e85fc4a2ce6fdf25d16ea118e
> # bad: [12ecef0cb12102d8c034770173d2d1363cb97d52] tracing: Reverse
> the
> order of trace_types_lock and event_mutex
> git bisect bad 12ecef0cb12102d8c034770173d2d1363cb97d52
> # good: [6e7a2398114a2597a0995f96f44d908741ae5035] tracing: Remove
> redundant unread variable ret
> git bisect good 6e7a2398114a2597a0995f96f44d908741ae5035
> # first bad commit: [12ecef0cb12102d8c034770173d2d1363cb97d52]
> tracing:
> Reverse the order of trace_types_lock and event_mutex
> 
> -- [BISECT LOG END] --
> 
> This is the commit it landed on.
> 
> 12ecef0cb12102d8c034770173d2d1363cb97d52 is the first bad commit
> commit 12ecef0cb12102d8c034770173d2d1363cb97d52
> Author: Steven Rostedt (VMware) <rostedt@goodmis.org>
> Date:   Thu Sep 21 16:22:49 2017 -0400
> 
>     tracing: Reverse the order of trace_types_lock and event_mutex
>     
>     In order to make future changes where we need to call
>     tracing_set_clock() from within an event command, the order of
>     trace_types_lock and event_mutex must be reversed, as the event
> command
>     will hold event_mutex and the trace_types_lock is taken from
> within
>     tracing_set_clock().
>     
>     Link: http://lkml.kernel.org/r/20170921162249.0dde3dca@gandalf.lo
> ca
> l.home
>     
>     Requested-by: Tom Zanussi <tom.zanussi@linux.intel.com>
>     Signed-off-by: Steven Rostedt (VMware) <rostedt@goodmis.org>

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [BUG] function_graph trace causes  hang when using sleepgraph (4.15.0-rc1 and newer)
  2018-01-09  1:02 ` Todd Brandt
@ 2018-01-09  1:07   ` Steven Rostedt
  2018-01-09  1:25     ` Todd Brandt
  0 siblings, 1 reply; 8+ messages in thread
From: Steven Rostedt @ 2018-01-09  1:07 UTC (permalink / raw)
  To: Todd Brandt; +Cc: linux-kernel, linux-pm, len.brown, todd.e.brandt

On Mon, 08 Jan 2018 17:02:29 -0800
Todd Brandt <todd.e.brandt@linux.intel.com> wrote:

> Stephen, the problem is reversed by removing the following two commits,
> the one the bisect showed and the very next. So the problem is here:
> 
> commit 1a149d7d3f45d311da1f63473736c05f30ae8a75
> Author: Steven Rostedt (VMware) <rostedt@goodmis.org>
> Date:   Fri Sep 22 16:59:02 2017 -0400
> 
>     ring-buffer: Rewrite trace_recursive_(un)lock() to be simpler

This one still doesn't make sense, for why it would cause the hang.


> commit 12ecef0cb12102d8c034770173d2d1363cb97d52
> Author: Steven Rostedt (VMware) <rostedt@goodmis.org>
> Date:   Thu Sep 21 16:22:49 2017 -0400
> 
>     tracing: Reverse the order of trace_types_lock and event_mutex

This one does.

Can you run lockdep when you do this and see if lockdep catches
anything? If it does, it should point directly to where the inversed
locking happened.

-- Steve

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [BUG] function_graph trace causes  hang when using sleepgraph (4.15.0-rc1 and newer)
  2018-01-09  1:07   ` Steven Rostedt
@ 2018-01-09  1:25     ` Todd Brandt
  2018-01-09  3:07       ` Todd Brandt
  2018-01-09 16:13       ` Steven Rostedt
  0 siblings, 2 replies; 8+ messages in thread
From: Todd Brandt @ 2018-01-09  1:25 UTC (permalink / raw)
  To: Steven Rostedt; +Cc: linux-kernel, linux-pm, len.brown, todd.e.brandt

On Mon, 2018-01-08 at 20:07 -0500, Steven Rostedt wrote:
> On Mon, 08 Jan 2018 17:02:29 -0800
> Todd Brandt <todd.e.brandt@linux.intel.com> wrote:
> 
> > Stephen, the problem is reversed by removing the following two
> > commits,
> > the one the bisect showed and the very next. So the problem is
> > here:
> > 
> > commit 1a149d7d3f45d311da1f63473736c05f30ae8a75
> > Author: Steven Rostedt (VMware) <rostedt@goodmis.org>
> > Date:   Fri Sep 22 16:59:02 2017 -0400
> > 
> >     ring-buffer: Rewrite trace_recursive_(un)lock() to be simpler
> 
> This one still doesn't make sense, for why it would cause the hang.
> 
> 
> > commit 12ecef0cb12102d8c034770173d2d1363cb97d52
> > Author: Steven Rostedt (VMware) <rostedt@goodmis.org>
> > Date:   Thu Sep 21 16:22:49 2017 -0400
> > 
> >     tracing: Reverse the order of trace_types_lock and event_mutex
> 
> This one does.
> 
> Can you run lockdep when you do this and see if lockdep catches
> anything? If it does, it should point directly to where the inversed
> locking happened.

Can you reproduce the issue there? I just want to be sure it's not
something local to our machines here, as long as you have CONFIG_PM
enabled it should work the same hopefully. 

I'll give lockdep a try here.

> 
> -- Steve

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [BUG] function_graph trace causes  hang when using sleepgraph (4.15.0-rc1 and newer)
  2018-01-09  1:25     ` Todd Brandt
@ 2018-01-09  3:07       ` Todd Brandt
  2018-01-09 16:13       ` Steven Rostedt
  1 sibling, 0 replies; 8+ messages in thread
From: Todd Brandt @ 2018-01-09  3:07 UTC (permalink / raw)
  To: Steven Rostedt; +Cc: linux-kernel, linux-pm, len.brown, todd.e.brandt

On Mon, 2018-01-08 at 17:25 -0800, Todd Brandt wrote:
> On Mon, 2018-01-08 at 20:07 -0500, Steven Rostedt wrote:
> > On Mon, 08 Jan 2018 17:02:29 -0800
> > Todd Brandt <todd.e.brandt@linux.intel.com> wrote:
> > 
> > > Stephen, the problem is reversed by removing the following two
> > > commits,
> > > the one the bisect showed and the very next. So the problem is
> > > here:
> > > 
> > > commit 1a149d7d3f45d311da1f63473736c05f30ae8a75
> > > Author: Steven Rostedt (VMware) <rostedt@goodmis.org>
> > > Date:   Fri Sep 22 16:59:02 2017 -0400
> > > 
> > >     ring-buffer: Rewrite trace_recursive_(un)lock() to be simpler
> > 
> > This one still doesn't make sense, for why it would cause the hang.
> > 
> > 
> > > commit 12ecef0cb12102d8c034770173d2d1363cb97d52
> > > Author: Steven Rostedt (VMware) <rostedt@goodmis.org>
> > > Date:   Thu Sep 21 16:22:49 2017 -0400
> > > 
> > >     tracing: Reverse the order of trace_types_lock and
> > > event_mutex
> > 
> > This one does.
> > 
> > Can you run lockdep when you do this and see if lockdep catches
> > anything? If it does, it should point directly to where the
> > inversed
> > locking happened.
> 
> Can you reproduce the issue there? I just want to be sure it's not
> something local to our machines here, as long as you have CONFIG_PM
> enabled it should work the same hopefully. 
> 
> I'll give lockdep a try here.

I tried lockdep here (/proc/lockdep*) but I can't get any useful data
after the hang beyond the last printk. Is there a way I can enclose the
device_pm_callback function in some kind of debug harness to prevent
the hang (sorry, I'm not terribly familiar with lockdep for kernel
level hangs)?


> 
> > 
> > -- Steve

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [BUG] function_graph trace causes  hang when using sleepgraph (4.15.0-rc1 and newer)
  2018-01-09  1:25     ` Todd Brandt
  2018-01-09  3:07       ` Todd Brandt
@ 2018-01-09 16:13       ` Steven Rostedt
  2018-01-11  0:21         ` Todd Brandt
  2018-01-18 17:21         ` Todd Brandt
  1 sibling, 2 replies; 8+ messages in thread
From: Steven Rostedt @ 2018-01-09 16:13 UTC (permalink / raw)
  To: Todd Brandt; +Cc: linux-kernel, linux-pm, len.brown, todd.e.brandt

On Mon, 08 Jan 2018 17:25:15 -0800
Todd Brandt <todd.e.brandt@linux.intel.com> wrote:

> Can you reproduce the issue there? I just want to be sure it's not
> something local to our machines here, as long as you have CONFIG_PM
> enabled it should work the same hopefully. 

Just to let you know, I'm currently traveling, so my testing ability is
limited at the time being.

-- Steve

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [BUG] function_graph trace causes  hang when using sleepgraph (4.15.0-rc1 and newer)
  2018-01-09 16:13       ` Steven Rostedt
@ 2018-01-11  0:21         ` Todd Brandt
  2018-01-18 17:21         ` Todd Brandt
  1 sibling, 0 replies; 8+ messages in thread
From: Todd Brandt @ 2018-01-11  0:21 UTC (permalink / raw)
  To: Steven Rostedt; +Cc: linux-kernel, linux-pm, len.brown, todd.e.brandt

On Tue, 2018-01-09 at 11:13 -0500, Steven Rostedt wrote:
> On Mon, 08 Jan 2018 17:25:15 -0800
> Todd Brandt <todd.e.brandt@linux.intel.com> wrote:
> 
> > Can you reproduce the issue there? I just want to be sure it's not
> > something local to our machines here, as long as you have CONFIG_PM
> > enabled it should work the same hopefully. 
> 
> Just to let you know, I'm currently traveling, so my testing ability
> is
> limited at the time being.

ok, it's not an emergency, we're using a patched version of the kernel
with those two commits removed in the interim. I tried running kgdb
over serial console, but unfortunately it doesn't work over a USB-to-
Serial dongle (even Greg K-H says it won't work, :P). This is the best
output I could get, considering:

# run the test script here
[  545.733458] PM: suspend entry (deep)
[  545.734535] PM: Syncing filesystems ... done.
[  546.580551] Freezing user space processes ... (elapsed 0.013
seconds) done.
[  546.594846] OOM killer disabled.
[  546.595002] Freezing remaining freezable tasks ... (elapsed 0.005
seconds) done.
[  557.808719] NMI watchdog: Watchdog detected hard LOCKUP on cpu 0

So it's definitely a hard lockup, I just can't get the stack trace
without a system with a built in serial console.

> 
> -- Steve

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [BUG] function_graph trace causes  hang when using sleepgraph (4.15.0-rc1 and newer)
  2018-01-09 16:13       ` Steven Rostedt
  2018-01-11  0:21         ` Todd Brandt
@ 2018-01-18 17:21         ` Todd Brandt
  1 sibling, 0 replies; 8+ messages in thread
From: Todd Brandt @ 2018-01-18 17:21 UTC (permalink / raw)
  To: Steven Rostedt; +Cc: linux-kernel, linux-pm

On Tue, 2018-01-09 at 11:13 -0500, Steven Rostedt wrote:
> On Mon, 08 Jan 2018 17:25:15 -0800
> Todd Brandt <todd.e.brandt@linux.intel.com> wrote:
> 
> > Can you reproduce the issue there? I just want to be sure it's not
> > something local to our machines here, as long as you have CONFIG_PM
> > enabled it should work the same hopefully. 
> 
> Just to let you know, I'm currently traveling, so my testing ability
> is
> limited at the time being.
> 
> -- Steve

I just tested 4.15.0-rc8 (with the ring_buffer changes) and it appears
to work correctly now, thanks!

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2018-01-18 17:21 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-01-08 23:55 [BUG] function_graph trace causes hang when using sleepgraph (4.15.0-rc1 and newer) Todd Brandt
2018-01-09  1:02 ` Todd Brandt
2018-01-09  1:07   ` Steven Rostedt
2018-01-09  1:25     ` Todd Brandt
2018-01-09  3:07       ` Todd Brandt
2018-01-09 16:13       ` Steven Rostedt
2018-01-11  0:21         ` Todd Brandt
2018-01-18 17:21         ` Todd Brandt

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).