* [ceph-mgr] Signal Raised in tcmalloc
@ 2017-06-21 6:07 zengran zhang
2017-06-21 13:21 ` Sage Weil
0 siblings, 1 reply; 2+ messages in thread
From: zengran zhang @ 2017-06-21 6:07 UTC (permalink / raw)
To: tchaikov; +Cc: ceph-devel
Hi, Kefu:
we found a mgr instance hang, and attach it with gdb. we can see a
signal raised(#8) in or after
`DaemonPerfCounters::update(MMgrReport*)` , so its because the
tcmalloc heap corrupt. can you help to disgnose the problem? ceph
version is 12.0.2. thanks so much.
#0 0x00007f2245fc8026 in SpinLock::SlowLock() () from /usr/lib/libtcmalloc.so.4
#1 0x00007f2245fb907a in
tcmalloc::CentralFreeList::RemoveRange(void**, void**, int) () from
/usr/lib/libtcmalloc.so.4
#2 0x00007f2245fbbfaa in
tcmalloc::ThreadCache::FetchFromCentralCache(unsigned long, unsigned
long) () from /usr/lib/libtcmalloc.so.4
#3 0x00007f2245fce368 in tc_newarray () from /usr/lib/libtcmalloc.so.4
#4 0x000055840e8f670d in ?? ()
#5 0x000055840e8f6a9c in pretty_version_to_str[abi:cxx11]() ()
#6 0x000055840e7122b6 in ceph::BackTrace::print(std::ostream&) const ()
#7 0x000055840e6f8a27 in ?? ()
#8 <signal handler called>
#9 0x00007f2245fb8c33 in
tcmalloc::CentralFreeList::FetchFromOneSpans(int, void**, void**) ()
from /usr/lib/libtcmalloc.so.4
#10 0x00007f2245fb8f0a in
tcmalloc::CentralFreeList::FetchFromOneSpansSafe(int, void**, void**)
() from /usr/lib/libtcmalloc.so.4
#11 0x00007f2245fb8fbf in
tcmalloc::CentralFreeList::RemoveRange(void**, void**, int) () from
/usr/lib/libtcmalloc.so.4
#12 0x00007f2245fbbfaa in
tcmalloc::ThreadCache::FetchFromCentralCache(unsigned long, unsigned
long) () from /usr/lib/libtcmalloc.so.4
#13 0x00007f2245fce368 in tc_newarray () from /usr/lib/libtcmalloc.so.4
#14 0x000055840e67036d in ?? ()
#15 0x000055840e671e46 in DaemonPerfCounters::update(MMgrReport*) ()
#16 0x000055840e6770e4 in DaemonServer::handle_report(MMgrReport*) ()
#17 0x000055840e67cb70 in DaemonServer::ms_dispatch(Message*) ()
#18 0x000055840e970f6a in DispatchQueue::entry() ()
#19 0x000055840e809d7d in DispatchQueue::DispatchThread::entry() ()
#20 0x00007f22453d96ba in start_thread (arg=0x7f223b6dc700) at
pthread_create.c:333
#21 0x00007f224444582d in clone () at
../sysdeps/unix/sysv/linux/x86_64/clone.S:109
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: [ceph-mgr] Signal Raised in tcmalloc
2017-06-21 6:07 [ceph-mgr] Signal Raised in tcmalloc zengran zhang
@ 2017-06-21 13:21 ` Sage Weil
0 siblings, 0 replies; 2+ messages in thread
From: Sage Weil @ 2017-06-21 13:21 UTC (permalink / raw)
To: zengran zhang; +Cc: tchaikov, ceph-devel
On Wed, 21 Jun 2017, zengran zhang wrote:
> Hi, Kefu:
>
> we found a mgr instance hang, and attach it with gdb. we can see a
> signal raised(#8) in or after
> `DaemonPerfCounters::update(MMgrReport*)` , so its because the
> tcmalloc heap corrupt. can you help to disgnose the problem? ceph
> version is 12.0.2. thanks so much.
This is likely to be fixed on the latest master. We were seeing lots of
crashes when the mgr went from active to inactive; now it is quite
stable.
sage
>
> #0 0x00007f2245fc8026 in SpinLock::SlowLock() () from /usr/lib/libtcmalloc.so.4
> #1 0x00007f2245fb907a in
> tcmalloc::CentralFreeList::RemoveRange(void**, void**, int) () from
> /usr/lib/libtcmalloc.so.4
> #2 0x00007f2245fbbfaa in
> tcmalloc::ThreadCache::FetchFromCentralCache(unsigned long, unsigned
> long) () from /usr/lib/libtcmalloc.so.4
> #3 0x00007f2245fce368 in tc_newarray () from /usr/lib/libtcmalloc.so.4
> #4 0x000055840e8f670d in ?? ()
> #5 0x000055840e8f6a9c in pretty_version_to_str[abi:cxx11]() ()
> #6 0x000055840e7122b6 in ceph::BackTrace::print(std::ostream&) const ()
> #7 0x000055840e6f8a27 in ?? ()
> #8 <signal handler called>
> #9 0x00007f2245fb8c33 in
> tcmalloc::CentralFreeList::FetchFromOneSpans(int, void**, void**) ()
> from /usr/lib/libtcmalloc.so.4
> #10 0x00007f2245fb8f0a in
> tcmalloc::CentralFreeList::FetchFromOneSpansSafe(int, void**, void**)
> () from /usr/lib/libtcmalloc.so.4
> #11 0x00007f2245fb8fbf in
> tcmalloc::CentralFreeList::RemoveRange(void**, void**, int) () from
> /usr/lib/libtcmalloc.so.4
> #12 0x00007f2245fbbfaa in
> tcmalloc::ThreadCache::FetchFromCentralCache(unsigned long, unsigned
> long) () from /usr/lib/libtcmalloc.so.4
> #13 0x00007f2245fce368 in tc_newarray () from /usr/lib/libtcmalloc.so.4
> #14 0x000055840e67036d in ?? ()
> #15 0x000055840e671e46 in DaemonPerfCounters::update(MMgrReport*) ()
> #16 0x000055840e6770e4 in DaemonServer::handle_report(MMgrReport*) ()
> #17 0x000055840e67cb70 in DaemonServer::ms_dispatch(Message*) ()
> #18 0x000055840e970f6a in DispatchQueue::entry() ()
> #19 0x000055840e809d7d in DispatchQueue::DispatchThread::entry() ()
> #20 0x00007f22453d96ba in start_thread (arg=0x7f223b6dc700) at
> pthread_create.c:333
> #21 0x00007f224444582d in clone () at
> ../sysdeps/unix/sysv/linux/x86_64/clone.S:109
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
>
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2017-06-21 13:21 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-06-21 6:07 [ceph-mgr] Signal Raised in tcmalloc zengran zhang
2017-06-21 13:21 ` Sage Weil
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.