All of lore.kernel.org
 help / color / mirror / Atom feed
From: Stefan Priebe <s.priebe-2Lf/h1ldwEHR5kwTpVNS9A@public.gmane.org>
To: Sage Weil <sage-4GqslpFJ+cxBDgjK7y7TUQ@public.gmane.org>
Cc: ceph-devel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
	ceph-users-Qp0mS5GaXlQ@public.gmane.org
Subject: Re: v0.61.5 Cuttlefish update released
Date: Fri, 19 Jul 2013 22:25:51 +0200	[thread overview]
Message-ID: <51E9A0CF.3010509@profihost.ag> (raw)
In-Reply-To: <alpine.DEB.2.00.1307190813570.10870-vIokxiIdD2AQNTJnQDzGJqxOck334EZe@public.gmane.org>

Hi,

sorry as all my mons were down with the same error - i was in a hurry 
made sadly no copy of the mons and workaround by hack ;-( but i posted a 
log to pastebin with debug mon 20. (see last email)

Stefan
Am 19.07.2013 17:14, schrieb Sage Weil:
> On Fri, 19 Jul 2013, Stefan Priebe - Profihost AG wrote:
>> crash is this one:
>
> Can you post a full lost (debug mon = 20, debug paxos = 20, debug ms = 1),
> and/or hit us up on irc?
>
>>
>> 2013-07-19 08:59:32.137646 7f484a872780  0 ceph version
>> 0.61.5-17-g83f8b88 (83f8b88e5be41371cb77b39c0966e79cad92087b), process
>
> What are the 17 patches on top? Anything in the mon?
>
> Thanks!
> sage
>
>> ceph-mon, pid 22172
>> 2013-07-19 08:59:32.173975 7f484a872780 -1 mon/OSDMonitor.cc: In
>> function 'virtual void OSDMonitor::update_from_paxos(bool*)' thread
>> 7f484a872780 time 2013-07-19 08:59:32.173506
>> mon/OSDMonitor.cc: 132: FAILED assert(latest_bl.length() != 0)
>>
>>   ceph version 0.61.5-17-g83f8b88 (83f8b88e5be41371cb77b39c0966e79cad92087b)
>>   1: (OSDMonitor::update_from_paxos(bool*)+0x16e1) [0x51d341]
>>   2: (PaxosService::refresh(bool*)+0xe6) [0x4f2c66]
>>   3: (Monitor::refresh_from_paxos(bool*)+0x57) [0x48f7b7]
>>   4: (Monitor::init_paxos()+0xe5) [0x48f955]
>>   5: (Monitor::preinit()+0x679) [0x4bba79]
>>   6: (main()+0x36b0) [0x484bb0]
>>   7: (__libc_start_main()+0xfd) [0x7f48489cec8d]
>>   8: /usr/bin/ceph-mon() [0x4801e9]
>>   NOTE: a copy of the executable, or `objdump -rdS <executable>` is
>> needed to interpret this.
>>
>> --- begin dump of recent events ---
>>     -13> 2013-07-19 08:59:32.136172 7f484a872780  5 asok(0x131a000)
>> register_command perfcounters_dump hook 0x1304010
>>     -12> 2013-07-19 08:59:32.136191 7f484a872780  5 asok(0x131a000)
>> register_command 1 hook 0x1304010
>>     -11> 2013-07-19 08:59:32.136194 7f484a872780  5 asok(0x131a000)
>> register_command perf dump hook 0x1304010
>>     -10> 2013-07-19 08:59:32.136200 7f484a872780  5 asok(0x131a000)
>> register_command perfcounters_schema hook 0x1304010
>>      -9> 2013-07-19 08:59:32.136204 7f484a872780  5 asok(0x131a000)
>> register_command 2 hook 0x1304010
>>      -8> 2013-07-19 08:59:32.136206 7f484a872780  5 asok(0x131a000)
>> register_command perf schema hook 0x1304010
>>      -7> 2013-07-19 08:59:32.136208 7f484a872780  5 asok(0x131a000)
>> register_command config show hook 0x1304010
>>      -6> 2013-07-19 08:59:32.136211 7f484a872780  5 asok(0x131a000)
>> register_command config set hook 0x1304010
>>      -5> 2013-07-19 08:59:32.136214 7f484a872780  5 asok(0x131a000)
>> register_command log flush hook 0x1304010
>>      -4> 2013-07-19 08:59:32.136216 7f484a872780  5 asok(0x131a000)
>> register_command log dump hook 0x1304010
>>      -3> 2013-07-19 08:59:32.136219 7f484a872780  5 asok(0x131a000)
>> register_command log reopen hook 0x1304010
>>      -2> 2013-07-19 08:59:32.137646 7f484a872780  0 ceph version
>> 0.61.5-17-g83f8b88 (83f8b88e5be41371cb77b39c0966e79cad92087b), process
>> ceph-mon, pid 22172
>>      -1> 2013-07-19 08:59:32.137967 7f484a872780  1 finished
>> global_init_daemonize
>>       0> 2013-07-19 08:59:32.173975 7f484a872780 -1 mon/OSDMonitor.cc: In
>> function 'virtual void OSDMonitor::update_from_paxos(bool*)' thread
>> 7f484a872780 time 2013-07-19 08:59:32.173506
>> mon/OSDMonitor.cc: 132: FAILED assert(latest_bl.length() != 0)
>>
>>   ceph version 0.61.5-17-g83f8b88 (83f8b88e5be41371cb77b39c0966e79cad92087b)
>>   1: (OSDMonitor::update_from_paxos(bool*)+0x16e1) [0x51d341]
>>   2: (PaxosService::refresh(bool*)+0xe6) [0x4f2c66]
>>   3: (Monitor::refresh_from_paxos(bool*)+0x57) [0x48f7b7]
>>   4: (Monitor::init_paxos()+0xe5) [0x48f955]
>>   5: (Monitor::preinit()+0x679) [0x4bba79]
>>   6: (main()+0x36b0) [0x484bb0]
>>   7: (__libc_start_main()+0xfd) [0x7f48489cec8d]
>>   8: /usr/bin/ceph-mon() [0x4801e9]
>>   NOTE: a copy of the executable, or `objdump -rdS <executable>` is
>> needed to interpret this.
>>
>> Mi
>>
>> Am 19.07.2013 08:58, schrieb Stefan Priebe - Profihost AG:
>>> All mons do not work anymore:
>>>
>>> === mon.a ===
>>> Starting Ceph mon.a on ccad...
>>> [21207]: (33) Numerical argument out of domain
>>> failed: 'ulimit -n 8192;  /usr/bin/ceph-mon -i a --pid-file
>>> /var/run/ceph/mon.a.pid -c /etc/ceph/ceph.conf '
>>>
>>> Stefan
>>>
>>> Am 19.07.2013 07:59, schrieb Sage Weil:
>>>> A note on upgrading:
>>>>
>>>> One of the fixes in 0.61.5 is with a 32bit vs 64bit bug with the feature
>>>> bits.  We did not realize it before, but the fix will prevent 0.61.4 (or
>>>> earlier) from forming a quorum with 0.61.5. This is similar to the upgrade
>>>> from bobtail (and the future upgrade to dumpling). As such, we recommend
>>>> you upgrade all monitors at once to avoid the potential for discruption in
>>>> service.
>>>>
>>>> I'm adding a note to the release notes.
>>>>
>>>> Thanks!
>>>> sage
>>>>
>>>>
>>>> On Thu, 18 Jul 2013, Sage Weil wrote:
>>>>
>>>>> We've prepared another update for the Cuttlefish v0.61.x series. This
>>>>> release primarily contains monitor stability improvements, although there
>>>>> are also some important fixes for ceph-osd for large clusters and a few
>>>>> important CephFS fixes. We recommend that all v0.61.x users upgrade.
>>>>>
>>>>>   * mon: misc sync improvements (faster, more reliable, better tuning)
>>>>>   * mon: enable leveldb cache by default (big performance improvement)
>>>>>   * mon: new scrub feature (primarily for diagnostic, testing purposes)
>>>>>   * mon: fix occasional leveldb assertion on startup
>>>>>   * mon: prevent reads until initial state is committed
>>>>>   * mon: improved logic for trimming old osdmaps
>>>>>   * mon: fix pick_addresses bug when expanding mon cluster
>>>>>   * mon: several small paxos fixes, improvements
>>>>>   * mon: fix bug osdmap trim behavior
>>>>>   * osd: fix several bugs with PG stat reporting
>>>>>   * osd: limit number of maps shared with peers (which could cause domino failures)
>>>>>   * rgw: fix radosgw-admin buckets list (for all buckets)
>>>>>   * mds: fix occasional client failure to reconnect
>>>>>   * mds: fix bad list traversal after unlink
>>>>>   * mds: fix underwater dentry cleanup (occasional crash after mds restart)
>>>>>   * libcephfs, ceph-fuse: fix occasional hangs on umount
>>>>>   * libcephfs, ceph-fuse: fix old bug with O_LAZY vs O_NOATIME confusion
>>>>>   * ceph-disk: more robust journal device detection on RHEL/CentOS
>>>>>   * ceph-disk: better, simpler locking
>>>>>   * ceph-disk: do not inadvertantely mount over existing osd mounts
>>>>>   * ceph-disk: better handling for unusual device names
>>>>>   * sysvinit, upstart: handle symlinks in /var/lib/ceph/*
>>>>>
>>>>> Please also refer to the complete release notes:
>>>>>
>>>>>     http://ceph.com/docs/master/release-notes/#v0-61-5-cuttlefish
>>>>>
>>>>> You can get v0.61.5 from the usual locations:
>>>>>
>>>>>   * Git at git://github.com/ceph/ceph.git
>>>>>   * Tarball at http://ceph.com/download/ceph-0.61.5.tar.gz
>>>>>   * For Debian/Ubuntu packages, see http://ceph.com/docs/master/install/debian
>>>>>   * For RPMs, see http://ceph.com/docs/master/install/rpm
>>>>>
>>>> _______________________________________________
>>>> ceph-users mailing list
>>>> ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org
>>>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>>>
>>
>>

      parent reply	other threads:[~2013-07-19 20:25 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-07-18 16:45 v0.61.5 Cuttlefish update released Sage Weil
2013-07-19  5:59 ` Sage Weil
2013-07-19  6:58   ` [ceph-users] " Stefan Priebe - Profihost AG
     [not found]     ` <51E8E3B3.8020300-2Lf/h1ldwEHR5kwTpVNS9A@public.gmane.org>
2013-07-19  7:00       ` Stefan Priebe - Profihost AG
2013-07-19  7:56         ` [ceph-users] " Stefan Priebe - Profihost AG
     [not found]         ` <51E8E409.8020201-2Lf/h1ldwEHR5kwTpVNS9A@public.gmane.org>
2013-07-19  7:56           ` Dan van der Ster
2013-07-19  7:57             ` [ceph-users] " Stefan Priebe - Profihost AG
2013-07-19 15:14         ` Sage Weil
     [not found]           ` <alpine.DEB.2.00.1307190813570.10870-vIokxiIdD2AQNTJnQDzGJqxOck334EZe@public.gmane.org>
2013-07-19 20:25             ` Stefan Priebe [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=51E9A0CF.3010509@profihost.ag \
    --to=s.priebe-2lf/h1ldwehr5kwtpvns9a@public.gmane.org \
    --cc=ceph-devel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=ceph-users-Qp0mS5GaXlQ@public.gmane.org \
    --cc=sage-4GqslpFJ+cxBDgjK7y7TUQ@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.