* Latest bobtail branch still crashing KVM VMs in bh_write_commit()
@ 2013-03-20 19:49 Travis Rhoden
2013-03-20 19:51 ` Campbell, Bill
2013-03-20 19:53 ` Stefan Priebe
0 siblings, 2 replies; 16+ messages in thread
From: Travis Rhoden @ 2013-03-20 19:49 UTC (permalink / raw)
To: ceph-devel
Hey folks,
We were hoping this one was fixed. I upgraded all my nodes to the
latest bobtail branch, but still hit this today:
osdc/ObjectCacher.cc: In function 'void
ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_t, uint64_t,
tid_t, int)' thread 7f650e62f700 time 2013-03-20 19:34:39.952616
osdc/ObjectCacher.cc: 834: FAILED assert(ob->last_commit_tid < tid)
ceph version 0.56.3-42-ga30903c (a30903c6adaa023587d3147179d6038ad37ca520)
1: (ObjectCacher::bh_write_commit(long, sobject_t, long, unsigned
long, unsigned long, int)+0xd68) [0x7f651d0ada48]
2: (ObjectCacher::C_WriteCommit::finish(int)+0x6b) [0x7f651d0b460b]
3: (Context::complete(int)+0xa) [0x7f651d06c9fa]
4: (librbd::C_Request::finish(int)+0x85) [0x7f651d09c315]
5: (Context::complete(int)+0xa) [0x7f651d06c9fa]
6: (librbd::rados_req_cb(void*, void*)+0x47) [0x7f651d081387]
7: (librados::C_AioSafe::finish(int)+0x1d) [0x7f651c43163d]
8: (Finisher::finisher_thread_entry()+0x1c0) [0x7f651c49c920]
9: (()+0x7e9a) [0x7f6519cffe9a]
10: (clone()+0x6d) [0x7f6519a2bcbd]
NOTE: a copy of the executable, or `objdump -rdS <executable>` is
needed to interpret this.
Is this occuring in librbd caching? If so, I could disable it for the
time being.
First saw this mentioned on-list here:
http://thread.gmane.org/gmane.comp.file-systems.ceph.devel/13577
Will be happy to provide anything I can for this one -- definitely
critical for my use case. It happens with about 10% of the VMs I
create. Always within the first 60 seconds of the VM booting and
being network accessible.
- Travis
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-20 19:49 Latest bobtail branch still crashing KVM VMs in bh_write_commit() Travis Rhoden
@ 2013-03-20 19:51 ` Campbell, Bill
2013-03-20 19:53 ` Stefan Priebe
1 sibling, 0 replies; 16+ messages in thread
From: Campbell, Bill @ 2013-03-20 19:51 UTC (permalink / raw)
To: Travis Rhoden; +Cc: ceph-devel
Travis, are you using format 1 or 2 images? I've seen the same behavior on format 2 images using cloned snapshots, but haven't run into this issue on any normal format 2 images.
----- Original Message -----
From: "Travis Rhoden" <trhoden@gmail.com>
To: "ceph-devel" <ceph-devel@vger.kernel.org>
Sent: Wednesday, March 20, 2013 3:49:23 PM
Subject: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
Hey folks,
We were hoping this one was fixed. I upgraded all my nodes to the
latest bobtail branch, but still hit this today:
osdc/ObjectCacher.cc: In function 'void
ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_t, uint64_t,
tid_t, int)' thread 7f650e62f700 time 2013-03-20 19:34:39.952616
osdc/ObjectCacher.cc: 834: FAILED assert(ob->last_commit_tid < tid)
ceph version 0.56.3-42-ga30903c (a30903c6adaa023587d3147179d6038ad37ca520)
1: (ObjectCacher::bh_write_commit(long, sobject_t, long, unsigned
long, unsigned long, int)+0xd68) [0x7f651d0ada48]
2: (ObjectCacher::C_WriteCommit::finish(int)+0x6b) [0x7f651d0b460b]
3: (Context::complete(int)+0xa) [0x7f651d06c9fa]
4: (librbd::C_Request::finish(int)+0x85) [0x7f651d09c315]
5: (Context::complete(int)+0xa) [0x7f651d06c9fa]
6: (librbd::rados_req_cb(void*, void*)+0x47) [0x7f651d081387]
7: (librados::C_AioSafe::finish(int)+0x1d) [0x7f651c43163d]
8: (Finisher::finisher_thread_entry()+0x1c0) [0x7f651c49c920]
9: (()+0x7e9a) [0x7f6519cffe9a]
10: (clone()+0x6d) [0x7f6519a2bcbd]
NOTE: a copy of the executable, or `objdump -rdS <executable>` is
needed to interpret this.
Is this occuring in librbd caching? If so, I could disable it for the
time being.
First saw this mentioned on-list here:
http://thread.gmane.org/gmane.comp.file-systems.ceph.devel/13577
Will be happy to provide anything I can for this one -- definitely
critical for my use case. It happens with about 10% of the VMs I
create. Always within the first 60 seconds of the VM booting and
being network accessible.
- Travis
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
NOTICE: Protect the information in this message in accordance with the company's security policies. If you received this message in error, immediately notify the sender and destroy all copies.
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-20 19:49 Latest bobtail branch still crashing KVM VMs in bh_write_commit() Travis Rhoden
2013-03-20 19:51 ` Campbell, Bill
@ 2013-03-20 19:53 ` Stefan Priebe
2013-03-20 20:04 ` Travis Rhoden
1 sibling, 1 reply; 16+ messages in thread
From: Stefan Priebe @ 2013-03-20 19:53 UTC (permalink / raw)
To: Travis Rhoden; +Cc: ceph-devel
Hi,
strange i've never seen this. Which qemu version?
Stefan
Am 20.03.2013 20:49, schrieb Travis Rhoden:
> Hey folks,
>
> We were hoping this one was fixed. I upgraded all my nodes to the
> latest bobtail branch, but still hit this today:
>
> osdc/ObjectCacher.cc: In function 'void
> ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_t, uint64_t,
> tid_t, int)' thread 7f650e62f700 time 2013-03-20 19:34:39.952616
> osdc/ObjectCacher.cc: 834: FAILED assert(ob->last_commit_tid < tid)
> ceph version 0.56.3-42-ga30903c (a30903c6adaa023587d3147179d6038ad37ca520)
> 1: (ObjectCacher::bh_write_commit(long, sobject_t, long, unsigned
> long, unsigned long, int)+0xd68) [0x7f651d0ada48]
> 2: (ObjectCacher::C_WriteCommit::finish(int)+0x6b) [0x7f651d0b460b]
> 3: (Context::complete(int)+0xa) [0x7f651d06c9fa]
> 4: (librbd::C_Request::finish(int)+0x85) [0x7f651d09c315]
> 5: (Context::complete(int)+0xa) [0x7f651d06c9fa]
> 6: (librbd::rados_req_cb(void*, void*)+0x47) [0x7f651d081387]
> 7: (librados::C_AioSafe::finish(int)+0x1d) [0x7f651c43163d]
> 8: (Finisher::finisher_thread_entry()+0x1c0) [0x7f651c49c920]
> 9: (()+0x7e9a) [0x7f6519cffe9a]
> 10: (clone()+0x6d) [0x7f6519a2bcbd]
> NOTE: a copy of the executable, or `objdump -rdS <executable>` is
> needed to interpret this.
>
> Is this occuring in librbd caching? If so, I could disable it for the
> time being.
>
> First saw this mentioned on-list here:
> http://thread.gmane.org/gmane.comp.file-systems.ceph.devel/13577
>
> Will be happy to provide anything I can for this one -- definitely
> critical for my use case. It happens with about 10% of the VMs I
> create. Always within the first 60 seconds of the VM booting and
> being network accessible.
>
> - Travis
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-20 19:53 ` Stefan Priebe
@ 2013-03-20 20:04 ` Travis Rhoden
2013-03-20 20:14 ` Stefan Priebe
0 siblings, 1 reply; 16+ messages in thread
From: Travis Rhoden @ 2013-03-20 20:04 UTC (permalink / raw)
To: Stefan Priebe, bcampbell; +Cc: ceph-devel
Hello.
> Travis, are you using format 1 or 2 images? I've seen the same behavior on format 2 images using cloned snapshots, but haven't run into this issue on any normal format 2 images.
In this case, they are format 2. And they are from cloned snapshots.
Exactly like the following:
# rbd ls -l -p volumes
NAME SIZE
PARENT FMT PROT LOCK
volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
I'm doing an OpenStack boot-from-volume setup.
> strange i've never seen this. Which qemu version?
# qemu-x86_64 -version
qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008 Fabrice Bellard
that's coming from Ubuntu 12.04 apt repos.
- Travis
On Wed, Mar 20, 2013 at 3:53 PM, Stefan Priebe <s.priebe@profihost.ag> wrote:
> Hi,
>
> strange i've never seen this. Which qemu version?
>
> Stefan
> Am 20.03.2013 20:49, schrieb Travis Rhoden:
>>
>> Hey folks,
>>
>> We were hoping this one was fixed. I upgraded all my nodes to the
>> latest bobtail branch, but still hit this today:
>>
>> osdc/ObjectCacher.cc: In function 'void
>> ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_t, uint64_t,
>> tid_t, int)' thread 7f650e62f700 time 2013-03-20 19:34:39.952616
>> osdc/ObjectCacher.cc: 834: FAILED assert(ob->last_commit_tid < tid)
>> ceph version 0.56.3-42-ga30903c
>> (a30903c6adaa023587d3147179d6038ad37ca520)
>> 1: (ObjectCacher::bh_write_commit(long, sobject_t, long, unsigned
>> long, unsigned long, int)+0xd68) [0x7f651d0ada48]
>> 2: (ObjectCacher::C_WriteCommit::finish(int)+0x6b) [0x7f651d0b460b]
>> 3: (Context::complete(int)+0xa) [0x7f651d06c9fa]
>> 4: (librbd::C_Request::finish(int)+0x85) [0x7f651d09c315]
>> 5: (Context::complete(int)+0xa) [0x7f651d06c9fa]
>> 6: (librbd::rados_req_cb(void*, void*)+0x47) [0x7f651d081387]
>> 7: (librados::C_AioSafe::finish(int)+0x1d) [0x7f651c43163d]
>> 8: (Finisher::finisher_thread_entry()+0x1c0) [0x7f651c49c920]
>> 9: (()+0x7e9a) [0x7f6519cffe9a]
>> 10: (clone()+0x6d) [0x7f6519a2bcbd]
>> NOTE: a copy of the executable, or `objdump -rdS <executable>` is
>> needed to interpret this.
>>
>> Is this occuring in librbd caching? If so, I could disable it for the
>> time being.
>>
>> First saw this mentioned on-list here:
>> http://thread.gmane.org/gmane.comp.file-systems.ceph.devel/13577
>>
>> Will be happy to provide anything I can for this one -- definitely
>> critical for my use case. It happens with about 10% of the VMs I
>> create. Always within the first 60 seconds of the VM booting and
>> being network accessible.
>>
>> - Travis
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>>
>
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-20 20:04 ` Travis Rhoden
@ 2013-03-20 20:14 ` Stefan Priebe
2013-03-20 20:17 ` Travis Rhoden
` (2 more replies)
0 siblings, 3 replies; 16+ messages in thread
From: Stefan Priebe @ 2013-03-20 20:14 UTC (permalink / raw)
To: Travis Rhoden; +Cc: bcampbell, ceph-devel
Hi,
> In this case, they are format 2. And they are from cloned snapshots.
> Exactly like the following:
>
> # rbd ls -l -p volumes
> NAME SIZE
> PARENT FMT PROT LOCK
> volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
> images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
>
> I'm doing an OpenStack boot-from-volume setup.
OK i've never used cloned snapshots so maybe this is the reason.
>> strange i've never seen this. Which qemu version?
>
> # qemu-x86_64 -version
> qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008 Fabrice Bellard
>
> that's coming from Ubuntu 12.04 apt repos.
maybe you should try qemu 1.4 there are a LOT of bugfixes. qemu-kvm does
not exist anymore it was merged into qemu with 1.3 or 1.4.
Stefan
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-20 20:14 ` Stefan Priebe
@ 2013-03-20 20:17 ` Travis Rhoden
2013-03-20 20:19 ` Josh Durgin
2013-03-21 0:58 ` Jacky.He
2 siblings, 0 replies; 16+ messages in thread
From: Travis Rhoden @ 2013-03-20 20:17 UTC (permalink / raw)
To: Stefan Priebe; +Cc: bcampbell, ceph-devel
On Wed, Mar 20, 2013 at 4:14 PM, Stefan Priebe <s.priebe@profihost.ag> wrote:
> Hi,
>
>
>> In this case, they are format 2. And they are from cloned snapshots.
>> Exactly like the following:
>>
>> # rbd ls -l -p volumes
>> NAME SIZE
>> PARENT FMT PROT LOCK
>> volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
>> images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
>>
>> I'm doing an OpenStack boot-from-volume setup.
>
>
> OK i've never used cloned snapshots so maybe this is the reason.
>
>
>>> strange i've never seen this. Which qemu version?
>>
>>
>> # qemu-x86_64 -version
>> qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008 Fabrice
>> Bellard
>>
>> that's coming from Ubuntu 12.04 apt repos.
>
>
> maybe you should try qemu 1.4 there are a LOT of bugfixes. qemu-kvm does not
> exist anymore it was merged into qemu with 1.3 or 1.4.
Since the crash is in librbd, would an update of qemu help anything?
>
> Stefan
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-20 20:14 ` Stefan Priebe
2013-03-20 20:17 ` Travis Rhoden
@ 2013-03-20 20:19 ` Josh Durgin
2013-03-20 20:32 ` Josh Durgin
2013-03-21 0:58 ` Jacky.He
2 siblings, 1 reply; 16+ messages in thread
From: Josh Durgin @ 2013-03-20 20:19 UTC (permalink / raw)
To: Stefan Priebe; +Cc: Travis Rhoden, bcampbell, ceph-devel
On 03/20/2013 01:14 PM, Stefan Priebe wrote:
> Hi,
>
>> In this case, they are format 2. And they are from cloned snapshots.
>> Exactly like the following:
>>
>> # rbd ls -l -p volumes
>> NAME SIZE
>> PARENT FMT PROT LOCK
>> volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
>> images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
>>
>> I'm doing an OpenStack boot-from-volume setup.
>
> OK i've never used cloned snapshots so maybe this is the reason.
>
>>> strange i've never seen this. Which qemu version?
>>
>> # qemu-x86_64 -version
>> qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008
>> Fabrice Bellard
>>
>> that's coming from Ubuntu 12.04 apt repos.
>
> maybe you should try qemu 1.4 there are a LOT of bugfixes. qemu-kvm does
> not exist anymore it was merged into qemu with 1.3 or 1.4.
This particular problem won't be solved by upgrading qemu. It's a ceph
bug. Disabling caching would work around the issue.
Travis, could you get a log from qemu of this happening with:
debug ms = 20
debug objectcacher = 20
debug rbd = 20
log file = /path/writeable/by/qemu
From those we can tell whether the issue is on the client side at least,
and hopefully what's causing it.
Thanks!
Josh
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-20 20:19 ` Josh Durgin
@ 2013-03-20 20:32 ` Josh Durgin
2013-03-20 20:39 ` Travis Rhoden
0 siblings, 1 reply; 16+ messages in thread
From: Josh Durgin @ 2013-03-20 20:32 UTC (permalink / raw)
To: Stefan Priebe; +Cc: Travis Rhoden, bcampbell, ceph-devel
On 03/20/2013 01:19 PM, Josh Durgin wrote:
> On 03/20/2013 01:14 PM, Stefan Priebe wrote:
>> Hi,
>>
>>> In this case, they are format 2. And they are from cloned snapshots.
>>> Exactly like the following:
>>>
>>> # rbd ls -l -p volumes
>>> NAME SIZE
>>> PARENT FMT PROT LOCK
>>> volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
>>> images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
>>>
>>> I'm doing an OpenStack boot-from-volume setup.
>>
>> OK i've never used cloned snapshots so maybe this is the reason.
>>
>>>> strange i've never seen this. Which qemu version?
>>>
>>> # qemu-x86_64 -version
>>> qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008
>>> Fabrice Bellard
>>>
>>> that's coming from Ubuntu 12.04 apt repos.
>>
>> maybe you should try qemu 1.4 there are a LOT of bugfixes. qemu-kvm does
>> not exist anymore it was merged into qemu with 1.3 or 1.4.
>
> This particular problem won't be solved by upgrading qemu. It's a ceph
> bug. Disabling caching would work around the issue.
>
> Travis, could you get a log from qemu of this happening with:
>
> debug ms = 20
> debug objectcacher = 20
> debug rbd = 20
> log file = /path/writeable/by/qemu
If it doesn't reproduce with those settings, try changing debug ms to 1
instead of 20.
> From those we can tell whether the issue is on the client side at least,
> and hopefully what's causing it.
>
> Thanks!
> Josh
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-20 20:32 ` Josh Durgin
@ 2013-03-20 20:39 ` Travis Rhoden
2013-03-20 21:47 ` Travis Rhoden
0 siblings, 1 reply; 16+ messages in thread
From: Travis Rhoden @ 2013-03-20 20:39 UTC (permalink / raw)
To: Josh Durgin; +Cc: Stefan Priebe, bcampbell, ceph-devel
Thanks Josh. I will respond when I have something useful!
On Wed, Mar 20, 2013 at 4:32 PM, Josh Durgin <josh.durgin@inktank.com> wrote:
> On 03/20/2013 01:19 PM, Josh Durgin wrote:
>>
>> On 03/20/2013 01:14 PM, Stefan Priebe wrote:
>>>
>>> Hi,
>>>
>>>> In this case, they are format 2. And they are from cloned snapshots.
>>>> Exactly like the following:
>>>>
>>>> # rbd ls -l -p volumes
>>>> NAME SIZE
>>>> PARENT FMT PROT LOCK
>>>> volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
>>>> images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
>>>>
>>>> I'm doing an OpenStack boot-from-volume setup.
>>>
>>>
>>> OK i've never used cloned snapshots so maybe this is the reason.
>>>
>>>>> strange i've never seen this. Which qemu version?
>>>>
>>>>
>>>> # qemu-x86_64 -version
>>>> qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008
>>>> Fabrice Bellard
>>>>
>>>> that's coming from Ubuntu 12.04 apt repos.
>>>
>>>
>>> maybe you should try qemu 1.4 there are a LOT of bugfixes. qemu-kvm does
>>> not exist anymore it was merged into qemu with 1.3 or 1.4.
>>
>>
>> This particular problem won't be solved by upgrading qemu. It's a ceph
>> bug. Disabling caching would work around the issue.
>>
>> Travis, could you get a log from qemu of this happening with:
>>
>> debug ms = 20
>> debug objectcacher = 20
>> debug rbd = 20
>> log file = /path/writeable/by/qemu
>
>
> If it doesn't reproduce with those settings, try changing debug ms to 1
> instead of 20.
>
>
>> From those we can tell whether the issue is on the client side at least,
>> and hopefully what's causing it.
>>
>> Thanks!
>> Josh
>
>
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-20 20:39 ` Travis Rhoden
@ 2013-03-20 21:47 ` Travis Rhoden
2013-03-22 17:11 ` Josh Durgin
0 siblings, 1 reply; 16+ messages in thread
From: Travis Rhoden @ 2013-03-20 21:47 UTC (permalink / raw)
To: Josh Durgin; +Cc: Stefan Priebe, bcampbell, ceph-devel
Didn't take long to re-create with the detailed debugging (ms = 20).
I'm sending Josh a link to the gzip'd log off-list, I"m not sure if
the log will contain any CephX keys or anything like that.
On Wed, Mar 20, 2013 at 4:39 PM, Travis Rhoden <trhoden@gmail.com> wrote:
> Thanks Josh. I will respond when I have something useful!
>
> On Wed, Mar 20, 2013 at 4:32 PM, Josh Durgin <josh.durgin@inktank.com> wrote:
>> On 03/20/2013 01:19 PM, Josh Durgin wrote:
>>>
>>> On 03/20/2013 01:14 PM, Stefan Priebe wrote:
>>>>
>>>> Hi,
>>>>
>>>>> In this case, they are format 2. And they are from cloned snapshots.
>>>>> Exactly like the following:
>>>>>
>>>>> # rbd ls -l -p volumes
>>>>> NAME SIZE
>>>>> PARENT FMT PROT LOCK
>>>>> volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
>>>>> images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
>>>>>
>>>>> I'm doing an OpenStack boot-from-volume setup.
>>>>
>>>>
>>>> OK i've never used cloned snapshots so maybe this is the reason.
>>>>
>>>>>> strange i've never seen this. Which qemu version?
>>>>>
>>>>>
>>>>> # qemu-x86_64 -version
>>>>> qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008
>>>>> Fabrice Bellard
>>>>>
>>>>> that's coming from Ubuntu 12.04 apt repos.
>>>>
>>>>
>>>> maybe you should try qemu 1.4 there are a LOT of bugfixes. qemu-kvm does
>>>> not exist anymore it was merged into qemu with 1.3 or 1.4.
>>>
>>>
>>> This particular problem won't be solved by upgrading qemu. It's a ceph
>>> bug. Disabling caching would work around the issue.
>>>
>>> Travis, could you get a log from qemu of this happening with:
>>>
>>> debug ms = 20
>>> debug objectcacher = 20
>>> debug rbd = 20
>>> log file = /path/writeable/by/qemu
>>
>>
>> If it doesn't reproduce with those settings, try changing debug ms to 1
>> instead of 20.
>>
>>
>>> From those we can tell whether the issue is on the client side at least,
>>> and hopefully what's causing it.
>>>
>>> Thanks!
>>> Josh
>>
>>
^ permalink raw reply [flat|nested] 16+ messages in thread
* RE: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-20 20:14 ` Stefan Priebe
2013-03-20 20:17 ` Travis Rhoden
2013-03-20 20:19 ` Josh Durgin
@ 2013-03-21 0:58 ` Jacky.He
2 siblings, 0 replies; 16+ messages in thread
From: Jacky.He @ 2013-03-21 0:58 UTC (permalink / raw)
To: 'Stefan Priebe', 'Travis Rhoden'
Cc: bcampbell, 'ceph-devel'
> -----Original Message-----
> From: ceph-devel-owner@vger.kernel.org
> [mailto:ceph-devel-owner@vger.kernel.org] On Behalf Of Stefan Priebe
> Sent: Thursday, March 21, 2013 4:14 AM
> To: Travis Rhoden
> Cc: bcampbell@axcess-financial.com; ceph-devel
> Subject: Re: Latest bobtail branch still crashing KVM VMs in
bh_write_commit()
>
> Hi,
>
> > In this case, they are format 2. And they are from cloned snapshots.
> > Exactly like the following:
> >
> > # rbd ls -l -p volumes
> > NAME SIZE
> > PARENT FMT PROT LOCK
> > volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
> > images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
> >
> > I'm doing an OpenStack boot-from-volume setup.
>
> OK i've never used cloned snapshots so maybe this is the reason.
>
> >> strange i've never seen this. Which qemu version?
> >
> > # qemu-x86_64 -version
> > qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008
> > Fabrice Bellard
> >
> > that's coming from Ubuntu 12.04 apt repos.
>
> maybe you should try qemu 1.4 there are a LOT of bugfixes. qemu-kvm does
not
> exist anymore it was merged into qemu with 1.3 or 1.4.
>
[jacky_he] I also encountered the same issue, ceph version is 0.56.3.
I have tried Qemu 1.3.1 and Qemu 1.4.0, KVM VM with format 2 cloned image
crashs.
My host OS is ubuntu 12.04, guest OS are CentOS 6.3 and Windows XP/Windows 7
> Stefan
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of
> a message to majordomo@vger.kernel.org More majordomo info at
> http://vger.kernel.org/majordomo-info.html
>
>
> __________ Information from ESET NOD32 Antivirus, version of virus
signature
> database 8141 (20130320) __________
>
> The message was checked by ESET NOD32 Antivirus.
>
> http://www.eset.com
>
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-20 21:47 ` Travis Rhoden
@ 2013-03-22 17:11 ` Josh Durgin
2013-03-22 18:30 ` Travis Rhoden
0 siblings, 1 reply; 16+ messages in thread
From: Josh Durgin @ 2013-03-22 17:11 UTC (permalink / raw)
To: Travis Rhoden; +Cc: Stefan Priebe, bcampbell, ceph-devel
I think I found the root cause based on your logs:
http://tracker.ceph.com/issues/4531
Josh
On 03/20/2013 02:47 PM, Travis Rhoden wrote:
> Didn't take long to re-create with the detailed debugging (ms = 20).
> I'm sending Josh a link to the gzip'd log off-list, I"m not sure if
> the log will contain any CephX keys or anything like that.
>
> On Wed, Mar 20, 2013 at 4:39 PM, Travis Rhoden <trhoden@gmail.com> wrote:
>> Thanks Josh. I will respond when I have something useful!
>>
>> On Wed, Mar 20, 2013 at 4:32 PM, Josh Durgin <josh.durgin@inktank.com> wrote:
>>> On 03/20/2013 01:19 PM, Josh Durgin wrote:
>>>>
>>>> On 03/20/2013 01:14 PM, Stefan Priebe wrote:
>>>>>
>>>>> Hi,
>>>>>
>>>>>> In this case, they are format 2. And they are from cloned snapshots.
>>>>>> Exactly like the following:
>>>>>>
>>>>>> # rbd ls -l -p volumes
>>>>>> NAME SIZE
>>>>>> PARENT FMT PROT LOCK
>>>>>> volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
>>>>>> images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
>>>>>>
>>>>>> I'm doing an OpenStack boot-from-volume setup.
>>>>>
>>>>>
>>>>> OK i've never used cloned snapshots so maybe this is the reason.
>>>>>
>>>>>>> strange i've never seen this. Which qemu version?
>>>>>>
>>>>>>
>>>>>> # qemu-x86_64 -version
>>>>>> qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008
>>>>>> Fabrice Bellard
>>>>>>
>>>>>> that's coming from Ubuntu 12.04 apt repos.
>>>>>
>>>>>
>>>>> maybe you should try qemu 1.4 there are a LOT of bugfixes. qemu-kvm does
>>>>> not exist anymore it was merged into qemu with 1.3 or 1.4.
>>>>
>>>>
>>>> This particular problem won't be solved by upgrading qemu. It's a ceph
>>>> bug. Disabling caching would work around the issue.
>>>>
>>>> Travis, could you get a log from qemu of this happening with:
>>>>
>>>> debug ms = 20
>>>> debug objectcacher = 20
>>>> debug rbd = 20
>>>> log file = /path/writeable/by/qemu
>>>
>>>
>>> If it doesn't reproduce with those settings, try changing debug ms to 1
>>> instead of 20.
>>>
>>>
>>>> From those we can tell whether the issue is on the client side at least,
>>>> and hopefully what's causing it.
>>>>
>>>> Thanks!
>>>> Josh
>>>
>>>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-22 17:11 ` Josh Durgin
@ 2013-03-22 18:30 ` Travis Rhoden
2013-04-11 0:53 ` Josh Durgin
0 siblings, 1 reply; 16+ messages in thread
From: Travis Rhoden @ 2013-03-22 18:30 UTC (permalink / raw)
To: Josh Durgin; +Cc: Stefan Priebe, bcampbell, ceph-devel
That's awesome Josh. Thanks for looking into it. Good luck with the fix!
- Travis
On Fri, Mar 22, 2013 at 1:11 PM, Josh Durgin <josh.durgin@inktank.com> wrote:
> I think I found the root cause based on your logs:
>
> http://tracker.ceph.com/issues/4531
>
> Josh
>
>
> On 03/20/2013 02:47 PM, Travis Rhoden wrote:
>>
>> Didn't take long to re-create with the detailed debugging (ms = 20).
>> I'm sending Josh a link to the gzip'd log off-list, I"m not sure if
>> the log will contain any CephX keys or anything like that.
>>
>> On Wed, Mar 20, 2013 at 4:39 PM, Travis Rhoden <trhoden@gmail.com> wrote:
>>>
>>> Thanks Josh. I will respond when I have something useful!
>>>
>>> On Wed, Mar 20, 2013 at 4:32 PM, Josh Durgin <josh.durgin@inktank.com>
>>> wrote:
>>>>
>>>> On 03/20/2013 01:19 PM, Josh Durgin wrote:
>>>>>
>>>>>
>>>>> On 03/20/2013 01:14 PM, Stefan Priebe wrote:
>>>>>>
>>>>>>
>>>>>> Hi,
>>>>>>
>>>>>>> In this case, they are format 2. And they are from cloned snapshots.
>>>>>>> Exactly like the following:
>>>>>>>
>>>>>>> # rbd ls -l -p volumes
>>>>>>> NAME SIZE
>>>>>>> PARENT FMT PROT LOCK
>>>>>>> volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
>>>>>>> images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
>>>>>>>
>>>>>>> I'm doing an OpenStack boot-from-volume setup.
>>>>>>
>>>>>>
>>>>>>
>>>>>> OK i've never used cloned snapshots so maybe this is the reason.
>>>>>>
>>>>>>>> strange i've never seen this. Which qemu version?
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> # qemu-x86_64 -version
>>>>>>> qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008
>>>>>>> Fabrice Bellard
>>>>>>>
>>>>>>> that's coming from Ubuntu 12.04 apt repos.
>>>>>>
>>>>>>
>>>>>>
>>>>>> maybe you should try qemu 1.4 there are a LOT of bugfixes. qemu-kvm
>>>>>> does
>>>>>> not exist anymore it was merged into qemu with 1.3 or 1.4.
>>>>>
>>>>>
>>>>>
>>>>> This particular problem won't be solved by upgrading qemu. It's a ceph
>>>>> bug. Disabling caching would work around the issue.
>>>>>
>>>>> Travis, could you get a log from qemu of this happening with:
>>>>>
>>>>> debug ms = 20
>>>>> debug objectcacher = 20
>>>>> debug rbd = 20
>>>>> log file = /path/writeable/by/qemu
>>>>
>>>>
>>>>
>>>> If it doesn't reproduce with those settings, try changing debug ms to 1
>>>> instead of 20.
>>>>
>>>>
>>>>> From those we can tell whether the issue is on the client side at
>>>>> least,
>>>>> and hopefully what's causing it.
>>>>>
>>>>> Thanks!
>>>>> Josh
>>>>
>>>>
>>>>
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>>
>
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-03-22 18:30 ` Travis Rhoden
@ 2013-04-11 0:53 ` Josh Durgin
2013-04-11 15:41 ` Travis Rhoden
0 siblings, 1 reply; 16+ messages in thread
From: Josh Durgin @ 2013-04-11 0:53 UTC (permalink / raw)
To: Travis Rhoden; +Cc: Stefan Priebe, bcampbell, ceph-devel
Finally got some time to fix this (hopefully).
Could you try librbd from the wip-objectcacher-handler-ordered branch?
Just librbd on the host running qemu needs to be updated.
Thanks,
Josh
On 03/22/2013 11:30 AM, Travis Rhoden wrote:
> That's awesome Josh. Thanks for looking into it. Good luck with the fix!
>
> - Travis
>
> On Fri, Mar 22, 2013 at 1:11 PM, Josh Durgin <josh.durgin@inktank.com> wrote:
>> I think I found the root cause based on your logs:
>>
>> http://tracker.ceph.com/issues/4531
>>
>> Josh
>>
>>
>> On 03/20/2013 02:47 PM, Travis Rhoden wrote:
>>>
>>> Didn't take long to re-create with the detailed debugging (ms = 20).
>>> I'm sending Josh a link to the gzip'd log off-list, I"m not sure if
>>> the log will contain any CephX keys or anything like that.
>>>
>>> On Wed, Mar 20, 2013 at 4:39 PM, Travis Rhoden <trhoden@gmail.com> wrote:
>>>>
>>>> Thanks Josh. I will respond when I have something useful!
>>>>
>>>> On Wed, Mar 20, 2013 at 4:32 PM, Josh Durgin <josh.durgin@inktank.com>
>>>> wrote:
>>>>>
>>>>> On 03/20/2013 01:19 PM, Josh Durgin wrote:
>>>>>>
>>>>>>
>>>>>> On 03/20/2013 01:14 PM, Stefan Priebe wrote:
>>>>>>>
>>>>>>>
>>>>>>> Hi,
>>>>>>>
>>>>>>>> In this case, they are format 2. And they are from cloned snapshots.
>>>>>>>> Exactly like the following:
>>>>>>>>
>>>>>>>> # rbd ls -l -p volumes
>>>>>>>> NAME SIZE
>>>>>>>> PARENT FMT PROT LOCK
>>>>>>>> volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
>>>>>>>> images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
>>>>>>>>
>>>>>>>> I'm doing an OpenStack boot-from-volume setup.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> OK i've never used cloned snapshots so maybe this is the reason.
>>>>>>>
>>>>>>>>> strange i've never seen this. Which qemu version?
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> # qemu-x86_64 -version
>>>>>>>> qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008
>>>>>>>> Fabrice Bellard
>>>>>>>>
>>>>>>>> that's coming from Ubuntu 12.04 apt repos.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> maybe you should try qemu 1.4 there are a LOT of bugfixes. qemu-kvm
>>>>>>> does
>>>>>>> not exist anymore it was merged into qemu with 1.3 or 1.4.
>>>>>>
>>>>>>
>>>>>>
>>>>>> This particular problem won't be solved by upgrading qemu. It's a ceph
>>>>>> bug. Disabling caching would work around the issue.
>>>>>>
>>>>>> Travis, could you get a log from qemu of this happening with:
>>>>>>
>>>>>> debug ms = 20
>>>>>> debug objectcacher = 20
>>>>>> debug rbd = 20
>>>>>> log file = /path/writeable/by/qemu
>>>>>
>>>>>
>>>>>
>>>>> If it doesn't reproduce with those settings, try changing debug ms to 1
>>>>> instead of 20.
>>>>>
>>>>>
>>>>>> From those we can tell whether the issue is on the client side at
>>>>>> least,
>>>>>> and hopefully what's causing it.
>>>>>>
>>>>>> Thanks!
>>>>>> Josh
>>>>>
>>>>>
>>>>>
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-04-11 0:53 ` Josh Durgin
@ 2013-04-11 15:41 ` Travis Rhoden
2013-04-11 20:15 ` Josh Durgin
0 siblings, 1 reply; 16+ messages in thread
From: Travis Rhoden @ 2013-04-11 15:41 UTC (permalink / raw)
To: Josh Durgin; +Cc: Stefan Priebe, bcampbell, ceph-devel
Hi Josh,
Thanks for the heads up. I've been testing the fix all morning, and
haven't run into a single crash yet! I turned on the RBD logging
during a couple of VM startups just to look and make sure I saw a
bunch of objectcacher traffic (to know I was really doing caching).
I'll keep the new version installed for now and see how things play
out through the day. So far things are looking very promising.
A couple of obligatory questions:
Any idea when the fixes will be backported to bobtail?
I"m running the latest bobtail packages everywhere else. I now have
0.60+ for librbd, librados, and ceph-common on my host running qemu
(all that host does is run virtual machiens with librbd). Do you know
of anything that would make this mixed environment a cause for
concern? Once the backport is done, I will revert these packages to
the bobtail version.
Thanks so much for the good work.
- Travis
On Wed, Apr 10, 2013 at 8:53 PM, Josh Durgin <josh.durgin@inktank.com> wrote:
> Finally got some time to fix this (hopefully).
> Could you try librbd from the wip-objectcacher-handler-ordered branch?
> Just librbd on the host running qemu needs to be updated.
>
> Thanks,
> Josh
>
>
> On 03/22/2013 11:30 AM, Travis Rhoden wrote:
>>
>> That's awesome Josh. Thanks for looking into it. Good luck with the fix!
>>
>> - Travis
>>
>> On Fri, Mar 22, 2013 at 1:11 PM, Josh Durgin <josh.durgin@inktank.com>
>> wrote:
>>>
>>> I think I found the root cause based on your logs:
>>>
>>> http://tracker.ceph.com/issues/4531
>>>
>>> Josh
>>>
>>>
>>> On 03/20/2013 02:47 PM, Travis Rhoden wrote:
>>>>
>>>>
>>>> Didn't take long to re-create with the detailed debugging (ms = 20).
>>>> I'm sending Josh a link to the gzip'd log off-list, I"m not sure if
>>>> the log will contain any CephX keys or anything like that.
>>>>
>>>> On Wed, Mar 20, 2013 at 4:39 PM, Travis Rhoden <trhoden@gmail.com>
>>>> wrote:
>>>>>
>>>>>
>>>>> Thanks Josh. I will respond when I have something useful!
>>>>>
>>>>> On Wed, Mar 20, 2013 at 4:32 PM, Josh Durgin <josh.durgin@inktank.com>
>>>>> wrote:
>>>>>>
>>>>>>
>>>>>> On 03/20/2013 01:19 PM, Josh Durgin wrote:
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On 03/20/2013 01:14 PM, Stefan Priebe wrote:
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> Hi,
>>>>>>>>
>>>>>>>>> In this case, they are format 2. And they are from cloned
>>>>>>>>> snapshots.
>>>>>>>>> Exactly like the following:
>>>>>>>>>
>>>>>>>>> # rbd ls -l -p volumes
>>>>>>>>> NAME SIZE
>>>>>>>>> PARENT FMT PROT LOCK
>>>>>>>>> volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
>>>>>>>>> images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
>>>>>>>>>
>>>>>>>>> I'm doing an OpenStack boot-from-volume setup.
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> OK i've never used cloned snapshots so maybe this is the reason.
>>>>>>>>
>>>>>>>>>> strange i've never seen this. Which qemu version?
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> # qemu-x86_64 -version
>>>>>>>>> qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008
>>>>>>>>> Fabrice Bellard
>>>>>>>>>
>>>>>>>>> that's coming from Ubuntu 12.04 apt repos.
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> maybe you should try qemu 1.4 there are a LOT of bugfixes. qemu-kvm
>>>>>>>> does
>>>>>>>> not exist anymore it was merged into qemu with 1.3 or 1.4.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> This particular problem won't be solved by upgrading qemu. It's a
>>>>>>> ceph
>>>>>>> bug. Disabling caching would work around the issue.
>>>>>>>
>>>>>>> Travis, could you get a log from qemu of this happening with:
>>>>>>>
>>>>>>> debug ms = 20
>>>>>>> debug objectcacher = 20
>>>>>>> debug rbd = 20
>>>>>>> log file = /path/writeable/by/qemu
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> If it doesn't reproduce with those settings, try changing debug ms to
>>>>>> 1
>>>>>> instead of 20.
>>>>>>
>>>>>>
>>>>>>> From those we can tell whether the issue is on the client side at
>>>>>>> least,
>>>>>>> and hopefully what's causing it.
>>>>>>>
>>>>>>> Thanks!
>>>>>>> Josh
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Latest bobtail branch still crashing KVM VMs in bh_write_commit()
2013-04-11 15:41 ` Travis Rhoden
@ 2013-04-11 20:15 ` Josh Durgin
0 siblings, 0 replies; 16+ messages in thread
From: Josh Durgin @ 2013-04-11 20:15 UTC (permalink / raw)
To: Travis Rhoden; +Cc: Stefan Priebe, bcampbell, ceph-devel
On 04/11/2013 08:41 AM, Travis Rhoden wrote:
> Hi Josh,
>
> Thanks for the heads up. I've been testing the fix all morning, and
> haven't run into a single crash yet! I turned on the RBD logging
> during a couple of VM startups just to look and make sure I saw a
> bunch of objectcacher traffic (to know I was really doing caching).
>
> I'll keep the new version installed for now and see how things play
> out through the day. So far things are looking very promising.
Great!
> A couple of obligatory questions:
>
> Any idea when the fixes will be backported to bobtail?
Hopefully tomorrow. There are a couple other bugs I'd like to fix, and
then I'll backport several recent fixes at once so I can test the
backports all together.
> I"m running the latest bobtail packages everywhere else. I now have
> 0.60+ for librbd, librados, and ceph-common on my host running qemu
> (all that host does is run virtual machiens with librbd). Do you know
> of anything that would make this mixed environment a cause for
> concern? Once the backport is done, I will revert these packages to
> the bobtail version.
I'm not aware of anything that would cause problems with upgraded
client-side packages.
> Thanks so much for the good work.
Thanks for helping track down these bugs!
Josh
> - Travis
>
> On Wed, Apr 10, 2013 at 8:53 PM, Josh Durgin <josh.durgin@inktank.com> wrote:
>> Finally got some time to fix this (hopefully).
>> Could you try librbd from the wip-objectcacher-handler-ordered branch?
>> Just librbd on the host running qemu needs to be updated.
>>
>> Thanks,
>> Josh
>>
>>
>> On 03/22/2013 11:30 AM, Travis Rhoden wrote:
>>>
>>> That's awesome Josh. Thanks for looking into it. Good luck with the fix!
>>>
>>> - Travis
>>>
>>> On Fri, Mar 22, 2013 at 1:11 PM, Josh Durgin <josh.durgin@inktank.com>
>>> wrote:
>>>>
>>>> I think I found the root cause based on your logs:
>>>>
>>>> http://tracker.ceph.com/issues/4531
>>>>
>>>> Josh
>>>>
>>>>
>>>> On 03/20/2013 02:47 PM, Travis Rhoden wrote:
>>>>>
>>>>>
>>>>> Didn't take long to re-create with the detailed debugging (ms = 20).
>>>>> I'm sending Josh a link to the gzip'd log off-list, I"m not sure if
>>>>> the log will contain any CephX keys or anything like that.
>>>>>
>>>>> On Wed, Mar 20, 2013 at 4:39 PM, Travis Rhoden <trhoden@gmail.com>
>>>>> wrote:
>>>>>>
>>>>>>
>>>>>> Thanks Josh. I will respond when I have something useful!
>>>>>>
>>>>>> On Wed, Mar 20, 2013 at 4:32 PM, Josh Durgin <josh.durgin@inktank.com>
>>>>>> wrote:
>>>>>>>
>>>>>>>
>>>>>>> On 03/20/2013 01:19 PM, Josh Durgin wrote:
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> On 03/20/2013 01:14 PM, Stefan Priebe wrote:
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> Hi,
>>>>>>>>>
>>>>>>>>>> In this case, they are format 2. And they are from cloned
>>>>>>>>>> snapshots.
>>>>>>>>>> Exactly like the following:
>>>>>>>>>>
>>>>>>>>>> # rbd ls -l -p volumes
>>>>>>>>>> NAME SIZE
>>>>>>>>>> PARENT FMT PROT LOCK
>>>>>>>>>> volume-099a6d74-05bd-4f00-a12e-009d60629aa8 5120M
>>>>>>>>>> images/b8bdda90-664b-4906-86d6-dd33735441f2@snap 2
>>>>>>>>>>
>>>>>>>>>> I'm doing an OpenStack boot-from-volume setup.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> OK i've never used cloned snapshots so maybe this is the reason.
>>>>>>>>>
>>>>>>>>>>> strange i've never seen this. Which qemu version?
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> # qemu-x86_64 -version
>>>>>>>>>> qemu-x86_64 version 1.0 (qemu-kvm-1.0), Copyright (c) 2003-2008
>>>>>>>>>> Fabrice Bellard
>>>>>>>>>>
>>>>>>>>>> that's coming from Ubuntu 12.04 apt repos.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> maybe you should try qemu 1.4 there are a LOT of bugfixes. qemu-kvm
>>>>>>>>> does
>>>>>>>>> not exist anymore it was merged into qemu with 1.3 or 1.4.
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> This particular problem won't be solved by upgrading qemu. It's a
>>>>>>>> ceph
>>>>>>>> bug. Disabling caching would work around the issue.
>>>>>>>>
>>>>>>>> Travis, could you get a log from qemu of this happening with:
>>>>>>>>
>>>>>>>> debug ms = 20
>>>>>>>> debug objectcacher = 20
>>>>>>>> debug rbd = 20
>>>>>>>> log file = /path/writeable/by/qemu
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> If it doesn't reproduce with those settings, try changing debug ms to
>>>>>>> 1
>>>>>>> instead of 20.
>>>>>>>
>>>>>>>
>>>>>>>> From those we can tell whether the issue is on the client side at
>>>>>>>> least,
>>>>>>>> and hopefully what's causing it.
>>>>>>>>
>>>>>>>> Thanks!
>>>>>>>> Josh
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
^ permalink raw reply [flat|nested] 16+ messages in thread
end of thread, other threads:[~2013-04-11 20:16 UTC | newest]
Thread overview: 16+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-03-20 19:49 Latest bobtail branch still crashing KVM VMs in bh_write_commit() Travis Rhoden
2013-03-20 19:51 ` Campbell, Bill
2013-03-20 19:53 ` Stefan Priebe
2013-03-20 20:04 ` Travis Rhoden
2013-03-20 20:14 ` Stefan Priebe
2013-03-20 20:17 ` Travis Rhoden
2013-03-20 20:19 ` Josh Durgin
2013-03-20 20:32 ` Josh Durgin
2013-03-20 20:39 ` Travis Rhoden
2013-03-20 21:47 ` Travis Rhoden
2013-03-22 17:11 ` Josh Durgin
2013-03-22 18:30 ` Travis Rhoden
2013-04-11 0:53 ` Josh Durgin
2013-04-11 15:41 ` Travis Rhoden
2013-04-11 20:15 ` Josh Durgin
2013-03-21 0:58 ` Jacky.He
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.