* Alibaba's work on recovery process
@ 2017-05-05 12:48 Piotr Dałek
2017-05-05 18:25 ` LIU, Fei
0 siblings, 1 reply; 7+ messages in thread
From: Piotr Dałek @ 2017-05-05 12:48 UTC (permalink / raw)
To: ceph-devel
Hello,
On yesterday's perf meeting, guys from Alibaba presented their progress on
improving recovery process. Has anybody received the slides form their talk
- and can share them?
Thanks in advance,
--
Piotr Dalek
piotr.dalek@corp.ovh.com
https://www.ovh.com/us/
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Alibaba's work on recovery process
2017-05-05 12:48 Alibaba's work on recovery process Piotr Dałek
@ 2017-05-05 18:25 ` LIU, Fei
2017-05-05 18:30 ` Huang Zhiteng
0 siblings, 1 reply; 7+ messages in thread
From: LIU, Fei @ 2017-05-05 18:25 UTC (permalink / raw)
To: Piotr Dałek, ceph-devel
Hi Piotr,
Will send to you soon.
Regards,
James
On 5/5/17, 5:48 AM, "Piotr Dałek" <ceph-devel-owner@vger.kernel.org on behalf of piotr.dalek@corp.ovh.com> wrote:
Hello,
On yesterday's perf meeting, guys from Alibaba presented their progress on
improving recovery process. Has anybody received the slides form their talk
- and can share them?
Thanks in advance,
--
Piotr Dalek
piotr.dalek@corp.ovh.com
https://www.ovh.com/us/
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Alibaba's work on recovery process
2017-05-05 18:25 ` LIU, Fei
@ 2017-05-05 18:30 ` Huang Zhiteng
[not found] ` <F28FEF91-F57A-48FB-B673-817758BCAF58@alibaba-inc.com>
0 siblings, 1 reply; 7+ messages in thread
From: Huang Zhiteng @ 2017-05-05 18:30 UTC (permalink / raw)
To: LIU, Fei; +Cc: Piotr Dałek, ceph-devel
Hi James,
Could you share the slidedeck to the list instead of individuals? Thanks.
On Sat, May 6, 2017 at 2:25 AM, LIU, Fei <james.liu@alibaba-inc.com> wrote:
> Hi Piotr,
>
> Will send to you soon.
>
> Regards,
> James
>
> On 5/5/17, 5:48 AM, "Piotr Dałek" <ceph-devel-owner@vger.kernel.org on behalf of piotr.dalek@corp.ovh.com> wrote:
>
> Hello,
>
> On yesterday's perf meeting, guys from Alibaba presented their progress on
> improving recovery process. Has anybody received the slides form their talk
> - and can share them?
>
> Thanks in advance,
>
> --
> Piotr Dalek
> piotr.dalek@corp.ovh.com
> https://www.ovh.com/us/
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
>
>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
--
Regards
Huang Zhiteng
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Alibaba's work on recovery process
[not found] ` <F28FEF91-F57A-48FB-B673-817758BCAF58@alibaba-inc.com>
@ 2017-05-09 8:18 ` Piotr Dałek
2017-05-11 16:34 ` LIU, Fei
0 siblings, 1 reply; 7+ messages in thread
From: Piotr Dałek @ 2017-05-09 8:18 UTC (permalink / raw)
Cc: ceph-devel
On 05/05/2017 08:36 PM, LIU, Fei wrote:
> Hi All,
> Here is the slide that we presented in this week Ceph performance meeting.
>
> Regards,
> James
I think it didn't make it to the list itself. Can you upload the .pdf
somewhere and post the link to it so others can download it?
--
Piotr Dalek
piotr.dalek@corp.ovh.com
https://www.ovh.com/us/
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Alibaba's work on recovery process
2017-05-09 8:18 ` Piotr Dałek
@ 2017-05-11 16:34 ` LIU, Fei
2017-05-11 17:44 ` Sage Weil
0 siblings, 1 reply; 7+ messages in thread
From: LIU, Fei @ 2017-05-11 16:34 UTC (permalink / raw)
To: Piotr Dałek
Cc: ceph-devel, Ming Lin, 徐延江, Mark Nelson, Sage Weil
Hi Piotr,
Here you go. We just uploaded the slide into slideshare for your reference. Please feel free to let us know if you have any comments.
https://www.slideshare.net/jupiturliu/ceph-recovery-improvement-v02
Regards,
James
On 5/9/17, 1:18 AM, "Piotr Dałek" <ceph-devel-owner@vger.kernel.org on behalf of piotr.dalek@corp.ovh.com> wrote:
On 05/05/2017 08:36 PM, LIU, Fei wrote:
> Hi All,
> Here is the slide that we presented in this week Ceph performance meeting.
>
> Regards,
> James
I think it didn't make it to the list itself. Can you upload the .pdf
somewhere and post the link to it so others can download it?
--
Piotr Dalek
piotr.dalek@corp.ovh.com
https://www.ovh.com/us/
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Alibaba's work on recovery process
2017-05-11 16:34 ` LIU, Fei
@ 2017-05-11 17:44 ` Sage Weil
2017-05-11 18:14 ` LIU, Fei
0 siblings, 1 reply; 7+ messages in thread
From: Sage Weil @ 2017-05-11 17:44 UTC (permalink / raw)
To: LIU, Fei
Cc: Piotr Dałek, ceph-devel, Ming Lin, 徐延江,
Mark Nelson
On Fri, 12 May 2017, LIU, Fei wrote:
> Hi Piotr,
> Here you go. We just uploaded the slide into slideshare for your reference. Please feel free to let us know if you have any comments.
>
> https://www.slideshare.net/jupiturliu/ceph-recovery-improvement-v02
>
> Regards,
> James
Hi James-
This work is very promising! Putting the write info in the log is
probably one of the easier pieces to tackle (and Josh is already looking a
variation of the async recovery). If you have the time I'd love to
resurrect that PR and get it into a mergeable state. Is there an open PR
with the current code?
sage
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Alibaba's work on recovery process
2017-05-11 17:44 ` Sage Weil
@ 2017-05-11 18:14 ` LIU, Fei
0 siblings, 0 replies; 7+ messages in thread
From: LIU, Fei @ 2017-05-11 18:14 UTC (permalink / raw)
To: Sage Weil
Cc: Piotr Dałek, ceph-devel, Ming Lin, 徐延江,
Mark Nelson
Hi Sage,
Great, we will clean the code with several of our bug fixes and get back to you for your help soon.
So far ,It runs well in our testing environment, we caught several bugs regarding to xattr/omap/data dismatch.
We pretty much fixed all of them and planning to add several new testing cases. We are still running tests aggressively in bigger cluster to see whether there is any existing issues.
If passed all of our aggressive testing bench ,we will transfer them into our production cluster for more observations.
By the way, we have built Alibaba in house Teuthology with more testing cases including hardware injection/networking failure injection/network switch error injection/Server failure injection etc.
Hopefully , it can pass all of the test in a week and cover all of the corners.
Regards,
James
On 5/11/17, 10:44 AM, "Sage Weil" <sage@newdream.net> wrote:
On Fri, 12 May 2017, LIU, Fei wrote:
> Hi Piotr,
> Here you go. We just uploaded the slide into slideshare for your reference. Please feel free to let us know if you have any comments.
>
> https://www.slideshare.net/jupiturliu/ceph-recovery-improvement-v02
>
> Regards,
> James
Hi James-
This work is very promising! Putting the write info in the log is
probably one of the easier pieces to tackle (and Josh is already looking a
variation of the async recovery). If you have the time I'd love to
resurrect that PR and get it into a mergeable state. Is there an open PR
with the current code?
sage
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2017-05-11 18:14 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-05-05 12:48 Alibaba's work on recovery process Piotr Dałek
2017-05-05 18:25 ` LIU, Fei
2017-05-05 18:30 ` Huang Zhiteng
[not found] ` <F28FEF91-F57A-48FB-B673-817758BCAF58@alibaba-inc.com>
2017-05-09 8:18 ` Piotr Dałek
2017-05-11 16:34 ` LIU, Fei
2017-05-11 17:44 ` Sage Weil
2017-05-11 18:14 ` LIU, Fei
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.