All of lore.kernel.org
 help / color / mirror / Atom feed
* Alibaba's work on recovery process
@ 2017-05-05 12:48 Piotr Dałek
  2017-05-05 18:25 ` LIU, Fei
  0 siblings, 1 reply; 7+ messages in thread
From: Piotr Dałek @ 2017-05-05 12:48 UTC (permalink / raw)
  To: ceph-devel

Hello,

On yesterday's perf meeting, guys from Alibaba presented their progress on 
improving recovery process. Has anybody received the slides form their talk 
- and can share them?

Thanks in advance,

-- 
Piotr Dalek
piotr.dalek@corp.ovh.com
https://www.ovh.com/us/

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Alibaba's work on recovery process
  2017-05-05 12:48 Alibaba's work on recovery process Piotr Dałek
@ 2017-05-05 18:25 ` LIU, Fei
  2017-05-05 18:30   ` Huang Zhiteng
  0 siblings, 1 reply; 7+ messages in thread
From: LIU, Fei @ 2017-05-05 18:25 UTC (permalink / raw)
  To: Piotr Dałek, ceph-devel

Hi Piotr,

   Will send to you soon.

   Regards,
   James

On 5/5/17, 5:48 AM, "Piotr Dałek" <ceph-devel-owner@vger.kernel.org on behalf of piotr.dalek@corp.ovh.com> wrote:

    Hello,
    
    On yesterday's perf meeting, guys from Alibaba presented their progress on 
    improving recovery process. Has anybody received the slides form their talk 
    - and can share them?
    
    Thanks in advance,
    
    -- 
    Piotr Dalek
    piotr.dalek@corp.ovh.com
    https://www.ovh.com/us/
    --
    To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
    the body of a message to majordomo@vger.kernel.org
    More majordomo info at  http://vger.kernel.org/majordomo-info.html
    



^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Alibaba's work on recovery process
  2017-05-05 18:25 ` LIU, Fei
@ 2017-05-05 18:30   ` Huang Zhiteng
       [not found]     ` <F28FEF91-F57A-48FB-B673-817758BCAF58@alibaba-inc.com>
  0 siblings, 1 reply; 7+ messages in thread
From: Huang Zhiteng @ 2017-05-05 18:30 UTC (permalink / raw)
  To: LIU, Fei; +Cc: Piotr Dałek, ceph-devel

Hi James,

Could you share the slidedeck to the list instead of individuals?  Thanks.

On Sat, May 6, 2017 at 2:25 AM, LIU, Fei <james.liu@alibaba-inc.com> wrote:
> Hi Piotr,
>
>    Will send to you soon.
>
>    Regards,
>    James
>
> On 5/5/17, 5:48 AM, "Piotr Dałek" <ceph-devel-owner@vger.kernel.org on behalf of piotr.dalek@corp.ovh.com> wrote:
>
>     Hello,
>
>     On yesterday's perf meeting, guys from Alibaba presented their progress on
>     improving recovery process. Has anybody received the slides form their talk
>     - and can share them?
>
>     Thanks in advance,
>
>     --
>     Piotr Dalek
>     piotr.dalek@corp.ovh.com
>     https://www.ovh.com/us/
>     --
>     To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>     the body of a message to majordomo@vger.kernel.org
>     More majordomo info at  http://vger.kernel.org/majordomo-info.html
>
>
>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html



-- 
Regards
Huang Zhiteng

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Alibaba's work on recovery process
       [not found]     ` <F28FEF91-F57A-48FB-B673-817758BCAF58@alibaba-inc.com>
@ 2017-05-09  8:18       ` Piotr Dałek
  2017-05-11 16:34         ` LIU, Fei
  0 siblings, 1 reply; 7+ messages in thread
From: Piotr Dałek @ 2017-05-09  8:18 UTC (permalink / raw)
  Cc: ceph-devel

On 05/05/2017 08:36 PM, LIU, Fei wrote:
> Hi All,
>    Here is the  slide that  we presented in this week Ceph performance meeting.
>
>    Regards,
>    James

I think it didn't make it to the list itself. Can you upload the .pdf 
somewhere and post the link to it so others can download it?

-- 
Piotr Dalek
piotr.dalek@corp.ovh.com
https://www.ovh.com/us/

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Alibaba's work on recovery process
  2017-05-09  8:18       ` Piotr Dałek
@ 2017-05-11 16:34         ` LIU, Fei
  2017-05-11 17:44           ` Sage Weil
  0 siblings, 1 reply; 7+ messages in thread
From: LIU, Fei @ 2017-05-11 16:34 UTC (permalink / raw)
  To: Piotr Dałek
  Cc: ceph-devel, Ming Lin, 徐延江, Mark Nelson, Sage Weil

Hi Piotr,
   Here you go. We just uploaded the slide into slideshare for your reference. Please feel free to let us know if you have any comments.

   https://www.slideshare.net/jupiturliu/ceph-recovery-improvement-v02

   Regards,
   James

On 5/9/17, 1:18 AM, "Piotr Dałek" <ceph-devel-owner@vger.kernel.org on behalf of piotr.dalek@corp.ovh.com> wrote:

    On 05/05/2017 08:36 PM, LIU, Fei wrote:
    > Hi All,
    >    Here is the  slide that  we presented in this week Ceph performance meeting.
    >
    >    Regards,
    >    James
    
    I think it didn't make it to the list itself. Can you upload the .pdf 
    somewhere and post the link to it so others can download it?
    
    -- 
    Piotr Dalek
    piotr.dalek@corp.ovh.com
    https://www.ovh.com/us/
    --
    To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
    the body of a message to majordomo@vger.kernel.org
    More majordomo info at  http://vger.kernel.org/majordomo-info.html
    





^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Alibaba's work on recovery process
  2017-05-11 16:34         ` LIU, Fei
@ 2017-05-11 17:44           ` Sage Weil
  2017-05-11 18:14             ` LIU, Fei
  0 siblings, 1 reply; 7+ messages in thread
From: Sage Weil @ 2017-05-11 17:44 UTC (permalink / raw)
  To: LIU, Fei
  Cc: Piotr Dałek, ceph-devel, Ming Lin, 徐延江,
	Mark Nelson

On Fri, 12 May 2017, LIU, Fei wrote:
> Hi Piotr,
>    Here you go. We just uploaded the slide into slideshare for your reference. Please feel free to let us know if you have any comments.
> 
>    https://www.slideshare.net/jupiturliu/ceph-recovery-improvement-v02
> 
>    Regards,
>    James

Hi James-

This work is very promising!  Putting the write info in the log is 
probably one of the easier pieces to tackle (and Josh is already looking a 
variation of the async recovery).  If you have the time I'd love to 
resurrect that PR and get it into a mergeable state.  Is there an open PR 
with the current code?

sage

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Alibaba's work on recovery process
  2017-05-11 17:44           ` Sage Weil
@ 2017-05-11 18:14             ` LIU, Fei
  0 siblings, 0 replies; 7+ messages in thread
From: LIU, Fei @ 2017-05-11 18:14 UTC (permalink / raw)
  To: Sage Weil
  Cc: Piotr Dałek, ceph-devel, Ming Lin, 徐延江,
	Mark Nelson

Hi Sage,
  Great, we will clean the code with several of our bug fixes and get back to you for your help soon.
So far ,It runs well in our testing environment, we caught  several bugs regarding to xattr/omap/data dismatch. 
We pretty much fixed all of them and planning to add several new testing cases. We are still running tests aggressively in bigger cluster to see whether there is any existing issues.
If passed all of our aggressive testing bench ,we will transfer them into our production cluster for more observations.

By the way, we have built Alibaba in house Teuthology with more testing cases including hardware injection/networking failure injection/network switch error injection/Server failure injection etc. 
Hopefully , it can pass all of the test in a week and cover all of the corners.

  Regards,
  James

On 5/11/17, 10:44 AM, "Sage Weil" <sage@newdream.net> wrote:

    On Fri, 12 May 2017, LIU, Fei wrote:
    > Hi Piotr,
    >    Here you go. We just uploaded the slide into slideshare for your reference. Please feel free to let us know if you have any comments.
    > 
    >    https://www.slideshare.net/jupiturliu/ceph-recovery-improvement-v02
    > 
    >    Regards,
    >    James
    
    Hi James-
    
    This work is very promising!  Putting the write info in the log is 
    probably one of the easier pieces to tackle (and Josh is already looking a 
    variation of the async recovery).  If you have the time I'd love to 
    resurrect that PR and get it into a mergeable state.  Is there an open PR 
    with the current code?
    
    sage
    



^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2017-05-11 18:14 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-05-05 12:48 Alibaba's work on recovery process Piotr Dałek
2017-05-05 18:25 ` LIU, Fei
2017-05-05 18:30   ` Huang Zhiteng
     [not found]     ` <F28FEF91-F57A-48FB-B673-817758BCAF58@alibaba-inc.com>
2017-05-09  8:18       ` Piotr Dałek
2017-05-11 16:34         ` LIU, Fei
2017-05-11 17:44           ` Sage Weil
2017-05-11 18:14             ` LIU, Fei

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.