Snapshots of consistency groups

* Snapshots of consistency groups
@ 2016-08-15 23:46 Victor Denisov
  2016-08-16 13:26 ` Jason Dillaman
  0 siblings, 1 reply; 43+ messages in thread
From: Victor Denisov @ 2016-08-15 23:46 UTC (permalink / raw)
  To: ceph-devel, Jason Dillaman, Josh Durgin, Mykola Golub

Gentlemen,

I'm writing to you to ask for your opinion regarding quiescing writes.

Here is the situation. In order to take snapshots of all images in a
consistency group,
we first need to quiesce all the image writers in the consistency group.
Let me call
group client - a client which requests a consistency group to take a snapshot.
Image client - the client that writes to an image.
Let's say group client starts sending notify_quiesce to all image
clients that write to the images in the group. After quiescing half of
the image clients the group client can die.

It presents us with a dilemma - what should we do with those quiesced
image clients.

Option 1 - is to wait till someone manually runs recover for that
consistency group.
We can show warning next to those unfinished groups when user runs
group list command.
There will be a command like group recover, which allows users to
rollback unsuccessful snapshots
or continue them using create snapshot command.

Option 2 - is to establish some heart beats between group client and
image client. If group client fails to heart beat then image client
unquiesces itself and continues normal operation.

Option 3 - is to have a timeout for each image client. If group client
fails to make a group snapshot within this timeout then we resume our
normal operation informing group client of the fact.

Which of these options do you prefer? Probably there are other options
that I miss.

Thanks,
Victor.

^ permalink raw reply	[flat|nested] 43+ messages in thread