All of lore.kernel.org
 help / color / mirror / Atom feed
* [Bug 1881231] [NEW] colo: Can not recover colo after svm failover twice
@ 2020-05-29  4:06 ye.zou
  2020-05-29  7:05 ` ye.zou
                   ` (5 more replies)
  0 siblings, 6 replies; 8+ messages in thread
From: ye.zou @ 2020-05-29  4:06 UTC (permalink / raw)
  To: qemu-devel

Public bug reported:

Hi Expert,
x-blockdev-change met some error, during testing colo

Host os:
CentOS Linux release 7.6.1810 (Core)

Reproduce steps:
1. create colo vm following https://github.com/qemu/qemu/blob/master/docs/COLO-FT.txt
2. kill secondary vm and remove the nbd child from the quorum to wait for recover
  type those commands on primary vm console:
  { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
  { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
  { 'execute': 'x-colo-lost-heartbeat'}
3. recover colo
4. kill secondary vm again after recover colo and type same commands as step 2:
  { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
  { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
  { 'execute': 'x-colo-lost-heartbeat'}
  but the first command got error
  { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
{"error": {"class": "GenericError", "desc": "Node 'colo-disk0' does not have child 'children.1'"}}

according to https://www.qemu.org/docs/master/qemu-qmp-ref.html
Command: x-blockdev-change
Dynamically reconfigure the block driver state graph. It can be used to add, remove, insert or replace a graph node. Currently only the Quorum driver implements this feature to add or remove its child. This is useful to fix a broken quorum child.

It seems x-blockdev-change not worked as expected.

Thanks.

** Affects: qemu
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of qemu-
devel-ml, which is subscribed to QEMU.
https://bugs.launchpad.net/bugs/1881231

Title:
  colo: Can not  recover colo after svm failover twice

Status in QEMU:
  New

Bug description:
  Hi Expert,
  x-blockdev-change met some error, during testing colo

  Host os:
  CentOS Linux release 7.6.1810 (Core)

  Reproduce steps:
  1. create colo vm following https://github.com/qemu/qemu/blob/master/docs/COLO-FT.txt
  2. kill secondary vm and remove the nbd child from the quorum to wait for recover
    type those commands on primary vm console:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
  3. recover colo
  4. kill secondary vm again after recover colo and type same commands as step 2:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
    but the first command got error
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
  {"error": {"class": "GenericError", "desc": "Node 'colo-disk0' does not have child 'children.1'"}}

  according to https://www.qemu.org/docs/master/qemu-qmp-ref.html
  Command: x-blockdev-change
  Dynamically reconfigure the block driver state graph. It can be used to add, remove, insert or replace a graph node. Currently only the Quorum driver implements this feature to add or remove its child. This is useful to fix a broken quorum child.

  It seems x-blockdev-change not worked as expected.

  Thanks.

To manage notifications about this bug go to:
https://bugs.launchpad.net/qemu/+bug/1881231/+subscriptions


^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 1881231] [NEW] colo: Can not recover colo after svm failover twice
  2020-05-29  4:06 [Bug 1881231] [NEW] colo: Can not recover colo after svm failover twice ye.zou
@ 2020-05-29  7:05 ` ye.zou
  2020-05-29  9:47 ` [Bug 1881231] " ye.zou
                   ` (4 subsequent siblings)
  5 siblings, 0 replies; 8+ messages in thread
From: ye.zou @ 2020-05-29  7:05 UTC (permalink / raw)
  To: qemu-devel

-- 
You received this bug notification because you are a member of qemu-
devel-ml, which is subscribed to QEMU.
https://bugs.launchpad.net/bugs/1881231

Title:
  colo: Can not  recover colo after svm failover twice

Status in QEMU:
  New

Bug description:
  Hi Expert,
  x-blockdev-change met some error, during testing colo

  Host os:
  CentOS Linux release 7.6.1810 (Core)

  Reproduce steps:
  1. create colo vm following https://github.com/qemu/qemu/blob/master/docs/COLO-FT.txt
  2. kill secondary vm and remove the nbd child from the quorum to wait for recover
    type those commands on primary vm console:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
  3. recover colo
  4. kill secondary vm again after recover colo and type same commands as step 2:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
    but the first command got error
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
  {"error": {"class": "GenericError", "desc": "Node 'colo-disk0' does not have child 'children.1'"}}

  according to https://www.qemu.org/docs/master/qemu-qmp-ref.html
  Command: x-blockdev-change
  Dynamically reconfigure the block driver state graph. It can be used to add, remove, insert or replace a graph node. Currently only the Quorum driver implements this feature to add or remove its child. This is useful to fix a broken quorum child.

  It seems x-blockdev-change not worked as expected.

  Thanks.

To manage notifications about this bug go to:
https://bugs.launchpad.net/qemu/+bug/1881231/+subscriptions


^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 1881231] Re: colo: Can not recover colo after svm failover twice
  2020-05-29  4:06 [Bug 1881231] [NEW] colo: Can not recover colo after svm failover twice ye.zou
  2020-05-29  7:05 ` ye.zou
@ 2020-05-29  9:47 ` ye.zou
  2020-05-29 15:42   ` Zhang, Chen
  2020-05-29  9:49 ` ye.zou
                   ` (3 subsequent siblings)
  5 siblings, 1 reply; 8+ messages in thread
From: ye.zou @ 2020-05-29  9:47 UTC (permalink / raw)
  To: qemu-devel

In step 3 I used following commands:
on primary vm console:
{"execute": "drive-mirror", "arguments":{ "device": "colo-disk0", "job-id": "resync", "target": "nbd://169.254.66.10:9999/parent0", "mode": "existing","format":"raw","sync":"full"} }

// till the job ready
{ "execute": "query-block-jobs" }

{"execute": "stop"}
{"execute": "block-job-cancel", "arguments":{ "device": "resync"} }

{'execute': 'human-monitor-command', 'arguments': {'command-line': 'drive_add -n buddy driver=replication,mode=primary,file.driver=nbd,file.host=169.254.66.10,file.port=9999,file.export=parent0,node-name=replication0'}}
{'execute': 'x-blockdev-change', 'arguments':{'parent': 'colo-disk0', 'node': 'replication0' } }
{'execute': 'migrate-set-capabilities', 'arguments': {'capabilities': [ {'capability': 'x-colo', 'state': true } ] } }
{'execute': 'migrate', 'arguments': {'uri': 'tcp:169.254.66.10:9998' } }
{ "execute": "migrate-set-parameters" , "arguments":{ "x-checkpoint-delay": 10000 } }

-- 
You received this bug notification because you are a member of qemu-
devel-ml, which is subscribed to QEMU.
https://bugs.launchpad.net/bugs/1881231

Title:
  colo: Can not  recover colo after svm failover twice

Status in QEMU:
  New

Bug description:
  Hi Expert,
  x-blockdev-change met some error, during testing colo

  Host os:
  CentOS Linux release 7.6.1810 (Core)

  Reproduce steps:
  1. create colo vm following https://github.com/qemu/qemu/blob/master/docs/COLO-FT.txt
  2. kill secondary vm and remove the nbd child from the quorum to wait for recover
    type those commands on primary vm console:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
  3. recover colo
  4. kill secondary vm again after recover colo and type same commands as step 2:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
    but the first command got error
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
  {"error": {"class": "GenericError", "desc": "Node 'colo-disk0' does not have child 'children.1'"}}

  according to https://www.qemu.org/docs/master/qemu-qmp-ref.html
  Command: x-blockdev-change
  Dynamically reconfigure the block driver state graph. It can be used to add, remove, insert or replace a graph node. Currently only the Quorum driver implements this feature to add or remove its child. This is useful to fix a broken quorum child.

  It seems x-blockdev-change not worked as expected.

  Thanks.

To manage notifications about this bug go to:
https://bugs.launchpad.net/qemu/+bug/1881231/+subscriptions


^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 1881231] Re: colo: Can not recover colo after svm failover twice
  2020-05-29  4:06 [Bug 1881231] [NEW] colo: Can not recover colo after svm failover twice ye.zou
  2020-05-29  7:05 ` ye.zou
  2020-05-29  9:47 ` [Bug 1881231] " ye.zou
@ 2020-05-29  9:49 ` ye.zou
  2020-06-01  7:34 ` Zhang Chen
                   ` (2 subsequent siblings)
  5 siblings, 0 replies; 8+ messages in thread
From: ye.zou @ 2020-05-29  9:49 UTC (permalink / raw)
  To: qemu-devel

primary vm command:
./qemu-system-x86_64 -L /usr/share/qemu-kvm/ -enable-kvm -cpu qemu64,+kvmclock -m 2048 -smp 2 -qmp stdio -vnc :0 -device piix3-usb-uhci -device usb-tablet -name primary -netdev tap,id=hn0,vhost=off,br=br_bond1,helper=/usr/libexec/qemu-bridge-helper -device rtl8139,id=e0,netdev=hn0,romfile="" -chardev socket,id=mirror0,host=169.254.66.11,port=9003,server,nowait -chardev socket,id=compare1,host=169.254.66.11,port=9004,server,wait -chardev socket,id=compare0,host=169.254.66.11,port=9001,server,nowait -chardev socket,id=compare0-0,host=169.254.66.11,port=9001 -chardev socket,id=compare_out,host=169.254.66.11,port=9005,server,nowait -chardev socket,id=compare_out0,host=169.254.66.11,port=9005 -object filter-mirror,id=m0,netdev=hn0,queue=tx,outdev=mirror0 -object filter-redirector,netdev=hn0,id=redire0,queue=rx,indev=compare_out -object filter-redirector,netdev=hn0,id=redire1,queue=rx,outdev=compare0 -object iothread,id=iothread1 -object colo-compare,id=comp0,primary_in=compare0-0,secondary_in=compare1,outdev=compare_out0,iothread=iothread1 -drive if=ide,id=colo-disk0,driver=quorum,read-pattern=fifo,vote-threshold=1,children.0.file.filename=/root/Centos7.4.qcow2,children.0.driver=qcow2 -S

secondary vm:
./qemu-system-x86_64 -L /usr/share/qemu-kvm/ -enable-kvm -cpu qemu64,+kvmclock -m 2048 -smp 2 -qmp stdio -vnc :1 -device piix3-usb-uhci -device usb-tablet -name secondary -netdev tap,id=hn0,vhost=off,br=br_bond1,helper=/usr/libexec/qemu-bridge-helper -device rtl8139,id=e0,netdev=hn0,romfile="" -chardev socket,id=red0,host=169.254.66.11,port=9003,reconnect=1 -chardev socket,id=red1,host=169.254.66.11,port=9004,reconnect=1 -object filter-redirector,id=f1,netdev=hn0,queue=tx,indev=red0 -object filter-redirector,id=f2,netdev=hn0,queue=rx,outdev=red1 -object filter-rewriter,id=rew0,netdev=hn0,queue=all -drive if=none,id=parent0,file.filename=/root/Centos7.4.qcow2,driver=qcow2 -drive if=none,id=childs0,driver=replication,mode=secondary,file.driver=qcow2,top-id=childs0,file.file.filename=/root/active.qcow2,file.backing.driver=qcow2,file.backing.file.filename=/root/hidden.qcow2,file.backing.backing=parent0 -drive if=none,id=colo-disk0,driver=quorum,read-pattern=fifo,vote-threshold=1,children.0=childs0 -incoming tcp:0:9998

-- 
You received this bug notification because you are a member of qemu-
devel-ml, which is subscribed to QEMU.
https://bugs.launchpad.net/bugs/1881231

Title:
  colo: Can not  recover colo after svm failover twice

Status in QEMU:
  New

Bug description:
  Hi Expert,
  x-blockdev-change met some error, during testing colo

  Host os:
  CentOS Linux release 7.6.1810 (Core)

  Reproduce steps:
  1. create colo vm following https://github.com/qemu/qemu/blob/master/docs/COLO-FT.txt
  2. kill secondary vm and remove the nbd child from the quorum to wait for recover
    type those commands on primary vm console:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
  3. recover colo
  4. kill secondary vm again after recover colo and type same commands as step 2:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
    but the first command got error
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
  {"error": {"class": "GenericError", "desc": "Node 'colo-disk0' does not have child 'children.1'"}}

  according to https://www.qemu.org/docs/master/qemu-qmp-ref.html
  Command: x-blockdev-change
  Dynamically reconfigure the block driver state graph. It can be used to add, remove, insert or replace a graph node. Currently only the Quorum driver implements this feature to add or remove its child. This is useful to fix a broken quorum child.

  It seems x-blockdev-change not worked as expected.

  Thanks.

To manage notifications about this bug go to:
https://bugs.launchpad.net/qemu/+bug/1881231/+subscriptions


^ permalink raw reply	[flat|nested] 8+ messages in thread

* RE: [Bug 1881231] Re: colo: Can not recover colo after svm failover twice
  2020-05-29  9:47 ` [Bug 1881231] " ye.zou
@ 2020-05-29 15:42   ` Zhang, Chen
  0 siblings, 0 replies; 8+ messages in thread
From: Zhang, Chen @ 2020-05-29 15:42 UTC (permalink / raw)
  To: Bug 1881231, qemu-devel; +Cc: ye.zou

Hi Ye,

Thank you for your test and report, I will try to fix this issue.

Thanks
Zhang Chen

> -----Original Message-----
> From: Qemu-devel <qemu-devel-
> bounces+chen.zhang=intel.com@nongnu.org> On Behalf Of ye.zou
> Sent: Friday, May 29, 2020 5:47 PM
> To: qemu-devel@nongnu.org
> Subject: [Bug 1881231] Re: colo: Can not recover colo after svm failover twice
> 
> In step 3 I used following commands:
> on primary vm console:
> {"execute": "drive-mirror", "arguments":{ "device": "colo-disk0", "job-id":
> "resync", "target": "nbd://169.254.66.10:9999/parent0", "mode":
> "existing","format":"raw","sync":"full"} }
> 
> // till the job ready
> { "execute": "query-block-jobs" }
> 
> {"execute": "stop"}
> {"execute": "block-job-cancel", "arguments":{ "device": "resync"} }
> 
> {'execute': 'human-monitor-command', 'arguments': {'command-line':
> 'drive_add -n buddy
> driver=replication,mode=primary,file.driver=nbd,file.host=169.254.66.10,file.
> port=9999,file.export=parent0,node-name=replication0'}}
> {'execute': 'x-blockdev-change', 'arguments':{'parent': 'colo-disk0', 'node':
> 'replication0' } }
> {'execute': 'migrate-set-capabilities', 'arguments': {'capabilities': [ {'capability':
> 'x-colo', 'state': true } ] } }
> {'execute': 'migrate', 'arguments': {'uri': 'tcp:169.254.66.10:9998' } }
> { "execute": "migrate-set-parameters" , "arguments":{ "x-checkpoint-delay":
> 10000 } }
> 
> --
> You received this bug notification because you are a member of qemu-
> devel-ml, which is subscribed to QEMU.
> https://bugs.launchpad.net/bugs/1881231
> 
> Title:
>   colo: Can not  recover colo after svm failover twice
> 
> Status in QEMU:
>   New
> 
> Bug description:
>   Hi Expert,
>   x-blockdev-change met some error, during testing colo
> 
>   Host os:
>   CentOS Linux release 7.6.1810 (Core)
> 
>   Reproduce steps:
>   1. create colo vm following
> https://github.com/qemu/qemu/blob/master/docs/COLO-FT.txt
>   2. kill secondary vm and remove the nbd child from the quorum to wait for
> recover
>     type those commands on primary vm console:
>     { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child':
> 'children.1'}}
>     { 'execute': 'human-monitor-command','arguments': {'command-line':
> 'drive_del replication0'}}
>     { 'execute': 'x-colo-lost-heartbeat'}
>   3. recover colo
>   4. kill secondary vm again after recover colo and type same commands as
> step 2:
>     { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child':
> 'children.1'}}
>     { 'execute': 'human-monitor-command','arguments': {'command-line':
> 'drive_del replication0'}}
>     { 'execute': 'x-colo-lost-heartbeat'}
>     but the first command got error
>     { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child':
> 'children.1'}}
>   {"error": {"class": "GenericError", "desc": "Node 'colo-disk0' does not have
> child 'children.1'"}}
> 
>   according to https://www.qemu.org/docs/master/qemu-qmp-ref.html
>   Command: x-blockdev-change
>   Dynamically reconfigure the block driver state graph. It can be used to add,
> remove, insert or replace a graph node. Currently only the Quorum driver
> implements this feature to add or remove its child. This is useful to fix a
> broken quorum child.
> 
>   It seems x-blockdev-change not worked as expected.
> 
>   Thanks.
> 
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/qemu/+bug/1881231/+subscriptions


^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 1881231] Re: colo: Can not recover colo after svm failover twice
  2020-05-29  4:06 [Bug 1881231] [NEW] colo: Can not recover colo after svm failover twice ye.zou
                   ` (2 preceding siblings ...)
  2020-05-29  9:49 ` ye.zou
@ 2020-06-01  7:34 ` Zhang Chen
  2020-09-30 11:02 ` Dr. David Alan Gilbert
  2020-12-10  8:40 ` Thomas Huth
  5 siblings, 0 replies; 8+ messages in thread
From: Zhang Chen @ 2020-06-01  7:34 UTC (permalink / raw)
  To: qemu-devel

Please try this patch:
https://lists.nongnu.org/archive/html/qemu-devel/2020-06/msg00016.html

Thanks
Zhang Chen

-- 
You received this bug notification because you are a member of qemu-
devel-ml, which is subscribed to QEMU.
https://bugs.launchpad.net/bugs/1881231

Title:
  colo: Can not  recover colo after svm failover twice

Status in QEMU:
  New

Bug description:
  Hi Expert,
  x-blockdev-change met some error, during testing colo

  Host os:
  CentOS Linux release 7.6.1810 (Core)

  Reproduce steps:
  1. create colo vm following https://github.com/qemu/qemu/blob/master/docs/COLO-FT.txt
  2. kill secondary vm and remove the nbd child from the quorum to wait for recover
    type those commands on primary vm console:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
  3. recover colo
  4. kill secondary vm again after recover colo and type same commands as step 2:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
    but the first command got error
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
  {"error": {"class": "GenericError", "desc": "Node 'colo-disk0' does not have child 'children.1'"}}

  according to https://www.qemu.org/docs/master/qemu-qmp-ref.html
  Command: x-blockdev-change
  Dynamically reconfigure the block driver state graph. It can be used to add, remove, insert or replace a graph node. Currently only the Quorum driver implements this feature to add or remove its child. This is useful to fix a broken quorum child.

  It seems x-blockdev-change not worked as expected.

  Thanks.

To manage notifications about this bug go to:
https://bugs.launchpad.net/qemu/+bug/1881231/+subscriptions


^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 1881231] Re: colo: Can not recover colo after svm failover twice
  2020-05-29  4:06 [Bug 1881231] [NEW] colo: Can not recover colo after svm failover twice ye.zou
                   ` (3 preceding siblings ...)
  2020-06-01  7:34 ` Zhang Chen
@ 2020-09-30 11:02 ` Dr. David Alan Gilbert
  2020-12-10  8:40 ` Thomas Huth
  5 siblings, 0 replies; 8+ messages in thread
From: Dr. David Alan Gilbert @ 2020-09-30 11:02 UTC (permalink / raw)
  To: qemu-devel

I think Lukas's fix:
  block/quorum.c: stable children names

was merged a few days ago as 5eb9a3c7b0571562c0289747690e25e6855bc96f

Can someone confirm that helped?

** Changed in: qemu
       Status: New => Fix Committed

-- 
You received this bug notification because you are a member of qemu-
devel-ml, which is subscribed to QEMU.
https://bugs.launchpad.net/bugs/1881231

Title:
  colo: Can not  recover colo after svm failover twice

Status in QEMU:
  Fix Committed

Bug description:
  Hi Expert,
  x-blockdev-change met some error, during testing colo

  Host os:
  CentOS Linux release 7.6.1810 (Core)

  Reproduce steps:
  1. create colo vm following https://github.com/qemu/qemu/blob/master/docs/COLO-FT.txt
  2. kill secondary vm and remove the nbd child from the quorum to wait for recover
    type those commands on primary vm console:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
  3. recover colo
  4. kill secondary vm again after recover colo and type same commands as step 2:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
    but the first command got error
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
  {"error": {"class": "GenericError", "desc": "Node 'colo-disk0' does not have child 'children.1'"}}

  according to https://www.qemu.org/docs/master/qemu-qmp-ref.html
  Command: x-blockdev-change
  Dynamically reconfigure the block driver state graph. It can be used to add, remove, insert or replace a graph node. Currently only the Quorum driver implements this feature to add or remove its child. This is useful to fix a broken quorum child.

  It seems x-blockdev-change not worked as expected.

  Thanks.

To manage notifications about this bug go to:
https://bugs.launchpad.net/qemu/+bug/1881231/+subscriptions


^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug 1881231] Re: colo: Can not recover colo after svm failover twice
  2020-05-29  4:06 [Bug 1881231] [NEW] colo: Can not recover colo after svm failover twice ye.zou
                   ` (4 preceding siblings ...)
  2020-09-30 11:02 ` Dr. David Alan Gilbert
@ 2020-12-10  8:40 ` Thomas Huth
  5 siblings, 0 replies; 8+ messages in thread
From: Thomas Huth @ 2020-12-10  8:40 UTC (permalink / raw)
  To: qemu-devel

QEMU v5.2 has been released now and should contain the fix

** Changed in: qemu
       Status: Fix Committed => Fix Released

-- 
You received this bug notification because you are a member of qemu-
devel-ml, which is subscribed to QEMU.
https://bugs.launchpad.net/bugs/1881231

Title:
  colo: Can not  recover colo after svm failover twice

Status in QEMU:
  Fix Released

Bug description:
  Hi Expert,
  x-blockdev-change met some error, during testing colo

  Host os:
  CentOS Linux release 7.6.1810 (Core)

  Reproduce steps:
  1. create colo vm following https://github.com/qemu/qemu/blob/master/docs/COLO-FT.txt
  2. kill secondary vm and remove the nbd child from the quorum to wait for recover
    type those commands on primary vm console:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
  3. recover colo
  4. kill secondary vm again after recover colo and type same commands as step 2:
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
    { 'execute': 'human-monitor-command','arguments': {'command-line': 'drive_del replication0'}}
    { 'execute': 'x-colo-lost-heartbeat'}
    but the first command got error
    { 'execute': 'x-blockdev-change', 'arguments': {'parent': 'colo-disk0', 'child': 'children.1'}}
  {"error": {"class": "GenericError", "desc": "Node 'colo-disk0' does not have child 'children.1'"}}

  according to https://www.qemu.org/docs/master/qemu-qmp-ref.html
  Command: x-blockdev-change
  Dynamically reconfigure the block driver state graph. It can be used to add, remove, insert or replace a graph node. Currently only the Quorum driver implements this feature to add or remove its child. This is useful to fix a broken quorum child.

  It seems x-blockdev-change not worked as expected.

  Thanks.

To manage notifications about this bug go to:
https://bugs.launchpad.net/qemu/+bug/1881231/+subscriptions


^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2020-12-10  8:52 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-05-29  4:06 [Bug 1881231] [NEW] colo: Can not recover colo after svm failover twice ye.zou
2020-05-29  7:05 ` ye.zou
2020-05-29  9:47 ` [Bug 1881231] " ye.zou
2020-05-29 15:42   ` Zhang, Chen
2020-05-29  9:49 ` ye.zou
2020-06-01  7:34 ` Zhang Chen
2020-09-30 11:02 ` Dr. David Alan Gilbert
2020-12-10  8:40 ` Thomas Huth

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.