From: Liwei <xieliwei@gmail.com>
To: linux-lvm@redhat.com
Subject: [linux-lvm] Unsync-ed LVM Mirror
Date: Sat, 3 Feb 2018 17:43:05 +0800 [thread overview]
Message-ID: <CAPE0SYxuLG3sqoaez=CV4ZjQEcB+iqzB+CM7q6jrLg9z+L6kzQ@mail.gmail.com> (raw)
Hi list,
I had a LV that I was converting from linear to mirrored (not
raid1) whose source device failed partway-through during the initial
sync.
I've since recovered the source device, but it seems like the
mirror is still acting as if some blocks are not readable? I'm getting
this in my logs, and the FS is full of errors:
[ +1.613126] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.000278] device-mapper: raid1: Primary mirror (253:25) failed
while out-of-sync: Reads may fail.
[ +0.085916] device-mapper: raid1: Mirror read failed.
[ +0.196562] device-mapper: raid1: Mirror read failed.
[ +0.000237] Buffer I/O error on dev dm-27, logical block 5371800560,
async page read
[ +0.592135] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.082882] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.246945] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.107374] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.083344] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.114949] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.085056] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.203929] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.157953] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +3.065247] recovery_complete: 23 callbacks suppressed
[ +0.000001] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.128064] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.103100] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.107827] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.140871] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.132844] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.124698] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.138502] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.117827] device-mapper: raid1: Unable to read primary mirror
during recovery
[ +0.125705] device-mapper: raid1: Unable to read primary mirror
during recovery
[Feb 3 17:09] device-mapper: raid1: Mirror read failed.
[ +0.167553] device-mapper: raid1: Mirror read failed.
[ +0.000268] Buffer I/O error on dev dm-27, logical block 5367765816,
async page read
[ +0.135138] device-mapper: raid1: Mirror read failed.
[ +0.000238] Buffer I/O error on dev dm-27, logical block 5367765816,
async page read
[ +0.000365] device-mapper: raid1: Mirror read failed.
[ +0.000315] device-mapper: raid1: Mirror read failed.
[ +0.000213] Buffer I/O error on dev dm-27, logical block 5367896888,
async page read
[ +0.000276] device-mapper: raid1: Mirror read failed.
[ +0.000199] Buffer I/O error on dev dm-27, logical block 5367765816,
async page read
However, if I take down the destination device and restart the LV
with --activateoption partial, I can read my data and everything
checks out.
My theory (and what I observed) is that lvm continued the initial
sync even after the source drive stopped responding, and has now
mapped the blocks that it 'synced' as dead. How can I make lvm retry
those blocks again?
In fact, I don't trust the mirror anymore, is there a way I can
conduct a scrub of the mirror after the initial sync is done? I read
about --syncaction check, but seems like it only notes the number of
inconsistencies. Can I have lvm re-mirror the inconsistencies from the
source to destination device? I trust the source device because we ran
a btrfs scrub on it and it reported that all checksums are valid.
It took months for the mirror sync to get to this stage (actually,
why does it take months to mirror 20TB?), I don't want to start it all
over again.
Warm regards,
Liwei
next reply other threads:[~2018-02-03 9:43 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-02-03 9:43 Liwei [this message]
2018-02-05 3:21 ` [linux-lvm] Unsync-ed LVM Mirror Liwei
2018-02-05 7:27 ` Eric Ren
2018-02-05 7:42 ` Liwei
2018-02-05 8:43 ` Eric Ren
2018-02-05 9:26 ` Liwei
2018-02-05 10:07 ` Eric Ren
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CAPE0SYxuLG3sqoaez=CV4ZjQEcB+iqzB+CM7q6jrLg9z+L6kzQ@mail.gmail.com' \
--to=xieliwei@gmail.com \
--cc=linux-lvm@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).