* Error on rebooting
@ 2013-07-26 0:19 Pete
2013-07-26 7:47 ` Hugo Mills
0 siblings, 1 reply; 3+ messages in thread
From: Pete @ 2013-07-26 0:19 UTC (permalink / raw)
To: linux-btrfs
Dear All,
Have I anything to be concerned about?
I have got some error messages on booting. The scenario was that I had
installed some ram and I suspect that I had disturbed a cable as one
disk was not visible. I could not mount the other disk (did not try
degraded, but the messages seemed to indicate something serious was up).
After installing ram booted. But some issue with some files, anything
accessing those files froze. Had to reboot. Failed to shutdown
correctly (shutdown stalled on unmount)
Reboot.
/home etc not mounted (btrfs in question)
Btrfsck /dev/sdb showed various errors.
When complete turned off machine. Fiddled with cables. Affected drive
now seen on reboot.
Rebooted. Mounted disks (perhaps) error messages may have been present
on boot. Much disk IO. Disk IO stopped. Machine appeared frozen
except that Caps lock and Num lock worked. Ctrl-alt-backspace did not
sort out stalled x(?)dm session. Hard power down.
Last reboot. Error messages. However, works. Example messages from dmesg:
[ 8.063138] btrfs: enabling inode map caching
[ 8.067617] btrfs: use lzo compression
[ 8.072092] btrfs: disk space caching is enabled
[ 8.147324] btrfs: bdev /dev/sdb errs: wr 4015, rd 464, flush 0,
corrupt 0, gen 0
[ 8.802275] NET: Registered protocol family 10
[ 15.462313] device fsid 2628a800-e095-4460-9b93-8847e9fb626b devid 2
transid 27794 /dev/sdc
[ 15.511463] device fsid 2628a800-e095-4460-9b93-8847e9fb626b devid 2
transid 27794 /dev/sdc
[ 15.566689] device fsid 2628a800-e095-4460-9b93-8847e9fb626b devid 2
transid 27794 /dev/sdc
[ 15.587851] device fsid 2628a800-e095-4460-9b93-8847e9fb626b devid 2
transid 27794 /dev/sdc
[ 15.620678] device fsid 2628a800-e095-4460-9b93-8847e9fb626b devid 2
transid 27794 /dev/sdc
[ 16.024295] IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready
...
...
...
...
19.491507] tun: (C) 1999-2004 Max Krasnyansky <maxk@qualcomm.com>
[ 56.064899] parent transid verify failed on 1142639534080 wanted
27788 found 26856
[ 56.154721] btrfs read error corrected: ino 1 off 1142639534080 (dev
/dev/sdb sector 2179305424)
[ 56.166301] parent transid verify failed on 1142597795840 wanted
27777 found 27772
[ 56.186790] btrfs read error corrected: ino 1 off 1142597795840 (dev
/dev/sdb sector 2179223904)
[ 56.460857] parent transid verify failed on 1142599532544 wanted
27779 found 27772
[ 56.461396] btrfs read error corrected: ino 1 off 1142599532544 (dev
/dev/sdb sector 2179227296)
[ 59.927078] ata1.00: configured for UDMA/133
[ 59.927082] ata1: EH complete
[ 59.933467] ata2.00: configured for UDMA/133
[ 59.933473] ata2: EH complete
[ 60.129445] ata3.00: configured for UDMA/133
[ 60.129458] ata3: EH complete
[ 61.449810] parent transid verify failed on 1142629605376 wanted
27784 found 26856
[ 61.473817] btrfs read error corrected: ino 1 off 1142629605376 (dev
/dev/sdb sector 2179286032)
[ 61.478075] parent transid verify failed on 1142629638144 wanted
27784 found 26856
[ 61.478574] btrfs read error corrected: ino 1 off 1142629638144 (dev
/dev/sdb sector 2179286096)
[ 61.478743] parent transid verify failed on 1142629658624 wanted
27784 found 26856
[ 61.478946] btrfs read error corrected: ino 1 off 1142629658624 (dev
/dev/sdb sector 2179286136)
[ 61.479147] parent transid verify failed on 1142629847040 wanted
27784 found 26856
[ 61.479382] btrfs read error corrected: ino 1 off 1142629847040 (dev
/dev/sdb sector 2179286504)
[ 61.479767] parent transid verify failed on 1142630506496 wanted
27784 found 26856
[ 61.480691] btrfs read error corrected: ino 1 off 1142630506496 (dev
/dev/sdb sector 2179287792)
[ 61.501092] parent transid verify failed on 1142629761024 wanted
27784 found 26856
[ 61.501423] btrfs read error corrected: ino 1 off 1142629761024 (dev
/dev/sdb sector 2179286336)
[ 62.704754] kded4[2419]: segfault at 10 ip 00007f99a11b26e0 sp
00007fff4305e578 error 4 in libkscreen.so.0.9.0[7f99a11a7000+e000]
[ 85.012565] parent transid verify failed on 1142612619264 wanted
27777 found 26856
[ 85.049566] btrfs read error corrected: ino 1 off 1142612619264 (dev
/dev/sdb sector 2179252856)
[ 87.961731] btrfs csum failed ino 749162 off 0 csum 2452727536
private 1516042199
[ 87.975603] btrfs read error corrected: ino 749162 off 0 (dev
/dev/sdb sector 2181130648)
[ 87.981595] btrfs csum failed ino 749163 off 0 csum 459327135 private
1516042199
[ 87.992897] btrfs read error corrected: ino 749163 off 0 (dev
/dev/sdb sector 2181149880)
[ 104.179638] parent transid verify failed on 1142638817280 wanted
27786 found 26856
[ 104.189146] btrfs read error corrected: ino 1 off 1142638817280 (dev
/dev/sdb sector 2179304024)
[ 104.197071] btrfs csum failed ino 1544486 off 0 csum 4176447263
private 467839912
[ 104.197136] btrfs csum failed ino 1544486 off 4096 csum 3482415336
private 475019870
[ 104.198076] btrfs csum failed ino 1544486 off 0 csum 4176447263
private 467839912
[ 104.198140] btrfs csum failed ino 1544486 off 4096 csum 3482415336
private 475019870
[ 104.204035] btrfs read error corrected: ino 1544486 off 0 (dev
/dev/sdb sector 2182960392)
[ 104.204551] btrfs read error corrected: ino 1544486 off 4096 (dev
/dev/sdb sector 2182960400)
[ 117.249253] parent transid verify failed on 1142609051648 wanted
27774 found 26856
[ 117.255886] btrfs read error corrected: ino 1 off 1142609051648 (dev
/dev/sdb sector 2179245888)
[ 117.419294] parent transid verify failed on 1142599507968 wanted
27779 found 27772
[ 117.437317] btrfs read error corrected: ino 1 off 1142599507968 (dev
/dev/sdb sector 2179227248)
[ 137.502176] NFSD: Unable to end grace period: -110
Given that I have booted now - does this mean that the above was btrfs
sorting itself out?
Thanks
Pete
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Error on rebooting
2013-07-26 0:19 Error on rebooting Pete
@ 2013-07-26 7:47 ` Hugo Mills
2013-07-26 12:10 ` Pete
0 siblings, 1 reply; 3+ messages in thread
From: Hugo Mills @ 2013-07-26 7:47 UTC (permalink / raw)
To: Pete; +Cc: linux-btrfs
[-- Attachment #1: Type: text/plain, Size: 4372 bytes --]
On Fri, Jul 26, 2013 at 01:19:40AM +0100, Pete wrote:
> Dear All,
>
> Have I anything to be concerned about?
>
> I have got some error messages on booting. The scenario was that I
> had installed some ram and I suspect that I had disturbed a cable as
> one disk was not visible. I could not mount the other disk (did not
> try degraded, but the messages seemed to indicate something serious
> was up).
>
> After installing ram booted. But some issue with some files,
> anything accessing those files froze. Had to reboot. Failed to
> shutdown correctly (shutdown stalled on unmount)
>
> Reboot.
>
> /home etc not mounted (btrfs in question)
>
> Btrfsck /dev/sdb showed various errors.
>
> When complete turned off machine. Fiddled with cables. Affected
> drive now seen on reboot.
>
> Rebooted. Mounted disks (perhaps) error messages may have been
> present on boot. Much disk IO. Disk IO stopped. Machine appeared
> frozen except that Caps lock and Num lock worked.
> Ctrl-alt-backspace did not sort out stalled x(?)dm session. Hard
> power down.
>
> Last reboot. Error messages. However, works. Example messages from dmesg:
>
> [ 8.063138] btrfs: enabling inode map caching
> [ 8.067617] btrfs: use lzo compression
> [ 8.072092] btrfs: disk space caching is enabled
> [ 8.147324] btrfs: bdev /dev/sdb errs: wr 4015, rd 464, flush 0,
> corrupt 0, gen 0
> [ 8.802275] NET: Registered protocol family 10
> [ 15.462313] device fsid 2628a800-e095-4460-9b93-8847e9fb626b
> devid 2 transid 27794 /dev/sdc
> [ 15.511463] device fsid 2628a800-e095-4460-9b93-8847e9fb626b
> devid 2 transid 27794 /dev/sdc
> [ 15.566689] device fsid 2628a800-e095-4460-9b93-8847e9fb626b
> devid 2 transid 27794 /dev/sdc
> [ 15.587851] device fsid 2628a800-e095-4460-9b93-8847e9fb626b
> devid 2 transid 27794 /dev/sdc
> [ 15.620678] device fsid 2628a800-e095-4460-9b93-8847e9fb626b
> devid 2 transid 27794 /dev/sdc
> [ 16.024295] IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready
> ...
>
> ...
>
> ...
>
> ...
> 19.491507] tun: (C) 1999-2004 Max Krasnyansky <maxk@qualcomm.com>
> [ 56.064899] parent transid verify failed on 1142639534080 wanted
> 27788 found 26856
> [ 56.154721] btrfs read error corrected: ino 1 off 1142639534080
> (dev /dev/sdb sector 2179305424)
> [ 56.166301] parent transid verify failed on 1142597795840 wanted
> 27777 found 27772
> [ 56.186790] btrfs read error corrected: ino 1 off 1142597795840
> (dev /dev/sdb sector 2179223904)
> [ 56.460857] parent transid verify failed on 1142599532544 wanted
> 27779 found 27772
> [ 56.461396] btrfs read error corrected: ino 1 off 1142599532544
> (dev /dev/sdb sector 2179227296)
> [ 59.927078] ata1.00: configured for UDMA/133
> [ 59.927082] ata1: EH complete
> [ 59.933467] ata2.00: configured for UDMA/133
> [ 59.933473] ata2: EH complete
> [ 60.129445] ata3.00: configured for UDMA/133
> [ 60.129458] ata3: EH complete
> [ 61.449810] parent transid verify failed on 1142629605376 wanted
> 27784 found 26856
> [ 61.473817] btrfs read error corrected: ino 1 off 1142629605376
> (dev /dev/sdb sector 2179286032)
[snip]
> [ 104.204035] btrfs read error corrected: ino 1544486 off 0 (dev
> /dev/sdb sector 2182960392)
> [ 104.204551] btrfs read error corrected: ino 1544486 off 4096 (dev
> /dev/sdb sector 2182960400)
> [ 117.249253] parent transid verify failed on 1142609051648 wanted
> 27774 found 26856
> [ 117.255886] btrfs read error corrected: ino 1 off 1142609051648
> (dev /dev/sdb sector 2179245888)
> [ 117.419294] parent transid verify failed on 1142599507968 wanted
> 27779 found 27772
> [ 117.437317] btrfs read error corrected: ino 1 off 1142599507968
> (dev /dev/sdb sector 2179227248)
> [ 137.502176] NFSD: Unable to end grace period: -110
>
> Given that I have booted now - does this mean that the above was
> btrfs sorting itself out?
Looks like it. I'd recommend a scrub to check for any other out of
date data on the affected drive. I've done pretty much the same thing
as this myself, and a scrub, though scary in the amount of noise it
made, fixed everything satisfactorily.
Hugo.
--
=== Hugo Mills: hugo@... carfax.org.uk | darksatanic.net | lug.org.uk ===
PGP key: 65E74AC0 from wwwkeys.eu.pgp.net or http://www.carfax.org.uk
--- Sometimes, when I'm alone, I Google myself. ---
[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 828 bytes --]
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Error on rebooting
2013-07-26 7:47 ` Hugo Mills
@ 2013-07-26 12:10 ` Pete
0 siblings, 0 replies; 3+ messages in thread
From: Pete @ 2013-07-26 12:10 UTC (permalink / raw)
To: linux-btrfs
Hugo,
thanks.
On 07/26/2013 08:47 AM, Hugo Mills wrote:
> Looks like it. I'd recommend a scrub to check for any other out of
> date data on the affected drive. I've done pretty much the same thing
> as this myself, and a scrub, though scary in the amount of noise it
> made, fixed everything satisfactorily.
bash-4.2# btrfs scrub start -Bd /mnt/data-pool/
scrub device /dev/sdb (id 1) done
scrub started at Fri Jul 26 08:18:00 2013 and finished after
9849 seconds
total bytes scrubbed: 984.77GB with 540 errors
error details: verify=20 csum=520
corrected errors: 540, uncorrectable errors: 0, unverified
errors: 0
So a bit of a wobble but raid1 to the rescue! Not sure what caused the
wobble. But all is well now.
Pete
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2013-07-26 12:10 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-07-26 0:19 Error on rebooting Pete
2013-07-26 7:47 ` Hugo Mills
2013-07-26 12:10 ` Pete
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.