All of lore.kernel.org
 help / color / mirror / Atom feed
* Error on rebooting
@ 2013-07-26  0:19 Pete
  2013-07-26  7:47 ` Hugo Mills
  0 siblings, 1 reply; 3+ messages in thread
From: Pete @ 2013-07-26  0:19 UTC (permalink / raw)
  To: linux-btrfs

Dear All,

Have I anything to be concerned about?

I have got some error messages on booting.  The scenario was that I had 
installed some ram and I suspect that I had disturbed a cable as one 
disk was not visible.  I could not mount the other disk (did not try 
degraded, but the messages seemed to indicate something serious was up).

After installing ram booted.  But some issue with some files, anything 
accessing those files froze.  Had to reboot.  Failed to shutdown 
correctly (shutdown stalled on unmount)

Reboot.

/home etc not mounted (btrfs in question)

Btrfsck /dev/sdb showed various errors.

When complete turned off machine.  Fiddled with cables.  Affected drive 
now seen on reboot.

Rebooted.  Mounted disks (perhaps) error messages may have been present 
on boot.  Much disk IO.  Disk IO stopped.  Machine appeared frozen 
except that Caps lock and Num lock worked.  Ctrl-alt-backspace did not 
sort out stalled x(?)dm session.  Hard power down.

Last reboot.  Error messages.  However, works.  Example messages from dmesg:

[    8.063138] btrfs: enabling inode map caching
[    8.067617] btrfs: use lzo compression
[    8.072092] btrfs: disk space caching is enabled
[    8.147324] btrfs: bdev /dev/sdb errs: wr 4015, rd 464, flush 0, 
corrupt 0, gen 0
[    8.802275] NET: Registered protocol family 10
[   15.462313] device fsid 2628a800-e095-4460-9b93-8847e9fb626b devid 2 
transid 27794 /dev/sdc
[   15.511463] device fsid 2628a800-e095-4460-9b93-8847e9fb626b devid 2 
transid 27794 /dev/sdc
[   15.566689] device fsid 2628a800-e095-4460-9b93-8847e9fb626b devid 2 
transid 27794 /dev/sdc
[   15.587851] device fsid 2628a800-e095-4460-9b93-8847e9fb626b devid 2 
transid 27794 /dev/sdc
[   15.620678] device fsid 2628a800-e095-4460-9b93-8847e9fb626b devid 2 
transid 27794 /dev/sdc
[   16.024295] IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready
...

...

...

...
    19.491507] tun: (C) 1999-2004 Max Krasnyansky <maxk@qualcomm.com>
[   56.064899] parent transid verify failed on 1142639534080 wanted 
27788 found 26856
[   56.154721] btrfs read error corrected: ino 1 off 1142639534080 (dev 
/dev/sdb sector 2179305424)
[   56.166301] parent transid verify failed on 1142597795840 wanted 
27777 found 27772
[   56.186790] btrfs read error corrected: ino 1 off 1142597795840 (dev 
/dev/sdb sector 2179223904)
[   56.460857] parent transid verify failed on 1142599532544 wanted 
27779 found 27772
[   56.461396] btrfs read error corrected: ino 1 off 1142599532544 (dev 
/dev/sdb sector 2179227296)
[   59.927078] ata1.00: configured for UDMA/133
[   59.927082] ata1: EH complete
[   59.933467] ata2.00: configured for UDMA/133
[   59.933473] ata2: EH complete
[   60.129445] ata3.00: configured for UDMA/133
[   60.129458] ata3: EH complete
[   61.449810] parent transid verify failed on 1142629605376 wanted 
27784 found 26856
[   61.473817] btrfs read error corrected: ino 1 off 1142629605376 (dev 
/dev/sdb sector 2179286032)
[   61.478075] parent transid verify failed on 1142629638144 wanted 
27784 found 26856
[   61.478574] btrfs read error corrected: ino 1 off 1142629638144 (dev 
/dev/sdb sector 2179286096)
[   61.478743] parent transid verify failed on 1142629658624 wanted 
27784 found 26856
[   61.478946] btrfs read error corrected: ino 1 off 1142629658624 (dev 
/dev/sdb sector 2179286136)
[   61.479147] parent transid verify failed on 1142629847040 wanted 
27784 found 26856
[   61.479382] btrfs read error corrected: ino 1 off 1142629847040 (dev 
/dev/sdb sector 2179286504)
[   61.479767] parent transid verify failed on 1142630506496 wanted 
27784 found 26856
[   61.480691] btrfs read error corrected: ino 1 off 1142630506496 (dev 
/dev/sdb sector 2179287792)
[   61.501092] parent transid verify failed on 1142629761024 wanted 
27784 found 26856
[   61.501423] btrfs read error corrected: ino 1 off 1142629761024 (dev 
/dev/sdb sector 2179286336)
[   62.704754] kded4[2419]: segfault at 10 ip 00007f99a11b26e0 sp 
00007fff4305e578 error 4 in libkscreen.so.0.9.0[7f99a11a7000+e000]
[   85.012565] parent transid verify failed on 1142612619264 wanted 
27777 found 26856
[   85.049566] btrfs read error corrected: ino 1 off 1142612619264 (dev 
/dev/sdb sector 2179252856)
[   87.961731] btrfs csum failed ino 749162 off 0 csum 2452727536 
private 1516042199
[   87.975603] btrfs read error corrected: ino 749162 off 0 (dev 
/dev/sdb sector 2181130648)
[   87.981595] btrfs csum failed ino 749163 off 0 csum 459327135 private 
1516042199
[   87.992897] btrfs read error corrected: ino 749163 off 0 (dev 
/dev/sdb sector 2181149880)
[  104.179638] parent transid verify failed on 1142638817280 wanted 
27786 found 26856
[  104.189146] btrfs read error corrected: ino 1 off 1142638817280 (dev 
/dev/sdb sector 2179304024)
[  104.197071] btrfs csum failed ino 1544486 off 0 csum 4176447263 
private 467839912
[  104.197136] btrfs csum failed ino 1544486 off 4096 csum 3482415336 
private 475019870
[  104.198076] btrfs csum failed ino 1544486 off 0 csum 4176447263 
private 467839912
[  104.198140] btrfs csum failed ino 1544486 off 4096 csum 3482415336 
private 475019870
[  104.204035] btrfs read error corrected: ino 1544486 off 0 (dev 
/dev/sdb sector 2182960392)
[  104.204551] btrfs read error corrected: ino 1544486 off 4096 (dev 
/dev/sdb sector 2182960400)
[  117.249253] parent transid verify failed on 1142609051648 wanted 
27774 found 26856
[  117.255886] btrfs read error corrected: ino 1 off 1142609051648 (dev 
/dev/sdb sector 2179245888)
[  117.419294] parent transid verify failed on 1142599507968 wanted 
27779 found 27772
[  117.437317] btrfs read error corrected: ino 1 off 1142599507968 (dev 
/dev/sdb sector 2179227248)
[  137.502176] NFSD: Unable to end grace period: -110

Given that I have booted now - does this mean that the above was btrfs 
sorting itself out?

Thanks

Pete

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Error on rebooting
  2013-07-26  0:19 Error on rebooting Pete
@ 2013-07-26  7:47 ` Hugo Mills
  2013-07-26 12:10   ` Pete
  0 siblings, 1 reply; 3+ messages in thread
From: Hugo Mills @ 2013-07-26  7:47 UTC (permalink / raw)
  To: Pete; +Cc: linux-btrfs

[-- Attachment #1: Type: text/plain, Size: 4372 bytes --]

On Fri, Jul 26, 2013 at 01:19:40AM +0100, Pete wrote:
> Dear All,
> 
> Have I anything to be concerned about?
> 
> I have got some error messages on booting.  The scenario was that I
> had installed some ram and I suspect that I had disturbed a cable as
> one disk was not visible.  I could not mount the other disk (did not
> try degraded, but the messages seemed to indicate something serious
> was up).
> 
> After installing ram booted.  But some issue with some files,
> anything accessing those files froze.  Had to reboot.  Failed to
> shutdown correctly (shutdown stalled on unmount)
> 
> Reboot.
> 
> /home etc not mounted (btrfs in question)
> 
> Btrfsck /dev/sdb showed various errors.
> 
> When complete turned off machine.  Fiddled with cables.  Affected
> drive now seen on reboot.
> 
> Rebooted.  Mounted disks (perhaps) error messages may have been
> present on boot.  Much disk IO.  Disk IO stopped.  Machine appeared
> frozen except that Caps lock and Num lock worked.
> Ctrl-alt-backspace did not sort out stalled x(?)dm session.  Hard
> power down.
> 
> Last reboot.  Error messages.  However, works.  Example messages from dmesg:
> 
> [    8.063138] btrfs: enabling inode map caching
> [    8.067617] btrfs: use lzo compression
> [    8.072092] btrfs: disk space caching is enabled
> [    8.147324] btrfs: bdev /dev/sdb errs: wr 4015, rd 464, flush 0,
> corrupt 0, gen 0
> [    8.802275] NET: Registered protocol family 10
> [   15.462313] device fsid 2628a800-e095-4460-9b93-8847e9fb626b
> devid 2 transid 27794 /dev/sdc
> [   15.511463] device fsid 2628a800-e095-4460-9b93-8847e9fb626b
> devid 2 transid 27794 /dev/sdc
> [   15.566689] device fsid 2628a800-e095-4460-9b93-8847e9fb626b
> devid 2 transid 27794 /dev/sdc
> [   15.587851] device fsid 2628a800-e095-4460-9b93-8847e9fb626b
> devid 2 transid 27794 /dev/sdc
> [   15.620678] device fsid 2628a800-e095-4460-9b93-8847e9fb626b
> devid 2 transid 27794 /dev/sdc
> [   16.024295] IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready
> ...
> 
> ...
> 
> ...
> 
> ...
>    19.491507] tun: (C) 1999-2004 Max Krasnyansky <maxk@qualcomm.com>
> [   56.064899] parent transid verify failed on 1142639534080 wanted
> 27788 found 26856
> [   56.154721] btrfs read error corrected: ino 1 off 1142639534080
> (dev /dev/sdb sector 2179305424)
> [   56.166301] parent transid verify failed on 1142597795840 wanted
> 27777 found 27772
> [   56.186790] btrfs read error corrected: ino 1 off 1142597795840
> (dev /dev/sdb sector 2179223904)
> [   56.460857] parent transid verify failed on 1142599532544 wanted
> 27779 found 27772
> [   56.461396] btrfs read error corrected: ino 1 off 1142599532544
> (dev /dev/sdb sector 2179227296)
> [   59.927078] ata1.00: configured for UDMA/133
> [   59.927082] ata1: EH complete
> [   59.933467] ata2.00: configured for UDMA/133
> [   59.933473] ata2: EH complete
> [   60.129445] ata3.00: configured for UDMA/133
> [   60.129458] ata3: EH complete
> [   61.449810] parent transid verify failed on 1142629605376 wanted
> 27784 found 26856
> [   61.473817] btrfs read error corrected: ino 1 off 1142629605376
> (dev /dev/sdb sector 2179286032)
[snip]
> [  104.204035] btrfs read error corrected: ino 1544486 off 0 (dev
> /dev/sdb sector 2182960392)
> [  104.204551] btrfs read error corrected: ino 1544486 off 4096 (dev
> /dev/sdb sector 2182960400)
> [  117.249253] parent transid verify failed on 1142609051648 wanted
> 27774 found 26856
> [  117.255886] btrfs read error corrected: ino 1 off 1142609051648
> (dev /dev/sdb sector 2179245888)
> [  117.419294] parent transid verify failed on 1142599507968 wanted
> 27779 found 27772
> [  117.437317] btrfs read error corrected: ino 1 off 1142599507968
> (dev /dev/sdb sector 2179227248)
> [  137.502176] NFSD: Unable to end grace period: -110
> 
> Given that I have booted now - does this mean that the above was
> btrfs sorting itself out?

   Looks like it. I'd recommend a scrub to check for any other out of
date data on the affected drive. I've done pretty much the same thing
as this myself, and a scrub, though scary in the amount of noise it
made, fixed everything satisfactorily.

   Hugo.

-- 
=== Hugo Mills: hugo@... carfax.org.uk | darksatanic.net | lug.org.uk ===
  PGP key: 65E74AC0 from wwwkeys.eu.pgp.net or http://www.carfax.org.uk
           --- Sometimes, when I'm alone, I Google myself. ---           

[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 828 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Error on rebooting
  2013-07-26  7:47 ` Hugo Mills
@ 2013-07-26 12:10   ` Pete
  0 siblings, 0 replies; 3+ messages in thread
From: Pete @ 2013-07-26 12:10 UTC (permalink / raw)
  To: linux-btrfs

Hugo,

thanks.

On 07/26/2013 08:47 AM, Hugo Mills wrote:

>     Looks like it. I'd recommend a scrub to check for any other out of
> date data on the affected drive. I've done pretty much the same thing
> as this myself, and a scrub, though scary in the amount of noise it
> made, fixed everything satisfactorily.

bash-4.2# btrfs scrub start -Bd /mnt/data-pool/
scrub device /dev/sdb (id 1) done
         scrub started at Fri Jul 26 08:18:00 2013 and finished after 
9849 seconds
         total bytes scrubbed: 984.77GB with 540 errors
         error details: verify=20 csum=520
         corrected errors: 540, uncorrectable errors: 0, unverified 
errors: 0


So a bit of a wobble but raid1 to the rescue!  Not sure what caused the 
wobble.  But all is well now.

Pete




^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2013-07-26 12:10 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-07-26  0:19 Error on rebooting Pete
2013-07-26  7:47 ` Hugo Mills
2013-07-26 12:10   ` Pete

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.