* [Ocfs2-devel] Problems with fsck
@ 2011-01-12 15:46 Massimo Cetra
2011-01-12 16:16 ` Sunil Mushran
0 siblings, 1 reply; 9+ messages in thread
From: Massimo Cetra @ 2011-01-12 15:46 UTC (permalink / raw)
To: ocfs2-devel
Hi List,
i'd like to share with you what happened yesterday.
Kernel 2.6.36.1
ocfs2-tools 1.6.3 (latest).
I had an old OCFS2 partition created with a 2.6.32 kernel and ocfs2
tools 1.4.5.
I unmounted all partitions on all nodes in order to enable discontig-bg.
I then used tunefs to add discontig-bg, inline-data and indexed-dirs.
During indexed-dirs tunefs segfaulted and since then, fsck didn't work
anymore.
I managed to mount the partition again but after some errors like the
following
Jan 11 23:11:56 www1 kernel: [ 2339.642683]
(mc,3305,0):ocfs2_block_check_validate:443 ERROR: CRC32 failed: stored:
0x76176db1, computed 0x9e4c2434. Applying ECC.
Jan 11 23:11:56 www1 kernel: [ 2339.645074]
(mc,3305,0):ocfs2_block_check_validate:457 ERROR: Fixed CRC32 failed:
stored: 0x76176db1, computed 0x91119fb2
Jan 11 23:11:56 www1 kernel: [ 2339.647196]
(mc,3305,0):ocfs2_validate_extent_block:903 ERROR: Checksum failed for
extent block 6924877
Jan 11 23:11:56 www1 kernel: [ 2339.649212]
(mc,3305,0):__ocfs2_find_path:1837 ERROR: status = -5
Jan 11 23:11:56 www1 kernel: [ 2339.650409]
(mc,3305,0):ocfs2_remove_rightmost_path:3090 ERROR: status = -5
Jan 11 23:11:56 www1 kernel: [ 2339.651719]
(mc,3305,0):ocfs2_rotate_tree_left:3225 ERROR: status = -5
Jan 11 23:11:56 www1 kernel: [ 2339.653076]
(mc,3305,0):ocfs2_truncate_rec:5442 ERROR: status = -5
Jan 11 23:11:56 www1 kernel: [ 2339.654272]
(mc,3305,0):ocfs2_remove_extent:5526 ERROR: status = -5
Jan 11 23:11:56 www1 kernel: [ 2339.655531]
(mc,3305,0):ocfs2_remove_btree_range:5717 ERROR: status = -5
Jan 11 23:11:56 www1 kernel: [ 2339.656908]
(mc,3305,0):ocfs2_commit_truncate:7117 ERROR: status = -5
Jan 11 23:11:56 www1 kernel: [ 2339.658152]
(mc,3305,0):ocfs2_truncate_for_delete:622 ERROR: status = -5
Jan 11 23:11:56 www1 kernel: [ 2339.659423]
(mc,3305,0):ocfs2_wipe_inode:793 ERROR: status = -5
Jan 11 23:11:56 www1 kernel: [ 2339.660700]
(mc,3305,0):ocfs2_delete_inode:1085 ERROR: status = -5
Jan 11 23:15:41 www1 kernel: [ 2565.101905] OCFS2: ERROR (device drbd1):
ocfs2_commit_truncate: Inode 7418891 has an empty extent record, depth 2
Jan 11 23:15:41 www1 kernel: [ 2565.101908].
Jan 11 23:15:41 www1 kernel: [ 2565.105104] File system is now read-only
due to the potential of on-disk corruption. Please run fsck.ocfs2 once
the file system is unmounted.
Jan 11 23:15:41 www1 kernel: [ 2565.108155]
(kworker/u:3,3361,0):ocfs2_truncate_for_delete:622 ERROR: status = -30
Jan 11 23:15:41 www1 kernel: [ 2565.110190]
(kworker/u:3,3361,0):ocfs2_wipe_inode:793 ERROR: status = -30
Jan 11 23:15:41 www1 kernel: [ 2565.111772]
(kworker/u:3,3361,0):ocfs2_delete_inode:1085 ERROR: status = -30
Jan 11 23:15:41 www1 kernel: [ 2565.134131] OCFS2: ERROR (device drbd1):
ocfs2_commit_truncate: Inode 7418889 has an empty extent record, depth 2
Jan 11 23:15:41 www1 kernel: [ 2565.134133].
i wasn't able to mount the filesystem anymore in RW.
I could mount only in RO.
fsck was failing like this:
www1:~# fsck.ocfs2 -f /dev/drbd1
fsck.ocfs2 1.6.3
Checking OCFS2 filesystem in /dev/drbd1:
Label: www-code
UUID: 03F008AFA8BA458E9C8614A9B4A3E6E8
Number of blocks: 26213582
Block size: 2048
Number of clusters: 13106791
Cluster size: 4096
Number of slots: 8
/dev/drbd1 was run with -f, check forced.
Pass 0a: Checking cluster allocation chains
Pass 0b: Checking inode allocation chains
Pass 0c: Checking extent block allocation chains
Pass 1: Checking inodes and blocks.
extent.c: I/O error on channel reading extent block at 9590812 in owner
3231503 for verification
extent.c: I/O error on channel reading extent block at 6924320 in owner
3231503 for verification
pass1: I/O error on channel while iterating over the blocks for inode
3231503
fsck.ocfs2: I/O error on channel while performing pass 1
www1:~#
-----------------------------------------------
It was late and i didn't have time to investigate more on a production
server so i did a complete backup, used mkfs to wipe everything and
restore the backup.
I'm sorry i can't provide more data on the problem. I tried to google
and search the mailing list archives but i didn't find anything interesting.
Obviously i was quite disappointed by this problem and i hope those
informations may, in some way, help identifying and fix the problem.
Thanks for your work,
Massimo
^ permalink raw reply [flat|nested] 9+ messages in thread
* [Ocfs2-devel] Problems with fsck
2011-01-12 15:46 [Ocfs2-devel] Problems with fsck Massimo Cetra
@ 2011-01-12 16:16 ` Sunil Mushran
2011-01-13 19:19 ` Massimo Cetra
` (2 more replies)
0 siblings, 3 replies; 9+ messages in thread
From: Sunil Mushran @ 2011-01-12 16:16 UTC (permalink / raw)
To: ocfs2-devel
fsck is failing because it is encountering block(s) with incorrect
checksums. An easy solution is to disable checksums and rerun
fsck. Checksums can be renabled later.
The problem started with the segfault when activating indexed-dirs.
Do you have the coredump?
On 01/12/2011 07:46 AM, Massimo Cetra wrote:
> Hi List,
>
> i'd like to share with you what happened yesterday.
>
> Kernel 2.6.36.1
> ocfs2-tools 1.6.3 (latest).
>
> I had an old OCFS2 partition created with a 2.6.32 kernel and ocfs2
> tools 1.4.5.
>
> I unmounted all partitions on all nodes in order to enable discontig-bg.
>
> I then used tunefs to add discontig-bg, inline-data and indexed-dirs.
>
> During indexed-dirs tunefs segfaulted and since then, fsck didn't work
> anymore.
>
> I managed to mount the partition again but after some errors like the
> following
>
> Jan 11 23:11:56 www1 kernel: [ 2339.642683]
> (mc,3305,0):ocfs2_block_check_validate:443 ERROR: CRC32 failed: stored:
> 0x76176db1, computed 0x9e4c2434. Applying ECC.
> Jan 11 23:11:56 www1 kernel: [ 2339.645074]
> (mc,3305,0):ocfs2_block_check_validate:457 ERROR: Fixed CRC32 failed:
> stored: 0x76176db1, computed 0x91119fb2
> Jan 11 23:11:56 www1 kernel: [ 2339.647196]
> (mc,3305,0):ocfs2_validate_extent_block:903 ERROR: Checksum failed for
> extent block 6924877
> Jan 11 23:11:56 www1 kernel: [ 2339.649212]
> (mc,3305,0):__ocfs2_find_path:1837 ERROR: status = -5
> Jan 11 23:11:56 www1 kernel: [ 2339.650409]
> (mc,3305,0):ocfs2_remove_rightmost_path:3090 ERROR: status = -5
> Jan 11 23:11:56 www1 kernel: [ 2339.651719]
> (mc,3305,0):ocfs2_rotate_tree_left:3225 ERROR: status = -5
> Jan 11 23:11:56 www1 kernel: [ 2339.653076]
> (mc,3305,0):ocfs2_truncate_rec:5442 ERROR: status = -5
> Jan 11 23:11:56 www1 kernel: [ 2339.654272]
> (mc,3305,0):ocfs2_remove_extent:5526 ERROR: status = -5
> Jan 11 23:11:56 www1 kernel: [ 2339.655531]
> (mc,3305,0):ocfs2_remove_btree_range:5717 ERROR: status = -5
> Jan 11 23:11:56 www1 kernel: [ 2339.656908]
> (mc,3305,0):ocfs2_commit_truncate:7117 ERROR: status = -5
> Jan 11 23:11:56 www1 kernel: [ 2339.658152]
> (mc,3305,0):ocfs2_truncate_for_delete:622 ERROR: status = -5
> Jan 11 23:11:56 www1 kernel: [ 2339.659423]
> (mc,3305,0):ocfs2_wipe_inode:793 ERROR: status = -5
> Jan 11 23:11:56 www1 kernel: [ 2339.660700]
> (mc,3305,0):ocfs2_delete_inode:1085 ERROR: status = -5
>
>
> Jan 11 23:15:41 www1 kernel: [ 2565.101905] OCFS2: ERROR (device drbd1):
> ocfs2_commit_truncate: Inode 7418891 has an empty extent record, depth 2
> Jan 11 23:15:41 www1 kernel: [ 2565.101908].
> Jan 11 23:15:41 www1 kernel: [ 2565.105104] File system is now read-only
> due to the potential of on-disk corruption. Please run fsck.ocfs2 once
> the file system is unmounted.
> Jan 11 23:15:41 www1 kernel: [ 2565.108155]
> (kworker/u:3,3361,0):ocfs2_truncate_for_delete:622 ERROR: status = -30
> Jan 11 23:15:41 www1 kernel: [ 2565.110190]
> (kworker/u:3,3361,0):ocfs2_wipe_inode:793 ERROR: status = -30
> Jan 11 23:15:41 www1 kernel: [ 2565.111772]
> (kworker/u:3,3361,0):ocfs2_delete_inode:1085 ERROR: status = -30
> Jan 11 23:15:41 www1 kernel: [ 2565.134131] OCFS2: ERROR (device drbd1):
> ocfs2_commit_truncate: Inode 7418889 has an empty extent record, depth 2
> Jan 11 23:15:41 www1 kernel: [ 2565.134133].
>
> i wasn't able to mount the filesystem anymore in RW.
> I could mount only in RO.
>
> fsck was failing like this:
>
> www1:~# fsck.ocfs2 -f /dev/drbd1
> fsck.ocfs2 1.6.3
> Checking OCFS2 filesystem in /dev/drbd1:
> Label: www-code
> UUID: 03F008AFA8BA458E9C8614A9B4A3E6E8
> Number of blocks: 26213582
> Block size: 2048
> Number of clusters: 13106791
> Cluster size: 4096
> Number of slots: 8
>
> /dev/drbd1 was run with -f, check forced.
> Pass 0a: Checking cluster allocation chains
> Pass 0b: Checking inode allocation chains
> Pass 0c: Checking extent block allocation chains
> Pass 1: Checking inodes and blocks.
> extent.c: I/O error on channel reading extent block at 9590812 in owner
> 3231503 for verification
> extent.c: I/O error on channel reading extent block at 6924320 in owner
> 3231503 for verification
> pass1: I/O error on channel while iterating over the blocks for inode
> 3231503
> fsck.ocfs2: I/O error on channel while performing pass 1
> www1:~#
>
> -----------------------------------------------
>
> It was late and i didn't have time to investigate more on a production
> server so i did a complete backup, used mkfs to wipe everything and
> restore the backup.
>
> I'm sorry i can't provide more data on the problem. I tried to google
> and search the mailing list archives but i didn't find anything interesting.
>
> Obviously i was quite disappointed by this problem and i hope those
> informations may, in some way, help identifying and fix the problem.
>
> Thanks for your work,
>
> Massimo
>
>
>
>
>
>
>
> _______________________________________________
> Ocfs2-devel mailing list
> Ocfs2-devel at oss.oracle.com
> http://oss.oracle.com/mailman/listinfo/ocfs2-devel
^ permalink raw reply [flat|nested] 9+ messages in thread
* [Ocfs2-devel] Problems with fsck
2011-01-12 16:16 ` Sunil Mushran
@ 2011-01-13 19:19 ` Massimo Cetra
2011-01-13 20:25 ` Massimo Cetra
2011-01-14 8:03 ` Tao Ma
2 siblings, 0 replies; 9+ messages in thread
From: Massimo Cetra @ 2011-01-13 19:19 UTC (permalink / raw)
To: ocfs2-devel
On 12/01/2011 17:16, Sunil Mushran wrote:
> fsck is failing because it is encountering block(s) with incorrect
> checksums. An easy solution is to disable checksums and rerun
> fsck. Checksums can be renabled later.
>
Uh. I noticed that CRC32 errors from the kernel but i didn't
rearealizedlly that it was related to the problem.
fsck reporting I/O errors was driving me to something more serious
(broken disks) but obviously i didn't find anything about it on the
controller.
I would suggest to modify that error about I/O indicating some causes of
the problem
I would also ask how to disable checksums as i don't find it in the docs
(hope this informations will help others).
> The problem started with the segfault when activating indexed-dirs.
> Do you have the coredump?
No sorry, i wish i had.
Max
^ permalink raw reply [flat|nested] 9+ messages in thread
* [Ocfs2-devel] Problems with fsck
2011-01-12 16:16 ` Sunil Mushran
2011-01-13 19:19 ` Massimo Cetra
@ 2011-01-13 20:25 ` Massimo Cetra
2011-01-14 8:03 ` Tao Ma
2 siblings, 0 replies; 9+ messages in thread
From: Massimo Cetra @ 2011-01-13 20:25 UTC (permalink / raw)
To: ocfs2-devel
On 12/01/2011 17:16, Sunil Mushran wrote:
> fsck is failing because it is encountering block(s) with incorrect
> checksums. An easy solution is to disable checksums and rerun
> fsck. Checksums can be renabled later.
>
Uh. I noticed that CRC32 errors from the kernel but i didn't really
think that it was related to the problem.
fsck reporting I/O errors was driving me to something more serious
(broken disks) but obviously i didn't find anything about it on the
controller.
I would suggest to modify that error about I/O indicating some causes of
the problem
I would also ask how to disable checksums as i don't find it in the docs
(hope this informations will help others).
> The problem started with the segfault when activating indexed-dirs.
> Do you have the coredump?
No sorry, i wish i had.
Max
^ permalink raw reply [flat|nested] 9+ messages in thread
* [Ocfs2-devel] Problems with fsck
2011-01-12 16:16 ` Sunil Mushran
2011-01-13 19:19 ` Massimo Cetra
2011-01-13 20:25 ` Massimo Cetra
@ 2011-01-14 8:03 ` Tao Ma
2011-01-14 22:38 ` Sunil Mushran
2 siblings, 1 reply; 9+ messages in thread
From: Tao Ma @ 2011-01-14 8:03 UTC (permalink / raw)
To: ocfs2-devel
On 01/13/2011 12:16 AM, Sunil Mushran wrote:
> fsck is failing because it is encountering block(s) with incorrect
> checksums. An easy solution is to disable checksums and rerun
> fsck. Checksums can be renabled later.
>
> The problem started with the segfault when activating indexed-dirs.
> Do you have the coredump?
I met with segfault when enabling indexed-dirs severl months ago. They
are still pending for review and integration.
http://oss.oracle.com/pipermail/ocfs2-tools-devel/2010-September/003574.html
Regards,
Tao
>
> On 01/12/2011 07:46 AM, Massimo Cetra wrote:
>> Hi List,
>>
>> i'd like to share with you what happened yesterday.
>>
>> Kernel 2.6.36.1
>> ocfs2-tools 1.6.3 (latest).
>>
>> I had an old OCFS2 partition created with a 2.6.32 kernel and ocfs2
>> tools 1.4.5.
>>
>> I unmounted all partitions on all nodes in order to enable discontig-bg.
>>
>> I then used tunefs to add discontig-bg, inline-data and indexed-dirs.
>>
>> During indexed-dirs tunefs segfaulted and since then, fsck didn't work
>> anymore.
>>
>> I managed to mount the partition again but after some errors like the
>> following
>>
>> Jan 11 23:11:56 www1 kernel: [ 2339.642683]
>> (mc,3305,0):ocfs2_block_check_validate:443 ERROR: CRC32 failed: stored:
>> 0x76176db1, computed 0x9e4c2434. Applying ECC.
>> Jan 11 23:11:56 www1 kernel: [ 2339.645074]
>> (mc,3305,0):ocfs2_block_check_validate:457 ERROR: Fixed CRC32 failed:
>> stored: 0x76176db1, computed 0x91119fb2
>> Jan 11 23:11:56 www1 kernel: [ 2339.647196]
>> (mc,3305,0):ocfs2_validate_extent_block:903 ERROR: Checksum failed for
>> extent block 6924877
>> Jan 11 23:11:56 www1 kernel: [ 2339.649212]
>> (mc,3305,0):__ocfs2_find_path:1837 ERROR: status = -5
>> Jan 11 23:11:56 www1 kernel: [ 2339.650409]
>> (mc,3305,0):ocfs2_remove_rightmost_path:3090 ERROR: status = -5
>> Jan 11 23:11:56 www1 kernel: [ 2339.651719]
>> (mc,3305,0):ocfs2_rotate_tree_left:3225 ERROR: status = -5
>> Jan 11 23:11:56 www1 kernel: [ 2339.653076]
>> (mc,3305,0):ocfs2_truncate_rec:5442 ERROR: status = -5
>> Jan 11 23:11:56 www1 kernel: [ 2339.654272]
>> (mc,3305,0):ocfs2_remove_extent:5526 ERROR: status = -5
>> Jan 11 23:11:56 www1 kernel: [ 2339.655531]
>> (mc,3305,0):ocfs2_remove_btree_range:5717 ERROR: status = -5
>> Jan 11 23:11:56 www1 kernel: [ 2339.656908]
>> (mc,3305,0):ocfs2_commit_truncate:7117 ERROR: status = -5
>> Jan 11 23:11:56 www1 kernel: [ 2339.658152]
>> (mc,3305,0):ocfs2_truncate_for_delete:622 ERROR: status = -5
>> Jan 11 23:11:56 www1 kernel: [ 2339.659423]
>> (mc,3305,0):ocfs2_wipe_inode:793 ERROR: status = -5
>> Jan 11 23:11:56 www1 kernel: [ 2339.660700]
>> (mc,3305,0):ocfs2_delete_inode:1085 ERROR: status = -5
>>
>>
>> Jan 11 23:15:41 www1 kernel: [ 2565.101905] OCFS2: ERROR (device drbd1):
>> ocfs2_commit_truncate: Inode 7418891 has an empty extent record, depth 2
>> Jan 11 23:15:41 www1 kernel: [ 2565.101908].
>> Jan 11 23:15:41 www1 kernel: [ 2565.105104] File system is now read-only
>> due to the potential of on-disk corruption. Please run fsck.ocfs2 once
>> the file system is unmounted.
>> Jan 11 23:15:41 www1 kernel: [ 2565.108155]
>> (kworker/u:3,3361,0):ocfs2_truncate_for_delete:622 ERROR: status = -30
>> Jan 11 23:15:41 www1 kernel: [ 2565.110190]
>> (kworker/u:3,3361,0):ocfs2_wipe_inode:793 ERROR: status = -30
>> Jan 11 23:15:41 www1 kernel: [ 2565.111772]
>> (kworker/u:3,3361,0):ocfs2_delete_inode:1085 ERROR: status = -30
>> Jan 11 23:15:41 www1 kernel: [ 2565.134131] OCFS2: ERROR (device drbd1):
>> ocfs2_commit_truncate: Inode 7418889 has an empty extent record, depth 2
>> Jan 11 23:15:41 www1 kernel: [ 2565.134133].
>>
>> i wasn't able to mount the filesystem anymore in RW.
>> I could mount only in RO.
>>
>> fsck was failing like this:
>>
>> www1:~# fsck.ocfs2 -f /dev/drbd1
>> fsck.ocfs2 1.6.3
>> Checking OCFS2 filesystem in /dev/drbd1:
>> Label: www-code
>> UUID: 03F008AFA8BA458E9C8614A9B4A3E6E8
>> Number of blocks: 26213582
>> Block size: 2048
>> Number of clusters: 13106791
>> Cluster size: 4096
>> Number of slots: 8
>>
>> /dev/drbd1 was run with -f, check forced.
>> Pass 0a: Checking cluster allocation chains
>> Pass 0b: Checking inode allocation chains
>> Pass 0c: Checking extent block allocation chains
>> Pass 1: Checking inodes and blocks.
>> extent.c: I/O error on channel reading extent block at 9590812 in owner
>> 3231503 for verification
>> extent.c: I/O error on channel reading extent block at 6924320 in owner
>> 3231503 for verification
>> pass1: I/O error on channel while iterating over the blocks for inode
>> 3231503
>> fsck.ocfs2: I/O error on channel while performing pass 1
>> www1:~#
>>
>> -----------------------------------------------
>>
>> It was late and i didn't have time to investigate more on a production
>> server so i did a complete backup, used mkfs to wipe everything and
>> restore the backup.
>>
>> I'm sorry i can't provide more data on the problem. I tried to google
>> and search the mailing list archives but i didn't find anything interesting.
>>
>> Obviously i was quite disappointed by this problem and i hope those
>> informations may, in some way, help identifying and fix the problem.
>>
>> Thanks for your work,
>>
>> Massimo
>>
>>
>>
>>
>>
>>
>>
>> _______________________________________________
>> Ocfs2-devel mailing list
>> Ocfs2-devel at oss.oracle.com
>> http://oss.oracle.com/mailman/listinfo/ocfs2-devel
>
>
> _______________________________________________
> Ocfs2-devel mailing list
> Ocfs2-devel at oss.oracle.com
> http://oss.oracle.com/mailman/listinfo/ocfs2-devel
^ permalink raw reply [flat|nested] 9+ messages in thread
* [Ocfs2-devel] Problems with fsck
2011-01-14 8:03 ` Tao Ma
@ 2011-01-14 22:38 ` Sunil Mushran
2011-01-18 5:47 ` Tiger Yang
0 siblings, 1 reply; 9+ messages in thread
From: Sunil Mushran @ 2011-01-14 22:38 UTC (permalink / raw)
To: ocfs2-devel
Tao,
Thanks. Mark signed off the first two. I can sign off the other two.
Tiger,
Please can you take these four patches and test enabling indexdirs
on a volume. Please do this ASAP. I want this in the git tree by early
next week.
Thanks
Sunil
On 01/14/2011 12:03 AM, Tao Ma wrote:
> On 01/13/2011 12:16 AM, Sunil Mushran wrote:
>> fsck is failing because it is encountering block(s) with incorrect
>> checksums. An easy solution is to disable checksums and rerun
>> fsck. Checksums can be renabled later.
>>
>> The problem started with the segfault when activating indexed-dirs.
>> Do you have the coredump?
> I met with segfault when enabling indexed-dirs severl months ago. They are still pending for review and integration.
> http://oss.oracle.com/pipermail/ocfs2-tools-devel/2010-September/003574.html
>
> Regards,
> Tao
>>
>> On 01/12/2011 07:46 AM, Massimo Cetra wrote:
>>> Hi List,
>>>
>>> i'd like to share with you what happened yesterday.
>>>
>>> Kernel 2.6.36.1
>>> ocfs2-tools 1.6.3 (latest).
>>>
>>> I had an old OCFS2 partition created with a 2.6.32 kernel and ocfs2
>>> tools 1.4.5.
>>>
>>> I unmounted all partitions on all nodes in order to enable discontig-bg.
>>>
>>> I then used tunefs to add discontig-bg, inline-data and indexed-dirs.
>>>
>>> During indexed-dirs tunefs segfaulted and since then, fsck didn't work
>>> anymore.
>>>
>>> I managed to mount the partition again but after some errors like the
>>> following
>>>
>>> Jan 11 23:11:56 www1 kernel: [ 2339.642683]
>>> (mc,3305,0):ocfs2_block_check_validate:443 ERROR: CRC32 failed: stored:
>>> 0x76176db1, computed 0x9e4c2434. Applying ECC.
>>> Jan 11 23:11:56 www1 kernel: [ 2339.645074]
>>> (mc,3305,0):ocfs2_block_check_validate:457 ERROR: Fixed CRC32 failed:
>>> stored: 0x76176db1, computed 0x91119fb2
>>> Jan 11 23:11:56 www1 kernel: [ 2339.647196]
>>> (mc,3305,0):ocfs2_validate_extent_block:903 ERROR: Checksum failed for
>>> extent block 6924877
>>> Jan 11 23:11:56 www1 kernel: [ 2339.649212]
>>> (mc,3305,0):__ocfs2_find_path:1837 ERROR: status = -5
>>> Jan 11 23:11:56 www1 kernel: [ 2339.650409]
>>> (mc,3305,0):ocfs2_remove_rightmost_path:3090 ERROR: status = -5
>>> Jan 11 23:11:56 www1 kernel: [ 2339.651719]
>>> (mc,3305,0):ocfs2_rotate_tree_left:3225 ERROR: status = -5
>>> Jan 11 23:11:56 www1 kernel: [ 2339.653076]
>>> (mc,3305,0):ocfs2_truncate_rec:5442 ERROR: status = -5
>>> Jan 11 23:11:56 www1 kernel: [ 2339.654272]
>>> (mc,3305,0):ocfs2_remove_extent:5526 ERROR: status = -5
>>> Jan 11 23:11:56 www1 kernel: [ 2339.655531]
>>> (mc,3305,0):ocfs2_remove_btree_range:5717 ERROR: status = -5
>>> Jan 11 23:11:56 www1 kernel: [ 2339.656908]
>>> (mc,3305,0):ocfs2_commit_truncate:7117 ERROR: status = -5
>>> Jan 11 23:11:56 www1 kernel: [ 2339.658152]
>>> (mc,3305,0):ocfs2_truncate_for_delete:622 ERROR: status = -5
>>> Jan 11 23:11:56 www1 kernel: [ 2339.659423]
>>> (mc,3305,0):ocfs2_wipe_inode:793 ERROR: status = -5
>>> Jan 11 23:11:56 www1 kernel: [ 2339.660700]
>>> (mc,3305,0):ocfs2_delete_inode:1085 ERROR: status = -5
>>>
>>>
>>> Jan 11 23:15:41 www1 kernel: [ 2565.101905] OCFS2: ERROR (device drbd1):
>>> ocfs2_commit_truncate: Inode 7418891 has an empty extent record, depth 2
>>> Jan 11 23:15:41 www1 kernel: [ 2565.101908].
>>> Jan 11 23:15:41 www1 kernel: [ 2565.105104] File system is now read-only
>>> due to the potential of on-disk corruption. Please run fsck.ocfs2 once
>>> the file system is unmounted.
>>> Jan 11 23:15:41 www1 kernel: [ 2565.108155]
>>> (kworker/u:3,3361,0):ocfs2_truncate_for_delete:622 ERROR: status = -30
>>> Jan 11 23:15:41 www1 kernel: [ 2565.110190]
>>> (kworker/u:3,3361,0):ocfs2_wipe_inode:793 ERROR: status = -30
>>> Jan 11 23:15:41 www1 kernel: [ 2565.111772]
>>> (kworker/u:3,3361,0):ocfs2_delete_inode:1085 ERROR: status = -30
>>> Jan 11 23:15:41 www1 kernel: [ 2565.134131] OCFS2: ERROR (device drbd1):
>>> ocfs2_commit_truncate: Inode 7418889 has an empty extent record, depth 2
>>> Jan 11 23:15:41 www1 kernel: [ 2565.134133].
>>>
>>> i wasn't able to mount the filesystem anymore in RW.
>>> I could mount only in RO.
>>>
>>> fsck was failing like this:
>>>
>>> www1:~# fsck.ocfs2 -f /dev/drbd1
>>> fsck.ocfs2 1.6.3
>>> Checking OCFS2 filesystem in /dev/drbd1:
>>> Label: www-code
>>> UUID: 03F008AFA8BA458E9C8614A9B4A3E6E8
>>> Number of blocks: 26213582
>>> Block size: 2048
>>> Number of clusters: 13106791
>>> Cluster size: 4096
>>> Number of slots: 8
>>>
>>> /dev/drbd1 was run with -f, check forced.
>>> Pass 0a: Checking cluster allocation chains
>>> Pass 0b: Checking inode allocation chains
>>> Pass 0c: Checking extent block allocation chains
>>> Pass 1: Checking inodes and blocks.
>>> extent.c: I/O error on channel reading extent block at 9590812 in owner
>>> 3231503 for verification
>>> extent.c: I/O error on channel reading extent block at 6924320 in owner
>>> 3231503 for verification
>>> pass1: I/O error on channel while iterating over the blocks for inode
>>> 3231503
>>> fsck.ocfs2: I/O error on channel while performing pass 1
>>> www1:~#
>>>
>>> -----------------------------------------------
>>>
>>> It was late and i didn't have time to investigate more on a production
>>> server so i did a complete backup, used mkfs to wipe everything and
>>> restore the backup.
>>>
>>> I'm sorry i can't provide more data on the problem. I tried to google
>>> and search the mailing list archives but i didn't find anything interesting.
>>>
>>> Obviously i was quite disappointed by this problem and i hope those
>>> informations may, in some way, help identifying and fix the problem.
>>>
>>> Thanks for your work,
>>>
>>> Massimo
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>> _______________________________________________
>>> Ocfs2-devel mailing list
>>> Ocfs2-devel at oss.oracle.com
>>> http://oss.oracle.com/mailman/listinfo/ocfs2-devel
>>
>>
>> _______________________________________________
>> Ocfs2-devel mailing list
>> Ocfs2-devel at oss.oracle.com
>> http://oss.oracle.com/mailman/listinfo/ocfs2-devel
>
^ permalink raw reply [flat|nested] 9+ messages in thread
* [Ocfs2-devel] Problems with fsck
2011-01-14 22:38 ` Sunil Mushran
@ 2011-01-18 5:47 ` Tiger Yang
2011-01-18 18:15 ` Sunil Mushran
0 siblings, 1 reply; 9+ messages in thread
From: Tiger Yang @ 2011-01-18 5:47 UTC (permalink / raw)
To: ocfs2-devel
On 01/15/2011 06:38 AM, Sunil Mushran wrote:
> Tiger,
> Please can you take these four patches and test enabling indexdirs
> on a volume. Please do this ASAP. I want this in the git tree by early
> next week.
Hi, Sunil,
I have finished the testing. If enable indexed-dirs on a volume which
has metaecc, tunefs.ocfs2 will get segment fault. Tao's patch fixes this
problem.
Regards,
Tiger
^ permalink raw reply [flat|nested] 9+ messages in thread
* [Ocfs2-devel] Problems with fsck
2011-01-18 5:47 ` Tiger Yang
@ 2011-01-18 18:15 ` Sunil Mushran
2011-01-21 17:29 ` Joel Becker
0 siblings, 1 reply; 9+ messages in thread
From: Sunil Mushran @ 2011-01-18 18:15 UTC (permalink / raw)
To: ocfs2-devel
On 01/17/2011 09:47 PM, Tiger Yang wrote:
> On 01/15/2011 06:38 AM, Sunil Mushran wrote:
>> Tiger,
>> Please can you take these four patches and test enabling indexdirs
>> on a volume. Please do this ASAP. I want this in the git tree by early
>> next week.
> Hi, Sunil,
>
> I have finished the testing. If enable indexed-dirs on a volume which has metaecc, tunefs.ocfs2 will get segment fault. Tao's patch fixes this problem.
Thanks. Add the acks and push them to the git repo.
^ permalink raw reply [flat|nested] 9+ messages in thread
* [Ocfs2-devel] Problems with fsck
2011-01-18 18:15 ` Sunil Mushran
@ 2011-01-21 17:29 ` Joel Becker
0 siblings, 0 replies; 9+ messages in thread
From: Joel Becker @ 2011-01-21 17:29 UTC (permalink / raw)
To: ocfs2-devel
On Tue, Jan 18, 2011 at 10:15:35AM -0800, Sunil Mushran wrote:
> On 01/17/2011 09:47 PM, Tiger Yang wrote:
> > On 01/15/2011 06:38 AM, Sunil Mushran wrote:
> >> Tiger,
> >> Please can you take these four patches and test enabling indexdirs
> >> on a volume. Please do this ASAP. I want this in the git tree by early
> >> next week.
> > Hi, Sunil,
> >
> > I have finished the testing. If enable indexed-dirs on a volume which has metaecc, tunefs.ocfs2 will get segment fault. Tao's patch fixes this problem.
>
> Thanks. Add the acks and push them to the git repo.
Yeah, they all look good.
Joel
--
^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2011-01-21 17:29 UTC | newest]
Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-01-12 15:46 [Ocfs2-devel] Problems with fsck Massimo Cetra
2011-01-12 16:16 ` Sunil Mushran
2011-01-13 19:19 ` Massimo Cetra
2011-01-13 20:25 ` Massimo Cetra
2011-01-14 8:03 ` Tao Ma
2011-01-14 22:38 ` Sunil Mushran
2011-01-18 5:47 ` Tiger Yang
2011-01-18 18:15 ` Sunil Mushran
2011-01-21 17:29 ` Joel Becker
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.