From: Sander Eikelenboom <linux@eikelenboom.it> To: Theodore Tso <tytso@MIT.EDU> Cc: linux-ext4@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: can't recover ext4 on lvm from ext4_mb_generate_buddy:739: group 1687, 32254 clusters in bitmap, 32258 in gd Date: Thu, 5 Jan 2012 15:52:18 +0100 [thread overview] Message-ID: <903212040.20120105155218@eikelenboom.it> (raw) In-Reply-To: <6FC155DD-80C1-4088-B745-6B74D9D5AA48@mit.edu> Thursday, January 5, 2012, 3:45:01 PM, you wrote: > On Jan 5, 2012, at 8:21 AM, Sander Eikelenboom wrote: >> Hmm it seems to be over by reverting from a 3.2.0 to a 3.1.5 kernel, i now can copy the files after the fsck without it being remounted-ro due to the error. > Hmm… So the question is whether this is caused by changes to ext4 or in the device-mapper / LVM. > The error which ext4 is reporting is that a block bitmap appears to be corrupted; the block group descriptors are reporting that there are 32258 free blocks, while only 32254 free blocks are found in the block bitmap. Since one or the other is must be wrong, and continuing could potentially cause data loss, the file system gets mounted remounted read-only. > What's funny is that fsck didn't report anything wrong. That implies that the LVM volume is returning different block contents, at least under some circumstances. > Hmm…. can you try reproducing this? What happens if you now reboot into 3.2? Do you still get the file system getting remounted read-only? Can you try running dumpe2fs on the file system before and after running e2fsck, and when you try to reproduce it, can you make a special note of the EXT4-fs error message: > [ 220.748928] EXT4-fs error (device dm-2): ext4_mb_generate_buddy:739: group 1687, 32254 clusters in bitmap, 32258 in gd > Do the numbers stay the same each time you reproduce the problem? And are there any changes in the output of dumpe2fs (run diff; it will probably be a very tiny difference). > Also, what is the underlying devices underlying the LVM? Are you using a MD device? Or is the 200T volume spread out across multiple hard drives directly (i.e., no RAID)? > -- Ted Yes well, since it worked again, i continued reshuffling the lvm layout that was planned for today, but this specific LV is still there, so i will try booting 3.2 again to see if i can reproduce and do the things you asked for. No MD is used, it's one sata disk, containing 2 partitions (boot and a lvm PV), it has one VG that has the PV. That one is split into multiple LV's. >> >> -- >> Sander >> >> >> This is a forwarded message >> From: Sander Eikelenboom <linux@eikelenboom.it> >> To: "Theodore Ts'o" <tytso@mit.edu> >> Date: Thursday, January 5, 2012, 11:37:59 AM >> Subject: can't recover ext4 on lvm from ext4_mb_generate_buddy:739: group 1687, 32254 clusters in bitmap, 32258 in gd >> >> ===8<==============Original message text=============== >> >> I'm having some troubles with a ext4 filesystem on LVM, it seems bricked and fsck doesn't seem to find and correct the problem. >> >> Steps: >> 1) fsck -v -p -f the filesystem >> 2) mount the filesystem >> 3) Try to copy a file >> 4) filesystem will be mounted RO on error (see below) >> 5) fsck again, journal will be recovered, no other errors >> 6) start at 1) >> >> >> I think the way i bricked it is: >> - make a lvm snapshot from that lvm logical disk >> - mount that lvm snapshot as RO >> - try to copy a file from that mounted RO snapshot to a diffrent dir on the lvm logical disk the snapshot is from. >> - it fails and i can't recover (see above) >> >> >> Is there a way to recover from this ? >> >> >> >> [ 220.748928] EXT4-fs error (device dm-2): ext4_mb_generate_buddy:739: group 1687, 32254 clusters in bitmap, 32258 in gd >> [ 220.749415] Aborting journal on device dm-2-8. >> [ 220.771633] EXT4-fs error (device dm-2): ext4_journal_start_sb:327: Detected aborted journal >> [ 220.772593] EXT4-fs (dm-2): Remounting filesystem read-only >> [ 220.792455] EXT4-fs (dm-2): Remounting filesystem read-only >> [ 220.805118] EXT4-fs (dm-2): ext4_da_writepages: jbd2_start: 9680 pages, ino 4079617; err -30 >> serveerstertje:/mnt/xen_images/domains/production# cd / >> serveerstertje:/# umount /mnt/xen_images/ >> serveerstertje:/# fsck -f -v -p /dev/serveerstertje/xen_images >> fsck from util-linux-ng 2.17.2 >> /dev/mapper/serveerstertje-xen_images: recovering journal >> >> 277 inodes used (0.00%) >> 5 non-contiguous files (1.8%) >> 0 non-contiguous directories (0.0%) >> # of inodes with ind/dind/tind blocks: 41/41/3 >> Extent depth histogram: 69/28/2 >> 51890920 blocks used (79.18%) >> 0 bad blocks >> 41 large files >> >> 199 regular files >> 53 directories >> 0 character device files >> 0 block device files >> 0 fifos >> 0 links >> 16 symbolic links (16 fast symbolic links) >> 0 sockets >> -------- >> 268 files >> serveerstertje:/# >> >> >> >> >> System: >> - Kernel 3.2.0 >> - Debian Squeeze with: >> ii e2fslibs 1.41.12-4stable1 ext2/ext3/ext4 file system libraries >> ii e2fsprogs 1.41.12-4stable1 ext2/ext3/ext4 file system utilities >> >> ===8<===========End of original message text=========== >> >> >> >> -- >> Best regards, >> Sander mailto:linux@eikelenboom.it<Message01.eml> -- Best regards, Sander mailto:linux@eikelenboom.it
WARNING: multiple messages have this Message-ID (diff)
From: Sander Eikelenboom <linux@eikelenboom.it> To: Theodore Tso <tytso@MIT.EDU> Cc: linux-ext4@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: can't recover ext4 on lvm from ext4_mb_generate_buddy:739: group 1687, 32254 clusters in bitmap, 32258 in gd Date: Thu, 5 Jan 2012 15:52:18 +0100 [thread overview] Message-ID: <903212040.20120105155218@eikelenboom.it> (raw) In-Reply-To: <6FC155DD-80C1-4088-B745-6B74D9D5AA48@mit.edu> Thursday, January 5, 2012, 3:45:01 PM, you wrote: > On Jan 5, 2012, at 8:21 AM, Sander Eikelenboom wrote: >> Hmm it seems to be over by reverting from a 3.2.0 to a 3.1.5 kernel, i now can copy the files after the fsck without it being remounted-ro due to the error. > Hmm… So the question is whether this is caused by changes to ext4 or in the device-mapper / LVM. > The error which ext4 is reporting is that a block bitmap appears to be corrupted; the block group descriptors are reporting that there are 32258 free blocks, while only 32254 free blocks are found in the block bitmap. Since one or the other is must be wrong, and continuing could potentially cause data loss, the file system gets mounted remounted read-only. > What's funny is that fsck didn't report anything wrong. That implies that the LVM volume is returning different block contents, at least under some circumstances. > Hmm…. can you try reproducing this? What happens if you now reboot into 3.2? Do you still get the file system getting remounted read-only? Can you try running dumpe2fs on the file system before and after running e2fsck, and when you try to reproduce it, can you make a special note of the EXT4-fs error message: > [ 220.748928] EXT4-fs error (device dm-2): ext4_mb_generate_buddy:739: group 1687, 32254 clusters in bitmap, 32258 in gd > Do the numbers stay the same each time you reproduce the problem? And are there any changes in the output of dumpe2fs (run diff; it will probably be a very tiny difference). > Also, what is the underlying devices underlying the LVM? Are you using a MD device? Or is the 200T volume spread out across multiple hard drives directly (i.e., no RAID)? > -- Ted Yes well, since it worked again, i continued reshuffling the lvm layout that was planned for today, but this specific LV is still there, so i will try booting 3.2 again to see if i can reproduce and do the things you asked for. No MD is used, it's one sata disk, containing 2 partitions (boot and a lvm PV), it has one VG that has the PV. That one is split into multiple LV's. >> >> -- >> Sander >> >> >> This is a forwarded message >> From: Sander Eikelenboom <linux@eikelenboom.it> >> To: "Theodore Ts'o" <tytso@mit.edu> >> Date: Thursday, January 5, 2012, 11:37:59 AM >> Subject: can't recover ext4 on lvm from ext4_mb_generate_buddy:739: group 1687, 32254 clusters in bitmap, 32258 in gd >> >> ===8<==============Original message text=============== >> >> I'm having some troubles with a ext4 filesystem on LVM, it seems bricked and fsck doesn't seem to find and correct the problem. >> >> Steps: >> 1) fsck -v -p -f the filesystem >> 2) mount the filesystem >> 3) Try to copy a file >> 4) filesystem will be mounted RO on error (see below) >> 5) fsck again, journal will be recovered, no other errors >> 6) start at 1) >> >> >> I think the way i bricked it is: >> - make a lvm snapshot from that lvm logical disk >> - mount that lvm snapshot as RO >> - try to copy a file from that mounted RO snapshot to a diffrent dir on the lvm logical disk the snapshot is from. >> - it fails and i can't recover (see above) >> >> >> Is there a way to recover from this ? >> >> >> >> [ 220.748928] EXT4-fs error (device dm-2): ext4_mb_generate_buddy:739: group 1687, 32254 clusters in bitmap, 32258 in gd >> [ 220.749415] Aborting journal on device dm-2-8. >> [ 220.771633] EXT4-fs error (device dm-2): ext4_journal_start_sb:327: Detected aborted journal >> [ 220.772593] EXT4-fs (dm-2): Remounting filesystem read-only >> [ 220.792455] EXT4-fs (dm-2): Remounting filesystem read-only >> [ 220.805118] EXT4-fs (dm-2): ext4_da_writepages: jbd2_start: 9680 pages, ino 4079617; err -30 >> serveerstertje:/mnt/xen_images/domains/production# cd / >> serveerstertje:/# umount /mnt/xen_images/ >> serveerstertje:/# fsck -f -v -p /dev/serveerstertje/xen_images >> fsck from util-linux-ng 2.17.2 >> /dev/mapper/serveerstertje-xen_images: recovering journal >> >> 277 inodes used (0.00%) >> 5 non-contiguous files (1.8%) >> 0 non-contiguous directories (0.0%) >> # of inodes with ind/dind/tind blocks: 41/41/3 >> Extent depth histogram: 69/28/2 >> 51890920 blocks used (79.18%) >> 0 bad blocks >> 41 large files >> >> 199 regular files >> 53 directories >> 0 character device files >> 0 block device files >> 0 fifos >> 0 links >> 16 symbolic links (16 fast symbolic links) >> 0 sockets >> -------- >> 268 files >> serveerstertje:/# >> >> >> >> >> System: >> - Kernel 3.2.0 >> - Debian Squeeze with: >> ii e2fslibs 1.41.12-4stable1 ext2/ext3/ext4 file system libraries >> ii e2fsprogs 1.41.12-4stable1 ext2/ext3/ext4 file system utilities >> >> ===8<===========End of original message text=========== >> >> >> >> -- >> Best regards, >> Sander mailto:linux@eikelenboom.it<Message01.eml> -- Best regards, Sander mailto:linux@eikelenboom.it -- To unsubscribe from this list: send the line "unsubscribe linux-ext4" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2012-01-05 14:52 UTC|newest] Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top 2012-01-05 10:37 can't recover ext4 on lvm from ext4_mb_generate_buddy:739: group 1687, 32254 clusters in bitmap, 32258 in gd Sander Eikelenboom 2012-01-05 13:21 ` Sander Eikelenboom 2012-01-05 14:45 ` Theodore Tso 2012-01-05 14:45 ` Theodore Tso 2012-01-05 14:52 ` Sander Eikelenboom [this message] 2012-01-05 14:52 ` Sander Eikelenboom 2012-01-05 15:46 ` Sander Eikelenboom 2012-01-05 15:46 ` Sander Eikelenboom [not found] ` <4910694144.20120105171428@eikelenboom.it> 2012-01-05 18:15 ` Ted Ts'o 2012-01-05 20:04 ` Sander Eikelenboom 2012-01-05 20:04 ` Sander Eikelenboom 2012-01-05 20:45 ` Sander Eikelenboom 2012-01-05 20:45 ` Sander Eikelenboom 2012-01-05 21:31 ` Sander Eikelenboom 2012-01-05 21:31 ` Sander Eikelenboom 2012-01-05 22:43 ` Sander Eikelenboom 2012-01-05 22:43 ` Sander Eikelenboom 2012-01-06 16:40 ` [dm-devel] " Mikulas Patocka 2012-01-28 4:53 ` WIMPy 2012-01-28 8:14 ` WIMPy 2012-01-28 8:34 ` Andreas Dilger 2012-01-28 15:31 ` WIMPy 2012-01-28 21:04 ` WIMPy 2012-02-03 5:30 ` WIMPy 2012-04-12 6:45 ` Landry Minoza 2012-04-12 6:45 ` Landry Minoza
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=903212040.20120105155218@eikelenboom.it \ --to=linux@eikelenboom.it \ --cc=linux-ext4@vger.kernel.org \ --cc=linux-kernel@vger.kernel.org \ --cc=tytso@MIT.EDU \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.