* massive filesystem corruption with 2.4.9 @ 2001-08-21 8:00 Kristian 2001-08-21 8:34 ` Christian Widmer 0 siblings, 1 reply; 9+ messages in thread From: Kristian @ 2001-08-21 8:00 UTC (permalink / raw) To: linux-kernel Hello. Since linux-2.4.5 always the same errors occur sporadically after the cold boot in the morning. (My computer is powered off during the night.) Every second day I noticed my syslog sais something like the following: Aug 21 09:01:06 adlib kernel: EXT2-fs error (device ide0(3,5)): ext2_new_block: Allocating block in system zone - block = 3 Aug 21 09:01:06 adlib kernel: EXT2-fs error (device ide0(3,5)): ext2_free_blocks: Freeing blocks in system zones - Block = 4, count = 1 Aug 21 09:01:06 adlib kernel: EXT2-fs error (device ide0(3,5)): ext2_new_block: Allocating block in system zone - block = 37 Aug 21 09:01:06 adlib kernel: EXT2-fs error (device ide0(3,5)): ext2_new_block: Allocating block in system zone - block = 45 Aug 21 09:01:07 adlib kernel: mtrr: base(0x42000000) is not aligned on a size(0x1800000) boundary Aug 21 09:01:09 adlib last message repeated 2 times Aug 21 09:01:26 adlib PAM_unix[1929]: (login) session opened for user root by LOGIN(uid=0) Aug 21 09:01:26 adlib -- root[1929]: ROOT LOGIN ON tty1 Aug 21 09:01:30 adlib kernel: EXT2-fs error (device ide0(3,5)): ext2_free_blocks: Freeing blocks in system zones - Block = 41, count = 4 Aug 21 09:01:30 adlib kernel: EXT2-fs error (device ide0(3,5)): ext2_new_block: Allocating block in system zone - block = 4 Aug 21 09:01:30 adlib kernel: EXT2-fs error (device ide0(3,5)): ext2_new_block: Allocating block in system zone - block = 7 Aug 21 09:01:30 adlib kernel: EXT2-fs error (device ide0(3,5)): ext2_free_blocks: Freeing blocks in system zones - Block = 8, count = 2 Today it destroyed my super block and all my root-directories were placed in /lost+found. I rescued everything with e2fsck-1.14 from a very old rescue-disk and then again with 1.23, renaming and replacing the directories by hand. A lot of devices and some .h-files were not recoverable. These fatal errors are occuring since 2.4.5 (2.4.8 I've not tested.). When I work with 2.4.4 everything is fine ! I already use the newest version of e2fsck (1.23) and util-linux (2.11f). My RedHat (Rotkäppchen) 6.2 is rather old, but I don't like gcc 2.96 at all. I posted this report as the errors occured after a complete crash with 2.4.6 also to the ext2-developers directly but they didn't answered. Maybe you could help me here ? Kristian ·· · · reach me :: · ·· ·· · · ·· · ·· · ··· · · :: http://www.korseby.net :: http://www.tomlab.de kristian@korseby.net ....:: ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: massive filesystem corruption with 2.4.9 2001-08-21 8:00 massive filesystem corruption with 2.4.9 Kristian @ 2001-08-21 8:34 ` Christian Widmer 2001-08-21 10:14 ` Kristian 0 siblings, 1 reply; 9+ messages in thread From: Christian Widmer @ 2001-08-21 8:34 UTC (permalink / raw) To: Kristian; +Cc: linux-kernel i had similar problems with 2.4.6. unfortunately i didn't save the errors so i can't compare the msg's. i just can say that with 2.4.6 it destroyed the ext2 on a 40GB and 60GB maxtor disk. since then my nfs server is running 2.2.19 with works fine (with minor promblems*). * after a client mounted an exprots once. i cant unmount that partition on the server after the client unmounted the exports. On Tuesday 21 August 2001 10:00, Kristian wrote: > Hello. > > Since linux-2.4.5 always the same errors occur sporadically after the cold > boot in the morning. (My computer is powered off during the night.) Every > second day I noticed my syslog sais something like the following: > > Aug 21 09:01:06 adlib kernel: EXT2-fs error (device ide0(3,5)): > ext2_new_block: Allocating block in system zone - block = 3 > Aug 21 09:01:06 adlib kernel: EXT2-fs error (device ide0(3,5)): > ext2_free_blocks: Freeing blocks in system zones - Block = 4, count = 1 > Aug 21 09:01:06 adlib kernel: EXT2-fs error (device ide0(3,5)): > ext2_new_block: Allocating block in system zone - block = 37 > Aug 21 09:01:06 adlib kernel: EXT2-fs error (device ide0(3,5)): > ext2_new_block: Allocating block in system zone - block = 45 > Aug 21 09:01:07 adlib kernel: mtrr: base(0x42000000) is not aligned on a > size(0x1800000) boundary > Aug 21 09:01:09 adlib last message repeated 2 times > Aug 21 09:01:26 adlib PAM_unix[1929]: (login) session opened for user root > by LOGIN(uid=0) > Aug 21 09:01:26 adlib -- root[1929]: ROOT LOGIN ON tty1 > Aug 21 09:01:30 adlib kernel: EXT2-fs error (device ide0(3,5)): > ext2_free_blocks: Freeing blocks in system zones - Block = 41, count = 4 > Aug 21 09:01:30 adlib kernel: EXT2-fs error (device ide0(3,5)): > ext2_new_block: Allocating block in system zone - block = 4 > Aug 21 09:01:30 adlib kernel: EXT2-fs error (device ide0(3,5)): > ext2_new_block: Allocating block in system zone - block = 7 > Aug 21 09:01:30 adlib kernel: EXT2-fs error (device ide0(3,5)): > ext2_free_blocks: Freeing blocks in system zones - Block = 8, count = 2 > > Today it destroyed my super block and all my root-directories were placed > in /lost+found. I rescued everything with e2fsck-1.14 from a very old > rescue-disk and then again with 1.23, renaming and replacing the > directories by hand. A lot of devices and some .h-files were not > recoverable. > > These fatal errors are occuring since 2.4.5 (2.4.8 I've not tested.). When > I work with 2.4.4 everything is fine ! > > I already use the newest version of e2fsck (1.23) and util-linux (2.11f). > My RedHat (Rotkäppchen) 6.2 is rather old, but I don't like gcc 2.96 at > all. > > I posted this report as the errors occured after a complete crash with > 2.4.6 also to the ext2-developers directly but they didn't answered. > > Maybe you could help me here ? > > Kristian > > ·· · · reach me :: · ·· ·· · · ·· · ·· · ··· · · > > :: http://www.korseby.net > :: http://www.tomlab.de > > kristian@korseby.net ....:: > > > > - > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ -- christian widmer zurlindenstrasse 294, 8003 zurich, switzerland email: cwidmer@iiic.ethz.ch phone: ++41 (0)1 491 03 68 ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: massive filesystem corruption with 2.4.9 2001-08-21 8:34 ` Christian Widmer @ 2001-08-21 10:14 ` Kristian 0 siblings, 0 replies; 9+ messages in thread From: Kristian @ 2001-08-21 10:14 UTC (permalink / raw) To: cwidmer; +Cc: linux-kernel Christian Widmer wrote: > i had similar problems with 2.4.6. unfortunately i didn't save the errors so > i can't compare the msg's. i just can say that with 2.4.6 it destroyed the > ext2 on a 40GB and 60GB maxtor disk. since then my nfs server is running > 2.2.19 with works fine (with minor promblems*). > > * after a client mounted an exprots once. i cant unmount that partition on > the server after the client unmounted the exports. I have several entries more in my logfile. It would be no problem collecting them if that is helpful. I forgot to say that I'm using an IBM 41 GB (hda: IBM-DTLA-305040, ATA DISK drive) and that this problem only occurs on my root-partition (hda5), the always mounted /boot-Partition (hda1) and partially mounted misc-Partition (hda7) are not effected. I don't use any NFS. Kristian ·· · · reach me :: · ·· ·· · · ·· · ·· · ··· · · :: http://www.korseby.net :: http://www.tomlab.de kristian@korseby.net ....:: ^ permalink raw reply [flat|nested] 9+ messages in thread
[parent not found: <no.id>]
* Re: massive filesystem corruption with 2.4.9 [not found] <no.id> @ 2001-08-21 13:58 ` Alan Cox 2001-08-21 16:00 ` Kristian 2001-08-21 16:23 ` Alan Cox 2001-08-21 16:26 ` Alan Cox 2 siblings, 1 reply; 9+ messages in thread From: Alan Cox @ 2001-08-21 13:58 UTC (permalink / raw) To: cwidmer; +Cc: Kristian, linux-kernel > > Aug 21 09:01:06 adlib kernel: EXT2-fs error (device ide0(3,5)): > > ext2_new_block: Allocating block in system zone - block =3D 3 > > Aug 21 09:01:06 adlib kernel: EXT2-fs error (device ide0(3,5)): > > ext2_free_blocks: Freeing blocks in system zones - Block =3D 4, count= > =3D 1 Typically this indicates disk, memory or chipset problems. If your disk is in UDMA33/66 mode you can pretty rule the disk out as the data is protected If you have a VIA chipset, especially if there is an SB Live! in the machine then that may be the cause (fixes in 2.4.8-ac, should be a fix in 2.4.9 but Linus tree also applies another bogus change but which should be harmless) > > These fatal errors are occuring since 2.4.5 (2.4.8 I've not tested.).= > When > > I work with 2.4.4 everything is fine ! What hardware ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: massive filesystem corruption with 2.4.9 2001-08-21 13:58 ` Alan Cox @ 2001-08-21 16:00 ` Kristian 2001-08-21 16:18 ` Christian Widmer 0 siblings, 1 reply; 9+ messages in thread From: Kristian @ 2001-08-21 16:00 UTC (permalink / raw) To: Alan Cox; +Cc: cwidmer, linux-kernel Alan Cox wrote: > Typically this indicates disk, memory or chipset problems. If your disk is > in UDMA33/66 mode you can pretty rule the disk out as the data is > protected > > If you have a VIA chipset, especially if there is an SB Live! in the machine > then that may be the cause (fixes in 2.4.8-ac, should be a fix in 2.4.9 but > Linus tree also applies another bogus change but which should be harmless) No. I can't find any VIA chipset. I'm really surprised. :-) But it is an original Compaq-Board (EP-Series..) with a horroble BIOS. It seems that they're using intel only.. I did a probe of my harddisk with IBM's Drive Fitness program. It detected no errors. Here's the output of it: Model : IBM-DTLA-305040 Serial no. : YJ025714 Capacity : 41.17 GB Cache size : 380 KB Microcode level : TW4OA60A ATA Compliance : ATA-5 Ultra DMA Highest mode : 5 Active mode : 1 Settings Write cache : Enabled Read look-ahead : Enabled Auto reassign : Enabled S.M.A.R.T. operations : Enabled S.M.A.R.T. status : Good ABLE : Disabled AAM : Disabled Security feature : Supported Password : Not Set Here is the output of cat /proc/pci: PCI devices found: Bus 0, device 0, function 0: Host bridge: Intel Corporation 440BX/ZX - 82443BX/ZX Host bridge (rev 3). Master Capable. Latency=64. Prefetchable 32 bit memory at 0x44000000 [0x47ffffff]. Bus 0, device 1, function 0: PCI bridge: Intel Corporation 440BX/ZX - 82443BX/ZX AGP bridge (rev 3). Master Capable. Latency=64. Min Gnt=140. Bus 0, device 14, function 0: Ethernet controller: Intel Corporation 82557 [Ethernet Pro 100] (rev 5). IRQ 11. Master Capable. Latency=66. Min Gnt=8.Max Lat=56. Prefetchable 32 bit memory at 0x48100000 [0x48100fff]. I/O at 0x1000 [0x101f]. Non-prefetchable 32 bit memory at 0x48000000 [0x480fffff]. Bus 0, device 15, function 0: Multimedia audio controller: Ensoniq ES1371 [AudioPCI-97] (rev 6). IRQ 11. Master Capable. Latency=64. Min Gnt=12.Max Lat=128. I/O at 0x1080 [0x10bf]. Bus 0, device 16, function 0: Multimedia video controller: Brooktree Corporation Bt878 (rev 17). IRQ 11. Master Capable. Latency=66. Min Gnt=16.Max Lat=40. Prefetchable 32 bit memory at 0x48200000 [0x48200fff]. Bus 0, device 16, function 1: Multimedia controller: Brooktree Corporation Bt878 (rev 17). IRQ 11. Master Capable. Latency=66. Min Gnt=4.Max Lat=255. Prefetchable 32 bit memory at 0x48300000 [0x48300fff]. Bus 0, device 20, function 0: ISA bridge: Intel Corporation 82371AB PIIX4 ISA (rev 2). Bus 0, device 20, function 1: IDE interface: Intel Corporation 82371AB PIIX4 IDE (rev 1). Master Capable. Latency=64. I/O at 0x1040 [0x104f]. Bus 0, device 20, function 2: USB Controller: Intel Corporation 82371AB PIIX4 USB (rev 1). IRQ 11. Master Capable. Latency=64. I/O at 0x1020 [0x103f]. Bus 0, device 20, function 3: Bridge: Intel Corporation 82371AB PIIX4 ACPI (rev 2). IRQ 9. Bus 1, device 0, function 0: VGA compatible controller: Matrox Graphics, Inc. MGA G400 AGP (rev 130). IRQ 11. Master Capable. Latency=64. Min Gnt=16.Max Lat=32. Prefetchable 32 bit memory at 0x42000000 [0x43ffffff]. Non-prefetchable 32 bit memory at 0x40800000 [0x40803fff]. Non-prefetchable 32 bit memory at 0x40000000 [0x407fffff]. /proc/cpuinfo: processor : 0 vendor_id : GenuineIntel cpu family : 6 model : 8 model name : Pentium III (Coppermine) stepping : 3 cpu MHz : 597.413 cache size : 256 KB fdiv_bug : no hlt_bug : no f00f_bug : no coma_bug : no fpu : yes fpu_exception : yes cpuid level : 2 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 sep mtrr pge mca cmov pat pse36 mmx fxsr sse bogomips : 1192.75 /proc/interrupts: 0: 1337585 XT-PIC timer 1: 46916 XT-PIC keyboard 2: 0 XT-PIC cascade 8: 1 XT-PIC rtc 11: 162990 XT-PIC es1371, bttv, eth0 12: 294331 XT-PIC PS/2 Mouse 14: 21839 XT-PIC ide0 15: 17 XT-PIC ide1 NMI: 0 ERR: 0 Kristian ·· · · reach me :: · ·· ·· · · ·· · ·· · ··· · · :: http://www.korseby.net :: http://www.tomlab.de kristian@korseby.net ....:: ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: massive filesystem corruption with 2.4.9 2001-08-21 16:00 ` Kristian @ 2001-08-21 16:18 ` Christian Widmer 0 siblings, 0 replies; 9+ messages in thread From: Christian Widmer @ 2001-08-21 16:18 UTC (permalink / raw) To: Kristian, Alan Cox; +Cc: linux-kernel > > If your disk is in UDMA33/66 mode you can pretty rule the > > disk out as the data is protected i think this should be with the promise driver. > > If you have a VIA chipset, especially if there is an SB Live! in the > > machine then that may be the cause (fixes in 2.4.8-ac, should be a fix > > in 2.4.9 but Linus tree also applies another bogus change but which > > should be harmless) it was an intel LX chipset that it is a memory problem i also don't belive. that ram work for over 2 year with no errors found with memtest (memtset86, intels memtest) compiling seveal times xfree86 and an many many times several kernels. and i never had any problems. until i tried the first time a 2.4.x kernel on the fileserver (that was 2.4.6). so i moved the fileserver back to 2.2.19. ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: massive filesystem corruption with 2.4.9 [not found] <no.id> 2001-08-21 13:58 ` Alan Cox @ 2001-08-21 16:23 ` Alan Cox 2001-08-21 19:06 ` Kristian 2001-08-21 16:26 ` Alan Cox 2 siblings, 1 reply; 9+ messages in thread From: Alan Cox @ 2001-08-21 16:23 UTC (permalink / raw) To: Kristian; +Cc: Alan Cox, cwidmer, linux-kernel > No. I can't find any VIA chipset. I'm really surprised. :-) But it is a= > n > original Compaq-Board (EP-Series..) with a horroble BIOS. It seems that= > they're > using intel only.. 440BX - good chipset. Does memtest86 show up anything on this box ? ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: massive filesystem corruption with 2.4.9 2001-08-21 16:23 ` Alan Cox @ 2001-08-21 19:06 ` Kristian 0 siblings, 0 replies; 9+ messages in thread From: Kristian @ 2001-08-21 19:06 UTC (permalink / raw) To: Alan Cox; +Cc: cwidmer, linux-kernel Alan Cox wrote: > Does memtest86 show up anything on this box ? No errors... Btw: As far as I know did the problem occur since I patched 2.4.5 with ac13 or ac15. Maybe a clean 2.4.5 works fine. I'm not sure about this. It's some time ago... Did you have made some important ext2-related changes with 2.4.5-ac?. I could revert to the old kernel and test him if it is relevant. Kristian ·· · · reach me :: · ·· ·· · · ·· · ·· · ··· · · :: http://www.korseby.net :: http://www.tomlab.de kristian@korseby.net ....:: ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: massive filesystem corruption with 2.4.9 [not found] <no.id> 2001-08-21 13:58 ` Alan Cox 2001-08-21 16:23 ` Alan Cox @ 2001-08-21 16:26 ` Alan Cox 2 siblings, 0 replies; 9+ messages in thread From: Alan Cox @ 2001-08-21 16:26 UTC (permalink / raw) To: cwidmer; +Cc: Kristian, Alan Cox, linux-kernel > that it is a memory problem i also don't belive. that ram work for over 2 year > with no errors found with memtest (memtset86, intels memtest) compiling > seveal times xfree86 and an many many times several kernels. > > and i never had any problems. until i tried the first time a 2.4.x kernel on > the fileserver (that was 2.4.6). so i moved the fileserver back to 2.2.19. Nod. I can follow that reasoning, I've come across boxes that fialed with 2.4 with memory errors, but not 2.2. So far however those have all shown up with memtest86, or been Athlon optimisation triggered via things Curiouser and curiouser ^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2001-08-21 19:07 UTC | newest] Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2001-08-21 8:00 massive filesystem corruption with 2.4.9 Kristian 2001-08-21 8:34 ` Christian Widmer 2001-08-21 10:14 ` Kristian [not found] <no.id> 2001-08-21 13:58 ` Alan Cox 2001-08-21 16:00 ` Kristian 2001-08-21 16:18 ` Christian Widmer 2001-08-21 16:23 ` Alan Cox 2001-08-21 19:06 ` Kristian 2001-08-21 16:26 ` Alan Cox
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).