linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Alexander Beregalov <a.beregalov@gmail.com>
To: Dmitry Monakhov <dmonakhov@openvz.org>
Cc: Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	linux-ext4@vger.kernel.org, Jens Axboe <jens.axboe@oracle.com>,
	"Theodore Ts'o" <tytso@mit.edu>,
	dmitry.torokhov@gmail.com
Subject: Re: 2.6.33-rc1: kernel BUG at fs/ext4/inode.c:1063 (sparc)
Date: Sun, 27 Dec 2009 23:32:25 +0300	[thread overview]
Message-ID: <a4423d670912271232y5bb928f7wb667ca71f1a93f8a@mail.gmail.com> (raw)
In-Reply-To: <87eimir4yk.fsf@openvz.org>

It seems Dmitry Torokhov has the same issue, Cc'ed.

2009/12/26 Dmitry Monakhov <dmonakhov@openvz.org>:
> Alexander Beregalov <a.beregalov@gmail.com> writes:
>
>>>> It seems I can easily reproduce it.
>>>> But I can't compile 2.6.33-rc2 :)
> BTW what sha1 of the git-commit you have used to reproduce
> the bug (2.6.33-rc1 HEAD has no this BUG_ON).
> This is important to me to know it, or just post the
> fs/ext4/inode.c file.

It was in the first post - 2f99f5c
There is only OCFS update between it and -rc2.

>>>>
>>>> scripts/kconfig/conf -s arch/sparc/Kconfig
>>>>   CHK     include/linux/version.h
>>>>   CHK     include/generated/utsrelease.h
>>>>   CALL    scripts/checksyscalls.sh
>>>>   CHK     include/generated/compile.h
>>>>   GZIP    kernel/config_data.gz
>>>>   CC      fs/configfs/inode.o
>>>>   IKCFG   kernel/config_data.h
>>>>   LD [M]  fs/btrfs/btrfs.o
>>>>   CC      kernel/configs.o
>>>> fs/btrfs/sysfs.o: file not recognized: File truncated
>>> This happens because of  delayed allocation. Each time BUG or
>>> unexpected power off happens during object files usually becomes
>>> broken. IMHO this is expected issue. Just recompile from beginning
>>> # make clean; make -j4
>>
>> It does not help, it still fails.
> Again strange, please run fsck. What about compile it from very
> beginning (start from unpacking tar-ball from kernel.org)
> Or may be compile it on another file-system(ext3 or
> ext4 with nodelalloc option)

I tried fsck, it did not find any problem, kernel build still fails after it.

>> I will try to crosscompile the kernel with Ted's patch on another host.

Here is output of 2.6.33-rc2 plus Ted's patch

EXT4-fs (sda1): inode #1387643: mdb_free (1) < mdb_claim (2) BUG

------------[ cut here ]------------
WARNING: at fs/ext4/inode.c:1067 ext4_get_blocks+0x3f0/0x440()
Modules linked in:
Call Trace:
 [0000000000456bb0] warn_slowpath_common+0x50/0xa0
 [0000000000456c1c] warn_slowpath_null+0x1c/0x40
 [0000000000545010] ext4_get_blocks+0x3f0/0x440
 [0000000000545420] mpage_da_map_blocks+0x80/0x800
 [0000000000546260] mpage_add_bh_to_extent+0x40/0x100
 [00000000005464cc] __mpage_da_writepage+0x1ac/0x220
 [00000000004a957c] write_cache_pages+0x19c/0x380
 [0000000000545e1c] ext4_da_writepages+0x27c/0x680
 [00000000004a97ec] do_writepages+0x2c/0x60
 [00000000004f952c] writeback_single_inode+0xcc/0x3c0
 [00000000004fa438] writeback_inodes_wb+0x338/0x500
 [00000000004fa748] wb_writeback+0x148/0x220
 [00000000004fab60] wb_do_writeback+0x240/0x260
 [00000000004fabec] bdi_writeback_task+0x6c/0xc0
 [00000000004b6fb0] bdi_start_fn+0x70/0xe0
 [000000000047036c] kthread+0x6c/0x80
---[ end trace 46a56c443941c84d ]---

>>
> It is sad, but i still can not reproduce your bug.
> At this time i've tested following configurations:
> system   :    2.6.33-rc2, x86 two cores cpu with 2GB of ram
> block dev: real sata drive, loopdev over tmpfs
> mkfs     : 4k and 1k blocksize
> mount    : w/o quota, quota, journaled quota
> quota    : both ON and OFF states
> fs-load  : - fsstress with 1,4,16,32 concurrent tasks
>           - kernel compilation -j4, -j32
>           - In fact currently my mail-dir is under quota control.
> Please clarify your use-case:
> 0) Your system speciffication: cpu_num, mem_size, page_size(i guess 8k)
>   block device.
UltraSparc IIe, UP, 2Gb, 8kb, real SCSI disk (sym53c8xx driver)
> 1) mkfs options
I do not remember.
Perhaps dumpe2fs can help

root@v120 ~ # dumpe2fs -h /dev/sda1
dumpe2fs 1.41.9 (22-Aug-2009)
Filesystem volume name:   <none>
Last mounted on:          /
Filesystem UUID:          b34f302e-78a3-4f80-bae6-31639456216c
Filesystem magic number:  0xEF53
Filesystem revision #:    1 (dynamic)
Filesystem features:      has_journal ext_attr resize_inode dir_index
filetype needs_recovery sparse_super large_file
Filesystem flags:         signed_directory_hash
Default mount options:    (none)
Filesystem state:         clean
Errors behavior:          Continue
Filesystem OS type:       Linux
Inode count:              2113536
Block count:              8448000
Reserved block count:     422400
Free blocks:              6661110
Free inodes:              1861302
First block:              0
Block size:               4096
Fragment size:            4096
Reserved GDT blocks:      1021
Blocks per group:         32768
Fragments per group:      32768
Inodes per group:         8192
Inode blocks per group:   512
Filesystem created:       Tue Nov 10 00:44:17 2009
Last mount time:          Sun Dec 27 20:05:48 2009
Last write time:          Sat Dec 26 10:59:00 2009
Mount count:              3
Maximum mount count:      21
Last checked:             Sat Dec 26 06:07:50 2009
Check interval:           15552000 (6 months)
Next check after:         Thu Jun 24 07:07:50 2010
Lifetime writes:          30 GB
Reserved blocks uid:      0 (user root)
Reserved blocks gid:      0 (group root)
First inode:              11
Inode size:               256
Required extra isize:     28
Desired extra isize:      28
Journal inode:            8
Default directory hash:   half_md4
Directory Hash Seed:      ae1ec2f1-0f86-4f26-ace5-eb656fd25709
Journal backup:           inode blocks
Journal size:             128M


> 2) mount options
noatime
> 3) quota options (if any)
No
> 4) your fs load test-case
Have not tried to find a simpler testcase yet.
make CROSS_COMPILE="ccache sparc64-unknown-linux-gnu-" -j4 zImage modules

Hm, perhaps ccache is the real trigger of the problem.

> 5) How long does it takes you to reproduce the bug.
Few seconds (~5)

  reply	other threads:[~2009-12-27 20:32 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-12-24 22:28 2.6.33-rc1: kernel BUG at fs/ext4/inode.c:1063 (sparc) Alexander Beregalov
2009-12-24 22:49 ` Alexander Beregalov
2009-12-25 12:31   ` Dmitry Monakhov
2009-12-25 19:33     ` Alexander Beregalov
2009-12-25 23:47       ` Dmitry Monakhov
2009-12-27 20:32         ` Alexander Beregalov [this message]
2009-12-27 21:38           ` Dmitry Torokhov
2009-12-27 22:52           ` tytso
2009-12-27 23:02             ` Alexander Beregalov
2009-12-28  3:51               ` tytso
2009-12-30  5:37                 ` tytso
2009-12-30 13:18                   ` Dmitry Monakhov
2009-12-30 17:45                     ` tytso
2009-12-30 17:48                     ` tytso
2009-12-24 23:05 ` tytso
2009-12-24 23:15   ` tytso

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=a4423d670912271232y5bb928f7wb667ca71f1a93f8a@mail.gmail.com \
    --to=a.beregalov@gmail.com \
    --cc=dmitry.torokhov@gmail.com \
    --cc=dmonakhov@openvz.org \
    --cc=jens.axboe@oracle.com \
    --cc=linux-ext4@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=tytso@mit.edu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).