linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Neil Brown <neilb@cse.unsw.edu.au>
To: ext3-users@redhat.com
Cc: lkml <linux-kernel@vger.kernel.org>,
	"Stephen C. Tweedie" <sct@redhat.com>,
	Andreas Dilger <adilger@turbolinux.com>,
	"Peter J. Braam" <braam@clusterfilesystem.com>
Subject: Re: ext3-2.4-0.9.0
Date: Sun, 8 Jul 2001 08:15:17 +1000 (EST)	[thread overview]
Message-ID: <15175.35317.985921.670835@notabene.cse.unsw.edu.au> (raw)
In-Reply-To: message from Andrew Morton on Saturday July 7
In-Reply-To: <3B45D6DB.70B9D754@uow.edu.au>

On Saturday July 7, andrewm@uow.edu.au wrote:
> An update of the ext3 journalling filesystem for 2.4 kernels
> is available at
> 
> 	http://www.uow.edu.au/~andrewm/linux/ext3/
> 
> Patches are against 2.4.6-ac1 and 2.4.6.

I thought it was time to try out ext3 between nfsd and raid5, so I
built 2.4.6  plus this patch, and an ext3 filesystem on a largish
raid5 volume, exported it (with the "sync" flag), mounted it from
another machines with NFSv2, and ran "dbench 4".

This produces a live-lock (I think that it the right term).
Throughput would drop to zero (determined by watching the counts in 
/proc/nfs/rpc/nfsd), but could be coaxed along by generating other
filesystem activity.

I tried nfs over ext3 on a plain ide disc and it worked fine.
I tried dbench directly on ext3/raid5 and it worked fine.
I tried dbench/nfs/ext2/raid5 and it worked fine.

So I think it is some interaction between ext3fs and raid5 triggered
by the high rate of "fsync" calls made by nfsd.  Naturally I blame
ext3 because I know more about raid5 and nfsd :-)

One particular aspect of raid5 that *could* be related is that it is
very reticent to schedule write requests. It tries to hang on the them
as long as possible in the hope of getting more write requests in the
same stripe.  My guess as to what is happening is that as write
request is submitted and then waited-for without an intervening 
		run_task_queue(&tq_disk);

When the system is livelocked, all I can tell at the moment (I am at
home and the console is at work so I cannot use alt-sysrq) is that 
kjournal is waiting in wait_on_buffer and an nfsd thread is waiting on
the journal.

I will try to explore it more deeply next time I am at work, but if
there are any suggestions as to what it might be, or how I might more
easily find out what is going on, I am all ears.

NeilBrown

  reply	other threads:[~2001-07-07 22:16 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2001-07-06 15:18 ext3-2.4-0.9.0 Andrew Morton
2001-07-07 22:15 ` Neil Brown [this message]
2001-07-08  1:05   ` ext3-2.4-0.9.0 Andrew Morton
2001-07-08  6:02     ` ext3-2.4-0.9.0 Neil Brown

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=15175.35317.985921.670835@notabene.cse.unsw.edu.au \
    --to=neilb@cse.unsw.edu.au \
    --cc=adilger@turbolinux.com \
    --cc=braam@clusterfilesystem.com \
    --cc=ext3-users@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=sct@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).