linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* 2.6.25 released with bug, which leads to XFS crash?
@ 2008-04-17  6:49 Denys Fedoryshchenko
  2008-04-17  8:12 ` markus reichelt
                   ` (2 more replies)
  0 siblings, 3 replies; 8+ messages in thread
From: Denys Fedoryshchenko @ 2008-04-17  6:49 UTC (permalink / raw)
  To: linux-kernel

Hi again

I reported about http://bugzilla.kernel.org/show_bug.cgi?id=10421 , and it is triggerable on different loaded servers with XFS (squid with aufs), 
just it is happening even on heavy load after 1-2 days. IMHO such bugs is critical (same as getting kernel panic, and etc), 
cause they are unrecoverable, causing minor filesystem corruption, and only way to fix them - wakeup sysadmin. Worst thing, it is hapenning at night,
when i restart squid, and probably it is doing agressive unlinking stale cache entries. It doesn't do panic, or even oops, but filesystem will be disconnected, 
and squid will remain in loop trying to restart. Sure it is easy to restart it, but maybe it has to be OOPS? so at least i can do 
sysctl -w kernel.panic_on_oops = 1, and FS will be recovered on reboot.

Just want to warn people who is using XFS on loaded servers to keep attention while using 2.6.25, and if you face same bug, report to bugzilla.

-- 
------
Technical Manager
Virtual ISP S.A.L.
Lebanon

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 2.6.25 released with bug, which leads to XFS crash?
  2008-04-17  6:49 2.6.25 released with bug, which leads to XFS crash? Denys Fedoryshchenko
@ 2008-04-17  8:12 ` markus reichelt
  2008-04-18  0:06 ` David Chinner
  2008-04-18  0:39 ` Eric Sandeen
  2 siblings, 0 replies; 8+ messages in thread
From: markus reichelt @ 2008-04-17  8:12 UTC (permalink / raw)
  To: linux-kernel

[-- Attachment #1: Type: text/plain, Size: 265 bytes --]

* Denys Fedoryshchenko <denys@visp.net.lb> wrote:

> Just want to warn people who is using XFS on loaded servers to keep
> attention while using 2.6.25, and if you face same bug, report to
> bugzilla.

Thanks for the heads-up!

-- 
left blank, right bald

[-- Attachment #2: Type: application/pgp-signature, Size: 197 bytes --]

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 2.6.25 released with bug, which leads to XFS crash?
  2008-04-17  6:49 2.6.25 released with bug, which leads to XFS crash? Denys Fedoryshchenko
  2008-04-17  8:12 ` markus reichelt
@ 2008-04-18  0:06 ` David Chinner
  2008-04-18  0:39 ` Eric Sandeen
  2 siblings, 0 replies; 8+ messages in thread
From: David Chinner @ 2008-04-18  0:06 UTC (permalink / raw)
  To: Denys Fedoryshchenko; +Cc: linux-kernel, xfs

On Thu, Apr 17, 2008 at 09:49:36AM +0300, Denys Fedoryshchenko wrote:
> Hi again
> 
> I reported about http://bugzilla.kernel.org/show_bug.cgi?id=10421 , and it
> is triggerable on different loaded servers with XFS (squid with aufs),
> just it is happening even on heavy load after 1-2 days. IMHO such bugs is
> critical (same as getting kernel panic, and etc),

Well, yes, and we treat shutdown bugs as such. A filesystem shutdown
is effectively a filesystem panic and is indicative of either a
corruption or a bug. The reality is that it takes time to triage
such a problem that only occurs on one workload on one set of
identical machines once every day or two. This does not make the
problem a release blocker, though.

The other side of it is that problems like this in Linux are often
the result of a bug in a lower layer and not XFS itself. Given this
particular problem seems to be memory corruption it could be anything
that is causing it....

> cause they are unrecoverable, causing minor filesystem corruption, and only
> way to fix them - wakeup sysadmin. Worst thing, it is hapenning at night,
> when i restart squid, and probably it is doing agressive unlinking stale
> cache entries. It doesn't do panic, or even oops, but filesystem will be
> disconnected, > and squid will remain in loop trying to restart. Sure it is
> easy to restart it, but maybe it has to be OOPS? so at least i can do
> sysctl -w kernel.panic_on_oops = 1, and FS will be recovered on reboot.

Rather than fearmongering, perhaps you should ask on the XFS list
(xfs@oss.sgi.com) whether anything like this can be done.  Then you
might have learnt about Documentation/filesystems/xfs.txt and
/proc/sys/fs/xfs/panic_mask:

  fs.xfs.panic_mask             (Min: 0  Default: 0  Max: 127)
        Causes certain error conditions to call BUG(). Value is a bitmask;
        AND together the tags which represent errors which should cause panics:

> Just want to warn people who is using XFS on loaded servers to keep
> attention while using 2.6.25, and if you face same bug, report to bugzilla.

Actually, I'd much prefer XFS bug reports to go to xfs@oss.sgi.com
rather than the kernel bugzilla - that way most of the XFS community
will see the bug report and the triage being done and then there's
no need for spamming lkml like this....

Cheers,

Dave.
-- 
Dave Chinner
Principal Engineer
SGI Australian Software Group

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 2.6.25 released with bug, which leads to XFS crash?
  2008-04-17  6:49 2.6.25 released with bug, which leads to XFS crash? Denys Fedoryshchenko
  2008-04-17  8:12 ` markus reichelt
  2008-04-18  0:06 ` David Chinner
@ 2008-04-18  0:39 ` Eric Sandeen
  2008-04-18  7:40   ` Christoph Hellwig
  2 siblings, 1 reply; 8+ messages in thread
From: Eric Sandeen @ 2008-04-18  0:39 UTC (permalink / raw)
  To: Denys Fedoryshchenko; +Cc: linux-kernel

Denys Fedoryshchenko wrote:
> Hi again
> 
> I reported about http://bugzilla.kernel.org/show_bug.cgi?id=10421 , and it is triggerable on different loaded servers with XFS (squid with aufs),

Oh, aufs was not listed in the bug report.  that's an out of tree
module, right?  Is it completely up to date and safe with 2.6.25?

-Eric

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 2.6.25 released with bug, which leads to XFS crash?
  2008-04-18  0:39 ` Eric Sandeen
@ 2008-04-18  7:40   ` Christoph Hellwig
  2008-04-18 10:32     ` Denys Fedoryshchenko
  0 siblings, 1 reply; 8+ messages in thread
From: Christoph Hellwig @ 2008-04-18  7:40 UTC (permalink / raw)
  To: Eric Sandeen; +Cc: Denys Fedoryshchenko, linux-kernel

On Thu, Apr 17, 2008 at 07:39:50PM -0500, Eric Sandeen wrote:
> Denys Fedoryshchenko wrote:
> > Hi again
> > 
> > I reported about http://bugzilla.kernel.org/show_bug.cgi?id=10421 , and it is triggerable on different loaded servers with XFS (squid with aufs),
> 
> Oh, aufs was not listed in the bug report.  that's an out of tree
> module, right?  Is it completely up to date and safe with 2.6.25?

And it's a completely crappy one.  I personally would through away every
bug report with that piece of junk involved.


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 2.6.25 released with bug, which leads to XFS crash?
  2008-04-18  7:40   ` Christoph Hellwig
@ 2008-04-18 10:32     ` Denys Fedoryshchenko
  2008-04-18 10:37       ` Christoph Hellwig
  2008-04-18 13:40       ` Eric Sandeen
  0 siblings, 2 replies; 8+ messages in thread
From: Denys Fedoryshchenko @ 2008-04-18 10:32 UTC (permalink / raw)
  To: Christoph Hellwig; +Cc: Eric Sandeen, linux-kernel

AUFS(same as COSS, UFS, NULL) is way of handling with cache storage in squid, and nothing else. It is userspace and not involved deeply with kernel things.
Since long long time i didn't had any single problem with it.
Kernel is fully vanilla.

On Friday 18 April 2008 10:40, Christoph Hellwig wrote:
> On Thu, Apr 17, 2008 at 07:39:50PM -0500, Eric Sandeen wrote:
> > Denys Fedoryshchenko wrote:
> > > Hi again
> > > 
> > > I reported about http://bugzilla.kernel.org/show_bug.cgi?id=10421 , and it is triggerable on different loaded servers with XFS (squid with aufs),
> > 
> > Oh, aufs was not listed in the bug report.  that's an out of tree
> > module, right?  Is it completely up to date and safe with 2.6.25?
> 
> And it's a completely crappy one.  I personally would through away every
> bug report with that piece of junk involved.
> 
> 

-- 
------
Technical Manager
Virtual ISP S.A.L.
Lebanon

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 2.6.25 released with bug, which leads to XFS crash?
  2008-04-18 10:32     ` Denys Fedoryshchenko
@ 2008-04-18 10:37       ` Christoph Hellwig
  2008-04-18 13:40       ` Eric Sandeen
  1 sibling, 0 replies; 8+ messages in thread
From: Christoph Hellwig @ 2008-04-18 10:37 UTC (permalink / raw)
  To: Denys Fedoryshchenko; +Cc: Christoph Hellwig, Eric Sandeen, linux-kernel

On Fri, Apr 18, 2008 at 01:32:30PM +0300, Denys Fedoryshchenko wrote:
> AUFS(same as COSS, UFS, NULL) is way of handling with cache storage in squid, and nothing else. It is userspace and not involved deeply with kernel things.
> Since long long time i didn't had any single problem with it.
> Kernel is fully vanilla.

Ah, okay.  There's also a kernel filesystem driver doing unioning that's
also called aufs that I'm referring to.  sorry for the noise.


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 2.6.25 released with bug, which leads to XFS crash?
  2008-04-18 10:32     ` Denys Fedoryshchenko
  2008-04-18 10:37       ` Christoph Hellwig
@ 2008-04-18 13:40       ` Eric Sandeen
  1 sibling, 0 replies; 8+ messages in thread
From: Eric Sandeen @ 2008-04-18 13:40 UTC (permalink / raw)
  To: Denys Fedoryshchenko; +Cc: Christoph Hellwig, linux-kernel

Denys Fedoryshchenko wrote:
> AUFS(same as COSS, UFS, NULL) is way of handling with cache storage in squid, and nothing else. It is userspace and not involved deeply with kernel things.
> Since long long time i didn't had any single problem with it.
> Kernel is fully vanilla.

ah, ok.  I had assumed kernel module too, sorry.

-Eric

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2008-04-18 13:41 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2008-04-17  6:49 2.6.25 released with bug, which leads to XFS crash? Denys Fedoryshchenko
2008-04-17  8:12 ` markus reichelt
2008-04-18  0:06 ` David Chinner
2008-04-18  0:39 ` Eric Sandeen
2008-04-18  7:40   ` Christoph Hellwig
2008-04-18 10:32     ` Denys Fedoryshchenko
2008-04-18 10:37       ` Christoph Hellwig
2008-04-18 13:40       ` Eric Sandeen

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).