All of lore.kernel.org
 help / color / mirror / Atom feed
* btree re-write and XFS_WANT_CORRUPTED_GOTO
@ 2011-05-19 15:59 Peter Watkins
  2011-05-20  0:25 ` Dave Chinner
  0 siblings, 1 reply; 2+ messages in thread
From: Peter Watkins @ 2011-05-19 15:59 UTC (permalink / raw)
  To: xfs

Hello again,

I've occasionally seen the XFS_WANT_CORRUPTED_GOTO error from
xfs_free_extent, usually when cleaning up an unlinked file or
truncating a file.

Is the btree rewrite known to fix any WANT_CORRUPTED problems? More
generally, why was the btree code reworked? I'm in the process of
going from 2.6.27 to 2.6.32 (still old I know), which seems to span
the btree re-write.

Are any particular patches recommended for this problem? I came across
24446fc66fdebbdd8baca0f44fd2a47ad77ba580. It's discussed at
http://oss.sgi.com/archives/xfs/2011-01/msg00266.html  Do any others
come to mind?

Still trying to create a reproducer case, but no luck so far.

Thanks for any comments!

-Peter


Here's an example stack:

XFS internal error XFS_WANT_CORRUPTED_GOTO at line 1545 of file
fs/xfs/xfs_alloc.c.  Caller 0xf8d5bef3
[<f8d85770>] xfs_error_report+0x60/0x70 [xfs]
[<f8d5bef3>] ? xfs_free_extent+0xc3/0xe0 [xfs]
[<f8d5ac21>] xfs_free_ag_extent+0x2e1/0x670 [xfs]
[<f8d5bef3>] ? xfs_free_extent+0xc3/0xe0 [xfs]
[<f8d5bef3>] xfs_free_extent+0xc3/0xe0 [xfs]
[<f8d6dd73>] xfs_bmap_finish+0xf3/0x170 [xfs]
[<f8d8f66c>] xfs_itruncate_finish+0x27c/0x3f0 [xfs]
[<f8dab30c>] xfs_setattr+0xaac/0xc00 [xfs]
[<f8db8942>] xfs_vn_setattr+0x22/0x30 [xfs]
[<c01d5d13>] fnotify_change+0x383/0x3a0
[<c01bd6ac>] do_truncate+0x6c/0x90
[<c03ad96d>] ? _spin_lock+0xd/0x10
[<c01bd88e>] do_sys_truncate+0x1be/0x220
[<c01bdae6>] sys_truncate64+0x16/0x20
[<c0109d73>] sysenter_do_call+0x12/0x2f

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: btree re-write and XFS_WANT_CORRUPTED_GOTO
  2011-05-19 15:59 btree re-write and XFS_WANT_CORRUPTED_GOTO Peter Watkins
@ 2011-05-20  0:25 ` Dave Chinner
  0 siblings, 0 replies; 2+ messages in thread
From: Dave Chinner @ 2011-05-20  0:25 UTC (permalink / raw)
  To: Peter Watkins; +Cc: xfs

On Thu, May 19, 2011 at 11:59:18AM -0400, Peter Watkins wrote:
> Hello again,
> 
> I've occasionally seen the XFS_WANT_CORRUPTED_GOTO error from
> xfs_free_extent, usually when cleaning up an unlinked file or
> truncating a file.
> 
> Is the btree rewrite known to fix any WANT_CORRUPTED problems? More

No.

> generally, why was the btree code reworked?

Three copies of the almost the same btree code, spanning 15,000 lines
of code, reduced to one btree core file and three ~1,000 line
functions per btree type. All btrees get WANT_CORRUPTED coverage
instead of just the freespace btree. Much easier to implement new
btrees (~1,000 lines of code instead of ~4-7000 lines). Bug fixes to
the btree core get fixed in all types, instead of a just the one
tree it was discovered for. Optimisations need to touch one piece of
code, not three, etc, etc.

> going from 2.6.27 to 2.6.32 (still old I know), which seems to span
> the btree re-write.
> 
> Are any particular patches recommended for this problem? I came across
> 24446fc66fdebbdd8baca0f44fd2a47ad77ba580. It's discussed at
> http://oss.sgi.com/archives/xfs/2011-01/msg00266.html  Do any others
> come to mind?

That problem required CXFS to reproduce - mainline XFS never
executes the particular code path that triggered the bug.

Seriously, if you are having problems with btree corruption, the
first thin you need to do is run with the latest and greatest code.
We fix problems all the time, so asking "what commits from the past
12 releases will fix this problem" is kinda pointless - we can't
give you any sort of reasonable answer to that.

If you can reproduce the problem on a recent kernel (2.6.38 or .39)
then we know we've got a bug that has not been fixed yet, and that
means we need to do spend the effort to fiadn it. However, if you
can't reproduce it on a current kernel, then theres really nothing
much we can do to help you identify the cause - you can run a bisect
on your reproducing workload and find the exact patch that fixed the
problem much more easily than we can....

Of course, if you really want someone to do all this work and fix
these sort of problems on older kernels for you, then that's the
value proposition that using RHEL or SLES bring to the table.

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2011-05-20  0:25 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-05-19 15:59 btree re-write and XFS_WANT_CORRUPTED_GOTO Peter Watkins
2011-05-20  0:25 ` Dave Chinner

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.