linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* swap_dup/swap_free errors with 2.4.20-pre10
@ 2002-10-15 11:47 David Coulson
  2002-10-16 21:23 ` Marcelo Tosatti
  0 siblings, 1 reply; 5+ messages in thread
From: David Coulson @ 2002-10-15 11:47 UTC (permalink / raw)
  To: linux-kernel, UML devel

I'm running 2.4.20-pre10 on a Dual PIII system with 2Gb of RAM and three 
2Gb swap logical volumes.

It runs fine for a while, then I get lots of;

Oct 15 12:41:31 maeve kernel: swap_dup: Bad swap file entry 00000020
Oct 15 12:41:31 maeve kernel: swap_dup: Bad swap file entry 00000020
Oct 15 12:41:31 maeve kernel: swap_free: Bad swap file entry 00000020
Oct 15 12:41:31 maeve kernel: swap_free: Bad swap file entry 00000020
Oct 15 12:41:31 maeve kernel: swap_dup: Bad swap file entry 00000020
Oct 15 12:41:31 maeve kernel: swap_dup: Bad swap file entry 00000020
Oct 15 12:41:31 maeve kernel: swap_free: Bad swap file entry 00000020

The address is always 00000020. I've tried the machine without any swap 
space, and I get exactly the same error, so I'm assuming it's either bad 
RAM or a kernel issue. I ran memtest86 on it yesterday, and it didn't 
throw up any errors, but I'm going to swap the RAM out and see if that 
fixes it.

Thanks,
David

-- 
David Coulson                                  http://davidcoulson.net/
d@vidcoulson.com                       http://journal.davidcoulson.net/


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: swap_dup/swap_free errors with 2.4.20-pre10
  2002-10-15 11:47 swap_dup/swap_free errors with 2.4.20-pre10 David Coulson
@ 2002-10-16 21:23 ` Marcelo Tosatti
  2002-10-16 22:14   ` David Coulson
  0 siblings, 1 reply; 5+ messages in thread
From: Marcelo Tosatti @ 2002-10-16 21:23 UTC (permalink / raw)
  To: David Coulson; +Cc: linux-kernel, UML devel



On Tue, 15 Oct 2002, David Coulson wrote:

> I'm running 2.4.20-pre10 on a Dual PIII system with 2Gb of RAM and three
> 2Gb swap logical volumes.
>
> It runs fine for a while, then I get lots of;
>
> Oct 15 12:41:31 maeve kernel: swap_dup: Bad swap file entry 00000020
> Oct 15 12:41:31 maeve kernel: swap_dup: Bad swap file entry 00000020
> Oct 15 12:41:31 maeve kernel: swap_free: Bad swap file entry 00000020
> Oct 15 12:41:31 maeve kernel: swap_free: Bad swap file entry 00000020
> Oct 15 12:41:31 maeve kernel: swap_dup: Bad swap file entry 00000020
> Oct 15 12:41:31 maeve kernel: swap_dup: Bad swap file entry 00000020
> Oct 15 12:41:31 maeve kernel: swap_free: Bad swap file entry 00000020
>
> The address is always 00000020. I've tried the machine without any swap
> space, and I get exactly the same error, so I'm assuming it's either bad
> RAM or a kernel issue. I ran memtest86 on it yesterday, and it didn't
> throw up any errors, but I'm going to swap the RAM out and see if that
> fixes it.

Any news on this one, David?


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: swap_dup/swap_free errors with 2.4.20-pre10
  2002-10-16 21:23 ` Marcelo Tosatti
@ 2002-10-16 22:14   ` David Coulson
  2002-10-17  2:02     ` [uml-devel] " Jeff Dike
  0 siblings, 1 reply; 5+ messages in thread
From: David Coulson @ 2002-10-16 22:14 UTC (permalink / raw)
  To: Marcelo Tosatti; +Cc: linux-kernel, UML devel

Hey Marcelo,

> Any news on this one, David?

Sorry, I forgot to follow up - I spend most of yesterday morning trying 
to stabalise the thing and didn't end up posting my results. Basically, 
my board can only handle 1.5Gb of PC133 properly, even though it will 
try to use 2Gb if you put that much in it. Interestingly enough, it 
passed the memtest86 tests I ran on it the other night, so I'm not sure 
what's going on there. Tyan, the board manufacturer, confirmed that the 
system is only stable with 1.5Gb of PC133, which is somewhat 
disappointing, but I guess I'll have to live. Interestingly, it ran fine 
for about 8hrs before going funny the first time, then it would spit out 
swap_dup/free errors within 30 to 60mins five times in a row.

I had weird lockups under 2.4.20-pre9, where the system would behave 
oddly - Most commands would work, but 'ps' simply locked up and I 
couldn't Ctrl-C out of it. I've moved back to 2.4.19-ck7-rmap, which 
seems to be stable at the moment, although I may take another look at 
the 2.4.20-pre10 kernel sometime. As always, I didn't have a keyboard or 
monitor hooked up to it, so I couldn't do too much with sysrq, but I'll 
be ready if it does it again.

Any points about the above lock-ups would be useful - Since 'ps' locks 
and others don't (e.g. 'ls'), I think it's quite a specific issue, 
although I've so far been unable to track it down.

Thanks,
David

-- 
David Coulson                                  http://davidcoulson.net/
d@vidcoulson.com                       http://journal.davidcoulson.net/


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [uml-devel] Re: swap_dup/swap_free errors with 2.4.20-pre10
  2002-10-16 22:14   ` David Coulson
@ 2002-10-17  2:02     ` Jeff Dike
  2002-10-17 10:20       ` David Coulson
  0 siblings, 1 reply; 5+ messages in thread
From: Jeff Dike @ 2002-10-17  2:02 UTC (permalink / raw)
  To: David Coulson, Marcelo Tosatti; +Cc: wstearns, linux-kernel, UML devel

david@davidcoulson.net said:
> I had weird lockups under 2.4.20-pre9, where the system would behave
> oddly - Most commands would work, but 'ps' simply locked up and I
> couldn't Ctrl-C out of it.

I've seen this bug multiple times.  Basically, something is holding a
mm_sem and not letting go.  Anything that walks the process list hangs.
Ultimately, this hangs anything that's remotely useful, and you have to
crash the box.

I've seen it on my laptop several times, and it hung a UML server that we
have.  UML is frequently, but not always involved.

We got a sysrq t from the UML server.  I posted to lkml about it, with no
response.  You can see that at
	http://marc.theaimsgroup.com/?l=linux-kernel&m=103351640614665&w=2

One factoid that I forgot to mention there is that when it happens on my 
laptop, the disk activity light is stuck on.

				Jeff


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [uml-devel] Re: swap_dup/swap_free errors with 2.4.20-pre10
  2002-10-17  2:02     ` [uml-devel] " Jeff Dike
@ 2002-10-17 10:20       ` David Coulson
  0 siblings, 0 replies; 5+ messages in thread
From: David Coulson @ 2002-10-17 10:20 UTC (permalink / raw)
  To: Jeff Dike; +Cc: Marcelo Tosatti, wstearns, linux-kernel, UML devel

Jeff Dike wrote:
> I've seen this bug multiple times.  Basically, something is holding a
> mm_sem and not letting go.  Anything that walks the process list hangs.
> Ultimately, this hangs anything that's remotely useful, and you have to
> crash the box.

Indeed. I experienced the problem every 24-36hrs around five times in a 
row last week. Pretty much every morning, I'd come in and the box was 
completly dead. It seems to be okay at the moment, but I'm not holding 
my breath.

> One factoid that I forgot to mention there is that when it happens on my 
> laptop, the disk activity light is stuck on.

My box doesn't have a light, but what I got from SNMP before the box 
died suggested that the system (e.g. the 'system' CPU usage MIB) was 
using a considerable amount of CPU time (>95%), so I'm not sure if it 
was swapping madly, or if something else was going on.

David

-- 
David Coulson                                  http://davidcoulson.net/
d@vidcoulson.com                       http://journal.davidcoulson.net/


^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2002-10-17 10:13 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2002-10-15 11:47 swap_dup/swap_free errors with 2.4.20-pre10 David Coulson
2002-10-16 21:23 ` Marcelo Tosatti
2002-10-16 22:14   ` David Coulson
2002-10-17  2:02     ` [uml-devel] " Jeff Dike
2002-10-17 10:20       ` David Coulson

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).