linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Problem with mm in 2.4.19 and 2.4.20
@ 2003-02-08 12:16 Przemysław Maciuszko
  2003-08-11  7:34 ` 2.4.18/2.4.20 filemap.c pmd bug (was Re: Problem with mm in 2.4.19 and 2.4.20) Harald Welte
  0 siblings, 1 reply; 4+ messages in thread
From: Przemysław Maciuszko @ 2003-02-08 12:16 UTC (permalink / raw)
  To: linux-kernel

Hello.
I have a problem with one news server (feeder) box running INN.
Under heavy load i get the following error on the console:

filemap.c:2084: bad pmd 2bc001e3

This showed few times during last few days and few times server 'hanged up'
after this.
Anyone has an idea what can cause it?

I'm using Linux Debian on dual PIII 1.1Ghz, 1GB RAM, LVM version 1.0.6,
Qlogic FC 2200F driver version 6.01
Any help would be apreciated...


-- 
Przemysław Maciuszko
Agora S.A.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* 2.4.18/2.4.20 filemap.c pmd bug (was Re: Problem with mm in 2.4.19 and 2.4.20)
  2003-02-08 12:16 Problem with mm in 2.4.19 and 2.4.20 Przemysław Maciuszko
@ 2003-08-11  7:34 ` Harald Welte
  2003-08-11  9:48   ` Christoph Hellwig
  0 siblings, 1 reply; 4+ messages in thread
From: Harald Welte @ 2003-08-11  7:34 UTC (permalink / raw)
  To: sal; +Cc: linux-kernel

[-- Attachment #1: Type: text/plain, Size: 2834 bytes --]

Przemys?aw Maciuszko wrote:

>I have a problem with one news server (feeder) box running INN.
>Under heavy load i get the following error on the console:
>
>filemap.c:2084: bad pmd 2bc001e3
>
>This showed few times during last few days and few times server 'hanged up'
>after this.

I can confirm this problem.  It happens on one of my newsservers as well,
currently at least once per day.  It is a dual PIII 650MHz, 1GB RAM,
200GB spool (scsi hardware raid array attached to adaptec aic7xxx), six
seperate SCSI disks attached to a seperate aic7xxx controller for
overview, running inn-2.3.2.

We've tried  RedHat kernels 2.4.18-3, 2.4.18-17.7, 2.4.20-19.7 and 
2.4.20-19.7bigmem as well as a kernel.org 2.4.20 - all with the same 
problem.

After the filemap.c / pmd_ERROR() printk, the box either hangs (no 
further printout, not that often) or has a stack overflow (most of the 
time):

filemap.c:2258: bad pmd c0003000(00000000000001e3).
do_IRQ: stack overflow: -864
c0252845 fffffca0 206d6564 c2426000 00000000 c0117b20 c0101018 c024bd2c
c2426000 00000018 00000018 00000000 c0117b20 c0101018 c2426470 6f6e0018
40320018 ffffff00 c0117b43 00000010 00000202 7369636e 3e65642e 613c200a
Call Trace:   [<c0117b20>] do_page_fault [kernel] 0x0 (0xc242634c))
[<c0117b20>] do_page_fault [kernel] 0x0 (0xc2426368))
[<c0117b43>] do_page_fault [kernel] 0x23 (0xc2426380))
[<c0117b20>] do_page_fault [kernel] 0x0 (0xc242645c))
[<c0108cc4>] error_code [kernel] 0x34 (0xc2426464))
[<c0117fc5>] do_page_fault [kernel] 0x4a5 (0xc2426498))
[<c0117b20>] do_page_fault [kernel] 0x0 (0xc2426574))
[<c0108cc4>] error_code [kernel] 0x34 (0xc242657c))
[<c0117fc5>] do_page_fault [kernel] 0x4a5 (0xc24265b0))
[<c0117b20>] do_page_fault [kernel] 0x0 (0xc242668c))
[<c0108cc4>] error_code [kernel] 0x34 (0xc2426694))
[<c0117fc5>] do_page_fault [kernel] 0x4a5 (0xc24266c8))
[<c0117b20>] do_page_fault [kernel] 0x0 (0xc24267a4))
[<c0108cc4>] error_code [kernel] 0x34 (0xc24267ac))

The messages are always preceded by a '(scsi0:A:0:0): Locking max tag 
count at 64' message. The scsi device number is changing, so it cannot 
be a single device

>Anyone has an idea what can cause it?

Unfortunately I'm not very familiar with the linux MM subsystem.  But 
since I consider this now as a confirmed bug, maybe some of the other 
lkml folks have an idea what might be going on.

>I'm using Linux Debian on dual PIII 1.1Ghz, 1GB RAM, LVM version 1.0.6
>Qlogic FC 2200F driver version 6.01

We don't use lvm, so the similarities seem to be:  Dual PIII, 
SCSI, INN

--
- Harald Welte <laforge@gnumonks.org>               http://www.gnumonks.org/
============================================================================
Programming is like sex: One mistake and you have to support it your lifetime

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: 2.4.18/2.4.20 filemap.c pmd bug (was Re: Problem with mm in 2.4.19 and 2.4.20)
  2003-08-11  7:34 ` 2.4.18/2.4.20 filemap.c pmd bug (was Re: Problem with mm in 2.4.19 and 2.4.20) Harald Welte
@ 2003-08-11  9:48   ` Christoph Hellwig
  2003-08-11 10:21     ` Harald Welte
  0 siblings, 1 reply; 4+ messages in thread
From: Christoph Hellwig @ 2003-08-11  9:48 UTC (permalink / raw)
  To: Harald Welte; +Cc: sal, linux-kernel

On Mon, Aug 11, 2003 at 09:34:43AM +0200, Harald Welte wrote:
> >I'm using Linux Debian on dual PIII 1.1Ghz, 1GB RAM, LVM version 1.0.6
> >Qlogic FC 2200F driver version 6.01
> 
> We don't use lvm, so the similarities seem to be:  Dual PIII, 
> SCSI, INN

Well, qlogic + lvm is vert prone of stack overflows.  You're using aic7xxx
I assume?  Some other interesting drivers?


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: 2.4.18/2.4.20 filemap.c pmd bug (was Re: Problem with mm in 2.4.19 and 2.4.20)
  2003-08-11  9:48   ` Christoph Hellwig
@ 2003-08-11 10:21     ` Harald Welte
  0 siblings, 0 replies; 4+ messages in thread
From: Harald Welte @ 2003-08-11 10:21 UTC (permalink / raw)
  To: Christoph Hellwig, sal, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1010 bytes --]

Hi Christian.  First of all, thanks for your quick reply.

On Mon, Aug 11, 2003 at 10:48:23AM +0100, Christoph Hellwig wrote:
 
> Well, qlogic + lvm is vert prone of stack overflows.  

In my case, we use neither of them.

> You're using aic7xxx I assume?  

yes.  The device is reported as

scsi0 : Adaptec AIC7XXX EISA/VLB/PCI SCSI HBA DRIVER, Rev 6.2.8
        <Adaptec aic7890/91 Ultra2 SCSI adapter>
        aic7890/91: Ultra2 Wide Channel A, SCSI Id=7, 32/253 SCBs


> Some other interesting drivers?

Well, there's a tulip based network board and one symbios SCSI controller
(ncr53c8xx driver) in the system.  But since the '(scsi0:A:9:0): Locking
max tag count at 64' message always indicates 'scsi0', I think it has to
do with aic7xxx.

-- 
- Harald Welte <laforge@gnumonks.org>               http://www.gnumonks.org/
============================================================================
Programming is like sex: One mistake and you have to support it your lifetime

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2003-08-11 10:25 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-02-08 12:16 Problem with mm in 2.4.19 and 2.4.20 Przemysław Maciuszko
2003-08-11  7:34 ` 2.4.18/2.4.20 filemap.c pmd bug (was Re: Problem with mm in 2.4.19 and 2.4.20) Harald Welte
2003-08-11  9:48   ` Christoph Hellwig
2003-08-11 10:21     ` Harald Welte

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).