* Problem with mm in 2.4.19 and 2.4.20 @ 2003-02-08 12:16 Przemysław Maciuszko 2003-08-11 7:34 ` 2.4.18/2.4.20 filemap.c pmd bug (was Re: Problem with mm in 2.4.19 and 2.4.20) Harald Welte 0 siblings, 1 reply; 4+ messages in thread From: Przemysław Maciuszko @ 2003-02-08 12:16 UTC (permalink / raw) To: linux-kernel Hello. I have a problem with one news server (feeder) box running INN. Under heavy load i get the following error on the console: filemap.c:2084: bad pmd 2bc001e3 This showed few times during last few days and few times server 'hanged up' after this. Anyone has an idea what can cause it? I'm using Linux Debian on dual PIII 1.1Ghz, 1GB RAM, LVM version 1.0.6, Qlogic FC 2200F driver version 6.01 Any help would be apreciated... -- Przemysław Maciuszko Agora S.A. ^ permalink raw reply [flat|nested] 4+ messages in thread
* 2.4.18/2.4.20 filemap.c pmd bug (was Re: Problem with mm in 2.4.19 and 2.4.20) 2003-02-08 12:16 Problem with mm in 2.4.19 and 2.4.20 Przemysław Maciuszko @ 2003-08-11 7:34 ` Harald Welte 2003-08-11 9:48 ` Christoph Hellwig 0 siblings, 1 reply; 4+ messages in thread From: Harald Welte @ 2003-08-11 7:34 UTC (permalink / raw) To: sal; +Cc: linux-kernel [-- Attachment #1: Type: text/plain, Size: 2834 bytes --] Przemys?aw Maciuszko wrote: >I have a problem with one news server (feeder) box running INN. >Under heavy load i get the following error on the console: > >filemap.c:2084: bad pmd 2bc001e3 > >This showed few times during last few days and few times server 'hanged up' >after this. I can confirm this problem. It happens on one of my newsservers as well, currently at least once per day. It is a dual PIII 650MHz, 1GB RAM, 200GB spool (scsi hardware raid array attached to adaptec aic7xxx), six seperate SCSI disks attached to a seperate aic7xxx controller for overview, running inn-2.3.2. We've tried RedHat kernels 2.4.18-3, 2.4.18-17.7, 2.4.20-19.7 and 2.4.20-19.7bigmem as well as a kernel.org 2.4.20 - all with the same problem. After the filemap.c / pmd_ERROR() printk, the box either hangs (no further printout, not that often) or has a stack overflow (most of the time): filemap.c:2258: bad pmd c0003000(00000000000001e3). do_IRQ: stack overflow: -864 c0252845 fffffca0 206d6564 c2426000 00000000 c0117b20 c0101018 c024bd2c c2426000 00000018 00000018 00000000 c0117b20 c0101018 c2426470 6f6e0018 40320018 ffffff00 c0117b43 00000010 00000202 7369636e 3e65642e 613c200a Call Trace: [<c0117b20>] do_page_fault [kernel] 0x0 (0xc242634c)) [<c0117b20>] do_page_fault [kernel] 0x0 (0xc2426368)) [<c0117b43>] do_page_fault [kernel] 0x23 (0xc2426380)) [<c0117b20>] do_page_fault [kernel] 0x0 (0xc242645c)) [<c0108cc4>] error_code [kernel] 0x34 (0xc2426464)) [<c0117fc5>] do_page_fault [kernel] 0x4a5 (0xc2426498)) [<c0117b20>] do_page_fault [kernel] 0x0 (0xc2426574)) [<c0108cc4>] error_code [kernel] 0x34 (0xc242657c)) [<c0117fc5>] do_page_fault [kernel] 0x4a5 (0xc24265b0)) [<c0117b20>] do_page_fault [kernel] 0x0 (0xc242668c)) [<c0108cc4>] error_code [kernel] 0x34 (0xc2426694)) [<c0117fc5>] do_page_fault [kernel] 0x4a5 (0xc24266c8)) [<c0117b20>] do_page_fault [kernel] 0x0 (0xc24267a4)) [<c0108cc4>] error_code [kernel] 0x34 (0xc24267ac)) The messages are always preceded by a '(scsi0:A:0:0): Locking max tag count at 64' message. The scsi device number is changing, so it cannot be a single device >Anyone has an idea what can cause it? Unfortunately I'm not very familiar with the linux MM subsystem. But since I consider this now as a confirmed bug, maybe some of the other lkml folks have an idea what might be going on. >I'm using Linux Debian on dual PIII 1.1Ghz, 1GB RAM, LVM version 1.0.6 >Qlogic FC 2200F driver version 6.01 We don't use lvm, so the similarities seem to be: Dual PIII, SCSI, INN -- - Harald Welte <laforge@gnumonks.org> http://www.gnumonks.org/ ============================================================================ Programming is like sex: One mistake and you have to support it your lifetime [-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: 2.4.18/2.4.20 filemap.c pmd bug (was Re: Problem with mm in 2.4.19 and 2.4.20) 2003-08-11 7:34 ` 2.4.18/2.4.20 filemap.c pmd bug (was Re: Problem with mm in 2.4.19 and 2.4.20) Harald Welte @ 2003-08-11 9:48 ` Christoph Hellwig 2003-08-11 10:21 ` Harald Welte 0 siblings, 1 reply; 4+ messages in thread From: Christoph Hellwig @ 2003-08-11 9:48 UTC (permalink / raw) To: Harald Welte; +Cc: sal, linux-kernel On Mon, Aug 11, 2003 at 09:34:43AM +0200, Harald Welte wrote: > >I'm using Linux Debian on dual PIII 1.1Ghz, 1GB RAM, LVM version 1.0.6 > >Qlogic FC 2200F driver version 6.01 > > We don't use lvm, so the similarities seem to be: Dual PIII, > SCSI, INN Well, qlogic + lvm is vert prone of stack overflows. You're using aic7xxx I assume? Some other interesting drivers? ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: 2.4.18/2.4.20 filemap.c pmd bug (was Re: Problem with mm in 2.4.19 and 2.4.20) 2003-08-11 9:48 ` Christoph Hellwig @ 2003-08-11 10:21 ` Harald Welte 0 siblings, 0 replies; 4+ messages in thread From: Harald Welte @ 2003-08-11 10:21 UTC (permalink / raw) To: Christoph Hellwig, sal, linux-kernel [-- Attachment #1: Type: text/plain, Size: 1010 bytes --] Hi Christian. First of all, thanks for your quick reply. On Mon, Aug 11, 2003 at 10:48:23AM +0100, Christoph Hellwig wrote: > Well, qlogic + lvm is vert prone of stack overflows. In my case, we use neither of them. > You're using aic7xxx I assume? yes. The device is reported as scsi0 : Adaptec AIC7XXX EISA/VLB/PCI SCSI HBA DRIVER, Rev 6.2.8 <Adaptec aic7890/91 Ultra2 SCSI adapter> aic7890/91: Ultra2 Wide Channel A, SCSI Id=7, 32/253 SCBs > Some other interesting drivers? Well, there's a tulip based network board and one symbios SCSI controller (ncr53c8xx driver) in the system. But since the '(scsi0:A:9:0): Locking max tag count at 64' message always indicates 'scsi0', I think it has to do with aic7xxx. -- - Harald Welte <laforge@gnumonks.org> http://www.gnumonks.org/ ============================================================================ Programming is like sex: One mistake and you have to support it your lifetime [-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2003-08-11 10:25 UTC | newest] Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2003-02-08 12:16 Problem with mm in 2.4.19 and 2.4.20 Przemysław Maciuszko 2003-08-11 7:34 ` 2.4.18/2.4.20 filemap.c pmd bug (was Re: Problem with mm in 2.4.19 and 2.4.20) Harald Welte 2003-08-11 9:48 ` Christoph Hellwig 2003-08-11 10:21 ` Harald Welte
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).