linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Xiaogang Wang <xiaogang.wang@umontreal.ca>
To: linux-kernel@vger.kernel.org
Subject: page_alloc.c bug and heavy I/O
Date: Fri, 8 Aug 2003 10:39:45 -0400	[thread overview]
Message-ID: <Pine.SGI.4.44.0308081020540.107825-100000@esirch2.ESI.UMontreal.CA> (raw)

Hi,

My hardware and softare:

  Asus P4P800, 2GB memory, 2.8GHZ P4 with HT enabled.
  On-board 3com Giga bit network card
  1 parallel ata 160G maxtor disk
  Nvidia Gefore4 MX440-8x graphics card (Asus V9180)

  Redhat 7.3, original kernel 2.4.18-3
  Intel Fortran Compiler 7.1
  Intel Math Kernel Library 6.0

My problem is that one of my fortran code always crashes after 10-24 hours.
This code has a heavy IO. It writes out a 5MB binary file every 1 minute.

The error message in  /var/log/message is: (coulson is the name of the computer)

Aug  7 21:11:29 coulson kernel: kernel BUG at page_alloc.c:226!
Aug  7 21:11:29 coulson kernel: invalid operand: 0000
Aug  7 21:11:29 coulson kernel: nfsd lockd sunrpc binfmt_misc sr_mod soundcore
parport_pc lp parport autofs 3c
....

the line number with page_alloc.c varies for different crases (not always 226).

This code also had a heavy IO. Specifically, it writes out a 5MB file every
1 minute.

I have done a couple of tests to try to find the cause, but without success so
far.

1) I have rmmod the 3com2000.0 network driver. The driver source is downloaded
from asus website, and compiled by me. The crash still occurs.

2) I heard of Nvidia binary driver can cause page_alloc.c kernel bug.
I do have a Nvidia Gefore4 MX440-8x graphics card (Asus V9180), but I did
not use Nvidia binary driver. Instead I used the vesa driver coming with redhat
7.3.  Nevertheless, I changed to an old pci ATI rage graphics card. But the
crash still occurs.

3) when the crash occurs, my local X is up and running. I have not tried
the case with local X shutdown.

I also got the same crash when I run the code on a second computer with the
same hw and sw. This makes the hw defect less likely to be the cause.

I am thinking to recompile a new kernel. But now I focus on if it is caused
by some uncompatible module drivers.

I would appreciate your inputs on this. Please cc your answer to me. I am not on
the list.

Xiaogang



------------------------------------------------
Dr Xiaogang Wang
Departement de chimie
Universite de Montreal
C.P. 6128, succursale Centre-ville
Montreal (Quebec) H3C 3J7

Tel. (514) 3436111 ext 3947 (office)
FAX  (514) 3437586 (office)
e-mail: xiaogang.wang@umontreal.ca
homepage: http://www.esi.umontreal.ca/~wangx
------------------------------------------------



             reply	other threads:[~2003-08-08 14:39 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2003-08-08 14:39 Xiaogang Wang [this message]
2003-08-08 14:57 ` page_alloc.c bug and heavy I/O Zwane Mwaikambo
2003-08-11 13:29   ` Xiaogang Wang
2003-08-08 17:15 ` Alan Cox

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=Pine.SGI.4.44.0308081020540.107825-100000@esirch2.ESI.UMontreal.CA \
    --to=xiaogang.wang@umontreal.ca \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).