linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Willy Tarreau <willy@w.ods.org>
To: Chris Stromsoe <cbs@cts.ucla.edu>
Cc: Marcelo Tosatti <marcelo.tosatti@cyclades.com>,
	linux-kernel@vger.kernel.org
Subject: Re: bad pmd filemap.c, oops; 2.4.30 and 2.4.32
Date: Sat, 31 Dec 2005 08:12:16 +0100	[thread overview]
Message-ID: <20051231071215.GX15993@alpha.home.local> (raw)
In-Reply-To: <Pine.LNX.4.64.0512301732170.21145@potato.cts.ucla.edu>


On Fri, Dec 30, 2005 at 05:48:15PM -0800, Chris Stromsoe wrote:
> I'm starting to suspect bad hardware.  Booting is now hanging (with 
> 2.4.27, 2.4.30 and 2.4.32) after scsi drivers load:

And nothing changed since previous boot, except UP ?

(...) 
> If I wait several minutes (around 10 or 15 minutes), I get:
> 
> scsi0:0:0:0: Attempting to queue an ABORT message
> CDB: 0x12 0x0 0x0 0x0 0xff 0x0
> scsi0:0:0:0: Command already completed
> aic7xxx_abort returns 0x2002
> scsi0:0:0:0: Attempting to queue an ABORT message
> CDB: 0x0 0x0 0x0 0x0 0x0 0x0
> scsi0:0:0:0: Command already completed
> aic7xxx_abort returns 0x2002
> scsi0:0:0:0: Attempting to queue a TARGET RESET message
> CDB: 0x12 0x0 0x0 0x0 0xff 0x0
> scsi0:0:0:0: Is not an active device
> aic7xxx_dev_reset returns 0x2002
> scsi0:0:0:0: Attempting to queue an ABORT message
> CDB: 0x0 0x0 0x0 0x0 0x0 0x0
> scsi0:0:0:0: Command already completed
> aic7xxx_abort returns 0x2002
> scsi0:0:0:0: Attempting to queue an ABORT message
> CDB: 0x0 0x0 0x0 0x0 0x0 0x0
> scsi0:0:0:0: Command already completed
> aic7xxx_abort returns 0x2002
> scsi: device set offline - not ready or command retry failed after bus 
> reset: host 0 channel 0 id 0 lun 0
> 
> 
> The messages repeated for all 15 targets on scsi0.  It's looking like it 
> will repeat for scsi1 as well.
(...)

it recalls me bad memories on my machine a very long time ago when the
driver was buggy :-(
It's not necessarily bad hardware. I also had trouble on one version
of the 29160 bios where it hanged during device scan if there were
too many terminations. Oh, BTW, please check that you have disabled
"automatic" termination in the BIOS. Manually set it either to ON or
OFF (low/high depending on your setup).

> How likely is it that a failing scsi controller contribute to the other 
> problems I was seeing?

Not much. Perhaps at worst, a failing controller could corrupt memory
by writing garbage at wrong locations, but you would not always get
the same messages. It seems to be a different problem here. To be
honnest, it's where I think you should try the new driver.

Regards,
Willy


  parent reply	other threads:[~2005-12-31  7:14 UTC|newest]

Thread overview: 40+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-12-27 16:58 bad pmd filemap.c, oops; 2.4.30 and 2.4.32 Chris Stromsoe
2005-12-28  0:10 ` Marcelo Tosatti
2005-12-29  2:52   ` Chris Stromsoe
2005-12-29  5:12     ` Willy Tarreau
2005-12-29  9:33       ` Chris Stromsoe
2005-12-29 10:08         ` Willy Tarreau
2005-12-29 12:01           ` Chris Stromsoe
2005-12-31  0:12     ` Chris Stromsoe
2005-12-31  1:48       ` Chris Stromsoe
2005-12-31  4:00         ` Chris Stromsoe
2005-12-31  7:25           ` Willy Tarreau
2005-12-31 11:06             ` Chris Stromsoe
2005-12-31  7:12         ` Willy Tarreau [this message]
2005-12-31 10:39           ` Chris Stromsoe
2005-12-31 10:56             ` Willy Tarreau
2005-12-31 12:08         ` Alan Cox
2005-12-31 13:01           ` Willy Tarreau
2006-01-05  3:52             ` Chris Stromsoe
2006-01-05  5:43               ` Willy Tarreau
2006-01-06 21:54                 ` Chris Stromsoe
2006-01-06 22:14                   ` Chris Stromsoe
2006-01-06 22:16                     ` Chris Stromsoe
2006-01-07  9:19                     ` Roberto Nibali
2006-01-09 18:28                       ` Chris Stromsoe
2006-01-09 20:16                         ` Roberto Nibali
2006-01-09 20:22                           ` Chris Stromsoe
2006-01-09 22:22                             ` Roberto Nibali
2006-01-10  0:59                               ` Chris Stromsoe
2006-01-15 11:29                                 ` Chris Stromsoe
2006-01-15 12:12                                   ` Willy Tarreau
2006-01-15 21:18                                     ` Chris Stromsoe
2006-01-15 22:38                                   ` Chris Stromsoe
2006-01-15 22:46                                     ` Willy TARREAU
2006-01-15 22:54                                       ` Chris Stromsoe
2006-01-16 20:52                                         ` Roberto Nibali
2006-01-16 21:32                                           ` Chris Stromsoe
2006-02-08  6:32                                         ` Chris Stromsoe
2006-02-08  6:37                                           ` Willy Tarreau
2006-01-08  9:45                   ` Willy Tarreau
2006-01-09 18:33                     ` Chris Stromsoe

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20051231071215.GX15993@alpha.home.local \
    --to=willy@w.ods.org \
    --cc=cbs@cts.ucla.edu \
    --cc=linux-kernel@vger.kernel.org \
    --cc=marcelo.tosatti@cyclades.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).