linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: n0ano@indstorage.com
To: Ryan Sweet <rsweet@atos-group.nl>
Cc: linux-kernel@vger.kernel.org
Subject: Re: random reboots of diskless nodes - 2.4.7 (fwd)
Date: Fri, 19 Oct 2001 09:52:07 -0600	[thread overview]
Message-ID: <20011019095207.F13141@tlaloc.indstorage.com> (raw)
In-Reply-To: <Pine.LNX.4.30.0110160228000.18043-100000@core-0>
In-Reply-To: <Pine.LNX.4.30.0110160228000.18043-100000@core-0>; from rsweet@atos-group.nl on Tue, Oct 16, 2001 at 02:28:46AM +0200

My first guess would be power.  You said you tested the power source.
Can you get ahold of a power line monitor with a strip chart recorder?
You might have a situation where the power is normally fine but for some
reason it could fluctuate at times and kick a machine into reset.
I assume you've eliminated the possibility of the janitor who unplugs
a machine to find an outlet for his floor polisher (don't laugh, it's
happened).

How's the temperature on the machines?  Even if it's OK it would be
good to get another strip chart recorder on it to make sure the temp
stays within bounds 24hrs/day.

Also, do you have a serial console attached to each machine?  This is
the only reliable way to make sure you have every console message that
came out right before the reboot.

On Tue, Oct 16, 2001 at 02:28:46AM +0200, Ryan Sweet wrote:
> 
> I've posted about this problem before, but in the meantime I've managed to
> test under several different configurations to help rule out some possible
> causes.
> 
> Short version: 2.4.7 on nfsroot diskless nodes randomly re-boots and I
> don't think it is a hardware problem or a problem with the server (which
> is stable).  Rather than "try this, try that..." I (and more importantly
> my boss) would really like to find (and then hopefully fix) the root cause
> of the problem.
> 
>
>...
>
> 	- upgraded (replaced) the power supply in all nodes
> 	- tested power source to computer room, moved to another computer
> room with better available power, etc...

-- 
Don Dugger
"Censeo Toto nos in Kansa esse decisse." - D. Gale
n0ano@indstorage.com
Ph: 303/652-0870x117

      parent reply	other threads:[~2001-10-19 15:32 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2001-10-16  0:28 random reboots of diskless nodes - 2.4.7 (fwd) Ryan Sweet
2001-10-16  4:58 ` Keith Owens
2001-11-05 14:50   ` Ryan Sweet
2001-10-16  8:14 ` Alan Cox
2001-10-16 20:06 ` Hans-Peter Jansen
2001-10-19 15:52 ` n0ano [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20011019095207.F13141@tlaloc.indstorage.com \
    --to=n0ano@indstorage.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=rsweet@atos-group.nl \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).