linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Need help with lockups on tyan s2460 motherboard in SMP mode
@ 2003-04-22 13:38 James Strandboge
  2003-04-22 19:04 ` James Strandboge
  2003-04-23 21:26 ` James Strandboge
  0 siblings, 2 replies; 3+ messages in thread
From: James Strandboge @ 2003-04-22 13:38 UTC (permalink / raw)
  To: Linux Kernel Mailing List

I have googled a lot on this issue, but if there is somewhere else I can
look, please tell me.

I have a dual AMD 1600+ (MP processors) system with tyan s2460
motherboard that freezes with no error messages in syslog.  I have read
(among a lot of other things):
http://ouray.cudenver.edu/~etumenba/smp-howto/SMP-HOWTO.html

and what I have done so far is:

replaced power supply with Enermax 550
BIOS upgrade to 1.05
kernels used are 2.4.18 from debian, 2.4.20 and 2.4.21-pre6 
compiled in acpi (acpi-200303028-2.4.21-pre6)
compiled out acpi
compiled with and without Athlon support
Currently using 2.4.21-pre6 for i386

booted with combinations of the following:
noapic
noacpi
acpi=off
nopentium

tried the following bios settings in various combinations:
MultiProcessor specification 1.1 and 1.4
Use PCI Interrupt Entries in MP as yes and no
enabled/disabled acpi
enabled/disabled power management

The only way I have found to reliably crash the system is to compile two
kernels simultaneously while in smp mode.  If I boot with 'nosmp' the
system seems ok (still running and compiling after 9 hours).

Because the system runs ok with one processor, it leads me to think it
may be kernel code, and not heat or memory related.  Though I guess it
could be the motherboard still.  Is this a correct assessment?  I'd be
happy to send any additional information.

I am new to SMP, so any and all suggestions are welcome.

Thanks,

Jamie Strandboge

-- 
James Strandboge
Targeted Performance Partners, LLC
Web: http://www.tpptraining.com
E-mail: jamie@tpptraining.com
Tel: (585) 271-8370
Fax: (585) 271-8373


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Need help with lockups on tyan s2460 motherboard in SMP mode
  2003-04-22 13:38 Need help with lockups on tyan s2460 motherboard in SMP mode James Strandboge
@ 2003-04-22 19:04 ` James Strandboge
  2003-04-23 21:26 ` James Strandboge
  1 sibling, 0 replies; 3+ messages in thread
From: James Strandboge @ 2003-04-22 19:04 UTC (permalink / raw)
  To: Linux Kernel Mailing List

On Tue, 2003-04-22 at 09:38, James Strandboge wrote:

> The only way I have found to reliably crash the system is to compile two
> kernels simultaneously while in smp mode.  If I boot with 'nosmp' the
> system seems ok (still running and compiling after 9 hours).

Well, it crashed after 15 hours with nosmp.  Guess its hardware related.

Jamie
-- 
James Strandboge
Targeted Performance Partners, LLC
Web: http://www.tpptraining.com
E-mail: jamie@tpptraining.com
Tel: (585) 271-8370
Fax: (585) 271-8373


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Need help with lockups on tyan s2460 motherboard in SMP mode
  2003-04-22 13:38 Need help with lockups on tyan s2460 motherboard in SMP mode James Strandboge
  2003-04-22 19:04 ` James Strandboge
@ 2003-04-23 21:26 ` James Strandboge
  1 sibling, 0 replies; 3+ messages in thread
From: James Strandboge @ 2003-04-23 21:26 UTC (permalink / raw)
  To: Linux Kernel Mailing List

On Tue, 2003-04-22 at 09:38, James Strandboge wrote:
> I have a dual AMD 1600+ (MP processors) system with tyan s2460
> motherboard that freezes with no error messages in syslog.  

First off, thank you to everyone who responded-- your input helped me
figure this out.  I wanted to send this to the lists so others might
benefit.

As it turns out, the problem appears to have been solely heat related. 
I ended up installing a vanilla 2.4.20 kernel with smp for 386 (ie no
pentium or athlon optimizations), without acpi, but with apm as a
module.  I disabled power management in the BIOS, and boot with only
apm=power-off for kernel command line arguments.  Notice I did NOT use
noapic.  The kernel boots fine and has no APIC errors (though there is
that errata #22 message).  I then installed lm_sensors 2.7.0 and
i2c-2.7.0 so I could read the temperature.

The temperature readings for the 3904 transistor are the ones to use,
since they are closest to those in the BIOS.  I called tyan and spoke
with someone on the phone, and he said that the system will lock up
around 60C (not the advertised 80-90C, because the sensor is reading air
temperature).  With the case all closed up I ran two kernel compiles,
and two 'top -d .01' instances all at once.  Then I ran 'sensors' every
30 seconds, and sure enough, the computer got near 60C and crashed.  I
took the sides off the case, and used an oscillating fan to blow air
into the computer and rebooted.  'sensors' now shows the temperature
between 39-43C and has been running the above stress test for several
hours (it has never run it for that long).

So now I will be buying some additional fans and making sure the air
flow is good in the case.  I also whole heartedly recommend using
lm_sensors.  It REALLY helped with debugging this issue.

Jamie

-- 
James Strandboge
Targeted Performance Partners, LLC
Web: http://www.tpptraining.com
E-mail: jamie@tpptraining.com
Tel: (585) 271-8370
Fax: (585) 271-8373


^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2003-04-23 21:14 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-04-22 13:38 Need help with lockups on tyan s2460 motherboard in SMP mode James Strandboge
2003-04-22 19:04 ` James Strandboge
2003-04-23 21:26 ` James Strandboge

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).