All of lore.kernel.org
 help / color / mirror / Atom feed
* x60 - spontaneous thermal shutdown
@ 2006-09-04 21:40 Pavel Machek
  2006-09-04 22:26 ` Andreas Mohr
  2006-09-11  9:46   ` Stefan Seyfried
  0 siblings, 2 replies; 10+ messages in thread
From: Pavel Machek @ 2006-09-04 21:40 UTC (permalink / raw)
  To: ACPI mailing list, kernel list

Hi!

x60 shut down after quite a while of uptime, in period of quite heavy
load:

Sep  4 23:33:01 amd kernel: ACPI: Critical trip point
Sep  4 23:33:01 amd kernel: Critical temperature reached (128 C), shutting down.
Sep  4 23:33:01 amd shutdown[32585]: shutting down for system halt
Sep  4 23:34:42 amd init: Switching to runlevel: 0

I do not think cpu reached 128C, as I still have my machine... Did
anyone else see that?
									Pavel
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: x60 - spontaneous thermal shutdown
  2006-09-04 21:40 x60 - spontaneous thermal shutdown Pavel Machek
@ 2006-09-04 22:26 ` Andreas Mohr
  2006-09-04 22:35   ` Pavel Machek
  2006-09-11  9:46   ` Stefan Seyfried
  1 sibling, 1 reply; 10+ messages in thread
From: Andreas Mohr @ 2006-09-04 22:26 UTC (permalink / raw)
  To: Pavel Machek; +Cc: ACPI mailing list, kernel list

Hi,

On Mon, Sep 04, 2006 at 11:40:59PM +0200, Pavel Machek wrote:
> Hi!
> 
> x60 shut down after quite a while of uptime, in period of quite heavy
> load:
> 
> Sep  4 23:33:01 amd kernel: ACPI: Critical trip point
> Sep  4 23:33:01 amd kernel: Critical temperature reached (128 C), shutting down.
> Sep  4 23:33:01 amd shutdown[32585]: shutting down for system halt
> Sep  4 23:34:42 amd init: Switching to runlevel: 0
> 
> I do not think cpu reached 128C, as I still have my machine... Did
> anyone else see that?

Could this be in any way related to the (in)famous Random Shutdown issues
on a little too many Apple MacBooks?
(since the x60 incidentally just happens to be Core Duo architecture, too)

Those Random Shutdown issues at least in several cases appear to happen
due to trouble with the temperature sensor or mainboard issues.
Thermal management is in quite some trouble there, judging from
the rather diverse aspects of machine shutdown failure...
(fan not working, CPU overheating, NOT overheating but shutting down
directly after boot, ...)

There's nothing like rushing out immature hardware to unsuspecting consumers...

Andreas Mohr

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: x60 - spontaneous thermal shutdown
  2006-09-04 22:26 ` Andreas Mohr
@ 2006-09-04 22:35   ` Pavel Machek
  2006-09-04 22:53     ` Andreas Mohr
  2006-09-05  6:10     ` Jan Engelhardt
  0 siblings, 2 replies; 10+ messages in thread
From: Pavel Machek @ 2006-09-04 22:35 UTC (permalink / raw)
  To: Andreas Mohr; +Cc: ACPI mailing list, kernel list

Hi!

> > x60 shut down after quite a while of uptime, in period of quite heavy
> > load:
> > 
> > Sep  4 23:33:01 amd kernel: ACPI: Critical trip point
> > Sep  4 23:33:01 amd kernel: Critical temperature reached (128 C), shutting down.
> > Sep  4 23:33:01 amd shutdown[32585]: shutting down for system halt
> > Sep  4 23:34:42 amd init: Switching to runlevel: 0
> > 
> > I do not think cpu reached 128C, as I still have my machine... Did
> > anyone else see that?
> 
> Could this be in any way related to the (in)famous Random Shutdown issues
> on a little too many Apple MacBooks?
> (since the x60 incidentally just happens to be Core Duo
 > architecture, too)

Well, but those macbooks were really overheating, no? This seems like
sensor failure, because I do not think cpu had 128 Celsius, without
going through 100 Celsius, first.

> Those Random Shutdown issues at least in several cases appear to happen
> due to trouble with the temperature sensor or mainboard issues.
> Thermal management is in quite some trouble there, judging from
> the rather diverse aspects of machine shutdown failure...
> (fan not working, CPU overheating, NOT overheating but shutting down
> directly after boot, ...)

I had fan working at the time of shutdown, and machine was able to
boot immediately afterwards. That means that 128 celsius was sensor
error.


-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: x60 - spontaneous thermal shutdown
  2006-09-04 22:35   ` Pavel Machek
@ 2006-09-04 22:53     ` Andreas Mohr
  2006-09-05  6:10     ` Jan Engelhardt
  1 sibling, 0 replies; 10+ messages in thread
From: Andreas Mohr @ 2006-09-04 22:53 UTC (permalink / raw)
  To: Pavel Machek; +Cc: ACPI mailing list, kernel list

Hi,

On Tue, Sep 05, 2006 at 12:35:20AM +0200, Pavel Machek wrote:
> Well, but those macbooks were really overheating, no? This seems like
> sensor failure, because I do not think cpu had 128 Celsius, without
> going through 100 Celsius, first.

No, in several cases it was a problem with a broken/damaged sensor cable.
But it seems there are a number of different problems with thermal
management, a non-working sensor cable only being (albeit a significant)
one of those.
Google "MacBook random shutdown" will provide tons of information.

> I had fan working at the time of shutdown, and machine was able to
> boot immediately afterwards. That means that 128 celsius was sensor
> error.

Let's hope people get that braindamage resolved, either via BIOS updates
(hmm, but probably not helpful in case of ACPI?) or by shipping/repairing
into working hardware (an astonishing amount of people already had their
2nd or 3rd non-working repair).

Andreas Mohr

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: x60 - spontaneous thermal shutdown
  2006-09-04 22:35   ` Pavel Machek
  2006-09-04 22:53     ` Andreas Mohr
@ 2006-09-05  6:10     ` Jan Engelhardt
  1 sibling, 0 replies; 10+ messages in thread
From: Jan Engelhardt @ 2006-09-05  6:10 UTC (permalink / raw)
  To: Pavel Machek; +Cc: Andreas Mohr, ACPI mailing list, kernel list


>> > Sep  4 23:33:01 amd kernel: ACPI: Critical trip point
>> > Sep  4 23:33:01 amd kernel: Critical temperature reached (128 C), shutting down.
>> > Sep  4 23:33:01 amd shutdown[32585]: shutting down for system halt
>> > Sep  4 23:34:42 amd init: Switching to runlevel: 0
>> > 
>> > I do not think cpu reached 128C, as I still have my machine... Did
>> > anyone else see that?
>> 
>> Could this be in any way related to the (in)famous Random Shutdown issues
>> on a little too many Apple MacBooks?
>> (since the x60 incidentally just happens to be Core Duo
>> architecture, too)
>
>Well, but those macbooks were really overheating, no? This seems like
>sensor failure, because I do not think cpu had 128 Celsius, without
>going through 100 Celsius, first.
>
>I had fan working at the time of shutdown, and machine was able to
>boot immediately afterwards. That means that 128 celsius was sensor
>error.

If it was near 128 C for some time, the plastic case the mainboard is 
housed in would have been extremely hot and one would have probably burned 
his fingers.


Jan Engelhardt
-- 

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: x60 - spontaneous thermal shutdown
  2006-09-04 21:40 x60 - spontaneous thermal shutdown Pavel Machek
@ 2006-09-11  9:46   ` Stefan Seyfried
  2006-09-11  9:46   ` Stefan Seyfried
  1 sibling, 0 replies; 10+ messages in thread
From: Stefan Seyfried @ 2006-09-11  9:46 UTC (permalink / raw)
  To: Pavel Machek; +Cc: ACPI mailing list, kernel list

On Mon, Sep 04, 2006 at 11:40:59PM +0200, Pavel Machek wrote:
> Hi!
> 
> x60 shut down after quite a while of uptime, in period of quite heavy
> load:
> 
> Sep  4 23:33:01 amd kernel: ACPI: Critical trip point
> Sep  4 23:33:01 amd kernel: Critical temperature reached (128 C), shutting down.
> Sep  4 23:33:01 amd shutdown[32585]: shutting down for system halt
> Sep  4 23:34:42 amd init: Switching to runlevel: 0
> 
> I do not think cpu reached 128C, as I still have my machine... Did
> anyone else see that?

my usual suspect: use ec_intr=0. I have seen this rather often on HP machines.
I attributed it to "communication problems with embedded controller" and
ec_intr=0 seemed to help somehow. But then, this was some kernel versions
ago and i did not encounter it recently.
-- 
Stefan Seyfried                  \ "I didn't want to write for pay. I
QA / R&D Team Mobile Devices      \ wanted to be paid for what I write."
SUSE LINUX Products GmbH, Nürnberg \                    -- Leonard Cohen
-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: x60 - spontaneous thermal shutdown
@ 2006-09-11  9:46   ` Stefan Seyfried
  0 siblings, 0 replies; 10+ messages in thread
From: Stefan Seyfried @ 2006-09-11  9:46 UTC (permalink / raw)
  To: Pavel Machek; +Cc: ACPI mailing list, kernel list

On Mon, Sep 04, 2006 at 11:40:59PM +0200, Pavel Machek wrote:
> Hi!
> 
> x60 shut down after quite a while of uptime, in period of quite heavy
> load:
> 
> Sep  4 23:33:01 amd kernel: ACPI: Critical trip point
> Sep  4 23:33:01 amd kernel: Critical temperature reached (128 C), shutting down.
> Sep  4 23:33:01 amd shutdown[32585]: shutting down for system halt
> Sep  4 23:34:42 amd init: Switching to runlevel: 0
> 
> I do not think cpu reached 128C, as I still have my machine... Did
> anyone else see that?

my usual suspect: use ec_intr=0. I have seen this rather often on HP machines.
I attributed it to "communication problems with embedded controller" and
ec_intr=0 seemed to help somehow. But then, this was some kernel versions
ago and i did not encounter it recently.
-- 
Stefan Seyfried                  \ "I didn't want to write for pay. I
QA / R&D Team Mobile Devices      \ wanted to be paid for what I write."
SUSE LINUX Products GmbH, Nürnberg \                    -- Leonard Cohen

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: x60 - spontaneous thermal shutdown
  2006-09-11  9:46   ` Stefan Seyfried
  (?)
@ 2006-09-11 14:10   ` Rafael J. Wysocki
  2006-09-11 15:13       ` Stefan Seyfried
  -1 siblings, 1 reply; 10+ messages in thread
From: Rafael J. Wysocki @ 2006-09-11 14:10 UTC (permalink / raw)
  To: Stefan Seyfried; +Cc: Pavel Machek, ACPI mailing list, kernel list

On Monday, 11 September 2006 11:46, Stefan Seyfried wrote:
> On Mon, Sep 04, 2006 at 11:40:59PM +0200, Pavel Machek wrote:
> > Hi!
> > 
> > x60 shut down after quite a while of uptime, in period of quite heavy
> > load:
> > 
> > Sep  4 23:33:01 amd kernel: ACPI: Critical trip point
> > Sep  4 23:33:01 amd kernel: Critical temperature reached (128 C), shutting down.
> > Sep  4 23:33:01 amd shutdown[32585]: shutting down for system halt
> > Sep  4 23:34:42 amd init: Switching to runlevel: 0
> > 
> > I do not think cpu reached 128C, as I still have my machine... Did
> > anyone else see that?
> 
> my usual suspect: use ec_intr=0.

Is this a kernel command line parameter?

I'm having some suspend/resume related problems on HPC 6325 now, and they
seem to be related to the embedded controller.

Greetings,
Rafael


-- 
You never change things by fighting the existing reality.
		R. Buckminster Fuller

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: x60 - spontaneous thermal shutdown
  2006-09-11 14:10   ` Rafael J. Wysocki
@ 2006-09-11 15:13       ` Stefan Seyfried
  0 siblings, 0 replies; 10+ messages in thread
From: Stefan Seyfried @ 2006-09-11 15:13 UTC (permalink / raw)
  To: Rafael J. Wysocki; +Cc: Pavel Machek, ACPI mailing list, kernel list

On Mon, Sep 11, 2006 at 04:10:36PM +0200, Rafael J. Wysocki wrote:
> On Monday, 11 September 2006 11:46, Stefan Seyfried wrote:
> > On Mon, Sep 04, 2006 at 11:40:59PM +0200, Pavel Machek wrote:
> > > Hi!
> > > 
> > > x60 shut down after quite a while of uptime, in period of quite heavy
> > > load:
> > > 
> > > Sep  4 23:33:01 amd kernel: ACPI: Critical trip point
> > > Sep  4 23:33:01 amd kernel: Critical temperature reached (128 C), shutting down.
> > > Sep  4 23:33:01 amd shutdown[32585]: shutting down for system halt
> > > Sep  4 23:34:42 amd init: Switching to runlevel: 0
> > > 
> > > I do not think cpu reached 128C, as I still have my machine... Did
> > > anyone else see that?
> > 
> > my usual suspect: use ec_intr=0.
> 
> Is this a kernel command line parameter?

yes.

seife@susi:~> dmesg | grep "^ACPI: EC"
ACPI: EC polling mode.
seife@susi:~> cat /proc/cmdline
root=/dev/hda5 vga=0x317 sysrq=yes resume=/dev/hda1  splash=silent showopts ec_intr=0

with ec_intr=1 (default), you'll get "ACPI: EC interrupt mode."

> I'm having some suspend/resume related problems on HPC 6325 now, and they
> seem to be related to the embedded controller.

Well, polling mode is always on my "things to try"-list for those unspecified
ACPI failures :-)
-- 
Stefan Seyfried
QA / R&D Team Mobile Devices        |              "Any ideas, John?"
SUSE LINUX Products GmbH, Nürnberg  | "Well, surrounding them's out." 
-
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: x60 - spontaneous thermal shutdown
@ 2006-09-11 15:13       ` Stefan Seyfried
  0 siblings, 0 replies; 10+ messages in thread
From: Stefan Seyfried @ 2006-09-11 15:13 UTC (permalink / raw)
  To: Rafael J. Wysocki; +Cc: Pavel Machek, ACPI mailing list, kernel list

On Mon, Sep 11, 2006 at 04:10:36PM +0200, Rafael J. Wysocki wrote:
> On Monday, 11 September 2006 11:46, Stefan Seyfried wrote:
> > On Mon, Sep 04, 2006 at 11:40:59PM +0200, Pavel Machek wrote:
> > > Hi!
> > > 
> > > x60 shut down after quite a while of uptime, in period of quite heavy
> > > load:
> > > 
> > > Sep  4 23:33:01 amd kernel: ACPI: Critical trip point
> > > Sep  4 23:33:01 amd kernel: Critical temperature reached (128 C), shutting down.
> > > Sep  4 23:33:01 amd shutdown[32585]: shutting down for system halt
> > > Sep  4 23:34:42 amd init: Switching to runlevel: 0
> > > 
> > > I do not think cpu reached 128C, as I still have my machine... Did
> > > anyone else see that?
> > 
> > my usual suspect: use ec_intr=0.
> 
> Is this a kernel command line parameter?

yes.

seife@susi:~> dmesg | grep "^ACPI: EC"
ACPI: EC polling mode.
seife@susi:~> cat /proc/cmdline
root=/dev/hda5 vga=0x317 sysrq=yes resume=/dev/hda1  splash=silent showopts ec_intr=0

with ec_intr=1 (default), you'll get "ACPI: EC interrupt mode."

> I'm having some suspend/resume related problems on HPC 6325 now, and they
> seem to be related to the embedded controller.

Well, polling mode is always on my "things to try"-list for those unspecified
ACPI failures :-)
-- 
Stefan Seyfried
QA / R&D Team Mobile Devices        |              "Any ideas, John?"
SUSE LINUX Products GmbH, Nürnberg  | "Well, surrounding them's out." 

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2006-09-11 15:13 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2006-09-04 21:40 x60 - spontaneous thermal shutdown Pavel Machek
2006-09-04 22:26 ` Andreas Mohr
2006-09-04 22:35   ` Pavel Machek
2006-09-04 22:53     ` Andreas Mohr
2006-09-05  6:10     ` Jan Engelhardt
2006-09-11  9:46 ` Stefan Seyfried
2006-09-11  9:46   ` Stefan Seyfried
2006-09-11 14:10   ` Rafael J. Wysocki
2006-09-11 15:13     ` Stefan Seyfried
2006-09-11 15:13       ` Stefan Seyfried

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.