From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Rafael J. Wysocki" Subject: Re: linux-next: Tree for June 13: IO APIC breakage on HP nx6325 Date: Wed, 18 Jun 2008 16:37:21 +0200 Message-ID: <200806181637.22241.rjw@sisk.pl> References: <20080613232214.394fd6fd.sfr@canb.auug.org.au> Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Return-path: Received: from ogre.sisk.pl ([217.79.144.158]:44063 "EHLO ogre.sisk.pl" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752421AbYFROgN (ORCPT ); Wed, 18 Jun 2008 10:36:13 -0400 In-Reply-To: Content-Disposition: inline Sender: linux-next-owner@vger.kernel.org List-ID: To: Thomas Gleixner Cc: "Maciej W. Rozycki" , Stephen Rothwell , linux-next@vger.kernel.org, LKML , Ingo Molnar , ACPI Devel Maling List , Len Brown On Wednesday, 18 of June 2008, Thomas Gleixner wrote: > On Wed, 18 Jun 2008, Thomas Gleixner wrote: > > On Wed, 18 Jun 2008, Rafael J. Wysocki wrote: > > > On Wednesday, 18 of June 2008, Rafael J. Wysocki wrote: > > > > On Tuesday, 17 of June 2008, Rafael J. Wysocki wrote: > > > > > On Tuesday, 17 of June 2008, Thomas Gleixner wrote: > > > > > > On Tue, 17 Jun 2008, Rafael J. Wysocki wrote: > > > > > > > > > > > > > > BTW, with the C1E patches reverted I don't get the > > > > > > > WARNING: at /home/rafael/src/linux-next/kernel/smp.c:215 smp_call_function_single+0x3d/0xa2 > > > > > > > in the log. Thomas? > > > > > > > > > > > > Yeah, my bad. Fix below. > > > > > > > > > > Thanks, it eliminates the WARNING, but still the box doesn't work with > > > > > the "x86: add C1E aware idle function" patch applied, even with 'highres=off'. > > > > > > > > > > The main symptom is that CPU loads are computed incorrectly (I got X using 126% > > > > > of CPU time from 'top', for example). Apart from this, some processes (like > > > > > gkrellm) seem to be 'frozen' and only change their state in 'jumps', as though > > > > > they only got CPU from time to time at random. > > > > > > > > > > Reverting the above-mentioned patch fixes those problems. > > > > > > > > Ah. If your fix is replaced with the appended one, the system happily works > > > > with C1E and highres. > > > > > > Scratch that. The symptoms appeared later this time, that's all. I've just got > > > b43 consuming 90+ % of the CPU time. :-( > > > > I would have been pretty surprised if it had helped :) > > > > Does the box boot when you disable the local apic timer on the kernel > > command line with the patch applied ? > > > > Also does forcing hpet change anything ? > > I just checked that the original c1e series and the affected code in > tip are not different. IIRC you confirmed that the C1E patches would > work on your box. So I wonder what else got changed which causes these > problems. Well, probably I didn't test that long enough. The symptoms do not always appear immediately, they sometimes appear only after several minutes. Thanks, Rafael