From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933215Ab0LTU46 (ORCPT ); Mon, 20 Dec 2010 15:56:58 -0500 Received: from ogre.sisk.pl ([217.79.144.158]:41777 "EHLO ogre.sisk.pl" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S933048Ab0LTU45 (ORCPT ); Mon, 20 Dec 2010 15:56:57 -0500 From: "Rafael J. Wysocki" To: Tejun Heo Subject: Re: [Bug #20232] kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4 Date: Mon, 20 Dec 2010 21:56:00 +0100 User-Agent: KMail/1.13.5 (Linux/2.6.37-rc6+; KDE/4.4.4; x86_64; ; ) Cc: Peter Zijlstra , Linux Kernel Mailing List , Kernel Testers List , Maciej Rutecki , Florian Mickler , Ozan Caglayan References: <1292841313.2708.137.camel@laptop> <4D0F8890.2070307@kernel.org> In-Reply-To: <4D0F8890.2070307@kernel.org> MIME-Version: 1.0 Message-Id: <201012202156.00621.rjw@sisk.pl> Content-Type: Text/Plain; charset="utf-8" Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Monday, December 20, 2010, Tejun Heo wrote: > Hello, > > On 12/20/2010 11:35 AM, Peter Zijlstra wrote: > > On Sun, 2010-12-19 at 13:50 +0100, Rafael J. Wysocki wrote: > >> This message has been generated automatically as a part of a report > >> of regressions introduced between 2.6.35 and 2.6.36. > >> > >> The following bug entry is on the current list of known regressions > >> introduced between 2.6.35 and 2.6.36. Please verify if it still should > >> be listed and let the tracking team know (either way). > >> > >> > >> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=20232 > >> Subject : kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4 > >> Submitter : Ozan Caglayan > >> Date : 2010-10-13 06:13 (68 days old) > > > > I'd be thinking that kworker going wonky is something for Tejun to have > > a look at.. Anyway, is it still relevant for current kernels? > > It looks like the work is scheduled in loop, so the kworker acting out > seems to be the symptom of the problem not the cause. Looks like > Rafael already has a proper fix on mind, so... Rather, something that _might_ work. I'm quite confident that this is a BIOS issue. Apparently, the BIOS tells us we can control PCI Express hotplug, but then it tries to do that itself via ACPI at the same time and that leads to a GPE storm. We may try to poke the BIOS a bit differently than we do right now, but whether or not it helps is to be seen. Also, we can try to handle both ACPI-based and native PCIe hotplug simultaneously at the same port, but that's going to be tricky. We still can use DMI-based blacklisting as the last resort. Thanks, Rafael From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Rafael J. Wysocki" Subject: Re: [Bug #20232] kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4 Date: Mon, 20 Dec 2010 21:56:00 +0100 Message-ID: <201012202156.00621.rjw@sisk.pl> References: <1292841313.2708.137.camel@laptop> <4D0F8890.2070307@kernel.org> Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <4D0F8890.2070307-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org> Sender: kernel-testers-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org List-ID: Content-Type: Text/Plain; charset="us-ascii" To: Tejun Heo Cc: Peter Zijlstra , Linux Kernel Mailing List , Kernel Testers List , Maciej Rutecki , Florian Mickler , Ozan Caglayan On Monday, December 20, 2010, Tejun Heo wrote: > Hello, > > On 12/20/2010 11:35 AM, Peter Zijlstra wrote: > > On Sun, 2010-12-19 at 13:50 +0100, Rafael J. Wysocki wrote: > >> This message has been generated automatically as a part of a report > >> of regressions introduced between 2.6.35 and 2.6.36. > >> > >> The following bug entry is on the current list of known regressions > >> introduced between 2.6.35 and 2.6.36. Please verify if it still should > >> be listed and let the tracking team know (either way). > >> > >> > >> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=20232 > >> Subject : kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4 > >> Submitter : Ozan Caglayan > >> Date : 2010-10-13 06:13 (68 days old) > > > > I'd be thinking that kworker going wonky is something for Tejun to have > > a look at.. Anyway, is it still relevant for current kernels? > > It looks like the work is scheduled in loop, so the kworker acting out > seems to be the symptom of the problem not the cause. Looks like > Rafael already has a proper fix on mind, so... Rather, something that _might_ work. I'm quite confident that this is a BIOS issue. Apparently, the BIOS tells us we can control PCI Express hotplug, but then it tries to do that itself via ACPI at the same time and that leads to a GPE storm. We may try to poke the BIOS a bit differently than we do right now, but whether or not it helps is to be seen. Also, we can try to handle both ACPI-based and native PCIe hotplug simultaneously at the same port, but that's going to be tricky. We still can use DMI-based blacklisting as the last resort. Thanks, Rafael