All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Rafael J. Wysocki" <rjw@sisk.pl>
To: Tejun Heo <tj@kernel.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>,
	Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	Kernel Testers List <kernel-testers@vger.kernel.org>,
	Maciej Rutecki <maciej.rutecki@gmail.com>,
	Florian Mickler <florian@mickler.org>,
	Ozan Caglayan <ozan@pardus.org.tr>
Subject: Re: [Bug #20232] kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4
Date: Mon, 20 Dec 2010 21:56:00 +0100	[thread overview]
Message-ID: <201012202156.00621.rjw@sisk.pl> (raw)
In-Reply-To: <4D0F8890.2070307@kernel.org>

On Monday, December 20, 2010, Tejun Heo wrote:
> Hello,
> 
> On 12/20/2010 11:35 AM, Peter Zijlstra wrote:
> > On Sun, 2010-12-19 at 13:50 +0100, Rafael J. Wysocki wrote:
> >> This message has been generated automatically as a part of a report
> >> of regressions introduced between 2.6.35 and 2.6.36.
> >>
> >> The following bug entry is on the current list of known regressions
> >> introduced between 2.6.35 and 2.6.36.  Please verify if it still should
> >> be listed and let the tracking team know (either way).
> >>
> >>
> >> Bug-Entry	: http://bugzilla.kernel.org/show_bug.cgi?id=20232
> >> Subject		: kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4
> >> Submitter	: Ozan Caglayan <ozan@pardus.org.tr>
> >> Date		: 2010-10-13 06:13 (68 days old)
> > 
> > I'd be thinking that kworker going wonky is something for Tejun to have
> > a look at.. Anyway, is it still relevant for current kernels?
> 
> It looks like the work is scheduled in loop, so the kworker acting out
> seems to be the symptom of the problem not the cause.  Looks like
> Rafael already has a proper fix on mind, so...

Rather, something that _might_ work.

I'm quite confident that this is a BIOS issue.  Apparently, the BIOS tells us
we can control PCI Express hotplug, but then it tries to do that itself via
ACPI at the same time and that leads to a GPE storm.  We may try to poke the
BIOS a bit differently than we do right now, but whether or not it helps is
to be seen.

Also, we can try to handle both ACPI-based and native PCIe hotplug
simultaneously at the same port, but that's going to be tricky.

We still can use DMI-based blacklisting as the last resort.

Thanks,
Rafael

WARNING: multiple messages have this Message-ID (diff)
From: "Rafael J. Wysocki" <rjw-KKrjLPT3xs0@public.gmane.org>
To: Tejun Heo <tj-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
Cc: Peter Zijlstra
	<a.p.zijlstra-/NLkJaSkS4VmR6Xm/wNWPw@public.gmane.org>,
	Linux Kernel Mailing List
	<linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
	Kernel Testers List
	<kernel-testers-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
	Maciej Rutecki
	<maciej.rutecki-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>,
	Florian Mickler <florian-sVu6HhrpSfRAfugRpC6u6w@public.gmane.org>,
	Ozan Caglayan <ozan-caicS1wCkhO6A22drWdTBw@public.gmane.org>
Subject: Re: [Bug #20232] kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4
Date: Mon, 20 Dec 2010 21:56:00 +0100	[thread overview]
Message-ID: <201012202156.00621.rjw@sisk.pl> (raw)
In-Reply-To: <4D0F8890.2070307-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>

On Monday, December 20, 2010, Tejun Heo wrote:
> Hello,
> 
> On 12/20/2010 11:35 AM, Peter Zijlstra wrote:
> > On Sun, 2010-12-19 at 13:50 +0100, Rafael J. Wysocki wrote:
> >> This message has been generated automatically as a part of a report
> >> of regressions introduced between 2.6.35 and 2.6.36.
> >>
> >> The following bug entry is on the current list of known regressions
> >> introduced between 2.6.35 and 2.6.36.  Please verify if it still should
> >> be listed and let the tracking team know (either way).
> >>
> >>
> >> Bug-Entry	: http://bugzilla.kernel.org/show_bug.cgi?id=20232
> >> Subject		: kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4
> >> Submitter	: Ozan Caglayan <ozan-caicS1wCkhO6A22drWdTBw@public.gmane.org>
> >> Date		: 2010-10-13 06:13 (68 days old)
> > 
> > I'd be thinking that kworker going wonky is something for Tejun to have
> > a look at.. Anyway, is it still relevant for current kernels?
> 
> It looks like the work is scheduled in loop, so the kworker acting out
> seems to be the symptom of the problem not the cause.  Looks like
> Rafael already has a proper fix on mind, so...

Rather, something that _might_ work.

I'm quite confident that this is a BIOS issue.  Apparently, the BIOS tells us
we can control PCI Express hotplug, but then it tries to do that itself via
ACPI at the same time and that leads to a GPE storm.  We may try to poke the
BIOS a bit differently than we do right now, but whether or not it helps is
to be seen.

Also, we can try to handle both ACPI-based and native PCIe hotplug
simultaneously at the same port, but that's going to be tricky.

We still can use DMI-based blacklisting as the last resort.

Thanks,
Rafael

  reply	other threads:[~2010-12-20 20:56 UTC|newest]

Thread overview: 75+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-12-19 12:41 2.6.37-rc6-git4: Reported regressions 2.6.35 -> 2.6.36 Rafael J. Wysocki
2010-12-19 12:41 ` Rafael J. Wysocki
2010-12-19 12:41 ` [Bug #16951] hackbench regression with 2.6.36-rc1 Rafael J. Wysocki
2010-12-19 12:41   ` Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #16971] qla4xxx compile failure on 32-bit PowerPC: missing readq and writeq Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #17121] Two blank rectangles more than 10 cm long when booting Rafael J. Wysocki
2010-12-19 12:50   ` Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #17061] 2.6.36-rc1 on zaurus: bluetooth regression Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #19052] 2.6.36-rc5-git1 -- [drm:i915_report_and_clear_eir] *ERROR* EIR stuck: 0x00000010, masking Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #20162] [LogFS][2.6.36.rc7+] Kernel BUG at readwrite.c:1193 Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #19632] 2.6.36-rc6: modprobe Not tainted warning Rafael J. Wysocki
2010-12-19 13:54   ` Heinz Diehl
2010-12-19 13:54     ` Heinz Diehl
2010-12-19 14:42     ` Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #19392] WARNING: at drivers/net/wireless/ath/ath5k/base.c:3475 ath5k_bss_info_changed+0x44/0x168 [ath5k]() Rafael J. Wysocki
2010-12-19 12:50   ` Rafael J. Wysocki
2010-12-20  7:25   ` Justin Mattock
2010-12-20 20:44     ` Rafael J. Wysocki
2010-12-20 20:44       ` Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #19372] 2.6.36-rc6: WARNING: at drivers/gpu/drm/radeon/radeon_fence.c:235 radeon_fence_wait+0x35a/0x3c0 Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #20342] [LogFS] [2.6.36-rc7] Deadlock in logfs_get_wblocks, hold and wait on same lock super->s_write_mutex Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #20332] [LogFS] [2.6.36-rc7] Kernel BUG at lib/btree.c:465! Rafael J. Wysocki
2010-12-19 12:50   ` Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #20322] 2.6.36-rc7: inconsistent lock state: inconsistent {IN-RECLAIM_FS-R} -> {RECLAIM_FS-ON-W} usage Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #20232] kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4 Rafael J. Wysocki
2010-12-19 12:50   ` Rafael J. Wysocki
2010-12-20 10:35   ` Peter Zijlstra
2010-12-20 10:35     ` Peter Zijlstra
2010-12-20 16:47     ` Tejun Heo
2010-12-20 16:47       ` Tejun Heo
2010-12-20 20:56       ` Rafael J. Wysocki [this message]
2010-12-20 20:56         ` Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #20462] 2.6.36-rc7-git2 - panic/GPF: e1000e/vlans? Rafael J. Wysocki
2010-12-19 19:44   ` Jesse Gross
2010-12-19 21:02     ` Rafael J. Wysocki
2010-12-19 21:02       ` Rafael J. Wysocki
2010-12-19 23:21       ` Jesse Gross
2010-12-19 23:21         ` Jesse Gross
2010-12-19 12:50 ` [Bug #21402] [KVM] Noacpi Windows guest can not boot up on 32bit KVM host Rafael J. Wysocki
2010-12-19 12:50   ` Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #21092] Kernel 2.6.36 Bug during quotaon on reiserfs Rafael J. Wysocki
2010-12-19 12:50   ` Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #21652] several problems with intel graphics since 2.6.36 Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #21662] 2.6.35->2.6.36 regression, vanilla kernel panic, ppp or hrtimers crashing Rafael J. Wysocki
2010-12-19 13:14   ` Denys Fedoryshchenko
2010-12-19 13:39     ` Rafael J. Wysocki
2010-12-19 13:39       ` Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #22172] alsa-util.c: snd_pcm_avail_delay() returned strange values: delay 0 is less than avail 32 Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #22782] 2.6.36: general protection fault during lockfs lockspace removal Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #22092] Kernel v2.6.36 trouble on USB disconnect Rafael J. Wysocki
2010-12-19 13:59   ` Heinz Diehl
2010-12-19 13:59     ` Heinz Diehl
2010-12-19 14:50     ` Rafael J. Wysocki
2010-12-19 14:50       ` Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #23812] HAL does not provide battery information on RHEL5 and CentOS-5 Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #24022] wireless no longer works after 1st update of 10.10 [rtl819xE:ERR in init_firmware()] Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #23302] alsa stops working after one or more hibernate or suspend cycles Rafael J. Wysocki
2010-12-19 12:50   ` Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #22842] iwl3945 suddenly stops working Rafael J. Wysocki
2010-12-19 12:50   ` Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #24752] Random crashes easily reproducible with make -j5 - intel i915 - kernel 2.6.36 on intel/nvidia hybrid graphics machine Rafael J. Wysocki
2010-12-19 12:50 ` [Bug #24392] AGP aperture disabled, worked in 2.6.35 Rafael J. Wysocki
2010-12-20 19:14   ` Bjorn Helgaas
2010-12-20 19:14     ` Bjorn Helgaas
2010-12-21  7:10     ` Stephen Kitt
2010-12-21  7:10       ` Stephen Kitt
2010-12-21  8:42     ` Kulikov Vasiliy
2010-12-21  8:42       ` Kulikov Vasiliy
2010-12-19 12:50 ` [Bug #24202] [830] drm:intel_prepare_page_flip, *ERROR* Prepared flip multiple times Rafael J. Wysocki
2010-12-19 12:50   ` Rafael J. Wysocki
  -- strict thread matches above, loose matches on Subject: below --
2010-12-29 23:18 2.6.37-rc8: Reported regressions 2.6.35 -> 2.6.36 Rafael J. Wysocki
2010-12-29 23:21 ` [Bug #20232] kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4 Rafael J. Wysocki
2010-12-05 20:15 2.6.37-rc4-git4: Reported regressions 2.6.35 -> 2.6.36 Rafael J. Wysocki
2010-12-05 20:34 ` [Bug #20232] kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4 Rafael J. Wysocki
2010-11-18 23:50 2.6.37-rc2-git4: Reported regressions 2.6.35 -> 2.6.36 Rafael J. Wysocki
2010-11-18 23:53 ` [Bug #20232] kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4 Rafael J. Wysocki
2010-10-17 20:15 2.6.36-rc8-git3: Reported regressions from 2.6.35 Rafael J. Wysocki
2010-10-17 20:21 ` [Bug #20232] kworker consumes ~100% CPU on HP Elitebook 8540w running 2.6.36_rc6-git4 Rafael J. Wysocki
2010-10-17 20:21   ` Rafael J. Wysocki

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=201012202156.00621.rjw@sisk.pl \
    --to=rjw@sisk.pl \
    --cc=a.p.zijlstra@chello.nl \
    --cc=florian@mickler.org \
    --cc=kernel-testers@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=maciej.rutecki@gmail.com \
    --cc=ozan@pardus.org.tr \
    --cc=tj@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.