All of lore.kernel.org
 help / color / mirror / Atom feed
* Regression between 2.6.35 and 2.6.38 (freeze at resume)
@ 2011-09-17  7:35 Éric Brunet
  2011-09-20 20:06 ` Éric Brunet
  0 siblings, 1 reply; 6+ messages in thread
From: Éric Brunet @ 2011-09-17  7:35 UTC (permalink / raw)
  To: linux-kernel

Hello,

Since I upgraded my Dell E4200 laptop to Fedora 15, I have had some problem 
with suspend/resume: occasionaly, the computer would freeze on resume. 
(I suspend by pressing fn-F1. My power manager is the kde plasmoid. I am not 
completely sure which program(s) get(s) launched with what options when I 
press fn-F1...)

Most of the times when it happens, it freezes before leaving anything 
interesting in the logs, and, on some rare occasion, there is a series of 
WARNING: and a BUG: in the log. 

The computer was working fine with Fedora 14, and is working fine while 
running Fedora 15 with the latest Fedora 14 kernel (2.6.35.6), so the problem 
is kernel related. I have seen it with all the Fedora 15 kernels that I have 
installed from the first one (2.6.38.6) to the latest one (2.6.40.4). I have 
also seen it with a vanilla 3.0.4 kernel that I compiled, so it is not a 
problem specific to fedora.

The bug does not occur every time. The frequency of occurence depends on the 
kernel version (nearly always on 2.6.38 and 3.04, every 5 or 10 times on 
2.6.38).

The full relevant logs are on 
https://bugzilla.redhat.com/show_bug.cgi?id=735404

but in short, the first warning is

WARNING: at lib/list_debug.c:47 __list_del_entry+0x8d/0x98()
Hardware name: Latitude E4200                  
list_del corruption, ffff88008b410bb0->next is LIST_POISON1 (dead000000100100)
Modules linked in: ppdev parport_pc lp parport cpufreq_ondemand acpi_cpufreq 
freq_table mperf ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 ip6table_filter 
ip6_tables nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack arc4 
dell_wmi sparse_keymap snd_hda_codec_hdmi snd_hda_codec_idt dell_laptop 
microcode dcdbas snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device 
iwlagn uvcvideo i2c_i801 snd_pcm videodev iTCO_wdt joydev iTCO_vendor_support 
media v4l2_compat_ioctl32 mac80211 e1000e cfg80211 snd_timer snd rfkill 
soundcore snd_page_alloc ipv6 firewire_ohci sdhci_pci sdhci mmc_core 
firewire_core crc_itu_t wmi i915 drm_kms_helper drm i2c_algo_bit i2c_core 
video [last unloaded: scsi_wait_scan]
Pid: 8798, comm: pm-suspend Not tainted 2.6.40.3-0.fc15.x86_64 #1
Call Trace:
 [<ffffffff81054c8e>] warn_slowpath_common+0x83/0x9b
 [<ffffffff81054d49>] warn_slowpath_fmt+0x46/0x48
 [<ffffffff812457bd>] __list_del_entry+0x8d/0x98
 [<ffffffff812457d6>] list_del+0xe/0x2d
 [<ffffffff813b0b3d>] led_trigger_unregister+0x29/0x9c
 [<ffffffff813b0bc9>] led_trigger_unregister_simple+0x19/0x26
 [<ffffffff8138c69e>] power_supply_remove_triggers+0x21/0x8f
 [<ffffffff8138bafe>] power_supply_unregister+0x1f/0x2c
 [<ffffffff812af9b2>] sysfs_remove_battery+0x2f/0x3e
 [<ffffffff812b0461>] battery_notify+0x21/0x2f
 [<ffffffff8148ad8b>] notifier_call_chain+0x37/0x63
 [<ffffffff810747a3>] __blocking_notifier_call_chain+0x4b/0x60
 [<ffffffff810747cc>] blocking_notifier_call_chain+0x14/0x16
 [<ffffffff81089023>] pm_notifier_call_chain+0x1a/0x33
 [<ffffffff810898f1>] enter_state+0x10a/0x137
 [<ffffffff81088f42>] state_store+0xaf/0xc5
 [<ffffffff81237bd3>] kobj_attr_store+0x17/0x19
 [<ffffffff8117fe80>] sysfs_write_file+0x111/0x14d
 [<ffffffff811271ad>] vfs_write+0xac/0xf3
 [<ffffffff8112739c>] sys_write+0x4a/0x6e
 [<ffffffff8148e182>] system_call_fastpath+0x16/0x1b

and then all hell breaks loose. 

The warning seems related to battery, and I just want to mention a point which 
is or is not relevant: when I wake up from the working kernel (2.6.35), the 
kde battery plasmoid shows the correct power level instantaneously. When I 
wake up from the non working kernel (2.6.40), the battery plasmoid first 
displays an empty battery for about one second before displaying the correct 
power level.

Does that ring a bell to someone ? What could I do to help debug this ?

Thanks

	Eric Brunet

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Regression between 2.6.35 and 2.6.38 (freeze at resume)
  2011-09-17  7:35 Regression between 2.6.35 and 2.6.38 (freeze at resume) Éric Brunet
@ 2011-09-20 20:06 ` Éric Brunet
  2011-09-21 12:29   ` Takashi Iwai
  0 siblings, 1 reply; 6+ messages in thread
From: Éric Brunet @ 2011-09-20 20:06 UTC (permalink / raw)
  To: linux-kernel

I reported a bug three days ago about my computer occasionaly not waking up 
(see below) and I have since compiled several kernels to see when the bug 
arrived.

Till now, I have seen the bug trigger on a vanilla 2.6.39-rc3. (Only once, in 
about ten suspend/resume) and I haven't seen it trigger in 2.6.38 nor 2.6.39-
rc1. I am now running 2.6.39-rc1 to see if the bug will eventualy trigger or 
not.

The WARNING with call trace on the fedora kernel that I reported in my 
previous message (see below) also happened in the vanilla kernel in exactly 
the same way. After that initial WARNING, I had four more WARNINGS and one BUG 
in rapid succession, and one final BUG 14 seconds later.

I'd like to know what I can do to help nail this annoying bug. I am not sure 
if I can git bisect it; it is hard to tell offhand if a kernel is good; it may 
resume correctly many times before crashing.

Thanks,

	Éric Brunet

Le samedi 17 septembre 2011 09:35:17, Éric Brunet a écrit :
> Hello,
> 
> Since I upgraded my Dell E4200 laptop to Fedora 15, I have had some problem
> with suspend/resume: occasionaly, the computer would freeze on resume.
> (I suspend by pressing fn-F1. My power manager is the kde plasmoid. I am
> not completely sure which program(s) get(s) launched with what options
> when I press fn-F1...)
> 
> Most of the times when it happens, it freezes before leaving anything
> interesting in the logs, and, on some rare occasion, there is a series of
> WARNING: and a BUG: in the log.
> 
> The computer was working fine with Fedora 14, and is working fine while
> running Fedora 15 with the latest Fedora 14 kernel (2.6.35.6), so the
> problem is kernel related. I have seen it with all the Fedora 15 kernels
> that I have installed from the first one (2.6.38.6) to the latest one
> (2.6.40.4). I have also seen it with a vanilla 3.0.4 kernel that I
> compiled, so it is not a problem specific to fedora.
> 
> The bug does not occur every time. The frequency of occurence depends on
> the kernel version (nearly always on 2.6.38 and 3.04, every 5 or 10 times
> on 2.6.38).
> 
> The full relevant logs are on
> https://bugzilla.redhat.com/show_bug.cgi?id=735404
> 
> but in short, the first warning is
> 
> WARNING: at lib/list_debug.c:47 __list_del_entry+0x8d/0x98()
> Hardware name: Latitude E4200
> list_del corruption, ffff88008b410bb0->next is LIST_POISON1
> (dead000000100100) Modules linked in: ppdev parport_pc lp parport
> cpufreq_ondemand acpi_cpufreq freq_table mperf ip6t_REJECT
> nf_conntrack_ipv6 nf_defrag_ipv6 ip6table_filter ip6_tables
> nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack arc4 dell_wmi
> sparse_keymap snd_hda_codec_hdmi snd_hda_codec_idt dell_laptop microcode
> dcdbas snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device iwlagn
> uvcvideo i2c_i801 snd_pcm videodev iTCO_wdt joydev iTCO_vendor_support
> media v4l2_compat_ioctl32 mac80211 e1000e cfg80211 snd_timer snd rfkill
> soundcore snd_page_alloc ipv6 firewire_ohci sdhci_pci sdhci mmc_core
> firewire_core crc_itu_t wmi i915 drm_kms_helper drm i2c_algo_bit i2c_core
> video [last unloaded: scsi_wait_scan]
> Pid: 8798, comm: pm-suspend Not tainted 2.6.40.3-0.fc15.x86_64 #1
> Call Trace:
>  [<ffffffff81054c8e>] warn_slowpath_common+0x83/0x9b
>  [<ffffffff81054d49>] warn_slowpath_fmt+0x46/0x48
>  [<ffffffff812457bd>] __list_del_entry+0x8d/0x98
>  [<ffffffff812457d6>] list_del+0xe/0x2d
>  [<ffffffff813b0b3d>] led_trigger_unregister+0x29/0x9c
>  [<ffffffff813b0bc9>] led_trigger_unregister_simple+0x19/0x26
>  [<ffffffff8138c69e>] power_supply_remove_triggers+0x21/0x8f
>  [<ffffffff8138bafe>] power_supply_unregister+0x1f/0x2c
>  [<ffffffff812af9b2>] sysfs_remove_battery+0x2f/0x3e
>  [<ffffffff812b0461>] battery_notify+0x21/0x2f
>  [<ffffffff8148ad8b>] notifier_call_chain+0x37/0x63
>  [<ffffffff810747a3>] __blocking_notifier_call_chain+0x4b/0x60
>  [<ffffffff810747cc>] blocking_notifier_call_chain+0x14/0x16
>  [<ffffffff81089023>] pm_notifier_call_chain+0x1a/0x33
>  [<ffffffff810898f1>] enter_state+0x10a/0x137
>  [<ffffffff81088f42>] state_store+0xaf/0xc5
>  [<ffffffff81237bd3>] kobj_attr_store+0x17/0x19
>  [<ffffffff8117fe80>] sysfs_write_file+0x111/0x14d
>  [<ffffffff811271ad>] vfs_write+0xac/0xf3
>  [<ffffffff8112739c>] sys_write+0x4a/0x6e
>  [<ffffffff8148e182>] system_call_fastpath+0x16/0x1b
> 
> and then all hell breaks loose.
> 
> The warning seems related to battery, and I just want to mention a point
> which is or is not relevant: when I wake up from the working kernel
> (2.6.35), the kde battery plasmoid shows the correct power level
> instantaneously. When I wake up from the non working kernel (2.6.40), the
> battery plasmoid first displays an empty battery for about one second
> before displaying the correct power level.
> 
> Does that ring a bell to someone ? What could I do to help debug this ?
> 
> Thanks
> 
> 	Eric Brunet

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Regression between 2.6.35 and 2.6.38 (freeze at resume)
  2011-09-20 20:06 ` Éric Brunet
@ 2011-09-21 12:29   ` Takashi Iwai
       [not found]     ` <20110921200939.5898164EE5@quatramaran.ens.fr>
  0 siblings, 1 reply; 6+ messages in thread
From: Takashi Iwai @ 2011-09-21 12:29 UTC (permalink / raw)
  To: Éric Brunet; +Cc: linux-kernel

At Tue, 20 Sep 2011 22:06:35 +0200,
Éric Brunet wrote:
> 
> I reported a bug three days ago about my computer occasionaly not waking up 
> (see below) and I have since compiled several kernels to see when the bug 
> arrived.
> 
> Till now, I have seen the bug trigger on a vanilla 2.6.39-rc3. (Only once, in 
> about ten suspend/resume) and I haven't seen it trigger in 2.6.38 nor 2.6.39-
> rc1. I am now running 2.6.39-rc1 to see if the bug will eventualy trigger or 
> not.
> 
> The WARNING with call trace on the fedora kernel that I reported in my 
> previous message (see below) also happened in the vanilla kernel in exactly 
> the same way. After that initial WARNING, I had four more WARNINGS and one BUG 
> in rapid succession, and one final BUG 14 seconds later.
> 
> I'd like to know what I can do to help nail this annoying bug. I am not sure 
> if I can git bisect it; it is hard to tell offhand if a kernel is good; it may 
> resume correctly many times before crashing.

Did you try the recent 3.1-rc?  There have been a few commits relevant
with battery module and PM after 3.0, and I guess this will be the fix
for your case, too.


Takashi

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Regression between 2.6.35 and 2.6.38 (freeze at resume)
       [not found]     ` <20110921200939.5898164EE5@quatramaran.ens.fr>
@ 2011-09-22  5:32       ` Takashi Iwai
  2011-09-23  9:37         ` Éric Brunet
  0 siblings, 1 reply; 6+ messages in thread
From: Takashi Iwai @ 2011-09-22  5:32 UTC (permalink / raw)
  To: Ãric Brunet; +Cc: linux-kernel

At Wed, 21 Sep 2011 22:09:39 +0200,
Ãric Brunet wrote:
> 
> >> I'd like to know what I can do to help nail this annoying bug. I am
> >> not sure if I can git bisect it; it is hard to tell offhand if a
> >> kernel is good; it may resume correctly many times before crashing.
> >
> > Did you try the recent 3.1-rc?  There have been a few commits relevant
> > with battery module and PM after 3.0, and I guess this will be the fix
> > for your case, too.
> 
> I did try 3.0.4 and it crashed the same way.

As mentioned, the series of patches were added first in 3.1-rc.
3.0.x have no these commits.


Takashi

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Regression between 2.6.35 and 2.6.38 (freeze at resume)
  2011-09-22  5:32       ` Takashi Iwai
@ 2011-09-23  9:37         ` Éric Brunet
  0 siblings, 0 replies; 6+ messages in thread
From: Éric Brunet @ 2011-09-23  9:37 UTC (permalink / raw)
  To: Takashi Iwai; +Cc: linux-kernel


Dans son message du jeudi 22/09/11 à  7:32, Takashi Iwai a écrit:
> 
> As mentioned, the series of patches were added first in 3.1-rc.
> 3.0.x have no these commits.
> 
3.1-rc6 crashed on me this morning at resume time. Unfortunately, nothing
appeared in the logs so I cannot be 100% sure it is the same bug as has
been hitting me since circa 2.6.38, but it looks the same.

What should I do now ? Run 3.1-rc7 and hope for the best ? Go on with
3.1-rc6 and hope that I'll get a crash with information in the logs so as
to make sure it is indeed the same bug ? Or go back to 2.6.38 to try to
pinpoint the first problematic kernel and (hopefully) the change that
introduced the problem ?

Thanks,

	Éric Brunet

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Regression between 2.6.35 and 2.6.38 (freeze at resume)
@ 2011-09-21 20:49 Éric Brunet
  0 siblings, 0 replies; 6+ messages in thread
From: Éric Brunet @ 2011-09-21 20:49 UTC (permalink / raw)
  To: linux-kernel

In-Reply-To: <s5hy5xigjts.wl%tiwai@suse.de>
References: <201109170935.17777.Eric.Brunet@lps.ens.fr> <201109202206.35705.Eric.Brunet@lps.ens.fr> <s5hy5xigjts.wl%tiwai@suse.de>

>> I'd like to know what I can do to help nail this annoying bug. I am
>> not sure if I can git bisect it; it is hard to tell offhand if a
>> kernel is good; it may resume correctly many times before crashing.
>
> Did you try the recent 3.1-rc?  There have been a few commits relevant
> with battery module and PM after 3.0, and I guess this will be the fix
> for your case, too.

I did try 3.0.4 and it crashed the same way. I'll try 3.1-rc as you
suggest.

The point is that it worked very well in F14 with 2.6.35 kernels. I was
trying to find out in which kernel exactly the problems began. I couldn't
crash 2.6.36, 2.6.37 and 2.6.38 but, today, 2.6.39-rc1 crashed on me. I
suspect (because there is a small change of behaviour) that the problem
appears in 2.6.38 even though I haven't crashed it yet. Were there some
changes in battery and PM around 2.6.38 ?

Anyway, I start now to compile 3.1.

Thanks,

        Éric Brunet

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2011-09-23  9:37 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-09-17  7:35 Regression between 2.6.35 and 2.6.38 (freeze at resume) Éric Brunet
2011-09-20 20:06 ` Éric Brunet
2011-09-21 12:29   ` Takashi Iwai
     [not found]     ` <20110921200939.5898164EE5@quatramaran.ens.fr>
2011-09-22  5:32       ` Takashi Iwai
2011-09-23  9:37         ` Éric Brunet
2011-09-21 20:49 Éric Brunet

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.