From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752561Ab3JYJIR (ORCPT ); Fri, 25 Oct 2013 05:08:17 -0400 Received: from e36.co.us.ibm.com ([32.97.110.154]:33410 "EHLO e36.co.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751970Ab3JYJIP (ORCPT ); Fri, 25 Oct 2013 05:08:15 -0400 Date: Fri, 25 Oct 2013 02:08:09 -0700 From: "Paul E. McKenney" To: Linus Torvalds Cc: Knut Petersen , Ingo Molnar , Thomas Gleixner , =?iso-8859-1?Q?Fr=E9d=E9ric?= Weisbecker , "Rafael J. Wysocki" , Viresh Kumar , Greg Kroah-Hartman , Greg KH , linux-kernel , cpufreq@vger.kernel.org, rusty@rustcorp.com.au Subject: Re: [BUG 3.12.rc4] Oops: unable to handle kernel paging request during shutdown Message-ID: <20131025090809.GZ4118@linux.vnet.ibm.com> Reply-To: paulmck@linux.vnet.ibm.com References: <525BD08C.2080101@t-online.de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) X-TM-AS-MML: No X-Content-Scanned: Fidelis XPS MAILER x-cbid: 13102509-3532-0000-0000-0000027562A1 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Oct 25, 2013 at 10:02:22AM +0100, Linus Torvalds wrote: > Adding more people, so quoting the whole email for them. > > We definitely have some module unload issues. Guys, try the following > a few times to unload modules: > > lsmod | grep ' 0 '| cut -d' ' -f1 | xargs sudo rmmod > > (a few times because unloading one module will then potentially make > other modules unloadable). Isn't it still the user's responsibility to ensure that a module will no longer being used before rmmod-ing it? Or were all those race conditions fixed? Thanx, Paul > On my machine, I can trigger this, for example: > > ------------[ cut here ]------------ > WARNING: CPU: 0 PID: 3217 at fs/sysfs/file.c:498 sysfs_attr_ns+0x91/0xa0() > sysfs: kobject (null) without dirent > Modules linked in: fuse nf_conntrack_broadcast ipt_MASQUERADE ip6t_REJECT xt_$ > CPU: 0 PID: 3217 Comm: rmmod Not tainted 3.12.0-rc6-00284-ge6036c0b8896 #19 > Hardware name: Sony Corporation SVP11213CXB/VAIO, BIOS R0270V7 05/17/2013 > 0000000000000009 ffff8800aca35df8 ffffffff8160aab5 ffff8800aca35e40 > ffff8800aca35e30 ffffffff810514b8 ffffffffa013f080 ffff8801194a6040 > 0000000000000800 0000000000000000 0000000000c5b3e0 ffff8800aca35e90 > Call Trace: > [] dump_stack+0x45/0x56 > [] warn_slowpath_common+0x78/0xa0 > [] warn_slowpath_fmt+0x47/0x50 > [] ? module_refcount+0xb0/0xb0 > [] sysfs_attr_ns+0x91/0xa0 > [] sysfs_remove_file+0x1a/0x50 > [] cpufreq_sysfs_remove_file+0x13/0x30 > [] acpi_cpufreq_exit+0x2e/0xcde [acpi_cpufreq] > [] SyS_delete_module+0x15d/0x2c0 > [] ? do_notify_resume+0x59/0x90 > [] system_call_fastpath+0x16/0x1b > ---[ end trace f887112caaa5c4ab ]--- > > so at least we have a cpufreq/sysfs interaction bug. There may be others. > > This particular cpufreq issue may be triggered by the fact that > acpi-cpufreq isn't actually in use (pstate is). Or it might be some > generic cpufreq/sysfs bug. Rafael, Greg, ideas? > > I don't see that this particular one would be the one that causes the > timer issues, but it's an example of the fact that module unload tends > to be special and not necessarily well tested. > > Linus > > On Fri, Oct 25, 2013 at 9:38 AM, Linus Torvalds > wrote: > > > > Hmm.. I just got a run_timer_softirq oops on my own laptop, slightly > > different. That was not during shutdown, although there was a "yum > > upgrade" finishing when that happened, so it's quite likely that there > > was a service shutdown (and then restart). > > > > I think it's related. But my oops has almost no information: the IP > > that was jumped to was bogus, and the callchain is just CPU idle > > followed by the softirq -> run_timers_softirq handling, so there's no > > real way to see *what* triggered it. > > > > The bad rip was ffffffffa051e250, which is not a valid code address. > > It *might* be a module address, though. So this might be triggered by > > rmmod on some module that doesn't remove all its timers... > > > > Ideas? > > > > Linus >