From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751863Ab1IVMgF (ORCPT ); Thu, 22 Sep 2011 08:36:05 -0400 Received: from bedivere.hansenpartnership.com ([66.63.167.143]:51215 "EHLO bedivere.hansenpartnership.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751743Ab1IVMgD (ORCPT ); Thu, 22 Sep 2011 08:36:03 -0400 Subject: Re: [Bug 25832] kernel crashes when a mounted ext3/4 file system is physically removed From: James Bottomley To: Hannes Reinecke Cc: "Jun'ichi Nomura" , Ben Hutchings , jaxboe@fusionio.com, Alan Stern , Rocko Requin , tytso@mit.edu, Kernel development list , linux-scsi In-Reply-To: <4E7B296D.8020900@suse.de> References: <1316386825.14749.207.camel@deadeye> <4E7841A4.8040104@ce.jp.nec.com> <4E7B296D.8020900@suse.de> Content-Type: text/plain; charset="UTF-8" Date: Thu, 22 Sep 2011 16:35:57 +0400 Message-ID: <1316694957.10571.26.camel@dabdike> Mime-Version: 1.0 X-Mailer: Evolution 2.32.1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 2011-09-22 at 14:26 +0200, Hannes Reinecke wrote: > On 09/20/2011 09:32 AM, Jun'ichi Nomura wrote: > > On 09/19/11 08:00, Ben Hutchings wrote: > [ .. ] > >> > >> There have been reports of this in Debian going back to 2.6.39: > >> > >> http://bugs.debian.org/631187 > >> http://bugs.debian.org/636263 > >> http://bugs.debian.org/642043 > >> > >> Plus possibly related crashes in elv_put_request after CD-ROM removal: > >> > >> http://bugs.debian.org/633890 > >> http://bugs.debian.org/634681 > >> http://bugs.debian.org/636103 > >> > >> The former was also reported in Ubuntu since their 2.6.38-10: > >> > >> https://bugs.launchpad.net/debian/+source/linux-2.6/+bug/793796 > >> > >> The result of the discussion there was that it appeared to be a > >> regression due to commit 86cbfb5607d4b81b1a993ff689bbd2addd5d3a9b > >> ("[SCSI] put stricter guards on queue dead checks") which was also > >> included in a stable update for 2.6.38. > >> > >> There was also a report on bugzilla.kernel.org, though no-one can see > >> quite what that says now: > >> > >> https://bugzilla.kernel.org/show_bug.cgi?id=38842 > >> > >> I also reported most of the above to James Bottomley and linux-scsi > >> nearly 2 months ago, to no response. > > > > I've reported a similar oops related to the above commit: > > [BUG] Oops when SCSI device under multipath is removed > > https://lkml.org/lkml/2011/8/10/11 > > > > Elevator being removed is the core of the problem. > > And the essential issue seems 2 different models of queue/driver relation > > implied by queue_lock. > > > > If reverting the commit is not an option, > > until somebody comes up to fix the essential issue, > > the patch below should close the regressions introduced by the commit. > > > Why do you have to do it that complicated? > Couldn't we just state that any external lock is being disconnected from > queue_lock after blk_cleanup_queue()? > > Then something like this should suffice here: > > diff --git a/block/blk-core.c b/block/blk-core.c > index 90e1ffd..a4ac005 100644 > --- a/block/blk-core.c > +++ b/block/blk-core.c > @@ -367,10 +367,8 @@ void blk_cleanup_queue(struct request_queue *q) > queue_flag_set_unlocked(QUEUE_FLAG_DEAD, q); > mutex_unlock(&q->sysfs_lock); > > - if (q->elevator) > - elevator_exit(q->elevator); > - > - blk_throtl_exit(q); > + if (q->queue_lock != q->__queue_lock) > + q->queue_lock = q->__queue_lock; > > blk_put_queue(q); > } > diff --git a/block/blk-sysfs.c b/block/blk-sysfs.c > index 0ee17b5..a5a756b 100644 > --- a/block/blk-sysfs.c > +++ b/block/blk-sysfs.c > @@ -477,6 +477,11 @@ static void blk_release_queue(struct kobject *kobj) > > blk_sync_queue(q); > > + if (q->elevator) > + elevator_exit(q->elevator); > + > + blk_throtl_exit(q); > + OK, I'll buy this one (when you fix the whitespace issue ... you have spaces instead of tabs). The fact that the lock check/replacement doesn't actually need any locking is probably worthy of a comment. James