From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751080AbVIVHEx (ORCPT ); Thu, 22 Sep 2005 03:04:53 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751101AbVIVHEx (ORCPT ); Thu, 22 Sep 2005 03:04:53 -0400 Received: from smtp.osdl.org ([65.172.181.4]:44490 "EHLO smtp.osdl.org") by vger.kernel.org with ESMTP id S1750916AbVIVHEw (ORCPT ); Thu, 22 Sep 2005 03:04:52 -0400 Date: Thu, 22 Sep 2005 00:03:50 -0700 From: Andrew Morton To: Reuben Farrelly Cc: linux-kernel@vger.kernel.org, linux-ide@vger.kernel.org, linux-scsi@vger.kernel.org, James Bottomley Subject: Re: 2.6.14-rc2-mm1 Message-Id: <20050922000350.63b68060.akpm@osdl.org> In-Reply-To: <4332535D.1010309@reub.net> References: <20050921222839.76c53ba1.akpm@osdl.org> <4332535D.1010309@reub.net> X-Mailer: Sylpheed version 1.0.4 (GTK+ 1.2.10; i386-redhat-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Reuben Farrelly wrote: > > Hi, > > On 22/09/2005 5:28 p.m., Andrew Morton wrote: > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.14-rc2/2.6.14-rc2-mm1/ > > > > - Added git tree `git-sas.patch': Luben Tuikov's SAS driver and its support. > > > > - Various random other things - nothing major. > > Overall boots up and looks fine, but still seeing this oops which comes up on > warm reboot intermittently: Nasty. > ahci(0000:00:1f.2) AHCI 0001.0000 32 slots 4 ports 1.5 Gbps 0xf impl SATA mode > ahci(0000:00:1f.2) flags: 64bit ncq led slum part > ata1: SATA max UDMA/133 cmd 0xF8802D00 ctl 0x0 bmdma 0x0 irq 193 > ata2: SATA max UDMA/133 cmd 0xF8802D80 ctl 0x0 bmdma 0x0 irq 193 > ata3: SATA max UDMA/133 cmd 0xF8802E00 ctl 0x0 bmdma 0x0 irq 193 > ata4: SATA max UDMA/133 cmd 0xF8802E80 ctl 0x0 bmdma 0x0 irq 193 > ata1: dev 0 ATA-6, max UDMA/133, 156301488 sectors: LBA48 > ata1: dev 0 configured for UDMA/133 > scsi0 : ahci > ata2: dev 0 ATA-6, max UDMA/133, 156301488 sectors: LBA48 > ata2: dev 0 configured for UDMA/133 > scsi1 : ahci > ata3: no device found (phy stat 00000000) > scsi2 : ahci > ata4: no device found (phy stat 00000000) > scsi3 : ahci > Vendor: ATA Model: ST380817AS Rev: 3.42 > Type: Direct-Access ANSI SCSI revision: 05 > Vendor: ATA Model: ST380817AS Rev: 3.42 > Type: Direct-Access ANSI SCSI revision: 05 > scheduling while atomic: ksoftirqd/0/0x00000100/3 > [] dump_stack+0x17/0x19 > [] schedule+0x8ba/0xccb > [] __down+0xe5/0x126 > [] __down_failed+0xa/0x10 > [] .text.lock.main+0x2b/0x3e > [] device_del+0x35/0x5d > [] scsi_target_reap+0x89/0xa3 > [] scsi_device_dev_release+0x114/0x18b > [] device_release+0x1a/0x5a > [] kobject_cleanup+0x43/0x6b > [] kobject_release+0xb/0xd > [] kref_put+0x2e/0x92 > [] kobject_put+0x14/0x16 > [] put_device+0x11/0x13 > [] scsi_put_command+0x7c/0x9e > [] scsi_next_command+0xf/0x19 > [] scsi_end_request+0x93/0xc5 > [] scsi_io_completion+0x281/0x46a > [] scsi_generic_done+0x2d/0x3a > [] scsi_finish_command+0x7f/0x93 > [] scsi_softirq+0xab/0x11c > [] __do_softirq+0x72/0xdc > [] do_softirq+0x37/0x39 > [] ksoftirqd+0x9f/0xf4 > [] kthread+0x99/0x9d > [] kernel_thread_helper+0x5/0xb There's a whole bunch of reasons why we cannot call scsi_target_reap() from softirq context. klist_del() locking and whatever semaphore that's taking are amongst them... > Unable to handle kernel paging request<5>SCSI device sda: 156301488 512-byte > hdwr sectors (80026 MB) > SCSI device sda: drive cache: write back > SCSI device sda: 156301488 512-byte hdwr sectors (80026 MB) > SCSI device sda: drive cache: write back > sda: at virtual address 6b6b6b6b > printing eip: > c025b81f > *pde = 00000000 > Oops: 0000 [#1] > SMP > last sysfs file: > Modules linked in: > CPU: 0 > EIP: 0060:[] Not tainted VLI > EFLAGS: 00010292 (2.6.14-rc2-mm1) > EIP is at scsi_run_queue+0x12/0xb8 > eax: 6b6b6b6b ebx: f7c36b70 ecx: 00000000 edx: 00000001 > esi: f7c4eb6c edi: 00000246 ebp: c1911eac esp: c1911e98 > ds: 007b es: 007b ss: 0068 > Process ksoftirqd/0 (pid: 3, threadinfo=c1910000 task=c1942a90) > Stack: c1baf5f8 f7c36b70 f7c36b70 f7c4eb6c 00000246 c1911eb8 c025b91f f7c386e8 > c1911ed0 c025b9db f7c36b70 f7c4eb6c 00000000 00000000 c1911f28 c025bdd4 > 00000001 00004f80 00000100 00000001 c1807ac0 00000000 00000000 00040000 > Call Trace: > [] show_stack+0x94/0xca > [] show_registers+0x15a/0x1ea > [] die+0x108/0x183 > [] do_page_fault+0x1ed/0x63d > [] error_code+0x4f/0x54 > [] scsi_next_command+0x16/0x19 > [] scsi_end_request+0x93/0xc5 > [] scsi_io_completion+0x281/0x46a > [] scsi_generic_done+0x2d/0x3a > [] scsi_finish_command+0x7f/0x93 > [] scsi_softirq+0xab/0x11c > [] __do_softirq+0x72/0xdc > [] do_softirq+0x37/0x39 > [] ksoftirqd+0x9f/0xf4 > [] kthread+0x99/0x9d > [] kernel_thread_helper+0x5/0xb > Code: fd ff 8b 4d ec 8b 41 44 e8 e4 a6 0b 00 89 45 f0 89 d8 e8 34 c1 ff ff eb > b2 55 89 e5 57 56 53 83 ec 08 89 45 f0 8b 80 10 01 00 00 <8b> 38 80 b8 85 01 > 00 00 00 0f 88 8b 00 00 00 8b 47 44 e8 af a6 > <0>Kernel panic - not syncing: Fatal exception in interrupt > <0>Rebooting in 60 seconds.. > It oopsed as well. That might be a second bug. > > This is not new to this -mm release (I had a screen dump of it 2 weeks ago but > I suspect it is actually a bit older than that even). > Thanks.